Adaptive Multiple Feature Method (AMFM) for the early detection of
parenchymal pathology in a smoking population
Renuka Uppaluri1, Geoffrey McLennan2, Paul Enright3,
James R. Standen4, Pamela Boyer-Pfersdorf3,
and Eric A. Hoffman1,5
- 1Department of Radiology
University of Iowa, Iowa City, IA 52242
- 2Department of Internal Medicine
University of Iowa, Iowa City, IA 52242
- 3Department of Internal Medicine
University of Arizona, Tucson, AZ 85724
- 4Department of Radiology
University of Arizona, Tucson, AZ 85724
- 5Department of Biomedical Engineering
University of Iowa, Iowa City, IA 52242
[ home
| search
| contact us
]
Table of Contents
Application of the Adaptive Multiple Feature Method (AMFM) to identify
early changes in a smoking population is discussed. This method
was specifically applied to determine if differences in CT images of smokers (with
normal lung function)
and non-smokers (with normal lung function) could be found through computerized texture analysis.
Results demonstrated
that these groups could be differentiated with over
80.0% accuracy.
Further, differences on CT images
between normal appearing lung from non-smokers (with normal lung function) and normal appearing
lung
from smokers (with abnormal lung function) were also investigated.
These groups were differentiated with over 89.5%
accuracy. In analyzing the whole lung region by region, the AMFM characterized
38.6% of a smoker lung (with normal lung function)
as mild emphysema. We can conclude that the AMFM detects
parenchymal patterns in the lungs of smokers which are different from normal
patterns occurring in healthy non-smokers. These patterns could perhaps indicate
early smoking-related changes.
Tissue characterization, texture, smoking, pulmonary, Bayesian classifier
We have made significant advances in the use of tissue characterization to
classify parenchymal patterns in reconstructed CT data sets. The Adaptive Multiple
Feature Method (AMFM) is a texture classification scheme that has been
successfully applied towards discrimination of various tissue types associated with
pulmonary emphysema, idiopathic pulmonary fibrosis, and sarcoidosis
[1,2,3]. While all our previous studies involved
CT images of subjects with well characterized disease, this study seeks to
evaluate early parenchymal pathology associated with smoking.
This is a challenge because early changes are subtle making them difficult to detect.
Smoking is one of the main causes of debilitating diseases such as emphysema. Emphysema
is a form of chronic obstructive pulmonary disorder (COPD). The onset
of emphysema starts early but it
presents itself clinically at a stage when it is incurable and care is largely supportive.
In recent years, lung volume reduction surgery has become a viable intervention for
end-stage emphysema subjects. However, patient selection criteria are poorly defined and,
more importantly, sensitive measures for outcome evaluation are lacking. Novel anti-elastase
and other anti-inflammatory modulators are being developed to halt the progression
of early emphysema.
Therefore, the
motivation for this study is to identify a means of detecting
changes caused by smoking before clinical symptoms appear. This can then be used either to
advise the patient to adopt lifestyle changes such as smoking cessation or to
determine the outcome of therapeutic interventions. It
should be noted, once established, emphysema may progress even with smoking cessation.
In this study, three groups of subjects are included - healthy non-smokers with normal
lung function, healthy smokers with normal lung function, and smokers with abnormal
lung function or COPD.
The AMFM will be used to demonstrate the
differences between these three groups.
The AMFM is described in detail elsewhere [1]. To summarize, the AMFM is a pattern
recognition algorithm which performs recognition based on a learning (training) stage.
The different patterns are described via a set of texture measures. During the training
stage, a) an optimal subset of features is determined, which best discriminates the
patterns in training set and b) a classification algorithm is trained to perform
the discrimination of the patterns in the training set using the optimal subset of features.
During the testing stage, the accuracy of the AMFM is tested on an independent set
of patterns.
The optimal feature selection is performed by using the divergence measure along with
correlation analysis [4] and the classification is performed by a Bayesian
approach. The texture measures used are derived from various classes. The first order
measures based on the grey level histogram, run-length measures, the co-occurrence
matrix measures, and the geometric fractal dimension computations are presented in
[5,6,1]. In addition to these 17, 10 new measures, developed
by us, are included in this study. They are the percentile measures, also based on the grey
level histogram, and the stochastic fractal dimension measures [7].
All measures used and their abbreviations are provided in Table 1.
Table 1:
Texture measures and their abbreviations
| Feature |
Abbreviation |
| Mean |
MEAN |
| Variance |
VAR |
| Skewness |
SKEW |
| Kurtosis |
KURT |
| Grey level entropy |
GREYENT |
| Short run emphasis |
SRE |
| Long run emphasis |
LRE |
| Grey level non-uniformity |
GLN |
| Run length non-uniformity |
RLN |
| Run Percentage |
RP |
| Angular second Moment |
ASM |
| Inertia |
INER |
| Contrast |
CON |
| Entropy |
ENT |
| Correlation |
CORR |
| Inverse difference moment |
IDM |
| Lowest fifth percentile |
LOW-FIFTH |
| Upper fifth percentile |
UPPER-FIFTH |
| Mean - Lowest fifth percentile |
LOWER-DIFF |
| Upper fifth percentile - mean |
UPPER-DIFF |
| Ratio of UPPER-DIFF to LOWER-DIFF |
DIFF-RATIO |
| Geometric fractal dimension |
GFD |
| Stochastic fractal dimension mean |
SFD MEAN |
| Stochastic fractal dimension variance |
SFD VAR |
| Stochastic fractal dimension skewness |
SFD SKEW |
| Stochastic fractal dimension kurtosis |
SFD KURT |
| Stochastic fractal dimension grey level entropy |
SFD ENT |
|
The subjects included in this study were: 41 healthy non-smokers (NONSMK),
24 smokers (SMK), and 19 smokers with abnormal lung function (COPD).
A Siemens DRH Somatom scanner was used to acquire CT scans of all subjects
in supine position. Two slices (1 mm slice thickness) were acquired at full
inspiration 5 cm above and 5 cm below the carina. The images were reconstructed
to 512
512 pixels and the grey levels were rescaled to lie between 0-2048.
- 1.
- Global analysis:
All three subject groups were used in this experiment. The left and right lungs
from each image were isolated and used individually as the region of interest
for texture computation and classification. Half the lungs were put into a training
set and the remaining were put into a test set.
The goal was to discriminate a
NONSMK lung from SMK lung, a NONSMK lung from a COPD lung, and a SMK lung from
a COPD lung. The ability to discriminate simultaneously between a NONSMK lung, SMK
lung, and COPD lung was also tested.
On the test set, classification accuracy of each subject group and the overall accuracy of all
subject groups was computed for each discrimination task. The classification accuracy
is the percent of samples belonging to a group correctly classified as such. The
overall accuracy is the percent of correctly classified samples of all groups together.
- 2.
- Regional analysis (observer-defined regions):
In this experiment, NONSMK and COPD subject groups were used. A trained observer
identified areas of four tissue types, namely, normal lung from NONSMK subjects
(NN), emphysema-like lung from COPD subjects (EC), ``normal" lung from COPD subjects (NC),
and mild emphysema-like lung from COPD subjects (MC).
In each defined region, pixel blocks of 31
31 overlapping by 15
15
were placed. Half the blocks were put into a training
set and the remaining were put into a test set.
The goal was to discriminate
NN samples from each of EC, NC, and MC samples, individually. The ability to
discriminate between NN from EC, NC, MC simultaneously was also tested.
As before, classification accuracies and overall accuracies were computed.
- 3.
- Regional analysis (whole lung slices):
All three subject groups were included in this experiment. On each CT slice,
31
31 pixel blocks overlapping by 15
15 were translated through
the lung regions and the pattern classification obtained for each block was assigned to
the 15
15 block at its center. The classifier assigned either a NN, EC, NC, or MC
pattern using the training acquired in the previous experiment.
Each block was color-coded based on its pattern and the images were displayed.
The goal of this experiment was to determine what percent of a NONSMK, SMK, and COPD
lung is comprised of each of the four patterns.
The results for this experiment are partially shown in Table 2. The training
and test sets contained 25 samples of NONSMK, SMK, and COPD groups. In
discriminating all three groups simultaneously, the classification accuracies
were 80.0%, 52.0%, and 64.0% for the NONSMK, SMK, and COPD groups, respectively.
The overall accuracy here was 65.3%.
Table 2:
Global analysis results. NONSMK: Healthy non-smokers, SMK: Smokers
with normal lung function, COPD: Smokers with COPD.
| Discrimination |
Classification |
Classification |
Overall |
| task |
Accuracy (Group 1) |
Accuracy (Group 2) |
Accuracy |
| Group 1: NONSMK |
88.0% |
84.0% |
86.0% |
| vs. Group 2: COPD |
|
|
|
| Group 1: NONSMK |
88.0% |
72.0% |
80.0% |
| vs. Group 2: SMK |
|
|
|
| Group 1: SMK |
76.0% |
72.0% |
74.0% |
| vs. Group 2: COPD |
|
|
|
|
The results for this experiment are partially shown in Table 3.
There were 180 NN, 180 EC, 48 NE, and 76 MC samples in the training and
test sets. In discriminating all four groups together, the classification
accuracies were 86.7%, 92.2%, 68.7%, and 77.6%, for the NN, EC, NC, and MC
groups, respectively. The overall accuracy here was 85.5%.
Table 3:
Regional analysis results (observer-defined regions). NN: Normal
lung region from healthy non-smokers, EC: Emphysematous lung region from
smoking COPD subjects, NC: Normal appearing lung region from smoking COPD
subjects, MC: Mild emphysematous lung region from smoking COPD subjects.
| Discrimination |
Classification |
Classification |
Overall |
| task |
Accuracy (Group 1) |
Accuracy (Group 2) |
Accuracy |
| Group 1: NN |
99.4% |
97.8% |
98.6% |
| vs. Group 2: EC |
|
|
|
| Group 1: NN |
94.4% |
70.8% |
89.5% |
| vs. Group 2: NC |
|
|
|
| Group 1: NN |
96.7% |
88.1% |
94.1% |
| vs. Group 2: MC |
|
|
|
|
The results of this experiment are shown in Table 4. The table
shows the distributions of NN, EC, NC, and MC patterns in NONSMK, SMK, and
COPD lungs. Examples of
a NONSMK, SMK, and COPD lung with their computer outputs are
shown in Figure 1.
Figure 1:
Examples of lungs with their color-coded computer output.
(a) An healthy non-smoker's lung (NONSMK) (b) Computer output of NONSMK lung
(c) An smoker's lung who has normal lung function (SMK) (d) Computer output of
SMK lung (e) Lung of a smoker with abnormal lung function (COPD) (f) Computer output of the COPD lung.
NN: Normal
lung region from healthy non-smokers, EC: Emphysematous lung region from
smoking COPD subjects, NC: Normal appearing lung region from smoking COPD
subjects, MC: Mild emphysematous lung region from smoking COPD subjects.
|
Table 4:
Regional analysis results (whole lung slices). NONSMK: Healthy non-smokers, SMK: Smokers
with normal lung function, COPD: Smokers with COPD, NN: Normal
lung region from healthy non-smokers, EC: Emphysematous lung region from
smoking COPD subjects, NC: Normal appearing lung region from smoking COPD
subjects, MC: Mild emphysematous lung region from smoking COPD subjects.
| Subject |
NN |
EC |
NC |
MC |
| group |
|
|
|
|
| NONSMK |
39.7% |
13.9% |
16.5% |
29.9% |
| SMK |
22.3% |
15.2% |
23.9% |
38.6% |
| COPD |
9.5% |
40.3% |
4.8% |
35.4% |
|
There are several issues that need to be addressed in this
study. The first one is the lack of gold standards. Emphysema can be
characterized by bullae and mild emphysema can be characterized by subtle
changes in the lung architecture, visible to the eye on CT. So there are
visual and clinical standards for these. However, there are no standards to
detect pathologies related to smoking. Therefore, our goal was to determine if
such pathologies, that we postulate to occur in the lung as a result of smoking, are detectable
by computerized texture analysis via CT.
The AMFM has demonstrated to be
successful in differentiating normal subjects from subjects with severe emphysema
[1], and more recently with interstitial lung diseases such as
idiopathic pulmonary fibrosis, and sarcoidosis [2]. Since, in this
study, NONSMK (non-smokers with normal lung function) can be differentiated from
SMK (smokers with normal lung function) with 80.0% accuracy, this further
demonstrates that the AMFM is sensitive enough to pick up subtle differences.
Our regional analysis experiments were used to localize and characterize the differences
between the groups. It
was first demonstrated that visually normal appearing lung regions from NONSMK and COPD
subjects (smokers with abnormal lung function)
were differentiated with 89.5% accuracy. An explanation for this could be that
normal appearing lung in COPD subjects appears ``normal" to the observer
only because it is juxtaposed with severely diseased lung. The mild emphysematous
regions and emphysema regions were also discriminated from normal lung of
NONSMK with accuracies higher than 94.0%.
In analyzing whole lung slices,
it was found that the majority of a SMK lung was composed of mild emphysema
and ``normal from COPD", i.e., lung which visually appears normal but is different
from normal lung of non-smokers.
It is also noted that 22.9% of a NONSMK lung
is composed of mild emphysema. We attribute this to the presence of dark regions
juxtaposed to ribs which may be occurring due to beam hardening.
Since such areas were not included in the training set as being normal, the computer
assigned labels of mild emphysema or emphysema. The computer can perform only
as well as the training it receives.
Therefore, representative training examples along with good data acquisition using
standardized protocols are of utmost importance. Futhermore, it is critical that
motion be eliminated during scanning and that lung volume be standardized.
This study also demonstrates the applicability of the AMFM across scanners.
While our previous studies were performed on Imatron C-150 scanner with 3 mm thick slices,
this study uses 1 mm thick slices from Siemens Somatom DRH scanner. It
should be noted that, for best results, the AMFM should be re-trained for each
scanner and for each new set of scanning and reconstruction parameter.
This study shows that a) via CT, the smokers with
normal lung function can be differentiated from healthy non-smokers and smokers
with COPD b) normal appearing lung regions from COPD subjects can be differentiated from
normal lung regions from healthy non-smokers c) in smokers with normal
lung function, there is a higher occurrence of the mild emphysema pattern.
We conclude that the lung from smokers with normal lung function is indeed abnormal,
suggesting that there are changes which can be attributed to smoking. It is our
hypothesis that these changes pre-date emphysema. Longitudinal studies would
be required to answer this, and the AMFM can be used for such a study.
Supported in part by the a contract from the National Library
of Medicine (N01-LM-4-3511 US PHS), NIH NHLBI SCOR (HL-14136), and a fellowship
grant from the Cystic Fibrosis Foundation. Reprint requests to: Eric A. Hoffman, PhD, Department
of Radiology, University of Iowa College of Medicine, 200 Hawkins Dr., Iowa City, IA, 52242.
- 1
-
R. Uppaluri, T. Mitsa, M. Sonka, E. A. Hoffman, and G. McLennan,
``Quantification of pulmonary emphysema from lung computed tomography
images,'' Am. J. Respir. Crit. Care Med. 156(1), pp. 248-254,
1997.
- 2
-
R. Uppaluri, M. Sonka, E. A. Hoffman, T. Mitsa, C. Dayton, G. W. Hunninghake,
and G. McLennan, ``Quantitative assessment in interstitial lung disease using
the multiple feature (MF) method (abstract),'' Am. J. Respir. Crit.
Care Med. 155(4), p. A322, 1997.
- 3
-
R. Uppaluri, P. G. Hartley, E. A. Hoffman, M. Sonka, and G. McLennan,
``Validation of AMFM: A quantitative, regional assessment of pulmonary
parenchymal pathology from HRCT (abstract),'' Radiology 205((P)), p. 481, 1997.
- 4
-
H. C. Andrews, Introduction to mathematical techniques in pattern
recognition, Wiley, New York, 1972.
- 5
-
R. Uppaluri, T. Mitsa, and J. R. Galvin, ``Fractal analysis of high-resolution
CT images as a tool for quantification of lung diseases,'' in Medical
Imaging 1995: Physiology and Function from Multidimensional Images, Vol.
2433, San Diego, CA, pp. 133-142, SPIE, (Bellingham, WA), 1995.
- 6
-
R. Uppaluri, T. Mitsa, E. A. Hoffman, G. McLennan, and M. Sonka, ``Texture
analysis of pulmonary parenchyma in normal and emphysematous lung,'' in Medical Imaging 1996: Physiology and Function from Multidimensional Images,
Vol. 2709, Newport Beach, CA, pp. 456-467, SPIE, (Bellingham, WA), 1996.
- 7
-
R. Uppaluri, Automated analysis of pulmonary parenchyma from CT images.
PhD thesis, University of Iowa, Iowa City, Iowa, USA, 1997.
©1994-2000 Division of Physiologic Imaging, Dept.
of Radiology, Univ. of Iowa
DPI Homepage |
VIDA |
NLM |
Contact Us |
Search
Last modified: Tue May 9 11:35:36 CDT