| Literature DB >> 29065605 |
Fang Yang1, Murat Hamit2, Chuan B Yan2, Juan Yao3, Abdugheni Kutluk2, Xi M Kong2, Sui X Zhang2.
Abstract
Esophageal cancer is one of the fastest rising types of cancers in China. The Kazak nationality is the highest-risk group in Xinjiang. In this work, an effective computer-aided diagnostic system is developed to assist physicians in interpreting digital X-ray image features and improving the quality of diagnosis. The modules of the proposed system include image preprocessing, feature extraction, feature selection, image classification, and performance evaluation. 300 original esophageal X-ray images were resized to a region of interest and then enhanced by the median filter and histogram equalization method. 37 features from textural, frequency, and complexity domains were extracted. Both sequential forward selection and principal component analysis methods were employed to select the discriminative features for classification. Then, support vector machine and K-nearest neighbors were applied to classify the esophageal cancer images with respect to their specific types. The classification performance was evaluated in terms of the area under the receiver operating characteristic curve, accuracy, precision, and recall, respectively. Experimental results show that the classification performance of the proposed system outperforms the conventional visual inspection approaches in terms of diagnostic quality and processing time. Therefore, the proposed computer-aided diagnostic system is promising for the diagnostics of esophageal cancer.Entities:
Mesh:
Year: 2017 PMID: 29065605 PMCID: PMC5394892 DOI: 10.1155/2017/4620732
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Flow chart of the system design.
Figure 2Preprocessing results of the abnormal esophageal X-ray images.
Figure 3Four-level DWT decomposition process.
Details of feature selection by SFS for the first-stage classification process.
| Features | Feature number | ||||
|---|---|---|---|---|---|
| Texture features ( | (0°, 1) |
| 2 | 3 | 4 |
| (45°, 1) | 5 |
|
|
| |
| (90°, 1) |
|
|
| 12 | |
| (135°, 1) | 13 | 14 | 15 | 16 | |
|
| |||||
| Frequency features | 17 | 18 | 19 | 20 | |
| 21 | 22 | 23 |
| ||
|
|
|
|
| ||
|
|
| 31 | 32 | ||
| 33 |
|
| 36 | ||
|
| |||||
| KC features |
| ||||
The numbers in italics are the features selected by SFS.
Details of feature selection by SFS for the second-stage classification process.
| Features | Feature number | ||||
|---|---|---|---|---|---|
| Texture features ( | (0°, 1) |
| 2 | 3 | 4 |
| (45°, 1) | 5 |
| 7 |
| |
| (90°, 1) | 9 |
|
| 12 | |
| (135°, 1) |
| 14 | 15 | 16 | |
|
| |||||
| Frequency features | 17 | 18 | 19 | 20 | |
|
| 22 |
|
| ||
| 25 |
|
|
| ||
|
|
| 31 |
| ||
| 33 | 34 |
| 36 | ||
|
| |||||
| KC features |
| ||||
The numbers in italics are the features selected by SFS.
Details of feature selection by PCA for the two-stage classification process.
| PC | Eigenvalue | Cumulative variance (%) | ||
|---|---|---|---|---|
| First stage | Second stage | First stage | Second stage | |
| PC1 | 11.07 | 15.1 | 35 | 45.8 |
| PC2 | 6.84 | 6.2 | 53.4 | 62.56 |
| PC3 | 5.57 | 4.44 | 68.4 | 74.57 |
| PC4 | 3.4 | 3.24 | 77.6 | 83.32 |
| PC5 | 2.94 | 1.84 | 85.6 | 88.3 |
| PC6 | 1.91 | 1.47 | 90.7 | 92.26 |
Figure 4KNN classification results for various choices of K (%).
Classification performance of SVM and KNN classifiers (%).
| Parameters | All features | SFS selection | PCA selection | ||||
|---|---|---|---|---|---|---|---|
| SVM | KNN | SVM | KNN | SVM | KNN | ||
| AUC | First stage | 94.5 | 93.7 | 97.4 | 95.7 | 95.33 | 94.5 |
| Second stage | 94 | 93.4 | 97 | 94.67 | 95.14 | 94 | |
| Accuracy | First stage | 92.67 | 91.3 | 95 | 93 | 93 | 92.67 |
| Second stage | 91.5 | 90.14 | 94.67 | 92.14 | 92.5 | 91.5 | |
| Precision | First stage | 91 | 90.4 | 94.33 | 92.5 | 91.4 | 91.33 |
| Second stage | 90.67 | 90.33 | 94.14 | 92.33 | 91.67 | 91.4 | |
| Recall | First stage | 91 | 90 | 94 | 92.5 | 91.4 | 91 |
| Second stage | 90.67 | 90.33 | 94.14 | 92 | 91.67 | 91.4 | |
Figure 5Classification performance of the first classification stage (%).
Figure 6Classification performance of the second classification stage (%).
Classification performance of previous studies (%).
| Methods | Classification accuracy |
|---|---|
| GH + Bayes [ | 76.6 |
| WT + Bayes [ | 76.5 |
| GH + GLCM + Bayes [ | 86.7 |
| GLCM + GGCM + PCA + KNN [ | 87.5 |
GH: gray-level histogram; WT: wavelet-based transform; GGCM: gray-gradient co-occurrence matrix.