| Literature DB >> 30621101 |
Xinqi Wang1, Keming Mao2, Lizhe Wang3, Peiyi Yang4, Duo Lu5, Ping He6.
Abstract
Lung cancer is one of the most deadly diseases around the world representing about 26% of all cancers in 2017. The five-year cure rate is only 18% despite great progress in recent diagnosis and treatment. Before diagnosis, lung nodule classification is a key step, especially since automatic classification can help clinicians by providing a valuable opinion. Modern computer vision and machine learning technologies allow very fast and reliable CT image classification. This research area has become very hot for its high efficiency and labor saving. The paper aims to draw a systematic review of the state of the art of automatic classification of lung nodules. This research paper covers published works selected from the Web of Science, IEEEXplore, and DBLP databases up to June 2018. Each paper is critically reviewed based on objective, methodology, research dataset, and performance evaluation. Mainstream algorithms are conveyed and generic structures are summarized. Our work reveals that lung nodule classification based on deep learning becomes dominant for its excellent performance. It is concluded that the consistency of the research objective and integration of data deserves more attention. Moreover, collaborative works among developers, clinicians, and other parties should be strengthened.Entities:
Keywords: computer tomography; lung nodule classification; medical image analysis; pattern recognition
Mesh:
Year: 2019 PMID: 30621101 PMCID: PMC6338921 DOI: 10.3390/s19010194
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Trend of incidence rates for several cancers in the United States from 1975 to 2013. (a) and (b) present the trend on males and females, respectively.
Figure 2Demonstration of the CAD market trend and its market share of lung cancer. (a) prediction of global CAD market, (b) proportion of different CAD systems applications.
Figure 3Problem statements of lung nodule classification in this research.
Figure 4Demonstration of four types of lung nodule CT image, shown from left to right, W, V, J, and P, respectively [12]. Red circles denote the locations of the nodule.
Figure 5High malignancy suspicious cases are given in (a) and low malignancy suspicious cases are given in (b) [13]. Red bounding boxes denote the locations of the nodule.
Figure 6Lung CT image [13]: (a) Original image with nodule (green color), (b) the part of CT image with nodule > 3 mm ROI (green color). Note that the xml file includes the outline of the node (only its boundary points). In this case, the entire nodule is displayed for better understanding and visibility.
Description from LIDC-IDRI.
| Case | Nodule ID | X Loc. | Y Loc. | Z Loc. | Contour of the Nodule | Malignancy |
|---|---|---|---|---|---|---|
| 0007 | Nodule 001 | 194 | 290 | 37 | ((189,280), (188,281), …, (190,281), (189,280)) | 5 |
| 0007 | IL057_159747 | 293 | 266 | 30 | ((289,280), (290,279), …, (288,279), (289,280)) | 5 |
| 0092 | Nodule 004 | 179 | 282 | 12 | ||
| 0092 | Nodule 005 | 361 | 333 | 108 |
Format of lung nodule position.
| Scan | Type | X | Y | Slice |
|---|---|---|---|---|
| W0001 | Nodule | 98 | 218 | 54 |
| W0001 | Nodule | 54 | 224 | 170 |
| W0003 | Nodule | 158 | 356 | 80 |
| W0005 | Nodule | 120 | 247 | 66 |
| W0006 | Nodule | 109 | 258 | 129 |
| W0007 | Nodule | 70 | 224 | 111 |
Figure 7Demonstration of lung CT images from ELCAP [12]: (a) the complete CT images, (b,c) are the part of CT scans. The symbol “1” is the location of nodule.
Format of lung nodule position.
| Database | Sample Number | Classification |
|---|---|---|
| Shanghai Zhongshan hospital database (ZSDB) | CT images from 350 patients | MIA, AAH, AIS, IA |
| SPIE-AAPM Lung CT Challenge [ | 22,489 CT images from 70 series | malignant and benign |
| General Hospital of Guangzhou Military Command (GHGMC) dataset | 180 benign and 120 malignant lung nodules | malignant and benign |
| NSCLC-Radiomics database [ | 13,482 CT images from 89 patients | malignant and benign |
| Lung Nodule Analysis challenge 2016 (LUNA16) [ | 888 CT scans | subset of LIDC-IDRI |
| Danish Lung Nodule Screening Trial (DLCST) [ | CT images from 4104 participants | Nodule and non-nodule |
MIA: Minimally Invasive Adenocarcinoma, AAH: Atypical Adenomatous Hyperplasia, AIS: Adenocarcinoma In Situ, and IA: Invasive Adenocarcinoma.
Main techniques and comparative analysis of the two-type classification. U: user-defined features, G: Generic features, D: deep features, 3D: 3D features, O: other methods. Acc: accuracy, Sen: sensitivity, Spe: specificity.
| Authors | Year | Database | Features | Classifier | Performance |
|---|---|---|---|---|---|
| Xie et al. [ | 2018 | LIDC-IDRI | U, D | ANNs | AUC:0.9665, 0.9445 and 0.8124 |
| Wei et al. [ | 2018 | LIDC-IDRI | U | Spectral clustering | Error rate: 10.9%, 17.5% |
| Wei et al. [ | 2018 | LIDC-IDRI | G | CBIR | AUC: 0.986, Acc: 91.8% |
| Dey et al. [ | 2018 | LIDC-IDRI, Private | D, 3D | CNNs | Acc: 90.4%, AUC: 0.9548 |
| Xie et al. [ | 2018 | LIDC-IDRI | U, D, 3D | CNNs | AUC: 0.9570, Acc: 91.6% |
| Chen et al. [ | 2018 | 72 patients, | U | SVM | Acc: 84%, Sen: 92.85% |
| Gong et al. [ | 2018 | Private | U, 3D | SVM, LDA, Naïve Bayes | AUC: 0.94, 0.90, 0.99 |
| Zhao et al. [ | 2018 | LIDC-IDRI | D | CNNs | Acc: 82.2%, AUC: 0.877 |
| Li et al. [ | 2018 | LIDC-IDRI, private | G | RF | Sen: 92%, AUC: 0.95 |
| Causey et al. [ | 2018 | LIDC-IDRI | U, D,3D | RF | AUC: 0.99 |
| Zhu et al. [ | 2018 | LIDC-IDRI, LUNA16 | D, 3D | CNNs, GBM | Acc: 90.44% |
| Tajbakhsh et al. [ | 2017 | 415 cases, 489 nodules | D | MTANNs, CNNs | AUC: 0.8806 and 0.7755 |
| Shen et al. [ | 2017 | LIDC-IDRI | 3D, D | CNNs | Acc: 87.14%, AUC: 0.93 |
| Hancock et al. [ | 2017 | LIDC-IDRI | U | Linear classifier | Acc: 88.08%, AUC: 0.949 |
| Xie et al. [ | 2017 | LIDC-IDRI | U, D | CNNs | Acc: 93.40% |
| Le et al. [ | 2017 | ZSDB, LIDC-IDRI | U | RF | AUC: 0.9144 and 0.8234 |
| Kang et al. [ | 2017 | LIDC-IDRI | D, 3D | CNNs | Error rate: 4.59% |
| Wei et al. [ | 2017 | LIDC-IDRI | U | CBIR | AUC: 0.751, Acc: 71.3% |
| Jin et al. [ | 2017 | LIDC | D | CDBNs | Acc: 92.83% |
| Song et al. [ | 2017 | LIDC-IDRI | D | CNNs, DNN, SAEs | Acc: 84.15%, Sen: 83.96% |
| Silva et al. [ | 2017 | LIDC-IDRI | D | CNNs | Sen: 94.66%, Spe: 95.14% |
| Nibali et al. [ | 2017 | LIDC-IDRI | D | CNNs | Acc: 89.90% |
| Xu et al. [ | 2017 | LIDC-IDRI | D | SVM | Acc: 89%, AUC: 0.95 |
| Jiang et al. [ | 2017 | LIDC-IDRI | U | SVM, RF, | Acc: 77.29%, 80.07%, 84.21%; AUC: 0.913 |
| Paing et al. [ | 2017 | TCIA [ | U, 3D | SVM | Acc: 90.9% |
| Shen et al. [ | 2017 | LIDC-IDRI | 3D, D | CNNs | Acc: 87.14%, AUC: 0.93 |
| Dhara et al. [ | 2016 | LIDC-IDRI | U, 3D | SVM | AUC: 0.9505, 0.8822 and 0.8848 |
| Yan et al. [ | 2016 | LIDC-IDRI | D, 3D | CNNs | Acc: 86.7%, 87.3%, and 87.4% |
| Sasidhar et al. [ | 2016 | LIDC-IDRI | U, G | SVM | Acc: 92% |
| Htwe et al. [ | 2016 | LIDC-IDRI, SPIE-AAPM | U | Fuzzy system | Sen: 87%, Acc: 78% |
| Dhara et al. [ | 2016 | LIDC-IDRI | U | SVM | AUC: 0.9465 |
| Gierada et al. [ | 2016 | 94 patients, 96 nodules | U, 3D | Regression analysis | AUC: from 0.79 to 0.83 |
| Sergeeva et al. [ | 2016 | LIDC-IDRI | U | KNN | Acc: 81.3% |
| Fernandes et al. [ | 2016 | 754 nodules | U, 3D | SVM | Sen: 87.94%, Spe: 94.32% |
| Shewaye et al. [ | 2016 | LIDC-IDRI, Private | U, G | SVM, KNN, RF, Logistic Regression, AdaBoost | Acc: 82% of malignant and 93% of benign |
| Rendon-Gonzalez et al. [ | 2016 | SPIE-AAPM | U | SVM | Acc: 78.08%, Sen: 84.93% |
| Kim et al. [ | 2016 | Private | U, D | SVM | Acc: 95.5%, Sen: 94.4% |
| Ma et al. [ | 2016 | TCIA | U | RF | Acc: 82.7% |
| Liu et al. [ | 2016 | LIDC-IDRI | D | CNNs | Error rate: 5.41% |
| Felix et al. [ | 2016 | 274 nodules | U, 3D | MLP, KNN, RF | AUC: 0.82 |
| Sun et al. [ | 2016 | LIDC-IDRI | D | CNNs,DBNs SDAE | Acc: 79.76%, 81.19% and 79.29% |
| Wang et al. [ | 2016 | LIDC-IDRI | U | SVM | Acc: 76.1% |
| Huang et al. [ | 2016 | 100 series | U | Logistic regression | Acc: 79%; AUC: 0.81 |
| Song et al. [ | 2016 | LIDC | U | Acc: 83.4% | |
| Xie et al. [ | 2016 | LIDC-IDRI | U, D | CNNs | Acc: 86.79%; |
| Aggarwal et al. [ | 2015 | Private | U | SVM | Acc: 82.32% |
| Narayanan et al. [ | 2015 | LIDC | U | ANNs | Acc: 92.2%, FP: 0.9% |
| Dilger et al. [ | 2015 | 50 nodules | U, G, 3D | ANNs | AUC: 0.935, Acc: 92% |
| Hua et al. [ | 2015 | LIDC | D | CNNs | Sen: 73.4% and 73.3% |
| Kumar et al. [ | 2015 | LIDC-IDRI | D | binary decision tree | Acc: 75.01%, Sen: 83.35% |
| Shen et al. [ | 2015 | LIDC-IDRI | 3D, D | SVM, RF | Acc: 86.84% |
| Tartar et al. [ | 2014 | Private | U | AdaBoost, Bagging, RSS | Sen: 94.7%, 90.0%, 77.8% |
| Dandil et al. [ | 2014 | 47 patients, 128 nodules | U | ANNs | Acc: 90.63%, Sen: 92.30% |
| Huang et al. [ | 2013 | 107 images | U | SVM | Acc: 83.11%, AUC: 0.8437 |
| Dilger et al. [ | 2013 | 27 nodules | U, 3D | NN | Acc: 92.6% |
| Han et al. [ | 2013 | LIDC-IDRI | U, 3D | SVM | AUC: 0.9441 |
| Lin et al. [ | 2013 | 107 scans | U | SVM | AUC: 0.9019, Acc: 88.82% |
| Nascimento et al. [ | 2012 | LIDC | U | SVM | Sen: 85.64, Spe: 97.89% |
| El-Baz et al. [ | 2011 | LIDC | U, 3D | KNN | Acc: 94.4% |
| Chen et al. [ | 2011 | 47 nodules | D | BPNN, RBPNN, LVQNN | Acc: 78.7% |
| El-Baz et al. [ | 2011 | LIDC | U, 3D | KNN | Acc: 93.6% |
| Namin et al. [ | 2010 | LIDC | U, 3D | KNN | Sen: 88% |
| El-Baz et al. [ | 2010 | LIDC | U, 3D | Bayes | Acc: 96.3% |
| Silva et al. [ | 2009 | Private | U, 3D | SVM | Acc: 100%, Spe: 100% |
| Way et al. [ | 2009 | Private | U, 3D | AUC: 0.863 | |
| Antonelli et al. [ | 2008 | 66 nodules | O | Sen: 95%, Spe: 91.33% | |
| Way et al. [ | 2006 | LIDC | U, 3D | LDA | AUC: 0.83 |
| Suzuki et al. [ | 2005 | 489 nodules | D | ANNs | AUC: 0.882 |
| Armato et al. [ | 2003 | 393 scans, 470 nodules | U, 3D | k-means | AUC: 0.79 |
| Lo et al. [ | 2003 | 48 cases | U, 3D | ANNs | AUC: 0.89 |
| Kawata et al. [ | 2003 | 107 cases | U, 3D | Sen: 91.4%, Spe: 51.4% | |
| Kawata et al. [ | 2001 | 248 nodules | U, 3D | k-means, LDA | AUC: 0.97 |
| Kawata et al. [ | 2000 | 210 nodules | U, 3D | k-means, LDA | AUC: 0.97 |
| Wyckoff et al. [ | 2000 | 21 cases | 3D, U | Acc: 81% | |
| McNitt et al. [ | 1999 | 31 cases | U | LDA | Acc: 90.3% |
Main techniques and comparative analysis of the selected four-type classification. U: User-defined features. G: Generic features. D: deep features. 3D: 3D features. O: other methods. Acc: accuracy. Sen: sensitivity. Spe: specificity.
| Author | Year | Database | Features | Classifier | Performance |
|---|---|---|---|---|---|
| Liu et al. [ | 2018 | LIDC-IDRI, ELCAP | U, D, 3D | CNNs | Acc: 92.3% and 90.3% |
| Yuan et al. [ | 2018 | U, G, D, 3D | SVM | Acc: 93.1% and 93.9% | |
| Mao et al. [ | 2018 | ELCAP | U, D | Softmax | Acc: 95.5% |
| Mao et al. [ | 2016 | ELCAP | U | SVM, clustering | Acc: over 90% |
| Mao et al. [ | 2016 | ELCAP | G | Ensemble classifier | Acc: 92% |
| Zhang et al. [ | 2014 | ELCAP | U, G | SVM, pLSA | Acc: 89% |
| Zhang et al. [ | 2014 | ELCAP | O | Acc: about 88% | |
| Zhang et al. [ | 2013 | ELCAP | U | SVM | Acc: 82.5% |
| Zhang et al. [ | 2013 | ELCAP | G | CPMw | Precision: 0.916 |
| Song et al. [ | 2012 | ELCAP | U, G | SVM | Acc: about 87.5% |
| Farag et al. [ | 2010 | ELCAP | G | LDA | Acc: 81.5% |
| Farag et al. [ | 2010 | ELCAP | G | LDA | Acc: 78.23% |
Figure 8Trend of performance for selected papers: (a) the performance of two-type classification. The blue and red boxes indicate the accuracy and AUC, respectively. Each box indicates the worst, best, and median performance. (b) The performance of four-type classification.
Figure 9Trends of the technology used in this field. For the convenience of observation, the 3D feature methods merge into others.