| Literature DB >> 36241922 |
K C Santosh1, Siva Allu2, Sivaramakrishnan Rajaraman3, Sameer Antani3.
Abstract
There has been an explosive growth in research over the last decade exploring machine learning techniques for analyzing chest X-ray (CXR) images for screening cardiopulmonary abnormalities. In particular, we have observed a strong interest in screening for tuberculosis (TB). This interest has coincided with the spectacular advances in deep learning (DL) that is primarily based on convolutional neural networks (CNNs). These advances have resulted in significant research contributions in DL techniques for TB screening using CXR images. We review the research studies published over the last five years (2016-2021). We identify data collections, methodical contributions, and highlight promising methods and challenges. Further, we discuss and compare studies and identify those that offer extension beyond binary decisions for TB, such as region-of-interest localization. In total, we systematically review 54 peer-reviewed research articles and perform meta-analysis.Entities:
Keywords: Chest x-rays; Deep learning; Medical imaging; Systematic review; Tuberculosis
Mesh:
Year: 2022 PMID: 36241922 PMCID: PMC9568934 DOI: 10.1007/s10916-022-01870-8
Source DB: PubMed Journal: J Med Syst ISSN: 0148-5598 Impact factor: 4.920
Fig. 1TB cases in chest X-rays showing a scattering bilateral reticular and nodular shadows more evident at the right middle and lower zone (source: radiopaedia.org, rID: 39290); b bilateral micronodular insterstitial effusion, left apical bronchiectasis at the level of the pulmonary lingula (granulomatous infectius process), and pleural apical thickening (source: radiopaedia.org, rID: 28037); and c secondary TB with apical lung cavities (source: https://www.kumc.edu/)
Fig. 2Workflow representing different phases of the systematic review (source: PRISMA criteria [26])
Dataset collections and their respective sources
| Data collections | Size | Source | |
|---|---|---|---|
| C1. | Thomas Jefferson University Hospital Dataset (JUHD, USA) | 119 | Source not available |
| C2. | Montgomery County Dataset (MC, USA) | 138 | |
| C3. | Infectious Disease Institute Dataset (Kampala, Uganda) | 138 | Availabel upon request [ |
| C4. | National Institute of Tuberculosis and Respiratory Diseases Dataset (NITRD, India) | 153(DA) 153(DB) | |
| C5. | Japanese Society of Radiological Technology Dataset (JSRT, Japan) | 247 | |
| C6. | Kasturba Hospitals Dataset (Manipal, India) | 317 | Available upon request [ |
| C7. | Gugulethu TB Clinic Dataset (Cape Town-South Africa) | 392 | Source not available |
| C8. | Belarus Tuberculosis Dataset (Belarus) | 422 | |
| C9. | Shenzhen Hospital Dataset (SH, China) | 662 | |
| C10. | GF Jooste and Khayelitsha Hospitals Dataset (South Africa) | 677 | Available upon request [ |
| C11. | Sehatmand Zindagi Dataset (Pakistan) (2016-2017) | 694 | Available upon request [ |
| C12. | Eastern Asia Hospital Dataset (EAH) | 864 | Available upon request [ |
| C13. | Medical Surveillance Dataset (Yonsei University, South Korea) | 39,675 | Available upon request [ |
| C14. | Nepal and Cameroon Dataset | 1,196 | Available upon request [ |
| C15. | Indiana Dataset (Indiana, USA) | 4,104 | |
| C16. | Mendeley Dataset (UK) | 5,232 | |
| C17. | First Affiliated Hospitals of Xi’an JiaoTong University Dataset (FAHXJU, China) | 5,344 | Available upon request [ |
| C18. | Pediatric Pneumonia CXR Dataset (USA) | 5,856 | |
| C19. | Sehatmand Zindagi Dataset (Pakistan) (2013-2015) | 6,090 | Available on request [ |
| C20. | Korean Institute of Tuberculosis Dataset (KIT, South Korea) | 10,848 | Source not available |
| C21. | Tuberculosis CXR Dataset (TBX11K, China) | 11,200 | |
| C22. | Open-i Dataset (NLM, USA) | 11,425 | |
| C23. | Sitec Medical Dataset (Philippines) | 14,094 | Source not available |
| C24. | Radiological Society of North America Dataset (RSNA, USA) | 26,684 | |
| C25. | Find and Treat Dataset (UK) | 47,510 | Source not available |
| C26. | Chest X-rayS - NIH (MD, USA) | 112,120 | |
| C27. | AbiyeV Dataset (Turkey) | 120,120 | Available upon request [ |
| C28. | CheXpert CXR Dataset (USA) | 223,648 |
Chest X-ray imaging tools, dataset size and their performance measured in Accuracy (ACC), Area Under the Curve (AUC), Specificity (SPEC), and Sensitivity (SEN)
| Authors (year) | Method | Data collection | Performance | |||
|---|---|---|---|---|---|---|
| (size) | ACC in | AUC | SPEC | SEN | ||
| Hwang et al. (2016) [ | CNN | C2 (138) | 67.40 | 0.88 | − | − |
| C9 (662) | 83.70 | 0.93 | − | − | ||
| C20 (10, 848) | 90.30 | 0.96 | − | − | ||
| Melendez et al. (2016) [ | CAD4TBV3.07 | C7 (392) | − | 0.84 | 0.49 | 0.95 |
| Lakhani and Sundaram (2017) [ | Ensemble: AlexNet | C1, C2, C8 | − | 0.99 | 1.00 | 0.97 |
| GoogLeNet | C9 (1, 007) | |||||
| Lopes and Valiati (2017) [ | CNN: GoogLenet | C2 (138) | 82.60 | 0.93 | − | − |
| VGGNet, ResNet | C9 (662) | 84.70 | 0.90 | − | − | |
| Abiyev and Ma’aitah (2018) [ | CNN | C27 (120, 120) | 92.40 | − | − | − |
| Rajpurkar et al. (2018) [ | CheXNeXt | C26 (112, 120) | − | 0.85 | − | − |
| Melendez et al. (2018) [ | CAD4TB | C25 | − | 0.90 | 0.56 | 0.95 |
| Zaidi et al. (2018) [ | CAD4TBV3.07 | C19 (6, 090) | − | 0.84 | 0.68 | 0.82 |
| Bekar et al. (2018) [ | ViDi | C3 (138) | − | 0.82 | 0.98 | 0.91 |
| Yadav et al. (2018) [ | ResNet | C2,C9, | ||||
| C26 (112, 920) | 94.89 | − | − | − | ||
| Qin et al. (2019) [ | CAD4TB(V6) | C14 (1, 196) | 92.00 | 0.92 | 0.96 | 0.47 |
| Lunit | C14 (1, 196) | 94.00 | 0.94 | 0.97 | 0.58 | |
| qXR | C14 (1, 196) | 94.00 | 0.94 | 0.96 | 0.71 | |
| Ge et al. (2019) [ | D121-BL | C26 (112, 120) | − | 0.84 | − | − |
| Hwa et al. (2019) [ | Ensemble: VGG16 and | C2, C9 (800) | 89.77 | − | 0.88 | 0.91 |
| Inception V3 | ||||||
| Pasa et al. (2019) [ | CNN and SVM | C2, C8 | ||||
| C9 (1, 104) | 86.20 | 0.93 | − | − | ||
| Evangelista and Guedes (2019) [ | Ensemble: Inception | C2, C5 | ||||
| ResNet and VGG | C9 (893) | 93.82 | − | 0.94 | 0.93 | |
| Ahsan et al. (2019) [ | VGG16 | C2, C9 (800) | 81.25 | − | − | − |
| Heo et al. (2019) [ | DCNN | C13 (39, 675) | − | 0.97 | 0.96 | 0.81 |
| Philipsen et al. (2019) [ | CAD4TB | C23 | − | 0.93 | 0.87 | 0.82 |
| Kim et al. (2020) [ | DCNN | C26 | − | 0.87 | 0.76 | 0.85 |
| Nash et al. (2020) [ | qXR | C6 (317) | − | 0.81 | 0.80 | 0.71 |
| Rajpurkar et al. (2020) [ | CheXiad | C10 (677) | 65.00 | − | 0.61 | 0.73 |
| Das et al.(2020) [ | Truncated Inception Net | C2, C9 (800) | 99.92 | 0.99 | 1.00 | 0.93 |
| Sathitratanacheewin et al. (2020) [ | DCNN | C9 (662) | − | 0.98 | 0.82 | 0.72 |
| C26 (112, 120) | − | 0.71 | − | − | ||
| Yoo et al. (2020) [ | CNN | C9 | 80.00 | 0.80 | 0.89 | 0.72 |
| C12 | ||||||
| Sahlol et al. (2020) [ | Mobile Net-AEO | C9 (662) | 90.20 | − | 0.90 | 0.91 |
| C16 (5, 232) | 94.10 | − | 0.97 | 0.87 | ||
| Xie et al. (2020) [ | Faster RCNN | C2 (138) | 92.60 | 0.98 | 0.923 | 0.93 |
| C9 (662) | 90.20 | 0.94 | 0.95 | 0.85 | ||
| C17 (5, 344) | 97.40 | 0.99 | 0.96 | 0.98 | ||
| Rajaraman and Antani (2020) [ | Ensemble: | C9, C15, C18 | 94.10 | 0.99 | 0.96 | 0.93 |
| InceptionResNet-V2, | C24 (37, 306) | |||||
| DenseNet-121 | ||||||
| Inception-V3, VDSNet | ||||||
| Rahman et al. (2020) [ | CheXNet | C2, C8, C9 | 96.40 | − | 0.96 | 0.96 |
| C24 (27, 484) | ||||||
| Guo et al. (2020) [ | Ensemble: VGG16, VGG19, | C9 (662) | 94.50 | − | 0.95 | 0.99 |
| Inception V3, ResNet34, | ||||||
| ResNet50, and ResNet101 | C26 (112, 120) | 95.60 | − | 0.98 | 0.98 | |
| Abideen et al. (2020) [ | B-CNN | C2 (138) | 96.42 | − | − | − |
| C9 (662) | 86.46 | − | − | − | ||
| Murphy et al. (2020) [ | CAD4TBV6 | C19 (6, 090) | − | 0.99 | 0.98 | 0.90 |
| Habib et al. (2020) [ | CAD4TB | C11 (694) | − | 0.78 | 0.42 | 0.91 |
| Rajaraman et al. (2020) [ | Ensemble: VGG-16, VGG-19 | C24, C28 (250, 332) | 91.63 | 0.97 | 0.88 | 0.924 |
| Xception, NASNet-mobile | ||||||
| Inception-V3, MobileNet | ||||||
| DenseNet-121 | ||||||
| Ayaz et al. (2021) [ | Ensemble: Inceptionv3, InceptionResnetv2 | C2 (138) | 93.47 | 0.97 | − | − |
| VGG16, VGG19, MobileNet | ||||||
| ResNet50, Xception | C9(662) | 97.59 | 0.99 | − | − | |
| Rajaraman et al. (2021) [ | ResNet-BS | C2 (138) | 92.30 | 0.96 | 0.97 | 0.88 |
| C9 (662) | 88.79 | 0.95 | 0.89 | 0.88 | ||
In data collection CX( YY), YY refers to YY number of samples used in that study from any specific collection index X
Comparative study on C2 (MC, USA) data collection
| Authors (year) | ACC (in %) | AUC | SPEC | SEN |
|---|---|---|---|---|
| Hwang et al. (2016) [ | 67.40 | 0.88 | − | − |
| Lakhani and Sundaram (2017) [ | − | 0.99 | 1.00 | 0.97 |
| Lopes and Valiati (2017) [ | 82.60 | 0.93 | − | − |
| Yadav et al. (2018) [ | 94.89 | − | − | − |
| Hwa et al. (2019) [ | 89.77 | − | 0.89 | 0.91 |
| Pasa et al. (2019) [ | 86.20 | 0.93 | − | − |
| Evangelista and Guedes (2019) [ | 93.82 | − | 0.94 | 0.93 |
| Ahsan et al. (2019) [ | 81.25 | − | − | − |
| Das et al. (2020) [ | 99.92 | 0.99 | 1.00 | 0.93 |
| Xie et al. (2020) [ | 92.60 | 0.98 | 0.92 | 0.93 |
| Rahman et al. (2020) [ | 96.40 | − | 0.97 | 0.96 |
| Abideen et al. (2020) [ | 96.42 | − | − | − |
| Ayaz et al. (2021) [ | 93.47 | 0.97 | − | − |
| Rajaraman et al. (2021) [ | 92.30 | 0.96 | 0.97 | 0.88 |
Other data collections were employed in addition to C2
Comparative study on C8 (Belarus) data collection
| Authors (year) | ACC (in %) | AUC | SPEC | SEN |
|---|---|---|---|---|
| Lakhani and Sundaram (2017) [ | − | 0.99 | 1.00 | 0.97 |
| Pasa et al. (2019) [ | 86.20 | 0.93 | − | − |
| Rahman et al. (2020) [ | 96.40 | − | 0.97 | 0.96 |
Other data collections were employed in addition to C8
Comparative study on C9 (SH, CHina) data collection
| Authors (year) | ACC (in %) | AUC | SPEC | SEN |
|---|---|---|---|---|
| Hwang et al. (2016) [ | 83.70 | 0.93 | − | − |
| Lakhani and Sundaram (2017) [ | − | 0.99 | 1.00 | 0.97 |
| Lopes and Valiati (2017) [ | 84.70 | 0.90 | − | − |
| Yadav et al. (2018) [ | 94.89 | − | − | − |
| Hwa et al. (2019) [ | 89.77 | − | 0.89 | 0.91 |
| Pasa et al. (2019) [ | 86.20 | 0.93 | − | − |
| Evangelista and Guedes (2019) [ | 93.82 | − | 0.94 | 0.93 |
| Ahsan et al. (2019) [ | 81.25 | − | − | − |
| Das et al.(2020) [ | 99.92 | 0.99 | 1.00 | 0.93 |
| Sathitratanacheewin et al. (2020) [ | − | 0.98 | 0.82 | 0.72 |
| Yoo et al. (2020) [ | 80.00 | 0.80 | 0.89 | 0.72 |
| Sahlol et al. (2020) [ | 90.20 | − | 0.90 | 0.91 |
| Xie et al. (2020) [ | 90.20 | 0.94 | 0.95 | 0.85 |
| Rajaraman and Antani (2020) [ | 94.10 | 0.995 | 0.96 | 0.93 |
| Rahman et al. (2020) [ | 96.40 | − | 0.97 | 0.96 |
| Guo et al. (2020) [ | 94.5 | − | 0.96 | 0.99 |
| Abideen et al. (2020) [ | 86.46 | − | − | − |
| Ayaz et al. (2021) [ | 97.59 | 0.99 | − | − |
| Rajaraman et al. (2021) [ | 88.79 | 0.95 | 0.89 | 0.88 |
Other data collections were employed in addition to C9
Comparative study on C24 (RSNA, USA) data collection
| Authors (year) | ACC (in %) | AUC | SPEC | SEN |
|---|---|---|---|---|
| Rajaraman and Antani (2020) [ | 94.10 | 0.99 | 0.96 | 0.93 |
| Rahman et al. (2020) [ | 96.40 | − | 0.97 | 0.96 |
| Rajaraman et al. (2020) [ | 91.63 | 0.97 | 0.88 | 0.92 |
Other data collections were employed in addition to C24
Comparative study on C26 (MD, USA) data collection
| Authors (year) | ACC (in %) | AUC | SPEC | SEN |
|---|---|---|---|---|
| Rajpurkar et al. (2018) [ | − | 0.85 | − | − |
| Yadav et al. (2018) [ | 94.89 | − | − | − |
| Ge et al. (2019) [ | − | 0.84 | − | − |
| Kim et al. (2020) [ | − | 0.87 | 0.76 | 0.85 |
| Sathitratanacheewin et al. (2020) [ | − | 0.71 | − | − |
| Guo et al. (2020) [ | 95.60 | − | 0.99 | 0.98 |
Other data collections were employed in addition to C26