| Literature DB >> 35137543 |
Zheng Wu1, Fei Wang1, Wei Cao1, Chao Qin1, Xuesi Dong1, Zhuoyu Yang1, Yadi Zheng1, Zilin Luo1, Liang Zhao1, Yiwen Yu1, Yongjie Xu1, Jiang Li1,2, Wei Tang3, Sipeng Shen4,5, Ning Wu3,6, Fengwei Tan7, Ni Li1,2, Jie He1,7.
Abstract
BACKGROUND: Screening with low-dose computed tomography (LDCT) is an efficient way to detect lung cancer at an earlier stage, but has a high false-positive rate. Several pulmonary nodules risk prediction models were developed to solve the problem. This systematic review aimed to compare the quality and accuracy of these models.Entities:
Keywords: early detection and early diagnosis; lung cancer; prediction; pulmonary nodule; screening
Mesh:
Year: 2022 PMID: 35137543 PMCID: PMC8888150 DOI: 10.1111/1759-7714.14333
Source DB: PubMed Journal: Thorac Cancer ISSN: 1759-7706 Impact factor: 3.500
FIGURE 1Flow chart of literature search
FIGURE 2Characters of existing models; (a) size and distribution of training sets used for modeling; (b) number and distribution of existing models; (c) number and distribution of models seeking validation in different ways; (d) number and distribution of models from different regions and data sources; and (e) frequency of risk factors used in traditional final models
FIGURE 3AUCs and confidence intervals of existing models by regions and time periods
Basic information and development of models based on the deep learning algorithm
| First author | Year | Study design | Targeted population | Inclusion criteria of participants | Inclusion criteria of nodules | Sample size | Cases of lung cancer | Data source |
|---|---|---|---|---|---|---|---|---|
| Yoganand Balagurunathan | 2019 | Screening trial | American | 55–74 years old and smoker | ≥4 mm | 244 | 78 | Multicenter |
| Gerard A. Silvestri | 2018 | Cohort study | American and Canadian | >40 years old | 8–30 mm | 178 | 29 | Multicenter |
| Chao Zhang | 2019 | Cohort study | American and Chinese | Unspecified | Unspecified | Multicenter | ||
| Johanna Uthoff | 2019 | Cohort study | American | 363 | 74 | Multicenter | ||
| Ilaria Bonavita | 2020 | Cohort study | American | Unspecified | Unspecified | Multicenter | ||
| Parnian Afshar | 2020 | Cohort study | American | 1010 | Unspecified | Multicenter | ||
| Huafeng Wang | 2018 | Cohort study | American | 1018 | Unspecified | Multicenter | ||
| Jason L. Causey | 2018 | Cohort study | American | 1018 | Unspecified | Multicenter | ||
| Samuel Hawkins 1 | 2016 | Screening trial | American | 55–74 years old and smoker | ≥4 mm | 600 | 200 | Multicenter |
| Samuel Hawkins 2 | 2016 | Screening trial | American | 55–74 years old and smoker | ≥4 mm | 600 | 200 | Multicenter |
| Andrew V. Kossenkov | 2019 | Cohort study | American | smoker | 6–20 mm | 583 | 293 | Multicenter |
| G. A. Soardi | 2015 | Cohort study | American | ≤30 mm | 311 | 199 | Single‐center | |
| Zuohong Wu | 2021 | Cohort study | Chinese | ≤30 mm | 995 | 772 | Single‐center | |
| Stéphane Chauvie | 2020 | Screening trial | Chinese | 45–75 years old and smoker | 234 | 32 | Multicenter | |
| Shulong Li | 2019 | Cohort study | American | 1010 | Unspecified | Multicenter | ||
| Rekka Mastouri | 2021 | Cohort study | American | Unspecified | Unspecified | Multicenter | ||
| Yin‐Chen Hsu | 2020 | Cohort study | Chinese | 836 | 27 | Single‐center | ||
| Jiabao Liu | 2020 | Cohort study | Chinese | 6–30 mm | 879 | 601 | Multicenter | |
| Rahul Paul | 2020 | Cohort study | American | 55–74 years old and smoker | ≥4 mm | 261 | 85 | Multicenter |
| Muahammad Bilal Zia | 2020 | Cohort study | American | 1010 | Unspecified | Multicenter | ||
| Yi‐Ming Xu | 2020 | Cohort study | American | 55–74 years old and smoker | ≥4 mm | 1109 | 926 | Multicenter |
| Subba R. Digumarthy | 2019 | Cohort study | American | 36 | Unspecified | Single‐center | ||
| Yangwei Xiang | 2019 | Cohort study | Chinese | 588 | 462 | Single‐center | ||
| Liting Mao | 2019 | Cohort study | Chinese | 294 | 61 | Single‐center | ||
| Shaun Daly | 2013 | Cohort study | American | 136 | 69 | Single‐center |
Validation of traditional models
| First author | Year | Type of validation | Calibration | Sample size | AUC | Thresholds | Sensitivity | Specificity |
|---|---|---|---|---|---|---|---|---|
| Annette McWilliams | 2013 | External | Excellent | 1090 | 0.970 | 0.05 | 0.71 | 0.96 |
| Barbara Nemesure | 2019 | Internal | Not calibrated | 1455 | 0.860 | 0.73 | 0.81 | |
| Michael W. Marcus | 2019 | Internal | Excellent | 1013 | 0.882 | |||
| Martin T. ammemagi | 2018 | External | Excellent | 3680 | 0.947 | |||
| Vineet K. Raghu | 2019 | External | Not calibrated | 126 | 0.882 | 0.61 | 0.28 | 1.00 |
| Joan E Walter | 2018 | Internal | Excellent | 809 | 0.850 | |||
| Xianfeng Li | 2017 | Internal | Not calibrated | 39 | 0.921 | |||
| Michal Reid | 2019 | External | Excellent | 45 | 0.810 | |||
| Michael K. Gould | 2007 | Internal | Excellent | 375 | 0.790 | |||
| Sungmin Zo | 2020 | Internal | Excellent | 157 | 0.952 | |||
| Xiao‐Bo Chen | 2019 | External | Excellent | 216 | 0.848 | |||
| Stephen J. Swensen | 1997 | Internal | Excellent | 210 | 0.833 | 0.10 | 0.93 | 0.47 |
| 0.40 | 0.51 | 0.90 | ||||||
| Man Zhang | 2015 | Internal | Not calibrated | 120 | 0.910 | 0.55 | 0.87 | 0.85 |
| Bin Zheng 1 | 2015 | Internal | Not calibrated | 198 | 0.808 | |||
| Bin Zheng 2 | 2015 | Internal | Not calibrated | 84 | 0.845 | |||
| Jingsi Dong | 2014 | Internal | Not calibrated | 1679 | 0.935 | |||
| Yun Li | 2012 | External | Not calibrated | 145 | 0.874 | 0.46 | 0.95 | 0.70 |
| Li Yang | 2017 | Internal | Not calibrated | 344 | 0.784 | 0.70 | 0.79 |
AUC, area under curve.
Validation of models based on the deep learning algorithm
| First author | Year | Sample size | Type of validation | AUC | Threshold | Sensitivity | Specificity |
|---|---|---|---|---|---|---|---|
| Yogan and Balagurunathan | 2019 | 235 | Internal | 0.850 | 0.54 | 0.91 | |
| Gerard A. Silvestri | 2018 | 178 | Internal | 0.760 | 0.05 | 0.97 | 0.44 |
| Chao Zhang | 2019 | Unspecified | External | 0.855 | 0.84 | 0.83 | |
| Johanna Uthoff | 2019 | 100 | External | 0.965 | 0.38 | 1.00 | 0.96 |
| Ilaria Bonavita | 2020 | Unspecified | Internal | Unspecified | |||
| Parnian Afshar | 2020 | 1010 | Internal | 0.964 | 0.95 | 0.90 | |
| Huafeng Wang | 2018 | 1018 | Internal | 0.970 | |||
| Jason L. Causey | 2018 | 1018 | Internal | 0.993 | |||
| Samuel Hawkins 1 | 2016 | 600 | Internal | 0.83 | |||
| Samuel Hawkins 2 | 2016 | 600 | Internal | 0.79 | |||
| Andrew V. Kossenkov | 2019 | 158 | External | 0.825 | 0.69 | 0.84 | |
| G. A. Soardi | 2015 | 311 | Internal | 0.893 | |||
| Zuohong Wu | 2021 | 995 | Internal | 0.851 | 0.88 | 0.64 | |
| Stéphane Chauvie | 2020 | 234 | Internal | Unspecified | 0.90 | 1.00 | |
| Shulong Li | 2019 | 1010 | Internal | 0.931 | 0.83 | 0.92 | |
| Rekka Mastouri | 2021 | Unspecified | Internal | 0.92 | 0.92 | 0.92 | |
| Yin‐Chen Hsu | 2020 | 836 | Internal | 0.873 | 0.75 | 0.85 | |
| Jiabao Liu | 2020 | 879 | Internal | 0.938 | 0.58 | 0.84 | 0.91 |
| Rahul Paul | 2020 | 261 | Internal | 0.960 | |||
| Muahammad Bilal Zia | 2020 | 1010 | Internal | Unspecified | 0.91 | 0.91 | |
| Yi‐Ming Xu | 2020 | 1109 | Internal | Unspecified | 0.93 | 0.89 | |
| Subba R. Digumarthy | 2019 | 36 | Internal | 0.708 | |||
| Yangwei Xiang | 2019 | 588 | Internal | 0.890 | 0.90 | 0.80 | |
| Liting Mao | 2019 | 294 | Internal | 0.970 | 0.81 | 0.92 | |
| Shaun Daly | 2013 | 81 | External | 0.676 | 0.95 | 0.25 |
AUC, area under curve.
Basic information and development of traditional models
| First author | Year | Study design | Study method | Target population | Inclusion criteria of participants | Inclusion criteria of nodules | Sample size | Cases of lung cancer | EPVb | Data source |
|---|---|---|---|---|---|---|---|---|---|---|
| Annette McWilliams | 2013 | Screen trial | Logistic regression | Canadian | 50–74 years old | ≥1 mm | 1871 | 102 | 11.33 | Multicenter |
| Barbara Nemesure | 2019 | Cohort study | Cox regression | American | 1469 | 85 | 6.54 | Single‐center | ||
| Michael W. Marcus | 2019 | Screen trial | Logistic regression | English | 50–75 years old | ≥3 mm | 1013 | 52 | 2.60 | Multicenter |
| Martin Tammemagi | 2018 | Screen trial | Logistic regression | Canadian | 50–74 years old | ≥1 mm | 1871 | 111 | 10.10 | Multicenter |
| Vineet K. Raghu | 2019 | Cohort study | Logistic regression | American | Smoker | 92 | 50 | 10.00 | Multicenter | |
| Joan E. Walter | 2018 | Screen trial | Logistic regression | Dutch/Belgian | 50–75 years old and smoker | 809 | 50 | 7.14 | Multicenter | |
| Xianfeng Li | 2017 | Cohort study | Fisher discriminant analysis | Chinese | 20–80 years old | 5–30 mm | 39 | 20 | 1.00 | Single‐center |
| Michal Reid | 2019 | Cohort study | Logistic regression | American | ≥18 years old | ≤30 mm | 301 | 200 | 10.00 | Single‐center |
| Michael K. Gould | 2007 | Cohort study | Logistic regression | American | 7–30 mm | 375 | 204 | 13.60 | Multicenter | |
| Sungmin Zo | 2020 | Cohort study | Logistic regression | Korean | 157 | 90 | 5.29 | Single‐center | ||
| Xiao‐Bo Chen | 2019 | Cohort study | Logistic regression | Chinese | 8–20 mm | 493 | 214 | 11.26 | Single‐center | |
| Stephen J. Swensen | 1997 | Cohort study | Logistic regression | American | 4‐30 mm | 419 | 145 | 8.06 | Single‐center | |
| Man Zhang | 2015 | Cohort study | Logistic regression | Chinese | ≤30 mm | 314 | 248 | 14.59 | Multicenter | |
| Bin Zheng 1 | 2015 | Cohort study | Logistic regression | Chinese | ≤30 mm and GCO | 405 | 367 | 11.84 | Single‐center | |
| Bin Zheng 2 | 2015 | Cohort study | Logistic regression | Chinese | ≤30 mm and GCO ≥50% | 159 | 166 | 5.35 | Single‐center | |
| Jingsi Dong | 2014 | Cohort study | Logistic regression | Chinese | 1679 | 1296 | 58.91 | Single‐center | ||
| Yun Li | 2012 | Cohort study | Logistic regression | Chinese | 371 | 229 | 15.27 | Unspecified | ||
| Li Yang | 2017 | Cohort study | Logistic regression | Chinese | 1078 | 721 | 65.55 | Single‐center |
Approximate number.
EPV, events per variable; GCO, ground glass opacity.
Variables of traditional models
| Variables | First authors of models | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Annette McWilliams | Barbara Nemesure | Michael W. Marcus | Martin Tammemagi | Vineet K. Raghu | Joan E. Walter | Xianfeng Li | Michal Reid | Michael K. Gould | Sungmin Zo | Xiao‐Bo Chen | Stephen J. Swensen | Man Zhang | Bin Zheng 1 | Bin Zheng 2 | Jingsi Dong | Yun Li | Li Yang | ||
| Basic character | Age | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | |
| Sex | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | ||||
| Personal history of other cancer | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | |||||||||
| Family history of lung cancer | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | ||||||
| Family history of other cancer | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | ||||||||
| BMI | 0 | 0 | 0 | 0 | |||||||||||||||
| Exposure of asbestos | 0 | 1 | 0 | ||||||||||||||||
| FVC | 1 | ||||||||||||||||||
| History of respiratory diseases | 1 | 1 | 0 | 0 | 0 | 0 | 0 | ||||||||||||
| Smoke | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | |||
| Clinical symptoms | 0 | 0 | 0 | ||||||||||||||||
| Time since previous lung cancer was diagnosed | 0 | ||||||||||||||||||
| FEV1 | 0 | 1 | 1 | ||||||||||||||||
| Biomarkers | Squamous cell carcinoma antigen | 0 | |||||||||||||||||
| NSE | 0 | 0 | |||||||||||||||||
| CEA | 1 | 0 | 0 | 0 | 0 | 0 | 1 | ||||||||||||
| CYFRA21‐1 | 1 | 0 | 1 | 1 | |||||||||||||||
| MiRNA‐21‐5p | 1 | 0 | |||||||||||||||||
| MiR‐574‐5p | 1 | 0 | |||||||||||||||||
| Laboratory indicators | 0 | 0 | |||||||||||||||||
| Ferritin | 0 | ||||||||||||||||||
| Imaging information | Size | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | |||
| Volume | 1 | 1 | 1 | ||||||||||||||||
| Density | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | ||||||||||
| Location | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | |||
| Count | 0 | 0 | 0 | 1 | 0 | ||||||||||||||
| Margin (spiculate) | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | ||||
| Satellite lesions | 1 | 1 | 0 | 0 | 1 | ||||||||||||||
| Calcification | 0 | 0 | 1 | 1 | 0 | 1 | 1 | ||||||||||||
| Cavitation | 0 | 0 | 0 | ||||||||||||||||
| Shape | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | ||||||||
| Enhancement | 1 | 0 | 0 | ||||||||||||||||
| Pleural indentation | 1 | 0 | 0 | 0 | |||||||||||||||
| Bronchus sign | 1 | 0 | |||||||||||||||||
| Vascular signs | 0 | 0 | |||||||||||||||||
| Enphysema | 0 | 0 | 1 | 1 | 0 | 0 | |||||||||||||
| Vessels sign | 0 | ||||||||||||||||||
| Vessel number | 1 | ||||||||||||||||||
| Tracheal signs | |||||||||||||||||||
| Previous CT scan | 0 | ||||||||||||||||||
| Previous X‐ray | 0 | ||||||||||||||||||
| Vacuole signs | |||||||||||||||||||
| Associated pleural effusion | 0 | 0 | |||||||||||||||||
| Enlarged hilar or mediastinal lymph nodes | 0 | 0 | |||||||||||||||||
| Visibility in retrospect | 0 | ||||||||||||||||||
| Carbohydrate antigen | 0 | ||||||||||||||||||
| Neuron‐specific enolase | 0 | ||||||||||||||||||
0 depicts the inclusion of a variable into the model as a candidate variable; 1 depicts retention in the final model.
bBMI, body mass index; FVC, forced vital capacity; FEV1, forced expiratory volume in one second; NSE, neuron‐specific enolase; CEA, carcinoembryonic antigen; CEFRA21‐1, cytokeratin fragment antigen 21‐1; MiR(NA), MicroRNA.
Comparison between existing methods and models based on the deep learning algorithm
| First author | Objects for comparison | Indicators for comparison | Superior methods |
|---|---|---|---|
| Yogan and Balagurunathan | None | ||
| Gerard A. Silvestri | Traditional models | AUCa | Deep learning |
| Gerard A. Silvestri | Clinician | AUC | Deep learning |
| Chao Zhang | Clinician | Accuracy, sensitivity, and specificity | Deep learning |
| Johanna Uthoff | None | ||
| Ilaria Bonavita | Clinician | F1 score | Deep learning |
| Parnian Afshar | None | ||
| Huafeng Wang | None | ||
| Jason L. Causey | Clinician | AUC | Similar |
| Samuel Hawkins 1,2 | Lung‐RADS | AUC | Deep learning |
| Samuel Hawkins 1,2 | Traditional models | AUC | Similar |
| Andrew V. Kossenkov | Traditional models | AUC | Deep learning |
| G. A. Soardi | None | ||
| Zuohong Wu | Traditional models | AUC | Deep learning |
| Stéphane Chauvie | Lung‐RADS | PPVa, sensitivity, and specificity | Deep learning |
| Stéphane Chauvie | Traditional models | PPV, sensitivity, and specificity | Deep learning |
| Shulong Li | None | ||
| Rekka Mastouri | None | ||
| Yin‐Chen Hsu | Lung‐RADS | AUC | Deep learning |
| Jiabao Liu | Clinician | AUC | Deep learning |
| Rahul Paul | None | ||
| Muahammad Bilal Zia | None | ||
| Yi‐Ming Xu | Clinician | Sensitivity | Deep learning |
| Subba R. Digumarthy | None | ||
| Yangwei Xiang | Traditional models | AUC | Deep learning |
| Liting Mao | ACR‐lung RADSa | Accuracy, sensitivity, and specificity | Deep learning |
| Shaun Daly | Traditional models | AUC | Deep learning |
AUC, area under curve; ACR‐Lung‐RADS, American College of Radiology Lung Imaging Reporting and Data System; PPV, positive predictive value.