| Literature DB >> 36010850 |
Gabriele C Forte1, Stephan Altmayer2, Ricardo F Silva3, Mariana T Stefani1, Lucas L Libermann1, Cesar C Cavion4, Ali Youssef5, Reza Forghani5, Jeremy King5, Tan-Lucien Mohamed5, Rubens G F Andrade3,4, Bruno Hochhegger5.
Abstract
We conducted a systematic review and meta-analysis of the diagnostic performance of current deep learning algorithms for the diagnosis of lung cancer. We searched major databases up to June 2022 to include studies that used artificial intelligence to diagnose lung cancer, using the histopathological analysis of true positive cases as a reference. The quality of the included studies was assessed independently by two authors based on the revised Quality Assessment of Diagnostic Accuracy Studies. Six studies were included in the analysis. The pooled sensitivity and specificity were 0.93 (95% CI 0.85-0.98) and 0.68 (95% CI 0.49-0.84), respectively. Despite the significantly high heterogeneity for sensitivity (I2 = 94%, p < 0.01) and specificity (I2 = 99%, p < 0.01), most of it was attributed to the threshold effect. The pooled SROC curve with a bivariate approach yielded an area under the curve (AUC) of 0.90 (95% CI 0.86 to 0.92). The DOR for the studies was 26.7 (95% CI 19.7-36.2) and heterogeneity was 3% (p = 0.40). In this systematic review and meta-analysis, we found that when using the summary point from the SROC, the pooled sensitivity and specificity of DL algorithms for the diagnosis of lung cancer were 93% and 68%, respectively.Entities:
Keywords: CNN; artificial intelligence; deep learning; deep learning networks; lung cancer
Year: 2022 PMID: 36010850 PMCID: PMC9405626 DOI: 10.3390/cancers14163856
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow diagram.
Characteristics of the included studies.
| Author | Year | Country | Study Design | Center | Artificial | Source | Threshold | Reference Standard | Method |
|---|---|---|---|---|---|---|---|---|---|
| Ardila et al. [ | 2019 | USA | retrospective | multicenter | CNN | Lung cancer screening dataset | PPV = 0.11 | Histopathology | External validation |
| Baldwin et al. [ | 2020 | UK | retrospective | multicenter | CNN | Private dataset | FN rate = 0% | Histopathology | External validation |
| Chen et al. [ | 2022 | China | retrospective | single | CNN | Private dataset | Unknown (third party software) | Histopathology | External validation |
| Çoruh et al. [ | 2021 | Turkey | retrospective | single | CNN | Private dataset | Youden index optimal cutoff | Histopathology | External validation |
| Trajanovski et al. [ | 2021 | USA | retrospective | multicenter | CNN | Lung cancer screening dataset | Sensitivity = 93% | Histopathology | External validation |
| Zhang et al. [ | 2019 | China | retrospective | multicenter | CNN | Private dataset | Probability of malignancy > 0.5 | Histopathology | Cross-validation |
CNN: convolutional neural network; PPV: positive predictive value; FN: false negative.
Figure 2Forest plot of the pooled sensitivity of DL in the detection and classification of lung cancer [37,38,39,40,41,42].
Figure 3Forest plot of the pooled specificity of DL in the detection and classification of lung cancer [37,38,39,40,41,42].
Figure 4Summarized receiver-operating curves (SROC) using the bivariate approach.