| Literature DB >> 32970157 |
Hyunsuk Yoo1, Ki Hwan Kim1, Ramandeep Singh2,3, Subba R Digumarthy2,3, Mannudeep K Kalra2,3.
Abstract
Importance: The improvement of pulmonary nodule detection, which is a challenging task when using chest radiographs, may help to elevate the role of chest radiographs for the diagnosis of lung cancer. Objective: To assess the performance of a deep learning-based nodule detection algorithm for the detection of lung cancer on chest radiographs from participants in the National Lung Screening Trial (NLST). Design, Setting, and Participants: This diagnostic study used data from participants in the NLST ro assess the performance of a deep learning-based artificial intelligence (AI) algorithm for the detection of pulmonary nodules and lung cancer on chest radiographs using separate training (in-house) and validation (NLST) data sets. Baseline (T0) posteroanterior chest radiographs from 5485 participants (full T0 data set) were used to assess lung cancer detection performance, and a subset of 577 of these images (nodule data set) were used to assess nodule detection performance. Participants aged 55 to 74 years who currently or formerly (ie, quit within the past 15 years) smoked cigarettes for 30 pack-years or more were enrolled in the NLST at 23 US centers between August 2002 and April 2004. Information on lung cancer diagnoses was collected through December 31, 2009. Analyses were performed between August 20, 2019, and February 14, 2020. Exposures: Abnormality scores produced by the AI algorithm. Main Outcomes and Measures: The performance of an AI algorithm for the detection of lung nodules and lung cancer on radiographs, with lung cancer incidence and mortality as primary end points.Entities:
Year: 2020 PMID: 32970157 PMCID: PMC7516603 DOI: 10.1001/jamanetworkopen.2020.17135
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Participant Characteristics
| Characteristic | No. (%) | |
|---|---|---|
| Full T0 data set | Nodule data set | |
| Total participants, No. | 5485 | 577 |
| Participants with cancer | 48 (0.9) | 48 (8.3) |
| Age, mean (SD) | 61.7 (5.0) | 62.2 (5.1) |
| Sex | ||
| Male | 3030 (55.2) | 322 (55.8) |
| Female | 2455 (44.8) | 255 (44.2) |
| Race | ||
| White | 5145 (93.8) | 539 (93.4) |
| Black or African American | 222 (4.0) | 22 (3.8) |
| Asian | 39 (0.7) | 5 (0.9) |
| American Indian or Alaskan Native | 17 (0.3) | 3 (0.5) |
| Native Hawaiian or other Pacific Islander | 1 (0.02) | 0 |
| >1 race | 36 (0.7) | 6 (1.0) |
| Unavailable | 25 (0.5) | 2 (0.4) |
| Ethnicity | ||
| Hispanic or Latino | 77 (1.4) | 5 (0.9) |
| Not Hispanic or Latino | 5385 (98.2) | 572 (99.1) |
| Unavailable | 23 (0.4) | 0 |
| Smoking status | ||
| Former | 2765 (50.4) | 306 (53.0) |
| Current | 2720 (49.6) | 271 (47.0) |
| Type of chest radiography used | ||
| Screen film | 47 (0.9) | 4 (0.7) |
| Computed | 2861 (52.2) | 296 (51.3) |
| Digital | 2108 (38.4) | 246 (42.6) |
| Thoravision | 465 (8.5) | 31 (5.4) |
| Unavailable | 4 (0.07) | 0 |
| Outcomes | ||
| Follow-up, median (IQR), y | 6.5 (6.1-6.9) | 6.5 (6.1-6.8) |
| Mortality | 380 (6.9) | 53 (9.2) |
Abbreviations: IQR, interquartile range; T0, baseline.
Figure 1. Receiver Operating Characteristic Curve of the Performance of the Artificial Intelligence Algorithm vs NLST Radiologists for the Detection of Noncalcified Nodules in the Nodule Data Set
Colored lines represent results from the artificial intelligence algorithm, and colored Xs represent results from NLST radiologists. AUROC indicates area under the receiver operating characteristic; CR, computed radiography; DR, digital radiography; and NLST, National Lung Screening Trial.
Comparison of Performance of Artificial Intelligence Algorithm vs National Lung Screening Trial Radiologists
| Variable | All images | Digital radiographic images | Computed radiographic images | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AI | NLST nodule | NLST cancer | P value | AI | NLST nodule | NLST cancer | P value | AI | NLST nodule | NLST cancer | P value | ||||
| AI vs NLST nodule | AI vs NLST cancer | AI vs NLST nodule | AI vs NLST cancer | AI vs NLST nodule | AI vs NLST cancer | ||||||||||
| Sensitivity (all cancer detection) | |||||||||||||||
| Nodule data set | 75.0 (62.8-87.2) | 77.1 (65.2-89.0) | 85.4 (75.4-95.4) | .78 | .13 | 76.0 (59.3-92.7) | 68.0 (49.7-86.3) | 80.0 (64.3-95.7) | .41 | .65 | 68.4 (47.5-89.3) | 84.2 (67.8-100.0) | 89.5 (75.7-100.0) | .26 | .10 |
| Full T0 data set | 75.0 (62.8-87.2) | 77.1 (65.2-89.0) | 85.4 (75.4-95.4) | .78 | .13 | 76.0 (59.3-92.7) | 68.0 (49.7-86.3) | 80.0 (64.3-95.7) | .41 | .65 | 68.4 (47.5-89.3) | 84.2 (47.5-89.3) | 89.5 (75.7-100.0) | .26 | .10 |
| Specificity (all cancer detection) | |||||||||||||||
| Nodule data set | 81.7 (78.4-85.0) | 83.4 (80.2-86.5) | 83.9 (80.8-87.1) | .43 | .30 | 91.0 (87.2-94.7) | 80.5 (75.3-85.8) | 82.4 (77.3-87.4) | .001 | .009 | 74.7 (69.6-79.8) | 85.6 (81.4-89.7) | 85.2 (81.0-89.4) | <.001 | <.001 |
| Full T0 data set | 83.3 (82.3-84.3) | 91.2 (90.4-91.9) | 91.5 (90.7-92.2) | <.001 | <.001 | 90.0 (89.7-92.2) | 90.4 (89.1-91.7) | 91.1 (89.9-92.3) | .52 | .82 | 76.7 (75.2-78.3) | 91.6 (90.5-92.6) | 91.4 (90.3-92.4) | <.001 | <.001 |
| Sensitivity (malignant pulmonary nodule detection) | |||||||||||||||
| Nodule data set | 94.1 (86.2-100.0) | 91.2 (81.6-100.0) | 94.1 (86.2-100.0) | .65 | >.99 | 100.0 (100.0-100.0) | 88.2 (72.9-100.0) | 94.1 (82.9-100.0) | .16 | .32 | 85.7 (67.4-100.0) | 92.9 (79.4-100.0) | 92.9 (79.4-100.0) | .56 | .56 |
| Full T0 data set | 94.1 (86.2-100.0) | 91.2 (81.6-100.0) | 94.1 (86.2-100.0) | .65 | >.99 | 100.0 (100.0-100.0) | 88.2 (72.9-100.0) | 94.1 (82.0-100.0) | .16 | .32 | 85.7 (67.4-100.0) | 92.9 (79.4-100.0) | 92.9 (79.4-100.0) | .56 | .56 |
| Specificity (malignant pulmonary nodule detection) | |||||||||||||||
| Nodule data set | 81.4 (78.1-84.7) | 82.7 (79.5-85.9) | 82.7 (79.5-85.9) | .56 | .56 | 90.4 (86.6-94.2) | 80.3 (75.2-85.5) | 81.2 (76.2-86.3) | .002 | .005 | 74.8 (69.8-79.9) | 84.8 (80.6-88.9) | 84.0 (79.8-88.3) | .002 | .003 |
| Full T0 data set | 83.3 (82.3-84.3) | 91.1 (90.3-91.8) | 91.3 (90.6-92.1) | <.001 | <.001 | 90.9 (89.6-92.1) | 90.3 (89.1-91.6) | 91.0 (89.7-92.2) | .53 | .91 | 76.7 (75.2-78.3) | 91.5 (90.4-92.5) | 91.3 (90.2-92.3) | <.001 | <.001 |
Abbreviations: AI, artificial intelligence; NLST, National Lung Screening Trial; NLST cancer, National Lung Screening Trial radiologists using cancer label; NLST nodule, National Lung Screening Trial radiologists using nodule label; T0, baseline.
Figure 2. Frontal Chest Radiographs of Patients With Malignant Pulmonary Nodules Missed by NLST Radiologists But Detected by Artificial Intelligence Algorithm
A, Chest radiograph of woman in her 60s (without AI detection). The woman was diagnosed with lung cancer 86 days after baseline imaging. B, Chest radiograph of woman in her 60s (with AI detection). The AI algorithm detected the missed subtle abnormality (in green, with nodule score of 38%) in the left perihilar region. C, Chest radiograph of man in his 50s (without AI detection). The man was diagnosed with lung cancer 127 days after baseline imaging. D, Chest radiograph of man in his 50s (with AI detection). The AI algorithm detected the missed subcentimeter nodule (in green, with nodule score of 53%) in the right upper lung zone. AI indicates artificial intelligence; and NLST, National Lung Screening Trial.