| Literature DB >> 35031654 |
Akitoshi Shimazaki1, Daiju Ueda2,3, Antoine Choppin4, Akira Yamamoto1, Takashi Honjo1, Yuki Shimahara4, Yukio Miki1.
Abstract
We developed and validated a deep learning (DL)-based model using the segmentation method and assessed its ability to detect lung cancer on chest radiographs. Chest radiographs for use as a training dataset and a test dataset were collected separately from January 2006 to June 2018 at our hospital. The training dataset was used to train and validate the DL-based model with five-fold cross-validation. The model sensitivity and mean false positive indications per image (mFPI) were assessed with the independent test dataset. The training dataset included 629 radiographs with 652 nodules/masses and the test dataset included 151 radiographs with 159 nodules/masses. The DL-based model had a sensitivity of 0.73 with 0.13 mFPI in the test dataset. Sensitivity was lower in lung cancers that overlapped with blind spots such as pulmonary apices, pulmonary hila, chest wall, heart, and sub-diaphragmatic space (0.50-0.64) compared with those in non-overlapped locations (0.87). The dice coefficient for the 159 malignant lesions was on average 0.52. The DL-based model was able to detect lung cancers on chest radiographs, with low mFPI.Entities:
Mesh:
Year: 2022 PMID: 35031654 PMCID: PMC8760245 DOI: 10.1038/s41598-021-04667-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of dataset selection.
Dataset demographics.
| Characteristic | Training dataset | Test dataset |
|---|---|---|
| 629 | 151 | |
| Men | 408 (65%) | 94 (62%) |
| Women | 221 (35%) | 57 (38%) |
| Men | 70 ± 8 | 70 ± 8 |
| Women | 69 ± 10 | 69 ± 10 |
| Chest radiographs (n) | 629 | 151 |
| No. of malignant nodules/masses | 652 | 159 |
| Mean nodule/mass size ± SD (mm) | 38 ± 21 | 33 ± 21 |
| ≤ 10 mm | 5 (0.77%) | 6 (3.8%) |
| 11–15 mm | 45 (6.9%) | 20 (13%) |
| 16–20 mm | 68 (10%) | 27 (17%) |
| 21–25 mm | 87 (13%) | 23 (14%) |
| 26–30 mm | 111 (17%) | 19 (12%) |
| 31–40 mm | 133 (20%) | 25 (16%) |
| 41–50 mm | 73 (11%) | 13 (8.1%) |
| > 50 mm | 130 (20%) | 26 (16%) |
| Total | 231 (35%) | 71 (45%) |
| Pulmonary apices | 48 (7.4%) | 21 (13%) |
| Pulmonary hila | 62 (9.5%) | 14 (8.8%) |
| Chest wall | 66 (10%) | 25 (16%) |
| Heart | 39 (6.0%) | 9 (5.7%) |
| Sub-diaphragmatic space | 16 (2.5%) | 2 (1.3%) |
| Right upper | 146 (22%) | 40 (25%) |
| Right middle | 147 (23%) | 21 (13%) |
| Right lower | 105 (16%) | 34 (21%) |
| Left upper | 73 (11%) | 17 (11%) |
| Left middle | 125 (19%) | 36 (23%) |
| Left lower | 56 (8.6%) | 11 (6.9%) |
| Traceable edge nodule | 204 (31%) | 65 (41%) |
| Traceable edge mass | 259 (40%) | 60 (38%) |
| Untraceable edge nodule | 112 (17%) | 30 (19%) |
| Untraceable edge mass | 77 (12%) | 4 (2.5%) |
| Primary lung cancer | 566 (87%) | 136 (86%) |
| Adenocarcinoma | 357 (55%) | 76 (48%) |
| Squamous cell carcinoma | 156 (24%) | 44 (28%) |
| Neuroendocrine carcinoma | 26 (4.0%) | 5 (3.1%) |
| Large cell carcinoma | 4 (0.61%) | 3 (1.9%) |
| Adenosquamous carcinoma | 18 (2.8%) | 1 (0.63%) |
| Sarcomatoid carcinoma | 4 (0.61%) | 7 (4.4%) |
| Salivary gland type carcinoma | 1 (0.15%) | 0 (0%) |
| Metastatic lung cancer | 82 (13%) | 23 (14%) |
| Malignant lymphoma | 4 (0.61%) | 0 (0%) |
Detection and segmentation performance of deep learning-based model in the test dataset.
| Characteristics | Values |
|---|---|
| Total sensitivity | 0.73 (0.66–0.79) |
| Dice coefficient ± SD | 0.52 ± 0.37 |
| ≤ 10 mm | 0.00 (0.00–0.00) |
| 11–15 mm | 0.38 (0.19–0.57) |
| 16–20 mm | 0.52 (0.33–0.70) |
| 21–25 mm | 0.83 (0.65–0.96) |
| 26–30 mm | 0.79 (0.58–0.95) |
| 31–40 mm | 1.00 (1.00–1.00) |
| 41–50 mm | 1.00 (1.00–1.00) |
| > 50 mm | 0.85 (0.69–0.96) |
| Pulmonary apices | 0.52 (0.33–0.71) |
| Pulmonary hila | 0.64 (0.36–0.86) |
| Chest wall | 0.52 (0.32–0.72) |
| Heart | 0.56 (0.22–0.89) |
| Sub-diaphragmatic space | 0.50 (0.00–1.00) |
| Non-overlapped lesions with normal anatomical structures | 0.87 (0.79–0.93) |
| Traceable edge | 0.87 (0.81–0.93) |
| Untraceable edge | 0.21 (0.06–0.35) |
Figure 2Free-response receiver-operating characteristic curve for the test dataset.
False positive output characteristics assessed by radiologists in the test dataset.
| Characteristics | No. of false positives |
|---|---|
| Total | 20 |
| Identified as some kind of structure | 19 (95%) |
| Non-calcified nodule-like output | 9 (45%) |
| Calcified nodule-like output | 4 (20%) |
| Not identified any structure | 1 (5.0%) |
| Calcified lung nodule | 5 (25%) |
| Pulmonary artery | 5 (25%) |
| Reticular opacity | 4 (20%) |
| Pleural plaque | 2 (10%) |
| Rib fracture | 1 (5.0%) |
| Bone island | 1 (5.0%) |
| Hilar lymph node | 1 (5.0%) |
| Not identified any structure | 1 (5.0%) |
| Total | 13 (65%) |
| Pulmonary apices | 4 (20%) |
| Pulmonary hila | 4 (20%) |
| Chest wall | 4 (20%) |
| Heart | 1 (0.5%) |
False negative nodule/mass characteristics in the test dataset.
| Characteristics | No. of false negatives |
|---|---|
| Total | 43 |
| ≤ 10 mm | 6 (14%) |
| 11–15 mm | 12 (28%) |
| 16–20 mm | 13 (30%) |
| 21–25 mm | 4 (9.3%) |
| 26–30 mm | 4 (9.3%) |
| 31–40 mm | 0 (0%) |
| 41–50 mm | 0 (0%) |
| > 50 mm | 4 (9.3%) |
| Total | 32 (74%) |
| Pulmonary apices | 10 (23%) |
| Pulmonary hila | 5 (12%) |
| Chest wall | 12 (28%) |
| Heart | 4 (9.3%) |
| Sub-diaphragmatic space | 1 (2.3%) |
| Right upper | 11 (26%) |
| Right middle | 5 (12%) |
| Right lower | 10 (23%) |
| Left upper | 2 (4.7%) |
| Left middle | 11 (23%) |
| Left lower | 4 (4.7%) |
Figure 3Two representative true positive cases. The images on the left are original images, and those on the right are images output by our model. (a) A 48-year-old woman with a nodule in the right lower lobe that was diagnosed as adenocarcinoma. The nodule was confused with rib and vessels (arrows). The model detected the nodule in the right middle lung field. (b) A 74-year-old woman with a nodule in the left lower lobe that was diagnosed as squamous cell carcinoma. The nodule overlapped with the heart (arrows). The lesion was identifiable by the model because its edges were traceable.
Figure 4Example of one false positive case. The image on the left is an original image, and the image on the right is an image output by our model. An 81-year-old woman with a mass in the right lower lobe that was diagnosed as squamous cell carcinoma. The mass in the right middle lung field (arrows) was carcinoma. Our model detected this lesion, and also detected a slightly calcified nodule in the right lower lung field (arrowhead). This nodule was an old fracture of the right tenth rib, but was misidentified as a malignant lesion because its shape was obscured by overlap with the right eighth rib and breast.
Figure 5Example of one false negative case. The image on the left is a gross image, and the image on the right is an enlarged image of the lesion. A 68-year-old man with a mass in the left lower lobe that was diagnosed as adenocarcinoma. This lesion overlapped with the heart and is only faintly visible (arrows). Our model failed to detect the mass.