| Literature DB >> 33791593 |
Pritam Mukherjee1, Mu Zhou1, Edward Lee2, Anne Schicht3, Yoganand Balagurunathan4, Sandy Napel5, Robert Gillies4, Simon Wong2, Alexander Thieme3, Ann Leung5, Olivier Gevaert1,6.
Abstract
Lung cancer is the most common fatal malignancy in adults worldwide, and non-small cell lung cancer (NSCLC) accounts for 85% of lung cancer diagnoses. Computed tomography (CT) is routinely used in clinical practice to determine lung cancer treatment and assess prognosis. Here, we developed LungNet, a shallow convolutional neural network for predicting outcomes of NSCLC patients. We trained and evaluated LungNet on four independent cohorts of NSCLC patients from four medical centers: Stanford Hospital (n = 129), H. Lee Moffitt Cancer Center and Research Institute (n = 185), MAASTRO Clinic (n = 311) and Charité - Universitätsmedizin (n=84). We show that outcomes from LungNet are predictive of overall survival in all four independent survival cohorts as measured by concordance indices of 0.62, 0.62, 0.62 and 0.58 on cohorts 1, 2, 3, and 4, respectively. Further, the survival model can be used, via transfer learning, for classifying benign vs malignant nodules on the Lung Image Database Consortium (n = 1010), with improved performance (AUC=0.85) versus training from scratch (AUC=0.82). LungNet can be used as a noninvasive predictor for prognosis in NSCLC patients and can facilitate interpretation of CT images for lung cancer stratification and prognostication.Entities:
Year: 2020 PMID: 33791593 PMCID: PMC8008967 DOI: 10.1038/s42256-020-0173-6
Source DB: PubMed Journal: Nat Mach Intell ISSN: 2522-5839
Fig. 1.Illustration of the proposed computational framework: (a) Input data for the two transfer learning tasks: CT images with survival time and CT images with malignancy scores and for a subset of patients the biopsy proven malignancy. (b) Training and validation of LungNet, a convolutional neural network model including transfer learning between the two tasks. (c) Evaluation of the two tasks using Kaplan Meier survival curves, and ROC curves.
Patient demographics: the number in parentheses represents percentage.
| Characteristic | cohort 1 | cohort 2 | cohort 3 | cohort 4 |
|---|---|---|---|---|
| Number of patients | 129 | 185 | 311 | 84 |
| Age (yrs., Mean ± SD) | 69.4 ± 8.47 | 67.65 ± 10.13 | 67.05 ±9.07 | |
| Sex (n Male, %) | 101 (78.3) | 83 (44.3) | 220 (70.7) | 64(76.2) |
| Smoking history | 20(15.5) | 38(20.5) | ||
| Adenocarcinoma | 100(77.5) | 107 (57.8) | 32(10.3) | 36 (42.9) |
| Squamous Carcinoma | 29(22.5) | 50 (27) | 84 (27) | 44 (52.4) |
| Other histology type(s) | 28(15.2) | 195(62.7) | 4 (4.8) | |
| Survival time (days, Mean) | 889 | 1021 | 609 | 944 |
| Survival time (days, SD) | 671 | 504 | 457 | 710 |
| Stage 1 | 67 (51.9) | 97(52.4) | 81 (26.0) | 5 (6.0) |
| Stage 2 | 42 (32.6) | 32(17.3) | 26(8.4) | 10(11.9) |
| Stage 3 | 15(11.6) | 38(20.5) | 73 (23.5) | 69 (82.1) |
| Stage 4 | 5(3.9) | 17(9.7) | 131 (42.1) |
Fig. 2Kaplan-Meier analysis of LungNet. (a) Kaplan-Meier survival performance of LungNet on four lung cancer survival cohorts. For Cohort 1, the LungNet model incorporates clinical features; for the other cohorts, the images-only version of LungNet was used. LungNet demonstrates stratification of low- and high-risk survival subgroups on four independent cohorts. (b) Kaplan-Meier survival performance of clinical-only models on four lung cancer survival cohorts. The median of the predicted risk scores was used to stratify patients into high and low risk groups.
Fig. 3.Kaplan-Meier survival performance of LungNet on early stage cancers. It shows that LungNet can stratify low- and high-risk survival subgroups on three independent cohorts for early stage cancers. For cohort 1, the LungNet model incorporates clinical features; for the other cohorts, the images-only version of LungNet is used.
Fig. 4.Receiver operating characteristic (ROC) curves for maligancy outcome prediction comparison with and without transfer learning for (a) magliancy by radiologist assessment and (b) biopsy-proven for the LIDC-IDRI cohort.
Fig. 5.Visualization of lung nodules and their survival outcomes in 2D space using t-SNE. The output features of LungNet are embedded into a two-dimensional manifold via a t-distributed stochastic neighbor embedding (t-SNE). The color-coded map is created based on the median survival time. High-risk patients (below the median survival threshold) are highlighted in red and clustered to the far right while low-risk patients (above the median survival threshold) are blue and clustered in the bottom left.
CT acquisition parameters: the number in parentheses represents percentage.
| CT parameters | cohort 1 | cohort 2 | cohort 3 | cohort 4 |
|---|---|---|---|---|
| STANDARD | 43 (37.2) | 6(3-2) | 4(4.8) | |
| BONEPLUS | 21(16.3) | |||
| LUNG | 26 (20.2) | 1 (0.5) | 4 (4.8) | |
| B45f | 12(9.3) | |||
| BONE | 5(3.9) | |||
| B40f | 1(0.8) | 83 (44.9) | ||
| B41f | 59 (31.9) | 12 (3.9) | ||
| B30f | 13 (7.0) | 67 (21.5) | 13(15.5) | |
| B19f | 66 (21.2) | |||
| BBlf | 1 (0.8) | 25 (8.0) | 2 (2.4) | |
| B31s | 2(1.1) | 29 (9.3) | 26(31.0) | |
| B18f | 17(5.5) | |||
| Other/NA | 15(11.6) | 21(11.3) | 95 (30.5) | 35 (41.7) |
| 120 | 119 (92.2) | 154(83.2) | 155(49.8) | 81 (96.4) |
| 130 | 4(2.2) | 3(3.6) | ||
| 140 | 27 (14.6) | 67 (21.5) | ||
| Other/NA | 10 (7.8) | 89 (28.6) | ||
| GE | 97(75.2) | 8(4.3) | 6(7.1) | |
| Siemens | 13(10.1) | 169 (91.4) | 222 (71.4) | 50(59.5) |
| Philips | 2 (1.6) | 4(2.2) | 20(23.8) | |
| Toshiba | 1(0.8) | 4(2.2) | 8(9.5) | |
| Other/NA | 89 (28.6) | |||
| ≤1mn | 21 (16.3) | 12 (14.3) | ||
| ϵ (1, 2] mm | 75 (58.1) | 47 (56.0) | ||
| ϵ (2, 3] mm | 15 (11.6) | 18 (9.7) | 311 (100) | 11 (13.1) |
| ϵ (3, 4] mm | 7 (5.4) | 19 (10.3) | 4 (3.1) | |
| ϵ (4, 5] mm | 7 (5.4) | 122 (65.9) | 10 (11.9) | |
| > 5 mm | 4 (3.1) | 26 (14.1) |
Fig. 6.Illustration of LungNet’s convolutional neural network (CNN) architecture. LungNet is a CNN architecture that is designed to address the survival prediction task. In addition, we show that LungNet can be used for the malignancy prediction task using transfer learning. It consists of three 3D convolutional layers with along with a 3D max-pooling layer. Three fully-connected layers were concatenated to reduce feature dimensions. Cox Proportional-hazard loss and Cross-entropy loss functions were used for the survival prediction and malignancy prediction tasks, respectively.