| Literature DB >> 35693275 |
Guangyu Tao1, Dejun Shi2, Lingming Yu1, Chunji Chen3, Zheng Zhang4, Chang Min Park5,6,7,8, Edyta Szurowska9, Yinan Chen1, Rui Wang3, Hong Yu1.
Abstract
Background: Accurate preoperative prediction of the invasiveness of lung nodules on computed tomography (CT) can avoid unnecessary invasive procedures and costs for low-risk patients. While previous studies approached this task using cross-sectional data, this study aimed to utilize the commonly available longitudinal data of lung nodules through sequential modelling based on long short-term memory (LSTM) networks.Entities:
Keywords: Lung nodule; follow-up computed tomography (follow-up CT); long short-term memory (LSTM); sequential modelling; tumor invasiveness
Year: 2022 PMID: 35693275 PMCID: PMC9186168 DOI: 10.21037/tlcr-22-319
Source DB: PubMed Journal: Transl Lung Cancer Res ISSN: 2218-6751
Figure 1Flowchart of the clinical pipeline for managing lung nodules. The red box indicates that the sequential model harnesses all of the follow-up data collected in radiology to predict the preoperative invasiveness of lung nodules and to potentially save patients with non-invasive cancer from undergoing invasive procedures such as needle biopsy and surgical resection.
Figure 2Flowchart of the inclusion and exclusion process for the study population. NSCLC, non-small cell lung cancer; VATS, video-assisted thoracoscopic surgery; CT, computed tomography.
Figure 3The imaging history and hematoxylin & eosin staining (H & E, ×200) of a 56-year-old male patient with a part-solid lung nodule confirmed as invasive adenocarcinoma. The nodule and its surroundings are displayed using a lung window (upper row) and a mediastinal window (lower row) for three unevenly temporally-spaced CT scans, as indicated by the axis at the bottom. represents the C-dimensional imaging features extracted at time point ti [i∊(0,1,2)], and Y2 represents the invasiveness of the nodule in one-hot format (see for more information). CT, computed tomography.
Figure 4Model architecture of the LSTM-based recurrent neural network that receives the semantic features () of length C (C-dimensional input) from an arbitrary number of timepoints (tn) and predicts the probabilities of a nodule being invasive or less invasive (P2, indicating two classes). CE was used to compute the loss between the model prediction and ground truth Y2. represents the output feature vector of length K at time point ti from the corresponding LSTM layer. We experimentally investigated either the vector at the final time point or the mean of all F at all available timepoints as inputs for the FC head. The FC head consisted of a batch normalization layer, fully connected layer, and a SoftMax layer projecting the length K vector to length two vectors as a prediction. LSTM, long short-term memory; FC, fully connected; CE, cross entropy.
Characteristics of the lung nodules
| Characteristics | Less invasive cancers | Invasive cancers | Statistics (𝜒2/ |
|---|---|---|---|
| Gender (N) | 9.38** | ||
| Male | 11 | 47 | |
| Female | 48 | 65 | |
| Age (years) | 56.14±12.24 | 60.56±8.56 | −2.76** |
| Nodule location | 1.71 | ||
| Left upper | 14 | 33 | |
| Left lower | 7 | 10 | |
| Right upper | 26 | 42 | |
| Right middle | 3 | 9 | |
| Right lower | 9 | 12 | |
| Nodule morphology | 15.70** | ||
| Solid | 1 | 20 | |
| Part-solid | 0 | 9 | |
| Pure ground-glass | 58 | 83 | |
| Nodule feature on baseline | |||
| | 0.84±0.44 | 1.13±0.67 | −3.07** |
| | 0.70±0.35 | 0.85±0.43 | −2.22* |
| | 0.01±0.08 | 0.28±0.68 | −3.07** |
| | 0.01±0.06 | 0.20±0.45 | −3.18** |
| V on lung window (cm3) | 0.47±1.13 | 1.07±2.90 | −1.52 |
| | −506.43±169.04 | −387.30±275.33 | −3.03** |
| Volume doubling time (years Approx.) | 41.54±57.32 | 21.71±45.46 | 2.47* |
| Duration of follow-ups (days) | 703.88±556.66 | 827.18±524.56 | −1.43 |
| Number of follow-ups (N) | 4.39 | ||
| 1 | 1 | 1 | |
| 2 | 14 | 16 | |
| 3 | 6 | 20 | |
| 4 | 13 | 21 | |
| 5 | 7 | 18 | |
| 6 | 18 | 36 |
*, P<0.05; **, P<0.01. L and L are the nodule lengths in the major axis and minor axis on the axial view. CT, computed tomography; ROI, region of interest. Values are mean ± standard deviation.
Figure 5Scatter plot of the size (mean of the major axis length and minor axis length using a lung window setting) of individual nodules during each CT examination visit in the invasive group (A) and less-invasive group (B). CT, computed tomography.
Figure 6ROC curves of all classifiers for nodule invasiveness developed in this study. (A) Comparison of three logistic regression-based classifiers built on nodule features from different time points. (B) Comparison of several LSTM-based classifiers built with different hyperparameters. The details of these models can be found in the results section. (C) Comparison between the best logistic regression model and the best LSTM model. LR, logistic regression; AUC, area under the curve; LSTM, long short-term memory; ROC, receiver operating characteristics.
The performance of all classifiers experimented with in this study.
| Classifiers | Input variables | AUC (95% CI)d | Accuracy | Sensitivity | Specificity | |
|---|---|---|---|---|---|---|
| Number of variablesc | Timepoint(s) | |||||
| Logistic regression | 11 | Baseline | Failed to fit | |||
| 10 | Baseline | 0.716 (0.636–0.795)* | 0.678 | 0.804 | 0.441 | |
| 10 | Pre-surgery | 0.885 (0.831–0.939)* | 0.807 | 0.848 | 0.729 | |
| 10 | Two ends | 0.947 (0.908–0.986)* | 0.906 | 0.938 | 0.847 | |
| LSTM | ||||||
| D1a; Fmeanb | 11 | Two ends | 0.676 (0.595–0.756)* | 0.585 | 0.688 | 0.390 |
| D1; Fmean | 11 | All visits | 0.753 (0.680–0.826)* | 0.673 | 0.759 | 0.508 |
| D2; Fmean | 11 | All visits | 0.910 (0.868–0.951)* | 0.825 | 0.893 | 0.695 |
| D2; Flast | 11 | All visits | 0.955 (0.927–0.982)* | 0.871 | 0.893 | 0.830 |
| D2; Flast | 10 | All visits | 0.951 (0.922–0.981)* | 0.883 | 0.929 | 0.780 |
| D2; Flast | 7 | All visits | 0.977 (0.961–0.994)* | 0.900 | 0.920 | 0.864 |
| D2; Flast | 6 | All visits | 0.982 (0.966–0.997)* | 0.924 | 0.946 | 0.881 |
a, D1 stands for unidirectional LSTM; D2 stands for bidirectional LSTM; b, means that the embedding features to the LSTM from all time points were averaged first and then sent to the FC head, while Flast indicates that only the embedding features from the last time point were sent to the FC head for classification; c, each nodule was characterized by the 11 variables specified in the Methods section. Using logistic regression, only 10 variables were found to fit the model, and the volume on lung window setting (Vlw) was excluded. For the LSTM, we experimented with four combinations of variables: all 11 variables, 10 variables with Vlw removed, seven with the four unchanging ones (nodule location, nodule morphology, patient gender, and patient age) removed, and 6 variables with Vlw also removed from the set of seven variables; d, Delong’s test was used to compare the AUC (95% CI) performance of all classifiers with that of the logistic regression using 10 feature inputs from both the baseline visit and the preoperative visit (LR_N10@2Ends). Statistically significant differences were labeled as: *, P<0.001. AUC, area under the curve; CI, confidence interval; LSTM, long short-term memory.