| Literature DB >> 35311112 |
Maosong Ye1, Lin Tong1,2, Xiaoxuan Zheng3,4, Hui Wang5,6, Haining Zhou7, Xiaoli Zhu8, Chengzhi Zhou9, Peige Zhao10, Yan Wang11, Qi Wang12, Li Bai13, Zhigang Cai14, Feng-Ming Spring Kong15, Yuehong Wang16, Yafei Li17, Mingxiang Feng18, Xin Ye19,20, Dawei Yang1, Zilong Liu1, Quncheng Zhang6, Ziqi Wang6, Shuhua Han8, Lihong Sun11, Ningning Zhao11, Zubin Yu21, Juncheng Zhang19,20, Xiaoju Zhang6, Ruth L Katz22, Jiayuan Sun3,4, Chunxue Bai1.
Abstract
Lung cancer is the leading cause of cancer-related deaths worldwide and in China. Screening for lung cancer by low dose computed tomography (LDCT) can reduce mortality but has resulted in a dramatic rise in the incidence of indeterminate pulmonary nodules, which presents a major diagnostic challenge for clinicians regarding their underlying pathology and can lead to overdiagnosis. To address the significant gap in evaluating pulmonary nodules, we conducted a prospective study to develop a prediction model for individuals at intermediate to high risk of developing lung cancer. Univariate and multivariate logistic analyses were applied to the training cohort (n = 560) to develop an early lung cancer prediction model. The results indicated that a model integrating clinical characteristics (age and smoking history), radiological characteristics of pulmonary nodules (nodule diameter, nodule count, upper lobe location, malignant sign at the nodule edge, subsolid status), artificial intelligence analysis of LDCT data, and liquid biopsy achieved the best diagnostic performance in the training cohort (sensitivity 89.53%, specificity 81.31%, area under the curve [AUC] = 0.880). In the independent validation cohort (n = 168), this model had an AUC of 0.895, which was greater than that of the Mayo Clinic Model (AUC = 0.772) and Veterans' Affairs Model (AUC = 0.740). These results were significantly better for predicting the presence of cancer than radiological features and artificial intelligence risk scores alone. Applying this classifier prospectively may lead to improved early lung cancer diagnosis and early treatment for patients with malignant nodules while sparing patients with benign entities from unnecessary and potentially harmful surgery. Clinical Trial Registration Number: ChiCTR1900026233, URL: http://www.chictr.org.cn/showproj.aspx?proj=43370.Entities:
Keywords: artificial intelligence; early diagnosis; liquid biopsy; lung cancer; prediction model
Year: 2022 PMID: 35311112 PMCID: PMC8924612 DOI: 10.3389/fonc.2022.853801
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Schematic Diagram of the Study Design.
Figure 2End-to-end deep convolutional neural network-based Artificial Intelligence low-dose computed tomography analysis toll development procedures, (A) A three-dimensional (3D) U-net-based convolutional neural network was used for the segmentation of lung nodules to identify suspicious nodules; (B) the 3D patches of the suspicious nodules were cropped and forwarded to a false-positive reduction network to discriminate the true clinically positive nodules from the false-positive nodules; (C) the patches that were labeled as positive were forwarded to a convolutional neural network-based classifier to determine whether the nodule was malignant or benign.
Figure 3Sample process procedures of liquid biopsy via 4-color fluorescent in situ hybridization (FISH) assay. (A) Peripheral blood from patients with indeterminate or high-risk nodules. (B) The peripheral blood mononuclear cells layer was isolated after configuration. (C) The peripheral blood mononuclear cells were applied to a glass slide. (D) Hybridization with 4-color FISH probes. (E) The result of the assay, scanned with a Duet microscope system.
Clinical characteristics of the study participants.
| Variables | Benign Nodule | Malignant Nodule | |||
|---|---|---|---|---|---|
| Training Cohort | Validation Cohort | Training Cohort | Validation Cohort | ||
| n = 135 | n = 63 | n = 425 | n = 105 | ||
| Age, y, mean, range | 55 (18–81) | 57 (30–82) | 60 (25–82) | 57 (25–81) | |
| Sex, no. of participants (%) | |||||
| Male | 76 (56%) | 37 (59%) | 204 (48%) | 46 (44%) | |
| Female | 59 (44%) | 26 (41%) | 221 (52%) | 59 (56%) | |
| Smoking history, no. of participants (%) | |||||
| Current or past smoker^ | 25 (19%) | 17 (27%) | 251 (59%) | 73 (70%) | |
| Nonsmoker | 110 (81%) | 46 (73%) | 174 (41%) | 32 (30%) | |
| Family history, no. of participants (%) | |||||
| Yes | 8 (6%) | 24 (38%) | 42 (10%) | 39 (37%) | |
| No | 127 (94%) | 39 (62%) | 383 (90%) | 66 (63%) | |
| Diameter of the nodule, millimeter, mean, range | 12 (1–29) | 9 (2–23) | 17 (1–30) | 14 (4–30) | |
| Nodule count, no. of participants (%) | |||||
| Single | 80 (59%) | 19 (30%) | 335 (79%) | 33 (31%) | |
| Multiple | 55 (41%) | 44 (70%) | 90 (21%) | 72 (69%) | |
| Type of nodule, no. of participants (%) | |||||
| Solid | 97 (72%) | 27 (43%) | 207 (49%) | 20 (19%) | |
| Subsolid | 38 (28%) | 36 (57%) | 218 (51%) | 85 (81%) | |
| Nodule location, no. of participants (%) | |||||
| Upper lobe | 61 (45%) | 24 (38%) | 251 (59%) | 68 (65%) | |
| Non-upper lobe | 74 (55%) | 39 (62%) | 174 (41%) | 37 (35%) | |
| Nodule edge, no. of participants (%) | |||||
| Entirely smooth | 85 (63%) | 35 (56%) | 144 (34%) | 16 (15%) | |
| Malignant signs* | 50 (37%) | 28 (44%) | 281 (66%) | 89 (85%) | |
| Malignant subtypes | |||||
| Adenocarcinoma | 361 (85%) | 97 (92%) | |||
| Squamous cell carcinoma | 23 (5%) | 3 (3%) | |||
| Others | 41 (10%) | 5 (5%) | |||
| Cancer stage | |||||
| IA1 | 103 (24%) | 45 (43%) | |||
| IA2 | 176 (42%) | 45 (43%) | |||
| IA3 | 146 (34%) | 15 (14%) | |||
^Current and past smokers were identified as 20 pack-years and a quit time of <15 years, respectively.
*Signs of malignancy indicate nodules with one or more of the following: lobulation, spiculation, vacuole sign, pleural indentation, vessel convergence sign, or other radiological signs of malignancy.
Figure 4(A) The area under the curve (AUC) of AI was 0.740 in the overall cohort. (B) The AUC of liquid biopsy was 0.765 on the overall cohort. (C) The sensitivity was 82.8%, and the specificity was 80.95 in the independent validation cohort for the best performing model (model 4). (D) In the validation cohort, the areas under the curve were 0.895, 0.772, and 0.740 for model 4, the Mayo Clinic Model, and the VA model, respectively.
Univariate analyses of predictors of malignancy.
| Variable | Odds Ratio(95% CI) | P-value |
|---|---|---|
| Age* | 1.041 (1.024–1.059) | <0.001 |
| Sex | 0.717 (0.485–1.058) | 0.094 |
| Current or past smoking* | 6.347 (3.946–10.210) | <0.001 |
| Family history | 1.741 (0.796–3.806) | 0.165 |
| Nodule diameter* | 1.106 (1.073–1.140) | <0.001 |
| Nodule count* | 0.786 (0.703–0.879) | <0.001 |
| Subsolid status* | 2.713 (1.780–4.133) | <0.001 |
| Upper lobe* | 1.750 (1.85–2.585) | 0.005 |
| Malignant signs at the nodule edge* | 3.247 (2.159–4.882) | <0.001 |
| AI risk score* | 36.891 (15.745–86.441) | <0.001 |
| Liquid biopsy result* | 1.379 (1.260–1.511) | <0.001 |
*Indicates significantly associated with lung cancer.
Figure 5Correlation Heat Map of Individual Predictors in the Training Cohort.
Ten-fold cross validation result of classifiers with different predictors.
| Predictors | Sensitivity (mean, 95% CI) | Specificity (mean, 95% CI) | AUC (mean, 95% CI) | |
|---|---|---|---|---|
| Model 1 | Diameter + nodule count + subsolid status + upper lobe location + malignant signs at the nodule edge | 89.01% (82.00–96.03%) | 62.52% (50.33–74.70%) | 0.769 (0.719–0.820) |
| Model 2 | Diameter + nodule count + subsolid status + upper lobe location + malignant signs at the nodule edge + AI risk score | 89.18% (81.30–97.09%) | 65.96% (53.13–78.80%) | 0.791 (0.737–0.845) |
| Model 3 | Age + smoking + diameter + nodule count + subsolid status + upper lobe location + malignant signs at the nodule edge + liquid biopsy result | 86.29% (77.32–95.25%) | 83.25% (76.70–89.80%) | 0.872 (0.846–0.900) |
| Model 4 | Age + smoking + diameter + nodule count + subsolid status + upper lobe location + malignant signs at the nodule edge + AI risk score + liquid biopsy result | 89.53% (81.79–97.26%) | 81.31% (76.43–86.18%) | 0.880 (0.852–0.910) |
CI, confidence interval; AUC, area under the curve.