| Literature DB >> 34258160 |
Quan-Xing Liu1, Dong Zhou1, Tian-Cheng Han2, Xiao Lu1, Bing Hou1, Man-Yuan Li1, Gui-Xue Yang1, Qing-Yuan Li2, Zhi-Hua Pei2, Yuan-Yuan Hong2, Ya-Xi Zhang2, Wei-Zhi Chen2, Hong Zheng1, Ji He2, Ji-Gang Dai1.
Abstract
Addressing the high false-positive rate of conventional low-dose computed tomography (LDCT) for lung cancer diagnosis, the efficacy of incorporating blood-based noninvasive testing for assisting practicing clinician's decision making in diagnosis of pulmonary nodules (PNs) is investigated. In this prospective observative study, next generation sequencing- (NGS-) based cell-free DNA (cfDNA) mutation profiling, NGS-based cfDNA methylation profiling, and blood-based protein cancer biomarker testing are performed for patients with PNs, who are diagnosed as high-risk patients through LDCT and subsequently undergo surgical resections, with tissue sections pathologically examined and classified. Using pathological classification as the gold standard, statistical and machine learning methods are used to select molecular markers associated with tissue's malignant classification based on a 98-patient discovery cohort (28 benign and 70 malignant), and to construct an integrative multianalytical model for tissue malignancy prediction. Predictive models based on individual testing platforms have shown varying levels of performance, while their final integrative model produces an area under the receiver operating characteristic curve (AUC) of 0.85. The model's performance is further confirmed on a 29-patient independent validation cohort (14 benign and 15 malignant, with power > 0.90), reproducing AUC of 0.86, which translates to an overall sensitivity of 80% and specificity of 85.7%.Entities:
Keywords: cfDNA methylation; cfDNA mutations; circulating tumor DNA; lung cancer diagnosis; machine learning; protein cancer biomarkers; pulmonary nodules
Mesh:
Substances:
Year: 2021 PMID: 34258160 PMCID: PMC8261512 DOI: 10.1002/advs.202100104
Source DB: PubMed Journal: Adv Sci (Weinh) ISSN: 2198-3844 Impact factor: 16.806
Figure 1Overview of the patient cohort used in our study.
Summary of patient demographics and clinical profiles in our study
| Discovery cohort [ | Independent validation cohort [ | ||||
|---|---|---|---|---|---|
| Benign [ | Malignant [ | Benign [ | Malignant [ | ||
| Patient age | 51.8±8.0 | 58.7±8.1 | 52.5±7.8 | 59.1±9.0 | |
| Gender | Male | 16 (57.1%) | 39 (55.7%) | 7 (50%) | 3 (20%) |
| Female | 12 (42.9%) | 31 (44.3%) | 7 (50%) | 12 (80%) | |
| Smoking history | Smokers | 11 (39.3%) | 30 (42.9%) | 3 (21.4%) | 2 (13.3%) |
| Nonsmokers | 17 (60.7%) | 40 (57.1%) | 11 (78.6%) | 13 (86.7%) | |
| Drinking history | Drinking | 2 (7.1%) | 13 (18.6%) | 2 (14.3%) | 1 (6.7%) |
| Nondrinking | 26 (92.9%) | 57 (81.4%) | 12 (85.7%) | 14 (93.3%) | |
| Family cancer history | Yes | 3 (10.7%) | 17 (24.3%) | 3 (21.4%) | 1 (6.7%) |
| No | 25 (89.3%) | 53 (75.7%) | 11 (78.6%) | 14 (93.3%) | |
| Nodule type | Solid | 24 (85.7%) | 52 (74.3%) | 14 (100%) | 14 (93.3%) |
| Part‐solid | 4 (14.3%) | 18 (25.7%) | 0 (0%) | 1 (6.7%) | |
| Nodule length [cm] | 1.71±0.62 | 2.06±0.58 | 1.94±0.63 | 2.05±0.60 | |
| Nodule width [cm] | 1.34±0.53 | 1.67±0.53 | 1.45±0.60 | 1.63±0.60 | |
| Pathology assessment | Carcinoid | 1 (1.4%) | 0 (0%) | ||
| Squamous cell carcinoma | 6 (8.6%) | 0 (0%) | |||
| Large cell carcinoma | 2 (2.9%) | 0 (0%) | |||
| Small cell lung cancer | 3 (4.3%) | 0 (0%) | |||
| Adenocarcinoma | 58 (82.9%) | 15 (100%) | |||
| Hamartoma | 5 (17.9%) | 1 (7.1%) | |||
| Atypical adenomatous hyperplasia of the lung | 1 (3.6%) | 0 (0%) | |||
| Tuberculosis | 18 (64.3%) | 8 (57.1%) | |||
| Granulomatous inflammation | 1 (3.6%) | 3 (21.4%) | |||
| Epithelioid vascular mesothelioma | 1 (3.6%) | 0 (0%) | |||
| Sclerosing alveolar cell tumor | 2 (7.1%) | 1 (7.1%) | |||
| Castleman disease | 0 (0%) | 1 (7.1%) | |||
| Stage | Ia | 19 (27.1%) | 6 (40%) | ||
| Ib | 41 (58.6%) | 9 (60%) | |||
| IIb | 4 (5.7%) | 0 (0%) | |||
| IIIa | 4 (5.7%) | 0 (0%) | |||
| IIIb | 2 (2.9%) | 0 (0%) | |||
Figure 2Performance of various predictive models in forms of receiver operating characteristic (ROC) curves and area under curve (AUC) scores. The performance of various predictive models based on different feature sets, namely clinical features, protein biomarkers, cfDNA mutations, cfDNA methylation, and integrated multianalytical model, respectively. The ROC curves of models are shown as lines of different colors. AUC and the 95% CI of each model are shown in the legend. a) Models’ performance on the 98‐patient discovery cohort. b) Models’ performance on the 29‐patient independent validation cohort.
Figure 3cfDNA methylation profiles of all patients in the discovery cohort and the independent validation cohort, represented using the 30‐MCB feature set. Colored bars indicate the average methylation level for the corresponding MCB.
Figure 4Predictive outcome of various models in comparison to the gold standard pathology assessment on the independent validation cohort. Samples were sorted according to nodule length.
Figure 5Performance comparison between PET/CT and our integrative multianalytical model, in forms of receiver operating characteristic (ROC) curves and area under curve (AUC) scores.