| Literature DB >> 31434796 |
Xuesi Dong1,2, Ruyang Zhang2,3,4,5, Jieyu He2, Linjing Lai2, Raphael N Alolga6, Sipeng Shen2,3,4, Ying Zhu2, Dongfang You2, Lijuan Lin2, Chao Chen2, Yang Zhao2, Weiwei Duan2, Li Su3,4, Andrea Shafer7, Moran Salama8, Thomas Fleischer9, Maria Moksnes Bjaanæs9, Anna Karlsson10, Maria Planck10, Rui Wang5, Johan Staaf10, Åslaug Helland9,11, Manel Esteller8, Yongyue Wei2,3,4, Feng Chen1,2,4,12, David C Christiani3,4,7.
Abstract
Limited studies have focused on developing prognostic models with trans-omics biomarkers for early-stage lung adenocarcinoma (LUAD). We performed integrative analysis of clinical information, DNA methylation, and gene expression data using 825 early-stage LUAD patients from 5 cohorts. Ranger algorithm was used to screen prognosis-associated biomarkers, which were confirmed with a validation phase. Clinical and biomarker information was fused using an iCluster plus algorithm, which significantly distinguished patients into high- and low-mortality risk groups (Pdiscovery = 0.01 and Pvalidation = 2.71×10-3). Further, potential functional DNA methylation-gene expression-overall survival pathways were evaluated by causal mediation analysis. The effect of DNA methylation level on LUAD survival was significantly mediated through gene expression level. By adding DNA methylation and gene expression biomarkers to a model of only clinical data, the AUCs of the trans-omics model improved by 18.3% (to 87.2%) and 16.4% (to 85.3%) in discovery and validation phases, respectively. Further, concordance index of the nomogram was 0.81 and 0.77 in discovery and validation phases, respectively. Based on systematic review of published literatures, our model was superior to all existing models for early-stage LUAD. In summary, our trans-omics model may help physicians accurately identify patients with high mortality risk.Entities:
Keywords: DNA methylation; early-stage; gene expression; lung adenocarcinoma; prognostic prediction
Mesh:
Substances:
Year: 2019 PMID: 31434796 PMCID: PMC6738411 DOI: 10.18632/aging.102189
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Baseline characteristics of the study population.
| Age (years) | 67.1 ± 9.9 | 65.6 ± 10.5 | 65.5 ± 9.3 | 66.1 ± 10.4 | 65.4 ± 9.8 | 65.7 ± 9.6 |
| Gender, n (%) | ||||||
| Female | 50 (52.1) | 89 (48.6) | 62 (46.6) | 35 (43.2) | 152 (45.8) | 388 (47.0) |
| Smoking status, n(%) | ||||||
| Never | 17 (17.7) | 28 (15.6) | 17 (12.8) | 17 (21.0) | 47 (14.6) | 126 (15.5) |
| Former | 52 (54.2) | 97 (53.9) | 74 (55.6) | 39 (48.1) | 194 (60.2) | 456 (56.2) |
| Current | 27 (28.1) | 55 (30.6) | 42 (31.6) | 25 (30.9) | 81 (25.2) | 230 (28.3) |
| Clinical stage, n(%) | ||||||
| I | 72 (75.0) | 151 (82.5) | 93 (69.9) | 74 (91.4) | 230 (69.3) | 620 (75.2) |
| II | 24 (25.0) | 32 (17.5) | 40 (30.1) | 7 (8.6) | 102 (30.7) | 205 (24.8) |
| Chemotherapy, n(%) | ||||||
| Yes | 4 (4.2) | 14 (7.7) | 31 (23.3) | 4 (4.9) | 20 (6.0) | 73 (8.8) |
| No | 92 (95.8) | 142 (77.6) | 102 (76.7) | 50 (61.7) | 109 (32.8) | 495 (60.0) |
| Unknown | 0 | 27 | 0 | 27 | 203 | 257 |
| Radiotherapy, n(%) | ||||||
| Yes | 12 (12.5) | 8 (8.9) | 1 (0.8) | 0 (0.0) | 6 (4.7) | 27 (4.8) |
| No | 84 (87.5) | 148 (91.1) | 132 (99.2) | 54 (100.0) | 123 (95.3) | 541 (95.2) |
| Unknown | 0 | 27 | 0 | 27 | 203 | 257 |
| Adjuvant therapy, n(%) | ||||||
| Yes | 14 (14.5) | 21 (13.4) | 32 (24.0) | 4 (7.4) | 25 (19.3) | 96 (16.9) |
| No | 82 (85.5) | 135 (86.6) | 101 (76.0) | 50 (92.6) | 104 (80.7) | 472 (83.1) |
| Unknown | 0 | 27 | 0 | 27 | 203 | 257 |
| Survival year | ||||||
| Median survival year | 7.1 | 9.6 | 7.2 | 7.1 | 4.4 | 7.4 |
| Censored ratea, % | 0.3 | 58.5 | 68.4 | 40.7 | 80.7 | 63.4% |
a Censored rate is proportion of samples lost to follow-up or alive at end of the study.
TCGA: The Cancer Genome Atlas
Figure 1Out of bag (OOB) error rate derived from weighted random forest analysis. Top 62 and 38 DNA methylation probes in the discovery (A) and validation phases (B) reached a minimum OOB error rate. Top 9 and 13 mRNAs in the discovery (C) and validation phases (D) reached a minimum OOB error rate.
Figure 2Direct and indirect effects of DNA methylation on lung adenocarcinoma survival mediated through gene expression in casual mediation analysis. DNA methylation risk score (MRS) and gene expression risk score (GRS) were calculated by linear combination with a weighted ln(HRadjusted) of identified probes.
Figure 3Kaplan-Meier (KM) survial curves of high- and low-mortality risk groups divided by iCluster. Classification ability of clinical information for discovery (A) and validation phases (B). Distinction ability of clinical information adding trans-omics biomarkers of DNA methylation and gene expression for the discovery (C) and validation phases (D).
Figure 4Time-dependent receiver operating characteristic (ROC). ROC was used to evaluate the performance of prognostic models for 3-year (A) and 5-year (B) overall survival prediction in the discovery phase. ROC also was used to evaluate the performance of prognostic models for 3-year (C) and 5-year (D) overall survival prediction in the validation phase. C: clinical model; C+M+G: clinical, DNA methylation, and gene expression model.
Figure 5Nomogram constructed with clinical (red font) and trans-omics biomarkers (blue and green font) for overall survival. The probability of each predictor can be converted into the points axis in the top of the nomogram. The summary of these points of each predictor corresponded the total points at the bottom of the nomogram. After adding the points of each predictor in the total points axis, a patient’s probability of survival (3- and 5-year) can be found at the bottom of the nomogram. For example, if a patient got a score (e.g. 500), the 3-year survival probability will be corresponding to 0.80.
Figure 6Flow chart of the study.