| Literature DB >> 31853300 |
Rongjiong Zheng1, Haiqi Xu1, Wenjie Mao1, Zhennan Du1, Mingming Wang1, Meiling Hu2, Xiaolong Gu1.
Abstract
Lung adenocarcinoma (LACA) is the leading cause of cancer-associated death worldwide. The present study intended to identify DNA methylation patterns that may serve as diagnostic and prognostic biomarkers for LACA. Data on DNA methylation and the survival data of the patients of LACA were obtained from The Cancer Genome Atlas. Kaplan-Meier curves and receiver operating characteristic curve analysis were utilized to build diagnostic and prognostic models. A total of 13 CpG sites were identified and validated as the optimal diagnostic and prognostic signature for overall survival. It was concluded that the CpG-based signature is a reliable predictor for the diagnosis and prognosis of patients with LACA. Copyright: © Zheng et al.Entities:
Keywords: lung adenocarcinoma; methylation; prognostic signature
Year: 2019 PMID: 31853300 PMCID: PMC6909784 DOI: 10.3892/etm.2019.8200
Source DB: PubMed Journal: Exp Ther Med ISSN: 1792-0981 Impact factor: 2.447
Baseline clinical characteristics of the patients in the primary and validation cohorts.
| Characteristic | Primary cohort (n=220) | Validation cohort (n=440) | P-value |
|---|---|---|---|
| Age at diagnosis (years) | 66.6±9.9 | 65.4±10.1 | 0.86 |
| Sex | 1.00 | ||
| Female | 120 (54.5) | 240 (54.5) | |
| Male | 100 (45.5) | 200 (45.5) | |
| Ethnicity | 0.08 | ||
| Caucasian | 176 (80.0) | 340 (77.3) | |
| Of African descent | 22 (10.0) | 50 (11.4) | |
| Others[ | 22 (10.0) | 50 (11.4) | |
| Stage | 0.45 | ||
| I/II | 175 (77.3) | 336 (76.4) | |
| III/IV | 40 (18.2) | 97 (22.0) | |
| NA | 5 (2.3) | 7 (1.6) | |
| Survival status | 0.86 | ||
| Alive | 139 (63.2) | 275 (62.5) | |
| Dead | 81 (36.8) | 165 (37.5) |
Others includes patients of Native American, Asian and unknown descent/ethnicity. Values are expressed as the mean ± standard deviation or n (%). NA, not available.
Figure 1.Volcano plot of differentially methylated loci. The red dots represent hypermethylated and the green dots represent hypomethylated loci. FC, fold change; FDR, false discovery rate.
Top 5 hyper- and hypomethylated sites of differential methylation.
| A, Hypermethylated sites | |||
|---|---|---|---|
| Composite | Log FC | adj.P-val | Gene |
| cg16306898 | 4.07515 | 1.28×10−10 | TMEM240 |
| cg00648301 | 4.04696 | 3.99×10−12 | INSM1 |
| cg01869632 | 4.034999 | 8.17×10−3 | DUSP26 |
| cg18837178 | 4.034999 | 8.17×10−3 | LINC01194 |
| cg22449330 | 4.034999 | 8.17×10−3 | WDPCP |
| cg05100666 | −4.80454 | 1.76×10−3 | BRD9 |
| cg15998127 | −4.80454 | 1.76×10−3 | HDAC4 |
| cg07764932 | −4.17089 | 1.87×10−7 | ARHGAP6 |
| cg12581354 | −4.17089 | 1.87×10−7 | RP11-175P13.2 |
| cg27649653 | −4.17089 | 1.87×10−7 | AC010642.1 |
FC, fold change; adj.P-val, adjusted P-value. TMEM240, transmembrane protein 240; INSM1, INSM transcriptional repressor 1; DUSP26, dual specificity phosphatase 26; LINC01194, long intergenic non-protein coding RNA 1194; WDPCP, WD repeat containing planar cell polarity effector; BRD9, bromodomain containing 9; HDAC4, histone deacetylase 4; ARHGAP6, Rho GTPase activating protein 6.
Figure 2.Kaplan-Meier survival curves of the methylated sites in (A) the primary cohort and (B) the validation cohort. The red data-points represent high risk group and the blue data-points represent low risk group. HR, hazard ratio.
Methylation loci significantly associated with survival.
| Composite | Chromosome | Start | End | Gene |
|---|---|---|---|---|
| cg00002719 | 1 | 169427468 | 169427469 | CCDC181 |
| cg02769743 | 1 | 9608344 | 9608345 | TMEM201 |
| cg05239163 | 1 | 154218790 | 154218791 | C1orf43 |
| cg05507908 | 5 | 75237454 | 75237455 | ANKRD31 |
| cg07918170 | 17 | 82932272 | 82932273 | TBCD |
| cg08213398 | 11 | 9722579 | 9722580 | SWAP70 |
| cg08516516 | 5 | 115816795 | 115816796 | CDO1 |
| cg08623223 | 8 | 11283906 | 11283907 | AF131216.1 |
| cg12748948 | 1 | 21779802 | 21779803 | USP48 |
| cg14904034 | 2 | 10389150 | 10389151 | HPCAL1 |
| cg16007456 | 1 | 100539082 | 100539083 | GPR88 |
| ch.6.2958553R | 6 | 152198831 | 152198831 | SYNE1 |
| cg19868631 | 7 | 54542083 | 54542084 | VSTM2A |
CCDC181, coiled-coil domain containing 181; TMEM201, transmembrane protein 201; C1orf43, chromosome 1 open reading frame 43; ANKRD31, ankyrin repeat domain 31; TBCD, tubulin folding cofactor D; SWAP70, switching B cell complex subunit SWAP70; CDO1, cysteine dioxygenase type 1; USP48, ubiquitin specific peptidase 48; HPCAL1, hippocalcin like 1; GPR88, G protein-coupled receptor 88; SYNE1, spectrin repeat containing nuclear envelope protein 1; VSTM2A, V-set and transmembrane domain containing 2A.
Figure 3.ROC curve analysis for the survival prediction by methylated sites in (A) the primary cohort and (B) the validation cohort. ROC, receiver operating characteristic; AUC, area under the ROC curve.
Figure 4.Clustering analyses of significantly enriched biological process terms. The x-axis represents the number of genes accumulated in the pathways. Darker color indicates higher significance (smaller P-value). GO, Gene Ontology.