| Literature DB >> 36111050 |
Shuying Wang1,2, Xiaoxing Liang3, Ruomi Guo4, Jiao Gong5, Xiaolong Zhong6, Yulin Liu6, Deqing Wang1,2, Yanmei Hao3, Bo Hu5.
Abstract
Background: Molecular typing based on deoxyribonucleic acid (DNA) methylation and gene expression can extend understandings of the molecular mechanisms involved in lung adenocarcinoma (LUAD) and enhance current diagnostic, treatment, and prognosis prediction approaches.Entities:
Keywords: Gene Expression Omnibus (GEO); Lung adenocarcinoma (LUAD); The Cancer Genome Atlas (TCGA); cancer subtypes; methylation
Year: 2022 PMID: 36111050 PMCID: PMC9469139 DOI: 10.21037/atm-22-3340
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
The clinical characteristics of the 2 subtypes from the validation data set
| Characteristics | Subtype 1 (n=37) | Subtype 2 (n=21) | P |
|---|---|---|---|
| Age (years), mean (SD) | 69.41 (9.05) | 66.29 (9.06) | 0.213 |
| Gender (male), n (%) | 8 (21.6) | 5 (23.8) | 1.000 |
| Smoking status (never), n (%) | 22 (59.5) | 7 (33.3) | 0.101 |
| Stage, n (%) | 0.003 | ||
| Stage I | 28 (75.7) | 6 (28.6) | |
| Stage II | 3 (8.1) | 8 (38.1) | |
| Stage III | 6 (16.2) | 6 (28.6) | |
| Stage IV | 0 (0.0) | 1 (4.8) | |
| Recurrence (yes), n (%) | 10 (27.0) | 9 (42.9) | 0.345 |
| KRAS mutation type, n (%) | 0.757 | ||
| G12A | 1 (2.9) | 1 (4.8) | |
| G12C | 5 (14.3) | 2 (9.5) | |
| G12D | 4 (11.4) | 1 (4.8) | |
| G12E | 0 (0.0) | 1 (4.8) | |
| G12V | 3 (8.6) | 2 (9.5) | |
| G13V | 1 (2.9) | 0 (0.0) | |
| WT | 21 (60.0) | 14 (66.7) |
SD, standard deviation; WT, wild type.
Figure 1The work flow for the data analysis in this study. LUAD, lung adenocarcinoma; TCGA, The Cancer Genome Atlas; NMF, non-negative matrix factorization; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 2The number of aberrant regulated genes and regulated methylation sites. FC, fold change; PCA, principal component analysis.
Figure 3The optimal number of cluster K and TCGA-ID for each subtype. (A) The optimal number of cluster K. (B) TCGA-ID for each subtype. TCGA, The Cancer Genome Atlas; ID, identity document.
Figure 4A total of 2 subtypes were obtained by unsupervised learning. (A) Silhouette plots for the identified cancer subtypes. (B) Survival R package cancer subtypes. (C) Heatmap of the sample similarity matrix. LUAD, lung adenocarcinoma.
Figure 5Heatmap of the correlations between the DEGs and methylation sites and clinical characteristics in each subtype. Pathologic_M: pathological metastasis; Pathologic_T: primary pathological tumor; Pathologic_N: pathological lymph node status. DEGs, differentially expressed genes.
The clinical characteristics of the 2 subtypes from the training data set
| Characteristics | Group 1 (n=240) | Group 2 (n=214) | P |
|---|---|---|---|
| Patient status (death), n (%) | 102 (42.5) | 61 (28.5) | 0.003 |
| Survival time (months), mean (SD) | 27.0 (27.9) | 33.3 (32.4) | 0.02 |
| Age (years), mean (SD) | 63.40 (10.57) | 66.63 (9.62) | 0.001 |
| Sex (male), n (%) | 129 (53.8) | 82 (38.3) | 0.001 |
| Recurrence | 0.22 | ||
| Missing | 39 (16.3) | 33 (15.4) | |
| No | 129 (53.8) | 131 (61.2) | |
| Yes | 72 (30.0) | 50 (23.4) | |
| Pathologic_M | 0.38 | ||
| Missing | 72 (30.0) | 75 (35.0) | |
| M0 | 156 (65.0) | 132 (61.7) | |
| M1 | 12 (5.0) | 7 (3.3) | |
| Pathologic_N | 0.001 | ||
| Missing | 2 (0.8) | 9 (4.2) | |
| N0 | 143 (59.6) | 157 (73.4) | |
| N1 | 52 (21.7) | 28 (13.1) | |
| N2 | 42 (17.5) | 20 (9.3) | |
| N3 | 1 (0.4) | 0 (0.0) | |
| Pathologic_T | 0.007 | ||
| Missing | 1 (0.4) | 2 (0.9) | |
| T1 | 65 (27.1) | 91 (42.5) | |
| T2 | 145 (60.4) | 94 (43.9) | |
| T3 | 21 (8.8) | 19 (8.9) | |
| T4 | 8 (3.3) | 8 (3.7) | |
| Tumor_stage | 0.001 | ||
| Missing | 1 (0.4) | 4 (1.9) | |
| Stage I | 112 (46.7) | 138 (64.5) | |
| Stage II | 70 (29.2) | 39 (18.2) | |
| Stage III | 45 (18.8) | 25 (11.7) | |
| Stage IV | 12 (5.0) | 8 (3.7) |
SD, standard deviation; Pathologic_M, pathological metastasis; Pathologic_T, primary pathological tumor; Pathologic_N, pathological lymph node status.