| Literature DB >> 32957968 |
Jiannan Liu1, Chuanpeng Dong1,2, Guanglong Jiang1,2,3, Xiaoyu Lu1,2, Yunlong Liu2,3, Huanmei Wu4,5.
Abstract
BACKGROUND: Colon cancer is one of the leading causes of cancer deaths in the USA and around the world. Molecular level characters, such as gene expression levels and mutations, may provide profound information for precision treatment apart from pathological indicators. Transcription factors function as critical regulators in all aspects of cell life, but transcription factors-based biomarkers for colon cancer prognosis were still rare and necessary.Entities:
Keywords: Cancer prognosis; Colon cancer; Machine learning; Transcription factor
Year: 2020 PMID: 32957968 PMCID: PMC7504662 DOI: 10.1186/s12920-020-00775-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Summary of the general clinicopathologic characteristics of patients in both training and testing datasets
| Characteristic | TCGA ( | GSE39582 ( | GSE17536 ( | GSE37892 ( | GSE17537 ( |
|---|---|---|---|---|---|
| N (%) | N (%) | N (%) | N (%) | N (%) | |
| Age (years) | |||||
| Median | 66 | 68 | 66 | 68 | 62 |
| Range | 31–90 | 22–97 | 26–92 | 22–97 | 23–94 |
| < 65 | 166 (38.2) | 211 (37.5) | 78 (44.1) | 54 (41.5) | 32 (58.2) |
| ≥ 65 | 269 (51.8) | 351 (62.3) | 99 (55.9) | 76 (58.5) | 23 (41.8) |
| Sex | |||||
| Male | 202 (46.4) | 309 (54.9) | 96 (54.2) | 69 (53.1) | 26 (47.3) |
| Female | 233 (53.6) | 253 (44.9) | 81 (45.8) | 61 (46.9) | 29 (52.7) |
| T Statusa | |||||
| T1–2 | 86 (19.8) | 56 (9.9) | |||
| T3–4 | 345 (79.3) | 483 (85.8) | |||
| N Statusa | |||||
| N0 | 254 (58.4) | 299 (53.1) | |||
| N1 | 100 (23.0) | 133 (23.6) | |||
| N2 | 78 (17.9) | 98 (17.4) | |||
| M Statusa | |||||
| M0 | 318 (73.1) | 479 (85.1) | |||
| M1 | 60 (13.8) | 61 (10.8) | |||
| MX | 47 (10.8) | 2 (0.4) | |||
| Stage | |||||
| I | 73 (16.8) | 32 (5.7) | 24 (13.6) | 4 (7.3) | |
| II | 167 (38.4) | 262 (46.5) | 57 (32.2) | 73 (56.2) | 15 (27.3) |
| III | 124 (28.5) | 204 (36.2) | 57 (32.2) | 57 (43.8) | 19 (34.5) |
| IV | 60 (13.8) | 60 (10.7) | 39 (22) | 17 (30.9) | |
aT status Describes the size of primary tissue and whether it has invaded nearby tissue, N status Describes nearby lymph nodes that are involved, M status Describes distant metastasis
Fig. 1Workflow of this study. (TFs Screening, Predictive Modeling; Model Validation)
Fig. 2The RF results of the prognosis TFs for the Depth and relative frequency
Fig. 3Information on five prognostic TFs finally selected for building the prediction model
Fig. 4A multivariate linear regression model based on expression of five TFs. A. The patient survival (follow-up distribution) and selected genes expression profile, among with the calculated risk scores; B. The KM curve for predicted high-risk subgroup and low-risk subgroup using TCGA COAD dataset
Fig. 5The KM curves of the overall survival probabilities for four independent validation datasets for predicted high-risk subgroups and low-risk subgroups
Fig. 6Enrichment plots for the top four enriched gene pathways according to the GSEA results. GSEA is performed on TCGA COAD dataset