| Literature DB >> 35637462 |
Ming-Hung Shen1,2,3, Chi-Jung Huang4,5, Thien-Fiew Ho6, Chih-Yi Liu7, Ying-Yih Shih8, Ching-Shui Huang9,10, Chi-Cheng Huang11,12,13.
Abstract
BACKGROUND: The aim of the study was to enhance colorectal cancer prognostication by integrating single nucleotide polymorphism (SNP) and gene expression (GE) microarrays for genomic and transcriptional alteration detection; genes with concurrent gains and losses were used to develop a prognostic signature.Entities:
Keywords: Colorectal cancer; Concurrent gene signature; Copy number variation; Gene expression; Microarray
Mesh:
Year: 2022 PMID: 35637462 PMCID: PMC9150289 DOI: 10.1186/s12885-022-09627-9
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.638
Fig. 1Overview of study design (CRC: colorectal cancer, GE: gene expression, SNP: single nucleotide polymorphism, b/w: between, TCGA-COAD: The Cancer Genome Atlas colon adenocarcinoma, RFS: relapse-free survival, OS: overall survival, GSE: Gene Expression Omnibus series accession number)
Fig. 2Frequency plot of CNV among 32 Taiwanese CRC patients (CNV: copy number variation, CRC: colorectal cancer)
The number of prognostic concurrent genes (probe sets) within each training dataset
| Dataset | Number of prognostic probe-sets | Percentage of prognostic probe-sets |
|---|---|---|
| GSE12945 | 25 | 1.25% |
| GSE14333 | 251 | 12.55% |
| GSE17538 | 53 | 2.65% |
| GSE18088 | 1005 | 50.25% |
| GSE18105 | 48 | 2.4% |
| GSE39582 | 1 | 0.05% |
| GSE5206 | 338 | 16.9% |
| GSE64857 | 68 | 3.4% |
| GSE9384 | 168 | 8.4% |
| TCGA_COAD | 43 | 2.15% |
The number of consensual concurrent genes among training datasets
| Number of consensual datasets | Number of concurrent genes |
|---|---|
| 5 | 1 |
| 4 | 4 |
| 3 | 44 |
| 2 | 241 |
| 1 | 1365 |
Fig. 3A to E Relapse-free/overall survival analysis from microarray datasets of GSE12945 (A top left), GSE14333 (B top right), GSE17538 (C middle left), TCGA_COAD (D middle right), and GSE39582 (E bottom) with leave-one-out cross-validation. The high−/low-risk group was defined by the 50th percentile of the prognostic index score determined by the supervised principal component composed of 49 concurrent genes. All survival times were measured in months, except for 3D, which was measured in days
Summary of censored case numbers in high- and low-risk groups and area under the curve (AUC) from time-dependent receiver operating characteristic (ROC) curve from survival analysis (gene only model, RFS Relapse-free survival, OS Overall survival, CRC Colorectal cancer, GEO Gene expression omnibus)
| CRC case number | GEO | Outcome | Predicted high−/low-risk patients | Censored in high-risk group | Censored in low-risk group | Landmark | AUC |
|---|---|---|---|---|---|---|---|
| 51 | GSE12945 | RFS | 24/27 | 22(91.7%) | 25(92.6%) | 49 | 0.672 |
| 226 | GSE14333 | RFS | 113/113 | 74(65.5%) | 102(90.3%) | 38.5 | 0.643 |
| 200 | GSE17538 | RFS | 101/99 | 64(63.4%) | 81(81.8%) | 39 | 0.63 |
| 174 | TCGA_COAD | OS | 89/85 | 79(88.8%) | 79(92.9%) | 30.5 | 0.824 |
| 557 | GSE39582 | RFS | 279/278 | 182(65.2%) | 198(71.2%) | 43 | 0.519 |
CRC risk predictive accuracy of recurrence, metastasis, or death by the concurrent gene prognostic model
| CRC case number | GEO | Outcome | Compound covariate (%) | Diagonal linear discriminant (%) | 1-NN (%) | 3-NN (%) | Nearest centroid (%) | SVM (%) | Bayesian compound covariate (%) | CV-AUC from Bayesian compound covariate |
|---|---|---|---|---|---|---|---|---|---|---|
| 100 | GSE5206 | Recurrence | 82 | 83 | 75 | 80 | 80 | 77 | 86 | 0.833 |
| 70 | GSE9348 | Metastasis | 70 | 76 | 79 | 81 | 70 | 73 | 85 | 0.768 |
| 53 | GSE18088 | Relapse | 75 | 75 | 75 | 75 | 77 | 72 | 89 | 0.863 |
| 111 | GSE18105 | Metastasis | 57 | 53 | 66 | 60 | 56 | 61 | N/A | 0.548 |
| 75 | GSE64857 | Recurrence | 61 | 60 | 64 | 64 | 63 | 57 | 81 | 0.684 |
CRC Colorectal cancer, GEO Gene expression omnibus, NN Neural network, SVM Support vector machine, CV-AUC Cross-validation area under receiver operating characteristic curve, N/A Not applicable