| Literature DB >> 32150672 |
Quan Chen1, Peng Gao1, Yongxi Song1, Xuanzhang Huang1, Qiong Xiao1, Xiaowan Chen1, Xinger Lv1, Zhenning Wang1.
Abstract
It is critical to identify patients with stage II and III colorectal cancer (CRC) who will benefit from adjuvant chemotherapy (ACT) after curative surgery, while the only use of clinical factors is insufficient to predict this beneficial effect. In this study, we performed genetic algorithm (GA) to select ACT candidate genes, and built a predictive model of support vector machine (SVM) using gene expression profiles from the Gene Expression Omnibus database. The model contained four ACT candidate genes (EDEM1, MVD, SEMA5B, and WWP2) and TNM stage (stage II or III). After using Subpopulation Treatment Effect Pattern Plot to determine the optimal cutoff value of predictive scores, the validated patients from The Cancer Genome Atlas database can be divided into the predictive ACT-benefit/-futile groups. Patients in the predictive ACT-benefit group with 5-fluorouracil (5-Fu)-based ACT had significantly longer relapse-free survival (RFS) compared to those without ACT (P = .015); However, the difference in RFS in the predictive ACT-futile group was insignificant (P = .596). The multivariable analysis found that the predictive groups were significantly associated with the effect of ACT (Pinteraction = .011). Consequently, we developed a predictive model based on the SVM and GA algorithm which was further validated to define patients who benefit from ACT on recurrence.Entities:
Keywords: adjuvant chemotherapy; colorectal cancer; gene profile; machine learning
Mesh:
Substances:
Year: 2020 PMID: 32150672 PMCID: PMC7196071 DOI: 10.1002/cam4.2952
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Datasets used in this study
| Datasets | Type | Tissue | Platform | TNM Stage | ACT | Samples | Median follow‐up |
|---|---|---|---|---|---|---|---|
| TCGA | mRNA | CRC tissue | HiSeqV2 | II‐III | 5‐Fu based | 138 | 72.53 |
|
| mRNA | CRC tissue | GPL570 | II‐III | 5‐Fu based | 145 | 44.97 |
|
| mRNA | CRC tissue | GPL570 | II‐III | 5‐Fu based | 31 | 52.86 |
|
| mRNA | CRC tissue | GPL570 | II‐III | 5‐Fu plus folinic acid | 392 | 56.00 |
HiSeqV2 = IlluminaHiSeq_RNASeqV2.
GPL570 = GPL570[HG‐U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array.
Median follow‐up for relapse‐free survival time.
FIGURE 1Outline of the SVM‐GA model flow
Comparison of clinical factors between patients who received ACT and patients who did not receive ACT in all datasets
| Case number (N%) |
| ||
|---|---|---|---|
| Patients who received ACT | Patients who did not receive ACT | ||
| Age | <.001 | ||
| <55 | 77 (26.64%) | 45 (11.66%) | |
| 55‐65 | 83 (28.72%) | 62 (16.06%) | |
| 65‐75 | 95 (32.88%) | 136 (35.23%) | |
| >75 | 34 (11.67%) | 143 (37.05%) | |
| Race | .054 | ||
| White | 22 (55.00%) | 56 (57.14%) | |
| Black/African | 7 (17.50%) | 4 (4.08%) | |
| Asian | 1 (2.50%) | 2 (2.05%) | |
| Unknown | 10 (25.00%) | 36 (36.73%) | |
| Gender | .290 | ||
| Male | 162 (52.42%) | 224 (56.42%) | |
| Female | 147 (47.58%) | 173 (43.58%) | |
| Site | .184 | ||
| Colon | 292 (94.50%) | 365 (91.94%) | |
| Rectum | 17 (5.50%) | 32 (8.06%) | |
| Grade | .787 | ||
| I | 1 (5.00%) | 1 (9.09%) | |
| II | 15 (75.00%) | 7 (63.64%) | |
| III | 4 (20.00%) | 3 (27.27%) | |
| Histological type | .709 | ||
| AC | 36 (90.00%) | 86 (87.76%) | |
| MC | 4 (10.00%) | 12 (12.24%) | |
| T stage | <.001 | ||
| T1 | 4 (1.69%) | 1 (0.33%) | |
| T2 | 37 (15.68%) | 95 (31.05%) | |
| T3 | 160 (67.79%) | 168 (54.89%) | |
| T4 | 35 (14.84%) | 42 (13.73%) | |
| N stage | <.001 | ||
| N0 | 56 (24.03%) | 161 (53.14%) | |
| N1 | 88 (37.77%) | 111 (36.63%) | |
| N2 | 73 (31.33%) | 22 (7.26%) | |
| N3 | 16 (6.87%) | 9 (2.97%) | |
| TNM Stage | <.001 | ||
| II | 86 (27.84%) | 315 (79.35%) | |
| III | 223 (72.16%) | 82 (20.65%) | |
Abbreviations: AC, adenocarcinoma; Black/African, Black or African American; MC, mucinous adenocarcinoma.
Lack of information in some series: 31 patients, lack of age information (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29621); 164 patients, lack of T stage information (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE14333, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29621); 170 patients, lack of N stage information (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE14333, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29621, http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE39582); Grade information was only provided in http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE29621 series; histological type and race information were only provided in TCGA series.
Grade I = well differentiated; grade II = moderately differentiated; grade III = poorly differentiated.
P values were made by χ 2‐test.
FIGURE 2Establishment of the SVM‐GA model using the training cohort. A, Results of genetic algorithm (GA). The iterations of each variable in GA are presented in the longitudinal axes and the selected variables in the SVM‐GA model are presented in the transverse axes. B, SVM‐GA model predictive scores distribution, patient relapse‐free survival time, and expression heatmap
FIGURE 3STEPP analysis between the concentrated continuous variables (predicted values of SVM model) and the effect of adjuvant chemotherapy (ACT) in the training cohort. A, Relapse‐free survival (RFS) rates at 36 mo of patients with ACT and surgery only according to patients’ subpopulations clustered by predictive values. B, Hazard ratio of patients with ACT according to patients’ subpopulations clustered by predictive values (solid line) with a 95% confidence interval (dashed lines)
FIGURE 4Relapse‐free survival (RFS) in the predictive adjuvant chemotherapy (ACT) groups in the test cohort. In total, 138 patients with CRC from TCGA database are included in these analyses. A, RFS in the predictive ACT‐benefit group. B, RFS in the predictive ACT‐futile group. 95%CI, 95% confidence interval; HR, hazard ratio
FIGURE 5Association between the relapse‐free survival (RFS) and predictive adjuvant chemotherapy (ACT) groups or clinical characteristics in the test cohort. A, Univariate analysis. B, Multivariate analysis adjusted for age, sex, TNM stage, and predictive groups. P values for association between clinical characteristics and ACT‐benefit group
Baseline characteristics before and after propensity score analysis in the test cohort
| Characteristic | Before matching | After matching | ||||
|---|---|---|---|---|---|---|
|
Patients who received ACT (N = 40) |
Patients who did not receive ACT (N = 98) |
|
Patients who received ACT (N = 29) |
Patients who did not receive ACT (N = 44) |
| |
| Age | <.001 | .072 | ||||
| <55 | 14 (35.00%) | 12 (12.24%) | 5 (17.24%) | 11 (24.99%) | ||
| 55‐65 | 12 (30.00%) | 16 (16.33%) | 11 (37.93%) | 7 (15.91%) | ||
| 65‐75 | 11 (27.50%) | 38 (38.78%) | 10 (34.48%) | 13 (29.55%) | ||
| >75 | 3 (7.50%) | 32 (32.65%) | 3 (10.35%) | 13 (29.55%) | ||
| Gender | .004 | .669 | ||||
| Male | 15 (37.50%) | 63 (64.28%) | 14 (48.27%) | 19 (43.18%) | ||
| Female | 25 (62.50%) | 35 (35.72%) | 15 (51.73%) | 25 (56.82%) | ||
| T stage | .019 | .342 | ||||
| T1 | 2 (5.00%) | 1 (1.02%) | 1 (3.45%) | 1 (2.27%) | ||
| T2 | 31 (77.50%) | 92 (93.88%) | 22 (75.86%) | 39 (88.64%) | ||
| T3 | 7 (17.50%) | 5 (5.10%) | 6 (20.69%) | 4 (9.09%) | ||
| N stage | <.001 | .197 | ||||
| N1 | 11 (27.50%) | 80 (81.64%) | 11 (37.93%) | 26 (59.10%) | ||
| N2 | 17 (42.50%) | 9 (9.18%) | 10 (34.48%) | 9 (20.45%) | ||
| N3 | 12 (30.00%) | 9 (9.18%) | 8 (27.59%) | 9 (20.45%) | ||
| TNM stage | <.001 | .077 | ||||
| II | 11 (27.50%) | 80 (81.64%) | 11 (37.93%) | 26 (59.09%) | ||
| III | 29 (72.50%) | 18 (18.36%) | 18 (62.07%) | 18 (40.91%) | ||
P values were made by χ 2‐test.
FIGURE 6Relapse‐free survival (RFS) in the predictive adjuvant chemotherapy (ACT) groups after PS matching in the test cohort. In total, 73 patients with CRC from TCGA database are included in these analyses. A, RFS in the predictive ACT‐benefit group. B, RFS in the predictive ACT‐futile group. 95%CI, 95% confidence interval; HR, hazard ratio