| Literature DB >> 35991579 |
Aaron Curtis1,2, Yajun Yu1,2, Megan Carey1, Patrick Parfrey3, Yildiz E Yilmaz1,3,4, Sevtap Savas1,2,5.
Abstract
Background: SNP interactions may explain the variable outcome risk among colorectal cancer patients. Examining SNP interactions is challenging, especially with large datasets. Multifactor Dimensionality Reduction (MDR)-based programs may address this problem.Entities:
Keywords: MDR; MMPs; SNP-SNP interactions; VEGF family; colorectal cancer; prognostic markers; protein interaction network
Year: 2022 PMID: 35991579 PMCID: PMC9385108 DOI: 10.3389/fgene.2022.902217
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1This figure demonstrates the overall workflow of the analyses performed. Multivariate Cox-regression and univariate Kaplan-Meier analyses were used to verify the Cox-MDR results and assess the associations of the identified genotype groups with clinical outcomes, whereas multivariate logistic regression was used to verify the GMDR 0.9 results and the association of the identified genotype groups with clinical outcomes.
Multivariable Cox regression analysis result for the significant 1-way Cox-MDR model in the MMP dataset (overall survival).
| Top model SNP | High risk genotypes |
| HR | 95% CI (lower-upper) |
|---|---|---|---|---|
| rs11225388_GA | AA | 0.002 | 0.591 | 0.425–0.821 |
CI: confidence interval; HR: hazards ratio; SNP: single nucleotide polymorphism. HR calculated for low risk genotypes (GG + GA) versus high-risk genotype (AA).
Multivariable logistic regression analysis results for the significant 2-way and 3-way GMDR 0.9 models in the MMP dataset (overall survival).
| Top model SNPs | High risk genotypes |
| OR | 95% CI (lower-upper) |
|---|---|---|---|---|
| rs7817382_GA and rs2254207_CA | (0AA,1CA), (0AA,2CC), (1GA,0AA), (1GA,2CC), (2GG,1CA) | 4.4194E-06 | 3.266 | 1.971–5.414 |
| rs2664369_GT, rs11225332_CT and rs11639960_GA | (0TT,0TT,2GG), (0TT,1CT,1GA), (0TT,1CT,2GG), (0TT,2CC,1GA), (1GT,0TT,0AA), (1GT,0TT,1GA), (1GT,1CT,2GG), (1GT,2CC,2GG), (2GG,0TT,0AA), (2GG,1CT,2GG), (2GG,2CC,0AA), (2GG,2CC,2GG) | 1.2929E-08 | 4.503 | 2.681–7.563 |
CI: confidence interval; OR: odds ratio; SNP: single nucleotide polymorphism.
Alleles are given in the order major allele minor allele. 0,1,2 refer to additive coding, i.e. dosage of the minor allele (0 = 0 copies of the minor allele, 1 = 1 copies of the minor allele, 2 = 2 copies of the minor allele).
Permutation testing and multivariable Cox-regression analysis results for the top Cox-MDR models in the VEGF interaction network set analyses (disease specific survival).
| Interactor set | Top model SNP(s) | High risk genotypes | Permutation | Cox regression | HR | 95% CI (lower-upper) |
|---|---|---|---|---|---|---|
| 1-way | ||||||
|
| ||||||
| |
| 1 (TG),2 (TT) | 0.273 |
|
|
|
| |
| 1 (GA) | 0.201 |
|
|
|
| |
| 1 (CA) | 0.346 |
|
|
|
| |
| 0 (TT) | 0.07 |
|
|
|
| |
| 0 (CC),2 (TT) | 0.181 |
|
|
|
| |
| 1 (CA),2 (CC) | 0.098 |
|
|
|
| |
| 1 (AC),2 (AA) | 0.032 | 0.003 | 1.977 | 1.265–3.089 |
|
| ||||||
| |
| 1 (GA) | 0.045 | 0.298 | 1.256 | 0.818–1.928 |
| 2-way | ||||||
| |
| (0 [CC],0 [AA]), (1 [TC],1 [GA]) (0 [CC],2 [GG]) (2 [TT],2 [GG]) | 0.392 |
|
|
|
| |
| (1 [TC],0 [TT]) (0 [CC],1 [CT]) (2 [TT],1 [CT]) (0 [CC],2 [CC]) (2 [TT],2 [CC]) | 0.225 |
|
|
|
| |
| (0 [AA],0 [CC]) (2 [GG],0 [CC]) (1 [GA],1 [AC]) (0 [AA],2 [AA]) (1 [GA],2 [AA]) | 0.146 |
|
|
|
| |
| (1 [TC],0 [GG]) (2 [TT],0 [GG]) (0 [CC],1 [AG]) (0 [CC],2 [AA]) (1 [TC],2 [AA]) | 0.128 |
|
|
|
| |
| (2 [CC],0 [GG]) (0 [AA],1 [TG]) (1 [CA],2 [TT]) (2 [CC],2 [TT]) | 0.389 |
|
|
|
| |
| (0 [CC],0 [GG]) (1 [TC],0 [GG]) (0 [CC],1 [AG]) (1 [TC],2 [AA]) | 0.004 | 2.03E-06 | 3.147 | 1.961–5.050 |
| |
| (0 [AA],0 [GG]) (1 [GA],2 [AA]) | 0.527 |
|
|
|
| 3-way | ||||||
| |
| (0 [TT],0 [CC],0 [TT]) (1 [CT],0 [CC],0 [TT]) (0 [TT],1 [TC],0 [TT]) (0 [TT],2 [TT],0 [TT]) (0 [TT],0 [CC],1 [CT]) (1 [CT],1 [TC],1 [CT]) (2 [CC],1 [TC],1 [CT]) (0 [TT],2 [TT],1 [CT]) (1 [CT],2 [TT],1 [CT]) (2 [CC],2 [TT],1 [CT]) (0 [TT],0 [CC],2 [CC]) (2 [CC],0 [CC],2 [CC]) (1 [CT],2 [TT],2 [CC]) | 0.058 |
|
|
|
| |
| (1 [CA],0 [GG],0 [TT]) (0 [AA],1 [AG],0 [TT]) (2 [CC],1 [AG],0 [TT]) (1 [CA],2 [AA],0 [TT]) (0 [AA],0 [GG],1 [CT]) (2 [CC],0 [GG],1 [CT]) (0 [AA],1 [AG],1 [CT]) (1 [CA],2 [AA],1 [CT]) (2 [CC],2 [AA],1 [CT]) (1 [CA],1 [AG],2 [CC]) (2 [CC],2 [AA],2 [CC]) | 0.217 |
|
|
|
| |
| (2 [CC],0 [GG],0 [CC]) (0 [TT],1 [AG],0 [CC]) (2 [CC],1 [AG],0 [CC]) (1 [CT],2 [AA],0 [CC]) (1 [CT],0 [GG],1 [TC]) (2 [CC],0 [GG],1 [TC]) (1 [CT],1 [AG],1 [TC]) (2 [CC],1 [AG],1 [TC]) (1 [CT],2 [AA],1 [TC]) (0 [TT],0 [GG],2 [TT]) (2 [CC],0 [GG],2 [TT]) (1 [CT],1 [AG],2 [TT]) | 0.229 |
|
|
|
| |
| (1 [CT],0 [CC],0 [GG]) (1 [CT],1 [TC],0 [GG]) (2 [CC],1 [TC],0 [GG]) (0 [TT],2 [TT],0 [GG]) (0 [TT],0 [CC],1 [AG]) (0 [TT],0 [CC],2 [AA]) (2 [CC],0 [CC],2 [AA]) (0 [TT],1 [TC],2 [AA]) (1 [CT],1 [TC],2 [AA]) | 0.097 |
|
|
|
| |
| (0 [GG],0 [TT],0 [CC]) (1 [AG],0 [TT],0 [CC]) (2 [AA],0 [TT],0 [CC]) (0 [GG],1 [GT],0 [CC]) (2 [AA],0 [TT],1 [TC]) (0 [GG],1 [GT],1 [TC]) (0 [GG],2 [GG],1 [TC]) (1 [AG],2 [GG],1 [TC]) (1 [AG],0 [TT],2 [TT]) (2 [AA],0 [TT],2 [TT]) (1 [AG],2 [GG],2 [TT]) (2 [AA],2 [GG],2 [TT]) | 0.12 |
|
|
|
| |
| (1 [TC],0 [GG],0 [GG]) (0 [CC],1 [AG],0 [GG]) (1 [TC],1 [AG],0 [GG]) (0 [CC],2 [AA],0 [GG]) (0 [CC],0 [GG],1 [AG]) (0 [CC],1 [AG],1 [AG]) (0 [CC],2 [AA],1 [AG]) (1 [TC],2 [AA],1 [AG]) (1 [TC],0 [GG],2 [AA]) (2 [TT],0 [GG],2 [AA]) (1 [TC],1 [AG],2 [AA]) (0 [CC],2 [AA],2 [AA]) (2 [TT],2 [AA],2 [AA]) | 0.007 | 2.21E-09 | 5.004 | 2.952–8.481 |
| |
| (1 [GA],0 [AA],0 [TT]) (0 [AA],1 [CA],0 [TT]) (0 [AA],2 [CC],0 [TT]) (2 [GG],2 [CC],0 [TT]) (1 [GA],0 [AA],1 [CT]) (0 [AA],1 [CA],1 [CT]) (2 [GG],1 [CA],1 [CT]) (0 [AA],0 [AA],2 [CC]) (2 [GG],0 [AA],2 [CC]) (2 [GG],1 [CA],2 [CC]) | 0.253 |
|
|
|
CI: confidence interval; HR: hazards ratio; SNP: single nucleotide polymorphism.
0, 1, and 2 in the High Risk Genotype column refer to additive coding, where the number refers to the number of minor alleles in the genotype.
Square brackets in the Top Model SNPs column indicate major and minor alleles for each SNP; which the first letter represents the minor allele and the second letter represents the major allele. In the high risk genotypes column, the three items enclosed in parentheses signify the genotypes of the combination of SNPs which was found to be high risk by Cox-MDR. Commas separate the genotypes for each SNP in the order in which they appear in the corresponding Top Model SNPs entry. Whenever a SNP with a main effect was identified in 1-way analysis, the analysis was repeated with that SNP removed from the dataset (i.e. successive iterations). FLT1 is also known as VEGFR1; KDR is also known as VEGFR2; FLT4 is also known as VEGFR3; and PGF is also known as PIGF.
FIGURE 2Log-rank p = 1.02619688760668E-12. Red: high risk genotype combinations: (TC,GG,GG), (CC,AG,GG), (TC,AG,GG), (CC,AA,GG), (CC,GG,AG), (CC,AG,AG), (CC,AA,AG), (TC,AA,AG), (TC,GG,AA), (TT,GG,AA), (TC,AG,AA), (CC,AA,AA), and (TT,AA,AA). Blue: all other genotype combinations. The vertical lines on the curves denote the censored patients (e.g. patients alive at the last follow up time). X and Y axis show the follow-up time (in years; rounded) and cumulative survival, respectively.
Multivariable logistic regression analysis results for the top GMDR 0.9 models in the VEGF interaction network set analyses (disease-specific survival).
| Interaction set | Top model SNP(s) | High risk genotypes | Permutation | Logistic regression | OR | 95% CI (lower-upper) |
|---|---|---|---|---|---|---|
| 1-way | ||||||
|
| ||||||
| |
| 0 [GG],2 [TT] | 0.014 | 0.010 | 2.399 | 1.230–4.679 |
| |
| 0 [AA],2 [GG] | 0.02 | 0.032 | 2.015 | 1.062–3.822 |
| |
| 1 [CT],2 [CC] | 0.358 |
|
| |
| |
| 0 [TT] | <0.001 | 0.002 | 2.941 | 1.468–5.891 |
| |
| 0 [CC],2 [TT] | <0.001 | 1.442E-04 | 3.662 | 1.875–7.152 |
| |
| 0 [TT] | 0.005 | 0.004 | 2.616 | 1.369–4.997 |
| |
| 1 [AC],2 [AA] | 0.045 | 0.011 | 2.359 | 1.222–4.554 |
|
| ||||||
| |
| 0 [AA] | 0.022 | 0.012 | 1.984 | 0.673–5.847 |
| |
| 1 [TG],2 [TT] | 0.067 |
|
| |
| |
| 1 [TC],2 [TT] | 0.116 |
|
| |
| |
| 0 [CC] | 0.001 | 3.425E-04 | 3.467 | 1.756–6.848 |
| |
| 1 [TG],2 [TT] | 0.007 | 0.006 | 2.924 | 1.362–6.278 |
| |
| 0 [GG] | 0.069 |
|
| |
|
| ||||||
| |
| 0 [TT] | 0.051 |
|
| |
| |
| 1 [TG],2 [TT] | 0.079 |
|
| |
| |
| 1 [GT] | 0.007 | 0.011 | 2.243 | 1.207–4.169 |
|
| ||||||
| |
| 0 [GG] | 0.043 | 0.009 | 2.317 | 1.235–4.346 |
|
| ||||||
| |
| 1 [TC] | 0.225 |
|
| |
| 2-way | ||||||
| |
| (0 [GG],2 [AA]) (1 [AG],0 [GG]) (2 [AA],0 [GG]) (2 [AA],1 [AG]) | <0.001 | 3.180E-05 | 4.387 | 2.186–8.805 |
| |
| (0 [TT],1 [CT]) (1 [CT],0 [TT]) (2 [CC],1 [CT]) | 0.018 | 7.082E-05 | 3.696 | 1.940–7.044 |
| |
| (0 [CC],1 [TC]) (1 [TC],0 [CC]) (1 [TC],2 [TT]) (2 [TT],1 [TC]) | 0.002 | 1.335E-04 | 3.827 | 1.922–7.620 |
| |
| (0 [CC],0 [AA]) (1 [TC],1 [CA]) (2 [TT],1 [CA]) | 0.003 | 1.852E-04 | 3.361 | 1.780–6.346 |
| |
| (0 [CC],1 [GA]) (1 [TC],0 [AA]) | <0.001 | 1.213E-05 | 4.542 | 2.306–8.947 |
| |
| (0 [CC],0 [CC]) (1 [TC],1 [TC]) (1 [TC],2 [TT]) (2 [TT],2 [TT]) | <0.001 | 3.872E-05 | 3.892 | 2.037–7.433 |
| |
| (0 [CC],1 [TC]) (1 [TC],0 [CC]) (1 [TC],2 [TT]) (2 [TT],0 [CC]) | 0.055 |
|
| |
| 3-way | ||||||
| |
| (0 [GG],0 [GG],1 [TC]) (0 [GG],0 [GG],2 [TT]) (0 [GG],1 [AG],1 [TC]) (0 [GG],2 [AA],0[CC]) (0 [GG],2 [AA],2 [TT]) (1 [CG],1 [AG],0 [CC]) (1 [CG],1 [AG],2 [TT]) (1 [CG],2 [AA],0 [CC]) (2 [CC],0 [GG],1 [TC]) (2 [CC],0 [GG],2 [TT]) (2 [CC],1 [AG],2 [TT]) (2 [CC],2 [AA],2 [TT]) | 0.001 | 2.146E-07 | 9.322 | 4.010–21.672 |
| |
| (0 [TT],0 [AA],1 [TG]) (0 [TT],0 [AA],2 [TT]) (0 [TT],1 [GA],2 [TT]) (0 [TT],2 [GG],1 [TG]) (1 [CT],0 [AA],2 [TT]) (1 [CT],1 [GA],0 [GG]) (1 [CT],2 [GG],0 [GG]) (1 [CT],2 [GG],1 [TG]) (2 [CC],1 [GA],0 [GG]) (2 [CC],1 [GA],1 [TG]) (2 [CC],1 [GA],2 [TT]) | 0.007 | 2.404E-09 | 9.991 | 4.691–21.276 |
| |
| (0 [AA],0 [TT],1 [TG]) (0 [AA],1 [CT],0 [GG]) (0 [AA],2 [CC],0 [GG]) (0 [AA],2 [CC],1 [TG]) (1 [GA],0 [TT],0 [GG]) (1 [GA],1 [CT],1 [TG]) (1 [GA],2 [CC],2 [TT]) (2 [GG],0 [TT],1 [TG]) (2 [GG],1 [CT],1 [TG]) (2 [GG],1 [CT],2 [TT]) | 0.038 | 4.028E-06 | 5.418 | 2.642–11.114 |
| |
| (0 [GG],0 [AA],0 [TT]) (0 [GG],0 [AA],2 [AA]) (0 [GG],1 [GA],0 [TT]) (0 [GG],1 [GA],1 [AT]) (0 [GG],2 [GG],0 [TT]) (1 [AG],0 [AA],0 [TT]) (1 [AG],1 [GA],1 [AT]) (2 [AA],2 [GG],0 [TT]) (2 [AA],2 [GG],2 [AA]) | <0.001 | 3.505E-08 | 14.855 | 5.693–38.761 |
| |
| (1 [TG],0 [CC],1 [GT]) (1 [TG],1 [TC],0 [TT]) (1 [TG],1 [TC],2 [GG]) (1 [TG],2 [TT],0 [TT]) (1 [TG],2 [TT],1 [GT]) (1 [TG],2 [TT],2 [GG]) (2 [TT],0 [CC],2 [GG]) (2 [TT],1 [TC],1 [GT]) (2 [TT],1 [TC],2 [GG]) (2 [TT],2 [TT],1 [GT]) | <0.001 | 7.062E-09 | 8.712 | 4.186–18.129 |
| |
| (0 [CC],0 [TT],0 [CC]) (0 [CC],1 [CT],0 [CC]) (1 [TC],0 [TT],1 [TC]) (1 [TC],0 [TT],2 [TT]) (1 [TC],1 [CT],0 [CC]) (2 [TT],0 [TT],2 [TT]) (2 [TT],1 [CT],0 [CC]) (2 [TT],1 [CT],2 [TT]) | <0.001 | 5.721E-08 | 8.030 | 3.784–17.038 |
| |
| (0 [GG],0 [GG],0 [CC]) (0 [GG],0 [GG],1 [TC]) (0 [GG],1 [AG],0 [CC]) (0 [GG],1 [AG],2 [TT]) (0 [GG],2 [AA],0 [CC]) (0 [GG],2 [AA],2 [TT]) (1 [AG],0 [GG],2 [TT]) (1 [AG],1 [AG],1 [TC]) (1 [AG],2 [AA],1 [TC]) (1 [AG],2 [AA],2 [TT]) (2 [AA],0 [GG],0 [CC]) (2 [AA],2 [AA],0 [CC]) | <0.001 | 4.218E-07 | 12.996 | 4.812–35.103 |
CI: confidence interval; OR: odds ratio; SNP: single nucleotide polymorphism.
0, 1, and 2 in the High Risk Genotype column refer to additive coding, where the number refers to the number of minor alleles in the genotype.
Square brackets in the Top Model SNPs column indicate major and minor alleles for each SNP; in which the first letter represents the minor allele and the second letter represents the major allele. The High risk genotypes column lists genotypes which were found by GMDR 0.9 to be high risk for poor survival. High-risk genotypes have the following format: the items between each pair of parentheses specify a genotype which is high risk for poor survival according to the GMDR output, presented in the order of the SNPs listed in the Top model SNP column. e.g. for top model SNPs FLT1.rs3936415 [AG]_FLT1.rs11149523 [AG]_NRP1.rs2073320 [TC], genotypes (0 [GG],0 [GG],0 [CC]), rs3936415 = GG, rs11149523 = GG, and rs2073320 = CC were classified as high risk by the GMDR 0.9 procedure. Whenever a SNP with a main effect was identified in 1-way analysis, the analysis was repeated with that SNP removed from the dataset (i.e. successive iterations). FLT1 is also known as VEGFR1; KDR is also known as VEGFR2; FLT4 is also known as VEGFR3; and PGF is also known as PIGF.
FIGURE 3Log-rank p = 6.61897020900234E-07. Red: High risk genotypes: (TT,AA,TG), (TT,AA,TT), (TT,GA,TT), (TT,GG,TG), (CT,AA,TT), (CT,GA,GG), (CT,GG,GG), (CT,GG,TG), (CC,GA,GG), (CC,GA,TG), and (CC,GA,TT). Blue: All others except (CT,GG,TT) and (CC,GG,TT)The vertical lines on the curves denote the censored patients (e.g. patients alive at the last follow up time). X and Y axis show the follow-up time (in years; rounded) and cumulative survival, respectively.SNP-SNP interactions in survival outcomes.