| Literature DB >> 33142733 |
Heike Deutelmoser1,2, Justo Lorenzo Bermejo2, Axel Benner3, Korbinian Weigl4, Hanla A Park5, Mariam Haffa1,6, Esther Herpel7,8, Martin Schneider9, Cornelia M Ulrich1,10,11, Michael Hoffmeister4, Jenny Chang-Claude5,12, Hermann Brenner1,4,13, Dominique Scherer1,2.
Abstract
Colorectal cancer (CRC) survival has environmental and inherited components. The expression of specific genes can be inferred based on individual genotypes-so called expression quantitative trait loci. In this study, we used the PrediXcan method to predict gene expression in normal colon tissue using individual genotype data from 91 CRC patients and examined the correlation ρ between predicted and measured gene expression levels. Out of 5434 predicted genes, 58% showed a negative ρ value and only 16% presented a ρ higher than 0.10. We subsequently investigated the association between genotype-based gene expression in colon tissue for genes with ρ > 0.10 and survival of 4436 CRC patients. We identified an inverse association between the predicted expression of ARID3B and CRC-specific survival for patients with a body mass index greater than or equal to 30 kg/m2 (HR (hazard ratio) = 0.66 for an expression higher vs. lower than the median, p = 0.005). This association was validated using genotype and clinical data from the UK Biobank (HR = 0.74, p = 0.04). In addition to the identification of ARID3B expression in normal colon tissue as a candidate prognostic biomarker for obese CRC patients, our study illustrates the challenges of genotype-based prediction of gene expression, and the advantage of reassessing the prediction accuracy in a subset of the study population using measured gene expression data.Entities:
Keywords: PrediXcan; colorectal cancer; genotype-based gene expression; survival
Mesh:
Substances:
Year: 2020 PMID: 33142733 PMCID: PMC7662650 DOI: 10.3390/ijms21218150
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Data overview for the examination of the correlation ρ between predicted and measured gene expression. Abbreviations: CRC: colorectal cancer; SNPs: single nucleotide polymorphisms.
Figure 2(a) Scatter plot of the mean measured versus the mean predicted expression values for 5434 genes with estimated Spearman correlation coefficients ρ < 0 in red, 0 ≤ ρ ≤ 0.1 in black, and ρ > 0.1 in green; (b,c) measured versus predicted expression values for the genes TRIM4 (b), and PYGL (c). The linear regression lines are also shown with their corresponding 95% confidence bands.
Characteristics of the study population.
| Overall Survival | CRC-Specific Survival | |||||||
|---|---|---|---|---|---|---|---|---|
| Variable | Level | Patients | Deaths | HR 1 (95% CI) |
| Deaths | HR 1 (95% CI) |
|
| Age at | <60 | 946 | 257 | Ref. | <2 × 10−16 | 209 | Ref. | 0.0003 |
| Diagnosis | 60–69 | 1339 | 466 | 1.25 (1.07−1.46) | 314 | 1.06 (0.89−1.27) | ||
| (years) | 70–79 | 1489 | 638 | 357 | ||||
| >80 | 662 | 429 | 173 | |||||
| Gender | Male | 2685 | 1111 | Ref. | 0.16 | 626 | Ref. | 0.47 |
| Female | 1751 | 679 | 1.07 (0.97−1.18) | 427 | 0.96 (0.85−1.08) | |||
| CRC stage | I | 1024 | 235 | Ref. | <2 × 10−16 | 40 | Ref. | <2 × 10−16 |
| II | 1345 | 434 | 1.49 (1.27−1.74) | 148 | 2.96 (2.09−4.20) | |||
| III | 1426 | 575 | 354 | |||||
| IV | 641 | 546 | 511 | |||||
| Tumor site | Colon | 2665 | 1069 | Ref. | 0.61 | 598 | Ref. | 0.03 |
| Rectum | 1771 | 721 | 1.03 (0.93–1.13) | 455 | 1.15 (1.02−1.30) | |||
| Body mass | <18.5 | 121 | 71 | 1.74 (1.37−2.23) | 0.0006 | 41 | 1.62 (1.17−2.23) | 0.01 |
| index | 18.5−24.9 | 1592 | 675 | Ref. | 396 | Ref. | ||
| (kg/m2) | 25–29.9 | 1871 | 726 | 0.87 (0.78−0.96) | 432 | 0.88 (0.77−1.01) | ||
| ≥30 | 852 | 318 | 184 | |||||
| Diabetes | No | 3594 | 1360 | Ref. | <8 × 10−13 | 844 | Ref. | 0.09 |
| Yes | 811 | 412 | 1.50 (1.34−1.67) | 200 | 1.14 (0.98−1.33) | |||
| Regular | No | 3272 | 1298 | Ref. | 0.05 | 793 | Ref. | 0.31 |
| NSAID use | Yes | 1105 | 469 | 1.11 (1.00−1.24) | 245 | 0.93 (0.80−1.07) | ||
| Smoking | Never | 1921 | 840 | Ref. | 0.04 | 515 | Ref. | 0.07 |
| Former | 1769 | 684 | 0.90 (0.81−1.00) | 367 | 0.79 (0.69−0.90) | |||
| Current | 672 | 263 | 169 | |||||
| Alcohol | No intake | 1320 | 606 | Ref. | 0.0003 | 339 | Ref. | 0.01 |
| intake | 0.1−5.6 | 856 | 318 | 0.75 (0.66−0.86) | 202 | 0.87 (0.73−1.03) | ||
| (g/day) | 5.7−13.2 | 770 | 280 | 175 | ||||
| 13.3−28.5 | 775 | 301 | 177 | |||||
| ≥28.6 | 669 | 269 | 148 | |||||
1 Hazard ratios adjusted for age, sex, CRC stage, and tumor site. Abbreviations: HR: hazard ratio; CI: confidence interval; CRC: colorectal cancer; Ref.: reference category; NSAID: non-steroidal anti-inflammatory drug.
Figure 3(a,b) Volcano plots showing the results from survival analyses of 852 CRC patients with a BMI ≥ 30, and 863 genes with a correlation between predicted and measured expression values higher than 0.10. The blue dots indicate results for the gene ARID3B. (a) Overall survival; (b) CRC-specific survival; (c,d) cumulative probability of death due to CRC for patients with a low (lower than the median) and high (higher than the median) ARID3B expression based on individual genotypes for patients with BMI ≥ 30. The number of CRC patients at risk is shown in the lower part of each panel; (c) Aalen-Johansen probability curves in the identification cohort (“Darmkrebs: Chancen der Verhuetung durch Screening” (DACHS) study); (d) Aalen–Johansen probability curves in the validation cohort (UK Biobank). Abbreviations: CRC: colorectal cancer.