| Literature DB >> 28956832 |
Noha Sharafeldin1,2, Martha L Slattery3, Qi Liu4, Conrado Franco-Villalobos5, Bette J Caan6, John D Potter7,8,9, Yutaka Yasui10,11.
Abstract
Characterization of gene-environment interactions (GEIs) in cancer is limited. We aimed at identifying GEIs in rectal cancer focusing on a relevant biologic process involving the angiogenesis pathway and relevant environmental exposures: cigarette smoking, alcohol consumption, and animal protein intake. We analyzed data from 747 rectal cancer cases and 956 controls from the Diet, Activity and Lifestyle as a Risk Factor for Rectal Cancer study. We applied a 3-step analysis approach: first, we searched for interactions among single nucleotide polymorphisms on the pathway genes; second, we searched for interactions among the genes, both steps using Logic regression; third, we examined the GEIs significant at the 5% level using logistic regression for cancer risk and Cox proportional hazards models for survival. Permutation-based test was used for multiple testing adjustment. We identified 8 significant GEIs associated with risk among 6 genes adjusting for multiple testing: TNF (OR = 1.85, 95% CI: 1.10, 3.11), TLR4 (OR = 2.34, 95% CI: 1.38, 3.98), and EGR2 (OR = 2.23, 95% CI: 1.04, 4.78) with smoking; IGF1R (OR = 1.69, 95% CI: 1.04, 2.72), TLR4 (OR = 2.10, 95% CI: 1.22, 3.60) and EGR2 (OR = 2.12, 95% CI: 1.01, 4.46) with alcohol; and PDGFB (OR = 1.75, 95% CI: 1.04, 2.92) and MMP1 (OR = 2.44, 95% CI: 1.24, 4.81) with protein. Five GEIs were associated with survival at the 5% significance level but not after multiple testing adjustment: CXCR1 (HR = 2.06, 95% CI: 1.13, 3.75) with smoking; and KDR (HR = 4.36, 95% CI: 1.62, 11.73), TLR2 (HR = 9.06, 95% CI: 1.14, 72.11), EGR2 (HR = 2.45, 95% CI: 1.42, 4.22), and EGFR (HR = 6.33, 95% CI: 1.95, 20.54) with protein. GEIs between angiogenesis genes and smoking, alcohol, and animal protein impact rectal cancer risk. Our results support the importance of considering the biologic hypothesis to characterize GEIs associated with cancer outcomes.Entities:
Keywords: angiogenesis; cancer risk; cancer survival; candidate gene-pathway; gene-environment interactions; rectal cancer
Mesh:
Year: 2017 PMID: 28956832 PMCID: PMC5664647 DOI: 10.3390/ijerph14101146
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Working Figure of the Angiogenesis Pathway Genes. Key gene components of pathway in blue frames: VEGF = vascular endothelial growth factor; FLT1 = vascular endothelial growth factor receptor 1; KDR = vascular endothelial growth factor receptor 2; HIF-1 = hypoxia-inducible factor 1; PDGF = platelet-derived growth factor; TEK = TEK receptor tyrosine kinase; TGFβ = Transforming growth factor, beta; TGFβR = Transforming growth factor, beta receptor; IGFR = insulin-like growth factor receptor. Secondary interacting genes of pathway in black frames: NF-kB = nuclear factor of kappa light polypeptide gene enhancer in B-cells; CXCL8 = C-X-C motif chemokine ligand 8; CXCR1 = C-X-C motif chemokine receptor 1; CXCR2 = C-X-C motif chemokine receptor 2; IL-1A = interleukin-1, alpha; IL-1B = interleukin-1, beta; TNF = tumor necrosis factor; MMPs (MMP1, MMP3, MMP7, MMP9) = matrix metallopeptidases; BMPs (BMP1, BMP2, BMP4, BMPR1A, BMPR1B, BMPR2, GDF10) = bone morphogenetic proteins; TLRs (TLR2, TLR3, TLR4) = toll-like receptors; EGR2 = early growth response 2; EGFR = epidermal growth factor receptor; IRS1 = insulin receptor substrate 1; VDR = Vitamin D Receptor. Environmental factors in green text.
Summary of the 3-step candidate pathway gene-environment interaction approach.
| Analysis Step | Interaction of Interest | Variable of Interest | Model | Specific Procedures | Product |
|---|---|---|---|---|---|
| Step 1: Summarize gene effects | SNP-set interaction within gene | SNPs on each gene separately | Logic regression with logit link/fitting exponential survival models | Cross-validation to determine optimal model size | Gene-specific trees (GSTs) |
| Step 2: Summarize pathway effects | Gene-set interaction within pathway | All GSTs on the pathway | Logic regression with logit link/fitting exponential survival models | Cross-validation to determine optimal model size | Pathway Trees |
| Step 3: Test gene-environment interaction | Gene-environment interaction within pathway | a. Sub-pathway specific GSTxE * b. Full pathway GSTxE *,§ | Logistic regression model §/Cox Proportional Hazards model ¥ | Statistical significance testing | Pathway GEIs |
* GSTxE, gene-specific tree—environment interaction; § Models adjusted for age, sex, race, study center, pathway trees; ¥ Models adjusted for age, sex, race, study center, pathway tree, stratified by cancer stage.
Demographic characteristics of study participants.
| Characteristic | Rectal Cancer Cases ( | Controls ( | |
|---|---|---|---|
| Age (Mean, SD) | 61.2 (10.8) | 62.1 (10.6) | 0.09 |
| Sex | |||
| Male | 447 (59.8%) | 542 (56.7%) | |
| Female | 300 (40.2%) | 414 (43.3%) | 0.19 |
| Race | |||
| White non-Hispanic | 617 (82.6%) | 821 (85.9%) | |
| Other | 130 (17.4%) | 135 (14.1%) | 0.06 |
| Education | |||
| ≤High School | 266 (35.6%) | 324 (33.9%) | |
| >High School | 481 (64.4%) | 632 (66.1%) | 0.46 |
| Marital Status | |||
| Married | 556 (74.4%) | 730 (76.4%) | |
| Other | 191 (25.6%) | 226 (23.6%) | 0.36 |
| Annual Income | |||
| ≤30 K | 206 (29.9%) | 238 (27.4%) | |
| >30 K | 483 (70.1%) | 632 (72.6%) | 0.27 |
| Cigarette Smoking | |||
| Non-smoker | 346 (46.3%) | 485 (50.7%) | |
| ≤20 pack-years | 158 (21.2%) | 240 (25.1%) | |
| >20 pack-years | 243 (32.5%) | 231 (24.2%) | 0.001 |
| Alcohol | |||
| Non/Moderate | 556 (74.4%) | 759 (79.4%) | |
| Heavy | 191 (25.6%) | 197 (20.6%) | 0.02 |
| Animal/Vegetable Protein Ratio | |||
| Low | 242 (32.4%) | 363 (38.0%) | |
| High | 505 (67.6%) | 593 (62.0%) | 0.02 |
| Center | |||
| Utah | 270 (36.1%) | 366 (38.3%) | |
| Northern California | 477 (63.9%) | 590 (61.7%) | 0.37 |
| Cancer Stage | |||
| In-situ | 20 (2.7%) | ||
| Local | 395 (52.9%) | ||
| Regional | 255 (34.1%) | ||
| Distant | 63 (8.4%) | ||
| Unknown | 14 (1.9%) |
Effects of gene-environment interactions significant at 5% level between gene-specific trees and environmental factors on rectal cancer risk.
| GST | Gene | Chr. | Cases (%) | Control (%) | Gene OR a (95% CI) | Env. Factor | Category | N (%) b | Gene OR by Env. Factor (95% CI) | ORINT c (95% CI) | PINT d |
|---|---|---|---|---|---|---|---|---|---|---|---|
| rs4821877 (CC or CT) | 22q13.1 | 610 (80.7%) | 746 (77.6%) | 1.21 (0.95, 1.54) | Protein e | Low | 612 (35.6%) | 0.85 (0.57, 1.26) | Ref | ||
| High | 1106 (64.4%) | 1.47 (1.08, 2.00) | 1.75 (1.04, 2.92) | 0.034 | |||||||
| rs2139924 (AA) | 15q26.3 | 243 (30.4%) | 287 (28.5%) | 0.93 (0.77, 1.00) | Alcohol | Non/Moderate | 1396 (77.4%) | 0.82 (0.66, 1.02) | Ref | ||
| Heavy | 408 (22.6%) | 1.36 (0.91, 2.05) | 1.69 (1.04, 2.72) | 0.033 | |||||||
| rs1800630 (CA or AA) | 6p21.33 | 240 (31.8%) | 267 (27.8%) | 1.19 (0.96, 1.47) | Smoking | Non | 834 (48.7%) | 0.95 (0.70, 1.30) | Ref | ||
| <20 PY | 400 (23.4%) | 1.13 (0.71, 1.81) | 1.14 (0.65, 2.01) | 0.644 | |||||||
| ≥20 PY | 477 (27.9%) | 1.68 (1.11, 2.54) | 1.85 (1.10, 3.11) | 0.021 | |||||||
| rs470215 (TT or TC) | 11q22.2 | 715 (90.3%) | 880 (87.9%) | 1.23 (0.89, 1.69) | Protein e | Low | 640 (35.7%) | 0.67 (0.40, 1.14) | Ref | ||
| High | 1153 (64.3%) | 1.78 (1.18, 2.70) | 2.44 (1.24, 4.81) | 0.010 | |||||||
| rs1927911 (CC) | 9q33.1 | 396 (52.4%) | 495 (51.5%) | 0.93 (0.76, 1.15) | Smoking | Non | 834 (48.7%) | 0.80 (0.59, 1.08) | Ref | ||
| <20 PY | 400 (23.4%) | 0.65 (0.41, 1.04) | 0.99 (0.57, 1.74) | 0.980 | |||||||
| rs11536889 (GG) | 546 (72.2%) | 684 (71.1%) | ≥20 PY | 477 (27.9%) | 1.33 (0.90, 1.98) | 2.34 (1.38, 3.98) | 0.002 | ||||
| rs1927911 (CT or TT) | 9q33.1 | 360 (47.6%) | 467 (48.5%) | 1.07 (0.87, 1.32) | Alcohol | Non/Moderate | 1326 (77.2%) | 0.95 (0.75, 1.21) | Ref | ||
| rs11536889 (GC or CC) | 210 (27.8%) | 278 (28.9%) | Heavy | 391 (22.8%) | 1.58 (1.01, 2.47) | 2.10 (1.22, 3.60) | 0.007 | ||||
| rs2295814 (GA or AA) | 10q21.3 | 106 (14.0%) | 115 (12.0%) | 1.11 (0.83, 1.49) | Smoking | Non | 834 (48.7%) | 0.90 (0.58, 1.37) | Ref | ||
| <20 PY | 400 (23.4%) | 1.21 (0.65, 2.28) | 1.84 (0.83, 4.09) | 0.130 | |||||||
| ≥20 PY | 477 (27.9%) | 1.53 (0.89, 2.65) | 2.23 (1.04, 4.78) | 0.040 | |||||||
| rs2295814 (GG) | 10q21.3 | 650 (86.0%) | 847 (88.0%) | 0.90 (0.67, 1.20) | Alcohol | Non/Moderate | 1326 (77.2%) | 0.81 (0.57, 1.14) | Ref | ||
| Heavy | 391 (22.8%) | 1.21 (0.69, 2.11) | 2.12 (1.01, 4.46) | 0.048 |
Abbreviations: GST, Gene-Specific Tree; Chr., Chromosome; Env., Environmental; PY, pack-years; OR, odds ratio; CI, confidence interval. a Gene odds ratios were adjusted for age, sex, race, study center, pathway trees; b N (%) frequency of the environmental variable within subjects with the GST; c ORINT: Interaction Odds Ratio; d PINT: Interaction p-value; e Animal/Vegetable Protein Ratio.
Effects of gene-environment interactions significant at 5% level between gene-specific trees and environmental factors on rectal cancer survival.
| GST | Gene | Chr. | Cases (%) | Gene HR a (95% CI) | Env. Factor | Category | N (%) b | Gene OR by Env. Factor a (95% CI) | HRINT c (95% CI) | PINT d |
|---|---|---|---|---|---|---|---|---|---|---|
| rs6838752 (TT or TC) | 4q12 | 705 (93.6%) | 0.89 (0.55, 1.45) | Protein e | Low | 258 (32.4%) | 0.44 (0.21, 0.91) | Ref | ||
| High | 538 (67.6%) | 1.43 (0.73, 2.83) | 4.12 (1.52, 11.13) | 0.005 | ||||||
| rs1008562 (GG) | 2q35 | 211 (27.9%) | 1.17 (0.89, 1.53) | Smoking | Non | 348 (46.2%) | 1.04 (0.68, 1.60) | Ref | ||
| <20 PY | 160 (21.2%) | 0.88 (0.44, 1.75) | 0.96 (0.46, 1.98) | 0.905 | ||||||
| ≥20 PY | 245 (32.5%) | 1.88 (1.20, 2.95) | 2.05 (1.12, 3.76) | 0.019 | ||||||
| rs7656411 (GG) | 4q31.3 | 61 (8.1%) | 0.83 (0.48, 1.44) | Protein e | Low | 244 (32.3%) | 0.13 (0.02, 0.98) | Ref | ||
| High | 512 (67.7%) | 1.33 (0.74, 2.38) | 8.69 (1.09, 69.12) | 0.041 | ||||||
| rs224082 (GA or AA) | 10q21.3 | 455 (60.2%) | 0.72 (0.56, 0.92) | Protein e | Low | 244 (32.3%) | 0.39 (0.25, 0.62) | Ref | ||
| High | 512 (67.7%) | 0.93 (0.68,1.23) | 2.41 (1.40, 4.15) | 0.002 | ||||||
| rs17151957 (AA) | 7p11.2 | 41 (6.5%) | 1. 82 (1.16, 2.88) | Protein e | Low | 244 (32.3%) | 0.54 (0.19, 1.53) | Ref | ||
| High | 512 (67.7%) | 3.37 (1.95, 5.82) | 5.84 (1.80, 18.94) | 0.003 |
Abbreviations: GST, Gene-Specific Tree; Chr., Chromosome; Env., Environmental; PY, pack-years; HR, hazard ratio; CI, confidence interval. a Gene hazard ratios were adjusted for age, sex, race, study center, pathway tree, baseline hazard stratified by cancer stage; b N (%) frequency of the environmental variable within subjects with the GST; c HRINT: Interaction hazards ratio; d PINT: Interaction p-value; e Animal/Vegetable Protein Ratio.