| Literature DB >> 22206016 |
Rakhshan Ihsan1, Pradeep Singh Chauhan, Ashwani Kumar Mishra, Dhirendra Singh Yadav, Mishi Kaushal, Jagannath Dev Sharma, Eric Zomawia, Yogesh Verma, Sujala Kapur, Sunita Saxena.
Abstract
Complex disease such as cancer results from interactions of multiple genetic and environmental factors. Studying these factors singularly cannot explain the underlying pathogenetic mechanism of the disease. Multi-analytical approach, including logistic regression (LR), classification and regression tree (CART) and multifactor dimensionality reduction (MDR), was applied in 188 lung cancer cases and 290 controls to explore high order interactions among xenobiotic metabolizing genes and environmental risk factors. Smoking was identified as the predominant risk factor by all three analytical approaches. Individually, CYP1A1*2A polymorphism was significantly associated with increased lung cancer risk (OR = 1.69;95%CI = 1.11-2.59,p = 0.01), whereas EPHX1 Tyr113His and SULT1A1 Arg213His conferred reduced risk (OR = 0.40;95%CI = 0.25-0.65,p<0.001 and OR = 0.51;95%CI = 0.33-0.78,p = 0.002 respectively). In smokers, EPHX1 Tyr113His and SULT1A1 Arg213His polymorphisms reduced the risk of lung cancer, whereas CYP1A1*2A, CYP1A1*2C and GSTP1 Ile105Val imparted increased risk in non-smokers only. While exploring non-linear interactions through CART analysis, smokers carrying the combination of EPHX1 113TC (Tyr/His), SULT1A1 213GG (Arg/Arg) or AA (His/His) and GSTM1 null genotypes showed the highest risk for lung cancer (OR = 3.73;95%CI = 1.33-10.55,p = 0.006), whereas combined effect of CYP1A1*2A 6235CC or TC, SULT1A1 213GG (Arg/Arg) and betel quid chewing showed maximum risk in non-smokers (OR = 2.93;95%CI = 1.15-7.51,p = 0.01). MDR analysis identified two distinct predictor models for the risk of lung cancer in smokers (tobacco chewing, EPHX1 Tyr113His, and SULT1A1 Arg213His) and non-smokers (CYP1A1*2A, GSTP1 Ile105Val and SULT1A1 Arg213His) with testing balance accuracy (TBA) of 0.6436 and 0.6677 respectively. Interaction entropy interpretations of MDR results showed non-additive interactions of tobacco chewing with SULT1A1 Arg213His and EPHX1 Tyr113His in smokers and SULT1A1 Arg213His with GSTP1 Ile105Val and CYP1A1*2C in nonsmokers. These results identified distinct gene-gene and gene environment interactions in smokers and non-smokers, which confirms the importance of multifactorial interaction in risk assessment of lung cancer.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22206016 PMCID: PMC3242784 DOI: 10.1371/journal.pone.0029431
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Association of genotypes of xenobiotic metabolizing genes and environmental risk factors with lung cancer susceptibility.
| FACTORS | CATEGORIES | GENOTYPE | CASES | CONTROLS | OR (95% C.I.) | p value | ||
| Genetic Factors | n | % | n | % | ||||
|
| TT | T6235T | 55 | 29.3 | 122 | 42.1 | 1..0 | |
| TC | T6235C | 103 | 54.8 | 124 | 42.8 | 1.69 (1.11–2.59) | 0.01 | |
| CC | C6235C | 30 | 16.0 | 44 | 15.2 | 1.53 (0.84–2.78) | 0.15 | |
|
| AA | Ile462Ile | 122 | 64.9 | 206 | 71.0 | 1..0 | |
| AG | Ile462Val | 56 | 29.8 | 77 | 26.6 | 1.16 (0.75–1.80) | 0.48 | |
| GG | Val462Val | 10 | 5.3 | 7 | 2.4 | 2.18 (0.78–6.09) | 0.13 | |
|
| TT | Tyr113Tyr | 82 | 43.6 | 94 | 32.4 | 1.0 | |
| TC | Tyr113His | 51 | 27.1 | 133 | 45.9 | 0.40 (0.25–0.65) | <0.001 | |
| CC | His113His | 55 | 29.3 | 63 | 21.7 | 1.00 (0.60–1.67) | 0.98 | |
|
| AA | His139His | 121 | 64.4 | 212 | 73.1 | 1.0 | |
| AG | His139Arg | 59 | 31.4 | 70 | 24.1 | 1.45 (0.92–2.27) | 0.10 | |
| GG | Arg139Arg | 8 | 4.3 | 8 | 2.8 | 2.41 (0.79–7.36) | 0.12 | |
|
| Wild Type | Present | 122 | 64.9 | 177 | 61.0 | 1.0 | |
| Null | Null Genotype | 66 | 35.1 | 113 | 39.0 | 0.95 (0.63–1.41) | 0.80 | |
|
| Wild Type | Present | 155 | 82.4 | 217 | 74.8 | 1.0 | |
| Null | Null Genotype | 33 | 17.6 | 73 | 25.2 | 0.62 (0.38–1.02) | 0.06 | |
|
| AA | Ile105Ile | 102 | 54.3 | 179 | 61.7 | 1.0 | |
| AG | Ile105Val | 77 | 41.0 | 96 | 33.1 | 1.46 (0.95–2.23) | 0.07 | |
| GG | Val105Val | 9 | 4.8 | 15 | 5.2 | 1.09 (0.43–2.77) | 0.84 | |
|
| GG | Arg213Arg | 123 | 65.4 | 153 | 52.8 | 1.0 | |
| GA | Arg213His | 50 | 26.6 | 116 | 40.0 | 0.51 (0.33–0.78) | 0.002 | |
| AA | His213His | 15 | 8.0 | 21 | 7.2 | 0.87 (0.42–1.82) | 0.72 | |
|
| ||||||||
|
| Non-smokers | 56 | 29.8 | 151 | 52.1 | 1.0 | ||
| Smokers | 132 | 70.2 | 139 | 47.9 | 3.06 (1.94–4.83) | <0.001 | ||
|
| Non-chewers | 92 | 48.9 | 172 | 59.3 | 1.0 | ||
| Chewers | 96 | 51.1 | 118 | 40.7 | 1.24 (0.82–1.85) | 0.293 | ||
|
| Non chewers | 52 | 27.7 | 131 | 45.2 | 1.0 | ||
| Chewers | 136 | 72.3 | 159 | 54.8 | 1.86 (1.21–2.84) | 0.004 | ||
|
| Non-alcoholic | 135 | 71.8 | 207 | 71.4 | 1.0 | ||
| Alcoholic | 53 | 28.2 | 83 | 28.6 | 0.87 (0.56–1.37) | 0.57 | ||
χ2 significant; p<0.05.
ORs adjusted for all environmental factors.
ORs adjusted for all genetic factors.
*Significant after p-value adjustment for multiple comparision (Sidak correction).
Distribution of CYP1A1 and EPHX1 haplotype frequency among lung cancer cases and controls.
| HAPLOTYPE | CASE ( 376) | CONTROL ( 580) | χ2 | P value | OR (95%CI) | P value | D' | |||
| % | n | % | n | |||||||
|
| TA | 53.34 | 201 | 60.80 | 352 | 5.00 | 0.02 | 1.00 | 0.72 | |
| TG | 3.31 | 12 | 2.65 | 16 | 0.21 | 0.64 | 1.31 (0.57–3.00) | 0.50 | ||
| CA | 26.45 | 99 | 23.51 | 137 | 0.95 | 0.32 | 1.26 (0.91–1.74) | 0.15 | ||
| CG | 16.90 | 64 | 13.04 | 75 | 2.94 | 0.08 | 1.49 (1.00–2.21) | 0.04 | ||
|
| TA | 44.79 | 168 | 45.04 | 262 | 0.05 | 0.81 | 1.00 | 0.21 | |
| TG | 12.39 | 47 | 10.31 | 59 | 1.57 | 0.20 | 1.23(0.78–1.94) | 0.30 | ||
| CA | 35.26 | 133 | 40.13 | 233 | 1.82 | 0.17 | 0.88 (0.65–1.19) | 0.42 | ||
| CG | 7.56 | 28 | 4.52 | 26 | 2.64 | 0.10 | 1.67 (0.91–3.06) | 0.07 | ||
D' Linkage Disequilibrium.
Main effects of genotypes on lung cancer risk stratified by smoking.
| Polymorphism | Genotype | Smoker | Non Smoker | ||
| Case/Control (n,%) | OR (95% C.I.),p value | Case/Control (n,%) | OR (95% C.I.),p value | ||
|
| TT | 44(33.3)/57(41.0) | 1.0 | 11(19.6)/65(28.6) | 1.0 |
| TC | 74(56.1)/61(43.9) | 1.45(0.84–2.50),0.17 | 29(51.8)/63(41.7) | 2.88(1.22–6.81),0.016 | |
| CC | 14(10.6)/21(15.1) | 0.83(0.36–1.91),0.66 | 16(28.6)/23(15.2) | 4.35(1.47–12.84),0.008 | |
|
| AA | 86(65.2)/93(66.9) | 1.0 | 36(64.3)/113(74.8) | 1.0 |
| AG | 40(30.3)/40(28.8) | 1.14(0.65–2.02),0.63 | 16(28.6)/37(24.5) | 1.53(0.67–3.48),0.30 | |
| GG | 6(4.5)/6(4.3) | 1.71(0.43–6.74),0.43 | 4(7.1)/1(0.7) | 11.81(1.24–111.98),0.03 | |
|
| TT | 60(45.5)/41(29.5) | 1.0 | 22(39.3)/53(35.1) | 1.0 |
| TC | 35(26.5)/71(51.1) | 0.34(0.18–0.63),0.001 | 16(28.6)/62(41.1) | 0.62(0.25–1.54),0.30 | |
| CC | 37(28.0)/27(19.4) | 1.14(0.57–2.29),0.69 | 18(32.1)/36(23.8) | 1.03(0.41–2.56),0.94 | |
|
| AA | 80(60.6)/103(74.1) | 1.0 | 41(73.2)/109(72.2) | 1.0 |
| AG | 48(36.4)/32(23.0) | 1.92(1.07–3.45),0.02 | 11(19.6)/38(25.2) | 0.98(0.41–2.36),0.98 | |
| GG | 4(3.0)/4(2.9) | 1.39(0.31–6.25),0.66 | 4(7.1)/4(2.6) | 4.25(0.54–33.15),0.16 | |
|
| WildType | 91(68.9)/86(61.9) | 1.0 | 31(55.4)/91(60.3) | 1.0 |
| Null | 41(31.1)/53(38.1) | 0.87(0.51–1.48),0.62 | 25(44.6)/60(39.7) | 1.25(0.61–2.54),0.53 | |
|
| WildType | 106(80.3)/104(74.8) | 1.0 | 49(87.5)/113(74.8) | 1.0 |
| Null | 26(19.7)/35(25.2) | 0.75(0.40–1.41),0.37 | 7(12.5)/38(25.2) | 0.48(0.19–1.20),0.11 | |
|
| AA | 69(52.3)/77(55.4) | 1.0 | 33(58.9)/102(67.5) | 1.0 |
| AG | 54(40.9)/55(39.6) | 1.35(0.77–2.36),0.29 | 23(41.1)/41(27.2) | 2.40(1.15–5.03),0.01 | |
| GG | 9(6.8)/7(5.0) | 1.49(0.49–4.56),0.47 | 0/8(5.3) | NA | |
|
| GG | 84 (63.6)/69 (49.6) | 1.0 | 39(69.6)/84(55.6) | 1.0 |
| GA | 35 (26.5)/58 (41.7) | 0.45(0.25–0.80),0.007 | 15(26.8)/58(38.4) | 0.54(0.24–1.19),0.13 | |
| AA | 13 (9.8)/12 (8.6) | 1.11(0.45–2.74),0.81 | 2(3.6)/9(6.0) | 0.48(0.09–2.54),0.39 | |
*p values adjusted for tobacco chewing, betel quid chewing and alcohol consumption.
Significant after p-value adjustment for multiple comparision (Sidak correction).
Figure 1Classification and regression tree model for xenobiotic metabolizing gene polymorphisms and environmental risk factors.
Terminal nodes are thick bordered. *W: Wild type genotype; V: Variant genotype, TN: Terminal Node, #p value <0.05
Results of MDR analysis.
| No. of Locus | Model | p value (χ2 test) | TBA | p-value | CVC | p-value | |
|
| |||||||
| 1st order | Smk | p < 0.0001 | 0.6114 | <0.001 | 10 | 0.391 | |
| 2nd order | Smk Ex3 | p < 0.0001 | 0.6407 | <0.001 | 10 | 0.391 | |
| 3rd order | Smk Ex3 Ex4 | p < 0.0001 | 0.6497 | <0.001 | 10 | 0.391 | |
|
|
|
|
|
|
|
| |
|
| |||||||
| 1st order | Ex3 | p < 0.0001 | 0.6228 | 0.012 | 10 | 0.402 | |
| 2nd order | Tbc Ex3 | p < 0.0001 | 0.6105 | <0.02 | 9 | 0.623 | |
|
|
|
|
|
|
|
| |
| 4th order | Tbc Alc Ex3 SULT | p < 0.0001 | 0.6268 | <0.008 | 7 | 0.846 | |
|
| |||||||
| 1st order | 2A | p = 0.0019 | 0.6170 | 0.09 | 10 | 0.372 | |
| 2nd order | 2A SULT | p = 0.0004 | 0.5562 | 0.46 | 8 | 0.734 | |
|
|
|
|
|
|
|
| |
| 4th order | 2A 2C P1 SULT | p < 0.0001 | 0.6439 | <0.021 | 10 | 0.372 |
*1,000-fold permutation test. **Best models selected with maximum cross-validation consistency (CVC) and maximum testing balance accuracy (TBA). Labels: Smk: smoking, Ex3: EPHX1 Tyr113His, Ex4: EPHX1 His139Arg, SULT: SULT1A1 Arg213His, Tbc: tobacco chewing, Alc: alcohol consumption, 2A: CYP1A1*2A, P1: GSTP1 Ile105Val, 2C: CYP1A1*2C.
False positive report probability and odds ratio for best model of MDR analysis.
| OR (95% CI) p value | OR = 1.5 | Prior Probability | OR = 2.0 | Prior Probability | |||||||||||
| Power | 10−1 | 10−2 | 10−3 | 10−4 | 10−5 | 10−6 | Power | 10−1 | 10−2 | 10−3 | 10−4 | 10−5 | 10−6 | ||
|
| 4.93 (3.32–7.33) p < 0.0001 | 0.0001 |
|
|
|
|
|
| 0.0001 |
|
|
|
|
|
|
|
| 4.55 (2.69–7.69) p < 0.0001 | 0.0001 |
|
| 0.472 | 0.900 | 0.989 | 0.989 | 0.001 |
|
|
|
| 0.588 | 0.588 |
|
| 7.32 (3.24–6.53) p < 0.0001 | 0.0001 |
| 0.708 | 0.961 | 0.996 | 1.00 | 1.00 | 0.001 |
|
| 0.650 | 0.949 | 0.995 | 0.995 |
Prior probabilities ranging from 0.1 to 10−6, with the estimated statistical power to detect an OR of 1.5 or 2.0 with α level equal to the observed p-value.
Bold type indicates the FPRP for the most likely prior probabilities i.e. a noteworthy association at the 0.2 FPRP.
Figure 2Interaction entropy graphs.
The interaction model describes the percentage of the entropy (information gain) removed by each variable (main effect: represented by nodes) and by each pairwise combination of attributes (interaction effect: represented by connections). Attributes are selected on the basis of MDR results obtained in case of (A) Smokers and (B) Non smokers. Labels: Ex3: EPHX1 Tyr113His, Alc: alcohol consumption, Tbc: Tobacco chewing, SULT: SULT1A1 Arg213His, 2A: CYP1A1*2A, 2C: CYP1A1*2C, P1: GSTP1 Ile105Val.