| Literature DB >> 24391818 |
Evangelina López de Maturana1, Yuanqing Ye2, M Luz Calle3, Nathaniel Rothman4, Víctor Urrea3, Manolis Kogevinas5, Sandra Petrus6, Stephen J Chanock4, Adonina Tardón7, Montserrat García-Closas4, Anna González-Neira1, Gemma Vellalta8, Alfredo Carrato9, Arcadi Navarro10, Belén Lorente-Galdós11, Debra T Silverman4, Francisco X Real12, Xifeng Wu2, Núria Malats1.
Abstract
The relationship between inflammation and cancer is well established in several tumor types, including bladder cancer. We performed an association study between 886 inflammatory-gene variants and bladder cancer risk in 1,047 cases and 988 controls from the Spanish Bladder Cancer (SBC)/EPICURO Study. A preliminary exploration with the widely used univariate logistic regression approach did not identify any significant SNP after correcting for multiple testing. We further applied two more comprehensive methods to capture the complexity of bladder cancer genetic susceptibility: Bayesian Threshold LASSO (BTL), a regularized regression method, and AUC-Random Forest, a machine-learning algorithm. Both approaches explore the joint effect of markers. BTL analysis identified a signature of 37 SNPs in 34 genes showing an association with bladder cancer. AUC-RF detected an optimal predictive subset of 56 SNPs. 13 SNPs were identified by both methods in the total population. Using resources from the Texas Bladder Cancer study we were able to replicate 30% of the SNPs assessed. The associations between inflammatory SNPs and bladder cancer were reexamined among non-smokers to eliminate the effect of tobacco, one of the strongest and most prevalent environmental risk factor for this tumor. A 9 SNP-signature was detected by BTL. Here we report, for the first time, a set of SNP in inflammatory genes jointly associated with bladder cancer risk. These results highlight the importance of the complex structure of genetic susceptibility associated with cancer risk.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24391818 PMCID: PMC3877090 DOI: 10.1371/journal.pone.0083745
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristic profile of the studied population.
| Total population | Non-smoker subset | ||||||||
| Cases | Controls | Cases | Controls | ||||||
| n = 1047 | (%) | n = 988 | % | n = 147 | % | n = 277 | % | ||
|
| |||||||||
| Male | 915 | (87) | 873 | (88) | 52 | (35) | 180 | (65) | |
| Female | 132 | (13) | 115 | (12) | 95 | (65) | 97 | (35) | |
|
| |||||||||
| Never smokers | 147 | (14) | 277 | (28) | 147 | (100) | 277 | (100) | |
| Occasional smokers | 44 | (4) | 79 | (8) | |||||
| Former smokers | 400 | (38) | 361 | (37) | |||||
| Current smokers | 450 | (43) | 267 | (27) | |||||
|
| |||||||||
| Barcelona | 182 | (17) | 196 | (20) | 22 | (15) | 45 | (16) | |
| Vallès/Bages | 171 | (16) | 157 | (16) | 21 | (14) | 38 | (14) | |
| Elche | 77 | (7) | 79 | (8) | 15 | (10) | 27 | (10) | |
| Asturias | 180 | (17) | 146 | (15) | 26 | (18) | 35 | (13) | |
| Tenerife | 437 | (42) | 410 | (41) | 63 | (43) | 132 | (48) | |
Figure 1Histogram of the posterior probabilities of having a positive (negative) SNP effect by Bayesian Threshold LASSO model (BTL) in the total population.
The dot point line indicates the cut-off point of 80% above which SNPs were considered.
Risk estimates of Bayesian Threshold LASSO model (BTL), considering a posterior probability higher than 80%, and from logistic regression for the total population.
| SNP | Gene | Type | Position | Alleles | ORaa_AA
| Post prob | ORaa_AA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs3091312 |
| downstream | 3p21.31 | A/T | 0.86 | 90.93 | 0.69 | 0.013 |
|
|
|
|
|
|
|
|
|
|
| rs2236757 |
| intronic | 21q22.11 | A/G | 1.16 | 89.77 | 1.43 | 0.015 |
| rs8193036 |
| upstream | 6p12.2 | C/T | 0.88 | 88.50 | 0.74 | 0.048 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs150126 |
| intronic | 6q15 | G/A | 0.88 | 87.74 | 0.68 | 0.011 |
| rs3789928 |
| intronic | 10q24.1 | G/C | 0.88 | 87.63 | 0.78 | 0.069 |
| rs7209435 |
| intronic | 17q23.3 | T/C | 1.13 | 86.29 | 1.29 | 0.095 |
| rs2020902 |
| splice site | 1p36.21 | T/C | 0.88 | 86.19 | 0.61 | 0.007 |
|
|
|
|
|
|
|
|
|
|
| rs7101 |
| 5’ UTR | 14q24.3 | C/T | 0.89 | 85.93 | 0.71 | 0.023 |
| rs12357751 |
| intronic | 10q24.1 | C/T | 1.14 | 85.67 | 1.36 | 0.040 |
|
|
|
|
|
|
|
|
|
|
| rs2737191 |
| upstream | 9q33.1 | A/G | 1.12 | 84.53 | 1.28 | 0.088 |
|
|
|
|
|
|
|
|
|
|
| rs10878176 |
| intronic | 12q14.2 | G/C | 0.90 | 83.88 | 0.70 | 0.014 |
| rs17226566 |
| intronic | 8q21.11 | T/C | 0.90 | 83.47 | 0.77 | 0.108 |
| rs744120 |
| upstream | 17q25.3 | C/G | 0.90 | 83.27 | 0.69 | 0.014 |
| rs723279 |
| intronic | 18q22.2 | G/A | 1.12 | 83.16 | 1.28 | 0.097 |
| rs13428 |
|
| 13q12.12 | G/C | 1.11 | 83.11 | 1.26 | 0.094 |
| rs11046349 |
| 3’ UTR | 12p13.31 | T/G | 0.91 | 82.17 | 0.60 | 0.014 |
| rs11602147 |
| intronic | 11q22.2 | C/G | 1.10 | 81.98 | 1.23 | 0.148 |
| rs1063169 |
| intronic | 14q24.3 | G/T | 1.12 | 81.88 | 1.56 | 0.018 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs17461269 |
| intronic | 4q31.21 | T/A | 0.91 | 81.22 | 0.76 | 0.072 |
| rs3765535 |
| intronic | 13q32.1 | A/G | 1.12 | 81.21 | 1.41 | 0.091 |
| rs11888 |
| 3’ UTR | 19p13.11 | T/C | 0.92 | 81.15 | 0.84 | 0.212 |
| rs10882755 |
| intronic | 10q24.1 | A/G | 1.13 | 80.76 | 1.41 | 0.058 |
|
|
|
|
|
|
|
|
|
|
| rs8049804 |
| intergenic | 17q22 | A/C | 1.10 | 80.33 | 1.18 | 0.239 |
| rs4871857 |
| non synonymous coding | 8p21.3 | G/C | 0.93 | 80.22 | 0.80 | 0.082 |
a Posterior mean of the OR, calculated from the BTL analyses Similar values for the median were obtained for each SNP.
b It corresponds to , where .
c OR obtained from the adjusted logistic regression.
d p-value of the trend obtained from the adjusted logistic regression.
SNPs also selected by AUC-RF are bold-faced.
Relative variable importance for the top 12 polymorphisms selected by AUC-RF in the total population.
| rs number | Gene | Type | Alleles | Position | Relative variable importance |
| rs2286662 |
| non synonymous coding | A/G | 19p13.11 | 20.8% |
| rs8192284 |
| non_synonymous coding | A/C | 1q21.3 | 16.4% |
| rs7104333 |
| downstream | A/G | 11q12.2 | 16.3% |
| rs288980 |
| intronic | C/T | 18q11.2 | 15.7% |
| rs3087455 |
| intronic | A/C | 4q35.1 | 15.6% |
| rs11655650 |
| intronic | C/T | 17q25.3 | 15.6% |
| rs3213427 |
| 3’ UTR | T/C | 12p13.31 | 15.3% |
| rs3136701 |
| intronic | C/G | 1p13.1 | 15.0% |
| rs4765621 |
| intronic | G/A | 12q24.31 | 14.8% |
| rs5498 |
| coding unknown | A/G | 19p13.2 | 14.8% |
| rs2839488 |
| intronic | C/G | 21q22.3 | 14.6% |
| rs1937845 |
| 5’ UTR | G/A | 10p15.1 | 14.4% |
Calculated by dividing the raw variable importance measurement by that with the highest MDG, that of smoking status.
Figure 2Venn diagrams showing the overlapping between the SNPs selected by Bayesian Threshold model (BTL) and AUC-Random Forest (AUC-RF).
(A) Number of SNPs detected by each method in the total population. (B) Number of SNPs detected by each method in the non-smoker subset. (C) Number of common SNPs detected by BTL in the total population and non-smoker subset, with posterior probabilities of at least 80% and 75% of having an effect different from 0. (D) Number of SNPs detected by AUC-RF in both the total population and the non-smoker subset.
Risk estimates from Bayesian Threshold LASSO model (BTL), considering a posterior probability of 75%, and from logistic regression analyses among non-smokers.
| rs number | Gene | Type | Position | Alleles | ORaa_AA
| Post prob | ORaa_AA
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| rs20432 |
| intronic | 1q31.1 | T/G | 0.91 | 75.17 | 0.28 | 0.003 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Posterior mean of the OR, calculated from the BTL analyses. Similar values for the median were obtained for each SNP.
It corresponds to , where .
OR obtained from the adjusted logistic regression.
p-value of the trend obtained from the adjusted logistic regression.
SNPs also selected by AUC-RF are bold-faced.