| Literature DB >> 23626673 |
Giuseppe Matullo1, Simonetta Guarrera, Marta Betti, Giovanni Fiorito, Daniela Ferrante, Floriana Voglino, Gemma Cadby, Cornelia Di Gaetano, Fabio Rosa, Alessia Russo, Ari Hirvonen, Elisabetta Casalone, Sara Tunesi, Marina Padoan, Mara Giordano, Anna Aspesi, Caterina Casadio, Francesco Ardissone, Enrico Ruffini, Pier Giacomo Betta, Roberta Libener, Roberto Guaschino, Ezio Piccolini, Monica Neri, Arthur W B Musk, Nicholas H de Klerk, Jennie Hui, John Beilby, Alan L James, Jenette Creaney, Bruce W Robinson, Sutapa Mukherjee, Lyle J Palmer, Dario Mirabelli, Donatella Ugolini, Stefano Bonassi, Corrado Magnani, Irma Dianzani.
Abstract
Asbestos exposure is the main risk factor for malignant pleural mesothelioma (MPM), a rare aggressive tumor. Nevertheless, only 5-17% of those exposed to asbestos develop MPM, suggesting the involvement of other environmental and genetic risk factors. To identify the genetic risk factors that may contribute to the development of MPM, we conducted a genome-wide association study (GWAS; 370,000 genotyped SNPs, 5 million imputed SNPs) in Italy, among 407 MPM cases and 389 controls with a complete history of asbestos exposure. A replication study was also undertaken and included 428 MPM cases and 1269 controls from Australia. Although no single marker reached the genome-wide significance threshold, several associations were supported by haplotype-, chromosomal region-, gene- and gene-ontology process-based analyses. Most of these SNPs were located in regions reported to harbor aberrant alterations in mesothelioma (SLC7A14, THRB, CEBP350, ADAMTS2, ETV1, PVT1 and MMP14 genes), causing at most a 2-3-fold increase in MPM risk. The Australian replication study showed significant associations in five of these chromosomal regions (3q26.2, 4q32.1, 7p22.2, 14q11.2, 15q14). Multivariate analysis suggested an independent contribution of 10 genetic variants, with an Area Under the ROC Curve (AUC) of 0.76 when only exposure and covariates were included in the model, and of 0.86 when the genetic component was also included, with a substantial increase of asbestos exposure risk estimation (odds ratio, OR: 45.28, 95% confidence interval, CI: 21.52-95.28). These results showed that genetic risk factors may play an additional role in the development of MPM, and that these should be taken into account to better estimate individual MPM risk in individuals who have been exposed to asbestos.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23626673 PMCID: PMC3634031 DOI: 10.1371/journal.pone.0061253
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary statistics of all the subjects included in the Italian GWAS.
| CASALE M. | TURIN | GENOA | ALL SAMPLE | |||||
| Cases | Controls | Cases | Controls | Cases | Controls | Cases | Controls | |
| N (%) | N (%) | N (%) | N (%) | N (%) | N (%) | N (%) | N (%) | |
| Eligible | 241 (48.88) | 252 (51.12) | 91 (61.9) | 56 (38.1) | 75 (48.08) | 81 (51.92) | 407 (51.13) | 389 (48.87) |
| After QC filtering | 230 (49.25) | 237 (50.75) | 89 (61.81) | 55 (38.19) | 73 (49.32) | 75 (50.68) | 392 (51.65) | 367 (48.35) |
|
| ||||||||
|
| 155 (67.39) | 162 (68.35) | 62 (69.66) | 38 (69.09) | 67 (91.78) | 56 (74.67) | 284 (72.45) | 256 (69.75) |
|
| 75 (32.61) | 75 (31.65) | 27 (30.34) | 17 (30.91) | 6 (8.22) | 19 (25.33) | 108 (27.55) | 111 (30.25) |
|
| ||||||||
|
| 204 (90.27) | 186 (78.81) | 62 (69.66) | 35 (63.64) | 54 (78.26) | 53 (76.81) | 320 (83,33) | 274 (76,11) |
|
| 6 (2.65) | 12 (5.08) | 5 (5.62) | 1 (1.82) | 8 (11.59) | 5 (7.25) | 19 (4,95) | 18 (5) |
|
| 14 (6.19) | 33 (13.98) | 16 (17.98) | 17 (30.91) | 4 (5.8) | 4 (5.8) | 34 (8.85) | 54 (15) |
|
| 0 (0) | 2 (0.85) | 3 (3.37) | 2 (3.64) | 1 (1.45) | 3 (4.35) | 4 (1.04) | 7 (1.94) |
|
| 2 (0.88) | 3 (1.27) | 3 (3.37) | 0 (0) | 2 (2.9) | 4 (5.8) | 7 (1.82) | 7 (1.94) |
|
| ||||||||
|
| 4 (2.06) | 54 (22.78) | 3 (3.37) | 18 (32.73) | 10 (13.7) | 41 (54.67) | 17 (4.87) | 113 (30.79) |
|
| 106 (54.64) | 103 (43.46) | 33 (37.08) | 25 (45.45) | 7 (9.59) | 22 (29.33) | 146 (41.01) | 150 (40.87) |
|
| 84 (43.3) | 80 (33.76) | 53 (59.55) | 12 (21.82) | 56 (76.71) | 12 (16) | 193 (54.21) | 104 (28.34) |
|
| 66.46±10.81 | 66.42±12.26 | 68.53±9.28 | 68.70±7.69 | 64.16±13.70 | 63.44±14.47 | 66.5±11.01 | 66.12±12.06 |
Figure 1Manhattan plot of genotyped SNPs from logistic additive model.
A) all samples, B) exposed samples.
Italian top 12 genotyped SNP list (2-tailed logistic regression, n = 759 overall, n = 593 exposed only).
| CHR Location | SNP | Ref. Allele | OR (95% CI) | P | Typed | Gene Name | Left Gene | Right Gene | Group |
| 6q21 | rs742109 | A | 0.55(0.43–0.71) | 2.70×10−6 | Genotyped |
|
| OVERALL | |
| 3q26.2 | rs7632718 | A | 1.83(1.42–2.37) | 3.71×10−6 | Genotyped |
|
|
| EXPOSED |
| 3p24.2 | rs9833191 | C | 0.54(0.41–0.71) | 7.67×10−6 | Genotyped |
|
|
| EXPOSED |
| 5q23.1 | rs1508805 | A | 1.85(1.41–2.44) | 1.04×10−5 | Genotyped |
|
| EXPOSED | |
| 1q25.2 | rs2501618 | A | 2.18(1.53–3.10) | 1.49×10−5 | Genotyped |
|
|
| EXPOSED |
| 5q35.3 | rs4701085 | G | 1.84(1.39–2.44) | 1.93×10−5 | Genotyped |
|
|
| EXPOSED |
| 4q22.1 | rs4290865 | A | 1.98(1.44–2.71) | 2.16×10−5 | Genotyped |
|
| EXPOSED | |
| 13q14.3 | rs9536579 | A | 0.54(0.40–0.72) | 2.33×10−5 | Genotyped |
|
| OVERALL | |
| 7p21.2 | rs3801094 | A | 1.75(1.35–2.27) | 2.52×10−5 | Genotyped |
|
|
| OVERALL |
| 8q24.21 | rs7841347 | A | 0.60(0.47–0.76) | 2.60×10−5 | Genotyped |
|
|
| OVERALL |
| 15q21.1 | rs10519201 | A | 2.36(1.57–3.56) | 3.82×10−5 | Genotyped |
|
|
| EXPOSED |
| 22q12.3 | rs5756444 | G | 0.60(0.47–0.76) | 3.95×10−5 | Genotyped |
|
| EXPOSED |
Region-, Gene- and GO process-based analysis on top SNPs (1-tailed binomial test, n = 759, alpha 0.0025, alpha = 0.01, alpha = 0.025, respectively).
| Region/Gene/GO processes based | Cytogenetic Band | Position (from - to) | Number of SNPs | Significant SNPs | P |
| - | 1q25.2 | (178192161–178267165) | 5 | 4 | 8.31×10−4 |
| - | 3p24.2 | (24311166–24397755) | 17 | 7 | 3.86×10−4 |
| - | 3q26.2 | (171668688–171738200) | 12 | 6 | 9.47×10−5 |
| - | 4q22.1 | (92842088–92925574) | 11 | 3 | 0.05 |
| - | 4q32.1 | (160680345–160763147) | 11 | 3 | 0.04 |
| - | 5q23.1 | (120950796–121034917) | 11 | 3 | 0.08 |
| - | 5q35.2 | (173515657–173599925) | 16 | 4 | 7.23×10−3 |
| - | 5q35.3 | (178559043–178654962) | 19 | 5 | 0.01 |
| - | 6q21 | (106656091–106738553) | 18 | 5 | 8.00×10−3 |
| - | 7p21.2 | (13877273–13974190) | 20 | 6 | 4.36×10−3 |
| - | 7p22.2 | (4339181–4436371) | 17 | 9 | 5.96×10−5 |
| - | 8q24.21 | (128837336–128935399) | 7 | 6 | 1.04×10−4 |
| - | 9p24.1 | (5363441–5453988) | 12 | 5 | 0.02 |
| - | 12q23.3 | (107375486–107461372) | 13 | 7 | 5.78×10−5 |
| - | 13q14.3 | (53429288–53513774) | 12 | 4 | 0.02 |
| - | 14q11.2 | (22334110–22425388) | 13 | 2 | 0.14 |
| - | 15q14 | (34381353–34470568) | 13 | 5 | 2.04×10−3 |
| - | 15q21.1 | (46959609–47047893) | 18 | 2 | 0.23 |
| - | 19q13.42 | (59189856–59266559) | 9 | 1 | 0.47 |
| - | 22q12.3 | (35660028–35754794) | 19 | 5 | 0.03 |
|
| 1q25.2 | (179933906–180093734) | 17 | 2 | 0.31 |
|
| 3p24.2 | (24162088–24541232) | 54 | 15 | 2.29×10−5 |
|
| 3q26.2 | (170167538–171715102) | 13 | 2 | 0.16 |
|
| 7p22.2 | (3341374–4303003) | 90 | 5 | 0.61 |
|
| 8q24.21 | (128808953–129119976) | 34 | 7 | 0.02 |
|
| - | - | 197 | 19 | 4.65×10−3 |
|
| - | - | 470 | 32 | 0.04 |
Figure 2Regional association plots for 4 of the most consistent chromosome regions.
a. 3p24.2, b. 8q24.21, c. 14q11.2, d. 7p22.2. Consistency was based on haplotype, gene-, region- and pathway analysis. Each SNP is plotted with respect to its chromosomal location (x axis) and its log10 transformed P value (y axis on the left) for associations with MPM. The tall blue spikes indicate the recombination rate (y axis on the right) at that region of the chromosome. The red-outlined diamond indicate the index SNP and other diamond indicate the genotyped SNPs, the squares indicate imputed SNPs using as reference 1000 Genomes Pilot 1 CEU population. LD values were calculated only on our control population.
Nested multivariate logistic regression models: 1) model 1, without genetic component; 2) model 2, with genetic component.
| MODEL 1 | MODEL 2 | ||||||||
| OR | OR_95L | OR_95U | P | OR | OR_95L | OR_95U | P | GENETIC MODEL | |
| LOW vs NO EXPOSURE | 8.01 | 4.41 | 14.54 | 8.52×10−12 | 15.31 | 7.78 | 30.14 | 2.86×10−15 | - |
| HIGH vs NO EXPOSURE | 17.33 | 9.28 | 32.37 | <2×10−16 | 45.28 | 21.52 | 95.28 | <2×10−16 | - |
| CLUSTER 2 vs 1 | 1.76 | 1.1 | 2.79 | 1.74×10−02 | 2.21 | 1.29 | 3.79 | 4.09×10−03 | - |
| rs2501618 | - | - | - | - | 2.23 | 1.47 | 3.37 | 1.52×10−04 | dominant |
| rs9833191 | - | - | - | - | 0.55 | 0.41 | 0.73 | 4.39×10−05 | additive |
| rs7632718 | - | - | - | - | 1.85 | 1.41 | 2.42 | 9.07×10−06 | additive |
| rs4701085 | - | - | - | - | 2.05 | 1.41 | 2.97 | 1.75×10−04 | dominant |
| rs73034881 | - | - | - | - | 0.44 | 0.29 | 0.67 | 1.12×10−04 | additive |
| rs3801094 | - | - | - | - | 1.86 | 1.39 | 2.48 | 2.78×10−05 | additive |
| rs7841347 | - | - | - | - | 0.51 | 0.39 | 0.67 | 1.56×10−06 | additive |
| rs10815216 | - | - | - | - | 0.41 | 0.27 | 0.60 | 8.53×10−06 | dominant |
| rs2236304 | - | - | - | - | 1.72 | 1.19 | 2.51 | 4.39×10−03 | dominant |
| rs7178364 | - | - | - | - | 0.45 | 0.28 | 0.71 | 5.66×10−04 | dominant |
adjusted for age, gender and center of recruitment.
MODEL 1: AIC = 871.3, AUC = 0.76.
MODEL 2: AIC = 730.27, AUC = 0.86.
Figure 3Receiver Operating Curves (ROC) for the two multivariate models including asbestos exposure 1) without and 2) with the 10 most robust and significant genetic variants.
Regional replication of Italian top signals in the Australian study for 5 out of the 20 regions.
| Cytogenetic Band | BP_start | BP_end | p Binomial test | p Binomial test | Meta-analysis |
| 3q26.2 | 171668688 | 171738200 | 9.47338E-05 | 0.01643691 | 1.61×10−5 |
| 4q32.1 | 160680345 | 160763147 | 0.042137914 | 0.000649 | 3.15×10−4 |
| 7p22.2 | 4339181 | 4436371 | 5.95584E-05 | 0.01403811 | 1.26×10−5 |
| 14q11.2 | 22334110 | 22425388 | 0.139471486 | 0.00100497 | 1.38×10−3 |
| 15q14 | 34381353 | 34470568 | 0.002040183 | 0.01305659 | 3.07×10−4 |
(1-tailed binomial test and meta-analysis).
NCBI36/hg18.
Italian study.
Australian study.
Figure 4eQTL: PVT1 and MYC gene-expression levels in blood cells across rs78941347 genotypes.