| Literature DB >> 22675575 |
Marine Germain1, Noémie Saut, Tiphaine Oudot-Mellakh, Luc Letenneur, Anne-Marie Dupuy, Marion Bertrand, Marie-Christine Alessi, Jean-Charles Lambert, Diana Zelenika, Joseph Emmerich, Laurence Tiret, Francois Cambien, Mark Lathrop, Philippe Amouyel, Pierre-Emmanuel Morange, David-Alexandre Trégouët.
Abstract
By applying an imputation strategy based on the 1000 Genomes project to two genome-wide association studies (GWAS), we detected a susceptibility locus for venous thrombosis on chromosome 11p11.2 that was missed by previous GWAS analyses that had been conducted on the same datasets. A comprehensive linkage disequilibrium and haplotype analysis of the whole locus where twelve SNPs exhibited association p-values lower than 2.23 10(-11) and the use of independent case-control samples demonstrated that the culprit variant was a rare variant located ~1 Mb away from the original hits, not tagged by current genome-wide genotyping arrays and even not well imputed in the original GWAS samples. This variant was in fact the rs1799963, also known as the FII G20210A prothrombin mutation. This work may be of major interest not only for its scientific impact but also for its methodological findings.Entities:
Mesh:
Year: 2012 PMID: 22675575 PMCID: PMC3366937 DOI: 10.1371/journal.pone.0038538
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Case-control samples available in this work.
(1) These individuals were typed with the Illumina Sentrix HumanHap300 beadchip containing 317,319 SNPs among which 291,872 satisfied the quality control (QC) criteria (Trégouët et al. (2009) Blood 113: 5298–5303). (2) These individuals were typed with the Illumina 610-Quad and Illumina 660W-Quad beadchips. Among the 551,141 SNPs common to both assays, 491,258 SNPs satisfied the QC criteria (Germain et al (2011) Plos One 6: e25581). (a) 812 VT patients of the MARTHA study were part of the GWAS(2) VT sample. (b) The FV Leiden and FII G20210A mutations were genotyped in the GWAS patients as part of the study design where patients homozygous for these mutations were excluded.
Genome-wide significant association (p<1.×10−8) observed at the 11p11.2 locus.
| GWAS from 2 | GWAS from 1 | Combined | ||||||||||||
| Locus | Position | SNP(a) | MAF(b) | logOR | se (LogOR) | P(c) | r2 (d) | MAF | logOR | se (LogOR) | P | r2 | P(e) | |
| MYBPC3 | 47373425 | rs2856656 | A/G | 0.032 | 1.598 | 0.257 | 2.76 10–12 | .8237 | 0.016 | 1.242 | 0.347 | 4.82 10−4 | .7624 | 1.03 10−12 |
| SPI1 | 47389771 | chr11:47389771 | C/G | 0.035 | 1.628 | 0.247 | 1.09 10−13 | .8371 | 0.018 | 1.044 | 0.328 | 1.92 10−3 | .7719 | 6.97 10−13 |
| CELF1 | 47565138 | chr11:47565138 | T/C | 0.031 | 1.748 | 0.27 | 5.13 10−13 | .7389 | 0.015 | 1.075 | 0.367 | 4.21 10−3 | .7162 | 3.64 10−12 |
| 47574081 | rs60206633 | C/G | 0.031 | 1.73 | 0.267 | 5.47 10−13 | .7444 | 0.016 | 1.061 | 0.364 | 4.41 10−3 | .7211 | 3.69 10−12 | |
| KBTBD4 | 47598481 | chr11:47598481 | G/A | 0.031 | 1.735 | 0.268 | 4.30 10−13 | .7584 | 0.016 | 0.964 | 0.368 | 1.06 10−2 | .7286 | 1.24 10−11 |
| FNBP4 | 47764906 | rs78261087 | G/A | 0.034 | 1.539 | 0.24 | 1.08 10−12 | .8358 | 0.017 | 0.864 | 0.339 | 1.29 10−2 | .7912 | 1.99 10−11 |
| 47783614 | chr11:47783614 | T/C | 0.034 | 1.528 | 0.239 | 1.11 10−12 | .8417 | 0.017 | 0.858 | 0.336 | 1.30 10−2 | .7957 | 2.23 10−11 | |
| NUP160 | 47818310 | rs58883118 | C/T | 0.035 | 1.526 | 0.237 | 8.22 10−13 | .8520 | 0.017 | 0.86 | 0.332 | 1.18 10−2 | .8166 | 1.52 10−11 |
| 47920993 | chr11:47920993 | G/T | 0.034 | 1.411 | 0.232 | 2.06 10−11 | .8796 | 0.016 | 1.048 | 0.333 | 2.19 10−3 | .8525 | 1.13 10−11 | |
| 47947894 | chr11:47947894 | C/T | 0.022 | 2.153 | 0.351 | 4.27 10−12 | .6464 | 0.011 | 1.45 | 0.465 | 2.42 10−3 | .6459 | 1.25 10−11 | |
| PTPRJ | 48052066 | rs117784795 | C/T | 0.039 | 1.315 | 0.207 | 2.67 10−12 | .9417 | 0.020 | 0.775 | 0.29 | 9.28 10−3 | .8986 | 1.78 10−11 |
| 48064194 | chr11:48064194 | G/A | 0.038 | 1.311 | 0.207 | 2.94 10−12 | .9746 | 0.020 | 0.856 | 0.296 | 4.99 10−3 | .9286 | 7.52 10−12 | |
(a) SNPs for which no rsID has yet been allocated are named according to their position on chromosome 11.
(b) Minor Allele Frequency.
(c) P-value of the association between imputed SNPs and VT risk, after adjusting for principal components.
(d) Imputation quality criterion (r2).
(e) Combined meta-analysis p-value obtained using the Mantel-Haenszel inverse-variance weighting method.
Figure 2Pairwise linkage disequilibrium r2 between genotyped SNPs at the 11p11.2 locus over the 47,373,425–48,064,194 bp region in the second GWAS study (Germain et al. Plos One 2011).
Association of haplotypes derived from rs2856650, rs3740689 and rs10769258 with VT risk in two GWAS.
| Polymorphisms | GWAS from | GWAS from | ||||||
| Haplotype Frequencies | OR [95%CI] | Haplotype Frequencies | OR [95%CI] | |||||
| rs2856650 | rs3740689 | rs10769258 | Controls (n = 1002) | Cases (n = 1542) | Controls (n = 1218) | Cases (n = 411) | ||
| G | G | T | 0.356 | 0.329 | reference | 0.332 | 0.335 | reference |
| G | A | T | 0.282 | 0.283 | 1.07 [0.93–1.24] p = 0.316 | 0.298 | 0.301 | 0.99 [0.82–1.22] p = 0.983 |
| G | A | C | 0.049 | 0.041 | 0.91 [0.68–1.21] p = 0.521 | 0.046 | 0.037 | 0.79 [0.51–1.23] p = 0.292 |
| A | G | T | 0.021 | 0.058 | 2.99 [2.02–4.44] p = 4.23 10−8 | 0.017 | 0.040 | 2.37 [1.36–4.15] p = 2.39 10−3 |
| A | A | C | 0.260 | 0.260 | 1.07 [0.93–1.24] p = 0.341 | 0.275 | 0.261 | 0.94 [0.76–1.15] p = 0.545 |
| A | A | T | 0.019 | 0.017 | 0.99 [0.59–1.66] p = 0.964 | 0.019 | 0.013 | 0.65 [0.27–1.57] p = 0.335 |
| Global test of haplotypic association | χ 2 = 40.38 with 5 df p = 1.25 10−7 | χ2 = 11.24 with 5 df p = 0.046 | ||||||
The meta-analysis of the AGT haplotype-associated ORs obtained in the two GWAS samples lead to an overall OR of 2.78 [2.01–3.81] (p = 4.72 10−10). In a combined analysis of the individual-level genotype data of the two GWAS, the AGT haplotype frequency was estimated to 0.054 and 0.019 in cases and controls, respectively. This led to a combined OR of 3.03 [2.23–4.10] (p = 8.96 10−13) compared to the most frequent GGT haplotype (with estimated frequency 0.330 and 0.343 in cases and controls, respectively).
Association of MYPBC3 rs2856656 and CELF1 rs60206633 with VT in the FARIVE study.
| rs2856656 | Controls | Cases | rs60206633 | Controls | Cases |
| AA | 510 (93%) | 487 (88%) | CC | 529 (94%) | 527 (92%) |
| AG | 38 (7%) | 66 (12%) | CG | 30 (6%) | 46 (8%) |
| GG | 2 (<1%) | 2 (<1%) | GG | 1 (<1%) | 2 (<1%) |
| MAF | 0.038 | 0.063 | 0.029 | 0.044 | |
| Cochran-Armitage Trend test | p = 0.008 | p = 0.062 | |||
Minor Allele Frequencies.
Genotype distribution of the MYPBC3 rs2856656 according to VT status and carrier-ship of F5/F2 mutations in the MARTHA and FARIVE studies.
| MARTHA Controls | MARTHA VT Cases | |||||
| rs2856656 | Non carriers | FV Leiden carriers | FII G20210A carriers | Non carriers | FV Leiden carrier | FII G20210A carriers |
| AA | 458 (97%) | 174 (99%) | 49 (35%) | 563 (99%) | 377 (98%) | 53 (33%) |
| AG | 13 (3%) | 2 (1%) | 91 (65%) | 6 (1%) | 8 (2%) | 106 (67%) |
| GG | 1 (<1%) | 0 (-) | 1 (<1%) | 1 (<1%) | 0 (-) | 0 (-) |
| MAF | 0.016 | 0.006 | 0.333 | 0.007 | 0.010 | 0.333 |
Minor Allele Frequencies.
Association of the imputed rs2856656 with VT risk in two GWAS datasets.
| Allelic Odds Ratio [95% Confidence Interval | GWAS from (1) 419 cases/1,228 controls | GWAS from (2) 1,542 cases/1,110 controls |
| Crude | 3.462 [1.754–6.835] p = 4.82 10−4 | 4.943 [2.987–8.180] p = 2.76 10−12 |
| Adjusted | ||
| - for rs1799963 imputed in controls and cases | 3.211 [1.632–6.313] p = 7.22 10−4 | 5.469 [3.348–8.932] p = 1.13 10−11 |
| - for rs1799963 imputed in controls and genotyped in cases | 1.264 [0.469–3.402] p = 0.643 | 1.732 [0.864–3.474] p = 0.122 |
Figure 3Box-Plot representation of the imputed dose at rs1799963 (FII G20210A) according to measured genotypes in a sample of 1,961 VT cases.
The imputed dose in the 419 VT cases of the first GWAS is shown in blue while results obtained in the 1,542 VT part of the second GWAS are shown in red.
Figure 4Pairwise linkage disequilibrium r2 between genotyped SNPs mapping from F2 to PTPRJ genes at the 11p11.2 locus in a sample of 1,542 VT cases.
Figure 5Box-Plot representation of the imputation quality (r2) according to the minor allele frequency of the SNPs inferred from 1000G 2010-08 release.
Box-plot derived from the imputation analysis of the largest GWAS (2).