| Literature DB >> 28787443 |
Abdou Mousas1,2, Georgios Ntritsos3, Ming-Huei Chen4, Ci Song4, Jennifer E Huffman4, Ioanna Tzoulaki3,5, Paul Elliott5, Bruce M Psaty6,7, Paul L Auer8, Andrew D Johnson4, Evangelos Evangelou3,5, Guillaume Lettre1,2, Alexander P Reiner9,10.
Abstract
The identification of rare coding or splice site variants remains the most straightforward strategy to link genes with human phenotypes. Here, we analyzed the association between 137,086 rare (minor allele frequency (MAF) <1%) coding or splice site variants and 15 hematological traits in up to 308,572 participants. We found 56 such rare coding or splice site variants at P<5x10-8, including 31 that are associated with a blood-cell phenotype for the first time. All but one of these 31 new independent variants map to loci previously implicated in hematopoiesis by genome-wide association studies (GWAS). This includes a rare splice acceptor variant (rs146597587, MAF = 0.5%) in interleukin 33 (IL33) associated with reduced eosinophil count (P = 2.4x10-23), and lower risk of asthma (P = 2.6x10-7, odds ratio [95% confidence interval] = 0.56 [0.45-0.70]) and allergic rhinitis (P = 4.2x10-4, odds ratio = 0.55 [0.39-0.76]). The single new locus identified in our study is defined by a rare p.Arg172Gly missense variant (rs145535174, MAF = 0.05%) in plasminogen (PLG) associated with increased platelet count (P = 6.8x10-9), and decreased D-dimer concentration (P = 0.018) and platelet reactivity (P<0.03). Finally, our results indicate that searching for rare coding or splice site variants in very large sample sizes can help prioritize causal genes at many GWAS loci associated with complex human diseases and traits.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28787443 PMCID: PMC5560754 DOI: 10.1371/journal.pgen.1006925
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
New rare coding or splice site variants associated with hematological traits.
Chromosome and genomic coordinates are on build hg19 of the human genome. Allele frequencies and directions of effect are for allele A2. Effect sizes (BETA and SE) are in standard deviation units. Phet is the Cochran’s Q test heterogeneity P-value. We provide P-values in the UK Biobank (UKBB) without (non-conditional) and with (conditional) correction for the known blood-cell variants [1]. Detailed results of the conditional analyses are in . Other phenotypes are hematological traits associated at P<5x10-8 in the meta-analyses. In the Comment column, we consider SNPs previously associated with blood-cell traits that are located 500 kb on either side of the reported rare variant. a, variants that are statistically dependent of known blood-cell trait variants; b, variants that are statistically independent from known blood-cell variants, but that are located at known blood-cell trait loci; c, new blood-cell trait locus.
| SNPID | CHR | Gene | Annotation (VEP) | A1/A2 | Freq (A2) | Main phenotype | BETA (SE) | UKBB | UKBB | Other phenotypes | Comment | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs148916169a | 1 (36932463) | missense | A/G | 0.0066 | WBC | 0.1308 (0.0181) | 4.62E-13 | 0.16 | 6.09E-07 | 0.61 | Neutro | Known WBC locus. | |
| rs138903557b | 2 (24245713) | missense | C/G | 0.0046 | MCV | -0.126 (0.0231) | 4.76E-08 | 0.48 | 2.70E-07 | 0.000032 | Locus associated with WBC and RBC traits. | ||
| rs147820690b | 2 (160735174) | missense | C/T | 0.0026 | PLT | 0.1654 (0.0287) | 8.19E-09 | 0.83 | 0.0043 | 0.00069 | Locus associated with PLT and WBC traits. | ||
| rs116274727b | 2 (192701265) | missense | T/C | 0.0061 | MPV | 0.1623 (0.0215) | 3.95E-14 | 0.97 | 7.77E-08 | 9.57E-12 | Known PLT locus. | ||
| rs28910273b | 3 (142188337) | missense | A/C | 0.0067 | MCV | -0.1318 (0.0185) | 1.02E-12 | 0.17 | 1.70E-08 | 3.74E-10 | MCH | Known RBC locus. | |
| rs77208665b | 3 (142274770) | missense | T/C | 0.0056 | MCV | -0.1171 (0.0208) | 1.91E-08 | 0.089 | 1.38E-05 | 5.16E-07 | |||
| rs151053159b | 5 (1078832) | missense | C/T | 0.0034 | RDW | -0.2013 (0.0278) | 4.77E-13 | 0.99 | 9.81E-10 | 0.0012 | Locus associated with WBC and RBC traits. | ||
| rs121434346a | 5 (1212453) | missense | G/A | 0.0042 | RDW | -0.1379 (0.0246) | 2.15E-08 | 0.65 | 2.06E-05 | 0.63 | Locus associated with WBC and RBC traits. | ||
| rs145535174c | 6 (161134124) | missense | A/G | 0.0005 | PLT | 0.34 (0.0587) | 6.83E-09 | 1.0 | 0.00017 | 3.62E-05 | Novel locus | ||
| rs74848966a | 7 (100365467) | missense | G/A | 0.0044 | MCH | 0.1522 (0.0263) | 7.02E-09 | 0.26 | 2.20E-06 | 0.27 | |||
| rs141547371a | 9 (214606) | missense | A/C | 0.0027 | MPV | 0.3022 (0.0479) | 2.87E-10 | 0.37 | 5.12E-08 | 0.012 | Known PLT locus. | ||
| rs146597587a | 9 (6255967) | splice acceptor | G/C | 0.0047 | Eosin | -0.2504 (0.0252) | 2.40E-23 | 0.12 | 1.02E-17 | 0.88 | Known WBC locus. | ||
| rs146879704b | 9 (114886569) | missense | C/T | 0.0037 | HGB | 0.1509 (0.0254) | 2.92E-09 | 0.16 | 1.87E-06 | 3.00E-05 | Known RBC locus. | ||
| rs141547732a | 9 (136280025) | missense | G/A | 0.0064 | RBC | 0.1195 (0.0195) | 8.47E-10 | 0.59 | 8.05E-07 | 0.40 | |||
| rs71508957a | 10 (64927837) | missense | C/T | 0.0085 | MPV | 0.1142 (0.0175) | 7.54E-11 | 0.19 | 4.14E-08 | 0.030 | PLT | Locus associated with WBC, PLT and RBC traits. | |
| rs61748606a | 11 (230474) | missense | G/T | 0.0060 | MPV | -0.1624 (0.0274) | 3.05E-09 | 0.32 | 3.74E-10 | 0.46 | Locus associated with WBC, PLT and RBC traits. | ||
| rs138326449b | 11 (116701354) | splice donor | G/A | 0.0020 | RDW | 0.2857 (0.0444) | 1.19E-10 | 0.018 | 4.48E-08 | 2.72E-09 | Known PLT locus. This rare variants is associated with triglyceride levels and coronary artery disease. | ||
| rs150349412a | 12 (112184086) | missense | G/A | 0.0027 | PLT | 0.176 (0.0318) | 3.01E-08 | 0.32 | 2.19E-06 | 0.65 | |||
| rs145120027a | 12 (122439451) | missense | T/C | 0.0011 | MPV | 0.3647 (0.0591) | 6.70E-10 | 0.039 | 2.02E-09 | 0.84 | Locus associated with PLT and RBC traits. | ||
| rs17881033a | 12 (122763670) | missense | C/A | 0.0091 | MPV | 0.1792 (0.0171) | 9.90E-26 | 0.087 | 2.44E-20 | 0.66 | |||
| rs151322438a | 12 (123335398) | missense | C/T | 0.0019 | MPV | 0.3258 (0.0407) | 1.16E-15 | 0.26 | 1.21E-13 | 0.20 | |||
| rs146030737a | 13 (28626716) | missense | C/T | 0.0057 | Mono | 0.1475 (0.0249) | 3.15E-09 | 0.67 | 3.81E-06 | 0.52 | Known WBC locus. | ||
| rs182782800a | 13 (73319139) | missense | G/A | 0.0038 | MCV | -0.1802 (0.024) | 5.50E-14 | 0.53 | 3.26E-09 | 0.012 | MCH | Known RBC locus. | |
| rs13888768b | 14 (103568488) | missense | G/A | 0.0023 | MPV | -0.2783 (0.0468) | 2.72E-09 | 0.23 | 6.37E-07 | 1.36E-10 | Known PLT locus. | ||
| rs148718670a | 14 (103574815) | missense | G/A | 0.0032 | MPV | -0.1677 (0.0295) | 1.28E-08 | 0.057 | 4.24E-06 | 0.0034 | |||
| rs184575290b | 15 (80191280) | missense | T/A | 0.0035 | Mono | 0.2164 (0.0302) | 7.28E-13 | 0.0048 | 1.57E-10 | 9.30E-09 | Known WBC locus. | ||
| rs57268939b | 16 (319547) | missense | T/C | 0.0005 | MCH | -0.5633 (0.0922) | 9.83E-10 | 0.034 | 6.05E-11 | 1.85E-09 | MCV | α-globin locus, previously associated with RBC traits. | |
| rs147810715b | 16 (30999491) | missense | G/C | 0.0004 | MCH | -0.4918 (0.0838) | 4.39E-09 | 0.63 | 3.22E-08 | 4.74E-08 | Known RBC locus. | ||
| rs35266519 a | 17 (38062390) | missense | C/T | 0.0089 | Neutro | -0.101 (0.0173) | 4.84E-09 | 0.61 | 3.18E-07 | 0.32 | Known WBC locus. | ||
| rs150420714b | 19 (50017538) | missense | G/C | 0.0083 | HCT | 0.1033 (0.0167) | 6.26E-10 | 0.43 | 9.77E-05 | 2.76E-05 | HGB, | Known RBC locus. | |
| rs201148397b | 22 (37482458) | missense | C/A | 0.0021 | MCH | 0.2542 (0.0371) | 7.12E-12 | 0.031 | 3.89E-09 | 8.86E-05 | MCV | Locus associated with PLT and RBC traits. |
Association results between PLG-rs145535174 and hemostatic and coagulation factors in the Framingham Heart Study.
The direction of the effects (BETA) is given for the rare G-allele (on forward strand). SE, standard error; ADP, Adenosine diphosphate; PAI-1, plasminogen activator inhibitor type 1; tPA, tissue plasminogen activator; FVII, factor VII; vWF, von Willebrand factor. We highlight in bold nominal P-values <0.05.
| Trait | Sample size | Freq | BETA (SE) | |
|---|---|---|---|---|
| Epinephrine 0.5uM Platelet Reactivity, % aggregation | 1877 (6) | 0.0016 | -1.092 (0.432) | |
| Epinephrine 1.0uM Platelet Reactivity, % aggregation | 2009 (7) | 0.0017 | -0.774 (0.402) | 0.054 |
| Epinephrine 3.0uM Platelet Reactivity, % aggregation | 1131 (6) | 0.0027 | -0.286(0.426) | 0.502 |
| Epinephrine EC50 Platelet Reactivity, | 2199 (8) | 0.0018 | 0.737 (0.381) | 0.053 |
| Collagen lag time, second | 2152 (8) | 0.0019 | 0.435 (0.383) | 0.256 |
| ADP 1.0uM Platelet Reactivity, % aggregation | 2392 (7) | 0.0015 | -0.920 (0.415) | |
| ADP 3.0uM Platelet Reactivity, % aggregation | 2555 (8) | 0.0016 | -0.451 (0.375) | 0.229 |
| ADP 5.0uM Platelet Reactivity, % aggregation | 1686 (7) | 0.0021 | -0.426 (0.408) | 0.297 |
| ADP EC50 Platelet Reactivity, | 2205 (8) | 0.0018 | 0.3818 (0.378) | 0.313 |
| PAI-1, ng/ml | 6441 (14) | 0.0011 | -0.339 (0.197) | 0.086 |
| tPA, ng/ml | 2613 (9) | 0.0017 | -0.116 (0.152) | 0.446 |
| D-dimer, ng/ml | 2874 (8) | 0.0005 | -0.479 (0.202) | |
| Fibrinogen, g/L | 6711 (16) | 0.0012 | -0.059 (0.047) | 0.207 |
| FVII, % antigen | 2620 (9) | 0.0017 | 2.261 (5.563) | 0.684 |
| vWF % antigen | 2621 (9) | 0.0017 | 21.187 (14.551) | 0.145 |
Association between IL33-rs146597587 and asthma, allergic rhinitis, or endometriosis risk in the UK Biobank.
All analyses were corrected for age, sex, and the first 10 principal components. The direction of the odds ratio is for the rare allele (rs146597587, C-allele on forward strand). The conditional P-values were calculated in a logistic model accounting for genotypes at common single nucleotide polymorphisms (SNPs) previously associated with asthma, allergic rhinitis, or endometriosis: rs343496, rs7032572, rs72699186, rs1342326, rs2381416, rs928413, rs10975519.
| Disease | Ncases | Nctrls | Freq | Odds ratio | 95% confidence interval | ||
|---|---|---|---|---|---|---|---|
| Asthma | 17,565 | 118,119 | 0.0048 | 0.56 | 0.45–0.70 | 2.60x10-7 | 1.22x10-6 |
| Allergic rhinitis | 7,918 | 127,766 | 0.0048 | 0.55 | 0.39–0.76 | 4.21x10-4 | 7.4x10-4 |
| Endometriosis | 1,589 | 134,095 | 0.0048 | 0.86 | 0.46–1.49 | 0.63 | 0.67 |
Pathway analyses using DAVID with genes that carry rare coding or splice site variants associated with blood-cell traits.
We only present biological pathways with a false discovery rate (FDR) <10%.
| Biological pathway or term | Fold Enrichment | P-value | FDR | Genes |
|---|---|---|---|---|
| Iron ion homeostasis | 121.3 | 0.00023 | 0.26 | |
| Cellular iron ion homeostasis | 85.3 | 0.00046 | 0.53 | |
| Ferrous iron import into cell | 1051.5 | 0.00175 | 1.99 | |
| Cellular response to iron ion | 525.8 | 0.00351 | 3.95 | |
| HFE-transferrin receptor complex | 279.3 | 0.00667 | 6.00 | |
| Disulfide bond | 3.2 | 0.00895 | 9.32 | |
| Cytokine-cytokine receptor interaction | 19.1 | 0.00040 | 0.28 | |
| Neutrophil chemotaxis | 99.1 | 0.00031 | 0.34 | |
| Cytokine-mediated signaling pathway | 41.1 | 0.00177 | 1.94 | |
| Integral component of plasma membrane | 6.6 | 0.00272 | 1.98 | |
| Plasma membrane | 3.3 | 0.00294 | 2.15 | |
| Receptor | 6.4 | 0.00292 | 2.63 | |
| Topological domain:Extracellular | 4.3 | 0.00316 | 2.72 | |
| Cell membrane | 4.0 | 0.00461 | 4.11 | |
| Disulfide bond | 3.7 | 0.00632 | 5.60 | |
| Membrane | 2.3 | 0.00732 | 6.46 | |
| Topological domain:Cytoplasmic | 3.5 | 0.00833 | 7.04 | |
| Transmembrane helix | 2.6 | 0.01030 | 8.99 | |
| Transmembrane | 2.6 | 0.01043 | 9.10 | |
| Blood coagulation | 26.4 | 0.00033 | 0.40 | |
| Erythrocytosis, somatic | 505.5 | 0.00329 | 2.10 | |
| Myelofibrosis, somatic | 337.0 | 0.00494 | 3.14 | |
| Pleckstrin homology-like domain | 9.4 | 0.00682 | 7.13 | |
| Phosphoprotein | 1.8 | 0.00723 | 7.31 | |