Literature DB >> 28787443

Rare coding variants pinpoint genes that control human hematological traits.

Abdou Mousas1,2, Georgios Ntritsos3, Ming-Huei Chen4, Ci Song4, Jennifer E Huffman4, Ioanna Tzoulaki3,5, Paul Elliott5, Bruce M Psaty6,7, Paul L Auer8, Andrew D Johnson4, Evangelos Evangelou3,5, Guillaume Lettre1,2, Alexander P Reiner9,10.   

Abstract

The identification of rare coding or splice site variants remains the most straightforward strategy to link genes with human phenotypes. Here, we analyzed the association between 137,086 rare (minor allele frequency (MAF) <1%) coding or splice site variants and 15 hematological traits in up to 308,572 participants. We found 56 such rare coding or splice site variants at P<5x10-8, including 31 that are associated with a blood-cell phenotype for the first time. All but one of these 31 new independent variants map to loci previously implicated in hematopoiesis by genome-wide association studies (GWAS). This includes a rare splice acceptor variant (rs146597587, MAF = 0.5%) in interleukin 33 (IL33) associated with reduced eosinophil count (P = 2.4x10-23), and lower risk of asthma (P = 2.6x10-7, odds ratio [95% confidence interval] = 0.56 [0.45-0.70]) and allergic rhinitis (P = 4.2x10-4, odds ratio = 0.55 [0.39-0.76]). The single new locus identified in our study is defined by a rare p.Arg172Gly missense variant (rs145535174, MAF = 0.05%) in plasminogen (PLG) associated with increased platelet count (P = 6.8x10-9), and decreased D-dimer concentration (P = 0.018) and platelet reactivity (P<0.03). Finally, our results indicate that searching for rare coding or splice site variants in very large sample sizes can help prioritize causal genes at many GWAS loci associated with complex human diseases and traits.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28787443      PMCID: PMC5560754          DOI: 10.1371/journal.pgen.1006925

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

Although genome-wide association studies (GWAS) have identified thousands of associations with the number or distributional characteristics of red blood cells, white blood cells, and platelets [1], the genes responsible for this phenotypic variation remain elusive at most of these loci. For the most part, the difficulty in connecting association signals with genes resides in the fact that most GWAS variants are non-coding and in linkage disequilibrium (LD) with many other variants. This problem is not specific to hematological traits, but rather a general bottleneck in the functional characterization and clinical translation of most GWAS findings for common human diseases. Causal gene identification is important to shed light onto disease pathophysiology, but also to develop new, genetically-guided, therapeutic targets. The discovery of rare variants that alter amino acid sequence or splicing is a strong indication that the mutated genes are involved in the phenotypes of interest. This approach is the cornerstone of Mendelian genetics. For complex human phenotypes, the discovery of rare coding variants associated with blood lipid levels [2, 3], coronary artery disease [4, 5], or more recently height [6] has helped prioritize causal genes and identify new drug targets (e.g. PCSK9 for atherosclerosis). For blood-cell related phenotypes, the identification of rare missense variants in the chemokine receptor gene CXCR2 [7] or the sphingosine-1 phosphate signalling gene S1PR4 [8] has highlighted the importance that these biological pathways play in neutrophil count variation in humans. Although extremely useful tools to pinpoint causal genes, the discovery of rare variants of modest effect sizes have previously been limited because they require very large sample sizes. Here, we took advantage of genotype data at 137,086 rare (minor allele frequency (MAF) <1%) variants in 308,572 participants to find new genes implicated in human hematopoiesis. We identified 56 rare coding variants, including 31 new variants, which highlight specific genes that contribute to inter-individual variation of red blood cell (RBC), white blood cells (WBC), and platelet (PLT) traits.

Results

31 new rare coding or splice site variants associated with blood-cell traits

In an effort to identify rare coding (defined in this study as missense or nonsense, excluding synonymous) or splice site variants that could implicate new genes in human hematopoiesis, we performed meta-analyses at 137,086 variants for 15 blood-cell traits in 308,572 participants (). These analyses found 81 missense or splice site variants at P<5x10-8 with a minor allele frequency (MAF) <1% across the tested studies. We excluded 25 variants because they were not present in phase 1 of the Blood-Cell Consortium (BCX1) dataset (N = 2), the direction of effect was discordant across studies (N = 11), or they mapped to the HLA region (N = 15)(). As positive controls to validate our approach, 25 rare coding or splice site variants previously associated with hematological traits were genome-wide significant in the current analyses () [1, 8–11]. In addition, we found 31 new rare coding or splice site variants in 29 different genes associated with blood-cell trait variation ( and ). The two variants in ATR are in strong linkage disequilibrium (LD), as are the two variants in EXOC3L4 (D′>0.8).

New rare coding or splice site variants associated with hematological traits.

Chromosome and genomic coordinates are on build hg19 of the human genome. Allele frequencies and directions of effect are for allele A2. Effect sizes (BETA and SE) are in standard deviation units. Phet is the Cochran’s Q test heterogeneity P-value. We provide P-values in the UK Biobank (UKBB) without (non-conditional) and with (conditional) correction for the known blood-cell variants [1]. Detailed results of the conditional analyses are in . Other phenotypes are hematological traits associated at P<5x10-8 in the meta-analyses. In the Comment column, we consider SNPs previously associated with blood-cell traits that are located 500 kb on either side of the reported rare variant. a, variants that are statistically dependent of known blood-cell trait variants; b, variants that are statistically independent from known blood-cell variants, but that are located at known blood-cell trait loci; c, new blood-cell trait locus. All 31 variants are novel discoveries because they were not identified in previous large-scale association studies for blood-cell traits. However, although the variants are new, many of them map to known blood-cell trait loci. Indeed, using conditional analyses in the UK Biobank, we could sub-divide these 31 variants into three categories: 16 variants are in LD with markers previously reported to associate with hematological traits (Pcond >0.002 in and ), 14 variants are not in LD with known blood-cell variants, but are located within 500 kb of one of these previously reported variants, and one variant (PLG-rs145535174) defines a new locus associated with PLT count variation in humans.

A rare missense variant in plasminogen associates with platelet count, D-dimer concentration, and platelet reactivity

Plasminogen (PLG) encodes the precursor of plasmin, an important proteolytic enzyme that degrades many plasma proteins and fibrin blood clots. The rare G-allele at this new PLG variant (rs145535174, MAF = 0.05%) is associated with increased PLT count (+20x109 platelets/l, P = 6.8x10-9). It replaces an arginine residue at position 172 into a glycine, an amino acid change that is predicted to be probably damaging by Polyphen and deleterious by SIFT [12, 13]. This genomic position is highly conserved, as evidenced by a GERP score >200 and a CADD score of 23.8, ranking this particular amino acid change in PLG among the top 1% of the most deleterious substitutions in the human genome [14]. Given the role of plasmin in hemostasis, we tested the association between rs145535174 and several hemostatic and coagulation factors in the FHS. Although we only found a maximum of 16 carriers of the rare allele (varying depending on overlapping phenotypes), we could detect nominal associations between rs145535174-G and decreased D-dimer concentration and platelet reactivity (). Because D-dimer is a by-product of fibrin clot degradation by plasmin, this result is consistent with rs145535174-G being a PLG loss-of-function variant. Since plasminogen/plasmin activity also plays a role in thrombotic diseases, we tested if the G-allele at PLG-rs145535174 increases the risk of myocardial infarction (MI), stroke, or venous thromboembolism (VTE) in the UK Biobank (). This variant was not associated with MI or stroke risk (P>0.05). However, there was an association with increased VTE risk in the UK Biobank (P = 0.0087, odds ratio = 4.01), although it was not replicated in the Montreal Heart Institute Biobank and the Women’s Health Initiative (P>0.05, ). The absence of genetic associations between PLG-rs145535174 and thrombotic events could simply reflect our limited statistical power given the rarity of the G-allele.

Association results between PLG-rs145535174 and hemostatic and coagulation factors in the Framingham Heart Study.

The direction of the effects (BETA) is given for the rare G-allele (on forward strand). SE, standard error; ADP, Adenosine diphosphate; PAI-1, plasminogen activator inhibitor type 1; tPA, tissue plasminogen activator; FVII, factor VII; vWF, von Willebrand factor. We highlight in bold nominal P-values <0.05.

Blood-cell trait-associated rare variants prioritize genes at GWAS loci

When we considered genes with known hematopoietic functions, we found little evidence that the rare coding or splice site variants found here implicated different potentially causal genes than candidate genes from previous blood-cell trait GWAS (). In other words, we have, with one exception, no loci with two strong blood-cell trait candidate genes. On chromosome 11, we found an association between mean platelet volume (MPV) and a rare coding variant in SIRT3, a gene implicated in platelet biology [15]. At the same locus, Gieger et al. had nominated PSMD13 as a candidate platelet gene on the basis of reduction in plasmatocyte numbers in Drosophila [16]. We also noted few examples where rare coding or splice site variants implicated the same genes than those prioritized by GWAS SNPs due to expression quantitative trait loci effects (e.g. SLC12A7, FLT3, TMPRSS6)(). Conditional analyses identified 16 rare variants in LD with previously reported blood-cell variants (). Despite this dependence due to LD, at some of these loci, we may have identified a rare variant with even greater biologic plausibility for the blood-cell trait association than the previously identified LD surrogate. Indeed, whereas all 16 variants from our study are coding or splice site variants, most LD surrogate variants are non-coding variants (). To clarify whether the new () or previously known [1] blood-cell trait variants are more likely causal variants, we performed the reciprocal conditional analyses in which we conditioned the previously known blood-cell trait variants on genotypes at the newly described 31 rare coding or splice site variants (). Comparison of the conditional results ( and ) highlighted a few loci where the common variants are better causal candidates (e.g. WDR66, CLIP1, HIP1R). There were also a few loci where the new rare coding or splice site variants were statistically equivalent to rare non-coding variants previously reported (e.g. IL33, FLT3, BORA). We did not find examples of rare variants completely explaining the association signals at known common variants, as we would expect given limited LD. On chromosome 1, we found a rare missense variant in CSF3R (rs148916169, MAF = 0.7%) associated with total WBC and neutrophil counts. Although this variant is correlated with common and low-frequency intronic variants (), it is the first missense variant to directly implicate CSF3R in WBC count variation in the general population. CSF3R encodes the receptor for CSF3, a cytokine that controls the proliferation and differentiation of granulocytes, and is also mutated in a severe form of congenital neutropenia (MIM# 617014) [17]. Although rare loss-of-function mutations in CSF3R are associated with reduced neutrophil levels, the rare CSF3R missense variant found in our experiment might represent a gain-of-function since its rare allele is associated with higher WBC and neutrophil counts. Another compelling example is the identification of a rare splice acceptor variant in IL33 (rs146597587, MAF = 0.5%) associated with reduced eosinophil count (). This likely functional IL33 variant maps to a locus with three non-coding variants, including a rare intergenic variant, associated with WBC traits () [1]. Interestingly, GWAS have associated IL33 common non-coding variants with risk for asthma [18], allergic rhinitis [19], and endometriosis [20]. In the UK Biobank, we found that the rare C-allele at IL33-rs146597587 associated with reduced eosinophil count is also associated with lower risk of asthma (P = 2.60x10-7, odds ratio = 0.56) and allergic rhinitis (P = 4.21x10-4, odds ratio = 0.55) (). These associations remained significant after accounting for the known GWAS variants previously identified at this locus (). Our finding strongly reinforces the clinical importance that eosinophils play in the pathophysiology of these diseases, and highlight the IL33 pathway as a promising therapeutic target. However, in a mediation analysis in the UK Biobank in which we corrected for eosinophil count, we found that the protective effect of IL33-rs146597587 was weaker but remained significant (Pmediation = 3.4x10-5). This suggests that IL33 may influence asthma risk in part by controlling blood eosinophil count, but also through currently unknown additional biological pathways. Finally, we note that the same rare IL33 variant was recently shown to associate with reduced eosinophil count and protection from asthma in the Icelandic population [21]. Because there is no sample overlap, our study represents a strong replication of this original report: in a meta-analysis of both studies, association results now reach genome-wide significance (in 24,030 asthma cases and 421,096 controls, P = 1.4x10-10, odds ratio = 0.54).

Association between IL33-rs146597587 and asthma, allergic rhinitis, or endometriosis risk in the UK Biobank.

All analyses were corrected for age, sex, and the first 10 principal components. The direction of the odds ratio is for the rare allele (rs146597587, C-allele on forward strand). The conditional P-values were calculated in a logistic model accounting for genotypes at common single nucleotide polymorphisms (SNPs) previously associated with asthma, allergic rhinitis, or endometriosis: rs343496, rs7032572, rs72699186, rs1342326, rs2381416, rs928413, rs10975519. We found 14 rare coding or splice site variants that appear to be associated with blood-cell traits independently of other common and/or rare variants previously identified ( and ). Although there are a few well-known blood-cell-related genes, such as SLC12A7 and TMPRSS6, most of the highlighted genes do not have assigned functions in hematopoiesis (e.g. MFSD2B, SDPR, C9orf66, SUSD1, EXOC3L4)(). We found two correlated rare missense variants in ATR (rs28910273, MAF = 0.6%; rs77208665, MAF = 0.7%), which encodes a checkpoint protein that coordinates cellular responses to replication stress and DNA damage. ATR is mutated in Seckel syndrome 1 (MIM #210600), which is characterized by hematological defects in some patients [22]. We also uncovered a rare splice acceptor variant in the apolipoprotein gene APOC3 (rs138326449, MAF = 0.2%) associated with RDW, a RBC trait that is considered a non-specific inflammatory marker and a predictor of cardiovascular diseases [23]. The same APOC3 variant has previously been associated with lower triglyceride levels and coronary artery disease risk [5, 24].

Biological pathway analyses with blood-cell trait genes

We performed three biological pathway-enrichment analyses using 58 genes that harbour a rare coding or splice site variants associated with RBC, WBC, or PLT traits ( and ). In total, we found six, 13, and five biological pathways or terms enriched for genes with variants associated with RBC, WBC, and PLT, respectively, at a false discovery rate <10% (). Most of these pathways were highly redundant in terms of gene content. For instance, five of the six RBC-enriched pathways are involved in iron homeostasis. For WBC traits, there were many significant inter-connected biological pathways that implicated six genes: CSF3R, CXCR2, S1PR4, IL17RA, AMICA1/JAML, and FLT3 (). These genes highlight the importance that genetic variation plays on cytokine signalling and chemotaxis to modulate circulating numbers of immune cells in the blood.

Pathway analyses using DAVID with genes that carry rare coding or splice site variants associated with blood-cell traits.

We only present biological pathways with a false discovery rate (FDR) <10%.

Discussion

Using a large sample of >300,000 participants, we identified 31 new missense or splice site variants associated with blood-cell traits. These include a rare p.Arg172Gly missense of plasminogen associated with higher platelet count, lower D-dimer concentration and lower platelet reactivity and a rare splice acceptor variant of IL33 associated with lower eosinophil count and lower risk of asthma and allergic rhinitis. At other genomic loci, our findings may prioritize the causal gene(s) that can be pursued with molecular techniques to further dissect and define the mechanism of association. Because phenotypic variance explained is determined by the effect size and allele frequency of the variants, the rare coding and splice site variants identified here do not have a large contribution to the heritability of blood-cell traits. For instance, the rare IL33-rs146597587 variant explains only 0.06% of the variation in eosinophil count. The chromosome 2p23 region spans several genes and contains six common, non-coding variants associated previously with WBC or RBC traits [1]. We identified a new rare coding variant in this region associated with lower MCV located in MFSD2B, a transporter gene of unknown function that is highly and specifically expressed in blood and bone marrow cells, particularly of the erythroid lineage [25]. As another example, at the LY75-CD302 locus, four common non-coding variants and a low-frequency missense variant have been associated with PLT and WBC traits [10]. A second novel rare LY75-CD302 missense variant was associated with higher PLT count. The LY75-CD302 locus represents a naturally occurring read-through transcript between the lymphocyte antigen 75 (LY75) and CD302 genes, and is expressed in leukocyte and hematopoietic stem and progenitor cells [25]. Alternative splicing results in fusion LY75-CD302 gene products that are expressed during dendritic cell maturation [26] and Hodgkin’s lymphoma cell lines [27]. Another example of a potential read-through transcript involves a gene-rich region on chromosome 15, where several common variants were recently associated with WBC traits [1]. At this locus, we discovered a rare coding variant (rs184575290) located near the ST20 termination codon associated with higher monocyte count. This locus represents a naturally occurring read-through transcript that produces a fusion protein between ST20 and the neighboring MTHFS gene. ST20 is highly expressed in myeloid cells, while the fusion product is expressed at lower levels in blood neutrophils and dendritic cells of the skin [25]. We found a rare missense variant within SDPR, a phosphatidylserine-binding protein originally isolated from human platelets [28], that may be involved in modulating activation of the platelet protein kinase C substrate pleckstrin and mediating the downstream effects of platelet granule secretion [29]. Common missense variants of EXOC3L4, which has no known function, have been previously associated with both platelet and liver enzyme traits [30]. In progenitor blood-cell expression data from the BLUEPRINT Project [31], there is a large increase in EXOC3L4 expression in megakaryocyte progenitors. Several rare, non-coding variants at the chromosome 13q21 region have been associated with RBC traits [1]. We identified a rare coding variant of BORA, an activator of the protein kinase Aurora A involved in centrosome maturation and spindle assembly during mitosis and whose expression pattern during hematopoiesis increases in RBC and PLT precursors [31]. Matriptase-2, encoded by TMPRSS6, has recently emerged through both complex trait and Mendelian disorders as an important genetic regulator of RBC and iron metabolism [32]. We report here the first rare coding variant of TMPRSS6 (p.V280L/p.V289L) associated with RBC traits in individuals unselected for hematologic disease. Other TMPRSS6 coding variants have been reported in patients with familial iron-refractory iron-deficiency anemia (IRIDA), a rare autosomal-recessive disorder characterized by hypochromic microcytic anemia and impaired iron balance [33]. Plasminogen plays an important role in fibrinolysis as well as wound healing, cell migration, and tissue re-modeling. Congenital plasminogen deficiency is a rare autosomal recessive disorder [34]. Severely affected individuals (type I plasminogen deficiency) can exhibit defective extracellular fibrin clearance during wound healing, leading to “ligneous conjunctivitis'' or thick, pseudomembranes on conjunctival and other mucosal surfaces. The p.Arg172Gly variant of plasminogen associated with higher PLT count is located in the first kringle domain, which mediates binding of plasminogen to fibrin and cell surfaces [35]. The observed association of the PLG p.Arg172Gly variant with lower D-dimer is consistent with reduced fibrinolysis, which might result from reduced circulating plasminogen, plasmin activity, or substrate binding. Overall, we did not observe an association between PLG p.Arg172Gly and risk of thrombotic disease. Similarly, mutations associated with congenital plasminogen deficiency do not appear to be a strong risk factor for VTE [36-38]. The reasons for the association of the PLG p.Arg172Gly variant with higher PLT count and lower platelet reactivity are not readily apparent. In vitro, plasmin has been reported to proteolytically inactivate thrombopoietin [39]. Plasmin also is capable of activating platelets through protease-activated receptors (PAR)-1 and -4 [40, 41]. Thus, it is possible that the PLG loss-of-function variant may influence thrombopoietin-induced platelet production, or PAR-induced platelet aggregation, respectively. Finally, it is worth noting that moderately reduced platelet count is a feature of a congenital platelet disorder characterized by a gain-of-function fibrinolysis defect due to increased expression and storage of urokinase plasminogen activator (PLAU) during megakaryocyte differentiation [42]. In conclusion, focusing on 137,086 rare coding or splice site variants in 308,572 participants, we discovered 31 new variants associated with blood-cell traits. Thirty of these 31 variants map to loci previously implicated by GWAS for hematological phenotypes. Because we used the ExomeChip or the UK Biobank array, there is an enrichment of GWAS signals among the loci tested, although many of the loci identified here did not harbour blood-cell trait genetic associations at the time the arrays were designed. As discussed, many of these associations prioritize strong candidate genes at these loci. Our study was limited by the content of the genotyping arrays utilized. As we expand our genetic experiments to complete genome sequence in very large cohorts, we anticipate that we will uncover more rare coding variants and, maybe more importantly for gene identification within GWAS loci, additional independent variants in the same candidate genes. Together with a recent report for adult height [6], our findings reinforce the importance that rare variants play in the architecture of complex human phenotypes.

Methods

Ethical statement

Written, informed consent was obtained for all participants in accordance with the Declaration of Helsinki. This project was also reviewed and approved by the Montreal Heart Institute Ethics Committee and the different recruiting centers (approval number 2014–1707).

Statistical analyses

Single variant association results considered in this effort were obtained from participants of European ancestry using an additive genetic model. All phenotypes were corrected for confounding factors (see below) and normalized using inverse normal transformation. The final analyses included samples from BCX1 (up to 157,622 participants), AIRWAVE (up to 14,866), and the first release of the UK Biobank (up to 136,084). Genotypes for BCX1 and AIRWAVE were obtained from the Illumina ExomeChip array; the content is available at: http://genome.sph.umich.edu/wiki/Exome_Chip_Design. The ExomeChip includes ~250,000 exonic variants obtained from whole-exome sequencing of ~12,000 participants. The UK Biobank samples were genotyped on the UK Biobank Axiom array; the content of the arrays is available at: http://www.ukbiobank.ac.uk/scientists-3/uk-biobank-axiom-array/. Whereas the Axiom array targets >800,000 variants, we only analyzed exonic variants that overlap with the content of the ExomeChip. Phenotypic information, and ExomeChip quality-control steps and association results from cohorts involved in the BCX1 Consortium and AIRWAVE have been described elsewhere [10, 11, 43]. Briefly, we excluded variants with low genotyping success rate (<95%, except for WHI that used a cutoff <90%). Samples with call rate <95% after joint or zCALL calling and with outlying heterozygosity rate were also excluded. Other exclusions were deviation from Hardy-Weinberg equilibrium (P<1x10-6) and gender mismatch. We performed principal component analysis (PCA) or multidimensional scaling (MDS) and excluded sample outliers from the resulting plots, using populations from the 1000 Genomes Project to anchor these analyses. Prior to performing meta-analyses of the association results provided by each participating study, we ran the EasyQC protocol [44] to check for population allele frequency deviations and proper trait transformation in each cohort. In terms of hematological phenotypes, we excluded individuals with blood cancer, leukemia, lymphoma, bone marrow transplant, congenital or hereditary anemia, HIV, end-stage kidney disease, dialysis, splenectomy, and cirrhosis, and those with extreme measurements of RBC traits. We also excluded individuals on erythropoietin treatment or chemotherapy. Additionally, we excluded pregnant women and individuals with acute medical illness at the time the complete blood count (CBC) was done. For all traits, within each study, we adjusted for age, age-squared, gender, the first 10 principal components, and, where applicable, other study-specific covariates such as study center using a linear regression model. Within each study, we then applied inverse normal transformation on the residuals and tested the phenotypes for association with the ExomeChip variants using either RVtests (version 20140416) [45] or RAREMETALWORKER.0.4.9 [46]. A description of methods and quality-control procedures for the blood-cell data for the first release of the UK Biobank can also be found elsewhere [1]. Description of the exome-component of the genotyping arrays used for the UK Biobank samples can be found at: http://www.ukbiobank.ac.uk/scientists-3/uk-biobank-axiom-array/. In the UK Biobank, we modelled blood-cell traits using the following covariates: age, sex, menopause status for women, assessment centre where the blood samples were collected, machine counter that processed the blood samples, month, day of the week, time inside the day that the samples were collected, self-reported ethnic background of the individuals, smoking status and smoking frequency, alcohol drinker status, alcohol intake frequency, height and weight. In the first release of the UK Biobank, we tested the association between directly genotyped or imputed variants and normalized hematological traits with BOLT-LMM [47]. We meta-analyzed results from BCX1, AIRWAVE and the UK Biobank using inverse-variance weighting as implemented in METAL [48]. We limited our analyses to variants with a mean minor allele frequency (MAF) <1% in the meta-analyses that are annotated as coding (missense or nonsense) or splice site (acceptor or donor) using ENSEMBL’s Variant Effect Predictor (VEP). Furthermore, the variants had to be present on the Illumina exome array used by the BCX1 studies. In total, we tested 137,086 variants against 15 blood-cell traits. These phenotypes are divided between the main three cell types found in blood: red blood cells (red blood cell count (RBC count), hemoglobin concentration (HGB), hematocrit (HCT), mean corpuscular hemoglobin (MCH), mean corpuscular volume (MCV), mean corpuscular hemoglobin concentration (MCHC), and RBC distribution width (RDW)), white blood cells (total white blood cell count (WBC count), neutrophil count (Neutro), lymphocyte count (Lympho), monocyte count (Mono), basophil count (Baso), and eosinophil count (Eosin)), and platelets (platelet count (PLT count) and mean platelet volume (MPV)). The meta-analysis results are available at: http://www.mhi-humangenetics.org/en/resources. We used a conservative α = 5x10-8 to declare statistical significance. At this statistical threshold, our sample size (N = 308,572) provides >99% power to discover variants with MAF = 0.1% that explain >0.03% of the phenotypic variance.

Conditional analyses

To test whether the 31 rare variants newly identified in the meta-analyses are associated with blood-cell traits independently of other known genetic loci, we regressed out the effect of the known blood-cell variants previously identified in the first release of the UK Biobank [1] from the normalized blood-cell phenotypes. All these analyses were done per phenotype; that is we fitted 15 different models for each of the 15 blood-cell phenotypes tested. We then re-tested in the UK Biobank (using linear regression implemented in R) the association between the “residual” blood-cell phenotypes and genotypes at the rare variants identified in the meta-analyses. For instance, for hemoglobin, we conditioned the new rare coding or splice site hemoglobin variants on all variants (across the genome) previously reported to associate with hemoglobin levels. We provide the complete list of variants on which we conditioned in per blood-cell trait. If the rare variants were not significantly associated with the residual phenotypes, we then ran pairwise conditional analyses to identify which previously known variants at the locus accounted for the association signal identified in this project. To declare statistical independence from previously reported hematological trait association signals, we used α = 0.002 (Bonferroni correction for 31 variants tested).

Secondary analyses for the rare PLG-rs145535174 variant

We sought association of the PLG variant (rs145535174) with platelet reactivity, as well as hemostatic and coagulation factors in the Framingham Heart Study (FHS) [49]. Genotyping for rs145535174 was conducted using the Illumina Human Exome BeadChip v.1.0 (Illumina, Inc., San Diego, CA) [50]. Multiple measurements were assessed for platelet reactivity [51], including maximum percentage platelet aggregation in response to agonists, i.e. ADP and epinephrine; minimal concentration of each agonist to produce a >50% aggregation response (EC50); and lag time in response to collagen stimulation. Hemostatic factors and coagulation factors, including antigens of plasminogen activator inhibitor-1 (PAI-1), tissue plasminogen activator (tPA), D-dimer, clotting factor VII (FVII) and von Willebrand factor (VWF), were measured using enzyme-linked immunosorbent assays [52-54] while fibrinogen levels were assessed using the Clauss method [52]. Association analyses in the FHS were conducted using either RAREMETALWORKER (http://genome.sph.umich.edu/wiki/RAREMETALWORKER) or seqMeta (http://cran.r-project.org/web/packages/seqMeta/index.html). A linear mixed effects model that accounts for familial correlation was used and adjustments were made for age, sex and principal components. The phenotypes were log-transformed or inverse normal transformed, as needed. We tested the association between the rare G-allele at PLG-rs145535174 and myocardial infarction (MI) and stroke in the UK Biobank. We used the “Health and Medical History” records to identify MI and stroke cases. For MI, we used the search terms “heart attack”, “myocardial infarction”, “acute myocardial infarction”, “subsequent myocardial infarction” and “old myocardial infarction” to retrieve affected individuals. We used the terms “stroke”, “ischaemic stroke” and “cerebral infarction” to define stroke cases. As controls, we excluded UK Biobank participants with MI, stroke or transient ischemic attack, percutaneous coronary intervention, coronary artery bypass graft surgery, peripheral vascular disease, congestive heart failure, and angina. For analysis of venous thromboembolism (VTE), we identified cases in the UK Biobank, the Montreal Heart Institute Biobank, and the Women’s Health Initiative as individuals with pulmonary embolism or deep vein thrombosis. We tested the genetic association by logistic regression in PLINK or R, correcting for age, sex and the first ten principal components when available.

Secondary analyses for the rare IL33-rs146597587 variant

We identified asthma, allergic rhinitis (hay fever), and endometriosis cases using the detailed “Health and Medical History” UK Biobank participant records. All other individuals were assigned as controls. We tested the genetic association by logistic regression in R, correcting for age, sex and the first ten principal components. To determine if the rare IL33-rs146597587 variant is independent from the previously reported common SNPs at the locus, we conditioned on genotypes at these variants (rs343496, rs7032572, rs72699186, rs1342326, rs2381416, rs928413, rs10975519) and re-run the logistic regression model.

Pathway analyses

We used the default parameters in DAVID [55] to perform biological term and pathway enrichment analyses. For these bioinformatic analyses, we used as reference set all genes with at least one rare coding or splice site variant tested in the meta-analyses. We retrieved genes associated with RBC, WBC, or PLT traits from (new independent variants from our study) and (known positive controls), and tested their enrichment in biological pathways in comparison with the reference set. Due to the relatively low number of genes that were used as input for this kind of analysis, we lowered the initial, minimum number of genes in a seeding group to 3 (default = 4) to ensure that the clustering algorithm will include as many genes as possible into functional groups. All other parameters were left at their default values.

Blood-Cell Consortium (BCX) membership and additional funding information.

Additional funding information for the following studies: MHI Biobank, BioMe, Health 2006/2008, GeneSTAR, BIOVU, MESA, ARIC, JHS, WHI, HANDLS, GSK-STABILITY, SOLID TIMI-52, CHS, Framingham Heart Study, SHIP/SHIP-TREND, NWIGM, FINCAVAS, YFS, CARDIA, HABC, EGCUT, AIRWAVE. (DOCX) Click here for additional data file.

Coding or splice site variants with a minor allele frequency <1% that were excluded during quality-control.

Chromosomes and positions are on build hg19 of the human genome. The direction of the effect sizes (Beta) is for allele A2. Beta and standard errors (SE) are in standard deviation units. Mono, monocyte; WBC, white blood cell count; MCH, mean corpuscular hemoglobin; Lympho, lymphocyte; MPV, mean platelet volume; HGB, hemoglobin; RBC, red blood cell count; Baso, basophil. (DOCX) Click here for additional data file.

Coding or splice site variants with a minor allele frequency <1% that were previously reported to associate with blood-cell traits.

Chromosomes and positions are on build hg19 of the human genome. The direction of the effect sizes (Beta) is for allele A2. Beta and standard errors (SE) are in standard deviation units. MPV, mean platelet volume; MCH, mean corpuscular hemoglobin; RDW, red blood cell distribution width; PLT, platelet count; WBC, white blood cell count; MCHC, mean corpuscular hemoglobin concentration; Mono, monocyte; Neutro, neutrophil; HGB, hemoglobin. (DOCX) Click here for additional data file.

Study-level association results for the 31 novel rare or splice site variants.

The direction of the BETA is for the A2 allele. There are slight differences between the UK Biobank association results presented here and those in because of the model used for conditional analyses in . (XLSX) Click here for additional data file.

Functional annotation of the 31 new independent rare (minor allele frequency <1%) variants identified in this study that are associated with hematological traits.

(XLSX) Click here for additional data file.

List of variants for condtional analyses.

For each blood-cell trait, we conditioned the association of the novel rare or coding variants on genotypes at these known blood-cell trait variants. (XLSX) Click here for additional data file.

Conditional results in the UK Biobank (UKBB).

When Pcond >0.002 (Bonferroni correction for 31 variants tested), we performed pairwise conditional analyses with markers at the locus to identify the variant that account for the association signal. This variant is listed in the Tagging variant column, along with its minor allele frequency and functional annotation. MPV, mean platelet volume; MCH, mean corpuscular hemoglobin; RDW, red blood cell distribution width; PLT, platelet count; WBC, white blood cell count; MCHC, mean corpuscular hemoglobin concentration; Mono, monocyte; Neutro, neutrophil; HGB, hemoglobin; MCV, mean corpuscular volume; Eosin, eosinophil; RBC, red blood cell count; HCT, hematocrit. (DOCX) Click here for additional data file.

Association between PLG-rs145535174 and thrombotic events.

All analyses were corrected for age and sex. UKBB, UK Biobank; MHIBB, Montreal Heart Institute Biobank; WHI, Women’s Health Initiative. The direction of the odds ratios is for the rare G-allele at rs145535174 (allele A2). (DOCX) Click here for additional data file.

Prioritization of candidate genes.

For each of the 31 novel rare coding or splice site variants presented in this study, we queried the corresponding loci in previous GWAS of blood-cell traits and highlighted previously prioritized candidate genes based on functional annotation (missense, splice site) or regulatory (eQTL) effect in the GTEx database. (DOCX) Click here for additional data file.

Reciprocal conditional analysis results in the UK Biobank (UKBB).

For each of the 31 variants in Table 1, we conditioned the variants from Astle et al. Associated with the same trait and located on the same chromosome. For instance for the IL33 locus, we conditioned all eosinophil-associated SNPs on chromosome 9 that were reported by Astle et al. with genotypes at the rare IL33 splice site variant (rs146597587).
Table 1

New rare coding or splice site variants associated with hematological traits.

Chromosome and genomic coordinates are on build hg19 of the human genome. Allele frequencies and directions of effect are for allele A2. Effect sizes (BETA and SE) are in standard deviation units. Phet is the Cochran’s Q test heterogeneity P-value. We provide P-values in the UK Biobank (UKBB) without (non-conditional) and with (conditional) correction for the known blood-cell variants [1]. Detailed results of the conditional analyses are in . Other phenotypes are hematological traits associated at P<5x10-8 in the meta-analyses. In the Comment column, we consider SNPs previously associated with blood-cell traits that are located 500 kb on either side of the reported rare variant. a, variants that are statistically dependent of known blood-cell trait variants; b, variants that are statistically independent from known blood-cell variants, but that are located at known blood-cell trait loci; c, new blood-cell trait locus.

SNPIDCHR(POS)GeneAnnotation (VEP)A1/A2Freq (A2)Main phenotypeBETA (SE)P-valuePhetUKBB Pnon-condUKBB PcondOther phenotypesComment
rs148916169a1 (36932463)CSF3RmissenseA/G0.0066WBC0.1308 (0.0181)4.62E-130.166.09E-070.61NeutroKnown WBC locus.
rs138903557b2 (24245713)MFSD2BmissenseC/G0.0046MCV-0.126 (0.0231)4.76E-080.482.70E-070.000032Locus associated with WBC and RBC traits.
rs147820690b2 (160735174)LY75-CD302missenseC/T0.0026PLT0.1654 (0.0287)8.19E-090.830.00430.00069Locus associated with PLT and WBC traits.
rs116274727b2 (192701265)SDPRmissenseT/C0.0061MPV0.1623 (0.0215)3.95E-140.977.77E-089.57E-12Known PLT locus.
rs28910273b3 (142188337)ATRmissenseA/C0.0067MCV-0.1318 (0.0185)1.02E-120.171.70E-083.74E-10MCHKnown RBC locus.
rs77208665b3 (142274770)ATRmissenseT/C0.0056MCV-0.1171 (0.0208)1.91E-080.0891.38E-055.16E-07
rs151053159b5 (1078832)SLC12A7missenseC/T0.0034RDW-0.2013 (0.0278)4.77E-130.999.81E-100.0012Locus associated with WBC and RBC traits.
rs121434346a5 (1212453)SLC6A19missenseG/A0.0042RDW-0.1379 (0.0246)2.15E-080.652.06E-050.63Locus associated with WBC and RBC traits.
rs145535174c6 (161134124)PLGmissenseA/G0.0005PLT0.34 (0.0587)6.83E-091.00.000173.62E-05Novel locus
rs74848966a7 (100365467)ZANmissenseG/A0.0044MCH0.1522 (0.0263)7.02E-090.262.20E-060.27EPO/TFR2 locus, previously associated with RBC traits.
rs141547371a9 (214606)C9orf66missenseA/C0.0027MPV0.3022 (0.0479)2.87E-100.375.12E-080.012Known PLT locus.
rs146597587a9 (6255967)IL33splice acceptorG/C0.0047Eosin-0.2504 (0.0252)2.40E-230.121.02E-170.88Known WBC locus.
rs146879704b9 (114886569)SUSD1missenseC/T0.0037HGB0.1509 (0.0254)2.92E-090.161.87E-063.00E-05Known RBC locus.
rs141547732a9 (136280025)REXO4missenseG/A0.0064RBC0.1195 (0.0195)8.47E-100.598.05E-070.40ABO locus, previously associated with RBC traits.
rs71508957a10 (64927837)JMJD1CmissenseC/T0.0085MPV0.1142 (0.0175)7.54E-110.194.14E-080.030PLTLocus associated with WBC, PLT and RBC traits.
rs61748606a11 (230474)SIRT3missenseG/T0.0060MPV-0.1624 (0.0274)3.05E-090.323.74E-100.46Locus associated with WBC, PLT and RBC traits.
rs138326449b11 (116701354)APOC3splice donorG/A0.0020RDW0.2857 (0.0444)1.19E-100.0184.48E-082.72E-09Known PLT locus. This rare variants is associated with triglyceride levels and coronary artery disease.
rs150349412a12 (112184086)ACAD10missenseG/A0.0027PLT0.176 (0.0318)3.01E-080.322.19E-060.65SH2B3 locus, previously associated with RBC, WBC and PLT traits.
rs145120027a12 (122439451)WDR66missenseT/C0.0011MPV0.3647 (0.0591)6.70E-100.0392.02E-090.84Locus associated with PLT and RBC traits.
rs17881033a12 (122763670)CLIP1missenseC/A0.0091MPV0.1792 (0.0171)9.90E-260.0872.44E-200.66
rs151322438a12 (123335398)HIP1RmissenseC/T0.0019MPV0.3258 (0.0407)1.16E-150.261.21E-130.20
rs146030737a13 (28626716)FLT3missenseC/T0.0057Mono0.1475 (0.0249)3.15E-090.673.81E-060.52Known WBC locus.
rs182782800a13 (73319139)BORAmissenseG/A0.0038MCV-0.1802 (0.024)5.50E-140.533.26E-090.012MCHKnown RBC locus.
rs13888768b14 (103568488)EXOC3L4missenseG/A0.0023MPV-0.2783 (0.0468)2.72E-090.236.37E-071.36E-10Known PLT locus.
rs148718670a14 (103574815)EXOC3L4missenseG/A0.0032MPV-0.1677 (0.0295)1.28E-080.0574.24E-060.0034
rs184575290b15 (80191280)ST20missenseT/A0.0035Mono0.2164 (0.0302)7.28E-130.00481.57E-109.30E-09Known WBC locus.
rs57268939b16 (319547)FAM234AmissenseT/C0.0005MCH-0.5633 (0.0922)9.83E-100.0346.05E-111.85E-09MCVα-globin locus, previously associated with RBC traits.
rs147810715b16 (30999491)HSD3B7missenseG/C0.0004MCH-0.4918 (0.0838)4.39E-090.633.22E-084.74E-08Known RBC locus.
rs35266519 a17 (38062390)GSDMBmissenseC/T0.0089Neutro-0.101 (0.0173)4.84E-090.613.18E-070.32Known WBC locus.
rs150420714b19 (50017538)FCGRTmissenseG/C0.0083HCT0.1033 (0.0167)6.26E-100.439.77E-052.76E-05HGB,RBCKnown RBC locus.
rs201148397b22 (37482458)TMPRSS6missenseC/A0.0021MCH0.2542 (0.0371)7.12E-120.0313.89E-098.86E-05MCVLocus associated with PLT and RBC traits.
(XLSX) Click here for additional data file.

Quantile-quantile plots for rare-variant association results (minor allele frequency <1%).

We present association results for 15 hematological traits analyzed at 137,086 variants in up to 308,572 participants. (JPG) Click here for additional data file.
Table 2

Association results between PLG-rs145535174 and hemostatic and coagulation factors in the Framingham Heart Study.

The direction of the effects (BETA) is given for the rare G-allele (on forward strand). SE, standard error; ADP, Adenosine diphosphate; PAI-1, plasminogen activator inhibitor type 1; tPA, tissue plasminogen activator; FVII, factor VII; vWF, von Willebrand factor. We highlight in bold nominal P-values <0.05.

TraitSample size(# carriers)Freq(G-allele)BETA (SE)P-value
Epinephrine 0.5uM Platelet Reactivity, % aggregation1877 (6)0.0016-1.092 (0.432)0.012
Epinephrine 1.0uM Platelet Reactivity, % aggregation2009 (7)0.0017-0.774 (0.402)0.054
Epinephrine 3.0uM Platelet Reactivity, % aggregation1131 (6)0.0027-0.286(0.426)0.502
Epinephrine EC50 Platelet Reactivity, uM2199 (8)0.00180.737 (0.381)0.053
Collagen lag time, second2152 (8)0.00190.435 (0.383)0.256
ADP 1.0uM Platelet Reactivity, % aggregation2392 (7)0.0015-0.920 (0.415)0.026
ADP 3.0uM Platelet Reactivity, % aggregation2555 (8)0.0016-0.451 (0.375)0.229
ADP 5.0uM Platelet Reactivity, % aggregation1686 (7)0.0021-0.426 (0.408)0.297
ADP EC50 Platelet Reactivity, uM2205 (8)0.00180.3818 (0.378)0.313
PAI-1, ng/ml6441 (14)0.0011-0.339 (0.197)0.086
tPA, ng/ml2613 (9)0.0017-0.116 (0.152)0.446
D-dimer, ng/ml2874 (8)0.0005-0.479 (0.202)0.018
Fibrinogen, g/L6711 (16)0.0012-0.059 (0.047)0.207
FVII, % antigen2620 (9)0.00172.261 (5.563)0.684
vWF % antigen2621 (9)0.001721.187 (14.551)0.145
Table 3

Association between IL33-rs146597587 and asthma, allergic rhinitis, or endometriosis risk in the UK Biobank.

All analyses were corrected for age, sex, and the first 10 principal components. The direction of the odds ratio is for the rare allele (rs146597587, C-allele on forward strand). The conditional P-values were calculated in a logistic model accounting for genotypes at common single nucleotide polymorphisms (SNPs) previously associated with asthma, allergic rhinitis, or endometriosis: rs343496, rs7032572, rs72699186, rs1342326, rs2381416, rs928413, rs10975519.

DiseaseNcasesNctrlsFreq(C-allele)Odds ratio95% confidence intervalP-valueP-value (conditional)
Asthma17,565118,1190.00480.560.45–0.702.60x10-71.22x10-6
Allergic rhinitis7,918127,7660.00480.550.39–0.764.21x10-47.4x10-4
Endometriosis1,589134,0950.00480.860.46–1.490.630.67
Table 4

Pathway analyses using DAVID with genes that carry rare coding or splice site variants associated with blood-cell traits.

We only present biological pathways with a false discovery rate (FDR) <10%.

Biological pathway or termFold EnrichmentP-valueFDRGenes
Red blood cell
Iron ion homeostasis121.30.000230.26TF, TFR2, TMPRSS6
Cellular iron ion homeostasis85.30.000460.53TF, TFR2, TMPRSS6
Ferrous iron import into cell1051.50.001751.99TF, TFR2
Cellular response to iron ion525.80.003513.95TF, TFR2
HFE-transferrin receptor complex279.30.006676.00TF, TFR2
Disulfide bond3.20.008959.32TF, TFR2, SUSD1, ZAN, FCGRT, TMPRSS6, EPO
White blood cells
Cytokine-cytokine receptor interaction19.10.000400.28FLT3, CSF3R, CXCR2, IL17RA
Neutrophil chemotaxis99.10.000310.34CSF3R, CXCR2, JAML
Cytokine-mediated signaling pathway41.10.001771.94FLT3, CSF3R, IL17RA
Integral component of plasma membrane6.60.002721.98FLT3, S1PR4, CSF3R, CXCR2, IL17RA
Plasma membrane3.30.002942.15GSDMB, FLT3, S1PR4, CSF3R, CXCR2, JAML, IL17RA
Receptor6.40.002922.63FLT3, S1PR4, CSF3R, CXCR2, IL17RA
Topological domain:Extracellular4.30.003162.72FLT3, S1PR4, CSF3R, CXCR2, JAML, IL17RA
Cell membrane4.00.004614.11GSDMB, S1PR4, CSF3R, CXCR2, JAML, IL17RA
Disulfide bond3.70.006325.60GSDMB, FLT3, CSF3R, CXCR2, JAML, IL17RA
Membrane2.30.007326.46GSDMB, FLT3, S1PR4, CSF3R, CXCR2, JAML, IL33, IL17RA
Topological domain:Cytoplasmic3.50.008337.04FLT3, S1PR4, CSF3R, CXCR2, JAML, IL17RA
Transmembrane helix2.60.010308.99FLT3, S1PR4, CSF3R, CXCR2, JAML, IL33, IL17RA
Transmembrane2.60.010439.10FLT3, S1PR4, CSF3R, CXCR2, JAML, IL33, IL17RA
Platelet
Blood coagulation26.40.000330.40SH2B3, JAK2, JMJD1C, PLG
Erythrocytosis, somatic505.50.003292.10SH2B3, JAK2
Myelofibrosis, somatic337.00.004943.14SH2B3, JAK2
Pleckstrin homology-like domain9.40.006827.13PLEK, SH2B3, JAK2, KALRN
Phosphoprotein1.80.007237.31PLEK, HIP1R, IQGAP2, PLG, EXOC3L4, CKAP2L, SDPR, CLIP1, SH2B3, JAK2, JMJD1C, TUBB1, LY75-CD302, KALRN
  53 in total

1.  Plasmin desensitization of the PAR1 thrombin receptor: kinetics, sites of truncation, and implications for thrombolytic therapy.

Authors:  A Kuliopulos; L Covic; S K Seeley; P J Sheridan; J Helin; C E Costello
Journal:  Biochemistry       Date:  1999-04-06       Impact factor: 3.162

2.  Actin mutations in dilated cardiomyopathy, a heritable form of heart failure.

Authors:  T M Olson; V V Michels; S N Thibodeau; Y S Tai; M T Keating
Journal:  Science       Date:  1998-05-01       Impact factor: 47.728

3.  Sirtuin Inhibition Induces Apoptosis-like Changes in Platelets and Thrombocytopenia.

Authors:  Sharda Kumari; Susheel N Chaurasia; Manasa K Nayak; Ram L Mallick; Debabrata Dash
Journal:  J Biol Chem       Date:  2015-03-31       Impact factor: 5.157

Review 4.  Quebec platelet disorder: update on pathogenesis, diagnosis, and treatment.

Authors:  Jessica Blavignac; Natalia Bunimov; Georges E Rivard; Catherine P M Hayward
Journal:  Semin Thromb Hemost       Date:  2011-11-18       Impact factor: 4.180

5.  Predicting functional effect of human missense mutations using PolyPhen-2.

Authors:  Ivan Adzhubei; Daniel M Jordan; Shamil R Sunyaev
Journal:  Curr Protoc Hum Genet       Date:  2013-01

6.  Rare and low-frequency variants and their association with plasma levels of fibrinogen, FVII, FVIII, and vWF.

Authors:  Jennifer E Huffman; Paul S de Vries; Alanna C Morrison; Maria Sabater-Lleal; Tim Kacprowski; Paul L Auer; Jennifer A Brody; Daniel I Chasman; Ming-Huei Chen; Xiuqing Guo; Li-An Lin; Riccardo E Marioni; Martina Müller-Nurasyid; Lisa R Yanek; Nathan Pankratz; Megan L Grove; Moniek P M de Maat; Mary Cushman; Kerri L Wiggins; Lihong Qi; Bengt Sennblad; Sarah E Harris; Ozren Polasek; Helene Riess; Fernando Rivadeneira; Lynda M Rose; Anuj Goel; Kent D Taylor; Alexander Teumer; André G Uitterlinden; Dhananjay Vaidya; Jie Yao; Weihong Tang; Daniel Levy; Melanie Waldenberger; Diane M Becker; Aaron R Folsom; Franco Giulianini; Andreas Greinacher; Albert Hofman; Chiang-Ching Huang; Charles Kooperberg; Angela Silveira; John M Starr; Konstantin Strauch; Rona J Strawbridge; Alan F Wright; Barbara McKnight; Oscar H Franco; Neil Zakai; Rasika A Mathias; Bruce M Psaty; Paul M Ridker; Geoffrey H Tofler; Uwe Völker; Hugh Watkins; Myriam Fornage; Anders Hamsten; Ian J Deary; Eric Boerwinkle; Wolfgang Koenig; Jerome I Rotter; Caroline Hayward; Abbas Dehghan; Alex P Reiner; Christopher J O'Donnell; Nicholas L Smith
Journal:  Blood       Date:  2015-06-23       Impact factor: 22.113

7.  A promoter-level mammalian expression atlas.

Authors:  Alistair R R Forrest; Hideya Kawaji; Michael Rehli; J Kenneth Baillie; Michiel J L de Hoon; Vanja Haberle; Timo Lassmann; Ivan V Kulakovskiy; Marina Lizio; Masayoshi Itoh; Robin Andersson; Christopher J Mungall; Terrence F Meehan; Sebastian Schmeier; Nicolas Bertin; Mette Jørgensen; Emmanuel Dimont; Erik Arner; Christian Schmidl; Ulf Schaefer; Yulia A Medvedeva; Charles Plessy; Morana Vitezic; Jessica Severin; Colin A Semple; Yuri Ishizu; Robert S Young; Margherita Francescatto; Intikhab Alam; Davide Albanese; Gabriel M Altschuler; Takahiro Arakawa; John A C Archer; Peter Arner; Magda Babina; Sarah Rennie; Piotr J Balwierz; Anthony G Beckhouse; Swati Pradhan-Bhatt; Judith A Blake; Antje Blumenthal; Beatrice Bodega; Alessandro Bonetti; James Briggs; Frank Brombacher; A Maxwell Burroughs; Andrea Califano; Carlo V Cannistraci; Daniel Carbajo; Yun Chen; Marco Chierici; Yari Ciani; Hans C Clevers; Emiliano Dalla; Carrie A Davis; Michael Detmar; Alexander D Diehl; Taeko Dohi; Finn Drabløs; Albert S B Edge; Matthias Edinger; Karl Ekwall; Mitsuhiro Endoh; Hideki Enomoto; Michela Fagiolini; Lynsey Fairbairn; Hai Fang; Mary C Farach-Carson; Geoffrey J Faulkner; Alexander V Favorov; Malcolm E Fisher; Martin C Frith; Rie Fujita; Shiro Fukuda; Cesare Furlanello; Masaaki Furino; Jun-ichi Furusawa; Teunis B Geijtenbeek; Andrew P Gibson; Thomas Gingeras; Daniel Goldowitz; Julian Gough; Sven Guhl; Reto Guler; Stefano Gustincich; Thomas J Ha; Masahide Hamaguchi; Mitsuko Hara; Matthias Harbers; Jayson Harshbarger; Akira Hasegawa; Yuki Hasegawa; Takehiro Hashimoto; Meenhard Herlyn; Kelly J Hitchens; Shannan J Ho Sui; Oliver M Hofmann; Ilka Hoof; Furni Hori; Lukasz Huminiecki; Kei Iida; Tomokatsu Ikawa; Boris R Jankovic; Hui Jia; Anagha Joshi; Giuseppe Jurman; Bogumil Kaczkowski; Chieko Kai; Kaoru Kaida; Ai Kaiho; Kazuhiro Kajiyama; Mutsumi Kanamori-Katayama; Artem S Kasianov; Takeya Kasukawa; Shintaro Katayama; Sachi Kato; Shuji Kawaguchi; Hiroshi Kawamoto; Yuki I Kawamura; Tsugumi Kawashima; Judith S Kempfle; Tony J Kenna; Juha Kere; Levon M Khachigian; Toshio Kitamura; S Peter Klinken; Alan J Knox; Miki Kojima; Soichi Kojima; Naoto Kondo; Haruhiko Koseki; Shigeo Koyasu; Sarah Krampitz; Atsutaka Kubosaki; Andrew T Kwon; Jeroen F J Laros; Weonju Lee; Andreas Lennartsson; Kang Li; Berit Lilje; Leonard Lipovich; Alan Mackay-Sim; Ri-ichiroh Manabe; Jessica C Mar; Benoit Marchand; Anthony Mathelier; Niklas Mejhert; Alison Meynert; Yosuke Mizuno; David A de Lima Morais; Hiromasa Morikawa; Mitsuru Morimoto; Kazuyo Moro; Efthymios Motakis; Hozumi Motohashi; Christine L Mummery; Mitsuyoshi Murata; Sayaka Nagao-Sato; Yutaka Nakachi; Fumio Nakahara; Toshiyuki Nakamura; Yukio Nakamura; Kenichi Nakazato; Erik van Nimwegen; Noriko Ninomiya; Hiromi Nishiyori; Shohei Noma; Shohei Noma; Tadasuke Noazaki; Soichi Ogishima; Naganari Ohkura; Hiroko Ohimiya; Hiroshi Ohno; Mitsuhiro Ohshima; Mariko Okada-Hatakeyama; Yasushi Okazaki; Valerio Orlando; Dmitry A Ovchinnikov; Arnab Pain; Robert Passier; Margaret Patrikakis; Helena Persson; Silvano Piazza; James G D Prendergast; Owen J L Rackham; Jordan A Ramilowski; Mamoon Rashid; Timothy Ravasi; Patrizia Rizzu; Marco Roncador; Sugata Roy; Morten B Rye; Eri Saijyo; Antti Sajantila; Akiko Saka; Shimon Sakaguchi; Mizuho Sakai; Hiroki Sato; Suzana Savvi; Alka Saxena; Claudio Schneider; Erik A Schultes; Gundula G Schulze-Tanzil; Anita Schwegmann; Thierry Sengstag; Guojun Sheng; Hisashi Shimoji; Yishai Shimoni; Jay W Shin; Christophe Simon; Daisuke Sugiyama; Takaai Sugiyama; Masanori Suzuki; Naoko Suzuki; Rolf K Swoboda; Peter A C 't Hoen; Michihira Tagami; Naoko Takahashi; Jun Takai; Hiroshi Tanaka; Hideki Tatsukawa; Zuotian Tatum; Mark Thompson; Hiroo Toyodo; Tetsuro Toyoda; Elvind Valen; Marc van de Wetering; Linda M van den Berg; Roberto Verado; Dipti Vijayan; Ilya E Vorontsov; Wyeth W Wasserman; Shoko Watanabe; Christine A Wells; Louise N Winteringham; Ernst Wolvetang; Emily J Wood; Yoko Yamaguchi; Masayuki Yamamoto; Misako Yoneda; Yohei Yonekura; Shigehiro Yoshida; Susan E Zabierowski; Peter G Zhang; Xiaobei Zhao; Silvia Zucchelli; Kim M Summers; Harukazu Suzuki; Carsten O Daub; Jun Kawai; Peter Heutink; Winston Hide; Tom C Freeman; Boris Lenhard; Vladimir B Bajic; Martin S Taylor; Vsevolod J Makeev; Albin Sandelin; David A Hume; Piero Carninci; Yoshihide Hayashizaki
Journal:  Nature       Date:  2014-03-27       Impact factor: 49.962

8.  Meta-analysis of rare and common exome chip variants identifies S1PR4 and other loci influencing blood cell traits.

Authors: 
Journal:  Nat Genet       Date:  2016-07-11       Impact factor: 38.330

9.  A rare IL33 loss-of-function mutation reduces blood eosinophil counts and protects from asthma.

Authors:  Dirk Smith; Hannes Helgason; Patrick Sulem; Unnur Steina Bjornsdottir; Ai Ching Lim; Gardar Sveinbjornsson; Haruki Hasegawa; Michael Brown; Randal R Ketchem; Monica Gavala; Logan Garrett; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Asgeir Sigurdsson; Olafur T Magnusson; Gudmundur I Eyjolfsson; Isleifur Olafsson; Pall Torfi Onundarson; Olof Sigurdardottir; David Gislason; Thorarinn Gislason; Bjorn Runar Ludviksson; Dora Ludviksdottir; H Marike Boezen; Andrea Heinzmann; Marcus Krueger; Celeste Porsbjerg; Tarunveer S Ahluwalia; Johannes Waage; Vibeke Backer; Klaus A Deichmann; Gerard H Koppelman; Klaus Bønnelykke; Hans Bisgaard; Gisli Masson; Unnur Thorsteinsdottir; Daniel F Gudbjartsson; James A Johnston; Ingileif Jonsdottir; Kari Stefansson
Journal:  PLoS Genet       Date:  2017-03-08       Impact factor: 5.917

10.  Rare and low-frequency coding variants in CXCR2 and other genes are associated with hematological traits.

Authors:  Paul L Auer; Alexander Teumer; Ursula Schick; Andrew O'Shaughnessy; Ken Sin Lo; Nathalie Chami; Chris Carlson; Simon de Denus; Marie-Pierre Dubé; Jeff Haessler; Rebecca D Jackson; Charles Kooperberg; Louis-Philippe Lemieux Perreault; Matthias Nauck; Ulrike Peters; John D Rioux; Frank Schmidt; Valérie Turcot; Uwe Völker; Henry Völzke; Andreas Greinacher; Li Hsu; Jean-Claude Tardif; George A Diaz; Alexander P Reiner; Guillaume Lettre
Journal:  Nat Genet       Date:  2014-04-28       Impact factor: 38.330

View more
  17 in total

Review 1.  Deciphering the Emerging Complexities of Molecular Mechanisms at GWAS Loci.

Authors:  Maren E Cannon; Karen L Mohlke
Journal:  Am J Hum Genet       Date:  2018-11-01       Impact factor: 11.025

2.  Whole-Genome Sequencing in Severe Chronic Obstructive Pulmonary Disease.

Authors:  Dmitry Prokopenko; Phuwanat Sakornsakolpat; Heide Loehlein Fier; Dandi Qiao; Margaret M Parker; Merry-Lynn N McDonald; Ani Manichaikul; Stephen S Rich; R Graham Barr; Christopher J Williams; Mark L Brantly; Christoph Lange; Terri H Beaty; James D Crapo; Edwin K Silverman; Michael H Cho
Journal:  Am J Respir Cell Mol Biol       Date:  2018-11       Impact factor: 6.914

3.  14q32 and let-7 microRNAs regulate transcriptional networks in fetal and adult human erythroblasts.

Authors:  Samuel Lessard; Mélissa Beaudoin; Stuart H Orkin; Daniel E Bauer; Guillaume Lettre
Journal:  Hum Mol Genet       Date:  2018-04-15       Impact factor: 6.150

4.  eSCAN: scan regulatory regions for aggregate association testing using whole-genome sequencing data.

Authors:  Yingxi Yang; Quan Sun; Le Huang; Jai G Broome; Adolfo Correa; Alexander Reiner; Laura M Raffield; Yuchen Yang; Yun Li
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

5.  Whole genome sequence analysis of platelet traits in the NHLBI Trans-Omics for Precision Medicine (TOPMed) initiative.

Authors:  Amarise Little; Yao Hu; Quan Sun; Deepti Jain; Jai Broome; Ming-Huei Chen; Florian Thibord; Caitlin McHugh; Praveen Surendran; Thomas W Blackwell; Jennifer A Brody; Arunoday Bhan; Nathalie Chami; Paul S de Vries; Lynette Ekunwe; Nancy Heard-Costa; Brian D Hobbs; Ani Manichaikul; Jee-Young Moon; Michael H Preuss; Kathleen Ryan; Zhe Wang; Marsha Wheeler; Lisa R Yanek; Goncalo R Abecasis; Laura Almasy; Terri H Beaty; Lewis C Becker; John Blangero; Eric Boerwinkle; Adam S Butterworth; Hélène Choquet; Adolfo Correa; Joanne E Curran; Nauder Faraday; Myriam Fornage; David C Glahn; Lifang Hou; Eric Jorgenson; Charles Kooperberg; Joshua P Lewis; Donald M Lloyd-Jones; Ruth J F Loos; Yuan-I Min; Braxton D Mitchell; Alanna C Morrison; Deborah A Nickerson; Kari E North; Jeffrey R O'Connell; Nathan Pankratz; Bruce M Psaty; Ramachandran S Vasan; Stephen S Rich; Jerome I Rotter; Albert V Smith; Nicholas L Smith; Hua Tang; Russell P Tracy; Matthew P Conomos; Cecelia A Laurie; Rasika A Mathias; Yun Li; Paul L Auer; Timothy Thornton; Alexander P Reiner; Andrew D Johnson; Laura M Raffield
Journal:  Hum Mol Genet       Date:  2022-02-03       Impact factor: 5.121

6.  Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits.

Authors:  Wei Cheng; Sohini Ramachandran; Lorin Crawford
Journal:  PLoS Genet       Date:  2020-06-15       Impact factor: 5.917

7.  Whole-genome sequencing association analysis of quantitative red blood cell phenotypes: The NHLBI TOPMed program.

Authors:  Yao Hu; Adrienne M Stilp; Caitlin P McHugh; Shuquan Rao; Deepti Jain; Xiuwen Zheng; John Lane; Sébastian Méric de Bellefon; Laura M Raffield; Ming-Huei Chen; Lisa R Yanek; Marsha Wheeler; Yao Yao; Chunyan Ren; Jai Broome; Jee-Young Moon; Paul S de Vries; Brian D Hobbs; Quan Sun; Praveen Surendran; Jennifer A Brody; Thomas W Blackwell; Hélène Choquet; Kathleen Ryan; Ravindranath Duggirala; Nancy Heard-Costa; Zhe Wang; Nathalie Chami; Michael H Preuss; Nancy Min; Lynette Ekunwe; Leslie A Lange; Mary Cushman; Nauder Faraday; Joanne E Curran; Laura Almasy; Kousik Kundu; Albert V Smith; Stacey Gabriel; Jerome I Rotter; Myriam Fornage; Donald M Lloyd-Jones; Ramachandran S Vasan; Nicholas L Smith; Kari E North; Eric Boerwinkle; Lewis C Becker; Joshua P Lewis; Goncalo R Abecasis; Lifang Hou; Jeffrey R O'Connell; Alanna C Morrison; Terri H Beaty; Robert Kaplan; Adolfo Correa; John Blangero; Eric Jorgenson; Bruce M Psaty; Charles Kooperberg; Russell T Walton; Benjamin P Kleinstiver; Hua Tang; Ruth J F Loos; Nicole Soranzo; Adam S Butterworth; Debbie Nickerson; Stephen S Rich; Braxton D Mitchell; Andrew D Johnson; Paul L Auer; Yun Li; Rasika A Mathias; Guillaume Lettre; Nathan Pankratz; Cathy C Laurie; Cecelia A Laurie; Daniel E Bauer; Matthew P Conomos; Alexander P Reiner
Journal:  Am J Hum Genet       Date:  2021-04-21       Impact factor: 11.025

8.  A large-scale exome array analysis of venous thromboembolism.

Authors:  Sara Lindström; Jennifer A Brody; Constance Turman; Marine Germain; Traci M Bartz; Erin N Smith; Ming-Huei Chen; Marja Puurunen; Daniel Chasman; Jeffrey Hassler; Nathan Pankratz; Saonli Basu; Weihua Guan; Beata Gyorgy; Manal Ibrahim; Jean-Philippe Empana; Robert Olaso; Rebecca Jackson; Sigrid K Braekkan; Barbara McKnight; Jean-Francois Deleuze; Cristopher J O'Donnell; Xavier Jouven; Kelly A Frazer; Bruce M Psaty; Kerri L Wiggins; Kent Taylor; Alexander P Reiner; Susan R Heckbert; Charles Kooperberg; Paul Ridker; John-Bjarne Hansen; Weihong Tang; Andrew D Johnson; Pierre-Emmanuel Morange; David A Trégouët; Peter Kraft; Nicholas L Smith; Christopher Kabrhel
Journal:  Genet Epidemiol       Date:  2019-01-19       Impact factor: 2.344

9.  Heterozygous germ line CSF3R variants as risk alleles for development of hematologic malignancies.

Authors:  Amy M Trottier; Lawrence J Druhan; Ira L Kraft; Amanda Lance; Simone Feurstein; Maria Helgeson; Jeremy P Segal; Soma Das; Belinda R Avalos; Lucy A Godley
Journal:  Blood Adv       Date:  2020-10-27

10.  An Asian-specific MPL genetic variant alters JAK-STAT signaling and influences platelet count in the population.

Authors:  Pengfei Sun; Wei Zhou; Yi Fu; Chloe Y Y Cheung; Yujun Dong; Min-Lee Yang; He Zhang; Jia Jia; Yong Huo; Cristen J Willer; Y Eugene Chen; Clara S Tang; Hung-Fat Tse; Karen S L Lam; Wei Gao; Ming Xu; Haiyi Yu; Pak Chung Sham; Yan Zhang; Santhi K Ganesh
Journal:  Hum Mol Genet       Date:  2021-05-28       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.