| Literature DB >> 26366553 |
Fabrice Danjou1, Magdalena Zoledziewska1, Carlo Sidore1,2,3, Maristella Steri1, Fabio Busonero1,2,4, Andrea Maschio1,2,4, Antonella Mulas1,3, Lucia Perseu1, Susanna Barella5, Eleonora Porcu1,2,3, Giorgio Pistis1,2,3, Maristella Pitzalis1, Mauro Pala1, Stephan Menzel6, Sarah Metrustry7, Timothy D Spector7, Lidia Leoni8, Andrea Angius1,8, Manuela Uda1, Paolo Moi5,9, Swee Lay Thein6,10, Renzo Galanello5,9, Gonçalo R Abecasis2, David Schlessinger11, Serena Sanna1, Francesco Cucca1,3.
Abstract
We report genome-wide association study results for the levels of A1, A2 and fetal hemoglobins, analyzed for the first time concurrently. Integrating high-density array genotyping and whole-genome sequencing in a large general population cohort from Sardinia, we detected 23 associations at 10 loci. Five signals are due to variants at previously undetected loci: MPHOSPH9, PLTP-PCIF1, ZFPM1 (FOG1), NFIX and CCND3. Among the signals at known loci, ten are new lead variants and four are new independent signals. Half of all variants also showed pleiotropic associations with different hemoglobins, which further corroborated some of the detected associations and identified features of coordinated hemoglobin species production.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26366553 PMCID: PMC4627580 DOI: 10.1038/ng.3307
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Association at the globin clusters
Schematic representation of association results in the genomic context of the β-globin (panel a) and α-globin (panel b) gene clusters. For each hemoglobin, the markers associated are positioned with + or – corresponding to an increase or decrease in the corresponding trait by the effective allele (as in Table 1). Symbol is larger if the marker is associated at genome-wide level or smaller if it results from the analysis of pleiotropic effects. The β039 mutation and –α 3.7 type I deletion as well as relevant genes and the locus control region hypersensitivity sites (HS) are indicated. Finally, at the bottom of each panel is represented the linkage disequilibrium (r2) profile for the region in Sardinia, with colors ranging from high (red), to intermediate (green), and low (blue).
Most significant independent association results from single variant tests for hemoglobin A1, A2 and fetal
The table shows the most significant association results (all results are corrected for β0 mutations observed in the HBB gene, and results on the α-globin gene cluster are adjusted for the −α 3.7 deletion type I, see Online Methods). Novel signals are shown in bold while variants refining previously reported signals are in italic. At each locus, we indicated the chromosome and genomic position (hg19 build), the rs ID when available, the effect allele tested for association (EA) and the other allele at the SNP (OA), the imputation accuracy (RSQR), the SNP effect allele frequency (EAF) and the regression coefficients. We then indicated whether the SNP is also linked the other hemoglobin forms (p < 0.01), and indicated the direction of the effect allele (+ for increasing the levels of Hb, - for decreasing). The candidate genes likely to be modulated by the lead SNP are also reported along with their inclusion criteria, as described in Online Methods (p = position, c = coding, e = eQTL, o = OMIM, b = biological). Where “α-globin gene cluster” is mentioned we refer to NPRL3, HBZ, HBQ1, HBA1, HBA2 and HBM genes; while for “β-globin gene cluster” we refer to HBB, HBD, HBBP1, HBG1, HBG2 and HBE1 genes. Association coefficients for males and females are reported in Supplementary Table 11.
| Shared effects | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Traits (units) and loci # | Candidate genes | chr:position | rsID from dbsnp142 | Alleles (EA/OA) | RSQR | EAF | Effect (StdErr) | p-value | HbA1 | HbA2 | HbF |
| HbA1 (g/dl) | |||||||||||
| locus1 | α-globin gene cluster(p,o,b); | rs570013781 |
|
|
|
|
| − | − | − | |
| α-globin gene cluster (p,o,b); |
|
|
|
|
|
|
|
| |||
|
| |||||||||||
|
|
|
|
|
|
|
| − | ||||
|
| |||||||||||
|
|
|
|
|
|
|
|
| ||||
|
| |||||||||||
| HbA2 | |||||||||||
|
| |||||||||||
| β-globin gene cluster(p,o,b); |
|
|
|
|
|
| − | ||||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
| − |
| |||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
| − | − | |||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
|
| ||||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
|
|
|
| ||
|
| |||||||||||
| locus2 | α-globin gene cluster (p,o,b); |
|
|
|
|
|
|
|
|
| |
| α-globin gene cluster (p,o,b); |
|
|
|
|
|
|
|
| |||
| α-globin gene cluster (p,o,b); |
|
|
|
|
|
|
| ||||
|
| |||||||||||
|
|
|
|
|
|
|
|
| ||||
|
| |||||||||||
| locus4 (%) | 6:135418916 | rs7776054 | G/A | Genotyped | 0.210 | 0.1762 (0.020) | 3.71×10−19 | + | + | ||
|
| |||||||||||
|
|
|
|
|
|
|
| |||||
|
| |||||||||||
|
|
|
|
|
|
|
| |||||
|
| |||||||||||
| HbF (g/dl) | |||||||||||
| locus1 | 2:60720951 | rs4671393 | A/G | 1.00 | 0.136 | 0.578 (0.023) | 2.60×10−130 | + | |||
|
|
|
|
|
|
| − | |||||
|
| |||||||||||
| locus2 | 6:135419018 | rs9399137 | C/T | Genotyped | 0.205 | 0.4202 (0.020) | 1.09×10−93 | + | + | ||
|
|
|
|
|
|
|
| |||||
|
| |||||||||||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
|
| ||||
| β-globin gene cluster (p,o,b); |
|
|
|
|
|
|
|
| − | ||
|
| |||||||||||
|
|
|
|
|
|
|
| |||||
= association results locally corrected for the −α 3.7 deletion type I (NG_000006.1:g.34164_37967del3804) (see Supplementary Note).
= first time associated to the trait and in a novel locus.
= first time associated to the trait in a previously reported locus.
= signal refining a previously reported signal.
= result not found using the 1000 Genomes reference panel.
cond. = obtained by conditional analysis on variants reported on the upper rows for the considered locus.
Figure 2Diagram of genome-wide associated loci
Representation of genome-wide significant findings on hemoglobin levels in relation to their contribution to the phenotypic variation (variance explained, panel a) or to their individual impact (effect size, panel b). At each step, the length of the black bar represents the magnitude of variance explained (panel a) or effect size (panel b) for each trait, locus, gene and variant. The bars are connected by colored bands to their sub-components (loci for each trait, genes for each locus, variants for each gene). Three colors (yellow, green and blue) represent the 3 hemoglobin forms (HbA1, HbA2 and HbF respectively), and for loci or genes affecting more than one hemoglobin: gray combines HbA1 and HbA2, cyan combines HbA2 and HbF, and light gray represents effects common to all 3 hemoglobin forms. Each panel is drawn to show loci in order of their importance, i.e. from the largest to smallest amount of explained phenotypic variance (panel a) or effect size (panel b). The variance explained by each locus was calculated fitting a regression model including all variants at that locus, while the effect size for a locus is the sum of effect sizes of all variants in that locus (Supplementary Table 3 reports effect sizes for such joint models). For variants associated with more than one trait the maximum value is used. Markers are reported as chromosome:position when an rs ID was not available; and when an intergenic region is involved instead of a single gene, we show nearby genes within brackets.
Replication of novel loci
The table describes association in the TwinsUK cohort (N = 4,131 individuals). For each SNP, we indicated the associated hemoglobin tested, the number of samples analysed, the imputation accuracy according to the IMPUTE-INFO metric, the effect allele tested for association (EA) and the other allele at the SNP (OA), the SNP effect allele frequency (EAF) and the regression coefficients. The last column explains the reason for the SNPs not being tested.
| Traits (units) and loci # from | SNP | Candidate genes | INFO score | Alleles (EA/OA) | EAF | Effect (StdErr) | p-value | Notes |
|---|---|---|---|---|---|---|---|---|
| HbA1 (g/dl) | ||||||||
| locus3 |
|
| - | - | - | - | - | Not imputable because absent in 1000 Genomes; at the moment, Sardinian specific. |
|
| ||||||||
| HbA2 (%) | ||||||||
| locus3 |
|
| 0.843 | G/T | 0.011 | 0.442 (0.118) | 1.73×10−04 | . |
| locus5 |
|
| 0.994 | C/T | 0.185 | 0.132 (0.029) | 6.98×10−06 | |
| locus6 |
|
| - | - | - | - | - | Not imputable because absent in 1000 Genomes; detected in the NHLBI GO Exome Sequencing Project (ESP). |
|
| ||||||||
| HbF (%) | ||||||||
| locus4 |
|
| 0.294 | T/C | 0.000 | - | - | Imputed as monomorphic in TwinsUK cohort. |