Literature DB >> 19862010

Multiple loci influence erythrocyte phenotypes in the CHARGE Consortium.

Santhi K Ganesh1, Neil A Zakai, Frank J A van Rooij, Nicole Soranzo, Albert V Smith, Michael A Nalls, Ming-Huei Chen, Anna Kottgen, Nicole L Glazer, Abbas Dehghan, Brigitte Kuhnel, Thor Aspelund, Qiong Yang, Toshiko Tanaka, Andrew Jaffe, Joshua C M Bis, Germaine C Verwoert, Alexander Teumer, Caroline S Fox, Jack M Guralnik, Georg B Ehret, Kenneth Rice, Janine F Felix, Augusto Rendon, Gudny Eiriksdottir, Daniel Levy, Kushang V Patel, Eric Boerwinkle, Jerome I Rotter, Albert Hofman, Jennifer G Sambrook, Dena G Hernandez, Gang Zheng, Stefania Bandinelli, Andrew B Singleton, Josef Coresh, Thomas Lumley, André G Uitterlinden, Janine M Vangils, Lenore J Launer, L Adrienne Cupples, Ben A Oostra, Jaap-Jan Zwaginga, Willem H Ouwehand, Swee-Lay Thein, Christa Meisinger, Panos Deloukas, Matthias Nauck, Tim D Spector, Christian Gieger, Vilmundur Gudnason, Cornelia M van Duijn, Bruce M Psaty, Luigi Ferrucci, Aravinda Chakravarti, Andreas Greinacher, Christopher J O'Donnell, Jacqueline C M Witteman, Susan Furth, Mary Cushman, Tamara B Harris, Jing-Ping Lin.   

Abstract

Measurements of erythrocytes within the blood are important clinical traits and can indicate various hematological disorders. We report here genome-wide association studies (GWAS) for six erythrocyte traits, including hemoglobin concentration (Hb), hematocrit (Hct), mean corpuscular volume (MCV), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC) and red blood cell count (RBC). We performed an initial GWAS in cohorts of the CHARGE Consortium totaling 24,167 individuals of European ancestry and replication in additional independent cohorts of the HaemGen Consortium totaling 9,456 individuals. We identified 23 loci significantly associated with these traits in a meta-analysis of the discovery and replication cohorts (combined P values ranging from 5 x 10(-8) to 7 x 10(-86)). Our findings include loci previously associated with these traits (HBS1L-MYB, HFE, TMPRSS6, TFR2, SPTA1) as well as new associations (EPO, TFRC, SH2B3 and 15 other loci). This study has identified new determinants of erythrocyte traits, offering insight into common variants underlying variation in erythrocyte measures.

Entities:  

Mesh:

Year:  2009        PMID: 19862010      PMCID: PMC2778265          DOI: 10.1038/ng.466

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Introduction

Red blood cell disorders such as anemia and erythrocytosis are broadly associated with multiple comorbid conditions including hypertension and other cardiovascular diseases, yet the genetic determinants of erythrocyte traits in the general population are poorly defined. Erythrocytes, which comprise approximately 40% - 50% of blood volume, are a key component for the transport of oxygen and carbon dioxide for cellular respiration. In clinical practice, measures of erythrocyte quantity, size and composition are routinely tested to diagnose and monitor hematologic diseases as well as the overall health of patients. Variation in erythrocyte measures even within normal ranges are related to other non-hematologic diseases and mortality1-3. Erythrocyte production and quality are under various environmental and genetic influences. While environmental exposures, dietary intake of vitamins and iron, and the anemia of chronic disease contribute substantially to abnormalities of erythrocyte measures, the heritability of erythrocyte traits ranges from 40% - 90%4-6. Disorders of hemoglobin production and hemoglobinopathies are some of the most common genetic diseases in the world, owing to natural selection. Some known low-frequency Mendelian variants also lead to inter-individual variability in erythrocyte traits in the general population7, 8. Candidate gene studies have identified a few non-hemoglobin loci, including EPOR and HBS1L, related to variation in erythrocyte traits8-10. Early genome-wide association and linkage studies of erythrocyte measures, which have identified a few associations, such as at chromosome 6q23, lacked statistical power for association and genetic resolution for testing competing hypotheses6, 11-13. To investigate genetic determinants of erythrocyte traits in the general population, we carried out genome-wide association studies and meta-analysis within multiple community-based cohorts comprising the CHARGE consortium14, followed by replication in independent samples. We identified 23 genetic loci associated with these erythrocyte traits. We further extend these findings to investigate possible links between these traits and vascular diseases, reporting associations of a few of the 23 loci identified with blood pressure and hypertension.

Results

CHARGE Consortium study samples

The total sample size for the individual cohort genome-wide association analysis and the CHARGE meta-analysis was 24,167 (the Age, Gene/Environment Susceptibility Reykjavik Study (AGES), N=3,205; the Atherosclerosis Risk in Communities Study (ARIC), N=7,803; the Cardiovascular Health Study (CHS), N=3,256; the Framingham Heart Study (FHS), N=3,359; and the Rotterdam Study (RS), N=5,523). We also included the Invecchiare in Chianti Study (InCHIANTI, N=1,021), an Italian cohort study, in these analyses. Characteristics of the study participants, including age, sex and trait summaries, are presented in Table 1.
Table 1

CHARGE cohort description

AGESARICCHSFHSRSInCHIANTI
Number of individuals eligible for GWAS321981273275338155231206
Percent women585361546056
Mean age (yrs)51 (6)54 (6)72 (5)38 (9)68 (8)68 (16)
Hb (g/dL)13.40(1.45)14.80(1.02)14.11(1.23)14.46(1.37)14.12(1.28)13.77(1.35)
Hct (% )40.35(3.53)43.60(2.86)42.14(3.54)43.00(3.9)41.36( 3.35)40.63(3.49)
MCH (picogram)30.91(1.69)NANA30.64(1.85)30.16(1.81)3.05(1.99)
MCHC (%)33.57(0.70)NANA33.67(0.91)34.17(1.15)33.84(1.05)
MCV (femtoliter)92.08(4.49)90.7(4.22)NA90.7(5.0)88.29(4.30)90.22(4.84)
RBC (1M cell/cmm)4.39(0.41)NANA4.73(0.46)4.69 (0.44)4.51(0.43)
Years of baseline examinations1968-19911987-19891989-901971-19751990-19931998
Years of DNA collection2002-20061987-19981989-19901996-19991990-19931998-2001

Sample sizes and summary statistics of covariates and erythrocyte traits measured in each cohort in CHARGE. Erythrocyte traits are abbreviated as: hemoglobin concentration (Hgb), hematocrit (Hct), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), mean corpuscular volume (MCV), erythrocyte count (RBC). The sample sizes presented are the range of numbers of individuals with both genotype and erythrocyte measures, after the exclusion of individuals with values beyond three standard deviations of the population mean for each erythrocyte trait.

Values for age and each erythrocyte trait are presented as mean(sd).

Meta-analysis of genome-wide association studies for six erythrocyte traits in the CHARGE Consortium

We studied six erythrocyte traits: hemoglobin concentration (Hgb); hematocrit (Hct); mean corpuscular volume (MCV); mean corpuscular hemoglobin (MCH); mean corpuscular hemoglobin concentration (MCHC); and red blood cell count (RBC), as defined in Supplementary Table 1. When cohort results were combined, 831 SNP associations at 23 independent loci (r2 < 0.3 between loci) across the six traits reached the genome-wide (GW) significance threshold of P < 5×10-8. The -log10(P value) genome-wide association plots for the meta-analysis of each of the 6 traits are shown in Figure 1. Corresponding QQ-plots are shown in Supplementary Figure 1a and the genomic control lambda (λGC) values in Supplementary Table 2. The genomic control inflation factor post-meta-analysis, which was not corrected at the meta-analysis level, showed no systematic inflation (Hgb λGC = 1.066; Hct λGC = 1.045; MCH λGC = 1.014; MCHC λGC = 0.995; MCV λGC = 1.029; and RBC λGC = 1.029; Supplementary Table 2). The meta-analysis results for all traits are summarized in Table 2, which is organized by the 23 independent loci and includes gene annotation information for each locus. The table also lists for each trait the number of SNPs exceeding the GW significance level. Altogether, there were 45 trait-locus combinations with at least one GW significant SNP. The complete set of SNP associations identified by the CHARGE meta-analysis is provided in Supplementary Table 3. Replication and further analysis focused on the 45 SNPs that gave the smallest P values for each of the 45 trait-locus findings in CHARGE.
Figure 1

Overview of CHARGE meta-analysis results for six erythrocyte traits: hemoglobin concentration (Hgb), hematocrit (Hct), mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), mean corpuscular volume (MCV), erythrocyte count (RBC). -log10 (P value) is plotted on the y-axis against genomic position of each SNP. Genomic loci with significant association (P < 5 × 10-8) are plotted in red, and loci with suggestive evidence are in blue (P < 4 × 10-7).

Table 2

CHARGE discoverymeta-analysis results, ordered by genomic locus

# SNPs per traitGene Annotation

Locus#ChrHgbHctMCHMCHCMCVRBCIn RefGeneRefGeneswithin60kbClosestRefGene
11q23.1000800SPTA1OR10Z1; SPTA1; OR10X1; OR6Y1
22p21360000PRKCEPRKCE
32p16.10000270BCL11ABCL11A
43q290040110TFRCTFRC
54q12000080nonenoneKIT
66p22.24937201330HFE; LRRC16; SCGN; SLC17A1;SLC17A3; SLC17A2; TRIM38;HIST1H4B; ZNF322AHFE; SCGN; LRRC16; SLC 17A1;SLC17A2; SLC17A3; SLC17A4;ZNF322A; ABT1; TRIM38
76p21.100510650PRICKLE4;FRS3;TFEB;MED20;USP49;CCND3;BYSLPRICKLE4; FRS3; PGC; TFEB;MED20; USP49; CCND3; BYSL;TAF8; PGC; FRS3
86q21000050noneCD164
96q23.301443138324HBS1L; MYBALDH8A1; HBS1L; MYB
106q24.10090130nonenoneCITED2
117p12.2000050IKZF1IKZF1
127q22.1020022TFR2; ZANGNB2; PCOLCE; FBXO24; TFR2;ACTL6B; HRBL; MOSPD3; LRCH4;ZAN; EPO; POP7; PERQ1; EPHB4
137q36.1130000PRKAG2PRKAG2
149p24.10090190RCL1RCL1; AK3
1510q11.21000040MARCH8MARCH8; ALOX5
1610q21.3130000HK1HK1
1712q24.121090000SH2B3; ATXN2; c12orf30; PTPN11SH2B3; ATXN2; BRAP; ACAD10;ERP29; TMEM116; C12orf30;TRAFD1; C12orf30; RPL6; PTPN11
1814q23.3000090MAX;FNTBMAX; RAB15; FNTB
1916p13.3001010ITFG3LUC7L; PDIA2; AXIN1; ITFG3;RGS11; ARHGDIG
2019p13.130060250MAN2B1; RTBDN; MAST1; DNASE2;GCDH; FARSAMAN2B1; MORG1; ZNF490; TNPO2;DHPS; FBXW9; ZNF791; C19orf56;JUNB; HOOK2; PRDX2; RNASEH2A;RTBDN; GADD45GIP1; KLF1;FARSA; RAD23A; CALR; MAST1;GCDH; DNASE2; DAND5; NFIX
2120q13.2300000nonenoneTSHZ2
2222q12.313290360C22orf33; TST; MPST ;TMPRSS6IL2RB; C22orf33; KCTD17; TST;TMPRSS6; MPST
2322q13.330000120TMEM112B; NCAPH2; SCO2; ECGF1TMEM112B; ADM2; MIOX; ECGF1;KLHDC7B; LOC440836; SAPS2;SCO2; NCAPH2; SBF1; MAPK8IP2;MIOX; CHKB; CPT1B

CHARGE meta-analysis results, showing the chromosomal position of each locus identified the number of SNPs identified within each and for each erythrocyte traitwith meta-analysis P value < 5×10-8. Annotation for SNPs within genes (InRefGene), within +/- 60kb of annotated RefGenes (RefGenewithin60kb), or in cases where no annotated gene was identified within 60kb, the nearest gene is reported (ClosestRefGene).

Independent replication

Replication of the 45 SNPs was conducted using a meta-analysis of association data in 9,456 independent European-ancestry individuals from five population-based cohorts in the HaemGen Consortium (Supplementary Note). A joint analysis of the HaemGen and CHARGE data showed a decrease in P values for all but two SNPs selected for replication. For one of the two SNPs (rs1800562) that did not show an improvement in P value when associated with Hct, the association to the Hgb trait was significant after Bonferroni correction, and for the second SNP (rs4466998), the association to MCV in the joint analysis of CHARGE and HaemGen data remained genome wide significant (P = 4.91 × 10-8). Significant independent replication for at least one trait was observed at 13 of 23 loci, using a Bonferroni-corrected significance threshold of P < 0.0011, or 0.05/45. Taking the joint meta-analysis results in sum, these data provide supportive evidence that the 23 loci from the discovery meta-analysis are true positives. Table 3 provides the full replication results, including beta coefficients, standard errors, and P values for the primary CHARGE findings, the HaemGen replication, and a combined meta-analysis of the two consortia for the 45 CHARGE trait-locus SNPs.
Table 3

CHARGE meta-analysis results, ordered by locus and trait, and HaemGen replication analysis

CHARGEHaemGenCHARGE + HaemGen

Locus#TraitSNPIDChrPhysPosmin_allMAFGene% VarBetas.e.PBetas.e.PBetas.e.P
2Hctrs10168349246208555C0.341667PRKCE0.16%0.2010.02831.176E-120.15240.04710.0012120.18750.02383.748E-15
6Hctrs1800562626201120A0.041667HFE0.09%0.37470.06087.204E-100.11670.10040.2450.30730.05132.035E-09
9Hctrs94837886135477194C0.177966HBS1L/MYB0.13%0.21720.03283.551E-110.21470.0527 4.55E-05 0.21660.02742.811E-15
12Hctrs73858047100073906C0.377358TFR 20.17%-0.15920.02862.745E-08-0.12690.04780.00799-0.1510.02424.45E-10
13Hctrs102240027151045974G0.258333PRKAG20.22%0.16910.02991.492E-080.27270.0493 3.08E-08 0.19630.02526.045E-15
16Hctrs169262461070763398T0.110169HK10.13%0.3370.05134.986E-110.30940.0960.001270.33150.04459.636E-14
17Hctrs1106598712110556807G0.341667SH 2B3/ATXN 20.15%-0.18090.02883.343E-10-0.14380.04650.001983-0.1710.02411.363E-12
22Hctrs24134502235800170T0.381818TMPRSS60.10%-0.1620.02796.333E-09-0.20820.0467 8.13E-06 -0.17360.02361.846E-13
2Hgbrs10495928246206670G0.341667PRKCE0.10%0.0110.0115.926E-100.05370.0152 0.000414 0.06310.00887.052E-13
6Hgbrs1800562626201120A0.041667HFE0.17%0.02240.02243.592E-160.11670.0322 0.000295 0.16170.01825.737E-19
13Hgbrs102240027151045974G0.258333PRKAG20.14%0.0110.0114.605E-080.09640.0164 4.19E-09 0.07140.00913.025E-15
16Hgbrs169262461070763398T0.110169HK10.21%0.01920.01921.036E-090.08630.03290.0087210.10990.01642.116E-11
17Hgbrs1106598712111076069A0.333333TRAFD10.16%0.01070.01071.345E-100.04270.01510.0045910.05850.00861.159E-11
21Hgbrs60135092050751758A0.183673TSHZ20.15%-0.06990.01161.963E-09-0.04790.02060.01999-0.06460.011.054E-10
22Hgbrs8557912235792882A0.391667TMPRSS60.20%-0.09620.0112.044E-18-0.08450.0157 7.97E-08 -0.09230.00893.25E-25
4MCHrs119150823197293536A0.425TFRC0.24%0.00410.00074.888E-090.00350.0009 5.66E-05 0.00380.00057.729E-13
6MCHrs1408272625950930G0.033898SLC17A30.41%-0.01340.00151.369E-18-0.01830.0019 2.37E-22 -0.01530.00123.868E-39
7MCHrs9349205642033137A0.196429CCND3/BYSL0.29%-0.00660.00081.785E-16-0.00370.0009 2.11E-05 -0.00530.00068.198E-20
9MCHrs77760546135460609G0.220339HBS1L/MYB1.02%-0.00920.00081.976E-33-0.01070.0009 4.33E-36 -0.00990.00067.356E-69
10MCHrs6287516139880112C0.491667CITED20.34%-0.00490.00073.84E-13-0.00340.0008 7.24E-06 -0.00430.00051.262E-17
14MCHrs1075865894846877A0.186441RCL10.18%-0.00480.00081.634E-08-0.00480.0009 5.14E-07 -0.00480.00062.166E-14
19MCHrs112279416249156A0.224138ITFG 30.28%0.00540.0012.992E-080.00360.00120.002990.00470.00072.675E-10
20MCHrs110858241912862547G0.366667GCDH0.20%-0.00410.00078.105E-09-0.0030.0009 0.000532 -0.00370.00051.415E-11
22MCHrs24134502235800170T0.381818TMPRSS60.41%-0.0060.00078.818E-17-0.00680.0008 5.34E-18 -0.00640.00058.77E-34
1MCHCrs8577211156879172A0.316667SPTA10.33%-0.00220.00043.414E-09-0.00130.00040.00266-0.00180.00031.033E-10
9MCHCrs93731246135464902C0.220339HBS1L/MYB0.30%-0.00230.00046.486E-10-0.00180.00042.6E-05-0.00210.00037.003E-14
3MCVrs2540917260462263C0.433333BCL11A0.24%-0.00310.00052.127E-11-0.00220.0006 0.000192 -0.00280.00041.125E-14
4MCVrs98592603197284944C0.35TFRC0.23%0.0030.00053.246E-100.0030.0008 0.000166 0.0030.00048.499E-14
5MCVrs172629455102519G0.116667KIT0.27%-0.00430.00071.36E-09-0.00510.001 2.22E-07 -0.00460.00069.816E-16
6MCVrs1800562626201120A0.041667HFE0.58%0.01150.00111.425E-270.01370.0015 5.02E-20 0.01220.00091.012E-46
7MCVrs9349205642033137A0.196429CCND3/BYSL0.58%-0.00550.00051.756E-24-0.00430.0008 8.48E-08 -0.00510.00041.121E-31
8MCVrs93740806109723113C0.375CD1640.22%-0.00260.00053.695E-08-0.00170.00060.003738-0.00230.00044.198E-10
9MCVrs48954416135468266G0.225HBS1L/MYB1.12%-0.0080.00051.004E-57-0.00830.0008 3.03E-27 -0.00810.00047.241E-86
10MCVrs6433816139881116A0.489362CITED20.50%-0.00390.00052.663E-18-0.00370.0007 1.63E-07 -0.00390.00044.665E-25
11MCVrs12718597750395922A0.275IKZF10.26%0.00320.00058.138E-120.00180.00070.01450.00280.00044.689E-13
12MCVrs77868777100051951G0.208333TFR 20.13%0.00320.00055.452E-090.00240.00080.0020810.0030.00042.543E-11
14MCVrs1075865894846877A0.186441RCL10.29%-0.00410.00064.354E-13-0.00450.0008 4.65E-08 -0.00430.00053.184E-20
15MCVrs112395501045344735G0.228814MARCH80.15%-0.00280.00051.873E-08-0.00220.00070.003496-0.00260.00041.346E-10
18MCVrs44669981464545293A0.478723FNTB0.17%0.00270.00058.925E-090.00080.00060.20610.0020.00044.907E-08
19MCVrs718902016244804T0.431034ITFG30.19%-0.00310.00051.081E-09-0.00240.0007 0.000871 -0.00290.00041.819E-12
20MCVrs72550451912793269A0.25RTBDN0.27%-0.00370.00061.233E-11-0.00180.00080.0266-0.00320.00042.173E-12
22MCVrs24134502235800170T0.381818TMPRSS60.65%-0.00540.00051.078E-30-0.00460.0007 7.62E-11 -0.00520.00042.772E-41
23MCVrs1317942249318618A0.166667ECGF10.22%-0.00440.00062.189E-13-0.00290.00090.001795-0.0040.00051.033E-15
9RBCrs94837886135461324G0.225HBS1L/MYB0.65%0.01410.00163.115E-190.01550.0013 2.19E-30 0.01410.0011.148E-47
12RBCrs20756717100183042A0.225EPO0.20%0.00860.00163.058E-080.00470.00160.0033830.00680.00111.123E-09

Replication test results for the lead SNP per locus and per erythrocyte trait (45 SNPs). Results are organized by trait, the by locus, as indicated in Table 1. Results from a combined CHARGE and HaemGen Consortium meta-analysis are presented. Minor allele frequency (MAF) is presented based on HapMap CEU. % Var indicates the percent of variance explained by the lead SNP in the corresponding trait-locus. P values in bold font meet a Bonferroni-corrected significance threshold for replication of P < 0.0011 (0.05/45). Units were Hgb g/dl, Hct %, MCH picogram, MCHC g/dL, MCV femtoliter, RBC 1 M cells/ccm.

For each lead SNP in the 23 independent loci, percent variance explained for each of the lead SNPs in the corresponding trait is provided in Table 3, averaging the percent variance explained for each SNP across the CHARGE cohorts. The combination of lead SNPs from each of the trait loci showed that average percent variance explained by the combination of lead SNPs, beyond the variance explained by age and gender, was 1.14% of Hgb variation (rs10495928, rs1800562, rs10224002, rs16926246, rs11065987, rs6013509, rs855791); 1.16% of Hct variation (rs10168349, rs1800562, rs7385804, rs10224002, rs16926246, rs11065987, rs9483788, rs2413450); 4.53% of MCH variation (rs11915082, rs1408272, rs9349205, rs7776054, rs628751, rs10758658, rs1122794, rs11085824, rs2413450); 0.63% of MCHC variation (rs857721, rs9373124); 5.98% of MCV variation (rs2540917, rs9859260, rs172629, rs1800562, rs9349205, rs9374080, rs4895441, rs643381, rs12718597, rs7786877, rs10758658, rs11239550, rs4466998, rs7189020, rs7255045, rs2413450, rs131794); and 0.85% of RBC variation (rs9483788, rs2075671).

Annotation of associated loci

For 20 of the identified loci, top associated SNPs were identified within a +/- 60 Kb window of a RefSeq gene (Table 2). For three loci, chromosomes 4q12, 6q24.1, 20q13.2, no genes were identified within this window, with the nearest genes approximately 116 Kb, 89 Mb, and 50 Mb away, respectively. Of the 23 loci, previously reported mutations or genetic associations for erythrocyte traits, markers of iron status or fetal hemoglobin levels, have been noted at six loci containing the genes HFE, TFR2, TMPRSS6, SPTA1, HBS1L-MYB, and BCL11A. Most of the remaining loci have not previously been reported to be associated with erythrocyte traits, though several genes are known to have important roles in erythrocyte biology or erythropoiesis. Genes identified near the associated loci, and their associated erythrocyte traits, are presented in Table 2 and Figure 2. Gene annotations, including gene information, known genetic mutations causing hematologic and non-hematologic diseases, and previously defined roles in hematologic and cardiovascular systems are listed in Supplementary Table 4. We confirmed the association of the known C282Y (rs1800562) and H63D (rs1799945) mutations in the HFE gene, mutations that are already known to underlie hereditary hemochromatosis, with Hgb, Hct, MCH and MCV.
Figure 2

Results of the CHARGE meta-analysis are organized into a Venn diagram, demonstrating overlap of loci meeting a genome-wide significance threshold of P < 5×10-8.

Gene expression in blood and endothelial cell lines

RNA expression levels for genes within a 1 Mb interval of each of the 23 loci we identified are presented in Supplementary Figure 2, for erythroid bodies (EBs), human umbilical vein endothelial cells (HUVECs), and seven other blood cell lines. For the top associated locus, chromosome 6q23.3, four genes were identified (ALDH8A1, HBS1L, MYB, AHI1) within the 1 Mb interval. A heatmap showing gene expression levels for each of these four genes is shown in Figure 3, demonstrating approximately 2-fold expression of MYB in EBs compared to other cell lines. Gene expression was detected for genes in multiple loci (Supplementary Figure 2). Since the identification of gene expression in biologically relevant tissues provides a rationale for prioritization of candidate genes for further genetic or functional investigations, we noted broad categories of expression patterns in EBs and HUVECs. The most highly expressed genes in EBs were in chromosomes 3q29 (TFRC), 6p22.2 (HIST1H4C, which is near HFE), 6p21.1 (CCND3), 10q21.3 (HK1), 12q24.12 (RPL6P27), 16p13.3 (HBZ, HBA1), and 22q12.3 (TST, RAC2). The most highly expressed genes in HUVECs were in chromosomes 6p22.2 (HIST14HC), 7q22.1 (SERPINE1), and 22q13.3 (MFNG).
Figure 3

Gene expression in blood and endothelial cells for genes in the chromosome 6q23.3 region. (a) SNPs in the locus are plotted against recombination rates as observed in HapMap CEU, using a window of +/-500kb around the lead SNP identified in this locus, which is plotted in blue. SNPs identified by the CHARGE meta-analyses are colored according to correlation with the lead SNP (r2 ≥ 0.8 red; 0.5 ≤ r2 < 0.8 orange; 0.2 ≤ r2 < 0.5 yellow; r2 < 0.2 white; no r2 value provided). The P value for the lead SNP in this region is provided. (b) A heatmap of gene expression levels in nine blood and endothelial cell lines is shown, including all genes, as annotated by ENSEMBL 54, within the +/- 500kb window of the locus (MK = megakaryocyte; EB = erythroid bodies; HUVEC = human umbilical vein endothelial cells; CD14 = monocytes; CD66b = granulocytes; CD19 = B lymphocytes; CD56 = NK cells; CD8 = Tc lymphocytes; CD4 = Th lymphocytes).

Blood pressure analysis of SNPs associated with erythrocyte traits

The results of association testing for the 45 lead SNPs from the CHARGE analysis of erythrocyte traits within the 23 loci are summarized in Supplementary Table 5a. We identified associations at a Bonferroni-corrected significance threshold P < 0.00135 (0.05/37, since 37 of the 45 SNPs are unique) in chromosomes 12q24.1 (SH2B3) and 7q36.1 (PRKAG2). The previously reported association of the chromosome 12q24.1 (SH2B3) locus with systolic blood pressure (SBP) and diastolic blood pressure (DBP) was the most significant association (rs1106598, SBP P = 1.2×10-6, HTN P = 0.0035; rs1763023, DBP P = 4.2×10-8). In the reported BP and HTN analysis, signals at this locus spanned 700kb from rs3184504 to rs1106618815, and association signals in Hgb and Hct spanned 987kb, from rs3184504 in SH2B3 to rs11066301 in PTPN11 and contained multiple genes15. Inspection of RNA expression data (Supplementary Figure 2) showed that in this region, SH2B3 and ATXN2 show high levels of gene expression in erythroid bodies and endothelial cells. Nominal associations (P < 0.05) were identified in the chromosomes 6p22.2 (HFE), 6q24.1, 7q22.1 (TFR2), and 20q12.3 (Supplementary Table 5a), and results of the evaluation of SNPs associated with BP and hypertension are presented in Supplementary Table 5b.

Discussion

In this meta-analysis of genome-wide association data from 24,167 European-ancestry individuals from six cohort studies in the CHARGE Consortium, we identified 23 loci associated with at least one of the six erythrocyte traits Hgb, Hct, MCH, MCHC, MCV, and RBC. We sought evidence for replication in an independent analysis of data from 9,456 European-ancestry individuals in the HaemGen Consortium. In the joint meta-analysis, merging CHARGE and HaemGen data, all 23 loci had P values less than P < 5×10-8, implying strong associations that merit further study. Among the 23 loci, six were previously known QTLs, and 17 are novel loci, some of which contain genes known to be involved with iron homeostasis, erythropoiesis, globin synthesis and erythrocyte membrane function. Finally, an investigation of possible links between these erythrocyte traits and blood pressure and hypertension confirmed overlap at the previously known SH2B3 locus, and identified additional suggestive associations, none of which met a genome-wide significance threshold. The six erythrocyte traits studied included some that are highly correlated, and as expected, we observed a high degree of concordance in the results across the six traits. Among the many genome-wide significant associations identified, the patterns of association are likely to reflect the correlations among these related traits. Interestingly, MCHC, a ratio of Hgb and Hct, two directly measured traits, is uniquely associated with chromosome 1q23.1 (SPTA1), a gene with several rare mutations known to cause deformation of erythrocytes16, 17. Reviewing the results in total, we observed there were generally three patterns of significant associations among the six traits (Figure 2). Results were generally similar for: (1) Hgb and Hct, which are mainly quantitative measures of hemoglobin in the blood; (2) MCH and MCV, representing erythrocyte size and quantity of hemoglobin per erythrocyte; and (3) MCHC, the ratio of Hgb to Hct, which appears somewhat distinct from the other traits. Across the six traits studied, the strongest signal was found in the HBS1L/MYB locus on chromosome 6q23, which was observed for five of the six individual traits (Hct, MCH, MCHC, MCV, RBC) at the genome-wide significance level. This locus also provided a modest but non-significant result for Hgb (rs4895441, P = 4.8×10-4). In addition, the Hgb/Hct and MCH/MCV patterns overlapped for associations in the chromosome 6p22.2 (HFE), 22q12.3 (TMPRSS6) and 7q22.1 (TFR2/EPO) loci. The RBC results represent a subset of the overlap between the Hgb/Hct and MCH/MCV patterns, with associations observed in the 7q22.1 (TFR2/EPO) and 6q23 (HBS1L/MYB) loci. Across the erythrocyte traits, overlap occurs where known patterns of traits characterize various clinically observed hematologic diseases, providing a possible context in which to interpret the overlap of associations. We annotated and categorized the findings of our analyses by association with known genetic disorders, biologic function, or altered function of the hematopoietic system, to assist with interpretation of the findings (Supplementary Table 4). We here consider these multiple findings in light of their potential role in several processes critical to erythrocyte biology, including iron homeostasis, erythrocyte membrane function, erythropoiesis and globin synthesis.

Iron homeostasis

We identified genome-wide significant association of SNPs within the HFE gene with Hgb, Hct, MCH, and MCV. C282Y mutation in HFE is the principal cause of hereditary hemochromatosis, a common autosomal recessive iron overload disease in individuals of northern European descent18. This mutation was associated with increased MCV and Hgb concentrations in a study of individuals drawn from a hemochromatosis and iron overload screening study7, and this variant was the lead association result for both Hgb and Hct in our study. Heterozygotes for either allele do not manifest clinical iron overload but may display an increased iron uptake and “resistance” to anemia, and the C282Y mutation may increase risk of coronary heart disease by increasing iron stores and lipid oxidation19, 20. The HFE gene induces expression of the iron regulatory hormone hepcidin. Hepcidin has recently emerged as the likely link between the inflammatory response and the handling of iron for erythropoiesis by both downregulating the absorption of iron in the intestine and by inhibiting the release of iron from macrophages21-24. SNPs within the TMPRSS6 gene were associated with Hgb, Hct, MCH and MCV. TMPRSS6 was identified by linkage and association studies in five families and two sporadic cases with iron-refractory iron deficiency anemia, a rare Mendelian disease25. TMPRSS6 encodes a type II transmembrane serine protease produced by the liver that regulates the expression of hepcidin25. The transferrin receptor (encoded by TFRC(TFR1)) and transferrin receptor 2 (TFR2) are highly homologous type II trans-membrane proteins in the transferrin protein family. SNPs within TFRC were associated with MCH and MCV, and SNPs within TFR2 were associated with Hct and MCV. Reduced TFRC expression is associated with anemia26. Existing evidence indicates that TFR2 is also a modulator of hepcidin expression, and mutations in TFR2 cause hemochromatosis type 327.

Erythropoiesis and globin synthesis

Two loci, chromosome 2p16.1 (BCL11A) associated with MCV, and chromosome 6q23.3 (HBS1L-MYB) associated with all traits except for Hgb, are related to variation in fetal hemoglobin and hemoglobin beta levels28, 29. BCL11A is an oncogene related to B-cell malignancies30 and regulates fetal hemoglobin expression31. BCL11A is expressed in erythroid precursors, and we observed BCL11A expression in EBs (Supplementary Figure 2) making it a biologically plausible candidate gene for erythrocyte trait variation32. A healthy population study showed polymorphisms in HBS1L and MYB influences erythrocyte, platelet, and monocyte counts10. Although the role of HBS1L is unknown, MYB has been associated with proliferation, survival, and differentiation of hematopoietic progenitor cells33, 34. MYB is also associated with eosinophil counts in blood and atopic asthma35. There are strong associations between SNPs within this locus and multiple erythrocyte traits (lead SNP rs4895441, Hgb P = 4.8×10-4; Hct P = 9.7×10-10; MCH P = 7.8×10-32; MCHC P = 4.5×10-9; MCV P = 1.0×10-57; and RBC P = 2.2×10-15). These strong genetic effects may explain why several prior linkage analyses of erythrocyte traits have identified this chromosomal region6, 11, 13. SNPs within SH2B3 are associated with Hct and Hgb. Interestingly, the SNPs within this gene are associated with blood pressure, myocardial infarction, type 1 diabetes, and celiac disease15, 35-39. SH2B3 is expressed in hematopoietic precursor cells and increases hematopoietic progenitors of erythroid, megakaryocytic, and myeloid lineages35, 40. SH2B3 is also expressed in vascular endothelium, where it promotes inflammation and may thereby contribute to vascular disease. Expression in different cell lineages and tissues may underlie the diverse pleiotropic consequences of SH2B3 on hematopoietic traits, autoimmune diseases, and vascular diseases. In the same locus, the PTPN11 gene product interacts with the transcription factor SHP2, which has an essential role in blood development that has been demonstrated in a murine Shp2-/- model41, and PTPN11 mutations cause Noonan’s and LEOPARD syndromes and juvenile myelomonocytic leukemia42-44. Lastly, we identified associations for Hct, MCV and RBC near the EPO gene. Erythropoietin, a glycoprotein hormone that controls erythropoiesis, is the first human recombinant hematopoietic protein approved for human use and is now used widely for the treatment of anemia. EPO variants have previously been described in association with diabetic retinal and renal vascular complications45 but not with erythrocyte traits.

Erythrocyte membrane

SNPs within the SPTA1 gene were associated with MCHC. SPTA1 encodes erythroid spectrin, a protein in the erythrocyte membrane, and is essential in determining the shape and deformability of erythrocytes. Spectrin mutations have been previously associated with hemolytic anemia, elliptocytosis, spherocytosis, and propoikilocytosis, but not with variations in MCHC outside of disease states16, 17.

Gene expression

The gene expression data may be viewed as additional annotation of the 23 loci we identified, confirming which genes are expressed in cell types of interest. These data may be used to generate further specific hypotheses that can then be tested at a functional and molecular level. Multiple lines of evidence formed the basis for our rationale to study the relationship of the SNPs identified through the study of erythrocyte traits to blood pressure (BP) and hypertension. Prior studies have shown that Hgb and Hct levels are associated with increased risk for hypertension and a variety of other vascular diseases and mortality1-3, 46-49. From a rheologic perspective, blood viscosity depends largely on Hgb or Hct levels and is a determinant of blood pressure50-53. There is an inverse relationship between viscosity and vascular blood flow54, and elevated Hct thereby hampers organ perfusion. Given these findings, we are intrigued by the overlap between the association results for Hgb and Hct from our study and the recently reported associations for BP and hypertension15, 36. We observed overlap of associations in the chromosome 12q24.12 region across a 987 kb linkage disequilibrium block, containing SH2B3, ATXN2, BRAP, C12orf03, TRAFD1, ACAD10, TMEM116 and PTPN11. We also identified associations within the 7q36.1 region containing PRKAG2, which does not have specifically known hematologic or vascular roles, but mutations in this gene cause cardiomyopathy and cardiac conduction system disorders55, 56. Neither causality nor independence of these associations is necessarily supported by these findings. However, these associations suggest that common genetic bases may underlie some of the correlation seen between erythrocyte, BP and hypertension traits. Further confirmation in large independent cohorts may provide stronger evidence for the strength and consistency of the associations with hypertension. Limitations of this study include restriction of the discovery and replication analyses to individual of European ancestry. Several spectrin and globin mutations have been identified in African American kindreds and the prevalence of hemoglobinopathies of various types is generally higher among individuals of African ancestry, highlighting the need for further investigation of these findings in individuals of non-European-ancestry57. As with any meta-analysis of genome-wide association results across different cohorts, population structure and other sources of heterogeneity may have caused false positives or false negatives. To assess population structure, we examined the per-cohort λGC, demonstrating that these values were consistently below 1.08, and we applied genomic control to the cohort-level test statistics. The final meta-analyses also showed no systematic inflation of the distribution of the final association statistics. In the replication analysis, for those loci that did not meet a conservative replication test, power may have been limited, and many are likely to improve with additional study. Finally, the interpretation of multiple analyses of correlated traits requires caution, particularly in attributing causality or independence of effects. We take the findings from our analyses of the six erythrocyte traits to indicate a set of loci that are of interest with regards to erythrocyte production, homeostasis and function. Specific differences in association patterns may highlight different pathways, and to understand this more deeply, further studies are needed. In summary, we have identified and validated common variants at several known and novel loci that influence the levels of six clinically relevant red blood cell measures in population-based cohorts. These QTLs have implications for understanding a variety of hematologic diseases as well as correlates of erythrocyte traits, such as BP and hypertension. Further studies are warranted to define these variants in the extremes of the distributions of these traits and ethnically diverse populations and to understand the functional impact of variants at the implicated candidate genes.
  64 in total

1.  Genetic and environmental causes of variation in basal levels of blood cells.

Authors:  D M Evans; I H Frazer; N G Martin
Journal:  Twin Res       Date:  1999-12

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Etiology of differences in hematocrit between males and females: sequence-based polymorphisms in erythropoietin and its receptor.

Authors:  S M Zeng; J Yankowitz; J A Widness; R G Strauss
Journal:  J Gend Specif Med       Date:  2001

4.  Abnormal mesoderm patterning in mouse embryos mutant for the SH2 tyrosine phosphatase Shp-2.

Authors:  T M Saxton; M Henkemeyer; S Gasca; R Shen; D J Rossi; F Shalaby; G S Feng; T Pawson
Journal:  EMBO J       Date:  1997-05-01       Impact factor: 11.598

5.  Identification of a gene responsible for familial Wolff-Parkinson-White syndrome.

Authors:  M H Gollob; M S Green; A S Tang; T Gollob; A Karibe; A S Ali Hassan ; F Ahmad; R Lozado; G Shah; L Fananapazir; L L Bachinski; R Roberts; A S Hassan
Journal:  N Engl J Med       Date:  2001-06-14       Impact factor: 91.245

6.  Novel PRKAG2 mutation responsible for the genetic syndrome of ventricular preexcitation and conduction system disease with childhood onset and absence of cardiac hypertrophy.

Authors:  M H Gollob; J J Seger; T N Gollob; T Tapscott; O Gonzales; L Bachinski; R Roberts
Journal:  Circulation       Date:  2001-12-18       Impact factor: 29.690

7.  Anemia as a risk factor for cardiovascular disease in The Atherosclerosis Risk in Communities (ARIC) study.

Authors:  Mark J Sarnak; Hocine Tighiouart; Guruprasad Manjunath; Bonnie MacLeod; John Griffith; Deeb Salem; Andrew S Levey
Journal:  J Am Coll Cardiol       Date:  2002-07-03       Impact factor: 24.094

8.  Stat5 regulates cellular iron uptake of erythroid cells via IRP-2 and TfR-1.

Authors:  Marc A Kerenyi; Florian Grebien; Helmuth Gehart; Manfred Schifrer; Matthias Artaker; Boris Kovacic; Hartmut Beug; Richard Moriggl; Ernst W Müllner
Journal:  Blood       Date:  2008-08-11       Impact factor: 22.113

9.  Hemochromatosis due to mutations in transferrin receptor 2.

Authors:  Antonella Roetto; Filomena Daraio; Federica Alberti; Paolo Porporato; Angelita Calì; Marco De Gobbi; Clara Camaschella
Journal:  Blood Cells Mol Dis       Date:  2002 Nov-Dec       Impact factor: 3.039

10.  Cytokine signaling and hematopoietic homeostasis are disrupted in Lnk-deficient mice.

Authors:  Laura Velazquez; Alec M Cheng; Heather E Fleming; Caren Furlonger; Shirly Vesely; Alan Bernstein; Christopher J Paige; Tony Pawson
Journal:  J Exp Med       Date:  2002-06-17       Impact factor: 14.307

View more
  195 in total

1.  Natural selection on EPAS1 (HIF2alpha) associated with low hemoglobin concentration in Tibetan highlanders.

Authors:  Cynthia M Beall; Gianpiero L Cavalleri; Libin Deng; Robert C Elston; Yang Gao; Jo Knight; Chaohua Li; Jiang Chuan Li; Yu Liang; Mark McCormack; Hugh E Montgomery; Hao Pan; Peter A Robbins; Kevin V Shianna; Siu Cheung Tam; Ngodrop Tsering; Krishna R Veeramah; Wei Wang; Puchung Wangdui; Michael E Weale; Yaomin Xu; Zhe Xu; Ling Yang; M Justin Zaman; Changqing Zeng; Li Zhang; Xianglong Zhang; Pingcuo Zhaxi; Yong Tang Zheng
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-07       Impact factor: 11.205

2.  Severe anemia in the Nan mutant mouse caused by sequence-selective disruption of erythroid Kruppel-like factor.

Authors:  Miroslawa Siatecka; Kenneth E Sahr; Sabra G Andersen; Mihaly Mezei; James J Bieker; Luanne L Peters
Journal:  Proc Natl Acad Sci U S A       Date:  2010-08-09       Impact factor: 11.205

3.  Genome-wide association studies and large-scale collaborations in epidemiology.

Authors:  Bruce M Psaty; Albert Hofman
Journal:  Eur J Epidemiol       Date:  2010-07-11       Impact factor: 8.082

4.  Identification of a common variant in the TFR2 gene implicated in the physiological regulation of serum iron levels.

Authors:  Irene Pichler; Cosetta Minelli; Serena Sanna; Toshiko Tanaka; Christine Schwienbacher; Silvia Naitza; Eleonora Porcu; Cristian Pattaro; Fabio Busonero; Alessandra Zanon; Andrea Maschio; Scott A Melville; Maria Grazia Piras; Dan L Longo; Jack Guralnik; Dena Hernandez; Stefania Bandinelli; Elmar Aigner; Anthony T Murphy; Victor Wroblewski; Fabio Marroni; Igor Theurl; Carsten Gnewuch; Eric Schadt; Manfred Mitterer; David Schlessinger; Luigi Ferrucci; Derrick R Witcher; Andrew A Hicks; Günter Weiss; Manuela Uda; Peter P Pramstaller
Journal:  Hum Mol Genet       Date:  2010-12-28       Impact factor: 6.150

Review 5.  Genome-wide significant associations for variants with minor allele frequency of 5% or less--an overview: A HuGE review.

Authors:  Orestis A Panagiotou; Evangelos Evangelou; John P A Ioannidis
Journal:  Am J Epidemiol       Date:  2010-09-28       Impact factor: 4.897

6.  The Rotterdam Study: 2016 objectives and design update.

Authors:  Albert Hofman; Guy G O Brusselle; Sarwa Darwish Murad; Cornelia M van Duijn; Oscar H Franco; André Goedegebure; M Arfan Ikram; Caroline C W Klaver; Tamar E C Nijsten; Robin P Peeters; Bruno H Ch Stricker; Henning W Tiemeier; André G Uitterlinden; Meike W Vernooij
Journal:  Eur J Epidemiol       Date:  2015-09-19       Impact factor: 8.082

7.  Toenail iron, genetic determinants of iron status, and the risk of glioma.

Authors:  Gabriella M Anic; Melissa H Madden; Reid C Thompson; L Burton Nabors; Jeffrey J Olson; Renato V Larocca; James E Browning; John D Brockman; Peter A Forsyth; Kathleen M Egan
Journal:  Cancer Causes Control       Date:  2013-08-31       Impact factor: 2.506

8.  TMEM14C is required for erythroid mitochondrial heme metabolism.

Authors:  Yvette Y Yien; Raymond F Robledo; Iman J Schultz; Naoko Takahashi-Makise; Babette Gwynn; Daniel E Bauer; Abhishek Dass; Gloria Yi; Liangtao Li; Gordon J Hildick-Smith; Jeffrey D Cooney; Eric L Pierce; Kyla Mohler; Tamara A Dailey; Non Miyata; Paul D Kingsley; Caterina Garone; Shilpa M Hattangadi; Hui Huang; Wen Chen; Ellen M Keenan; Dhvanit I Shah; Thorsten M Schlaeger; Salvatore DiMauro; Stuart H Orkin; Alan B Cantor; James Palis; Carla M Koehler; Harvey F Lodish; Jerry Kaplan; Diane M Ward; Harry A Dailey; John D Phillips; Luanne L Peters; Barry H Paw
Journal:  J Clin Invest       Date:  2014-08-26       Impact factor: 14.808

9.  ITPA gene variants protect against anaemia in patients treated for chronic hepatitis C.

Authors:  Jacques Fellay; Alexander J Thompson; Dongliang Ge; Curtis E Gumbs; Thomas J Urban; Kevin V Shianna; Latasha D Little; Ping Qiu; Arthur H Bertelsen; Mark Watson; Amelia Warner; Andrew J Muir; Clifford Brass; Janice Albrecht; Mark Sulkowski; John G McHutchison; David B Goldstein
Journal:  Nature       Date:  2010-02-21       Impact factor: 49.962

Review 10.  12q24 locus association with type 1 diabetes: SH2B3 or ATXN2?

Authors:  Georg Auburger; Suzana Gispert; Suna Lahut; Ozgür Omür; Ewa Damrath; Melanie Heck; Nazlı Başak
Journal:  World J Diabetes       Date:  2014-06-15
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.