| Literature DB >> 23431002 |
Ana Moleirinho1, Susana Seixas, Alexandra M Lopes, Celeste Bento, Maria J Prata, António Amorim.
Abstract
Human hemoglobins, the oxygen carriers in the blood, are composed by two α-like and two β-like globin monomers. The β-globin gene cluster located at 11p15.5 comprises one pseudogene and five genes whose expression undergoes two critical switches: the embryonic-to-fetal and fetal-to-adult transition. HBD encodes the δ-globin chain of the minor adult hemoglobin (HbA2), which is assumed to be physiologically irrelevant. Paradoxically, reduced diversity levels have been reported for this gene. In this study, we sought a detailed portrait of the genetic variation within the β-globin cluster in a large human population panel from different geographic backgrounds. We resequenced the coding and noncoding regions of the two adult β-globin genes (HBD and HBB) in European and African populations, and analyzed the data from the β-globin cluster (HBE, HBG2, HBG1, HBBP1, HBD, and HBB) in 1,092 individuals representing 14 populations sequenced as part of the 1000 Genomes Project. Additionally, we assessed the diversity levels in nonhuman primates using chimpanzee sequence data provided by the PanMap Project. Comprehensive analyses, based on classic neutrality tests, empirical and haplotype-based studies, revealed that HBD and its neighbor pseudogene HBBP1 have mainly evolved under purifying selection, suggesting that their roles are essential and nonredundant. Moreover, in the light of recent studies on the chromatin conformation of the β-globin cluster, we present evidence sustaining that the strong functional constraints underlying the decreased contemporary diversity at these two regions were not driven by protein function but instead are likely due to a regulatory role in ontogenic switches of gene expression.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23431002 PMCID: PMC3622298 DOI: 10.1093/gbe/evt029
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(A) Schematic representation of the β-globin cluster located at chromosome 11p15.5. Top: The relative position of the β-globin genes and the LCR in the cluster. Bottom: δ-globin (HBD) and β-globin (HBB) gene organization (exons are represented by grey squares). The arrows in the lower diagram indicate the extent of the segments surveyed in the resequencing study of the European and African populations. Small black arrows indicate the five DNase I hypersensitive sites (HS) encompassed by the LCR located from approximately 6 to 18 kb upstream of ε- globin (HBE). (B) HBD and HBB haplotypes as inferred by PHASE v.2.02. The ancestral state at each site was inferred from ortholog nonhuman primate sequences. From the 25 sites, only 2 lacked a previously associated reference identification code in public databases (dbSNP and Exome Sequencing Project release ESP5400) and were unique to the Portuguese population. Coding variants are labeled. These include the following: one synonymous amino acid replacement in HBB (H3H) for the PT population; three synonymous replacements in HBB (H3H, L69L, and V134V) for the CEU population; one synonymous replacement in HBD (H98H) and one synonymous in HBB (H3H) and two nonsynonymous replacements (E7K–HbC allele, E7V–HbS allele) in HBB for the YRI population. SNP identifiers as in dbSNP and their chromosomal position based on GRCh37 version are indicated in each column.
Summary Statistics of Population Variation for the Two Adult β-globin Genes
| Population | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| π | θw | TD | π | θw | TD | ||||||
| PT | 50 | 1,879 | 1 | 0.21 | 1.18 | −1.10 | 1,843 | 7 | 7.82 | 8.49 | −0.21 |
| CEU | 46 | 1,879 | 1 | 0.23 | 1.21 | −1.11 | 1,843 | 8 | 8.23 | 9.89 | −0.46 |
| YRI | 46 | 1,879 | 8 | 4.55 | 9.66 | −1.53 | 1,843 | 9 | 8.11 | 11.12 | −0.77 |
aNumber of chromosomes.
bTotal number of sites surveyed.
cPolymorphic sites.
dNucleotide diversity (×104) (Nei and Li 1979).
eWatterson θ per site (×104) (Watterson 1975).
fTajima’s D statistic (Tajima 1989).
*P ≤ 0.001 according to the constant size model, to the best fit model from Schaffner et al. (2005) and to the best fit model from Gutenkunst et al. (2009).
**P ≤ 0.05 according to the constant size model and to the best fit model from Gutenkunst et al. (2009).
FHBD and HBB gene genealogies as estimated by Genetree. Time is scaled in millions of years (Myr). Solid circles represent nucleotide substitutions. The number below each branch of the trees represents the chromosomes observed for each haplotype, and in the lower diagram this information is split by population.
Summary Statistics of Population Variation for the β-globin Cluster Genes Using the 1,000 Genomes Project Data
| CEU | YRI | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| π | θw | TD | π | θw | TD | ||||||
| 1,843 | 170 | 7 | 8.28 | 6.65 | 0.53 | 176 | 11 | 8.49 | 10.39 | −0.44 | |
| 1,879 | 4 | 0.31 | 3.73 | −1.67 | 10 | 6.53 | 9.26 | −0.70 | |||
| 1,935 | 11 | 8.75 | 9.96 | −0.29 | 14 | 6.15 | 12.59 | −1.30 | |||
| 1,926 | 10 | 11.79 | 9.09 | 0.71 | 18 | 21.78 | 16.27 | 0.90 | |||
| 1,933 | 11 | 17.47 | 9.97 | 1.83 | 16 | 22.09 | 14.41 | 1.39 | |||
| 1,862 | 4 | 5.39 | 3.76 | 0.79 | 13 | 7.39 | 12.15 | −0.98 | |||
aTotal number of sites surveyed.
bNumber of chromosomes.
cPolymorphic sites.
dNucleotide diversity (×104) (Nei and Li 1979).
eWatterson θ per site (×104) (Watterson 1975).
fTajima’s D statistic (Tajima 1989).
FSliding window analysis of the genomic region encompassing the β-globin gene cluster. Data were obtained from 1000 Genomes Project, representing 1,092 individuals from 14 populations: three African (ASW, LWK, and YRI), five European (CEU, FIN, GBR, IBS, and TSI), three Asian (CHB, CHS, and JPT) and three American-admixed populations (CLM, MXL, and PUR). Both π (A) and Tajima’s D (B) were calculated in 2 kb windows with increments of 150 bp.
Evolutionary Rates Based on Jukes–Cantor Distance
| Exons | Introns | |||||
|---|---|---|---|---|---|---|
| Div | Mutation Rate | Div | Mutation Rate | |||
| Year | Generation | Year | Generation | |||
| 0.80 | 0.74 × 10−9 | 1.84 × 10−8 | 0.99 | 0.91 × 10−9 | 2.28 × 10−8 | |
| 0.26 | 0.24 × 10−9 | 0.60 × 10−8 | 1.75 | 1.62 × 10−9 | 4.06 × 10−8 | |
| 1.45 | 1.34 × 10−9 | 3.36 × 10−8 | 1.04 | 0.96 × 10−9 | 2.41 × 10−8 | |
aAverage nucleotide divergence between human and chimpanzee.
bMutation rate per site.