| Literature DB >> 22607059 |
Santiago Rodriguez1, Dylan M Williams, Philip A I Guthrie, Wendy L McArdle, George Davey Smith, David M Evans, Tom R Gaunt, Ian N M Day.
Abstract
Haptoglobin binds free haemoglobin that prevents oxidative damage produced by haemolysis. There is a copy number variant (CNV) in the haptoglobin gene (HP) consisting of two alleles, Hp1 (no duplication), and Hp2 (1.7kb duplication involving two exons). The spread of the Hp2 allele is believed to have taken place under selective pressures conferred by malaria resistance. However, molecular evidence is lacking and Hp did not emerge in genomewide SNPs surveys for evidence of selection. In Europe, there is geographical constancy of Hp2 frequency, indicative of absence of clinal pressures and that modern day European alleles represent a "snapshot" of their out-of-Africa migrations. In this work we test for signatures of natural selection acting on the Hp CNV in a sample from the UK population (Avon Longitudinal Study of Parents and Children, ALSPAC). We present here heterozygosity decay, pairwise F(ST) values observed between ALSPAC and 301 populations from all five populated continents, extended haplotype homozygosity analyses involving the CNV and 80 SNPs surrounding the CNV ~500kb in each direction, and linkage disequilibrium and pairwise haplotypic analyses involving 160 SNPs on chromosome 16q22.1. Taken together, our results represent the first molecular analysis of natural selection in the Hp CNV genetic region.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22607059 PMCID: PMC3963445 DOI: 10.1111/j.1469-1809.2012.00716.x
Source DB: PubMed Journal: Ann Hum Genet ISSN: 0003-4800 Impact factor: 1.670
Haptoglobin Genotype and Allele Frequencies in 400 ALSPAC Samples.
| Genotype Frequency | ||
|---|---|---|
| Hp1/1 | 55 | |
| Hp1/2 | 154 | |
| Hp2/2 | 156 | |
| Fails | 35 | |
| Number | Allele frequency | |
| Hp1 | 264 | 0.362 |
| Hp2 | 466 | 0.638 |
Figure 1The r2 values observed between the HP CNV (87) and neighbouring SNPs taken from Haploview.
Figure 2Polymorphism across ∼2 Mb centred on the HP CNV observed in HapMap samples from Europeans (CEU), Africans (YRI), Chinese (CHB) and Japanese (JPT). A decay in heterozygosity around the HP CNV was observed in all population samples. (A) Pattern of heterozygosity variation observed in CEU for a ∼2Mb interval centred on the HP CNV. (B) Pattern of heterozygosity variation observed in CEU, YRI, CHB and JPT for the same ∼2 Mb interval. (C) Heterozygosity decay in a 300Kb region centred on the HP CNV. The two vertical lines represent the interval where HP is located. The asterisk in Figure 1c represents the location of the HP CNV.
Descriptive Statistics of Pairwise FST Values Observed between ALSPAC and each of the 301 Populations Across Five Continents with Information available for the HP CNV (Carter and Worwood, 2007).
| Minimum | Maximum | Mean | Std. Deviation | ||
|---|---|---|---|---|---|
| ALSPAC vs. Africa | 41 | 0 | 0.3672 | 0.1046 | 0.0891 |
| ALSPAC vs. America | 42 | 0 | 0.3848 | 0.1032 | 0.1137 |
| ALSPAC vs. Asia | 99 | 0 | 0.1837 | 0.0291 | 0.0343 |
| ALSPAC vs. Europe | 91 | 0 | 0.0322 | 0.0032 | 0.0055 |
| ALSPAC vs. Oceania | 28 | 0.0025 | 0.3582 | 0.1443 | 0.0952 |
Descriptive statistics of pairwise FST values observed between populations within each continent for the 301 populations with information available for the HP CNV (Carter and Worwood, 2007).
| Continent | Minimum | Maximum | Average | S.D. | |
|---|---|---|---|---|---|
| Africa | 41 | 0 | 0.594 | 0.072 | 0.090 |
| America | 42 | 0 | 0.549 | 0.111 | 0.120 |
| Asia | 99 | 0 | 0.558 | 0.029 | 0.044 |
| Europe | 91 | 0 | 0.107 | 0.004 | 0.008 |
| Oceania | 28 | 0 | 0.652 | 0.124 | 0.159 |
Figure 3EHH analysis of the HP CNV and neighboring SNPs. (A) Frequency of the four haplotypes defined by a core haplotype including the HP CNV. G and A are the ancestral alleles for SNPs rs2000999 and rs152837, respectively. In the first position, A corresponds to Hp1 and C corresponds to Hp2; (B) Bifurcation diagrams for each of the four haplotypes. Division of the diagram reflects breakdown of LD; (C) EHH plotted for the core haplotype at ∼500 kb in both directions from the core haplotype. The decay of EHH for haplotype Hp2-G-G is markedly different from the other haplotypes; (D) Significance of EHH and rEHH values for haplotype Hp2-G-G.
ANOVA test comparing mean minor allele frequencies (MAF) between groups of SNPs stratified by level of disequilibrium with the Hp2 allele.
| SNP | Mean q (95% CI) | S.D. | Minimum q | Maximum q | |
|---|---|---|---|---|---|
| (a) D′ < 0.75 | 148 | 0.28 (0.26,0.30) | 0.12 | 0.05 | 0.50 |
| (b) D′≥ 0.75 | 12 | 0.13 (0.10,0.17) | 0.06 | 0.04 | 0.22 |
P < 0.001.
Based on one way ANOVA.
Haplotype frequencies for SNPs in high LD with the HP genotype. 10 SNPs show alleles associating more closely with the Hp2 allele (labels B-J and L), whilst two are more closely associated with the Hp1 allele (labels A and K).
| SNP code | Haplotype ( | Observed haplotype frequency | D′ | Position on chromosome 16 | Min. allele frequency | ||
|---|---|---|---|---|---|---|---|
| (A) | rs17665900 | 2,G | 0.624 | 0.81 | 0.16 | 69915112 | 0.878 |
| 2,A | 0.015 | −0.81 | 0.122 | ||||
| 1,G | 0.255 | −0.81 | 0.878 | ||||
| 1,A | 0.107 | 0.81 | 0.122 | ||||
| (B) | rs1424241 | 2,C | 0.475 | −0.79 | 0.08 | 69966408 | 0.824 |
| 2,T | 0.162 | 0.79 | 0.176 | ||||
| 1,C | 0.349 | 0.79 | 0.824 | ||||
| 1,T | 0.013 | −0.79 | 0.176 | ||||
| (C) | rs2000999 | 2,G | 0.433 | −0.89 | 0.12 | 69995594 | 0.786 |
| 2,A | 0.205 | 0.89 | 0.214 | ||||
| 1,G | 0.353 | 0.89 | 0.786 | ||||
| 1,A | 0.009 | −0.89 | 0.214 | ||||
| (D) | rs152837 | 2,A | 0.567 | −1.00 | 0.04 | 70005252 | 0.929 |
| 2,G | 0.071 | 1.00 | 0.071 | ||||
| 1,A | 0.362 | 1.00 | 0.929 | ||||
| 1,G | 0.000 | −1.00 | 0.071 | ||||
| (E) | rs152828 | 2,G | 0.540 | −1.00 | 0.06 | 70011387 | 0.901 |
| 2,A | 0.099 | 1.00 | 0.099 | ||||
| 1,G | 0.362 | 1.00 | 0.901 | ||||
| 1,A | 0.000 | −1.00 | 0.099 | ||||
| (F) | rs217180 | 2,G | 0.529 | −1.00 | 0.07 | 70072130 | 0.890 |
| 2,A | 0.110 | 1.00 | 0.110 | ||||
| 1,G | 0.362 | 1.00 | 0.890 | ||||
| 1,A | 0.000 | −1.00 | 0.110 | ||||
| (G) | rs12926250 | 2,G | 0.529 | −0.83 | 0.05 | 70100817 | 0.884 |
| 2,T | 0.109 | 0.83 | 0.116 | ||||
| 1,G | 0.355 | 0.83 | 0.884 | ||||
| 1,T | 0.007 | −0.83 | 0.116 | ||||
| (H) | rs12928056 | 2,C | 0.527 | −0.84 | 0.05 | 70113075 | 0.882 |
| 2,A | 0.111 | 0.84 | 0.118 | ||||
| 1,C | 0.355 | 0.84 | 0.882 | ||||
| 1,A | 0.007 | −0.84 | 0.118 | ||||
| (I) | rs11646048 | 2,A | 0.439 | −0.90 | 0.12 | 70179233 | 0.793 |
| 2,G | 0.200 | 0.90 | 0.207 | ||||
| 1,A | 0.355 | 0.90 | 0.793 | ||||
| 1,G | 0.007 | −0.90 | 0.207 | ||||
| (J) | rs9940976 | 2,A | 0.431 | −0.82 | 0.11 | 70199327 | 0.778 |
| 2,C | 0.207 | 0.82 | 0.222 | ||||
| 1,A | 0.347 | 0.82 | 0.778 | ||||
| 1,C | 0.015 | −0.82 | 0.222 | ||||
| (K) | rs726887 | 2,G | 0.636 | 0.85 | 0.06 | 70239298 | 0.955 |
| 2,T | 0.004 | −0.85 | 0.045 | ||||
| 1,G | 0.319 | −0.85 | 0.955 | ||||
| 1,T | 0.041 | 0.85 | 0.045 | ||||
| (L) | rs212165 | 2,A | 0.535 | −0.82 | 0.05 | 70356964 | 0.889 |
| 2,G | 0.104 | 0.82 | 0.111 | ||||
| 1,A | 0.354 | 0.82 | 0.889 | ||||
| 1,G | 0.007 | −0.82 | 0.111 |
Age estimates of the HP CNV and 12 SNPs in high LD (D′ > 0.79) with HP genotype
| SNP code | SNP allele | Min allele frequency (q) | Allele age estimates | ||||
|---|---|---|---|---|---|---|---|
| Scaled time | Generations | Age (years) | |||||
| (A) | rs17665900 | A | 0.122 | 0.58 | 5844 | 116,875 | |
| (B) | rs1424241 | T | 0.176 | 0.74 | 7417 | 148,332 | |
| (C) | rs2000999 | A | 0.214 | 0.84 | 8388 | 167,761 | |
| (D) | rs152837 | G | 0.071 | 0.41 | 4052 | 81,046 | |
| (E) | rs152828 | A | 0.099 | 0.51 | 5069 | 101,386 | |
| (F) | rs217180 | A | 0.110 | 0.54 | 5443 | 108,850 | |
| (G) | rs12926250 | T | 0.116 | 0.57 | 5668 | 113,354 | |
| (H) | rs12928056 | A | 0.118 | 0.57 | 5712 | 114,241 | |
| (I) | rs11646048 | G | 0.207 | 0.82 | 8219 | 164,380 | |
| (J) | rs9940976 | C | 0.222 | 0.86 | 8587 | 171,748 | |
| (K) | rs726887 | T | 0.045 | 0.29 | 2938 | 58,760 | |
| (L) | rs212165 | G | 0.111 | 0.55 | 5488 | 109,760 | |
| (M) | 1 | 0.362 | 1.15 | 11,531 | 230,616 | ||
Scaled estimate based on formula by Slatkin and Rannala (Slatkin and Rannala, 2000).
Based on the minimum populations size before recent modern human growth (N= 10,000).
Assuming a generation span of 20 years.