| Literature DB >> 19180233 |
David Reich1, Michael A Nalls, W H Linda Kao, Ermeg L Akylbekova, Arti Tandon, Nick Patterson, James Mullikin, Wen-Chi Hsueh, Ching-Yu Cheng, Josef Coresh, Eric Boerwinkle, Man Li, Alicja Waliszewska, Julie Neubauer, Rongling Li, Tennille S Leak, Lynette Ekunwe, Joe C Files, Cheryl L Hardy, Joseph M Zmuda, Herman A Taylor, Elad Ziv, Tamara B Harris, James G Wilson.
Abstract
Persistently low white blood cell count (WBC) and neutrophil count is a well-described phenomenon in persons of African ancestry, whose etiology remains unknown. We recently used admixture mapping to identify an approximately 1-megabase region on chromosome 1, where ancestry status (African or European) almost entirely accounted for the difference in WBC between African Americans and European Americans. To identify the specific genetic change responsible for this association, we analyzed genotype and phenotype data from 6,005 African Americans from the Jackson Heart Study (JHS), the Health, Aging and Body Composition (Health ABC) Study, and the Atherosclerosis Risk in Communities (ARIC) Study. We demonstrate that the causal variant must be at least 91% different in frequency between West Africans and European Americans. An excellent candidate is the Duffy Null polymorphism (SNP rs2814778 at chromosome 1q23.2), which is the only polymorphism in the region known to be so differentiated in frequency and is already known to protect against Plasmodium vivax malaria. We confirm that rs2814778 is predictive of WBC and neutrophil count in African Americans above beyond the previously described admixture association (P = 3.8 x 10(-5)), establishing a novel phenotype for this genetic variant.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19180233 PMCID: PMC2628742 DOI: 10.1371/journal.pgen.1000360
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Relationship between ancestry and the distribution of neutrophil count.
(A) Distribution of normally transformed absolute neutrophil count for the three classes of genotype at rs2814778. Individuals who are homozygous for the null allele have distinctly lower neutrophil count (−0.35±0.89 standard deviations compared with the mean) than individuals who are carriers for the functional allele (0.76±0.89). We were able to place constraints on the frequency of the high neutrophil count allele in (B) West Africans, and (C) European Americans by assuming that the observed distributions of neutrophil count for each ancestry class (which we marked in practice by the genotype at rs2814778) are a mixture of distributions specified by the underlying allele frequency. The results indicate a 99% probability that the frequency is <4.9% in Africans and also a 99% probability that the frequency is >95.2% in Europeans.
Comparison of phenotypic characteristics for the four sets of samples used in this study.
| Number of samples | No. homozygous for null rs2814778 allele | Age (range) | BMI (range) | % Female | Average percentage of European ancestry | Total WBC | Neutrophils | Bands | Lymphocytes | Monocytes | Eosinophils | Basophils | Ratio of absolute neutrophil count in carriers of Eur. anc. to non-carriers | Z-score for association of European ancestry carriers to neut. level | Corr. coefficient ρ (stand. error) between Eur. anc. carrier and neut. level | |
|
| 1,969 | 1,349 | 52 (21–84) | 32 (15–66) | 60% | 18.1% | 5.58 | 3.09 | .004 | 1.93 | .387 | .138 | .030 | 1.58 | 29.0 | .519 (.016) |
|
| 2,476 | 1,749 | 54 (44–66) | 30 (14–66) | 60% | 17.7% | 5.73 | 2.85 | .158 | 2.13 | .347 | .167 | .035 | 1.65 | 30.4 | .497 (.014) |
|
| 902 | 617 | 59 (50–74) | 31 (18–59) | 67% | 17.8% | 5.46 | 2.87 | .080 | 1.94 | .359 | .145 | .034 | 1.64 | 21.5 | .568 (.021) |
|
| 658 | 396 | 73 (69–80) | 29 (16–43) | 62% | 21.3% | 5.59 | 2.96 | Not avail | 1.93 | .473 | .172 | .059 | 1.62 | 16.6 | .532 (.026) |
|
| 6,005 | 3,958 | 56 (21–84) | 30 (14–66) | 61% | 18.2% | 5.63 | 2.94 | .088 | 2.02 | .376 | .155 | .036 | 1.62 | 49.7 | .519 (.009) |
Note: For the differential white blood cell counts, we present absolute values, obtained by multiplying the differential counts (expressed as a percentage of total WBC) by the total WBC. The Health ABC study did not obtain band counts.
*: For the JHS-ARIC overlap samples, the values of all phenotypes (age, BMI, and differential white blood cell counts) are averaged between the JHS and ARIC baseline measurements, taken an average of 14 years apart. Analyses in all other groups use a single measurement at the cohort baseline.
§: Average percentage of European ancestry was estimated using the ANCESTRYMAP software. European ancestry “carrier status” was defined by rs2814778 genotype.
†: Neutrophil count is used to assess the genetic association, since it accounts for essentially all the correlation to the chromosome 1 locus and thus provides the best measure of association (text and Table 2). Association analyses are carried out on the normally transformed rank-ordered values of the absolute neutrophil count. Standard errors on the correlation coefficients are obtained by a leave-1-out jackknife analysis.
Effect of the chromosome 1 locus on white blood cell counts.
| Raw values of differential white blood cell counts | Rank-order transformation of differential WBC counts | Correlation (ρ) of WBC rank-order to ancestry | ||||||||
| Z-scores for association tests | ||||||||||
| African ancestry homo-zygotes (n = 4,111) (std. err.) | Afr/Eur ancestry hetero-zygotes (n = 1,647) (std. err.) | European ancestry homo-zygotes (n = 247) (std. err.) | Ratio of 1,894 Eur ancestry carriers to 4,111 Afr ancestry homozygotes | Afr anc. homozy-gotes vs. Afr/Eur hetero-zygotes | Afr anc. homozy-gotes vs. Eur anc. homo-zygotes | Afr/Eur anc heter-ozygotes vs. Eur anc. Homo-zygotes | Afr anc. Homo-zygotes vs. Eur ancestry carrier | ρ between European ancestry carrier and phenotype | ρ between Eur ancestry carrier and phenotype correcting for neut. level | |
|
| 5.085 (.024) | 6.798 (.045) | 6.847 (.109) | 1.34 | 39.4 | 19.4 | 0.7 | 41.6 | .458 | Not significant |
|
| 2.459 (.017) | 3.982 (.034) | 4.013 (.084) | 1.62 | 47.2 | 23.2 | 0.6 | 49.7 | .519 | Not applicable |
|
| 0.080 (.004) | 0.107 (.008) | 0.100 (.017) | 1.32 | 0.6 | 0.1 | −0.1 | 0.6 | Not significant | Not significant |
|
| 1.995 (.012) | 2.06 (.018) | 2.081 (.048) | 1.03 | 3.9 | 2.0 | 0.3 | 4.2 | .054 | Not significant |
|
| 0.351 (.003) | 0.427 (.004) | 0.438 (.012) | 1.22 | 16.9 | 7.6 | 0.5 | 17.8 | .222 | .025 (P = .05) |
|
| 0.149 (.002) | 0.169 (.004) | 0.157 (.007) | 1.13 | 6.1 | 2.0 | −0.9 | 6.2 | .080 | Not significant |
|
| 0.034 (.001) | 0.041 (.001) | 0.038 (.002) | 1.17 | 6.7 | 1.1 | −1.7 | 6.5 | .085 | −.034 (P = .009) |
*: European ancestry at the locus has an essentially dominant effect on white blood cell counts. None of the differential counts phenotypes shows a significant difference between the 1,647 FY−/FY+ heterozygotes and 247 FY+/FY+ homozygotes after correcting for 7 hypotheses tested (Bonferroni corrected P = 0.48). However, all white blood cell counts except bands show significant associations comparing the 4,111 FY−/FY− homozygotes to the 1,894 carriers of the FY+ allele (Z-scores between 4.2 and 49.7 standard deviations).
†: After controlling for neutrophil level, there is weak additional evidence for association to the chromosome 1 locus only for monocytes and basophils. For subsequent mapping analyses we only focused on neutrophil levels, as adding these other residuals did not substantially increase the strength of the association. We do not estimate correlation coefficients of the associations that are not significant.
Stratification of samples into 12 groups for admixture mapping analysis.
| Standard deviations compared with population mean | Samples | Risk model used (factor change in European ancestry vs. genome average) | LOD score at rs2814778 |
| <−1.5 | 400 | 0.04 | 46.6 |
| −1.5 to −1.0 | 552 | 0.05 | 64.8 |
| −1.0 to −.75 | 408 | 0.11 | 35.2 |
| −.75 to −.5 | 493 | 0.19 | 28.6 |
| −.5 to −.25 | 557 | 0.3 | 22.9 |
| −.25 to 0 | 593 | 0.56 | 7.1 |
| 0 to .25 | 593 | 0.98 | 0.1 |
| .25 to .5 | 558 | 1.6 | 4.6 |
| .5 to .75 | 492 | 2.5 | 17.5 |
| .75 to 1 | 408 | 3.5 | 27.8 |
| 1 to 1.5 | 552 | 5 | 53.8 |
| >1.5 | 399 | 6.5 | 54.0 |
| Sum across all | 6,005 | 363.1 |
Figure 2Admixture association defines a 451 kb region containing the risk allele.
The LOD score for admixture association to neutrophil count shows a peak of 363.1, and a 99% confidence interval of 155.957–156.408 Mb (the region where the LOD score is within 1.44 of its maximum). The known genes under the peak are obtained using a screenshot of the “Known Genes” track from the UCSC genome browser (http://genome.ucsc.edu).
Reduced neutrophil count is more associated to the Duffy null polymorphism than to ancestry.
| χ2 value for association of neutrophil count to a predictor (n = 5,997 individuals passing analysis quality filters) | |
|
| 1883.2 |
|
| 1867.9 |
|
| 1883.6 |
|
| 15.7 (P = 3.8×10−5) |
|
| 0.4 (P = 0.74) |
*: For the association tests, we used ancestry estimates that included the genotype of the SNP. This ensures that any association at the SNP is above and beyond that expected from ancestry alone.
Figure 3Fine mapping reveals rs2814778 as the only significant association.
(A) Results of case-control association analysis for 193 SNPs genotyped in 148 individuals with low neutrophil count (<2,100/mm3), which we compared with 74 controls with high neutrophil count (5,000–9,000/mm3). All samples were selected to have a confident estimate of all African ancestry at the chromosome 1 locus (>99% probability) based on ANCESTRYMAP analysis at flanking markers outside the admixture peak. (B) HapMap SNPs of >5% minor allele frequency are well captured by this genotyping. We find that 94% of West African SNPs and 96% of European American SNPs are correlated with r2>0.8 to one of the SNPs we genotyped.
Figure 4Ancestry analysis of BAC clones that are the source of the human genome reference sequence across the chromosome 1 locus.
The human genome reference sequence across the admixture peak is pieced together from 5 BAC clones, which turn out to be a mosaic of European and African ancestry. To determine ancestry, we examined the haplotype of the human genome reference sequence for 284 SNPs for which data are available from the International Haplotype Map Project, and then output the ratio of the number of perfect matches to the reference sequence haplotype in 120 European American to the number of perfect matches in 120 West African chromosomes (conservatively adding 1 to the counts). Values above 10 indicate strong (>10∶1) evidence for a European haplotype, and values below 0.1 indicate strong (<1∶10) evidence for an African haplotype. RPCI-11, the individual who is the source of ∼74% of the human genome reference sequence, has two clones of entirely African and two clones of entirely European ancestry at this locus, strongly indicating that the human genome reference sequence is primarily that of an African American. We included these BAC clones in our search for polymorphisms across the admixture peak that were consistent with being fixed in frequency between European and African populations.