| Literature DB >> 21775986 |
Anjali G Hinch1, Arti Tandon, Nick Patterson, Yunli Song, Nadin Rohland, Cameron D Palmer, Gary K Chen, Kai Wang, Sarah G Buxbaum, Ermeg L Akylbekova, Melinda C Aldrich, Christine B Ambrosone, Christopher Amos, Elisa V Bandera, Sonja I Berndt, Leslie Bernstein, William J Blot, Cathryn H Bock, Eric Boerwinkle, Qiuyin Cai, Neil Caporaso, Graham Casey, L Adrienne Cupples, Sandra L Deming, W Ryan Diver, Jasmin Divers, Myriam Fornage, Elizabeth M Gillanders, Joseph Glessner, Curtis C Harris, Jennifer J Hu, Sue A Ingles, William Isaacs, Esther M John, W H Linda Kao, Brendan Keating, Rick A Kittles, Laurence N Kolonel, Emma Larkin, Loic Le Marchand, Lorna H McNeill, Robert C Millikan, Adam Murphy, Solomon Musani, Christine Neslund-Dudas, Sarah Nyante, George J Papanicolaou, Michael F Press, Bruce M Psaty, Alex P Reiner, Stephen S Rich, Jorge L Rodriguez-Gil, Jerome I Rotter, Benjamin A Rybicki, Ann G Schwartz, Lisa B Signorello, Margaret Spitz, Sara S Strom, Michael J Thun, Margaret A Tucker, Zhaoming Wang, John K Wiencke, John S Witte, Margaret Wrensch, Xifeng Wu, Yuko Yamamura, Krista A Zanetti, Wei Zheng, Regina G Ziegler, Xiaofeng Zhu, Susan Redline, Joel N Hirschhorn, Brian E Henderson, Herman A Taylor, Alkes L Price, Hakon Hakonarson, Stephen J Chanock, Christopher A Haiman, James G Wilson, David Reich, Simon R Myers.
Abstract
Recombination, together with mutation, gives rise to genetic variation in populations. Here we leverage the recent mixture of people of African and European ancestry in the Americas to build a genetic map measuring the probability of crossing over at each position in the genome, based on about 2.1 million crossovers in 30,000 unrelated African Americans. At intervals of more than three megabases it is nearly identical to a map built in Europeans. At finer scales it differs significantly, and we identify about 2,500 recombination hotspots that are active in people of West African ancestry but nearly inactive in Europeans. The probability of a crossover at these hotspots is almost fully controlled by the alleles an individual carries at PRDM9 (P value < 10(-245)). We identify a 17-base-pair DNA sequence motif that is enriched in these hotspots, and is an excellent match to the predicted binding target of PRDM9 alleles common in West Africans and rare in Europeans. Sites of this motif are predicted to be risk loci for disease-causing genomic rearrangements in individuals carrying these alleles. More generally, this map provides a resource for research in human genetic variation and evolution.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21775986 PMCID: PMC3154982 DOI: 10.1038/nature10336
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Figure 1Building an African American genetic map. (A) HAPMIX detection of crossovers between segments of inferred ancestry is illustrated in a father-mother-child trio. Black segments show inferred crossovers; arrows showtransmission of ancestral crossovers from parent to child, Purple/green segments show de novo events (paternal/maternal origin respectively), corresponding to events identified directly using two additional children (bottom,“Pedigree inferred”).(B) The AA Map localizes five hotspots in a region of the MHC whose positions (blue) were previously mapped by sperm typing[1]. (C) Comparison of maps shows a hotspot at 33.1Mb in the African-derived AA and YRI maps, but not the deCODE and CEU maps (all maps smoothed to 10kb).
Genetic map assessments at different size scales
| Scale | ρ - Pearson correlation of the AA map | Est. correlation of | Est. coefficient of variation | ||
|---|---|---|---|---|---|
| Combined LD | CEU | YRI | |||
| 3 kb | 0.75 (0.63) | 0.66 (0.58) | 0.71 (0.53) | 0.93 | 1.41 |
| 10 kb | 0.82 (0.74) | 0.73 (0.70) | 0.78 (0.65) | 0.96 | 0.73 |
| 30 kb | 0.86 (0.83) | 0.78 (0.78) | 0.83 (0.74) | 0.98 | 0.36 |
| 100 kb | 0.91 (0.89) | 0.84 (0.85) | 0.87 (0.81) | 0.99 | 0.17 |
| 300 kb | 0.94 (0.93) | 0.89 (0.90) | 0.92 (0.88) | 1.00 | 0.08 |
| 1 Mb | 0.97 (0.96) | 0.94 (0.94) | 0.95 (0.95) | 1.00 | 0.04 |
| 3 Mb | 0.98 (0.98) | 0.97 (0.97) | 0.98 (0.97) | 1.00 | 0.02 |
Note: The numbers in this table are restricted to the autosomes and genomic segments more than 5 Mb from the telomere.
The Combined map is the HapMap2 population-averaged LD-based map18[18].
The standard error of the map at each size scale is determined by the posterior probability distribution of the MCMC.
Figure 2Association of PRDM9 genetic variation with hotspot activity. (A) A GWAS measuring association of the “African-enrichment” (AE) phenotype shows a single genome-wide significant peak at PRDM9, with rs6889665 the best associated SNP. (B) Relationship between alleles at the rs6889665 and predicted binding target of the PRDM9 zinc finger array[9] for West African and European samples. The alleles are grouped into 8 clusters according to their best-matching region to the 13-bp motif, and annotated by the number of bases matching the motif. The African-enriched rs6889665 “C” allele always co-occurs with motifs with a poor (5/8) match to the 13-mer. (C) Gene tree32 of the LD block containing the PRDM9 ZF array (Methods); numbered circles show SNPs and significant P-values for association, after conditioning on rs6889665.
Figure 3A sequence motif specifying the positions of African-enriched hotspots. (A) Logo plot showing a degenerate 17-bp hotspot motif, with stack height proportional to -log P-value, and relative letter height proportional to the mean crossover rate increase given each base. Below is the bioinformatic PRDM9 binding prediction for the rs6889665 AE associated alleles (from Figure 2B), matching the motif at 10/11 bases (lines). (B) Average crossover rate (in 2 kb sliding windows) in the AA (red line) and deCODE (black line) maps surrounding the 500 strongest motif matches. (C) In seven rs6889665 “CC” individuals from the pedigree study, we localized 82 crossovers to within 10 kb, and plot average AA, YRI, deCODE and CEU map rates. There is no strong peak above local background in the deCODE or CEU maps.