| Literature DB >> 24917567 |
Kaye N Ballantyne1, Arwin Ralf, Rachid Aboukhalid, Niaz M Achakzai, Maria J Anjos, Qasim Ayub, Jože Balažic, Jack Ballantyne, David J Ballard, Burkhard Berger, Cecilia Bobillo, Mehdi Bouabdellah, Helen Burri, Tomas Capal, Stefano Caratti, Jorge Cárdenas, François Cartault, Elizeu F Carvalho, Monica Carvalho, Baowen Cheng, Michael D Coble, David Comas, Daniel Corach, Maria E D'Amato, Sean Davison, Peter de Knijff, Maria Corazon A De Ungria, Ronny Decorte, Tadeusz Dobosz, Berit M Dupuy, Samir Elmrghni, Mateusz Gliwiński, Sara C Gomes, Laurens Grol, Cordula Haas, Erin Hanson, Jürgen Henke, Lotte Henke, Fabiola Herrera-Rodríguez, Carolyn R Hill, Gunilla Holmlund, Katsuya Honda, Uta-Dorothee Immel, Shota Inokuchi, Mark A Jobling, Mahmoud Kaddura, Jong S Kim, Soon H Kim, Wook Kim, Turi E King, Eva Klausriegler, Daniel Kling, Lejla Kovačević, Leda Kovatsi, Paweł Krajewski, Sergey Kravchenko, Maarten H D Larmuseau, Eun Young Lee, Ruediger Lessig, Ludmila A Livshits, Damir Marjanović, Marek Minarik, Natsuko Mizuno, Helena Moreira, Niels Morling, Meeta Mukherjee, Patrick Munier, Javaregowda Nagaraju, Franz Neuhuber, Shengjie Nie, Premlaphat Nilasitsataporn, Takeki Nishi, Hye H Oh, Jill Olofsson, Valerio Onofri, Jukka U Palo, Horolma Pamjav, Walther Parson, Michal Petlach, Christopher Phillips, Rafal Ploski, Samayamantri P R Prasad, Dragan Primorac, Gludhug A Purnomo, Josephine Purps, Hector Rangel-Villalobos, Krzysztof Rębała, Budsaba Rerkamnuaychoke, Danel Rey Gonzalez, Carlo Robino, Lutz Roewer, Alexandra Rosa, Antti Sajantila, Andrea Sala, Jazelyn M Salvador, Paula Sanz, Cornelia Schmitt, Anil K Sharma, Dayse A Silva, Kyoung-Jin Shin, Titia Sijen, Miriam Sirker, Daniela Siváková, Vedrana Skaro, Carlos Solano-Matamoros, Luis Souto, Vlastimil Stenzl, Herawati Sudoyo, Denise Syndercombe-Court, Adriano Tagliabracci, Duncan Taylor, Andreas Tillmar, Iosif S Tsybovsky, Chris Tyler-Smith, Kristiaan J van der Gaag, Daniel Vanek, Antónia Völgyi, Denise Ward, Patricia Willemse, Eric P H Yap, Rita Y Y Yong, Irena Zupanič Pajnič, Manfred Kayser.
Abstract
Relevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve individuals and populations completely. Here, 52 centers generated quality-controlled data of 13 rapidly mutating (RM) Y-STRs in 14,644 related and unrelated males from 111 worldwide populations. Strikingly, >99% of the 12,272 unrelated males were completely individualized. Haplotype diversity was extremely high (global: 0.9999985, regional: 0.99836-0.9999988). Haplotype sharing between populations was almost absent except for six (0.05%) of the 12,156 haplotypes. Haplotype sharing within populations was generally rare (0.8% nonunique haplotypes), significantly lower in urban (0.9%) than rural (2.1%) and highest in endogamous groups (14.3%). Analysis of molecular variance revealed 99.98% of variation within populations, 0.018% among populations within groups, and 0.002% among groups. Of the 2,372 newly and 156 previously typed male relative pairs, 29% were differentiated including 27% of the 2,378 father-son pairs. Relative to Yfiler, haplotype diversity was increased in 86% of the populations tested and overall male relative differentiation was raised by 23.5%. Our study demonstrates the value of RM Y-STRs in identifying and separating unrelated and related males and provides a reference database.Entities:
Keywords: RM Y-STRs; Y-STRs; Y-chromosome; forensic; haplotypes; paternal lineage
Mesh:
Year: 2014 PMID: 24917567 PMCID: PMC4145662 DOI: 10.1002/humu.22599
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
RM Y-STR Haplotype Characteristics in a Global Set of 12,272 Individuals from 111 Populations Summarized for Regional Groups
| Group | Individuals | Populations | Haplotypes | Haplotype diversity | θ | Average number of differing loci |
|---|---|---|---|---|---|---|
| Sub-Saharan Africa | 303 | 5 | 300 | 0.99993 | 0.00006 | 16.5 |
| North Africa | 452 | 4 | 445 | 0.99992 | 0.00010 | 16.2 |
| Middle East | 100 | 1 | 100 | 1.0 | 0 | 15.26 |
| Central Asia | 86 | 1 | 80 | 0.99836 | 0.0012 | 16.18 |
| South Asia | 661 | 8 | 644 | 0.99992 | 0.00009 | 17.5 |
| East Asia | 967 | 7 | 964 | 0.999994 | 0.000006 | 17.3 |
| South East Asia | 634 | 6 | 630 | 0.99998 | 0.00002 | 16.6 |
| Aboriginal Australian | 100 | 1 | 96 | 0.99919 | 0.00069 | 16.8 |
| Native American | 365 | 10 | 357 | 0.99986 | 0.00015 | 17.4 |
| Admixed Native American | 764 | 12 | 758 | 0.99998 | 0.00002 | 17.5 |
| European | 5,618 | 38 | 5599 | 0.9999988 | 0.000001 | 17.4 |
| Migrant Sub-Saharan Africa | 663 | 5 | 659 | 0.99998 | 0.000004 | 16.3 |
| Migrant European | 731 | 3 | 730 | 0.999996 | 0.000004 | 17.0 |
| Migrant Asian | 649 | 7 | 645 | 0.99998 | 0.00002 | 16.7 |
| Bhutan | 78 | 2 | 56 | 0.99434 | 0.00794 | 13.9 |
| Biaka Pygmy | 101 | 1 | 94 | 0.99822 | 0.00226 | 17.5 |
| Global | 12,272 | 111 | 12,156 | 0.9999985 | 0.00000238 | 16.5 |
Bhutan and Biaka Pygmies were not assigned to a larger geographic group as the individuals sampled were ascertained differently (Bhutan), or are known to have different population characteristics to all other study populations (Biaka, see Materials and Methods and DNA Samples).
Figure 1Two-dimensional plot of MDS analysis of Slatkin's linearized F values for RM Y-STR haplotypes in a global sample of 12,272 individuals from 111 populations (stress 0.07462). Smaller inset MDS shows the effect of equalized sample size (N = 100 individuals or fewer per population, stress = 0.02416).
Comparison of RM Y-STR and Yfiler Haplotype Characteristics in a Global Set of 7,784 Individuals from 65 Populations Summarized for Regional Groups
| Group | Individuals | Populations | Yfiler haplotypes | RM Y-STR haplotypes | Yfiler haplotype diversity | RM Y-STR haplotype diversity | Yfiler average percentage of differing loci | RM Y-STR average percentage of differing loci |
|---|---|---|---|---|---|---|---|---|
| North Africa | 193 | 2 | 173 | 189 | 0.99792 | 0.99973 | 59% | 73% |
| Central Asia | 83 | 1 | 67 | 77 | 0.99060 | 0.99824 | 63% | 77% |
| South Asia | 497 | 6 | 450 | 483 | 0.99951 | 0.99988 | 68% | 83% |
| East Asia | 633 | 5 | 580 | 630 | 0.99952 | 0.99999 | 64% | 82% |
| South East Asia | 200 | 2 | 175 | 198 | 0.99759 | 0.99990 | 66% | 81% |
| Aboriginal Australian | 100 | 1 | 74 | 96 | 0.99152 | 0.99919 | 68% | 80% |
| Native American | 279 | 8 | 233 | 275 | 0.99761 | 0.99988 | 61% | 82% |
| Admixed Native American | 458 | 6 | 444 | 454 | 0.99986 | 0.99996 | 67% | 81% |
| European | 4,041 | 25 | 3,696 | 4,025 | 0.99991 | 0.999998 | 66% | 82% |
| Migrant Sub-Saharan Africa | 442 | 3 | 407 | 439 | 0.99909 | 0.99997 | 63% | 81% |
| Migrant European | 552 | 3 | 541 | 551 | 0.99993 | 0.99999 | 63% | 81% |
| Migrant Asian | 205 | 2 | 176 | 203 | 0.99737 | 0.99990 | 66% | 83% |
| Biaka Pygmies | 101 | 1 | 83 | 94 | 0.99505 | 0.99822 | 65% | 83% |
| Global | 7,784 | 65 | 6,975 | 7,714 | 0.99995 | 0.999997 | 65% | 81% |
Figure 2Proportion of individuals with haplotypes shared within populations (right) and between populations (left) for Yfiler (light blue bars) and the RM Y-STR set (dark red bars).
Figure 3Geographic representation of pairwise between-population haplotype sharing. Blue lines connect population pairs showing shared haplotypes for A: Yfiler, and B: the RM Y-STRs set. Smaller insets show Europe enlarged.
Figure 4Weighted median-joining networks from single-copy Y-STRs for 1,000 individuals randomly selected from the total dataset, with regional geographic groups indicated by colors (see inset legend). A: Based on 15-loci Yfiler haplotypes (excluding DYS385a/b). B: Based on 10-loci RM Y-STR haplotypes (excluding DYF399S1, DYF403S1, and DYF404S1).
RM Y-STR Haplotype Characteristics in a Global Set of 12,272 Individuals from 111 Populations Summarized for Groups Based on Urbanization Status
| Population classification | Populations | Individuals | Haplotypes | Singletons | Haplotype diversity | Proportion of nonunique haplotypes |
|---|---|---|---|---|---|---|
| Endogamous | 7 | 558 | 512 | 478 | 0.999614 | 0.143 |
| Rural | 20 | 1,369 | 1,354 | 1,340 | 0.999982 | 0.021 |
| Urban | 51 | 7,198 | 7,164 | 7,131 | 0.999999 | 0.009 |
| Mixed | 26 | 2,379 | 2,361 | 2,342 | 0.999994 | 0.015 |
| Unknown | 7 | 770 | 767 | 764 | 0.999990 | 0.008 |
Comparison of RM Y-STRs and Yfiler Haplotype Characteristics in a Global Set of 7,784 Individuals from 65 Populations Summarized for Groups Based on Urbanization Status
| Population classification | Yfiler ht | RM Y-STR ht | Yfiler singletons | RM Y-STR singletons | Yfiler haplotype diversity | RM Y-STR haplotype diversity | Yfiler proportion of nonunique haplotypes | RM Y-STR proportion of nonunique haplotypes | Yfiler percentage average allelic difference | RM Y-STR percentage average allelic difference | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Endogamous | 466 | 419 | 445 | 380 | 427 | 0.99947 | 0.99978 | 0.101 | 0.045 | 68% | 83% |
| Rural | 714 | 602 | 708 | 537 | 703 | 0.99916 | 0.99997 | 0.157 | 0.008 | 70% | 85% |
| Urban | 4,974 | 4,577 | 4,951 | 4,318 | 4,929 | 0.99994 | 0.999998 | 0.080 | 0.005 | 68% | 83% |
| Mixed | 1,422 | 1,332 | 1,407 | 1,267 | 1,392 | 0.99988 | 0.99999 | 0.063 | 0.011 | 67% | 84% |
| Unknown | 208 | 181 | 205 | 169 | 202 | 0.99684 | 0.99986 | 0.130 | 0.014 | 64% | 83% |
Figure 5Empirical male relative differentiation using 2,528 paternal relative pairs separated by one to 20 meioses for Yfiler (light blue bars) and the RM Y-STR set (dark red bars). The data combine the new 2,372 relative pairs from the current study with the previously used 156 relative pairs [Ballantyne et al., 2012]. Values above the bars indicate the absolute number of relative pairs the estimated differentiation rate is based upon. Error bars represent binomial confidence intervals.