| Literature DB >> 24854874 |
Josephine Purps1, Sabine Siegert2, Sascha Willuweit1, Marion Nagy1, Cíntia Alves3, Renato Salazar4, Sheila M T Angustia5, Lorna H Santos5, Katja Anslinger6, Birgit Bayer6, Qasim Ayub7, Wei Wei7, Yali Xue7, Chris Tyler-Smith7, Miriam Baeta Bafalluy8, Begoña Martínez-Jarreta8, Balazs Egyed9, Beate Balitzki10, Sibylle Tschumi10, David Ballard11, Denise Syndercombe Court11, Xinia Barrantes12, Gerhard Bäßler13, Tina Wiest13, Burkhard Berger14, Harald Niederstätter14, Walther Parson15, Carey Davis16, Bruce Budowle17, Helen Burri18, Urs Borer18, Christoph Koller18, Elizeu F Carvalho19, Patricia M Domingues19, Wafaa Takash Chamoun20, Michael D Coble21, Carolyn R Hill21, Daniel Corach22, Mariela Caputo22, Maria E D'Amato23, Sean Davison23, Ronny Decorte24, Maarten H D Larmuseau24, Claudio Ottoni24, Olga Rickards25, Di Lu26, Chengtao Jiang26, Tadeusz Dobosz27, Anna Jonkisz27, William E Frank28, Ivana Furac29, Christian Gehrig30, Vincent Castella30, Branka Grskovic31, Cordula Haas32, Jana Wobst32, Gavrilo Hadzic33, Katja Drobnic33, Katsuya Honda34, Yiping Hou35, Di Zhou35, Yan Li35, Shengping Hu36, Shenglan Chen36, Uta-Dorothee Immel37, Rüdiger Lessig37, Zlatko Jakovski38, Tanja Ilievska38, Anja E Klann39, Cristina Cano García39, Peter de Knijff40, Thirsa Kraaijenbrink40, Aikaterini Kondili41, Penelope Miniati41, Maria Vouropoulou41, Lejla Kovacevic42, Damir Marjanovic42, Iris Lindner43, Issam Mansour44, Mouayyad Al-Azem44, Ansar El Andari44, Miguel Marino45, Sandra Furfuro45, Laura Locarno45, Pablo Martín46, Gracia M Luque46, Antonio Alonso46, Luís Souto Miranda47, Helena Moreira47, Natsuko Mizuno48, Yasuki Iwashima48, Rodrigo S Moura Neto49, Tatiana L S Nogueira50, Rosane Silva51, Marina Nastainczyk-Wulf52, Jeanett Edelmann52, Michael Kohl52, Shengjie Nie53, Xianping Wang54, Baowen Cheng55, Carolina Núñez56, Marian Martínez de Pancorbo56, Jill K Olofsson57, Niels Morling57, Valerio Onofri58, Adriano Tagliabracci58, Horolma Pamjav59, Antonia Volgyi59, Gusztav Barany59, Ryszard Pawlowski60, Agnieszka Maciejewska60, Susi Pelotti61, Witold Pepinski62, Monica Abreu-Glowacka63, Christopher Phillips64, Jorge Cárdenas64, Danel Rey-Gonzalez64, Antonio Salas64, Francesca Brisighelli65, Cristian Capelli66, Ulises Toscanini67, Andrea Piccinini68, Marilidia Piglionica69, Stefania L Baldassarra69, Rafal Ploski70, Magdalena Konarzewska70, Emila Jastrzebska71, Carlo Robino72, Antti Sajantila73, Jukka U Palo74, Evelyn Guevara74, Jazelyn Salvador75, Maria Corazon De Ungria75, Jae Joseph Russell Rodriguez76, Ulrike Schmidt77, Nicola Schlauderer77, Pekka Saukko78, Peter M Schneider79, Miriam Sirker79, Kyoung-Jin Shin80, Yu Na Oh80, Iulia Skitsa81, Alexandra Ampati81, Tobi-Gail Smith82, Lina Solis de Calvit83, Vlastimil Stenzl84, Thomas Capal84, Andreas Tillmar85, Helena Nilsson85, Stefania Turrina86, Domenico De Leo86, Andrea Verzeletti87, Venusia Cortellini87, Jon H Wetton88, Gareth M Gwynne88, Mark A Jobling88, Martin R Whittle89, Denilce R Sumita89, Paulina Wolańska-Nowak90, Rita Y Y Yong91, Michael Krawczak92, Michael Nothnagel2, Lutz Roewer93.
Abstract
In a worldwide collaborative effort, 19,630 Y-chromosomes were sampled from 129 different populations in 51 countries. These chromosomes were typed for 23 short-tandem repeat (STR) loci (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385ab, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, GATAH4, DYS481, DYS533, DYS549, DYS570, DYS576, and DYS643) and using the PowerPlex Y23 System (PPY23, Promega Corporation, Madison, WI). Locus-specific allelic spectra of these markers were determined and a consistently high level of allelic diversity was observed. A considerable number of null, duplicate and off-ladder alleles were revealed. Standard single-locus and haplotype-based parameters were calculated and compared between subsets of Y-STR markers established for forensic casework. The PPY23 marker set provides substantially stronger discriminatory power than other available kits but at the same time reveals the same general patterns of population structure as other marker sets. A strong correlation was observed between the number of Y-STRs included in a marker set and some of the forensic parameters under study. Interestingly a weak but consistent trend toward smaller genetic distances resulting from larger numbers of markers became apparent.Entities:
Keywords: AMOVA; Database; Discriminatory power; Gene diversity; Population structure
Mesh:
Year: 2014 PMID: 24854874 PMCID: PMC4127773 DOI: 10.1016/j.fsigen.2014.04.008
Source DB: PubMed Journal: Forensic Sci Int Genet ISSN: 1872-4973 Impact factor: 4.882
Fig. 1Geographic distribution of sampling sites. Male samples (n = 19,630) were collected at 129 sites on five continents.
Fig. 2Allele distribution of PPY23-specific loci.
Fig. 3Ranking of PPY23 markers by gene diversity (GD). For the calculation of GD, DYS385ab was treated as a single marker. DYS389II.I equals the difference between DYS389II and DYS389I. PPY23-specific markers are given in bold. (a) Rank within the whole data set; (b) rank within continental residency groups, i.e. Africa (orange; n = 445), Asia (red; n = 3458), Europe (magenta; n = 11,968), Latin America (blue; n = 1183) or North America (green; n = 2576). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Marker-wise frequency of variant alleles. PPY23-specific markers are given in bold. Light gray: copy number variants; gray: intermediate alleles; dark gray: null allele.
Number of distinct haplotypes for the PPY23 marker panel overall and for five groups defined by continental residency.
| Overall | Africa | Asia | Europe | Latin America | North America | |
|---|---|---|---|---|---|---|
| 19,630 haplotypes | 445 haplotypes | 3458 haplotypes | 11,968 haplotypes | 1183 haplotypes | 2576 haplotypes | |
| 129 populations | 6 populations | 22 populations | 72 populations | 14 populations | 15 populations | |
| 18,237 (92.9%) | 337 (75.7%) | 3293 (95.2%) | 11,185 (93.5%) | 1094 (92.5%) | 2378 (92.3%) | |
| 531 | 27 | 65 | 314 | 38 | 67 | |
| 64 | 6 | 9 | 32 | 3 | 12 | |
| 16 | 4 | 2 | 7 | 1 | 3 | |
| 6 | 1 | 3 | 2 | |||
| 2 | – | 1 | 1 | |||
| 2 | 1 | – | ||||
| 1 | 1 | – | ||||
| – | – | |||||
| – | 1 | |||||
| 1 | ||||||
| HD | 0.999995 | 0.998704 | 0.999983 | 0.999992 | 0.999924 | 0.999953 |
| MP | 5.63 × 10−5 | 3.54 × 10−3 | 3.07 × 10−4 | 9.11 × 10−5 | 9.21 × 10−4 | 4.35 × 10−4 |
| MP as 1 in … | 17,760 | 283 | 3262 | 10,975 | 1086 | 2298 |
| # Distinct haplotypes | 18,860 | 377 | 3369 | 11,543 | 1136 | 2463 |
| # Population specific haplotypes | – | 376 | 3367 | 11,516 | 1127 | 2446 |
| DC | 0.9608 | 0.8472 | 0.9743 | 0.9645 | 0.9603 | 0.9561 |
HD, haplotype diversity; MP, match probability; DC, discrimination capacity. Number and origin of individuals sharing a distinct haplotype.
Northern Alaska [Inupiat] 5; Northern Alaska [Inupiat] 5; Krakow, Poland [Polish] 1; Netherlands [Dutch] 4; Finland [Finnish] 5; Estonia [Estonian] 1; Finland [Finnish] 3; Turku, Finland [Finnish] 1; Kinyawa, Kenya [Maasai] 5.
Northern Alaska, USA [Inupiat] 6; Estonia [Estonian] 1; Finland [Finnish] 3; Western Alaska, USA [Yupik] 2.
South Africa [Xhosa] 7; Baranya, Hungary [Romani] 2; Hungary [Hungarian] 2; Hungary [Romani] 2; USA [European American] 1.
Kinyawa, Kenya [Maasai] 8.
Central Alaska, USA [Athapaskan] 1; Estonia [Estonian] 1; Finland [Finnish] 6; Turku, Finland [Finnish] 3.
# Haplotypes observed.
Diversity values and forensic parameter estimates for five forensic marker panels.
| # Haplotypes observed | Forensic marker set | ||||
|---|---|---|---|---|---|
| 19,630 samples | MHT | SWGDAM | PPY12 | Yfiler | PPY23 |
| 9 marker | 11 marker | 12 marker | 17 marker | 23 marker | |
| 6083 (31.0%) | 8495 (43.3%) | 9092 (46.3%) | 15,263 (77.8%) | 18,237 (92.9%) | |
| 1131 | 1227 | 1260 | 1064 | 531 | |
| 435 | 436 | 416 | 256 | 64 | |
| 226 | 199 | 196 | 94 | 16 | |
| 114 | 101 | 106 | 63 | 6 | |
| 86 | 85 | 85 | 21 | 2 | |
| 63 | 51 | 50 | 12 | 2 | |
| 43 | 50 | 41 | 12 | 1 | |
| 29 | 29 | 34 | 9 | – | |
| 31 | 21 | 24 | 4 | – | |
| 22 | 24 | 28 | 5 | 1 | |
| 21 | 21 | 16 | 3 | ||
| 14 | 14 | 9 | 2 | ||
| 14 | 15 | 12 | 1 | ||
| 15 | 8 | 4 | 2 | ||
| 13 | 3 | 5 | – | ||
| 11 | 3 | 2 | 2 | ||
| 9 | 2 | 7 | 1 | ||
| 4 | 9 | 4 | – | ||
| 6 | – | 3 | 2 | ||
| 3 | 4 | 3 | 1 | ||
| 3 | 4 | 2 | 1 | ||
| 5 | 1 | – | – | ||
| 2 | 4 | 4 | – | ||
| 3 | 1 | 4 | 1 | ||
| 3 | 1 | 1 | – | ||
| 3 | 4 | 2 | – | ||
| n = 28 | 2 | 1 | 1 | – | |
| 3 | 1 | 1 | – | ||
| 2 | 1 | 2 | – | ||
| 13 | 11 | 7 | 1 | ||
| 9 | 8 | 11 | |||
| 5 | 4 | 6 | |||
| 5 | 7 | 3 | |||
| n ∈ (70, 80] | 5 | 1 | – | ||
| 2 | – | – | |||
| 2 | 2 | 1 | |||
| 2 | – | – | |||
| 1 | – | 1 | |||
| – | – | 1 | |||
| – | 1 | – | |||
| – | 1 | – | |||
| – | – | 1 | |||
| 1 | – | – | |||
| – | 1 | – | |||
| 1 | – | – | |||
| – | – | 1 | |||
| 1 | – | ||||
| – | 1 | ||||
| 1 | |||||
| 1 | |||||
| HD | 0.998033 | 0.999199 | 0.999394 | 0.999962 | 0.999995 |
| MP | 2.02 × 10−3 | 8.52 × 10−4 | 6.57 × 10−4 | 8.94 × 10−5 | 5.63 × 10−5 |
| MP as 1 in … | 496 | 1175 | 1522 | 11,189 | 17,760 |
| # Distinct haplotypes | 8448 | 10,852 | 11,446 | 16,820 | 18,860 |
| DC | 0.4304 | 0.5528 | 0.5831 | 0.8569 | 0.9608 |
Diversity values and forensic parameter estimates for the PPY23 panel were taken from Table 1.
MHT, minimal haplotype; SWGDAM, Scientific Working Group for DNA Analysis Methods; PPY12, PowerPlex®Y12; Yfiler, Yfiler®kit; PPY23, PowerPlex®Y23; HD, haplotype diversity; MP, match probability; DC, discrimination capacity.
Forensic parameter estimates for short STRs <220 bp included in the Yfiler and PPY23 marker panels.
| Yfiler short | PPY23 short | |
|---|---|---|
| HD | 0.998569 | 0.999695 |
| MP | 1.48 × 10−3 | 3.56 × 10−4 |
| MP as 1 in … | 675 | 2809 |
| # distinct haplotypes | 6418 | 11,702 |
| DC | 0.3269 | 0.5961 |
HD, haplotype diversity; MP, match probability; DC, discrimination capacity
Includes markers DYS456, DYS389I, DYS458, DYS19, DYS393, DYS391, GATAH4, DYS437.
Includes markers DYS576, DYS389I, DYS391, DYS481, DYS570, DYS635, DYS393, DYS458.
Pairwise RST value estimates (below the diagonal) and corresponding p value (above the diagonal) between meta-populations defined by continental residency for five forensic marker panels.
| Marker set | Continent | Africa | Asia | Europe | Latin America | North America |
|---|---|---|---|---|---|---|
| MHT | Africa | + | + | + | + | |
| Asia | 0.33675 | + | + | + | ||
| Europe | 0.31720 | 0.07390 | + | + | ||
| Latin America | 0.31534 | 0.07258 | 0.04588 | + | ||
| North America | 0.23944 | 0.05755 | 0.02525 | 0.01369 | ||
| SWGDAM | Africa | + | + | + | + | |
| Asia | 0.29365 | + | + | + | ||
| Europe | 0.25428 | 0.07710 | + | + | ||
| Latin America | 0.27268 | 0.08041 | 0.03879 | + | ||
| North America | 0.20579 | 0.06960 | 0.02477 | 0.01083 | ||
| PPY12 | Africa | + | + | + | + | |
| Asia | 0.28688 | + | + | + | ||
| Europe | 0.25591 | 0.08023 | + | + | ||
| Latin America | 0.26814 | 0.07821 | 0.03884 | + | ||
| North America | 0.20601 | 0.06886 | 0.02465 | 0.01036 | ||
| Yfiler | Africa | + | + | + | + | |
| Asia | 0.22740 | + | + | + | ||
| Europe | 0.22823 | 0.10203 | + | + | ||
| Latin America | 0.23016 | 0.08182 | 0.02695 | + | ||
| North America | 0.1755 | 0.07394 | 0.01914 | 0.0081 | ||
| PPY23 | Africa | + | + | + | + | |
| Asia | 0.26082 | + | + | + | ||
| Europe | 0.28376 | 0.06978 | + | + | ||
| Latin America | 0.28108 | 0.05315 | 0.01653 | + | ||
| North America | 0.21391 | 0.04687 | 0.01842 | 0.00895 | ||
Pairwise genetic distances (RST value estimates, below the diagonal) were obtained by an analysis of molecular variance (AMOVA). Significance of genetic distances (above the diagonal) was tested by 10,000 permutations. All p values were <10−4 (highlighted by +). MHT, minimal haplotype; SWGDAM, Scientific Working Group for DNA Analysis Methods; PPY12, PowerPlex®Y12; Yfiler, Yfiler®kit; PPY23, PowerPlex®Y23.
Fig. 5Genetic distances between populations. Pairwise RST values for PPY23 were calculated between all 129 populations and grouped by continental residency.
Fig. 6Population structure revealed by PPY23. Interpolated maps depict the first MDS components of pairwise RST values for PPY23. (a) Two-dimensional MDS analysis for the whole data set (129 populations); (b) four-dimensional MDS analysis for populations of European residency and ancestry alone (68 populations). Sample locations are marked in white. Color coding is on an arbitrary rainbow scale that assigns yellow and magenta to the opposite ends of the scale.