Literature DB >> 17058101

Variation of 52 new Y-STR loci in the Y Chromosome Consortium worldwide panel of 76 diverse individuals.

Si-Keun Lim1, Yali Xue, Emma J Parkin, Chris Tyler-Smith.   

Abstract

We have established 16 small multiplex reactions of two-four loci to amplify 52 recently described single-copy simple Y-STRs and typed these loci in a worldwide panel of 74 diverse men and two women. Two Y-STRs were found to be commonly multicopy in this sample set and were excluded from the study. Of the remaining 50, four (DYS481, DYS570, DYS576 and DYS643) showed higher diversities than the commonly used loci and can potentially provide increased haplotype discrimination in both forensic and anthropological work. Ten loci showed occasional missing alleles, duplicated peaks or intermediate-sized alleles.

Entities:  

Mesh:

Year:  2006        PMID: 17058101      PMCID: PMC1794629          DOI: 10.1007/s00414-006-0124-8

Source DB:  PubMed          Journal:  Int J Legal Med        ISSN: 0937-9827            Impact factor:   2.686


Introduction

Y-STRs have key roles in the fields of forensic genetics, anthropological genetics and genealogy because of their ability to discriminate between male lineages and provide information about the relationships between them [1, 2]. The Y chromosome haplotype reference database [3] provides a widely used compilation of haplotype information constructed from a “minimal haplotype” of nine loci or a “minHt + SWGDAM core set” of 11 loci (http://www.yhrd.org/index.html). Some applications, however, require more Y-STRs. For example, a study of ∼1,000 men from east Asia found that almost 3% (27/1,003) shared the same 16-STR haplotype [4] and thus would not be distinguished by standard analyses. Most of the STRs on the Y chromosome have now been identified [5], and a set of 52 was highlighted that seemed particularly useful because their unit size was ≥3, they were single-copy, had a simple structure and showed variation in a set of eight diverse men. These additional loci proved to be useful in the east Asian study where 46 of them allowed a male lineage characteristic of the Qing Dynasty to be defined [4], but they clearly varied considerably in their diversity [4, 5] and may vary in other properties that affect their usefulness as well. In addition, it may often be impractical or impossible to type such a large number of markers. Further studies of these loci are therefore needed to identify the most useful subset. US population data for 16 of them have been presented [6], but data from other loci and populations are lacking. We have therefore established multiplex typing procedures for all of them and examined their variation in the Y Chromosome Consortium (YCC) worldwide panel of men [7].

Materials and methods

The YCC panel consists of 74 male and two female DNAs; the men may be broken down into 26 from Africa, 26 from Asia and the Americas and 22 from Europe or the Middle East. In addition, the haplogroup R individual previously typed with all of the new markers [5] was included in this study to facilitate consistent allele calling. DNA was amplified before use with the GenomiPhi whole genome amplification kit (Amersham Biosciences, Amersham, UK) according to the manufacturer’s recommendations. A total of 52 polymorphic simple single-copy Y-STRs [5] were included in the present study. The published primers had been designed to operate under a common set of conditions and were therefore used in this study, except that a G was added to the 5’ end of the unlabelled primer if it was not already present to facilitate non-templated addition of an A to the labelled product strand [8]. Loci were tested in silico for potential interactions between primers using the AutoDimer software [9], and suitable sets were assembled into small multiplexes for experimental assessment resulting in 16 multiplexes each consisting of 2–4 loci (Table S1). Polymerase chain reactions (PCRs) were set up in 20 μl volumes containing 1× PCR buffer (Invitrogen, Paisley, UK), 1.75 mM MgCl2, 200 μM deoxynucleotide triphosphates (dNTPs; Amersham Biosciences), 1.0 unit of Platinum Taq DNA polymerase (5 U/μl, Invitrogen) with 10 pg–2 ng whole-genome-amplified DNA and primer pairs at the concentrations shown in Table S1. Thermal cycling was carried out in an MJ Research (Genetic Research Instrumentation, Braintree, UK) DNA Engine Tetrad™ 2 starting with denaturation at 95°C for 15 min, followed by 20 cycles of touchdown PCR: 94°C for 30 s, 70°C for 45 s, 72°C for 1 min, with a 1°C decrease in annealing temperature every cycle and then 15 cycles of standard PCR (94°C for 30 s, 50°C for 45 s, 72°C for 1 min) and finishing with extension at 60°C for 45 min and storage at 4°C. Products were analysed by mixing 1 μl of PCR product with 15 μl Hi–Di formamide and 0.2 μl size marker (CXR 60–400 bases, Promega UK, Southampton, UK) and running on 36 cm × 50 μm capillaries containing POP-4 polymer (Applied Biosystems) on an ABI Prism 3100 Genetic Analyzer (Applied Biosystems, Warrington, UK). Electrophoresis was carried out at 3 kV for 3 s followed by 15 kV for 45 min with a run temperature of 60°C. Allele sizes were measured using GeneMapper v3.0 (Applied Biosystems). Most loci were sequenced because of the lack of previous sequence data, to confirm previous results or to investigate the structure of intermediate-sized sizes. Such alleles were amplified using unlabelled primers and sequenced by the Wellcome Trust Sanger Institute small-scale sequencing facility using standard methods.

Results

The 52 Y-STRs were examined in the 76 YCC samples and haplogroup R control individual, but the analyses presented in this paper (Tables S2, Tables S3) are based only on the YCC data to facilitate comparisons with other YCC results [10]. As expected, no specific products were obtained from the two female YCC samples in the size range examined, and single peaks were seen in all males for 40 of the STRs. The other 12 loci showed more complex patterns (Table 1). Products from four loci were missing in one (DYS525, DYS589, DYS636) or two (DYS556) individuals. These findings were reproducible and occurred in multiplex reactions that successfully amplified other loci, so that they may represent null alleles, but their structural basis remains to be determined, and they were treated conservatively as missing data in our analyses.
Table 1

Loci showing multiple peaks, missing peaks or intermediate alleles

LocusIntermediate alleleaComments
DYF386S1Two peaks in many individuals, excluded from analysis
DYF390S1Two peaks in many individuals, excluded from analysis
DYS448Two peaks in two individuals
DYS52210 (U2Ains)Intermediate-sized allele. This insertion converts the reference sequence, which can be written ATAG ATG (ATAG)10 into ATAG AT A G (ATAG)10, which therefore has 12 copies of the ATAG repeat, but differs from the regular allele 12
DYS525No product in one individual, two peaks in one individual
DYS53111 (D6Tins)Intermediate-sized allele
DYS549Two peaks in one individual
DYS556No product in two individuals
DYS567Two peaks in two individuals
DYS576Two peaks in two individuals
DYS589No product in one individual
DYS636No product in one individual

U upstream, D downstream, ins insertion

aNomenclature based on the standard recommendations [1]

Loci showing multiple peaks, missing peaks or intermediate alleles U upstream, D downstream, ins insertion aNomenclature based on the standard recommendations [1] Two peaks were observed in many individuals for DYF390S1 and DYF386S1, and we interpreted these as duplicated loci that happened to have the same sized alleles in the small number of individuals examined before [5]; these two STRs were excluded from subsequent analyses. Five loci also showed two peaks of similar height in one (DYS525, DYS549) or two (DYS488, DYS567, DYS576) individuals, which may reflect rare duplications or somatic mutations in the YCC cell lines. In addition, two loci showed fragment sizes that did not fall into the expected size classes: DYS522 in one individual and DYS531 in 11 individuals corresponding precisely to haplogroup Q [7] and thus representing a variant characteristic of this haplogroup. The structural basis of these variants was determined by sequencing and found to arise from insertion events in the flanking sequences between the STRs and the primers (Table 1). Null alleles, occasional duplications and intermediate alleles have been found in the standard Y-STRs [1], and so we concluded that 50 of the 52 new Y-STRs merited further consideration as loci for wider use. We next examined the variation of these 50 STRs. The number of alleles ranged from two to 11, the diversity from 0.05 to 0.90 and the variance from 0.04 to 7.89 (Table 2). All of these characteristics were correlated, probably because of their common dependence on the repeat count. To interpret the values obtained, we have compared them with published data on the standard single-copy loci in the YCC panel [10]. Of the new loci, four (DYS481, DYS570, DYS576 and DYS643) showed higher diversity than the most variable standard locus DYS390 (diversity = 0.79) and 15 showed higher diversity than DYS393 (diversity = 0.66; Table 2). The discrimination of haplotypes that are not distinguished by the commonly used markers is a particularly useful property. As reported [10], eight pairs of YCC individuals carry haplotypes that are identical when the standard minimal set of Y-STRs is used. Two of these are from different populations (Mbuti Pygmy/Bantu speaker; English/German) and these were distinguished by seven and nine of the new loci, respectively. The other six pairs are from isolated populations, and these were distinguished by 2, 1, 1, 0, 0 and 0, respectively, of the new markers (Table S4). Although a total of 15 loci contribute to this increased discrimination, all of the five distinguishable haplotypes could be separated using just two of the most variable loci, DYS570 and DYS576.
Table 2

Variation of 50 new Y-STR loci in the YCC panel

LocusMean repeat countNumber of allelesDiversityVariance
DYS48123.3110.907.87
DYS57017.6110.863.89
DYS57617.370.822.16
DYS64311.190.822.33
DYS48515.970.781.92
DYF406S110.670.751.26
DYS52211.260.741.03
DYS58912.360.731.07
DYS53311.260.721.03
DYS54912.150.720.92
DYS50512.160.711.00
DYS50811.480.711.44
DYS5259.970.710.95
DYS53111.040.690.39
DYS55611.760.671.03
DYS57210.350.640.61
DYS56511.560.640.71
DYS5949.570.630.87
DYS54011.440.620.52
DYS48713.260.621.38
DYS51110.950.620.73
DYS5739.940.620.73
DYS61712.450.621.06
DYS49515.350.610.76
DYS56710.460.610.72
DYS49714.240.600.51
DYS48813.460.580.91
DYS49211.650.560.45
DYS49012.970.562.43
DYS56811.170.560.92
DYS63611.350.560.49
DYS53710.940.530.43
DYS61811.840.510.37
DYS63810.940.490.34
DYS49112.150.460.40
DYS5788.140.410.40
DYS64011.230.390.39
DYS47611.240.360.29
DYS4949.040.350.28
DYS6419.640.330.93
DYS5549.050.310.36
DYS57510.040.250.22
DYS5838.030.220.12
DYS5907.930.220.23
DYS4807.930.220.16
DYS56911.030.180.11
DYS5309.120.170.09
DYS5809.140.110.19
DYS4728.020.080.04
DYS5799.020.050.89

Loci are ordered according to their diversity.

Variation of 50 new Y-STR loci in the YCC panel Loci are ordered according to their diversity.

Discussion

We have investigated the properties of 52 new Y-STRs in a diverse worldwide set of males. We found that two of the Y-STRs were multicopy and thus not well suited to some applications and that the remaining 50 loci differed substantially in their properties. Our measurements of allele numbers, diversity and variance were overall consistent with the previous report [5]; correlation coefficients (R2 values) were 0.47, 0.58 and 0.67, respectively, but differed for some individual loci. The most variable Y-STR, in all respects, was DYS481, and this was not previously considered in detail because sequence data were not available before. Several other loci (e.g., DYS570, DYS576 and DYS643) may be particularly useful for increasing discrimination in forensic work, and the simple structure and mutational properties of this set make them the markers of choice for many population genetic studies. This is illustrated by considering the correlation between mean repeat count and variance in repeat number of the 50 simple loci: it was far higher (R2 = 0.67) than the value reported for complex Y-STRs (R2 = 0.34, [5]), suggesting that the simple STRs have simpler mutational mechanisms and may lead to more precise dates of lineages. The data in Table 2 and Table S3 now provide a basis for choosing the best simple loci and assembling them into a high-level multiplex reaction for more extensive population screening. Below is the link to the electronic supplementary material. Multiplex organization and primer concentrations (DOC 102 kb) PCR product size range and allele range (DOC 92 kb) Haplotypes of the YCC DNAs (XLS 58 kb) (Note: this table is provided as an Excel file and is the table mentioned in CE7. We transformed it into text because some journals insist on this, but the text version is difficult to interpret as CE5 highlights. The loci are ordered according to their positions in multiplexes 1–16 in Table S1 and Table S2 , and it seems most consistent to keep the same order for Table S3 ). Subdivision of minimal haplotypes by new Y-STRs (DOC 87 kb)
  9 in total

1.  A nomenclature system for the tree of human Y-chromosomal binary haplogroups.

Authors: 
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

2.  Online reference database of European Y-chromosomal short tandem repeat (STR) haplotypes.

Authors:  L Roewer; M Krawczak; S Willuweit; M Nagy; C Alves; A Amorim; K Anslinger; C Augustin; A Betz; E Bosch; A Cagliá; A Carracedo; D Corach; A F Dekairelle; T Dobosz; B M Dupuy; S Füredi; C Gehrig; L Gusmaõ; J Henke; L Henke; M Hidding; C Hohoff; B Hoste; M A Jobling; H J Kärgel; P de Knijff; R Lessig; E Liebeherr; M Lorente; B Martínez-Jarreta; P Nievas; M Nowak; W Parson; V L Pascali; G Penacino; R Ploski; B Rolf; A Sala; U Schmidt; C Schmitt; P M Schneider; R Szibor; J Teifel-Greding; M Kayser
Journal:  Forensic Sci Int       Date:  2001-05-15       Impact factor: 2.395

3.  A comprehensive survey of human Y-chromosomal microsatellites.

Authors:  Manfred Kayser; Ralf Kittler; Axel Erler; Minttu Hedman; Andrew C Lee; Aisha Mohyuddin; S Qasim Mehdi; Zoë Rosser; Mark Stoneking; Mark A Jobling; Antti Sajantila; Chris Tyler-Smith
Journal:  Am J Hum Genet       Date:  2004-06       Impact factor: 11.025

4.  AutoDimer: a screening tool for primer-dimer and hairpin structures.

Authors:  Peter M Vallone; John M Butler
Journal:  Biotechniques       Date:  2004-08       Impact factor: 1.993

5.  Recent spread of a Y-chromosomal lineage in northern China and Mongolia.

Authors:  Yali Xue; Tatiana Zerjal; Weidong Bao; Suling Zhu; Si-Keun Lim; Qunfang Shu; Jiujin Xu; Ruofu Du; Songbin Fu; Pu Li; Huanming Yang; Chris Tyler-Smith
Journal:  Am J Hum Genet       Date:  2005-10-19       Impact factor: 11.025

6.  Allele frequencies for 27 Y-STR loci with U.S. Caucasian, African American, and Hispanic samples.

Authors:  John M Butler; Amy E Decker; Peter M Vallone; Margaret C Kline
Journal:  Forensic Sci Int       Date:  2006-01-27       Impact factor: 2.395

Review 7.  DNA Commission of the International Society of Forensic Genetics (ISFG): an update of the recommendations on the use of Y-STRs in forensic analysis.

Authors:  L Gusmão; J M Butler; A Carracedo; P Gill; M Kayser; W R Mayr; N Morling; M Prinz; L Roewer; C Tyler-Smith; P M Schneider
Journal:  Int J Legal Med       Date:  2006-07       Impact factor: 2.686

8.  Forensic value of 14 novel STRs on the human Y chromosome.

Authors:  Alan J Redd; Al B Agellon; Veronica A Kearney; Veronica A Contreras; Tatiana Karafet; Hwayong Park; Peter de Knijff; John M Butler; Michael F Hammer
Journal:  Forensic Sci Int       Date:  2002-12-04       Impact factor: 2.395

9.  Modulation of non-templated nucleotide addition by Taq DNA polymerase: primer modifications that facilitate genotyping.

Authors:  M J Brownstein; J D Carpten; J R Smith
Journal:  Biotechniques       Date:  1996-06       Impact factor: 1.993

  9 in total
  13 in total

1.  Mutation rate estimates for 110 Y-chromosome STRs combining population and father-son pair data.

Authors:  Concetta Burgarella; Miguel Navascués
Journal:  Eur J Hum Genet       Date:  2010-09-08       Impact factor: 4.246

2.  Diversity of five novel Y-STR loci and their application in studies of north Chinese populations.

Authors:  Zhaoyang Xu; Haiming Sun; Yang Yu; Yan Jin; Xaingning Meng; Donglin Sun; Jing Bai; Feng Chen; Songbin Fu
Journal:  J Genet       Date:  2010-04       Impact factor: 1.166

3.  Evaluation of miniY-STR multiplex PCR systems for extended 16 Y-STR loci.

Authors:  H Asamura; S Fujimori; M Ota; T Oki; H Fukushima
Journal:  Int J Legal Med       Date:  2007-09-26       Impact factor: 2.686

4.  Genetic sub-structure in western Mediterranean populations revealed by 12 Y-chromosome STR loci.

Authors:  V Rodríguez; C Tomàs; J J Sánchez; J A Castro; M M Ramon; A Barbaro; N Morling; A Picornell
Journal:  Int J Legal Med       Date:  2008-12-06       Impact factor: 2.686

5.  Africans in Yorkshire? The deepest-rooting clade of the Y phylogeny within an English genealogy.

Authors:  Turi E King; Emma J Parkin; Geoff Swinfield; Fulvio Cruciani; Rosaria Scozzari; Alexandra Rosa; Si-Keun Lim; Yali Xue; Chris Tyler-Smith; Mark A Jobling
Journal:  Eur J Hum Genet       Date:  2007-01-24       Impact factor: 4.246

6.  Novel Y-chromosome short tandem repeat sequence variation for loci DYS710, DYS518, DYS385, DYS644, DYS612, DYS626, DYS504, DYS481, DYS447 and DYS449.

Authors:  Mohaimin Kasu; Jamie Fredericks; Mischa Fraser; Christiaan Labuschagne; Mpasi Lesaoana; Maria Eugenia D'Amato
Journal:  Int J Legal Med       Date:  2019-04-13       Impact factor: 2.686

7.  Extended Y chromosome haplotypes resolve multiple and unique lineages of the Jewish priesthood.

Authors:  Michael F Hammer; Doron M Behar; Tatiana M Karafet; Fernando L Mendez; Brian Hallmark; Tamar Erez; Lev A Zhivotovsky; Saharon Rosset; Karl Skorecki
Journal:  Hum Genet       Date:  2009-08-08       Impact factor: 4.132

8.  Improving global and regional resolution of male lineage differentiation by simple single-copy Y-chromosomal short tandem repeat polymorphisms.

Authors:  Mark Vermeulen; Andreas Wollstein; Kristiaan van der Gaag; Oscar Lao; Yali Xue; Qiuju Wang; Lutz Roewer; Hans Knoblauch; Chris Tyler-Smith; Peter de Knijff; Manfred Kayser
Journal:  Forensic Sci Int Genet       Date:  2009-02-23       Impact factor: 4.882

9.  Human Y chromosome base-substitution mutation rate measured by direct sequencing in a deep-rooting pedigree.

Authors:  Yali Xue; Qiuju Wang; Quan Long; Bee Ling Ng; Harold Swerdlow; John Burton; Carl Skuce; Ruth Taylor; Zahra Abdellah; Yali Zhao; Daniel G MacArthur; Michael A Quail; Nigel P Carter; Huanming Yang; Chris Tyler-Smith
Journal:  Curr Biol       Date:  2009-08-27       Impact factor: 10.834

10.  A worldwide survey of human male demographic history based on Y-SNP and Y-STR data from the HGDP-CEPH populations.

Authors:  Wentao Shi; Qasim Ayub; Mark Vermeulen; Rong-guang Shao; Sofia Zuniga; Kristiaan van der Gaag; Peter de Knijff; Manfred Kayser; Yali Xue; Chris Tyler-Smith
Journal:  Mol Biol Evol       Date:  2009-10-12       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.