Literature DB >> 29739989

Haplotypic polymorphisms and mutation rate estimates of 22 Y-chromosome STRs in the Northern Chinese Han father-son pairs.

Yaran Yang1, Weini Wang2, Feng Cheng3, Man Chen1,4, Tong Chen1,4, Jing Zhao1,4, Chong Chen5, Yan Shi5, Chen Li6, Chuguang Chen6, Yacheng Liu5, Jiangwei Yan7,8,9.   

Abstract

Y chromosome Short tandem repeats (Y-STRs) analysis has been widely used in forensic identification, kinship testing, and population evolution. An accurate understanding of haplotype and mutation rate will benefit these applications. In this work, we analyzed 1123 male samples from Northern Chinese Han population which including 578 DNA-confirmed father-son pairs at 22 Y-STRs loci. A total of 537 haplotypes were observed and the overall haplotype diversity was calculated as 1.0000 ± 0.0001. Except that only two haplotypes were observed twice, all the rest of the 535 were unique. Furthermore, totally 47 mutations were observed during 13,872 paternal meiosis. The mutation rate for each locus estimates ranged from 0.0 to 15.6 × 10-3 with an average mutation rate 3.4 × 10-3 (95% CI 2.5-4.5 × 10-3). Among the 22 loci, DYS449, DYS389 II and DYS458 are the most prone to mutations. This study adds to the growing data on Y-STR haplotype diversity and mutation rates and could be very useful for population and forensic genetics.

Entities:  

Mesh:

Year:  2018        PMID: 29739989      PMCID: PMC5940815          DOI: 10.1038/s41598-018-25362-3

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Y chromosome Short tandem repeats (Y-STRs) are widely used in genetic epidemiology[1], forensic genetics[2] and human migration[3] because of its paternal inheritance and human population structuring[4]. However, just the same as autosomal STR, Y-STRs also have high mutation rates[5]. Therefore, reliable estimates of mutation rates of Y-STRs are a prerequisite for the accurate application based on Y-STR analysis. Several studies on estimating Y-STR mutation rates had been reported, such as investigating the father–son pairs from confirmed paternity[6], male individuals from deep-rooted pedigrees[7], genotyping sperm cells[8], and using Y-STR population data with known history[9]. Of these approaches, estimating Y-STR mutation rates through the direct observation of allelic transmission between father and son is the most accurate, as long as large numbers of meiosis could be investigated. In this study, We determined the haplotypes and mutation rates for the 22 Y-STRs, DYS19, DYS385a/b, DYS388, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS444, DYS447, DYS448, DYS449, DYS456, DYS458, DYS522, DYS527a/b, DYS635 and Y-GATA-H4 in 1123 Northern Chinese Han male individuals from 578 father–son pairs.

Materials and Methods

Samples and DNA extraction

Blood samples were collected from 1123 healthy Northern Chinese Han male individuals. Among these individuals, there had 578 father–son pairs. All father–son pairs were confirmed by using autosomal STRs typing based on 39 autosomal STRs by using MicroreaderTM 21 ID and 23 SP system (Microread Genetics Incorporation, China), with a minimum paternity probability of 99.99%. All individuals signed the informed consent before participating in this study. Genomic DNA was extracted using Chelex resin method[10]. The quantity of DNA was quantified by Qubit® Quantitation System (Invitrogen, CA, USA). All experiments of this study were carried out in accordance with the guidelines and regulations of the Ethical Committee of Beijing Institute of Genomics, Chinese Academy of Sciences (Protocol name: A study on the Haplotypic polymorphisms and mutation rate estimates of Y-chromosome STRs in father–son pairs. No. 2016033).

Multiplex PCR amplification and genotyping

The samples were amplified 22 Y-STR loci (DYS19, DYS385a/b, DYS388, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS444, DYS447, DYS448, DYS449, DYS456, DYS458, DYS522, DYS527a/b, DYS635 and Y-GATA-H4) using AGCU™ Y24 Plus amplification kit (AGCU ScienTech Incorporation, Wuxi, China) following manufacturer’s recommendations. All the PCR amplification was proceeded respectively in an ABI PRISM® GeneAmp® 9700 thermal cycler. PCR products were detected on ABI PRISM® 3130xl Genetic Analyzer according to the manufacturer’s recommendations. Electrophoretic result was analyzed using GeneMapper® ID-X software.

Quality control

A male DNA sample 9948 (Promega Corporation, WI, USA) and female DNA sample 9947 A (Promega Corporation) were used as reference and negative control for each batch of genotyping. All the experiments were carried out at the laboratory accredited by the China National Accreditation Service (CNAS) and strictly followed the recommendations on the analysis of Y-STRs by DNA Commission of the International Society of Forensic Genetics (ISFG)[11].

Statistical analysis

Haplotype and allele frequencies were calculated by the gene counting method. Gene diversity (GD) for each locus was calculated using the formula: GD = [n (1 − ∑pi2)]/(n − 1), where n is the number of alleles, pi is the frequency of the ith allele. Discrimination capacity (DC) was determined as DC = Ndiff/N, where Ndiff and N was the number of different haplotypes and the sample size, repectivly. Haplotype diversity (HD) and Standard error (SE) was calculated according to Nei’s formula[12]. Mutation rates were calculated as the number of mutations divided by the number of Meiosis. Confidence intervals (CI) were estimated from the binominal standard deviation[13]. In mutation counting, there were two father–son pairs where one-step mutation seen for both DYS389I and DYS389II, for instance (13, 29) → (14, 30). These were treated as one mutation instead of two because DYS389I is part of the sequence called DYS389II. According to the repeat numbers of alleles per locus, the alleles were categorized into short (25%), medium (50%) and long class (25%) as described by Ge et al.[14], used to evaluate the relationship between allele size and corresponding mutation rate.

Results and Discussion

Allele frequencies and gene diversity

Allele frequencies and gene diversity values for each locus are listed in Supplemental Table 1. A total of 190 alleles were detected at 22 Y-STR loci with the allele frequencies ranged from 0.0018 to 0.7676. The number of alleles at each locus ranged from 4 for Y_GATA_H4 to 15 for DYS447. At two multi-copy loci DYS385a/b and DYS527a/b, 71 allelic combinations with 16 separated alleles and 40 allelic combinations with 11 alleles were observed, repectively. Single-copy locus DYS449 and multi-copy locus DYS385a/b showed the highest GD values as 0.8883 and 0.9658, respectively. Except DYS391 and DYS438, the GD values for the other 20 loci were all obove 0.5, which suggests high polymorphisms in the Northern Chinese Han population.

Haplotype diversity

The haplotype distribution in a sample of 539 unrelated Northern Chinese Han males for the 22 Y-STRs is shown in the Supplementary Table 2. Haplotype diversity and forensic parameter based on various combinations of Y-STRs, such as minimal haplotype, extended haplotype, Y filer and this study are shown in Table 1. Within these unrelated 539 Northern Chinese Han individuals, a total of 537 haplotypes were observed at the 22-loci resolution and the haplotype diversity value of was 0.99998. 535 haplotypes (99.63%) were observed once and only 2 were observed twice (0.37%). 492 haplotypes were observed at the minimal haplotype STRs resolution (DYS19, DYS389I/II, DYS390, DYS391, DYS392, DYS393, and DYS385). For the extended haplotype STRs (minimal haplotype STRs, DYS438 and DYS439), 517 haplotypes were observed.
Table 1

Haplotype diversity and forensic parameters for different combinations of Y-STRs in 539 unrelated Northern Chinese Han.

No. of observed haplotypesCombined Y-STRs
Minimal haplotypeExtended haplotypeY filerThis study
1(unique)458500531535
2261342
34300
43100
51000
Number of haplotypes492517535537
Unique haplotypes458500531535
Fraction of unique haplotypes93.09%96.71%99.25%99.63%
Discrimination capacity0.9995370.9997980.9999630.999977
Haplotype diversity ± SD0.9995 ± 0.00020.9998 ± 0.00011.0000 ± 0.00011.0000 ± 0.0001
Haplotype diversity and forensic parameters for different combinations of Y-STRs in 539 unrelated Northern Chinese Han. In the case of 17 Y-STRs (extended haplotype STRs, DYS437, DYS448, DYS456, DYS458, DYS635, and GATA H4.1) from AmpFlSTR® Yfiler™ kit, 535 haplotypes were observed with the haplotype diversity value of 0.99996. From these, 531 haplotypes (99.25%) were observed once, 4 were observed twice (0.75%). Although the number of unique haplotype increased when additional Y-STR loci were combined, however, in this study, only 2 unique haplotype were increased with 5 loci were added compared with Y filer. This suggest that to achieve the goal for high haplotype resolution for Y-STR analysis, selecting appropriate loci, such as the Rapidly mutating Y-STRs[15], should be considered.

Variant alleles

Thirty four copy number variants were detected in 1123 males. Variant alleles were confirmed by re-amplification and genotyping. Null alleles were observed at DYS448 (6 father–son pairs), DYS19 (1 father–son pair) and DYS527a/b (1 father–son pair). Primers were designed for larger PCR fragments of these 3 loci, but failed to produce amplicons in the test samples (data not shown). DYS448 is located within the azoospermia factor c gene (AZFc) in the distal euchromatic part of the Y chromosome long arm. AZFc consists almost entirely of very long direct and inverted repeats. Therefore, it is prone to partial deletions or duplications by rearrangements[16]. The DYS448 null allele has been reported by several studies[17-20]. The relatively high frequencies of the DYS448 null allele in Asians suggest giving careful consideration to the use of DYS448 for commercial genotyping and further database construction in Asians. Triplications were observed at DYS527a/b (8 father–son pairs) and DYS385 a/b (1 father–son pair). These variants are not rare in forensic casework and they should be interpreted carefully to exclude mixed profiles. These variants have been considered due to non-allelic, homologous recombination[21].

Mutation rates

In this study, 578 meiosis from fathers to sons were observed, in which 47 mutations were found at all the studied loci except DYS47, DYS438, DYS447, DYS522, and DYS388 (Table 2). There are no more than one locus mutations in the same father-son pair. Except one three-step mutation occurred at DYS449 (32 → 29), all remaining mutations were single step, namely, 97.9% mutations were one step. This finding is consistent with the general notion that the majority of mutations comprise single step repeat gain or loss due to strand slippage during replication[22]. Among these 47 mutations, 26 mutations (i.e., 55.3%) gained repeats, and 21 mutations (i.e., 44.7%) lost repeats. Hence, the data herein support that mutations at these Y chromosome microsatellites do not have any contraction or expansion bias.
Table 2

The mutation and rates for the 22 Y-STR loci studied in Northern Chinese Han.

LocusNo. of meiosisMutation countNo. of mutations with repeat gainsNo. of mutations with repeat lossesMutation rate × 10−395% CI × 10−3
DYS19578111.70–9.6
DYS385a/b11564223.50.9–8.8
DYS389I5782113.50.4–13.2
DYS389II5785328.70.3–21.3
DYS3905783215.21.1–16.0
DYS391578111.70–10.2
DYS392578111.70–10.2
DYS393578111.70–10.2
DYS437578000–6.4
DYS438578000–6.4
DYS439578446.91.9–17.6
DYS448578111.70–9.6
DYS456578111.70–10.2
DYS4585785328.70.3–21.3
DYS635578111.70–10.2
Y_GATA_H4578111.70–10.2
DYS44957894515.67.6–31.1
DYS447578000–6.4
DYS522578000–6.4
DYS388578000–6.4
DYS444578223.50.4–13.2
DYS527a/b11565144.31.4–10.1
Summary138724726213.42.5–4.5
The mutation and rates for the 22 Y-STR loci studied in Northern Chinese Han. The average mutation rate across these 22 Y-STR loci was 0.0034 (95% confidence interval (CI), 0.0025–0.0045), which was close to the average mutation rates across 16 Y-STR markers of the Texas populations (i.e., 0.0021) by Ge et al.[14] and the South China Han population (i.e., 0.0023) by Weng et al.[23] The mutation rates of the 22 Y-STR loci ranged from 0.0000 (95% CI, 0.0000–0.0064) to 0.0156 (95% CI, 0.0076–0.0311). Mutation counts and rates by relative allele sizes (short, moderate, and long) for each locus is shown in Table 3. In the Northern Chinese Han population, the mutation rate of long alleles (6.9 × 10−3) is significantly greater than short (1.9 × 10−3) and moderate (2.5 × 10−3) alleles. Therefore, the longer alleles are more likely to be mutated than short alleles, which is consistent with the previous studies[14,23,24].
Table 3

Mutation counts and rates by relative allele sizes (short, moderate, and long) for each locus.

LocusMotifAllele rangeShortModerateLong
DYS19TAGA10–191
DYS385a/bGAAA7–284
DYS389I(TCTG)(TCTA)9–172
DYS389II(TCTG)(TCTA)(TCTG)(TCTA)24–345
DYS390(TCTA)(TCTG)17–283
DYS391TCTA6–141
DYS392TAT6–171
DYS393AGAT9–171
DYS437TCTA13–17
DYS438TTTTC6–14
DYS439AGAT9–1413
DYS448AGAGAT20–261
DYS456AGAT13–181
DYS458GAAA13–2023
DYS635TSTA compound17–271
Y_GATA_H4TAGA8–131
DYS449TTTC26–369
DYS447TAAWA compound22–29
DYS522GATA8–17
DYS388ATT10–18
DYS444TAGA11–1511
DYS527a/b(GAAA)(GGAA)15–285
No. of mutations61823
No. of alleles320073463326
Mutation rate × 10−31.92.56.9
95% CI × 10−30.7–4.11.5–3.94.4–10.4

The allele sizes are generally categorized into 25%, 50%, and 25% quantiles for short, moderate and long allele sizes, respectively.

Mutation counts and rates by relative allele sizes (short, moderate, and long) for each locus. The allele sizes are generally categorized into 25%, 50%, and 25% quantiles for short, moderate and long allele sizes, respectively. It is more accurate to estimate the Y-STR mutation rate is by testing a large number of meiosis from father-son pairs. Ballantyne et al.[25] provided Y-STR mutation rates for a large number of Y-STR markers in a reasonably large number of up to 2000 DNA-confirmed father-son pairs collected from the Germany and Poland. Burgella et al.[26] performed a meta-analysis to estimate the mutation rate for 110 Y-STRs combining population and father–son pair data. A comparison of our data to these published rates was shown in Table 4. The mutation rates for most of the shared loci were similar except DYS449, which was 1.9 × 10−3 reported by Burgarella and only approximately one eighth and one seventh of our and Ballantyne’s study.
Table 4

Comparison between the mutation rates of this study and other two published studies with large number of meiosis.

LocusThis studyBallantyne et al.[25]Burgarella et al.[26]
No. of meiosisMutation countMutation rate × 10−395% CI × 10−3No. of meiosisMutation countMutation rate × 10−395% CI × 10−3No. of meiosisMutation countMutation rate × 10−395% CI × 10−3
DYS1957811.70–9.6175674.42.0–8.214632322.21.6–3.1
DYS385 a/b*115643.50.9–8.8176232.10.6–5.1////
161564.11.8–8.1
DYS389 I57823.50.4–13.2175195.52.7–9.712651322.51.8–3.6
DYS389 II57858.70.3–21.3174363.81.6–7.5////
DYS39057835.21.1–16.0175821.50.4–4.114131302.11.5–3.0
DYS39157811.70–10.2175953.21.3–6.713995382.71.8–3.7
DYS39257811.70–10.2172819.70.1–3.21394860.40.2–0.9
DYS39357811.70–10.2175032.10.6–5125761310.6–1.8
DYS437578000–6.4176021.50.4–4.19238101.10.6–2
DYS438578000–6.41751110.1–3.2933940.40.2–1.1
DYS43957846.91.9–17.6173663.81.6–7.59313515.84.2–7.2
DYS44857811.70–9.61747000.01–2.16655111.70.9–3
DYS45657811.70–10.2175784.92.4–96664304.53.2–6.4
DYS45857858.70.3–21.31756148.44.8–13.46684466.95.2–9.2
DYS63557811.70–10.2173263.91.6–7.67434233.12.1–4.6
Y_GATA_H457811.70–10.2175553.21.3–6.67618212.81.8–4.2
DYS449578915.67.6–31.116171912.27.5–18.536971.99.2–38.6
DYS447578000–6.4172232.10.6–5.165834.61.6–13.3
DYS522578000–6.41620110.2–3.4555000–6.9
DYS388578000–6.41635000.02–2.3239410.40.02–2.4
DYS44457823.50.4–13.2177595.52.7–9.780000–45.8
DYS527 a/b115654.31.4–10.1////////

*The mutation rate for DYS385 a/b was calculated separatly as DYS385 a and DYS385 b by Ballantyne’s study.

/The data were absent due to these loci were not included in the study.

Comparison between the mutation rates of this study and other two published studies with large number of meiosis. *The mutation rate for DYS385 a/b was calculated separatly as DYS385 a and DYS385 b by Ballantyne’s study. /The data were absent due to these loci were not included in the study.

Conclusion

In this study, we investigated the haplotype diversity and estimated mutation rates for 22 Y-STRs in 578 father–son pairs in a Northern Chinese Han population. We detected 537 distinct haplotypes in 539 male individuals, which indicating a high power to distinguish unrelated male individuals. Furthermore, totally 47 mutations were observed during 13,872 paternal meiosis. The mutation rate for each locus estimates ranged from 0.0 to 15.6 × 10−3 with an average mutation rate 3.4 × 10−3 (95% CI 2.5–4.5 × 10−3). This study adds to the growing data on Y-STR haplotype diversity and mutation rates. It could be very useful for population and forensic genetics. However, to obtain precise knowledge of haplotype and mutation rate, more number of meiosis analyses involving more Y-STRs loci should be performed. Supplementary table 1 Supplementary table 2
  23 in total

1.  Characteristics and frequency of germline mutations at microsatellite loci from the human Y chromosome, as revealed by direct observation in father/son pairs.

Authors:  M Kayser; L Roewer; M Hedman; L Henke; J Henke; S Brauer; C Krüger; M Krawczak; M Nagy; T Dobosz; R Szibor; P de Knijff; M Stoneking; A Sajantila
Journal:  Am J Hum Genet       Date:  2000-04-06       Impact factor: 11.025

2.  Mutation rates at two human Y-chromosomal microsatellite loci using small pool PCR techniques.

Authors:  U Holtkemper; B Rolf; C Hohoff; P Forster; B Brinkmann
Journal:  Hum Mol Genet       Date:  2001-03-15       Impact factor: 6.150

3.  Mutations at Y-STR loci: implications for paternity testing and forensic analysis.

Authors:  M Kayser; A Sajantila
Journal:  Forensic Sci Int       Date:  2001-05-15       Impact factor: 2.395

4.  The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time.

Authors:  Lev A Zhivotovsky; Peter A Underhill; Cengiz Cinnioğlu; Manfred Kayser; Bharti Morar; Toomas Kivisild; Rosaria Scozzari; Fulvio Cruciani; Giovanni Destro-Bisol; Gabriella Spedini; Geoffrey K Chambers; Rene J Herrera; Kiau Kiun Yong; David Gresham; Ivailo Tournev; Marcus W Feldman; Luba Kalaydjieva
Journal:  Am J Hum Genet       Date:  2003-12-19       Impact factor: 11.025

Review 5.  The human Y chromosome: an evolutionary marker comes of age.

Authors:  Mark A Jobling; Chris Tyler-Smith
Journal:  Nat Rev Genet       Date:  2003-08       Impact factor: 53.242

6.  16 Y chromosomal STR haplotypes in Japanese.

Authors:  Natsuko Mizuno; Hiroaki Nakahara; Kazumasa Sekiguchi; Kanako Yoshida; Minoru Nakano; Kentaro Kasai
Journal:  Forensic Sci Int       Date:  2007-03-12       Impact factor: 2.395

7.  Twelve short tandem repeat loci Y chromosome haplotypes: genetic analysis on populations residing in North America.

Authors:  Bruce Budowle; Mike Adamowicz; Xavier G Aranda; Charles Barna; Ranajit Chakraborty; Dan Cheswick; Bradley Dafoe; Arthur Eisenberg; Roger Frappier; Ann Marie Gross; Carll Ladd; Hee-Suk Lee; Scott C Milne; Carole Meyers; Mechthild Prinz; Melanie L Richard; Gabriela Saldanha; Amy A Tierney; Lori Viculis; Benjamin E Krenke
Journal:  Forensic Sci Int       Date:  2005-05-28       Impact factor: 2.395

8.  Estimating Y chromosome specific microsatellite mutation frequencies using deep rooting pedigrees.

Authors:  E Heyer; J Puymirat; P Dieltjes; E Bakker; P de Knijff
Journal:  Hum Mol Genet       Date:  1997-05       Impact factor: 6.150

9.  Mutation rates at 16 Y-chromosome STRs in the South China Han population.

Authors:  Weixia Weng; Hong Liu; Shuanglin Li; Jianye Ge; Huijun Wang; Chao Liu
Journal:  Int J Legal Med       Date:  2012-10-23       Impact factor: 2.686

10.  Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR Yfiler PCR amplification kit.

Authors:  Miriam Goedbloed; Mark Vermeulen; Rixun N Fang; Maria Lembring; Andreas Wollstein; Kaye Ballantyne; Oscar Lao; Silke Brauer; Carmen Krüger; Lutz Roewer; Rüdiger Lessig; Rafal Ploski; Tadeusz Dobosz; Lotte Henke; Jürgen Henke; Manohar R Furtado; Manfred Kayser
Journal:  Int J Legal Med       Date:  2009-03-26       Impact factor: 2.686

View more
  2 in total

1.  Empirical Evidence on Enhanced Mutation Rates of 19 RM-YSTRs for Differentiating Paternal Lineages.

Authors:  Faqeeha Javed; Muhammad Shafique; Dennis McNevin; Muhammad Usama Javed; Abida Shehzadi; Ahmad Ali Shahid
Journal:  Genes (Basel)       Date:  2022-05-26       Impact factor: 4.141

2.  Development and validation of a novel 133-plex forensic STR panel (52 STRs and 81 Y-STRs) using single-end 400 bp massive parallel sequencing.

Authors:  Haoliang Fan; Lingxiang Wang; Changhui Liu; Xiaoyu Lu; Xuding Xu; Kai Ru; Pingming Qiu; Chao Liu; Shao-Qing Wen
Journal:  Int J Legal Med       Date:  2021-11-06       Impact factor: 2.791

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.