| Literature DB >> 32438591 |
Yuxiang Zhou1, Yining Yao1, Baonian Liu1, Qinrui Yang1, Zhihan Zhou1, Chengchen Shao1, Shilin Li2, Qiqun Tang3, Jianhui Xie1.
Abstract
Y-chromosomal short tandem repeats (Y-STRs) are widely used in human research for the evaluation of population substructure or population differentiation. Previous studies show that several haplotype sets can be used for the evaluation of population differentiation. However, little is known about whether each Y-STR in these sets performs well during this procedure. In this study, a total of 20,927 haplotypes of a Yfiler Plus set were collected from 41 global populations. Different configurations were observed in multidimensional scaling (MDS) plots based on pairwise genetic distances evaluated using a Yfiler set and a Yfiler Plus set, respectively. Subsequently, 23 single-copy Y-STRs were characterized in the evaluation of population differentiation using the mean of allele frequency difference (mAFD) between populations. Our results indicated that DYS392 had the largest mAFD value (0.3802) and YGATAH4 had the smallest value (0.1845). On the whole, larger pairwise genetic distances could be obtained using the set with the top fifteen markers from these 23 single-copy Y-STRs, and clear clustering or separation of populations could be observed in the MDS plot in comparison with those using the set with the minimum fifteen markers. In conclusion, the mAFD value is reliable to characterize Y-STRs for efficiency in the evaluation of population differentiation.Entities:
Keywords: Y-STR; allele frequency difference; multidimensional scaling; pairwise genetic distance; population differentiation
Mesh:
Year: 2020 PMID: 32438591 PMCID: PMC7290957 DOI: 10.3390/genes11050566
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The multidimensional scaling (MDS) plots obtained based on pairwise genetic distances between populations with a Yfiler set (A) and a Yfiler Plus set (B), respectively.
Figure 2Pairwise populations are ranked in an ascending order of pairwise genetic distances with a Yfiler set and are set as the horizontal axis. Pairwise genetic distances with a Yfiler set and a Yfiler Plus set are indicated, respectively.
Allele frequency difference (mAFD) values and mutation rates of 23 Y-chromosomal short tandem repeats (Y-STRs).
| Loci | mAFD | Mutation Rates # | Binomial 95%CI | |
|---|---|---|---|---|
| 1 |
| 0.3802 | 4.96 × 10−4 | 2.14 × 10−4–9.77 × 10−4 |
| 2 |
| 0.3507 | 3.51 × 10−4 | 0.96 × 10−4–8.99 × 10−4 |
| 3 |
| 0.3379 | 4.21 × 10−3 | 2.97 × 10−3–5.80 × 10−3 |
| 4 | DYS481 | 0.3359 | 4.88 × 10−3 | 2.52 × 10−3–8.51 × 10−3 |
| 5 | DYS449 | 0.3271 | 9.44 × 10−3 | 5.78 × 10−3–1.45 × 10−2 |
| 6 |
| 0.3257 | 1.39 × 10−3 | 6.92 × 10−4–2.48 × 10−3 |
| 7 | DYS533 | 0.3104 | 3.68 × 10−3 | 1.68 × 10−3–6.97 × 10−3 |
| 8 |
| 0.3053 | 2.20 × 10−3 | 1.55 × 10−3–3.04 × 10−3 |
| 9 |
| 0.2878 | 1.07 × 10−3 | 6.11 × 10−4–1.74 × 10−3 |
| 10 | DYS518 | 0.2786 | 1.46 × 10−2 | 9.86 × 10−3–2.08 × 10−2 |
| 11 | DYS627 | 0.2666 | 1.32 × 10−2 | 8.95 × 10−3–1.88 × 10−2 |
| 12 |
| 0.2504 | 2.08 × 10−3 | 1.44 × 10−3–2.91 × 10−3 |
| 13 |
| 0.2494 | 2.72 × 10−3 | 1.96 × 10−3–3.69 × 10−3 |
| 14 | DYS570 | 0.2372 | 1.14 × 10−2 | 7.67 × 10−3–1.62 × 10−2 |
| 15 |
| 0.2345 | 6.17 × 10−3 | 4.57 × 10−3–8.15 × 10−3 |
| 16 | DYS460 | 0.2278 | 5.82 × 10−3 | 2.80 × 10−3–1.69 × 10−2 |
| 17 |
| 0.2273 | 1.32 × 10−3 | 7.39 × 10−4–2.18 × 10−3 |
| 18 |
| 0.2211 | 5.46 × 10−3 | 4.19 × 10−3–6.99 × 10−3 |
| 19 | DYS576 | 0.2136 | 1.26 × 10−2 | 8.86 × 10−3–1.73 × 10−2 |
| 20 |
| 0.2131 | 4.33 × 10−3 | 3.34 × 10−3–5.51 × 10−3 |
| 21 |
| 0.2106 | 4.41 × 10−3 | 3.07 × 10−3–6.12 × 10−3 |
| 22 |
| 0.1891 | 2.53 × 10−3 | 1.82 × 10−3–3.43 × 10−3 |
| 23 |
| 0.1845 | 3.01 × 10−3 | 1.98 × 10−3–4.38 × 10−3 |
# The data of mutation rates is from the Y Chromosome Haplotype Reference Database (YHRD) website (release 61, Jun 2019) [19]. Y-STRs in the Yfiler set are indicated in bold.
Figure 3The mAFD values and mutation rates of 23 Y-STRs. Blue line: mAFD; Orange line: Mutation rates.
Figure 4Pairwise populations are ranked in an ascending order of pairwise genetic distances with the 15-marker set of the minimum mAFD values, and are set as the horizontal axis. Pairwise genetic distances with the 15-marker set of the largest mAFD values and of the minimum mAFD values are indicated, respectively.
Figure 5The MDS plots obtained based on pairwise genetic distances between populations with the marker sets of the largest (A) and the minimum (B) mAFD values, respectively.