| Literature DB >> 35166077 |
Mi Hyeon Moon1,2, Sae Rom Hong1, Kyoung-Jin Shin1,3.
Abstract
BACKGROUND: Rapidly mutating (RM) Y-chromosomal short tandem repeats (Y-STRs) have been demonstrated to increase the possibility of distinguishing between male relatives due to a higher mutation rate than conventional Y-STRs. Massively parallel sequencing (MPS) can be useful for forensic DNA typing as it allows the detection of sequence variants of many forensic markers. Here, we present sequence variations of 31 Y-STRs including nine RM Y-STRs (DYF387S1, DYF399S1, DYF404S1, DYS449, DYS518, DYS570, DYS576, DYS612, and DYS627), their frequencies, distribution, and the gain in the number of alleles using MPS.Entities:
Keywords: Massively Parallel Sequencing; RM Y-STR; Sequence Variation; Y-STR
Mesh:
Year: 2022 PMID: 35166077 PMCID: PMC8845103 DOI: 10.3346/jkms.2022.37.e40
Source DB: PubMed Journal: J Korean Med Sci ISSN: 1011-8934 Impact factor: 2.153
Fig. 1Comparison of the number of observed alleles by CE and MPS analysis of 31 Y-STRs across four populations (n = 220). The blue bar presents the number of alleles by size-based method and the red bar presents the number of alleles observed by repeat region variations resulted from MPS method. The green bar presents the number of alleles observed by flanking region SNP resulted from MPS method. The nine RM Y-STRs used in this study are listed at the top of the figure.
CE = capillary electrophoresis, MPS = massively parallel sequencing, SNP = single nucleotide polymorphism, STR = short tandem repeat, RM = rapidly mutating.
Frequencies of sequence-based motif across four populations
| Markera | Motif structureb | Flanking SNP | Population frequencyc | ||||
|---|---|---|---|---|---|---|---|
| AfAm | Cauc | Hisp | Kor | ||||
|
| |||||||
| 18–24 | [GAAA]3 N7 [GAAA]n | rs4306075 | 0.333 | 0.333 | 0.242 | 0.361 | |
| 19–27 | [GAAA]3 N7 [GAAA]n | rs878949651 | 0.137 | 0.367 | 0.418 | 0.278 | |
| 21–26 | [GAAA]3 N7 [GAAA]n |
| - | - | 0.003 | ||
| 17.1–29.1 | [GAAA]3 N8 [GAAA]n | 0.137 | 0.286 | 0.319 | 0.348 | ||
| All other motifs | - | 0.014 | 0.021 | 0.010 | |||
|
| |||||||
| 9–19 | [TTTC]n | 1.000 | 0.989 | 0.988 | 1.000 | ||
| 13.2, 14.2 | [TTTC]n TC [TTTC]2 | - | 0.011 | 0.012 | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 25–34 | [TTTC]n N50 [TTTC]m | 1.000 | 0.980 | 1.000 | 0.942 | ||
| 29 | [TTTC]14 TCTC N50 [TTTC]14 | - | 0.020 | - | - | ||
| 31 | [TTTC]2 TATC [TTTC]12 N50 [TTTC]16 | - | - | - | 0.010 | ||
| 31 | CTTC [TTTC]15 N50 [TTTC]15 | - | - | - | 0.010 | ||
| 30.2 | [TTTC]16 N50 [TTTC]10 TT [TTTC]4 | - | - | - | 0.038 | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 34–45 | [AAAG]3 GAAG [AAAG]n GGAG [AAAG]4 gaagag [AAAG]m | 1.000 | 0.960 | 0.960 | 0.971 | ||
| 39 | [AAAG]3 GAAG [AAAG]n GGAG [AAAG]4 gaagag [AAAG]m [GAAG]o | - | 0.020 | 0.020 | - | ||
| 38 | [AAAG]3 GAAG [AAAG]16 GGAG [AAAG]4 gaagag [AAAG]2 AAAA [AAAG]10 | - | 0.020 | - | - | ||
| 37 | [AAAG]3 GAAG [AAAG]18 gaagag [AAAG]15 | - | - | 0.020 | - | ||
| 37.2, 38.2 | [AAAG]3 GAAG [AAAG]2 AA [AAAG]n GGAG [AAAG]4 gaagag [AAAG]m | - | - | - | 0.029 | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 13–23 | [TTTC]n | 1.000 | 0.980 | 0.916 | 0.992 | ||
| 19 | TTCC [TTTC]18 | - | 0.020 | 0.021 | - | ||
| 17 | [TTTC]9 TTTG [TTTC]7 | - | - | 0.021 | - | ||
| 22 | [TTTC]5 TCTC [TTTC]16 | - | - | 0.042 | - | ||
| All other motifs | - | - | - | 0.008 | |||
|
| |||||||
| 27–41 | [CCT]5 CTT [TCT]4 CCT [TCT]n | 1.000 | 1.000 | 1.000 | 0.895 | ||
| 32–40 | [CCT]5 CTT [TCT]3 TTT CCT [TCT]n | - | - | - | 0.105 | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 15–25 | [AGAG]3 [AAAG]n | 1.000 | 0.980 | 1.000 | 0.990 | ||
| 19.2, 21.2 | [AGAG]n AG [AAAG]m | - | 0.020 | - | 0.010 | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 9–22 | [GAAA]n | 1.000 | 1.000 | 0.957 | 1.000 | ||
| 16 | [GAAA]2 TAAA [GAAA]13 | - | - | 0.032 | - | ||
| 13.2 | [GAAA]2 AA [GAAA]11 | - | - | 0.011 | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 21–27 | [TAGA]4 CAGA [TAGA]n [CAGA]m | 1.000 | 0.940 | 1.000 | 1.000 | ||
| 25 | [TAGA]4 CAGA [TAGA]n [CAGA]m | rs766823340 | - | 0.040 | - | - | |
| 24 | [TAGA]16 [CAGA]8 | - | 0.020 | - | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 10–16 | [AGAT]n | 1.000 | 0.860 | 0.979 | 1.000 | ||
| 13 | CGAT [AGAT]12 | - | 0.140 | 0.021 | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 13–17 | [TCTA]n [TCTG]m [TCTA]4 | 0.412 | 1.000 | 1.000 | 1.000 | ||
| 13, 14 | [TCTA]n [TCTG]m [TCTA]4 | rs9786886 |
| - | - | - | |
| All other motifs | - | - | - | - | |||
|
| |||||||
| 9–14 | [TTTTC]n | 1.000 | 1.000 | 0.980 | 0.895 | ||
| 10 | [TTTTC]10 | rs760613324 | - | - | 0.020 | - | |
| 10, 11 | TTTTC TTTTA [TTTTC]n | - | - | - | 0.105 | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 17–23 | [AGAGAT]n N42 [AGAGAT]m | 1.000 | 1.000 | 0.979 | 1.000 | ||
| 20 | AGAGAT AGTGAT [AGAGAT]n N42 [AGAGAT]m | - | - | 0.021 | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 13–21 | [GAAA]n | 1.000 | 0.860 | 0.937 | 1.000 | ||
| 15–20 | [GAAA]n GGAA | - | 0.120 | 0.042 | - | ||
| 17 | [GAAA]10 GGAA [GAAA]6 | - | 0.020 | - | - | ||
| 18.2 | [GAAA]16 AA [GAAA]2 | - | - | 0.021 | - | ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| 17–29 | [CTT]n | 1.000 | 0.960 | 0.979 | 0.971 | ||
| 21, 23, 25 | CTG [CTT]n | - | 0.040 | 0.021 | - | ||
| 20, 23, 25 | [CTT]n | rs370750300 | - | - | - | 0.029 | |
| All other motifs | - | - | - | - | |||
|
| |||||||
| 19–28 | [TAGA]n [TACA]2 [TAGA]2 [TACA]2 [TAGA]2 [TACA]2 [TAGA]4 | 0.615 | 0.102 | 0.469 | 0.036 | ||
| 17–25 | [TAGA]n [TACA]2 [TAGA]2 [TACA]2 [TAGA]4 | 0.385 | 0.888 | 0.531 |
| ||
| All other motifs | - | - | - | - | |||
|
| |||||||
| Del | TTCTC AC TTCTC | - | - | - |
| ||
| Ins | TTCTC AC [TTCTC]2 | 1.000 | 1.000 | 1.000 | 0.210 | ||
| All other motifs | - | - | - | - | |||
SNP = single nucleotide polymorphism, AfAm = African Americans, Cau = Caucasians, Hisp = Hispanics, Kor = Korean, RM Y-STR = rapidly mutating Y-chromosomal short tandem repeat.
aMarkers are sorted in ascending order from RM Y-STRs. RM Y-STRs are marked with an asterisk; bThe sequence-based motif obtained from massively parallel sequencing method was organized as reported by Gettings et al.14 Motifs with more than 1% frequency at least in one population in each marker are indicated. Variable stretches are marked “n, m and o” and “all other motifs” is the sum of frequencies of less than 1% motif; cThe differences with more than 20% of the frequency compared to other populations are indicated as bold.
Fig. 2Alignment of three sequence fragments of multi-copy DYF399S1 locus and SNP variation annotation. Bracketed motifs in repeat regions are counted for allele designation and lowercase letters in repeat regions are not counted as repeats. The left side shows the coordinates of the sequence in the Y chromosome. Gray boxes indicate nucleotides with sequence differences between fragments. The red letters indicate the SNPs that differentiate them into various alleles in the sequence-based analysis of DYF399S1. The rs number of each SNP is also shown (GRCh38 coordinates).
SNP = single nucleotide polymorphism.