| Literature DB >> 33630413 |
Hui Li1,2, Cheng Zhang3, Guoqing Song4, Ke Ma2, Yu Cao2, Xueying Zhao5, Qinrui Yang1, Jianhui Xie1.
Abstract
BACKGROUND: Massively parallel sequencing (MPS) is a promising supplementary method for forensic casework in short tandem repeats (STRs) genotyping, owing to several advantageous features in comparison to traditional capillary electrophoresis (CE). However, the application of MPS in casework requires accessible datasets from the worldwide population to enrich the allele frequencies of sequence-based STR genotypes.Entities:
Keywords: MPS; STR; flanking region; forensic genetics; population genetics
Mesh:
Year: 2021 PMID: 33630413 PMCID: PMC8123751 DOI: 10.1002/mgg3.1626
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
The number of unique alleles observed at 26 A‐STRs by LB, by RSB, and by FSB
| Locus | LB | RSB | FSB | Increase | %Increase |
|---|---|---|---|---|---|
| D12S391 | 11 | 30 | 30 | 19 | 172.73 |
| D21S11 | 13 | 32 | 33 | 20 | 153.85 |
| D13S317 | 8 | 9 | 20 | 12 | 150.00 |
| D2S1338 | 11 | 25 | 25 | 14 | 127.27 |
| D7S820 | 8 | 10 | 18 | 10 | 125.00 |
| D3S1358 | 6 | 13 | 13 | 7 | 116.67 |
| D20S482 | 6 | 8 | 13 | 7 | 116.67 |
| D8S1179 | 9 | 17 | 17 | 8 | 88.89 |
| D16S539 | 7 | 7 | 13 | 6 | 85.71 |
| D2S441 | 7 | 10 | 12 | 5 | 71.42 |
| D1S1656 | 9 | 14 | 14 | 5 | 55.56 |
| D9S1122 | 8 | 12 | 12 | 4 | 50.00 |
| D5S818 | 7 | 7 | 10 | 3 | 42.86 |
| D10S1248 | 6 | 6 | 8 | 2 | 33.33 |
| vWA | 8 | 10 | 10 | 2 | 25.00 |
| D4S2408 | 5 | 6 | 6 | 1 | 20.00 |
| D6S1043 | 15 | 17 | 17 | 2 | 13.33 |
| FGA | 15 | 16 | 16 | 1 | 6.67 |
| D17S1301 | 6 | 6 | 6 | 0 | 0.00 |
| D18S51 | 13 | 13 | 13 | 0 | 0.00 |
| D19S433 | 11 | 11 | 11 | 0 | 0.00 |
| CSF1PO | 8 | 8 | 8 | 0 | 0.00 |
| PentaD | 9 | 9 | 9 | 0 | 0.00 |
| PentaE | 18 | 18 | 18 | 0 | 0.00 |
| TH01 | 6 | 6 | 6 | 0 | 0.00 |
| TPOX | 5 | 5 | 5 | 0 | 0.00 |
| Total | 235 | 325 | 363 | 128 | 54.47 |
Abbreviations: A‐STRs, Autosomal STRs; FSB, sequence‐based alleles with flanking region; LB, length‐based alleles; RSB, sequence‐based alleles without flanking region.
FIGURE 1Allele diversity of A‐STRs based on length or sequence (with or without flanking regions)
The number of unique alleles observed at 24 Y‐STRs and 7 X‐STRs by LB, by RSB, and by FSB.
| Locus | LB | RSB | FSB | Increase | %Increase |
|---|---|---|---|---|---|
| DYS389II | 7 | 21 | 21 | 14 | 200.00 |
| DYF387S1 | 9 | 26 | 26 | 17 | 188.89 |
| DYS390 | 7 | 14 | 16 | 9 | 128.57 |
| DYS437 | 4 | 7 | 8 | 4 | 100.00 |
| DYS448 | 6 | 12 | 12 | 6 | 100.00 |
| Y‐GATA‐H4 | 4 | 4 | 8 | 4 | 100.00 |
| DYS635 | 6 | 10 | 10 | 4 | 66.67 |
| DYS612 | 9 | 12 | 12 | 3 | 33.33 |
| DYS460 | 4 | 4 | 5 | 1 | 25.00 |
| DYS19 | 5 | 5 | 6 | 1 | 20.00 |
| DYS389I | 5 | 6 | 6 | 1 | 20.00 |
| DYS439 | 5 | 6 | 6 | 1 | 20.00 |
| DYS481 | 13 | 15 | 15 | 2 | 15.38 |
| DYS385a‐b | 12 | 12 | 12 | 0 | 0.00 |
| DYS391 | 3 | 3 | 3 | 0 | 0.00 |
| DYS392 | 6 | 6 | 6 | 0 | 0.00 |
| DYS438 | 4 | 4 | 4 | 0 | 0.00 |
| DYS505 | 4 | 4 | 4 | 0 | 0.00 |
| DYS522 | 5 | 5 | 5 | 0 | 0.00 |
| DYS533 | 5 | 5 | 5 | 0 | 0.00 |
| DYS549 | 4 | 4 | 4 | 0 | 0.00 |
| DYS570 | 7 | 7 | 7 | 0 | 0.00 |
| DYS576 | 5 | 5 | 5 | 0 | 0.00 |
| DYS643 | 6 | 6 | 6 | 0 | 0.00 |
| Total | 145 | 203 | 212 | 67 | 46.215 |
Abbreviations: FSB, sequence‐based alleles with flanking region; LB, length‐based alleles; RSB, sequence‐based alleles without flanking region; Y‐STRs, Y chromosome STRs.
The number of unique alleles observed at 7 X‐STRs by LB, by RSB, and by FSB.
| Locus | LB | RSB | FSB | Increase | %Increase |
|---|---|---|---|---|---|
| DXS10135 | 20 | 34 | 34 | 14 | 70.00 |
| DXS7132 | 8 | 10 | 10 | 2 | 25.00 |
| DXS10103 | 6 | 7 | 7 | 1 | 16.67 |
| DXS8378 | 6 | 6 | 6 | 0 | 0.00 |
| DXS10074 | 9 | 9 | 9 | 0 | 0.00 |
| DXS7423 | 5 | 5 | 5 | 0 | 0.00 |
| HPRTB | 7 | 7 | 7 | 0 | 0.00 |
| Total | 61 | 78 | 78 | 17 | 27.87 |
Abbreviations: FSB, sequence‐based alleles with flanking region; LB, length‐based alleles; RSB, sequence‐based alleles without flanking region; X‐STRs, X chromosome STRs.
FIGURE 2Allele diversity of Y‐ and X‐STRs based on length or sequence (with or without flanking regions)
Thirty‐five novel alleles of STR loci observed in this study.
| Locus | MPS allele (following the nomenclature recommended by IFSG) |
|---|---|
| D5S818 | D5S818 [CE6]‐GRCh38‐Chr5‐123775543‐123775606 [ATCT]6 |
| D6S1043 | D6S1043 [CE20.3]‐GRCh38‐Chr6‐91740160‐91740292 [ATCT]6[ATGT][ATCT]2[ATC][ATCT]11 91740273‐A |
| D6S1043 | D6S1043 [CE21.3]‐GRCh38‐Chr6‐91740160‐91740292 [ATCT]6[ATGT][ATCT]2[ATC][ATCT]12 91740273‐A |
| D7S820 | D7S820 [CE11]‐GRCh38‐Chr7‐84160191‐84160297 [TATC]9[TGTC][TATC] 84160204‐A |
| D9S1122 | D9S1122 [CE17]‐GRCh38‐Chr9‐77073809‐77073880 [TAGA][TCGA][TAGA]15 |
| D10S1248 | D10S1248 [CE14]‐GRCh38‐Chr10 129294226‐129294318 [GGAA]14 129294238‐A |
| D10S1248 | D10S1248 [CE15]‐GRCh38‐Chr10 129294226‐129294318 [GGAA]15 129294243‐A |
| D12S391 | D12S391 [CE21]‐GRCh38‐Chr12‐12296981‐12297189 [AGAT]4[AGGT][AGAT]9[AGAC]6[AGAT] |
| D13S317 | D13S317 [CE14]‐GRCh38‐Chr13‐82147986‐82148107 [TATC]14 82148069‐T 82148073‐T |
| D13S317 | D13S317 [CE15]‐GRCh38‐Chr13‐82147986‐82148107 [TATC]15 82148069‐T |
| D16S539 | D16S539 [CE14]‐GRCh38‐Chr16‐86352664‐86352781 [GATA]14 86352761‐C |
| D18S51 | D18S51 [CE7]‐GRCh38‐Chr18‐63281662‐63281796 [AGAA]7 |
| D20S482 | D20S482 [CE14]‐GRCh38‐Chr20‐4525674‐4525771 [AGAT]4[ATAT][AGAT]9 |
| D20S482 | D20S482 [CE15]‐GRCh38‐Chr20‐4525674‐4525771 [AGAT]14[AGAC] |
| D21S11 | D21S11 [CE29]‐GRCh38‐Chr21‐19181939‐19182111 [TCTA]6[TCTG]5[TCTA]3 TA [TCTA]3 TCA[TCTA]2 TCCATA [TCTA]10 19182101‐T |
| D21S11 | D21S11 [CE30.2]‐GRCh38‐Chr21‐19181939‐19182111 [TCTA]5[TCTG]7[TCTA]3 TA [TCTA]2 TCA[TCTA]2 TCCATA [TCTA]10 TA[TCTA] |
| D21S11 | D21S11 [CE31.2]‐GRCh38‐Chr21‐19181939‐19182111 [TCTA]5[TCTG]7[TCTA]2 TA [TCTA]3 TCA[TCTA]2 TCCATA [TCTA]11 TA[TCTA] |
| D21S11 | D21S11 [CE33.2]‐GRCh38‐Chr21‐19181939‐19182111 [TCTA]5[TCTG]6[TCTA]3 TA [TCTA]4 TCA[TCTA]2 TCCATA [TCTA]12 TA[TCTA] |
| FGA | FGA [CE29]‐GRCh38‐Chr4‐154587713‐154587840 [GGAA]2[GGAG][AAAG]21[AGAA][AAAA][GAAA]3 |
| DYS19 | DYS19 [CE14]‐GRCh38‐ChrY‐9684267‐9684443 [TCTA]11 CCTA [TCTA]3 9684269‐T |
| DYS390 | DYS390 [CE19]‐ChrY‐GRCh38 15162096‐15163170 [TAGA]11[CAGA]8 |
| DYS390 | DYS390 [CE23]‐ChrY‐GRCh38 15162096‐15163170 [TAGA]3[CAGA][TAGA]10[CAGA]9 |
| DYS389I | DYS389I [CE13]‐ChrY‐GRCh38 12500387‐12500513 [TAGA]9[CAGA]4 |
| DYS389II | DYS389II [CE29]‐ChrY‐GRCh38 12500448‐12500633 [TAGA]9[CAGA]4N48 [TAGA]11[CAGA]5 |
| DYS439 | DYS439 [CE11]‐ChrY‐GRCh38 12403461‐12403587 [GATA]11 12403513‐G 12403514‐A 12403515‐T |
| DYS460 | DYS460 [CE9]‐ChrY‐GRCh38 18888810‐18889046 [TATC]9 1888811‐G |
| DYS481 | DYS481 [CE17]‐ChrY‐GRCh38 8558313‐8558408 [CTT]17 |
| DYS635 | DYS635 [CE23]‐ChrY‐GRCh38 12258755‐12258975 [TAGA]12 [TACA]3 [TAGA]2 [TACA]2 [TAGA]4 |
| DYF387S1 | DYF387S1 [CE40]‐GRCh38‐ChrY‐23785347‐23785521 [AAAG]3[GTAG][GAAG]4[AAAG]2[GAAG][AAAG]2[GAAG]8[AAAG]19 |
| DYF387S1 | DYF387S1 [CE42]‐GRCh38‐ChrY‐23785347‐23785521 [AAAG]3[GTAG][GAAG]4[AAAG]2[GAAG][AAAG]2[GAAG]10[AAAG]19 |
| Y‐GATA‐H4 | Y‐GATA‐H4 [CE10]‐ChrY‐GRCh38 16631624‐16631759 [TCTA]10 16631721‐T |
| DXS10135 | DXS10135 [CE14]‐GRCh38‐ChrX‐9338302‐9338520 [AAGA]3 N7 [AAGA]10[AAAG] |
| DXS10135 | DXS10135 [CE26]‐GRCh38‐ChrX‐9338302‐9338520 [AAGA]3 N7 [AAGA]15[AAGG][AAGA]4[AAGG][AAGA][AAAG] |
| DXS10135 | DXS10135 [CE30]‐GRCh38‐ChrX‐9338302‐9338520 [AAGA]3 N7 [AAGA]19[AAGG]2[AAGA]3[AAGG][AAGA][AAAG] |
| DXS7132 | DXS7132 [CE16]‐GRCh38‐ChrX‐65435623‐65435778 [TAGA]15[CAGA] |
Abbreviations: CE, capillary electrophoresis; MPS, massively parallel sequencing.
SNPs and InDels observed in flanking regions at 58 STRs using UAS.
| Locus | Variation | Position (GRCh38/hg38) | dbSNP ID | Wild | Mutant | Count |
|---|---|---|---|---|---|---|
| D2S441 | SNP | Chr2: 68011922 | rs74640515 | G | A | 25 |
| D5S818 | SNP | Chr5: 123775552 | rs73801920 | C | A | 54 |
| D6S1043 | SNP | Chr6: 91740273 | rs529713981 | G | A | 4 |
| D7S820 | SNP | Chr7: 84160204 | rs7789995 | T | A | 222 |
| D7S820 | SNP | Chr7: 84160286 | rs16887642 | G | A | 42 |
| D10S1248 | SNP | Chr10: 129294238 | rs1279061683 | G | A | 1 |
| D10S1248 | SNP | Chr10: 129294243 | rs563636310 | T | A | 1 |
| D13S317 | SNP | Chr13: 82148069 | rs9546005 | A | T | 120 |
| D13S317 | SNP | Chr13: 82148073 | rs202043589 | A | T | 23 |
| D16S539 | SNP | Chr16: 86352692 | rs563997442 | C | G | 2 |
| D16S539 | SNP | Chr16: 86352761 | rs11642858 | A | C | 77 |
| D20S482 | SNP | Chr20: 4525681 | rs561985213 | G | A | 2 |
| D20S482 | SNP | Chr20: 4525680 | rs77560248 | C | T | 29 |
| D21S11 | SNP | Chr21: 19182101 | rs1051967683 | C | T | 1 |
| vWA | SNP | Chr12: 5983970 | rs75219269 | A | G | 1 |
| DYS19 | SNP | ChrY: 9684269 | Null | G | T | 1 |
| DYS390 | SNP | ChrY: 15163163 | rs758940870 | T | C | 2 |
| DYS437 | SNP | ChrY: 12346421 | Null | G | A | 9 |
| DYS439 | SNP | ChrY: 12403513 | rs1042036966 | A | G | 1 |
| DYS439 | SNP | ChrY: 12403514 | Null | G | A | 1 |
| DYS439 | SNP | ChrY: 12403515 | Null | A | T | 1 |
| DYS460 | SNP | ChrY: 1888811 | Null | T | G | 1 |
| Y‐GATA‐H4 | SNP | ChrY: 16631721 | rs765275581 | C | T | 1 |
| Y‐GATA‐H4 | SNP | ChrY: 16631756 | Null | A | G | 59 |
| DXS10135 | Deletion | ChrX: 9338410–9338416 | rs201630737 | AAGAAGA | AGA | 1 |
Abbreviations: dbSNP, Single‐Nucleotide Polymorphism database; Null, No record in dbSNP.