| Literature DB >> 32699278 |
Dan Peng1,2, Yinming Zhang1,2, Han Ren1,2, Haixia Li1,2, Ran Li1,2, Xuefeng Shen1,2, Nana Wang1,2, Erwen Huang3,4, Riga Wu5,6, Hongyu Sun7,8.
Abstract
Massively parallel sequencing (MPS) has rapidly become a promising method for forensic DNA typing, due to its ability to detect a large number of markers and samples simultaneously in a single reaction, and sequence information can be obtained directly. In the present study, two kinds of forensic genetic markers, short tandem repeat (STR) and identity-informative single nucleotide polymorphism (iiSNP) were analyzed simultaneously using ForenSeq DNA Signature Prep Kit, a commercially available kit on MPS platform. A total of 152 DNA markers, including 27 autosomal STR (A-STR) loci, 24 Y chromosomal STR (Y-STR) loci, 7 X chromosomal STR (X-STR) loci and 94 iiSNP loci were genotyped for 107 Tibetan individuals (53 males and 54 females). Compared with length-based STR typing methods, 112 more A-STR alleles, 41 more Y-STR alleles, and 24 more X-STR alleles were observed at 17 A-STRs, 9 Y-STRs, and 5 X-STRs using sequence-based approaches. Thirty-nine novel sequence variations were observed at 20 STR loci. When the flanking regions were also analyzed in addition to target SNPs at the 94 iiSNPs, 38 more alleles were identified. Our study provided an adequate genotype and frequencies data of the two types of genetic markers for forensic practice. Moreover, we also proved that this panel is highly polymorphic and informative in Tibetan population, and should be efficient in forensic kinship testing and personal identification cases.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32699278 PMCID: PMC7376188 DOI: 10.1038/s41598-020-69137-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Allele coverage ratios of 27 A-STRs.
Novel alleles observed in 107 Tibetan samples.
| Locus | LB Allele | Allele name (using nomenclature according to Parson et al. 2016) | Count |
|---|---|---|---|
| FGA | 26 | FGA [CE26]-Chr4-GRCh38 154,587,736–154,587,823 [GGAA]2 GGAG [AAAG]16 AGAG AAAG AGAA AAAA [GAAA]3 | 1 |
| FGA | 27 | FGA [CE27]-Chr4-GRCh38 154,587,736–154,587,823 [GGAA]2 GGAG [AAAG]17 AGAG AAAG AGAA AAAA [GAAA]3 | 1 |
| FGA | 29 | FGA [CE29]-Chr4-GRCh38 154,587,736–154,587,823 [GGAA]2 GGAG [AAAG]19 AGAG AAAG AGAA AAAA [GAAA]3 | 1 |
| D6S1043 | 16 | D6S1043 [CE16]-Chr6-GRCh38 91,740,225–91,740,272 [ATCT]16 | 1 |
| D6S1043 | 19.3 | D6S1043 [CE19.3]-Chr6-GRCh38 91,740,225–91,740,272 [ATCT]5 ATGT [ATCT]2 ATC [ATCT]11 91,740,273-A | 1 |
| D7S820 | 10.1 | D7S820 [CE10.1]-Chr7-GRCh38 84,160,226–84,160,277 [TATC]10 84,160,204-A; 84,160,204.1A | 1 |
| D7S820 | 11 | D7S820 [CE11]-Chr7-GRCh38 84,160,226–84,160,277 [TATC]9 TGTC TATC 84,160,204-A | 1 |
| D8S1179 | 8 | D8S1179 [CE8]-Chr8-GRCh38 124,894,865–124,894,916 [TCTA]7 TCAA | 1 |
| vWA | 16 | vWA [CE16]-Chr12-GRCh38 5,983,977–5,984,044 [TAGA]10 [CAGA]5 TAGA | 4 |
| vWA | 17 | vWA [CE17]-Chr12-GRCh38 5,983,977–5,984,044 [TAGA]12 [CAGA]4 CAGA | 1 |
| D12S391 | 25 | D12S391 [CE25]-Chr12-GRCh38 12,297,020–12,297,095 [AGAT]16 [AGAC]3 AGAA [AGAC]4 AGAT | 2 |
| D16S539 | 8 | D16S539 [CE8]-Chr16-GRCh38 86,352,702–86,352,745 [GATA]8 86,352,692-G | 3 |
| D16S539 | 8 | D16S539 [CE8]-Chr16-GRCh38 86,352,702–86,352,745 [GATA]8 86,352,761-C | 1 |
| D19S433 | 12.2 | D19S433 [CE12.2]-Chr19-GRCh38 29,926,235–29,926,298 [CCTT]11 cctt CCTT tt CCTT | 1 |
| D21S11 | 28 | D21S11 [CE28]-Chr21-GRCh38 19,181,973–19,182,099 [TCTA]5 [TCTG]5 [TCTA]3 ta [TCTA]2 tca [TCTA]2 tccata [TCTA]11 | 2 |
| D21S11 | 29 | D21S11 [CE29]-Chr21-GRCh38 19,181,973–19,182,099 [TCTA]7 [TCTG]4 [TCTA]3 ta [TCTA]3 tca [TCTA]2 tccata [TCTA]10 | 1 |
| D21S11 | 29 | D21S11 [CE29]-Chr21-GRCh38 19,181,973–19,182,099 [TCTA]6 [TCTG]5 [TCTA]3 ta [TCTA]3 tca [TCTA]2 tccata [TCTA]10 19,182,101-T | 2 |
| D21S11 | 32.2 | D21S11 [CE32.2]-Chr21-GRCh38 19,181,973–19,182,099 [TCTA]5 [TCTG]7 [TCTA]3 ta [TCTA]3 tca [TCTA]2 tccata [TCTA]11 TA TCTA | 1 |
| D21S11 | 33.2 | D21S11 [CE33.2]-Chr21-GRCh38 19,181,973–19,182,099 [TCTA]5 [TCTG]6 [TCTA]3 ta [TCTA]4 tca [TCTA]2 tccata [TCTA]12 TA TCTA | 2 |
| DYS481 | 29 | DYS481 [CE29]-ChrY-GRCh38 8,558,337–8,558,402 [CTT]29 8,558,336-T | 1 |
| DYS612 | 36 | DYS612 [CE36]-ChrY-GRCh38 13,640,728–13,640,835 [CCT]5 CTT [TCT]4 CCT [TCT]25 13,640,861-C | 1 |
| DYS390 | 23 | DYS390 [CE23]-ChrY-GRCh38 15,163,067–15,163,162 [TAGA]4 CAGA [TAGA]9 [CAGA]9 15,163,163-C | 1 |
| DYS390 | 26 | DYS390 [CE26]-ChrY-GRCh38 15,163,067–15,163,162 [TAGA]4 CAGA [TAGA]12 [CAGA]9 | 2 |
| DYS390 | 27 | DYS390 [CE27]-ChrY-GRCh38 15,163,067–15,163,162 [TAGA]4 CAGA [TAGA]13 [CAGA]9 | 1 |
| DYS390 | 27 | DYS390 [CE27]-ChrY-GRCh38 15,163,067–15,163,162 [TAGA]4 CAGA [TAGA]12 [CAGA]10 | 3 |
| Y-GATA-H4 | 10 | Y-GATA-H4 [CE10]-ChrY-GRCh38 16,631,673–16,631,720 [TCTA]10 16,631,756-G | 7 |
| Y-GATA-H4 | 11 | Y-GATA-H4 [CE11]-ChrY-GRCh38 16,631,673–16,631,720 [TCTA]11 16,631,756-G | 15 |
| Y-GATA-H4 | 12 | Y-GATA-H4 [CE12]-ChrY-GRCh38 16,631,673–16,631,720 [TCTA]12 16,631,756-G | 1 |
| DYS460 | 9 | DYS460 [CE9]-ChrY-GRCh38 18,888,956–18,888,995 [CTAT]9 18,888,914-T; 18,888,949-T | 1 |
| DYS448 | 17 | DYS448 [CE17]-ChrY-GRCh38 22,218,923–22,219,078 [AGAGAT]10 N42 [AGAGAT]7 | 4 |
| DYS448 | 17 | DYS448 [CE17]-ChrY-GRCh38 22,218,923–22,219,078 [AGAGAT]9 N42 [AGAGAT]8 | 1 |
| DYS448 | 18 | DYS448 [CE18]-ChrY-GRCh38 22,218,923–22,219,078 [AGAGAT]11 N36 [AGAGAT]8 22,218,995–22,219,000 DEL | 1 |
| DYF387S1 | 39 | DYF387S1 [CE39]-ChrY-GRCh38 25,884,581–25,884,724 [CTTT]18 [CTTC]8 [CTTT]2 CTTC [CTTT]2 [CTTC]4 CTAC [CTTT]3 | 2 |
| DXS8378 | 10 | DXS8378 [CE10]-ChrX-GRCh38 9,402,262–9,402,301 [ATAG]10 9,402,257-G | 1 |
| DXS8378 | 10 | DXS8378 [CE10]-ChrX-GRCh38 9,402,262–9,402,301 [ATAG]5 ACAG [ATAG]4 | 1 |
| DXS7132 | 10 | DXS7132 [CE10]-ChrX-GRCh38 65,435,647–65,435,702 [TAGA]10 | 1 |
| DXS10074 | 15.3 | DXS10074 [CE15.3]-ChrX-GRCh38 67,757,345–67,757,400 [AAGA]10 AAA [AAGA]2 AAGG [AAGA]2 | 4 |
| DXS10074 | 16.3 | DXS10074 [CE16.3]-ChrX-GRCh38 67,757,345–67,757,400 [AAGA]11 AAA [AAGA]2 AAGG [AAGA]2 | 2 |
| DXS10103 | 21 | DXS10103 [CE21]-ChrX-GRCh38 134,284,959–134,285,038 [TAGA]2 ctga CAGA [TAGA]13 [CAGA]4 TAGA | 4 |
Figure 2Comparison of length-based and sequence-based counting of alleles for 58 STRs. Differential shading in the columns indicates the number of alleles based on length (white), the number of alleles increased based on the sequence in the repeat region only (black), the sequence in the flanking region only (dots), and the sequence in both repeat region and flanking region (stripe).
SNPs and InDels observed in STR flanking regions.
| Position* | Locus | Chromosome | STR locus | Upstream/downstream | Allele | Frequency |
|---|---|---|---|---|---|---|
| 68,011,922 | rs74640515 | 2 | D2S441 | Upstream | G>A | 0.0888 |
| 123,775,552 | rs73801920 | 5 | D5S818 | Upstream | C>A | 0.1636 |
| 91,740,273 | rs529713981 | 6 | D6S1043 | Downstream | G>A | 0.0047 |
| 84,160,204 | rs7789995 | 7 | D7S820 | Upstream | T>A | 0.9439 |
| 84,160,286 | rs16887642 | 7 | D7S820 | Downstream | G>A | 0.1869 |
| 84,160,205–84,160,212 | rs1463708262 | 7 | D7S820 | Upstream | Dup A (insertion) | 0.0047 |
| 5,983,970 | rs75219269 | 12 | vWA | Upstream | A>G | 0.1916 |
| 82,148,069 | rs9546005 | 13 | D13S317 | Downstream | A>T | 0.5514 |
| 82,148,073 | rs202043589 | 13 | D13S317 | Downstream | A>T | 0.0794 |
| 86,352,692 | rs563997442 | 16 | D16S539 | Upstream | C>G | 0.0140 |
| 86,352,761 | rs11642858 | 16 | D16S539 | Downstream | A>C | 0.3551 |
| 4,525,681 | rs561985213 | 20 | D20S482 | Upstream | G>A | 0.0093 |
| 4,525,680 | rs77560248 | 20 | D20S482 | Upstream | C>T | 0.1308 |
| 19,182,101 | rs1051967683 | 21 | D21S11 | Downstream | C>T | 0.0093 |
| 7,547,499 | rs371507752 | Y | DYS522 | Upstream | C>T | 0.0189 |
| 8,558,336 | rs370750300 | Y | DYS481 | Upstream | G>T | 0.0189 |
| 12,346,421 | NULL** | Y | DYS437 | Downstream | G>A | 0.1698 |
| 13,640,861 | rs555095027 | Y | DYS612 | Downstream | T>C | 0.0189 |
| 15,163,163 | rs758940870 | Y | DYS390 | Downstream | T>C | 0.0189 |
| 16,631,756 | NULL | Y | Y-GATA-H4 | Downstream | A>G | 0.4340 |
| 18,888,914 | NULL | Y | DYS460 | Upstream | A>T | 0.0189 |
| 18,888,949 | NULL | Y | DYS460 | Upstream | C>T | 0.0189 |
| 22,218,995–22,219,000 | NULL | Y | DYS448 | Repeat region (Not counted) | Del ATAGAG | 0.0189 |
| 9,402,257 | rs867174547 | X | DXS8378 | Upstream | A>G | 0.0062 |
| 65,435,703 | rs778986795 | X | DXS7132 | Downstream | C>T | 0.0435 |
| 67,757,322 | rs56195635 | X | DXS10074 | Upstream | C>G | 0.0062 |
| 134,284,967 | rs754666041 | X | DXS10103 | Repeat region (Not counted) | C>T | 0.0062 |
*: GRCh 38.
**: No record in dbSNP and 1,000 Genomes.
Combined forensic parameters of datasets used in this study.
| 26 A-STRs (length based) | 26 A-STRs (sequence based) | 94 iiSNPs (target SNPs) | 94 iiSNPs (full sequences) | 26 A-STRs (length based) + 94 iiSNPs (target SNPs) | 26 A-STRs (sequence based) + 94 iiSNPs (full sequences) | |
|---|---|---|---|---|---|---|
| CMP | 1.943E−30 | 3.218E−34 | 6.319E−35 | 3.351E−37 | 1.223E−64 | 1.074E−70 |
| TDP | 1−(1.943E−30) | 1−(3.218E−34) | 1−(6.319E−35) | 1−(3.351E−37) | 1−(1.223E−64) | 1−(1.074E−70) |
| CPEduo | 0.9999996 | 0.99999998 | 0.9999 | 0.99995 | 0.99999999995 | 0.9999999999992 |
| CPEtrio | 0.99999999996 | 0.9999999999995 | 0.99999995 | 0.999999993 | 1−(2.031E−18) | 1−(3.802E−21) |
Figure 3Instances of InDel in D7S820.
Figure 4Instances of reference alleles and three categories of variants. RRVO represents the internal sequence variations present in repeat region which is different from the reference allele. FRVO represents the sequence variations with flanking region variations only, while the repeat region is the same as the reference allele. RRFR stands for the sequence with both repeat motif variations and flanking region variations.