| Literature DB >> 31831766 |
Wenshen Dai1, Yajiao Pan2, Xiaochen Sun3, Riga Wu4,5, Luo Li2, Dongming Yang3.
Abstract
The development of massively parallel sequencing (MPS) has quickly changed forensic short tandem repeat (STR) genotyping. By providing detailed sequence information, MPS technology may be used as an alternative or additional method to overcome the limitations of capillary electrophoresis-based STR profiling. Most current NGS processes are labour-intensive with regard to library preparation and require high-quality DNA template. In this study, a 16-plex STR typing system (SeqType®R16) was used to achieve direct library preparation without DNA extraction and adaptor ligation. The efficiency of this system was tested in 601 individuals, including 593 old blood samples from the Chinese Han population and eight positive controls. It took approximately 4 hours for library preparation, including blood direct multiplex PCR (1.5 hours), mixing of the product (15 minutes), single tube purification (2 hours) and quantification (15 minutes). The results showed that MPS presented a broader allele range and higher discrimination power. Except for FGA and D19S433, the allele number almost doubled or more than doubled at all complex STR loci and simple STR loci, including D13S317, D16S539, D5S818, and D7S820. The range of discrimination power increased from 0.8008-0.9572 to 0.8401-0.9753, and the culminated matching probability decreased from 1.7 × 10-15 to 1.1 × 10-17.Entities:
Mesh:
Year: 2019 PMID: 31831766 PMCID: PMC6908607 DOI: 10.1038/s41598-019-55282-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Data quality of the three sequencing runs.
| Chip ID | Sample number | Final reads | TF alignment | Reads passed length filteringa | Barcode and primer sortingb | Downstream barcode identificationc | Matched reads | Reads of each sample (mean ± std.dev) | Reads of per locus per sample(mean ± std.dev) | Reads of per locus per sample | Sample% reported more than 15 loci |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 150 | 4.4 M | 95% | 4.27 M | 4.0 M | 3.7 M | 3.6 M | 24,053 ± 24,318 | 1,503 ± 1,520 | 352~4,466 | 94% |
| 2 | 151 | 4.4 M | 97% | 4.24 M | 4.1 M | 3.7 M | 3.7 M | 24,244 ± 26,667 | 1,515 ± 1,667 | 381~4,018 | 95% |
| 3 | 150 | 4.3 M | 96% | 4.08 M | 3.9 M | 2.5 M | 2.4 M | 15,712 ± 14,271 | 982 ± 892 | 183~2,311 | 97% |
| 4 | 150 | 3.6 M | 94% | 3.47 M | 3.2 M | 2.9 M | 2.9 M | 19,273 ± 18,228 | 1,205 ± 1,139 | 283~3,214 | 93% |
| Average | 150 | 4.2 M | 95.5% | 4.02 M | 3.8 M | 3.2 M | 3.1 M | 20,826 ± 21,689 | 1,301 ± 1,355 | 359~3,503 | 94.8% |
aLength filtering was performed by giving up the reads less than 60 bp.
bBarcode and primer sorting was processed by perfect matching to barcode (10 bp) and forward primer sequences (19–23 bp).
cDownstream barcodes were specifically chosen sequences (5–10 bp) downstream of the short tandem repeat (STR) repeat region. Detection of this downstream barcode may help to obtain reads that cover the entire STR repeat region. Perfect matching was applied in this filtering step.
Figure 1Summary of allele balance of 15 STRs and amelogenin.
Figure 2Distribution of read coverage for the 16 loci and the proportion of genotyped samples in each locus.
Discordance between capillary electrophoresis (CE) and next-generation sequencing (NGS) genotyping methods.
| Sample ID | STR | CE | NGS |
|---|---|---|---|
| 4591 | D18S51 | 15,18.1 | 15(61), x(74) |
| 4816 | D7S820 | 10.1,12 | 10(50), 12(63) |
The inconsistent allele call is shown in bold italics. The number of reads for each allele by NGS typing is listed in parentheses.
The number of alleles obtained by capillary electrophoresis-based (CE) short tandem repeat (STR) genotyping compared to next-generation sequencing (NGS)-STR genotyping.
| Locus | Repeat type | Alleles number identified by CE | Alleles number identified by NGS | Increase rate |
|---|---|---|---|---|
| D2S1338 | Compound | 12 | 41 | 241.7% |
| D21S11 | Compound | 16 | 44 | 175.0% |
| vWA | Compound | 10 | 22 | 120.0% |
| D8S1179 | Compound | 10 | 21 | 110.0% |
| D3S1358 | Compound | 9 | 17 | 88.9% |
| FGA | Compound | 19 | 21 | 10.5% |
| D19S433 | Compound | 14 | 15 | 7.1% |
| D13S317 | Simple | 8 | 23 | 187.5% |
| D5S818 | Simple | 9 | 19 | 111.1% |
| D7S820 | Simple | 9 | 16 | 77.8% |
| D16S539 | Simple | 8 | 14 | 75.0% |
| CSF1PO | Simple | 10 | 11 | 10.0% |
| D18S51 | Simple | 17 | 17 | 0.0% |
| TH01 | Simple | 6 | 6 | 0.0% |
| TPOX | Simple | 7 | 7 | 0.0% |
Forensic parameters of 15 STRs in the Chinese Han population obtained by sequence compared to length.
| STR | NGS | CE | ||
|---|---|---|---|---|
| DP | Hexp | DP | Hexp | |
| D2S1338 | 0.973 | 0.890 | 0.949 | 0.849 |
| D3S1358 | 0.862 | 0.760 | 0.801 | 0.711 |
| D8S1179 | 0.975 | 0.893 | 0.938 | 0.833 |
| VWA | 0.925 | 0.816 | 0.912 | 0.802 |
| D21S11 | 0.975 | 0.897 | 0.910 | 0.806 |
| D19S433 | 0.933 | 0.829 | 0.931 | 0.827 |
| FGA | 0.957 | 0.861 | 0.957 | 0.861 |
| D13S317 | 0.957 | 0.862 | 0.921 | 0.812 |
| D5S818 | 0.932 | 0.828 | 0.870 | 0.764 |
| D16S539 | 0.934 | 0.831 | 0.891 | 0.781 |
| D7S820 | 0.901 | 0.795 | 0.882 | 0.774 |
| CSFIPO | 0.840 | 0.746 | 0.840 | 0.746 |
| D18S51 | 0.943 | 0.843 | 0.943 | 0.843 |
| TH01 | 0.679 | 0.648 | 0.679 | 0.648 |
| TPOX | 0.609 | 0.612 | 0.609 | 0.612 |