| Literature DB >> 32244632 |
Olivier Tytgat1,2, Yannick Gansemans1, Jana Weymaere1, Kaat Rubben1, Dieter Deforce1, Filip Van Nieuwerburgh1.
Abstract
Nanopore sequencing for forensic short tandem repeats (STR) genotyping comes with the advantages associated with massively parallel sequencing (MPS) without the need for a high up-front device cost, but genotyping is inaccurate, partially due to the occurrence of homopolymers in STR loci. The goal of this study was to apply the latest progress in nanopore sequencing by Oxford Nanopore Technologies in the field of STR genotyping. The experiments were performed using the state of the art R9.4 flow cell and the most recent R10 flow cell, which was specifically designed to improve consensus accuracy of homopolymers. Two single-contributor samples and one mixture sample were genotyped using Illumina sequencing, Nanopore R9.4 sequencing, and Nanopore R10 sequencing. The accuracy of genotyping was comparable for both types of flow cells, although the R10 flow cell provided improved data quality for loci characterized by the presence of homopolymers. We identify locus-dependent characteristics hindering accurate STR genotyping, providing insights for the design of a panel of STR loci suited for nanopore sequencing. Repeat number, the number of different reference alleles for the locus, repeat pattern complexity, flanking region complexity, and the presence of homopolymers are identified as unfavorable locus characteristics. For single-contributor samples and for a limited set of the commonly used STR loci, nanopore sequencing could be applied. However, the technology is not mature enough yet for implementation in routine forensic workflows.Entities:
Keywords: DNA fingerprinting; Oxford Nanopore sequencing; forensic; massively parallel sequencing; short tandem repeats
Year: 2020 PMID: 32244632 PMCID: PMC7230633 DOI: 10.3390/genes11040381
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Average distribution of the reads over the loci. Red refers to Illumina data, orange to Nanopore R9.4, and yellow to Nanopore R10.
Percentage of locus-categorized reads that aligned to a true allele and possible mapping—accuracy related characteristics of the STR loci.
| % True-Allele Alignment * | Homopolymer Length (bp) | Complexity Repeat Pattern | Allele Range | |||
|---|---|---|---|---|---|---|
| Allele | Illumina | R9.4 | R10 | |||
| Amel | 100.00 | 85.97 | 82.21 | N/A | N/A | X-Y |
| TPOX | 97.16 | 91.65 | 79.43 | 2 | Low | 8:12 |
| THO1 | 94.75 | 68.08 | 69.26 | 2 | Low | 6:10 |
| D7S820 | 88.47 | 70.31 | 59.45 | 0 | Low | 7:14 |
| D13S317 | 89.80 | 69.11 | 57.25 | 0 | Low | 8:15 |
| D8S1179 | 93.63 | 78.21 | 55.31 | 0 | Medium | 8:17 |
| D16S539 | 91.75 | 75.36 | 54.86 | 0 | Low | 8:14 |
| D5S818 | 86.12 | 55.04 | 51.91 | 0 | Low | 8:14 |
| D3S1358 | 91.66 | 44.55 | 44.20 | 0 | Medium | 13:19 |
| D18S51 | 85.32 | 32.52 | 44.10 | 3 | Low | 10:22 |
| FGA | 81.03 | 27.95 | 39.67 | 3 | High | 18:28 |
| vWA | 84.77 | 30.54 | 37.92 | 0 | High | 14:20 |
| CD4 | 97.83 | 32.62 | 37.74 | 4 | Low | 4:11 |
| D21S11 | 89.94 | 26.59 | 18.87 | 0 | High | 27:36 |
| SE33 | 65.35 | 9.96 | 11.44 | 3 | High | 12:34 |
|
|
|
|
| |||
%: Percentage; *: Alignment to stutter alleles is not considered as true alignment.
Figure 2Distribution of locus-categorized reads per locus. Left bars are R9.4 reads and right bars R10. Black refers to non-alignment, yellow to alignment to multiple alleles (XA-tag), red to non-true alignment, and green to true-allele alignment.
Figure 3Relative frequency of uniquely aligned reads per locus. Grey indicates R9.4 data and black indicates R10 data. Green-colored bars indicate true positive alleles and red-colored bars indicate drop-ins or dropouts.
Figure 4Genotyping results after alignment. Green indicates correct genotyping, red indicates incorrect genotyping, and yellow indicates correct genotyping but complicated by one or more other highly represented alleles.