Literature DB >> 29525576

Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results.

Rebecca S Just1, Jodi A Irwin2.   

Abstract

Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools employed. Further, these biologically based, easy-to-derive designations uphold clear relationships between parent alleles and their stutter products, enabling analysis in fully continuous probabilistic programs that model stutter while avoiding the algorithmic complexities that come with string based searches. Though using repeat unit plus LUS length as the allele designator does not capture variation that occurs outside of the core repeat regions, this straightforward approach would permit the large majority of known STR sequence variation to be used for mixture deconvolution and, in turn, result in more informative mixture statistics in the near term. Ultimately, the method could bridge the gap from current length-based probabilistic systems to facilitate broader adoption of NGS by forensic DNA testing laboratories.
Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

Keywords:  Longest uninterrupted stretch (LUS); Mixture; Next generation sequencing (NGS); Probabilistic genotyping; Sequence variation; Short tandem repeat (STR)

Mesh:

Year:  2018        PMID: 29525576     DOI: 10.1016/j.fsigen.2018.02.016

Source DB:  PubMed          Journal:  Forensic Sci Int Genet        ISSN: 1872-4973            Impact factor:   4.882


  4 in total

1.  Massively parallel sequencing analysis of nondegraded and degraded DNA mixtures using the ForenSeq™ system in combination with EuroForMix software.

Authors:  Hsiao-Lin Hwa; Ming-Yih Wu; Wan-Chia Chung; Tsang-Ming Ko; Chih-Peng Lin; Hsiang-I Yin; Tsui-Ting Lee; James Chun-I Lee
Journal:  Int J Legal Med       Date:  2018-10-29       Impact factor: 2.686

2.  Report from the STRAND Working Group on the 2019 STR sequence nomenclature meeting.

Authors:  Katherine Butler Gettings; David Ballard; Martin Bodner; Lisa A Borsuk; Jonathan L King; Walther Parson; Christopher Phillips
Journal:  Forensic Sci Int Genet       Date:  2019-09-21       Impact factor: 4.882

Review 3.  A Review of Probabilistic Genotyping Systems: EuroForMix, DNAStatistX and STRmix™.

Authors:  Peter Gill; Corina Benschop; John Buckleton; Øyvind Bleka; Duncan Taylor
Journal:  Genes (Basel)       Date:  2021-09-30       Impact factor: 4.096

4.  Identification of sequence polymorphisms at 58 STRs and 94 iiSNPs in a Tibetan population using massively parallel sequencing.

Authors:  Dan Peng; Yinming Zhang; Han Ren; Haixia Li; Ran Li; Xuefeng Shen; Nana Wang; Erwen Huang; Riga Wu; Hongyu Sun
Journal:  Sci Rep       Date:  2020-07-22       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.