Literature DB >> 31698330

Forensic STR allele extraction using a machine learning paradigm.

Yao-Yuan Liu1, David Welch2, Ryan England3, Janet Stacey4, SallyAnn Harbison5.   

Abstract

We present a machine learning approach to short tandem repeat (STR) sequence detection and extraction from massively parallel sequencing data called Fragsifier. Using this approach, STRs are detected on each read by first locating the longest repeat stretches followed by locus prediction using k-mers in a machine learning sequence model. This is followed by reference flanking sequence alignment to determine precise STR boundaries. We show that Fragsifier produces genotypes that are concordant with profiles obtained using capillary electrophoresis (CE), and also compared the results with that of STRait Razor and the ForenSeq UAS. The data pre-processing and training of the sequence classifier is readily scripted, allowing the analyst to experiment with different thresholds, datasets and loci of interest, and different machine learning models.
Copyright © 2019 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Bioinformatics; Machine learning; Massively parallel sequencing; STR extraction

Mesh:

Year:  2019        PMID: 31698330     DOI: 10.1016/j.fsigen.2019.102194

Source DB:  PubMed          Journal:  Forensic Sci Int Genet        ISSN: 1872-4973            Impact factor:   4.882


  1 in total

1.  Systematic Selection of Age-Associated mRNA Markers and the Development of Predicted Models for Forensic Age Inference by Three Machine Learning Methods.

Authors:  Xiaoye Jin; Zheng Ren; Hongling Zhang; Qiyan Wang; Yubo Liu; Jingyan Ji; Jiang Huang
Journal:  Front Genet       Date:  2022-07-01       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.