Literature DB >> 31173943

BOCS: DNA k-mer content and scoring for rapid genetic biomarker identification at low coverage.

Lee E Korshoj1, Prashant Nagpal2.   

Abstract

A single, inexpensive diagnostic test capable of rapidly identifying a wide range of genetic biomarkers would prove invaluable in precision medicine. Previous work has demonstrated the potential for high-throughput, label-free detection of A-G-C-T content in DNA k-mers, providing an alternative to single-letter sequencing while also having inherent lossy data compression and massively parallel data acquisition. Here, we apply a new bioinformatics algorithm - block optical content scoring (BOCS) - capable of using the high-throughput content k-mers for rapid, broad-spectrum identification of genetic biomarkers. BOCS uses content-based sequence alignment for probabilistic mapping of k-mer contents to gene sequences within a biomarker database, resulting in a probability ranking of genes on a content score. Simulations of the BOCS algorithm reveal high accuracy for identification of single antibiotic resistance genes, even in the presence of significant sequencing errors (100% accuracy for no sequencing errors, and >90% accuracy for sequencing errors at 20%), and at well below full coverage of the genes. Simulations for detecting multiple resistance genes within a methicillin-resistant Staphylococcus aureus (MRSA) strain showed 100% accuracy at an average gene coverage of merely 0.515, when the k-mer lengths were variable and with 4% sequencing error within the k-mer blocks. Extension of BOCS to cancer and other genetic diseases met or exceeded the results for resistance genes. Combined with a high-throughput content-based sequencing technique, the BOCS algorithm potentiates a test capable of rapid diagnosis and profiling of genetic biomarkers ranging from antibiotic resistance to cancer and other genetic diseases.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Biomarker detection; DNA sequencing; High-throughput diagnostics; Multidrug-resistant bacteria; Optical sequencing; Raman spectroscopy

Mesh:

Substances:

Year:  2019        PMID: 31173943     DOI: 10.1016/j.compbiomed.2019.05.022

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  1 in total

1.  Analysis of Identification Method for Bacterial Species and Antibiotic Resistance Genes Using Optical Data From DNA Oligomers.

Authors:  Ryan L Wood; Tanner Jensen; Cindi Wadsworth; Mark Clement; Prashant Nagpal; William G Pitt
Journal:  Front Microbiol       Date:  2020-02-20       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.