Literature DB >> 17881407

Effect of the mutation rate and background size on the quality of pathogen identification.

Chris Reed1, Viacheslav Fofanov, Catherine Putonti, Sergei Chumakov, Tom Slezak, Yuriy Fofanov.   

Abstract

MOTIVATION: Genomic-based methods have significant potential for fast and accurate identification of organisms or even genes of interest in complex environmental samples (air, water, soil, food, etc.), especially when isolation of the target organism cannot be performed by a variety of reasons. Despite this potential, the presence of the unknown, variable and usually large quantities of background DNA can cause interference resulting in false positive outcomes.
RESULTS: In order to estimate how the genomic diversity of the background (total length of all of the different genomes present in the background), target length and target mutation rate affect the probability of misidentifications, we introduce a mathematical definition for the quality of an individual signature in the presence of a background based on its length and number of mismatches needed to transform the signature into the closest subsequence present in the background. This definition, in conjunction with a probabilistic framework, allows one to predict the minimal signature length required to identify the target in the presence of different sizes of backgrounds and the effect of the target's mutation rate on the quality of its identification. The model assumptions and predictions were validated using both Monte Carlo simulations and real genomic data examples. The proposed model can be used to determine appropriate signature lengths for various combinations of target and background genome sizes. It also predicted that any genomic signatures will be unable to identify target if its mutation rate is >5%. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2007        PMID: 17881407     DOI: 10.1093/bioinformatics/btm420

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  A high-throughput pipeline for designing microarray-based pathogen diagnostic assays.

Authors:  Ravi Vijaya Satya; Nela Zavaljevski; Kamal Kumar; Jaques Reifman
Journal:  BMC Bioinformatics       Date:  2008-04-10       Impact factor: 3.169

2.  Targeted amplification for enhanced detection of biothreat agents by next-generation sequencing.

Authors:  Shea N Gardner; Kenneth G Frey; Cassie L Redden; James B Thissen; Jonathan E Allen; Adam F Allred; Matthew D Dyer; Vishwesh P Mokashi; Tom R Slezak
Journal:  BMC Res Notes       Date:  2015-11-16

3.  The effects of glass surfaces and probe GC content on signal intensities of a 60-mer diagnostic microarray.

Authors:  Xiaoyang Mo; Qinghua Wu; Junjian Hu; Wenli Ma; Min Wei; Wuzhou Yuan; Yuequn Wang; Yongqin Li; Yun Deng; Xiushan Wu
Journal:  Ann Microbiol       Date:  2008       Impact factor: 2.112

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.