Literature DB >> 16403797

Bayesian classifiers for detecting HGT using fixed and variable order markov models of genomic signatures.

Daniel Dalevi1, Devdatt Dubhashi, Malte Hermansson.   

Abstract

MOTIVATION: Analyses of genomic signatures are gaining attention as they allow studies of species-specific relationships without involving alignments of homologous sequences. A naïve Bayesian classifier was built to discriminate between different bacterial compositions of short oligomers, also known as DNA words. The classifier has proven successful in identifying foreign genes in Neisseria meningitis. In this study we extend the classifier approach using either a fixed higher order Markov model (Mk) or a variable length Markov model (VLMk).
RESULTS: We propose a simple algorithm to lock a variable length Markov model to a certain number of parameters and show that the use of Markov models greatly increases the flexibility and accuracy in prediction to that of a naïve model. We also test the integrity of classifiers in terms of false-negatives and give estimates of the minimal sizes of training data. We end the report by proposing a method to reject a false hypothesis of horizontal gene transfer. AVAILABILITY: Software and Supplementary information available at www.cs.chalmers.se/~dalevi/genetic_sign_classifiers/.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16403797     DOI: 10.1093/bioinformatics/btk029

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  Normal and compound poisson approximations for pattern occurrences in NGS reads.

Authors:  Zhiyuan Zhai; Gesine Reinert; Kai Song; Michael S Waterman; Yihui Luan; Fengzhu Sun
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

Review 2.  A bioinformatician's guide to metagenomics.

Authors:  Victor Kunin; Alex Copeland; Alla Lapidus; Konstantinos Mavromatis; Philip Hugenholtz
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

3.  New powerful statistics for alignment-free sequence comparison under a pattern transfer model.

Authors:  Xuemei Liu; Lin Wan; Jing Li; Gesine Reinert; Michael S Waterman; Fengzhu Sun
Journal:  J Theor Biol       Date:  2011-06-25       Impact factor: 2.691

4.  RAIphy: phylogenetic classification of metagenomics samples using iterative refinement of relative abundance index profiles.

Authors:  Ozkan U Nalbantoglu; Samuel F Way; Steven H Hinrichs; Khalid Sayood
Journal:  BMC Bioinformatics       Date:  2011-01-31       Impact factor: 3.169

5.  The IncP-1 plasmid backbone adapts to different host bacterial species and evolves through homologous recombination.

Authors:  Peter Norberg; Maria Bergström; Vinay Jethava; Devdatt Dubhashi; Malte Hermansson
Journal:  Nat Commun       Date:  2011-04-05       Impact factor: 14.919

6.  DarkHorse: a method for genome-wide prediction of horizontal gene transfer.

Authors:  Sheila Podell; Terry Gaasterland
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

7.  Community-wide analysis of microbial genome sequence signatures.

Authors:  Gregory J Dick; Anders F Andersson; Brett J Baker; Sheri L Simmons; Brian C Thomas; A Pepper Yelton; Jillian F Banfield
Journal:  Genome Biol       Date:  2009-08-21       Impact factor: 13.583

8.  Estimating DNA coverage and abundance in metagenomes using a gamma approximation.

Authors:  Sean D Hooper; Daniel Dalevi; Amrita Pati; Konstantinos Mavromatis; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Bioinformatics       Date:  2009-12-14       Impact factor: 6.937

9.  Comparison of metagenomic samples using sequence signatures.

Authors:  Bai Jiang; Kai Song; Jie Ren; Minghua Deng; Fengzhu Sun; Xuegong Zhang
Journal:  BMC Genomics       Date:  2012-12-27       Impact factor: 3.969

10.  ESPRIT: estimating species richness using large collections of 16S rRNA pyrosequences.

Authors:  Yijun Sun; Yunpeng Cai; Li Liu; Fahong Yu; Michael L Farrell; William McKendree; William Farmerie
Journal:  Nucleic Acids Res       Date:  2009-05-05       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.