| Literature DB >> 20011152 |
Fahong Yu, Yijun Sun, Li Liu, William Farmerie.
Abstract
GSTaxClassifier (Genomic Signature based Taxonomic Classifier) is a program for metagenomics analysis of shotgun DNA sequences. The program includes a simple but effective algorithm, a modification of the Bayesian method, to predict the most probable genomic origins of sequences at different taxonomical ranks, on the basis of genome databases;a function to generate genomic profiles of reference sequences with tri-, tetra-, penta-, and hexa-nucleotide motifs for setting a user-defined database; two different formats (tabular- and tree-based summaries) to display taxonomic predictions with improved analytical methods; and effective ways to retrieve, search, and summarize results by integrating the predictions into the NCBI tree-based taxonomic information.GSTaxClassifier takes input nucleotide sequences and using a modified Bayesian model evaluates the genomic signatures between metagenomic query sequences and reference genome databases. The simulation studies of a numerical data sets showed that GSTaxClassifier could serve as a useful program for metagenomics studies, which is freely available at http://helix2.biotech.ufl.edu:26878/metagenomics/.Entities:
Keywords: Bayesian method; Genomic signature; meta-genomics; taxonomy
Year: 2009 PMID: 20011152 PMCID: PMC2770370 DOI: 10.6026/97320630004046
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Influence of motif lengths on prediction accuracy of sequences from bacteria.