Literature DB >> 15731209

Self-organizing and self-correcting classifications of biological data.

George M Garrity1, Timothy G Lilburn.   

Abstract

MOTIVATION: Rapid, automated means of organizing biological data are required if we hope to keep abreast of the flood of data emanating from sequencing, microarray and similar high-throughput analyses. Faced with the need to validate the annotation of thousands of sequences and to generate biologically meaningful classifications based on the sequence data, we turned to statistical methods in order to automate these processes.
RESULTS: An algorithm for automated classification based on evolutionary distance data was written in S. The algorithm was tested on a dataset of 1436 small subunit ribosomal RNA sequences and was able to classify the sequences according to an extant scheme, use statistical measurements of group membership to detect sequences that were misclassified within this scheme and produce a new classification. In this study, the use of the algorithm to address problems in prokaryotic taxonomy is discussed. AVAILABILITY: S-Plus is available from Insightful, Inc. An S-Plus implementation of the algorithm and the associated data are available at http://taxoweb.mmg.msu.edu/datasets

Mesh:

Substances:

Year:  2005        PMID: 15731209     DOI: 10.1093/bioinformatics/bti346

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

Review 1.  Biodiversity of Intestinal Lactic Acid Bacteria in the Healthy Population.

Authors:  Marika Mikelsaar; Epp Sepp; Jelena Štšepetova; Epp Songisepp; Reet Mändar
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

Review 2.  A New Genomics-Driven Taxonomy of Bacteria and Archaea: Are We There Yet?

Authors:  George M Garrity
Journal:  J Clin Microbiol       Date:  2016-05-18       Impact factor: 5.948

3.  Identification of gene expression patterns using planned linear contrasts.

Authors:  Hao Li; Constance L Wood; Yushu Liu; Thomas V Getchell; Marilyn L Getchell; Arnold J Stromberg
Journal:  BMC Bioinformatics       Date:  2006-05-05       Impact factor: 3.169

4.  The Ribosomal Database Project: improved alignments and new tools for rRNA analysis.

Authors:  J R Cole; Q Wang; E Cardenas; J Fish; B Chai; R J Farris; A S Kulam-Syed-Mohideen; D M McGarrell; T Marsh; G M Garrity; J M Tiedje
Journal:  Nucleic Acids Res       Date:  2008-11-12       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.