Literature DB >> 22538644

TBC: a clustering algorithm based on prokaryotic taxonomy.

Jae-Hak Lee1, Hana Yi, Yoon-Seong Jeon, Sungho Won, Jongsik Chun.   

Abstract

High-throughput DNA sequencing technologies have revolutionized the study of microbial ecology. Massive sequencing of PCR amplicons of the 16S rRNA gene has been widely used to understand the microbial community structure of a variety of environmental samples. The resulting sequencing reads are clustered into operational taxonomic units that are then used to calculate various statistical indices that represent the degree of species diversity in a given sample. Several algorithms have been developed to perform this task, but they tend to produce different outcomes. Herein, we propose a novel sequence clustering algorithm, namely Taxonomy-Based Clustering (TBC). This algorithm incorporates the basic concept of prokaryotic taxonomy in which only comparisons to the type strain are made and used to form species while omitting full-scale multiple sequence alignment. The clustering quality of the proposed method was compared with those of MOTHUR, BLASTClust, ESPRIT-Tree, CD-HIT, and UCLUST. A comprehensive comparison using three different experimental datasets produced by pyrosequencing demonstrated that the clustering obtained using TBC is comparable to those obtained using MOTHUR and ESPRIT-Tree and is computationally efficient. The program was written in JAVA and is available from http://sw.ezbiocloud.net/tbc.

Entities:  

Mesh:

Year:  2012        PMID: 22538644     DOI: 10.1007/s12275-012-1214-6

Source DB:  PubMed          Journal:  J Microbiol        ISSN: 1225-8873            Impact factor:   3.422


  22 in total

1.  Clustering of highly homologous sequences to reduce the size of large protein databases.

Authors:  W Li; L Jaroszewski; A Godzik
Journal:  Bioinformatics       Date:  2001-03       Impact factor: 6.937

2.  Sequence clustering strategies improve remote homology recognitions while reducing search times.

Authors:  Weizhong Li; Lukasz Jaroszewski; Adam Godzik
Journal:  Protein Eng       Date:  2002-08

3.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 4.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

5.  Using affinity propagation combined post-processing to cluster protein sequences.

Authors:  F Yang; Q Zhu; D Tang; M Zhao
Journal:  Protein Pept Lett       Date:  2010-06       Impact factor: 1.890

6.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

Review 7.  Microbial community profiling for human microbiome projects: Tools, techniques, and challenges.

Authors:  Micah Hamady; Rob Knight
Journal:  Genome Res       Date:  2009-04-21       Impact factor: 9.043

Review 8.  Metagenomic pyrosequencing and microbial identification.

Authors:  Joseph F Petrosino; Sarah Highlander; Ruth Ann Luna; Richard A Gibbs; James Versalovic
Journal:  Clin Chem       Date:  2009-03-05       Impact factor: 8.327

9.  ESPRIT-Tree: hierarchical clustering analysis of millions of 16S rRNA pyrosequences in quasilinear computational time.

Authors:  Yunpeng Cai; Yijun Sun
Journal:  Nucleic Acids Res       Date:  2011-05-19       Impact factor: 16.971

10.  Probing metagenomics by rapid cluster analysis of very large datasets.

Authors:  Weizhong Li; John C Wooley; Adam Godzik
Journal:  PLoS One       Date:  2008-10-10       Impact factor: 3.240

View more
  8 in total

Review 1.  Analytical tools and databases for metagenomics in the next-generation sequencing era.

Authors:  Mincheol Kim; Ki-Hyun Lee; Seok-Whan Yoon; Bong-Soo Kim; Jongsik Chun; Hana Yi
Journal:  Genomics Inform       Date:  2013-09-30

2.  Bacterial diversity in ornithogenic soils compared to mineral soils on King George Island, Antarctica.

Authors:  Ok-Sun Kim; Namyi Chae; Hyun Soo Lim; Ahnna Cho; Jeong Hoon Kim; Soon Gyu Hong; Jeongsu Oh
Journal:  J Microbiol       Date:  2012-12-30       Impact factor: 3.422

Review 3.  Clinical microbiology informatics.

Authors:  Daniel D Rhoads; Vitali Sintchenko; Carol A Rauch; Liron Pantanowitz
Journal:  Clin Microbiol Rev       Date:  2014-10       Impact factor: 26.132

4.  Performance and bacterial communities of successive alkalinity-producing systems (SAPSs) in passive treatment processes treating mine drainages differing in acidity and metal levels.

Authors:  Sokhee Philemon Jung; Youngwook Cheong; Giljae Yim; Sangwoo Ji; Hojeong Kang
Journal:  Environ Sci Pollut Res Int       Date:  2013-11-27       Impact factor: 4.223

5.  CLUSTOM: a novel method for clustering 16S rRNA next generation sequences by overlap minimization.

Authors:  Kyuin Hwang; Jeongsu Oh; Tae-Kyung Kim; Byung Kwon Kim; Dong Su Yu; Bo Kyeng Hou; Gustavo Caetano-Anollés; Soon Gyu Hong; Kyung Mo Kim
Journal:  PLoS One       Date:  2013-05-01       Impact factor: 3.240

6.  Metagenomic insights into the bioaerosols in the indoor and outdoor environments of childcare facilities.

Authors:  Su-Kyoung Shin; Jinman Kim; Sung-min Ha; Hyun-Seok Oh; Jongsik Chun; Jongryeul Sohn; Hana Yi
Journal:  PLoS One       Date:  2015-05-28       Impact factor: 3.240

7.  Effects of the Brown Seaweed Laminaria japonica Supplementation on Serum Concentrations of IgG, Triglycerides, and Cholesterol, and Intestinal Microbiota Composition in Rats.

Authors:  Jae-Young Kim; Young Min Kwon; In-Sung Kim; Jeong-A Kim; Da-Yoon Yu; Bishnu Adhikari; Sang-Suk Lee; In-Soon Choi; Kwang-Keun Cho
Journal:  Front Nutr       Date:  2018-04-12

8.  Cloacal Microbiome Structure in a Long-Distance Migratory Bird Assessed Using Deep 16sRNA Pyrosequencing.

Authors:  Jakub Kreisinger; Dagmar Čížková; Lucie Kropáčková; Tomáš Albrecht
Journal:  PLoS One       Date:  2015-09-11       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.