Literature DB >> 31894129

A critical analysis of state-of-the-art metagenomics OTU clustering algorithms.

Ashaq Hussain Bhat1, Puniethaa Prabhu, Kalpana Balakrishnan.   

Abstract

Taxonomic profiling, using hyper-variable regions of 16S rRNA, is one of the important goals in metagenomics analysis. Operational taxonomic unit (OTU) clustering algorithms are the important tools to perform taxonomic profiling by grouping 16S rRNA sequence reads into OTU clusters. Presently various OTU clustering algorithms are available within different pipelines, even some pipelines have implemented more than one clustering algorithms, but there is less literature available for the relative performance and features of these algorithms. This makes the choice of using these methods unclear. In this study five current state-of-the-art OTU clustering algorithms (CDHIT, Mothur's Average Neighbour, SUMACLUST, Swarm, and UCLUST) have been comprehensively evaluated on the metagenomics sequencing data. It was found that in all the datasets, Mothur's average neighbour and Swarm created more number of OTU clusters. Based on normalized mutual information (NMI) and normalized information difference (NID), Swarm and Mothur's average neighbour showed better clustering qualities than others. But in terms of time complexity the greedy algorithms (SUMACLUST, CDHIT, and UCLUST) performed well. So there is a trade-off between quality and time, and it is necessary while analysing large size of 16S rRNA gene sequencing data.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31894129

Source DB:  PubMed          Journal:  J Biosci        ISSN: 0250-5991            Impact factor:   1.826


  36 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 2.  Bacterial evolution.

Authors:  C R Woese
Journal:  Microbiol Rev       Date:  1987-06

3.  A large-scale benchmark study of existing algorithms for taxonomy-independent microbial community analysis.

Authors:  Yijun Sun; Yunpeng Cai; Susan M Huse; Rob Knight; William G Farmerie; Xiaoyu Wang; Volker Mai
Journal:  Brief Bioinform       Date:  2011-04-27       Impact factor: 11.622

4.  Diversity of human vaginal bacterial communities and associations with clinically defined bacterial vaginosis.

Authors:  Brian B Oakley; Tina L Fiedler; Jeanne M Marrazzo; David N Fredricks
Journal:  Appl Environ Microbiol       Date:  2008-05-16       Impact factor: 4.792

5.  VSEARCH: a versatile open source tool for metagenomics.

Authors:  Torbjørn Rognes; Tomáš Flouri; Ben Nichols; Christopher Quince; Frédéric Mahé
Journal:  PeerJ       Date:  2016-10-18       Impact factor: 2.984

6.  Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering.

Authors:  Xiaolin Hao; Rui Jiang; Ting Chen
Journal:  Bioinformatics       Date:  2011-01-13       Impact factor: 6.937

7.  An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea.

Authors:  Daniel McDonald; Morgan N Price; Julia Goodrich; Eric P Nawrocki; Todd Z DeSantis; Alexander Probst; Gary L Andersen; Rob Knight; Philip Hugenholtz
Journal:  ISME J       Date:  2011-12-01       Impact factor: 10.302

8.  MICCA: a complete and accurate software for taxonomic profiling of metagenomic data.

Authors:  Davide Albanese; Paolo Fontana; Carlotta De Filippo; Duccio Cavalieri; Claudio Donati
Journal:  Sci Rep       Date:  2015-05-19       Impact factor: 4.379

9.  A comparison of methods for clustering 16S rRNA sequences into OTUs.

Authors:  Wei Chen; Clarence K Zhang; Yongmei Cheng; Shaowu Zhang; Hongyu Zhao
Journal:  PLoS One       Date:  2013-08-13       Impact factor: 3.240

10.  A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.

Authors:  Imane Allali; Jason W Arnold; Jeffrey Roach; Maria Belen Cadenas; Natasha Butz; Hosni M Hassan; Matthew Koci; Anne Ballou; Mary Mendoza; Rizwana Ali; M Andrea Azcarate-Peril
Journal:  BMC Microbiol       Date:  2017-09-13       Impact factor: 3.605

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.