Literature DB >> 25912934

MtHc: a motif-based hierarchical method for clustering massive 16S rRNA sequences into OTUs.

Ze-Gang Wei1, Shao-Wu Zhang.   

Abstract

The recent sequencing revolution driven by high-throughput technologies has led to rapid accumulation of 16S rRNA sequences for microbial communities. Clustering short sequences into operational taxonomic units (OTUs) is an initial crucial process in analyzing metagenomic data. Although many methods have been proposed for OTU inferences, a major challenge is the balance between inference accuracy and computational efficiency. To address these challenges, we present a novel motif-based hierarchical method (namely MtHc) for clustering massive 16S rRNA sequences into OTUs with high clustering accuracy and low memory usage. Suppose all the 16S rRNA sequences can be used to construct a complete weighted network, where sequences are viewed as nodes, each pair of sequences is connected by an imaginary edge, and the distance of a pair of sequences represents the weight of the edge. MtHc consists of three main phrases. First, heuristically search the motif that is defined as n-node sub-graph (in the present study, n = 3, 4, 5), in which the distance between any two nodes is less than a threshold. Second, use the motif as a seed to form candidate clusters by computing the distances of other sequences with the motif. Finally, hierarchically merge the candidate clusters to generate the OTUs by only calculating the distances of motifs between two clusters. Compared with the existing methods on several simulated and real-life metagenomic datasets, we demonstrate that MtHc has higher clustering performance, less memory usage and robustness for setting parameters, and that it is more effective to handle the large-scale metagenomic datasets. The MtHC software can be freely download from for academic users.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25912934     DOI: 10.1039/c5mb00089k

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  7 in total

Review 1.  A clinician's guide to microbiome analysis.

Authors:  Marcus J Claesson; Adam G Clooney; Paul W O'Toole
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2017-08-09       Impact factor: 46.802

Review 2.  Factoring the intestinal microbiome into the pathogenesis of autoimmune hepatitis.

Authors:  Albert J Czaja
Journal:  World J Gastroenterol       Date:  2016-11-14       Impact factor: 5.742

3.  Deciphering bacterial community changes in zucker diabetic fatty rats based on 16S rRNA gene sequences analysis.

Authors:  Chunyan Gu; Ye Yang; Hong Xiang; Shu Li; Lina Liang; Hua Sui; Libin Zhan; Xiaoguang Lu
Journal:  Oncotarget       Date:  2016-08-02

4.  NPBSS: a new PacBio sequencing simulator for generating the continuous long reads with an empirical model.

Authors:  Ze-Gang Wei; Shao-Wu Zhang
Journal:  BMC Bioinformatics       Date:  2018-05-22       Impact factor: 3.169

5.  The gut microbiota and immune checkpoint inhibitors.

Authors:  Audrey Humphries; Adil Daud
Journal:  Hum Vaccin Immunother       Date:  2018-04-09       Impact factor: 3.452

6.  Comparison of Methods for Picking the Operational Taxonomic Units From Amplicon Sequences.

Authors:  Ze-Gang Wei; Xiao-Dan Zhang; Ming Cao; Fei Liu; Yu Qian; Shao-Wu Zhang
Journal:  Front Microbiol       Date:  2021-03-24       Impact factor: 5.640

7.  Polysaccharides from Chrysanthemum morifolium Ramat ameliorate colitis rats by modulating the intestinal microbiota community.

Authors:  Jin-Hua Tao; Jin-Ao Duan; Shu Jiang; Nan-Nan Feng; Wen-Qian Qiu; Yong Ling
Journal:  Oncotarget       Date:  2017-08-24
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.