Literature DB >> 22843983

Meta-Storms: efficient search for similar microbial communities based on a novel indexing scheme and similarity score for metagenomic data.

Xiaoquan Su1, Jian Xu, Kang Ning.   

Abstract

BACKGROUND: It has long been intriguing scientists to effectively compare different microbial communities (also referred as 'metagenomic samples' here) in a large scale: given a set of unknown samples, find similar metagenomic samples from a large repository and examine how similar these samples are. With the current metagenomic samples accumulated, it is possible to build a database of metagenomic samples of interests. Any metagenomic samples could then be searched against this database to find the most similar metagenomic sample(s). However, on one hand, current databases with a large number of metagenomic samples mostly serve as data repositories that offer few functionalities for analysis; and on the other hand, methods to measure the similarity of metagenomic data work well only for small set of samples by pairwise comparison. It is not yet clear, how to efficiently search for metagenomic samples against a large metagenomic database.
RESULTS: In this study, we have proposed a novel method, Meta-Storms, that could systematically and efficiently organize and search metagenomic data. It includes the following components: (i) creating a database of metagenomic samples based on their taxonomical annotations, (ii) efficient indexing of samples in the database based on a hierarchical taxonomy indexing strategy, (iii) searching for a metagenomic sample against the database by a fast scoring function based on quantitative phylogeny and (iv) managing database by index export, index import, data insertion, data deletion and database merging. We have collected more than 1300 metagenomic data from the public domain and in-house facilities, and tested the Meta-Storms method on these datasets. Our experimental results show that Meta-Storms is capable of database creation and effective searching for a large number of metagenomic samples, and it could achieve similar accuracies compared with the current popular significance testing-based methods.
CONCLUSION: Meta-Storms method would serve as a suitable database management and search system to quickly identify similar metagenomic samples from a large pool of samples. CONTACT: ningkang@qibebt.ac.cn SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2012        PMID: 22843983     DOI: 10.1093/bioinformatics/bts470

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Meta-Prism 2.0: Enabling algorithm and web server for ultra-fast, memory-efficient, and accurate analysis among millions of microbial community samples.

Authors:  Kai Kang; Hui Chong; Kang Ning
Journal:  Gigascience       Date:  2022-07-28       Impact factor: 7.658

2.  The predictive power of saliva electrolytes exceeds that of saliva microbiomes in diagnosing early childhood caries.

Authors:  Ying Zhang; Shi Huang; Songbo Jia; Zheng Sun; Shanshan Li; Fan Li; Lijuan Zhang; Jie Lu; Kaixuan Tan; Fei Teng; Fang Yang
Journal:  J Oral Microbiol       Date:  2021-05-13       Impact factor: 5.474

3.  Exploration and retrieval of whole-metagenome sequencing samples.

Authors:  Sohan Seth; Niko Välimäki; Samuel Kaski; Antti Honkela
Journal:  Bioinformatics       Date:  2014-05-19       Impact factor: 6.937

4.  Microbial community pattern detection in human body habitats via ensemble clustering framework.

Authors:  Peng Yang; Xiaoquan Su; Le Ou-Yang; Hon-Nian Chua; Xiao-Li Li; Kang Ning
Journal:  BMC Syst Biol       Date:  2014-12-08

5.  Exploring neighborhoods in the metagenome universe.

Authors:  Kathrin P Aßhauer; Heiner Klingenberg; Thomas Lingner; Peter Meinicke
Journal:  Int J Mol Sci       Date:  2014-07-14       Impact factor: 5.923

6.  Equivalent input produces different output in the UniFrac significance test.

Authors:  Jeffrey R Long; Vanessa Pittet; Brett Trost; Qingxiang Yan; David Vickers; Monique Haakensen; Anthony Kusalik
Journal:  BMC Bioinformatics       Date:  2014-08-13       Impact factor: 3.169

7.  Assessment of quality control approaches for metagenomic data analysis.

Authors:  Qian Zhou; Xiaoquan Su; Kang Ning
Journal:  Sci Rep       Date:  2014-11-07       Impact factor: 4.379

8.  Parallel-META 3: Comprehensive taxonomical and functional analysis platform for efficient comparison of microbial communities.

Authors:  Gongchao Jing; Zheng Sun; Honglei Wang; Yanhai Gong; Shi Huang; Kang Ning; Jian Xu; Xiaoquan Su
Journal:  Sci Rep       Date:  2017-01-12       Impact factor: 4.379

9.  MetaBoot: a machine learning framework of taxonomical biomarker discovery for different microbial communities based on metagenomic data.

Authors:  Xiaojun Wang; Xiaoquan Su; Xinping Cui; Kang Ning
Journal:  PeerJ       Date:  2015-07-07       Impact factor: 2.984

10.  Rapid comparison and correlation analysis among massive number of microbial community samples based on MDV data model.

Authors:  Xiaoquan Su; Jianqiang Hu; Shi Huang; Kang Ning
Journal:  Sci Rep       Date:  2014-09-17       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.