Literature DB >> 24369151

Phylogeny-based classification of microbial communities.

Olga Tanaseichuk1, James Borneman, Tao Jiang.   

Abstract

MOTIVATION: Next-generation sequencing coupled with metagenomics has led to the rapid growth of sequence databases and enabled a new branch of microbiology called comparative metagenomics. Comparative metagenomic analysis studies compositional patterns within and between different environments providing a deep insight into the structure and function of complex microbial communities. It is a fast growing field that requires the development of novel supervised learning techniques for addressing challenges associated with metagenomic data, e.g. sensitivity to the choice of sequence similarity cutoff used to define operational taxonomic units (OTUs), high dimensionality and sparsity of the data and so forth. On the other hand, the natural properties of microbial community data may provide useful information about the structure of the data. For example, similarity between species encoded by a phylogenetic tree captures the relationship between OTUs and may be useful for the analysis of complex microbial datasets where the diversity patterns comprise features at multiple taxonomic levels. Even though some of the challenges have been addressed by learning algorithms in the literature, none of the available methods take advantage of the inherent properties of metagenomic data.
RESULTS: We proposed a novel supervised classification method for microbial community samples, where each sample is represented as a set of OTU frequencies, which takes advantage of the natural structure in microbial community data encoded by a phylogenetic tree. This model allows us to take advantage of environment-specific compositional patterns that may contain features at multiple granularity levels. Our method is based on the multinomial logistic regression model with a tree-guided penalty function. Additionally, we proposed a new simulation framework for generating 16S ribosomal RNA gene read counts that may be useful in comparative metagenomics research. Our experimental results on simulated and real data show that the phylogenetic information used in our method improves the classification accuracy.
AVAILABILITY AND IMPLEMENTATION: http://www.cs.ucr.edu/~tanaseio/metaphyl.htm.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24369151     DOI: 10.1093/bioinformatics/btt700

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Multilevel regularized regression for simultaneous taxa selection and network construction with metagenomic count data.

Authors:  Zhenqiu Liu; Fengzhu Sun; Jonathan Braun; Dermot P B McGovern; Steven Piantadosi
Journal:  Bioinformatics       Date:  2014-11-20       Impact factor: 6.937

2.  KERNEL-PENALIZED REGRESSION FOR ANALYSIS OF MICROBIOME DATA.

Authors:  Timothy W Randolph; Sen Zhao; Wade Copeland; Meredith Hullar; Ali Shojaie
Journal:  Ann Appl Stat       Date:  2018-03-09       Impact factor: 2.083

3.  Natural bacterial communities serve as quantitative geochemical biosensors.

Authors:  Mark B Smith; Andrea M Rocha; Chris S Smillie; Scott W Olesen; Charles Paradis; Liyou Wu; James H Campbell; Julian L Fortney; Tonia L Mehlhorn; Kenneth A Lowe; Jennifer E Earles; Jana Phillips; Steve M Techtmann; Dominique C Joyner; Dwayne A Elias; Kathryn L Bailey; Richard A Hurt; Sarah P Preheim; Matthew C Sanders; Joy Yang; Marcella A Mueller; Scott Brooks; David B Watson; Ping Zhang; Zhili He; Eric A Dubinsky; Paul D Adams; Adam P Arkin; Matthew W Fields; Jizhong Zhou; Eric J Alm; Terry C Hazen
Journal:  MBio       Date:  2015-05-12       Impact factor: 7.867

4.  Explaining diversity in metagenomic datasets by phylogenetic-based feature weighting.

Authors:  Davide Albanese; Carlotta De Filippo; Duccio Cavalieri; Claudio Donati
Journal:  PLoS Comput Biol       Date:  2015-03-27       Impact factor: 4.475

5.  Metagenomic Analysis of Silage.

Authors:  Richard K Tennant; Christine M Sambles; Georgina E Diffey; Karen A Moore; John Love
Journal:  J Vis Exp       Date:  2017-01-13       Impact factor: 1.355

6.  Principal Amalgamation Analysis for Microbiome Data.

Authors:  Yan Li; Gen Li; Kun Chen
Journal:  Genes (Basel)       Date:  2022-06-24       Impact factor: 4.141

7.  Predictive Modeling of Microbiome Data Using a Phylogeny-Regularized Generalized Linear Mixed Model.

Authors:  Jian Xiao; Li Chen; Stephen Johnson; Yue Yu; Xianyang Zhang; Jun Chen
Journal:  Front Microbiol       Date:  2018-06-27       Impact factor: 5.640

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.