Literature DB >> 25256572

A two-stage statistical procedure for feature selection and comparison in functional analysis of metagenomes.

Naruekamol Pookhao1, Michael B Sohn1, Qike Li1, Isaac Jenkins1, Ruofei Du1, Hongmei Jiang1, Lingling An2.   

Abstract

MOTIVATION: With the advance of new sequencing technologies producing massive short reads data, metagenomics is rapidly growing, especially in the fields of environmental biology and medical science. The metagenomic data are not only high dimensional with large number of features and limited number of samples but also complex with a large number of zeros and skewed distribution. Efficient computational and statistical tools are needed to deal with these unique characteristics of metagenomic sequencing data. In metagenomic studies, one main objective is to assess whether and how multiple microbial communities differ under various environmental conditions.
RESULTS: We propose a two-stage statistical procedure for selecting informative features and identifying differentially abundant features between two or more groups of microbial communities. In the functional analysis of metagenomes, the features may refer to the pathways, subsystems, functional roles and so on. In the first stage of the proposed procedure, the informative features are selected using elastic net as reducing the dimension of metagenomic data. In the second stage, the differentially abundant features are detected using generalized linear models with a negative binomial distribution. Compared with other available methods, the proposed approach demonstrates better performance for most of the comprehensive simulation studies. The new method is also applied to two real metagenomic datasets related to human health. Our findings are consistent with those in previous reports. AVAILABILITY: R code and two example datasets are available at http://cals.arizona.edu/∼anling/software.htm. SUPPLEMENTARY INFORMATION: Supplementary file is available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 25256572      PMCID: PMC4287949          DOI: 10.1093/bioinformatics/btu635

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  The future of microbial metagenomics (or is ignorance bliss?).

Authors:  Jack A Gilbert; Folker Meyer; Mark J Bailey
Journal:  ISME J       Date:  2010-11-25       Impact factor: 10.302

Review 2.  A bioinformatician's guide to metagenomics.

Authors:  Victor Kunin; Alex Copeland; Alla Lapidus; Konstantinos Mavromatis; Philip Hugenholtz
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

3.  Integrative analysis of environmental sequences using MEGAN4.

Authors:  Daniel H Huson; Suparna Mitra; Hans-Joachim Ruscheweyh; Nico Weber; Stephan C Schuster
Journal:  Genome Res       Date:  2011-06-20       Impact factor: 9.043

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  Regulation of the stringent response is the essential function of the conserved bacterial G protein CgtA in Vibrio cholerae.

Authors:  David M Raskin; Nicholas Judson; John J Mekalanos
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-05       Impact factor: 11.205

6.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

Review 7.  Exploring prokaryotic diversity in the genomic era.

Authors:  Philip Hugenholtz
Journal:  Genome Biol       Date:  2002-01-29       Impact factor: 13.583

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  Statistical methods for detecting differentially abundant features in clinical metagenomic samples.

Authors:  James Robert White; Niranjan Nagarajan; Mihai Pop
Journal:  PLoS Comput Biol       Date:  2009-04-10       Impact factor: 4.475

10.  Methods for comparative metagenomics.

Authors:  Daniel H Huson; Daniel C Richter; Suparna Mitra; Alexander F Auch; Stephan C Schuster
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

View more
  8 in total

1.  Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation.

Authors:  A Grant Schissler; Walter W Piegorsch; Yves A Lussier
Journal:  Stat Methods Med Res       Date:  2017-05-29       Impact factor: 3.021

Review 2.  Statistical challenges in longitudinal microbiome data analysis.

Authors:  Saritha Kodikara; Susan Ellul; Kim-Anh Lê Cao
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

Review 3.  Recent Advances in the Etiopathogenesis of Inflammatory Bowel Disease: The Role of Omics.

Authors:  Eleni Stylianou
Journal:  Mol Diagn Ther       Date:  2018-02       Impact factor: 4.074

4.  Combining whole-genome shotgun sequencing and rRNA gene amplicon analyses to improve detection of microbe-microbe interaction networks in plant leaves.

Authors:  Julian Regalado; Derek S Lundberg; Oliver Deusch; Sonja Kersten; Talia Karasov; Karin Poersch; Gautam Shirsekar; Detlef Weigel
Journal:  ISME J       Date:  2020-05-13       Impact factor: 10.302

5.  Negative Binomial Mixed Models for Analyzing Longitudinal Microbiome Data.

Authors:  Xinyan Zhang; Yu-Fang Pei; Lei Zhang; Boyi Guo; Amanda H Pendegraft; Wenzhuo Zhuang; Nengjun Yi
Journal:  Front Microbiol       Date:  2018-07-26       Impact factor: 5.640

6.  Functional gene categories differentiate maize leaf drought-related microbial epiphytic communities.

Authors:  Barbara A Methe; David Hiltbrand; Jeffrey Roach; Wenwei Xu; Stuart G Gordon; Brad W Goodner; Ann E Stapleton
Journal:  PLoS One       Date:  2020-09-18       Impact factor: 3.240

Review 7.  The Road to Metagenomics: From Microbiology to DNA Sequencing Technologies and Bioinformatics.

Authors:  Alejandra Escobar-Zepeda; Arturo Vera-Ponce de León; Alejandro Sanchez-Flores
Journal:  Front Genet       Date:  2015-12-17       Impact factor: 4.599

8.  Paucibacterial Microbiome and Resident DNA Virome of the Healthy Conjunctiva.

Authors:  Thuy Doan; Lakshmi Akileswaran; Dallin Andersen; Benjamin Johnson; Narae Ko; Angira Shrestha; Valery Shestopalov; Cecilia S Lee; Aaron Y Lee; Russell N Van Gelder
Journal:  Invest Ophthalmol Vis Sci       Date:  2016-10-01       Impact factor: 4.799

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.