Literature DB >> 25568281

Selection of models for the analysis of risk-factor trees: leveraging biological knowledge to mine large sets of risk factors with application to microbiome data.

Qunyuan Zhang1, Haley Abel1, Alan Wells1, Petra Lenzini1, Felicia Gomez1, Michael A Province1, Alan A Templeton2, George M Weinstock1, Nita H Salzman1, Ingrid B Borecki1.   

Abstract

MOTIVATION: Establishment of a statistical association between microbiome features and clinical outcomes is of growing interest because of the potential for yielding insights into biological mechanisms and pathogenesis. Extracting microbiome features that are relevant for a disease is challenging and existing variable selection methods are limited due to large number of risk factor variables from microbiome sequence data and their complex biological structure.
RESULTS: We propose a tree-based scanning method, Selection of Models for the Analysis of Risk factor Trees (referred to as SMART-scan), for identifying taxonomic groups that are associated with a disease or trait. SMART-scan is a model selection technique that uses a predefined taxonomy to organize the large pool of possible predictors into optimized groups, and hierarchically searches and determines variable groups for association test. We investigate the statistical properties of SMART-scan through simulations, in comparison to a regular single-variable analysis and three commonly-used variable selection methods, stepwise regression, least absolute shrinkage and selection operator (LASSO) and classification and regression tree (CART). When there are taxonomic group effects in the data, SMART-scan can significantly increase power by using bacterial taxonomic information to split large numbers of variables into groups. Through an application to microbiome data from a vervet monkey diet experiment, we demonstrate that SMART-scan can identify important phenotype-associated taxonomic features missed by single-variable analysis, stepwise regression, LASSO and CART.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25568281      PMCID: PMC4426830          DOI: 10.1093/bioinformatics/btu855

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB.

Authors:  T Z DeSantis; P Hugenholtz; N Larsen; M Rojas; E L Brodie; K Keller; T Huber; D Dalevi; P Hu; G L Andersen
Journal:  Appl Environ Microbiol       Date:  2006-07       Impact factor: 4.792

2.  Identification of important regressor groups, subgroups and individuals via regularization methods: application to gut microbiome data.

Authors:  Tanya P Garcia; Samuel Müller; Raymond J Carroll; Rosemary L Walzem
Journal:  Bioinformatics       Date:  2013-10-24       Impact factor: 6.937

3.  Comparative methods with sampling error and within-species variation: contrasts revisited and revised.

Authors:  Joseph Felsenstein
Journal:  Am Nat       Date:  2008-06       Impact factor: 3.926

4.  A cladistic analysis of phenotypic associations with haplotypes inferred from restriction endonuclease mapping. IV. Nested analyses with cladogram uncertainty and recombination.

Authors:  A R Templeton; C F Sing
Journal:  Genetics       Date:  1993-06       Impact factor: 4.562

5.  The detection of disease clustering and a generalized regression approach.

Authors:  N Mantel
Journal:  Cancer Res       Date:  1967-02       Impact factor: 12.701

6.  Significant genotype by diet (G × D) interaction effects on cardiometabolic responses to a pedigree-wide, dietary challenge in vervet monkeys (Chlorocebus aethiops sabaeus).

Authors:  Venkata S Voruganti; Matthew J Jorgensen; Jay R Kaplan; Kylie Kavanagh; Larry L Rudel; Ryan Temel; Lynn A Fairbanks; Anthony G Comuzzie
Journal:  Am J Primatol       Date:  2013-01-11       Impact factor: 2.371

7.  Tree scanning: a method for using haplotype trees in phenotype/genotype association studies.

Authors:  Alan R Templeton; Taylor Maxwell; David Posada; Jari H Stengård; Eric Boerwinkle; Charles F Sing
Journal:  Genetics       Date:  2004-09-15       Impact factor: 4.562

8.  A framework for human microbiome research.

Authors: 
Journal:  Nature       Date:  2012-06-13       Impact factor: 49.962

9.  Hypothesis testing and power calculations for taxonomic-based human microbiome data.

Authors:  Patricio S La Rosa; J Paul Brooks; Elena Deych; Edward L Boone; David J Edwards; Qin Wang; Erica Sodergren; George Weinstock; William D Shannon
Journal:  PLoS One       Date:  2012-12-20       Impact factor: 3.240

10.  Statistical methods for detecting differentially abundant features in clinical metagenomic samples.

Authors:  James Robert White; Niranjan Nagarajan; Mihai Pop
Journal:  PLoS Comput Biol       Date:  2009-04-10       Impact factor: 4.475

View more
  5 in total

Review 1.  Tools for Analysis of the Microbiome.

Authors:  Jessica Galloway-Peña; Blake Hanson
Journal:  Dig Dis Sci       Date:  2020-03       Impact factor: 3.199

Review 2.  Revolution of Alzheimer Precision Neurology. Passageway of Systems Biology and Neurophysiology.

Authors:  Harald Hampel; Nicola Toschi; Claudio Babiloni; Filippo Baldacci; Keith L Black; Arun L W Bokde; René S Bun; Francesco Cacciola; Enrica Cavedo; Patrizia A Chiesa; Olivier Colliot; Cristina-Maria Coman; Bruno Dubois; Andrea Duggento; Stanley Durrleman; Maria-Teresa Ferretti; Nathalie George; Remy Genthon; Marie-Odile Habert; Karl Herholz; Yosef Koronyo; Maya Koronyo-Hamaoui; Foudil Lamari; Todd Langevin; Stéphane Lehéricy; Jean Lorenceau; Christian Neri; Robert Nisticò; Francis Nyasse-Messene; Craig Ritchie; Simone Rossi; Emiliano Santarnecchi; Olaf Sporns; Steven R Verdooner; Andrea Vergallo; Nicolas Villain; Erfan Younesi; Francesco Garaci; Simone Lista
Journal:  J Alzheimers Dis       Date:  2018       Impact factor: 4.472

3.  An autoregressive logistic model to predict the reciprocal effects of oviductal fluid components on in vitro spermophagy by neutrophils in cattle.

Authors:  Rasoul Kowsar; Behrooz Keshtegar; Mohamed A Marey; Akio Miyamoto
Journal:  Sci Rep       Date:  2017-06-30       Impact factor: 4.379

4.  Application of Taxonomic Modeling to Microbiota Data Mining for Detection of Helminth Infection in Global Populations.

Authors:  Mahbaneh Eshaghzadeh Torbati; Makedonka Mitreva; Vanathi Gopalakrishnan
Journal:  Data (Basel)       Date:  2016-12-13

5.  Decision Tree Algorithm-Generated Single-Nucleotide Polymorphism Barcodes of rbcL Genes for 38 Brassicaceae Species Tagging.

Authors:  Cheng-Hong Yang; Kuo-Chuan Wu; Li-Yeh Chuang; Hsueh-Wei Chang
Journal:  Evol Bioinform Online       Date:  2018-03-05       Impact factor: 1.625

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.