Literature DB >> 22825854

Adverse subpopulation regression for multivariate outcomes with high-dimensional predictors.

Bin Zhu1, David B Dunson, Allison E Ashley-Koch.   

Abstract

Biomedical studies have a common interest in assessing relationships between multiple related health outcomes and high-dimensional predictors. For example, in reproductive epidemiology, one may collect pregnancy outcomes such as length of gestation and birth weight and predictors such as single nucleotide polymorphisms in multiple candidate genes and environmental exposures. In such settings, there is a need for simple yet flexible methods for selecting true predictors of adverse health responses from a high-dimensional set of candidate predictors. To address this problem, one may either consider linear regression models for the continuous outcomes or convert these outcomes into binary indicators of adverse responses using predefined cutoffs. The former strategy has the disadvantage of often leading to a poorly fitting model that does not predict risk well, whereas the latter approach can be very sensitive to the cutoff choice. As a simple yet flexible alternative, we propose a method for adverse subpopulation regression, which relies on a two-component latent class model, with the dominant component corresponding to (presumed) healthy individuals and the risk of falling in the minority component characterized via a logistic regression. The logistic regression model is designed to accommodate high-dimensional predictors, as occur in studies with a large number of gene by environment interactions, through the use of a flexible nonparametric multiple shrinkage approach. The Gibbs sampler is developed for posterior computation. We evaluate the methods with the use of simulation studies and apply these to a genetic epidemiology study of pregnancy outcomes.
Copyright © 2012 John Wiley & Sons, Ltd.

Entities:  

Mesh:

Year:  2012        PMID: 22825854      PMCID: PMC3712761          DOI: 10.1002/sim.5520

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  19 in total

1.  Classification of births by birth weight and gestational age: an application of multivariate mixture models.

Authors:  T B Gage
Journal:  Ann Hum Biol       Date:  2003 Sep-Oct       Impact factor: 1.533

2.  Bayesian multivariate logistic regression.

Authors:  Sean M O'Brien; David B Dunson
Journal:  Biometrics       Date:  2004-09       Impact factor: 2.571

3.  Fixed and random effects selection in linear and logistic models.

Authors:  Satkartar K Kinney; David B Dunson
Journal:  Biometrics       Date:  2007-04-02       Impact factor: 2.571

4.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

Review 5.  Environmental contributions to disparities in pregnancy outcomes.

Authors:  Marie Lynn Miranda; Pamela Maxson; Sharon Edwards
Journal:  Epidemiol Rev       Date:  2009-10-21       Impact factor: 6.222

6.  Joint Bayesian analysis of birthweight and censored gestational age using finite mixture models.

Authors:  Scott L Schwartz; Alan E Gelfand; Marie L Miranda
Journal:  Stat Med       Date:  2010-07-20       Impact factor: 2.373

7.  Maternal vitamin D receptor genetic variation contributes to infant birthweight among black mothers.

Authors:  Geeta K Swamy; Melanie E Garrett; Marie Lynn Miranda; Allison E Ashley-Koch
Journal:  Am J Med Genet A       Date:  2011-05-05       Impact factor: 2.802

8.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

9.  Strong rules for discarding predictors in lasso-type problems.

Authors:  Robert Tibshirani; Jacob Bien; Jerome Friedman; Trevor Hastie; Noah Simon; Jonathan Taylor; Ryan J Tibshirani
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-03       Impact factor: 4.488

10.  A Bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eQTL studies.

Authors:  Oliver Stegle; Leopold Parts; Richard Durbin; John Winn
Journal:  PLoS Comput Biol       Date:  2010-05-06       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.