| Literature DB >> 25732839 |
Yanming Li1, Bin Nan1, Ji Zhu2.
Abstract
We propose a multivariate sparse group lasso variable selection and estimation method for data with high-dimensional predictors as well as high-dimensional response variables. The method is carried out through a penalized multivariate multiple linear regression model with an arbitrary group structure for the regression coefficient matrix. It suits many biology studies well in detecting associations between multiple traits and multiple predictors, with each trait and each predictor embedded in some biological functional groups such as genes, pathways or brain regions. The method is able to effectively remove unimportant groups as well as unimportant individual coefficients within important groups, particularly for large p small n problems, and is flexible in handling various complex group structures such as overlapping or nested or multilevel hierarchical structures. The method is evaluated through extensive simulations with comparisons to the conventional lasso and group lasso methods, and is applied to an eQTL association study.Entities:
Keywords: Coordinate descent algorithm; EQTL; Genetic association; High-dimensional data; Oracle inequalities; Sparsity
Mesh:
Year: 2015 PMID: 25732839 PMCID: PMC4479976 DOI: 10.1111/biom.12292
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 2.571