Literature DB >> 22988281

Incorporating group correlations in genome-wide association studies using smoothed group Lasso.

Jin Liu1, Jian Huang, Shuangge Ma, Kai Wang.   

Abstract

In genome-wide association studies, penalization is an important approach for identifying genetic markers associated with disease. Motivated by the fact that there exists natural grouping structure in single nucleotide polymorphisms and, more importantly, such groups are correlated, we propose a new penalization method for group variable selection which can properly accommodate the correlation between adjacent groups. This method is based on a combination of the group Lasso penalty and a quadratic penalty on the difference of regression coefficients of adjacent groups. The new method is referred to as smoothed group Lasso (SGL). It encourages group sparsity and smoothes regression coefficients for adjacent groups. Canonical correlations are applied to the weights between groups in the quadratic difference penalty. We first derive a GCD algorithm for computing the solution path with linear regression model. The SGL method is further extended to logistic regression for binary response. With the assistance of the majorize-minimization algorithm, the SGL penalized logistic regression turns out to be an iteratively penalized least-square problem. We also suggest conducting principal component analysis to reduce the dimensionality within groups. Simulation studies are used to evaluate the finite sample performance. Comparison with group Lasso shows that SGL is more effective in selecting true positives. Two datasets are analyzed using the SGL method.

Mesh:

Substances:

Year:  2012        PMID: 22988281      PMCID: PMC3590928          DOI: 10.1093/biostatistics/kxs034

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  9 in total

1.  Detection of gene copy number changes in CGH microarrays using a spatially correlated mixture model.

Authors:  Philippe Broët; Sylvia Richardson
Journal:  Bioinformatics       Date:  2006-02-02       Impact factor: 6.937

2.  A flexible and powerful bayesian hierarchical model for ChIP-Chip experiments.

Authors:  Raphael Gottardo; Wei Li; W Evan Johnson; X Shirley Liu
Journal:  Biometrics       Date:  2007-09-20       Impact factor: 2.571

3.  Group SCAD regression analysis for microarray time course gene expression data.

Authors:  Lifeng Wang; Guang Chen; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-04-26       Impact factor: 6.937

4.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

5.  The Sparse Laplacian Shrinkage Estimator for High-Dimensional Regression.

Authors:  Jian Huang; Shuangge Ma; Hongzhe Li; Cun-Hui Zhang
Journal:  Ann Stat       Date:  2011       Impact factor: 4.028

6.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 7.  A review of the MHC genetics of rheumatoid arthritis.

Authors:  J L Newton; S M J Harney; B P Wordsworth; M A Brown
Journal:  Genes Immun       Date:  2004-05       Impact factor: 2.676

8.  Genetic Analysis Workshop 17 mini-exome simulation.

Authors:  Laura Almasy; Thomas D Dyer; Juan Manuel Peralta; Jack W Kent; Jac C Charlesworth; Joanne E Curran; John Blangero
Journal:  BMC Proc       Date:  2011-11-29

9.  Data for Genetic Analysis Workshop 16 Problem 1, association analysis of rheumatoid arthritis data.

Authors:  Christopher I Amos; Wei Vivien Chen; Michael F Seldin; Elaine F Remmers; Kimberly E Taylor; Lindsey A Criswell; Annette T Lee; Robert M Plenge; Daniel L Kastner; Peter K Gregersen
Journal:  BMC Proc       Date:  2009-12-15
  9 in total
  12 in total

1.  Structured gene-environment interaction analysis.

Authors:  Mengyun Wu; Qingzhao Zhang; Shuangge Ma
Journal:  Biometrics       Date:  2019-10-09       Impact factor: 2.571

2.  Prediction-Oriented Marker Selection (PROMISE): With Application to High-Dimensional Regression.

Authors:  Soyeon Kim; Veerabhadran Baladandayuthapani; J Jack Lee
Journal:  Stat Biosci       Date:  2016-09-26

3.  Integrative analysis of high-throughput cancer studies with contrasted penalization.

Authors:  Xingjie Shi; Jin Liu; Jian Huang; Yong Zhou; BenChang Shia; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2014-01-06       Impact factor: 2.135

4.  Structure-Leveraged Methods in Breast Cancer Risk Prediction.

Authors:  Jun Fan; Yirong Wu; Ming Yuan; David Page; Jie Liu; Irene M Ong; Peggy Peissig; Elizabeth Burnside
Journal:  J Mach Learn Res       Date:  2016-12       Impact factor: 3.654

5.  Time-varying Hazards Model for Incorporating Irregularly Measured, High-Dimensional Biomarkers.

Authors:  Xiang Li; Quefeng Li; Donglin Zeng; Karen Marder; Jane Paulsen; Yuanjia Wang
Journal:  Stat Sin       Date:  2020-07       Impact factor: 1.261

6.  IGESS: a statistical approach to integrating individual-level genotype data and summary statistics in genome-wide association studies.

Authors:  Mingwei Dai; Jingsi Ming; Mingxuan Cai; Jin Liu; Can Yang; Xiang Wan; Zongben Xu
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

7.  Molecular pathway identification using biological network-regularized logistic models.

Authors:  Wen Zhang; Ying-Wooi Wan; Genevera I Allen; Kaifang Pang; Matthew L Anderson; Zhandong Liu
Journal:  BMC Genomics       Date:  2013-12-09       Impact factor: 3.969

8.  Signatures for mass spectrometry data quality.

Authors:  Brett G Amidan; Daniel J Orton; Brian L Lamarche; Matthew E Monroe; Ronald J Moore; Alexander M Venzin; Richard D Smith; Landon H Sego; Mark F Tardiff; Samuel H Payne
Journal:  J Proteome Res       Date:  2014-03-24       Impact factor: 4.466

9.  Correspondence between fMRI and SNP data by group sparse canonical correlation analysis.

Authors:  Dongdong Lin; Vince D Calhoun; Yu-Ping Wang
Journal:  Med Image Anal       Date:  2013-10-31       Impact factor: 8.545

10.  Efficient network-guided multi-locus association mapping with graph cuts.

Authors:  Chloé-Agathe Azencott; Dominik Grimm; Mahito Sugiyama; Yoshinobu Kawahara; Karsten M Borgwardt
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.