Literature DB >> 20122189

Identifying main effects and epistatic interactions from large-scale SNP data via adaptive group Lasso.

Can Yang1, Xiang Wan, Qiang Yang, Hong Xue, Weichuan Yu.   

Abstract

BACKGROUND: Single nucleotide polymorphism (SNP) based association studies aim at identifying SNPs associated with phenotypes, for example, complex diseases. The associated SNPs may influence the disease risk individually (main effects) or behave jointly (epistatic interactions). For the analysis of high throughput data, the main difficulty is that the number of SNPs far exceeds the number of samples. This difficulty is amplified when identifying interactions.
RESULTS: In this paper, we propose an Adaptive Group Lasso (AGL) model for large-scale association studies. Our model enables us to analyze SNPs and their interactions simultaneously. We achieve this by introducing a sparsity constraint in our model based on the fact that only a small fraction of SNPs is disease-associated. In order to reduce the number of false positive findings, we develop an adaptive reweighting scheme to enhance sparsity. In addition, our method treats SNPs and their interactions as factors, and identifies them in a grouped manner. Thus, it is flexible to analyze various disease models, especially for interaction detection. However, due to the intensive computation when millions of interaction terms needs to be searched in the model fitting, our method needs to combined with some filtering methods when applied to genome-wide data for detecting interactions.
CONCLUSION: By using a wide range of simulated datasets and a real dataset from WTCCC, we demonstrate the advantages of our method.

Entities:  

Mesh:

Year:  2010        PMID: 20122189      PMCID: PMC3203332          DOI: 10.1186/1471-2105-11-S1-S18

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  22 in total

1.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

Authors:  M R Nelson; S L Kardia; R E Ferrell; C F Sing
Journal:  Genome Res       Date:  2001-03       Impact factor: 9.043

2.  A complete enumeration and classification of two-locus disease models.

Authors:  W Li; J Reich
Journal:  Hum Hered       Date:  2000 Nov-Dec       Impact factor: 0.444

3.  A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.

Authors:  Digna R Velez; Bill C White; Alison A Motsinger; William S Bush; Marylyn D Ritchie; Scott M Williams; Jason H Moore
Journal:  Genet Epidemiol       Date:  2007-05       Impact factor: 2.135

4.  Pathway-based approaches for analysis of genomewide association studies.

Authors:  Kai Wang; Mingyao Li; Maja Bucan
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

5.  SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.

Authors:  Can Yang; Zengyou He; Xiang Wan; Qiang Yang; Hong Xue; Weichuan Yu
Journal:  Bioinformatics       Date:  2008-12-19       Impact factor: 6.937

6.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

7.  Association of genetic variations of genes encoding thrombospondin, type 1, domain-containing 4 and 7A with low bone mineral density in Japanese women with osteoporosis.

Authors:  Seijiro Mori; Ikuyo Kou; Hidenori Sato; Mitsuru Emi; Hideki Ito; Takayuki Hosoi; Shiro Ikegawa
Journal:  J Hum Genet       Date:  2008-05-17       Impact factor: 3.172

Review 8.  Detecting gene-gene interactions that underlie human diseases.

Authors:  Heather J Cordell
Journal:  Nat Rev Genet       Date:  2009-06       Impact factor: 53.242

9.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

10.  Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

Authors:  Clive J Hoggart; John C Whittaker; Maria De Iorio; David J Balding
Journal:  PLoS Genet       Date:  2008-07-25       Impact factor: 5.917

View more
  19 in total

1.  A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES.

Authors:  Jiahan Li; Wei Zhong; Runze Li; Rongling Wu
Journal:  Ann Appl Stat       Date:  2014       Impact factor: 2.083

2.  A comparison of multifactor dimensionality reduction and L1-penalized regression to identify gene-gene interactions in genetic association studies.

Authors:  Stacey Winham; Chong Wang; Alison A Motsinger-Reif
Journal:  Stat Appl Genet Mol Biol       Date:  2011-01-06

3.  PoCos: Population Covering Locus Sets for Risk Assessment in Complex Diseases.

Authors:  Marzieh Ayati; Mehmet Koyutürk
Journal:  PLoS Comput Biol       Date:  2016-11-11       Impact factor: 4.475

4.  Fine-mapping additive and dominant SNP effects using group-LASSO and fractional resample model averaging.

Authors:  Jeremy Sabourin; Andrew B Nobel; William Valdar
Journal:  Genet Epidemiol       Date:  2014-11-21       Impact factor: 2.135

5.  A penalized robust semiparametric approach for gene-environment interactions.

Authors:  Cen Wu; Xingjie Shi; Yuehua Cui; Shuangge Ma
Journal:  Stat Med       Date:  2015-08-03       Impact factor: 2.373

6.  PUMA: a unified framework for penalized multiple regression analysis of GWAS data.

Authors:  Gabriel E Hoffman; Benjamin A Logsdon; Jason G Mezey
Journal:  PLoS Comput Biol       Date:  2013-06-27       Impact factor: 4.475

7.  Genome-wide interaction-based association analysis identified multiple new susceptibility Loci for common diseases.

Authors:  Yang Liu; Haiming Xu; Suchao Chen; Xianfeng Chen; Zhenguo Zhang; Zhihong Zhu; Xueying Qin; Landian Hu; Jun Zhu; Guo-Ping Zhao; Xiangyin Kong
Journal:  PLoS Genet       Date:  2011-03-17       Impact factor: 5.917

8.  Comparative analysis of methods for detecting interacting loci.

Authors:  Li Chen; Guoqiang Yu; Carl D Langefeld; David J Miller; Richard T Guy; Jayaram Raghuram; Xiguo Yuan; David M Herrington; Yue Wang
Journal:  BMC Genomics       Date:  2011-07-05       Impact factor: 3.969

Review 9.  Sparse models for correlative and integrative analysis of imaging and genetic data.

Authors:  Dongdong Lin; Hongbao Cao; Vince D Calhoun; Yu-Ping Wang
Journal:  J Neurosci Methods       Date:  2014-09-09       Impact factor: 2.390

10.  eQTL Epistasis - Challenges and Computational Approaches.

Authors:  Yang Huang; Stefan Wuchty; Teresa M Przytycka
Journal:  Front Genet       Date:  2013-05-31       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.