Literature DB >> 22944714

Cluster-localized sparse logistic regression for SNP data.

Harald Binder1, Tina Müller, Holger Schwender, Klaus Golka, Michael Steffens, Jan G Hengstler, Katja Ickstadt, Martin Schumacher.   

Abstract

The task of analyzing high-dimensional single nucleotide polymorphism (SNP) data in a case-control design using multivariable techniques has only recently been tackled. While many available approaches investigate only main effects in a high-dimensional setting, we propose a more flexible technique, cluster-localized regression (CLR), based on localized logistic regression models, that allows different SNPs to have an effect for different groups of individuals. Separate multivariable regression models are fitted for the different groups of individuals by incorporating weights into componentwise boosting, which provides simultaneous variable selection, hence sparse fits. For model fitting, these groups of individuals are identified using a clustering approach, where each group may be defined via different SNPs. This allows for representing complex interaction patterns, such as compositional epistasis, that might not be detected by a single main effects model. In a simulation study, the CLR approach results in improved prediction performance, compared to the main effects approach, and identification of important SNPs in several scenarios. Improved prediction performance is also obtained for an application example considering urinary bladder cancer. Some of the identified SNPs are predictive for all individuals, while others are only relevant for a specific group. Together with the sets of SNPs that define the groups, potential interaction patterns are uncovered.

Entities:  

Mesh:

Year:  2012        PMID: 22944714     DOI: 10.1515/1544-6115.1694

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  9 in total

Review 1.  Big data in medical science--a biostatistical view.

Authors:  Harald Binder; Maria Blettner
Journal:  Dtsch Arztebl Int       Date:  2015-02-27       Impact factor: 5.594

2.  A weighting approach for judging the effect of patient strata on high-dimensional risk prediction signatures.

Authors:  Veronika Weyer; Harald Binder
Journal:  BMC Bioinformatics       Date:  2015-09-15       Impact factor: 3.169

3.  Genetic variants confer susceptibility to urinary bladder cancer: an updated list of confirmed polymorphisms.

Authors:  Silvia Selinski
Journal:  EXCLI J       Date:  2012-11-14       Impact factor: 4.068

4.  Identifying Prognostic SNPs in Clinical Cohorts: Complementing Univariate Analyses by Resampling and Multivariable Modeling.

Authors:  Stefanie Hieke; Axel Benner; Richard F Schlenk; Martin Schumacher; Lars Bullinger; Harald Binder
Journal:  PLoS One       Date:  2016-05-09       Impact factor: 3.240

5.  Model-based optimization of subgroup weights for survival analysis.

Authors:  Jakob Richter; Katrin Madjar; Jörg Rahnenführer
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

6.  Weighted Cox regression for the prediction of heterogeneous patient subgroups.

Authors:  Katrin Madjar; Jörg Rahnenführer
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-07       Impact factor: 2.796

7.  The post GWAS era: strategies to identify gene-gene and gene-environment interactions in urinary bladder cancer.

Authors:  Silvia Selinski
Journal:  EXCLI J       Date:  2014-11-03       Impact factor: 4.068

8.  Relevance of genetic disposition versus environmental exposure for cancer risk: an old controversy revisited with novel methods.

Authors:  H M Bolt
Journal:  EXCLI J       Date:  2013-01-28       Impact factor: 4.068

9.  Interaction of genetic variants towards increased cancer risk.

Authors:  Seddik Hammad
Journal:  EXCLI J       Date:  2013-07-09       Impact factor: 4.068

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.