Literature DB >> 29077795

Group spike-and-slab lasso generalized linear models for disease prediction and associated genes detection by incorporating pathway information.

Zaixiang Tang1,2,3,4, Yueping Shen1,2, Yan Li4, Xinyan Zhang4, Jia Wen5, Chen'ao Qian6, Wenzhuo Zhuang7, Xinghua Shi5, Nengjun Yi4.   

Abstract

Motivation: Large-scale molecular data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, standard approaches for omics data analysis ignore the group structure among genes encoded in functional relationships or pathway information.
Results: We propose new Bayesian hierarchical generalized linear models, called group spike-and-slab lasso GLMs, for predicting disease outcomes and detecting associated genes by incorporating large-scale molecular data and group structures. The proposed model employs a mixture double-exponential prior for coefficients that induces self-adaptive shrinkage amount on different coefficients. The group information is incorporated into the model by setting group-specific parameters. We have developed a fast and stable deterministic algorithm to fit the proposed hierarchal GLMs, which can perform variable selection within groups. We assess the performance of the proposed method on several simulated scenarios, by varying the overlap among groups, group size, number of non-null groups, and the correlation within group. Compared with existing methods, the proposed method provides not only more accurate estimates of the parameters but also better prediction. We further demonstrate the application of the proposed procedure on three cancer datasets by utilizing pathway structures of genes. Our results show that the proposed method generates powerful models for predicting disease outcomes and detecting associated genes. Availability and implementation: The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Contact: nyi@uab.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29077795      PMCID: PMC5860634          DOI: 10.1093/bioinformatics/btx684

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

2.  Pre-validation and inference in microarrays.

Authors:  Robert J Tibshirani; Brad Efron
Journal:  Stat Appl Genet Mol Biol       Date:  2002-08-22

3.  Agglomerative joint clustering of metabolic data with spike at zero: A Bayesian perspective.

Authors:  Vahid Partovi Nia; Mostafa Ghannad-Rezaie
Journal:  Biom J       Date:  2015-06-22       Impact factor: 2.207

4.  The group exponential lasso for bi-level variable selection.

Authors:  Patrick Breheny
Journal:  Biometrics       Date:  2015-03-13       Impact factor: 2.571

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  Fast identification of biological pathways associated with a quantitative trait using group lasso with overlaps.

Authors:  Matt Silver; Giovanni Montana
Journal:  Stat Appl Genet Mol Biol       Date:  2012-01-06

7.  The spike-and-slab lasso Cox model for survival prediction and associated genes detection.

Authors:  Zaixiang Tang; Yueping Shen; Xinyan Zhang; Nengjun Yi
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

8.  A systematic evaluation of high-dimensional, ensemble-based regression for exploring large model spaces in microbiome analyses.

Authors:  Jyoti Shankar; Sebastian Szpakowski; Norma V Solis; Stephanie Mounaud; Hong Liu; Liliana Losada; William C Nierman; Scott G Filler
Journal:  BMC Bioinformatics       Date:  2015-02-01       Impact factor: 3.169

9.  Assessing the clinical utility of cancer genomic and proteomic data across tumor types.

Authors:  Yuan Yuan; Eliezer M Van Allen; Larsson Omberg; Nikhil Wagle; Ali Amin-Mansour; Artem Sokolov; Lauren A Byers; Yanxun Xu; Kenneth R Hess; Lixia Diao; Leng Han; Xuelin Huang; Michael S Lawrence; John N Weinstein; Josh M Stuart; Gordon B Mills; Levi A Garraway; Adam A Margolin; Gad Getz; Han Liang
Journal:  Nat Biotechnol       Date:  2014-06-22       Impact factor: 54.908

10.  Overlapping Group Logistic Regression with Applications to Genetic Pathway Selection.

Authors:  Yaohui Zeng; Patrick Breheny
Journal:  Cancer Inform       Date:  2016-09-15
View more
  7 in total

1.  BhGLM: Bayesian hierarchical GLMs and survival models, with applications to genomics and epidemiology.

Authors:  Nengjun Yi; Zaixiang Tang; Xinyan Zhang; Boyi Guo
Journal:  Bioinformatics       Date:  2019-04-15       Impact factor: 6.937

2.  WNT pathway signaling is associated with microvascular injury and predicts kidney transplant failure.

Authors:  Michael E Seifert; Joseph P Gaut; Boyi Guo; Sanjay Jain; Andrew F Malone; Feargal Geraghty; Deborah L Della Manna; Eddy S Yang; Nengjun Yi; Daniel C Brennan; Roslyn B Mannon
Journal:  Am J Transplant       Date:  2019-05-10       Impact factor: 8.086

3.  How Can Gene-Expression Information Improve Prognostic Prediction in TCGA Cancers: An Empirical Comparison Study on Regularization and Mixed Cox Models.

Authors:  Xinghao Yu; Ting Wang; Shuiping Huang; Ping Zeng
Journal:  Front Genet       Date:  2020-08-21       Impact factor: 4.599

4.  Jackknife Model Averaging Prediction Methods for Complex Phenotypes with Gene Expression Levels by Integrating External Pathway Information.

Authors:  Xinghao Yu; Lishun Xiao; Ping Zeng; Shuiping Huang
Journal:  Comput Math Methods Med       Date:  2019-04-08       Impact factor: 2.238

5.  Structured Genome-Wide Association Studies with Bayesian Hierarchical Variable Selection.

Authors:  Yize Zhao; Hongtu Zhu; Zhaohua Lu; Rebecca C Knickmeyer; Fei Zou
Journal:  Genetics       Date:  2019-04-22       Impact factor: 4.562

6.  Predicting Grating Orientations With Cross-Frequency Coupling and Least Absolute Shrinkage and Selection Operator in V1 and V4 of Rhesus Monkeys.

Authors:  Zhaohui Li; Yue Du; Youben Xiao; Liyong Yin
Journal:  Front Comput Neurosci       Date:  2021-01-25       Impact factor: 2.380

7.  Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait.

Authors:  Jie-Huei Wang; Yi-Hau Chen
Journal:  BMC Bioinformatics       Date:  2018-09-21       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.