Literature DB >> 27799277

The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection.

Zaixiang Tang1,2,3, Yueping Shen1,2, Xinyan Zhang3, Nengjun Yi4.   

Abstract

Large-scale "omics" data have been increasingly used as an important resource for prognostic prediction of diseases and detection of associated genes. However, there are considerable challenges in analyzing high-dimensional molecular data, including the large number of potential molecular predictors, limited number of samples, and small effect of each predictor. We propose new Bayesian hierarchical generalized linear models, called spike-and-slab lasso GLMs, for prognostic prediction and detection of associated genes using large-scale molecular data. The proposed model employs a spike-and-slab mixture double-exponential prior for coefficients that can induce weak shrinkage on large coefficients, and strong shrinkage on irrelevant coefficients. We have developed a fast and stable algorithm to fit large-scale hierarchal GLMs by incorporating expectation-maximization (EM) steps into the fast cyclic coordinate descent algorithm. The proposed approach integrates nice features of two popular methods, i.e., penalized lasso and Bayesian spike-and-slab variable selection. The performance of the proposed method is assessed via extensive simulation studies. The results show that the proposed approach can provide not only more accurate estimates of the parameters, but also better prediction. We demonstrate the proposed procedure on two cancer data sets: a well-known breast cancer data set consisting of 295 tumors, and expression data of 4919 genes; and the ovarian cancer data set from TCGA with 362 tumors, and expression data of 5336 genes. Our analyses show that the proposed procedure can generate powerful models for predicting outcomes and detecting associated genes. The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/).
Copyright © 2017 by the Genetics Society of America.

Entities:  

Keywords:  cancer; double-exponential distribution; generalized linear model; outcome prediction; spike-and-slab lasso

Mesh:

Year:  2016        PMID: 27799277      PMCID: PMC5223525          DOI: 10.1534/genetics.116.192195

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  25 in total

1.  Pre-validation and inference in microarrays.

Authors:  Robert J Tibshirani; Brad Efron
Journal:  Stat Appl Genet Mol Biol       Date:  2002-08-22

2.  Predicting survival from microarray data--a comparative study.

Authors:  H M Bøvelstad; S Nygård; H L Størvold; M Aldrin; Ø Borgan; A Frigessi; O C Lingjaerde
Journal:  Bioinformatics       Date:  2007-06-06       Impact factor: 6.937

3.  Agglomerative joint clustering of metabolic data with spike at zero: A Bayesian perspective.

Authors:  Vahid Partovi Nia; Mostafa Ghannad-Rezaie
Journal:  Biom J       Date:  2015-06-22       Impact factor: 2.207

4.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  Super-sparse principal component analyses for high-throughput genomic data.

Authors:  Donghwan Lee; Woojoo Lee; Youngjo Lee; Yudi Pawitan
Journal:  BMC Bioinformatics       Date:  2010-06-02       Impact factor: 3.169

7.  Combining multidimensional genomic measurements for predicting cancer prognosis: observations from TCGA.

Authors:  Qing Zhao; Xingjie Shi; Yang Xie; Jian Huang; BenChang Shia; Shuangge Ma
Journal:  Brief Bioinform       Date:  2014-03-13       Impact factor: 13.994

8.  Classification of microarray data using gene networks.

Authors:  Franck Rapaport; Andrei Zinovyev; Marie Dutreix; Emmanuel Barillot; Jean-Philippe Vert
Journal:  BMC Bioinformatics       Date:  2007-02-01       Impact factor: 3.169

9.  A systematic evaluation of high-dimensional, ensemble-based regression for exploring large model spaces in microbiome analyses.

Authors:  Jyoti Shankar; Sebastian Szpakowski; Norma V Solis; Stephanie Mounaud; Hong Liu; Liliana Losada; William C Nierman; Scott G Filler
Journal:  BMC Bioinformatics       Date:  2015-02-01       Impact factor: 3.169

10.  Assessing the clinical utility of cancer genomic and proteomic data across tumor types.

Authors:  Yuan Yuan; Eliezer M Van Allen; Larsson Omberg; Nikhil Wagle; Ali Amin-Mansour; Artem Sokolov; Lauren A Byers; Yanxun Xu; Kenneth R Hess; Lixia Diao; Leng Han; Xuelin Huang; Michael S Lawrence; John N Weinstein; Josh M Stuart; Gordon B Mills; Levi A Garraway; Adam A Margolin; Gad Getz; Han Liang
Journal:  Nat Biotechnol       Date:  2014-06-22       Impact factor: 54.908

View more
  14 in total

1.  BhGLM: Bayesian hierarchical GLMs and survival models, with applications to genomics and epidemiology.

Authors:  Nengjun Yi; Zaixiang Tang; Xinyan Zhang; Boyi Guo
Journal:  Bioinformatics       Date:  2019-04-15       Impact factor: 6.937

2.  Group spike-and-slab lasso generalized linear models for disease prediction and associated genes detection by incorporating pathway information.

Authors:  Zaixiang Tang; Yueping Shen; Yan Li; Xinyan Zhang; Jia Wen; Chen'ao Qian; Wenzhuo Zhuang; Xinghua Shi; Nengjun Yi
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

3.  RCRdiff: A fully integrated Bayesian method for differential expression analysis using raw NanoString nCounter data.

Authors:  Can Xu; Xinlei Wang; Johan Lim; Guanghua Xiao; Yang Xie
Journal:  Stat Med       Date:  2021-11-12       Impact factor: 2.373

4.  Integration of immune and hypoxia gene signatures improves the prediction of radiosensitivity in breast cancer.

Authors:  Derui Yan; Shang Cai; Lu Bai; Zixuan Du; Huijun Li; Peng Sun; Jianping Cao; Nengjun Yi; Song-Bai Liu; Zaixiang Tang
Journal:  Am J Cancer Res       Date:  2022-03-15       Impact factor: 6.166

5.  Semiparametric Bayesian variable selection for gene-environment interactions.

Authors:  Jie Ren; Fei Zhou; Xiaoxi Li; Qi Chen; Hongmei Zhang; Shuangge Ma; Yu Jiang; Cen Wu
Journal:  Stat Med       Date:  2019-12-21       Impact factor: 2.373

6.  The spike-and-slab lasso Cox model for survival prediction and associated genes detection.

Authors:  Zaixiang Tang; Yueping Shen; Xinyan Zhang; Nengjun Yi
Journal:  Bioinformatics       Date:  2017-09-15       Impact factor: 6.937

Review 7.  The Application of Bayesian Methods in Cancer Prognosis and Prediction.

Authors:  Jiadong Chu; N A Sun; Wei Hu; Xuanli Chen; Nengjun Yi; Yueping Shen
Journal:  Cancer Genomics Proteomics       Date:  2022 Jan-Feb       Impact factor: 4.069

8.  Development and Validation of a Radiosensitivity Prediction Model for Lower Grade Glioma Based on Spike-and-Slab Lasso.

Authors:  Zixuan Du; Shang Cai; Derui Yan; Huijun Li; Xinyan Zhang; Wei Yang; Jianping Cao; Nengjun Yi; Zaixiang Tang
Journal:  Front Oncol       Date:  2021-07-30       Impact factor: 6.244

9.  Simple Quasi-Bayes Approach for Modeling Mean Medical Costs.

Authors:  Grace Yoon; Wenxin Jiang; Lei Liu; Ya-Chen Tina Shih
Journal:  Int J Biostat       Date:  2019-06-05       Impact factor: 1.829

10.  Fast and flexible linear mixed models for genome-wide genetics.

Authors:  Daniel E Runcie; Lorin Crawford
Journal:  PLoS Genet       Date:  2019-02-08       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.