Literature DB >> 24297531

The Stream algorithm: computationally efficient ridge-regression via Bayesian model averaging, and applications to pharmacogenomic prediction of cancer cell line sensitivity.

Elias Chaibub Neto1, In Sock Jang, Stephen H Friend, Adam A Margolin.   

Abstract

Computational efficiency is important for learning algorithms operating in the "large p, small n" setting. In computational biology, the analysis of data sets containing tens of thousands of features ("large p"), but only a few hundred samples ("small n"), is nowadays routine, and regularized regression approaches such as ridge-regression, lasso, and elastic-net are popular choices. In this paper we propose a novel and highly efficient Bayesian inference method for fitting ridge-regression. Our method is fully analytical, and bypasses the need for expensive tuning parameter optimization, via cross-validation, by employing Bayesian model averaging over the grid of tuning parameters. Additional computational efficiency is achieved by adopting the singular value decomposition reparametrization of the ridge-regression model, replacing computationally expensive inversions of large p × p matrices by efficient inversions of small and diagonal n × n matrices. We show in simulation studies and in the analysis of two large cancer cell line data panels that our algorithm achieves slightly better predictive performance than cross-validated ridge-regression while requiring only a fraction of the computation time. Furthermore, in comparisons based on the cell line data sets, our algorithm systematically out-performs the lasso in both predictive performance and computation time, and shows equivalent predictive performance, but considerably smaller computation time, than the elastic-net.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24297531      PMCID: PMC3911888     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  5 in total

1.  The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.

Authors:  Leming Shi; Gregory Campbell; Wendell D Jones; Fabien Campagne; Zhining Wen; Stephen J Walker; Zhenqiang Su; Tzu-Ming Chu; Federico M Goodsaid; Lajos Pusztai; John D Shaughnessy; André Oberthuer; Russell S Thomas; Richard S Paules; Mark Fielden; Bart Barlogie; Weijie Chen; Pan Du; Matthias Fischer; Cesare Furlanello; Brandon D Gallas; Xijin Ge; Dalila B Megherbi; W Fraser Symmans; May D Wang; John Zhang; Hans Bitter; Benedikt Brors; Pierre R Bushel; Max Bylesjo; Minjun Chen; Jie Cheng; Jing Cheng; Jeff Chou; Timothy S Davison; Mauro Delorenzi; Youping Deng; Viswanath Devanarayan; David J Dix; Joaquin Dopazo; Kevin C Dorff; Fathi Elloumi; Jianqing Fan; Shicai Fan; Xiaohui Fan; Hong Fang; Nina Gonzaludo; Kenneth R Hess; Huixiao Hong; Jun Huan; Rafael A Irizarry; Richard Judson; Dilafruz Juraeva; Samir Lababidi; Christophe G Lambert; Li Li; Yanen Li; Zhen Li; Simon M Lin; Guozhen Liu; Edward K Lobenhofer; Jun Luo; Wen Luo; Matthew N McCall; Yuri Nikolsky; Gene A Pennello; Roger G Perkins; Reena Philip; Vlad Popovici; Nathan D Price; Feng Qian; Andreas Scherer; Tieliu Shi; Weiwei Shi; Jaeyun Sung; Danielle Thierry-Mieg; Jean Thierry-Mieg; Venkata Thodima; Johan Trygg; Lakshmi Vishnuvajjala; Sue Jane Wang; Jianping Wu; Yichao Wu; Qian Xie; Waleed A Yousef; Liang Zhang; Xuegong Zhang; Sheng Zhong; Yiming Zhou; Sheng Zhu; Dhivya Arasappan; Wenjun Bao; Anne Bergstrom Lucas; Frank Berthold; Richard J Brennan; Andreas Buness; Jennifer G Catalano; Chang Chang; Rong Chen; Yiyu Cheng; Jian Cui; Wendy Czika; Francesca Demichelis; Xutao Deng; Damir Dosymbekov; Roland Eils; Yang Feng; Jennifer Fostel; Stephanie Fulmer-Smentek; James C Fuscoe; Laurent Gatto; Weigong Ge; Darlene R Goldstein; Li Guo; Donald N Halbert; Jing Han; Stephen C Harris; Christos Hatzis; Damir Herman; Jianping Huang; Roderick V Jensen; Rui Jiang; Charles D Johnson; Giuseppe Jurman; Yvonne Kahlert; Sadik A Khuder; Matthias Kohl; Jianying Li; Li Li; Menglong Li; Quan-Zhen Li; Shao Li; Zhiguang Li; Jie Liu; Ying Liu; Zhichao Liu; Lu Meng; Manuel Madera; Francisco Martinez-Murillo; Ignacio Medina; Joseph Meehan; Kelci Miclaus; Richard A Moffitt; David Montaner; Piali Mukherjee; George J Mulligan; Padraic Neville; Tatiana Nikolskaya; Baitang Ning; Grier P Page; Joel Parker; R Mitchell Parry; Xuejun Peng; Ron L Peterson; John H Phan; Brian Quanz; Yi Ren; Samantha Riccadonna; Alan H Roter; Frank W Samuelson; Martin M Schumacher; Joseph D Shambaugh; Qiang Shi; Richard Shippy; Shengzhu Si; Aaron Smalter; Christos Sotiriou; Mat Soukup; Frank Staedtler; Guido Steiner; Todd H Stokes; Qinglan Sun; Pei-Yi Tan; Rong Tang; Zivana Tezak; Brett Thorn; Marina Tsyganova; Yaron Turpaz; Silvia C Vega; Roberto Visintainer; Juergen von Frese; Charles Wang; Eric Wang; Junwei Wang; Wei Wang; Frank Westermann; James C Willey; Matthew Woods; Shujian Wu; Nianqing Xiao; Joshua Xu; Lei Xu; Lun Yang; Xiao Zeng; Jialu Zhang; Li Zhang; Min Zhang; Chen Zhao; Raj K Puri; Uwe Scherf; Weida Tong; Russell D Wolfinger
Journal:  Nat Biotechnol       Date:  2010-07-30       Impact factor: 54.908

2.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

3.  The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity.

Authors:  Jordi Barretina; Giordano Caponigro; Nicolas Stransky; Kavitha Venkatesan; Adam A Margolin; Sungjoon Kim; Christopher J Wilson; Joseph Lehár; Gregory V Kryukov; Dmitriy Sonkin; Anupama Reddy; Manway Liu; Lauren Murray; Michael F Berger; John E Monahan; Paula Morais; Jodi Meltzer; Adam Korejwa; Judit Jané-Valbuena; Felipa A Mapa; Joseph Thibault; Eva Bric-Furlong; Pichai Raman; Aaron Shipway; Ingo H Engels; Jill Cheng; Guoying K Yu; Jianjun Yu; Peter Aspesi; Melanie de Silva; Kalpana Jagtap; Michael D Jones; Li Wang; Charles Hatton; Emanuele Palescandolo; Supriya Gupta; Scott Mahan; Carrie Sougnez; Robert C Onofrio; Ted Liefeld; Laura MacConaill; Wendy Winckler; Michael Reich; Nanxin Li; Jill P Mesirov; Stacey B Gabriel; Gad Getz; Kristin Ardlie; Vivien Chan; Vic E Myer; Barbara L Weber; Jeff Porter; Markus Warmuth; Peter Finan; Jennifer L Harris; Matthew Meyerson; Todd R Golub; Michael P Morrissey; William R Sellers; Robert Schlegel; Levi A Garraway
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

4.  Systematic identification of genomic markers of drug sensitivity in cancer cells.

Authors:  Mathew J Garnett; Elena J Edelman; Sonja J Heidorn; Chris D Greenman; Anahita Dastur; King Wai Lau; Patricia Greninger; I Richard Thompson; Xi Luo; Jorge Soares; Qingsong Liu; Francesco Iorio; Didier Surdez; Li Chen; Randy J Milano; Graham R Bignell; Ah T Tam; Helen Davies; Jesse A Stevenson; Syd Barthorpe; Stephen R Lutz; Fiona Kogera; Karl Lawrence; Anne McLaren-Douglas; Xeni Mitropoulos; Tatiana Mironenko; Helen Thi; Laura Richardson; Wenjun Zhou; Frances Jewitt; Tinghu Zhang; Patrick O'Brien; Jessica L Boisvert; Stacey Price; Wooyoung Hur; Wanjuan Yang; Xianming Deng; Adam Butler; Hwan Geun Choi; Jae Won Chang; Jose Baselga; Ivan Stamenkovic; Jeffrey A Engelman; Sreenath V Sharma; Olivier Delattre; Julio Saez-Rodriguez; Nathanael S Gray; Jeffrey Settleman; P Andrew Futreal; Daniel A Haber; Michael R Stratton; Sridhar Ramaswamy; Ultan McDermott; Cyril H Benes
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

5.  Profiling critical cancer gene mutations in clinical tumor samples.

Authors:  Laura E MacConaill; Catarina D Campbell; Sarah M Kehoe; Adam J Bass; Charles Hatton; Lili Niu; Matt Davis; Keluo Yao; Megan Hanna; Chandrani Mondal; Lauren Luongo; Caroline M Emery; Alissa C Baker; Juliet Philips; Deborah J Goff; Michelangelo Fiorentino; Mark A Rubin; Kornelia Polyak; Jennifer Chan; Yuexiang Wang; Jonathan A Fletcher; Sandro Santagata; Gianni Corso; Franco Roviello; Ramesh Shivdasani; Mark W Kieran; Keith L Ligon; Charles D Stiles; William C Hahn; Matthew L Meyerson; Levi A Garraway
Journal:  PLoS One       Date:  2009-11-18       Impact factor: 3.240

  5 in total
  10 in total

1.  Estimation of QT interval prolongation through model-averaging.

Authors:  Peter L Bonate
Journal:  J Pharmacokinet Pharmacodyn       Date:  2017-04-18       Impact factor: 2.745

2.  Comprehensive anticancer drug response prediction based on a simple cell line-drug complex network model.

Authors:  Dong Wei; Chuanying Liu; Xiaoqi Zheng; Yushuang Li
Journal:  BMC Bioinformatics       Date:  2019-01-22       Impact factor: 3.169

3.  Evaluating the consistency of large-scale pharmacogenomic studies.

Authors:  Raziur Rahman; Saugato Rahman Dhruba; Kevin Matlock; Carlos De-Niz; Souparno Ghosh; Ranadip Pal
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

4.  Stepwise group sparse regression (SGSR): gene-set-based pharmacogenomic predictive models with stepwise selection of functional priors.

Authors:  In Sock Jang; Rodrigo Dienstmann; Adam A Margolin; Justin Guinney
Journal:  Pac Symp Biocomput       Date:  2015

5.  Ensembled machine learning framework for drug sensitivity prediction.

Authors:  Aman Sharma; Rinkle Rani
Journal:  IET Syst Biol       Date:  2020-02       Impact factor: 1.615

6.  An overview of machine learning methods for monotherapy drug response prediction.

Authors:  Farzaneh Firoozbakht; Behnam Yousefi; Benno Schwikowski
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

7.  Drug susceptibility prediction against a panel of drugs using kernelized Bayesian multitask learning.

Authors:  Mehmet Gönen; Adam A Margolin
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

8.  Simulation studies as designed experiments: the comparison of penalized regression models in the "large p, small n" setting.

Authors:  Elias Chaibub Neto; J Christopher Bare; Adam A Margolin
Journal:  PLoS One       Date:  2014-10-07       Impact factor: 3.240

Review 9.  Computational models for predicting drug responses in cancer research.

Authors:  Francisco Azuaje
Journal:  Brief Bioinform       Date:  2017-09-01       Impact factor: 11.622

10.  Integrated Genomic, Epigenomic, and Expression Analyses of Ovarian Cancer Cell Lines.

Authors:  Eniko Papp; Dorothy Hallberg; Gottfried E Konecny; Daniel C Bruhm; Vilmos Adleff; Michaël Noë; Ioannis Kagiampakis; Doreen Palsgrove; Dylan Conklin; Yasuto Kinose; James R White; Michael F Press; Ronny Drapkin; Hariharan Easwaran; Stephen B Baylin; Dennis Slamon; Victor E Velculescu; Robert B Scharpf
Journal:  Cell Rep       Date:  2018-11-27       Impact factor: 9.423

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.