MOTIVATION: The prediction performance of Cox proportional hazard model suffers when there are only few uncensored events in the training data. RESULTS: We propose a Sparse-Group regularized Cox regression method to improve the prediction performance of large-scale and high-dimensional survival data with few observed events. Our approach is applicable when there is one or more other survival responses that 1. has a large number of observed events; 2. share a common set of associated predictors with the rare event response. This scenario is common in the UK Biobank (Sudlow et al., 2015) dataset where records for a large number of common and less prevalent diseases of the same set of individuals are available. By analyzing these responses together, we hope to achieve higher prediction performance than when they are analyzed individually. To make this approach practical for large-scale data, we developed an accelerated proximal gradient optimization algorithm as well as a screening procedure inspired by Qian et al. (2020). AVAILABILITY: https://github.com/rivas-lab/multisnpnet-Cox. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The prediction performance of Cox proportional hazard model suffers when there are only few uncensored events in the training data. RESULTS: We propose a Sparse-Group regularized Cox regression method to improve the prediction performance of large-scale and high-dimensional survival data with few observed events. Our approach is applicable when there is one or more other survival responses that 1. has a large number of observed events; 2. share a common set of associated predictors with the rare event response. This scenario is common in the UK Biobank (Sudlow et al., 2015) dataset where records for a large number of common and less prevalent diseases of the same set of individuals are available. By analyzing these responses together, we hope to achieve higher prediction performance than when they are analyzed individually. To make this approach practical for large-scale data, we developed an accelerated proximal gradient optimization algorithm as well as a screening procedure inspired by Qian et al. (2020). AVAILABILITY: https://github.com/rivas-lab/multisnpnet-Cox. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Robert Tibshirani; Jacob Bien; Jerome Friedman; Trevor Hastie; Noah Simon; Jonathan Taylor; Ryan J Tibshirani Journal: J R Stat Soc Series B Stat Methodol Date: 2012-03 Impact factor: 4.488
Authors: Junyang Qian; Yosuke Tanigawa; Wenfei Du; Matthew Aguirre; Chris Chang; Robert Tibshirani; Manuel A Rivas; Trevor Hastie Journal: PLoS Genet Date: 2020-10-23 Impact factor: 5.917
Authors: Manuel A Rivas; Matti Pirinen; Donald F Conrad; Monkol Lek; Emily K Tsang; Konrad J Karczewski; Julian B Maller; Kimberly R Kukurba; David S DeLuca; Menachem Fromer; Pedro G Ferreira; Kevin S Smith; Rui Zhang; Fengmei Zhao; Eric Banks; Ryan Poplin; Douglas M Ruderfer; Shaun M Purcell; Taru Tukiainen; Eric V Minikel; Peter D Stenson; David N Cooper; Katharine H Huang; Timothy J Sullivan; Jared Nedzel; Carlos D Bustamante; Jin Billy Li; Mark J Daly; Roderic Guigo; Peter Donnelly; Kristin Ardlie; Michael Sammeth; Emmanouil T Dermitzakis; Mark I McCarthy; Stephen B Montgomery; Tuuli Lappalainen; Daniel G MacArthur Journal: Science Date: 2015-05-08 Impact factor: 47.728
Authors: Cathie Sudlow; John Gallacher; Naomi Allen; Valerie Beral; Paul Burton; John Danesh; Paul Downey; Paul Elliott; Jane Green; Martin Landray; Bette Liu; Paul Matthews; Giok Ong; Jill Pell; Alan Silman; Alan Young; Tim Sprosen; Tim Peakman; Rory Collins Journal: PLoS Med Date: 2015-03-31 Impact factor: 11.069
Authors: Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee Journal: Gigascience Date: 2015-02-25 Impact factor: 6.524
Authors: Christopher DeBoever; Yosuke Tanigawa; Malene E Lindholm; Greg McInnes; Adam Lavertu; Erik Ingelsson; Chris Chang; Euan A Ashley; Carlos D Bustamante; Mark J Daly; Manuel A Rivas Journal: Nat Commun Date: 2018-04-24 Impact factor: 14.919
Authors: Nasa Sinnott-Armstrong; Yosuke Tanigawa; Manuel A Rivas; David Amar; Nina Mars; Christian Benner; Matthew Aguirre; Guhan Ram Venkataraman; Michael Wainberg; Hanna M Ollila; Tuomo Kiiskinen; Aki S Havulinna; James P Pirruccello; Junyang Qian; Anna Shcherbina; Fatima Rodriguez; Themistocles L Assimes; Vineeta Agarwala; Robert Tibshirani; Trevor Hastie; Samuli Ripatti; Jonathan K Pritchard; Mark J Daly Journal: Nat Genet Date: 2021-01-18 Impact factor: 38.330