Literature DB >> 31545341

A fast divide-and-conquer sparse Cox regression.

Yan Wang1,2, Chuan Hong3, Nathan Palmer3, Qian Di1, Joel Schwartz1, Isaac Kohane3, Tianxi Cai2,3.   

Abstract

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the partial likelihood (PL). These sequences of linearization enable us to maximize the PL with only a small subset and perform penalized estimation via a fast approximation to the PL. The algorithm is applicable for the analysis of both time-independent and time-dependent survival data. Simulations suggest that the proposed DAC algorithm substantially outperforms the full sample-based estimators and the existing DAC algorithm with respect to the computational speed, while it achieves similar statistical efficiency as the full sample-based estimators. The proposed algorithm was applied to extraordinarily large survival datasets for the prediction of heart failure-specific readmission within 30 days among Medicare heart failure patients.
© The Author 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Cox proportional hazards model; Distributed learning; Divide-and-conquer; Least square approximation; Shrinkage estimation; Variable selection

Year:  2021        PMID: 31545341      PMCID: PMC8036003          DOI: 10.1093/biostatistics/kxz036

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  17 in total

1.  L1 penalized estimation in the Cox proportional hazards model.

Authors:  Jelle J Goeman
Journal:  Biom J       Date:  2010-02       Impact factor: 2.207

2.  Association of Short-term Exposure to Air Pollution With Mortality in Older Adults.

Authors:  Qian Di; Lingzhen Dai; Yun Wang; Antonella Zanobetti; Christine Choirat; Joel D Schwartz; Francesca Dominici
Journal:  JAMA       Date:  2017-12-26       Impact factor: 56.272

3.  Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data.

Authors:  E F Philbin; T G DiSalvo
Journal:  J Am Coll Cardiol       Date:  1999-05       Impact factor: 24.094

4.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors:  Patrick Breheny; Jian Huang
Journal:  Ann Appl Stat       Date:  2011-01-01       Impact factor: 2.083

5.  Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data.

Authors:  Hude Quan; Vijaya Sundararajan; Patricia Halfon; Andrew Fong; Bernard Burnand; Jean-Christophe Luthi; L Duncan Saunders; Cynthia A Beck; Thomas E Feasby; William A Ghali
Journal:  Med Care       Date:  2005-11       Impact factor: 2.983

6.  Trends in heart failure incidence and survival in a community-based population.

Authors:  Véronique L Roger; Susan A Weston; Margaret M Redfield; Jens P Hellermann-Homan; Jill Killian; Barbara P Yawn; Steven J Jacobsen
Journal:  JAMA       Date:  2004-07-21       Impact factor: 56.272

7.  Relation of heart failure hospitalization to exposure to fine particulate air pollution.

Authors:  C Arden Pope; Dale G Renlund; Abdallah G Kfoury; Heidi T May; Benjamin D Horne
Journal:  Am J Cardiol       Date:  2008-08-27       Impact factor: 2.778

8.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors:  Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

9.  Generating survival times to simulate Cox proportional hazards models with time-varying covariates.

Authors:  Peter C Austin
Journal:  Stat Med       Date:  2012-07-04       Impact factor: 2.373

Review 10.  Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events.

Authors:  Menelaos Pavlou; Gareth Ambler; Shaun Seaman; Maria De Iorio; Rumana Z Omar
Journal:  Stat Med       Date:  2015-10-29       Impact factor: 2.373

View more
  7 in total

1.  Online Updating of Survival Analysis.

Authors:  Jing Wu; Ming-Hui Chen; Elizabeth D Schifano; Jun Yan
Journal:  J Comput Graph Stat       Date:  2021-03-08       Impact factor: 2.302

2.  Scalable Algorithms for Large Competing Risks Data.

Authors:  Eric S Kawaguchi; Jenny I Shen; Marc A Suchard; Gang Li
Journal:  J Comput Graph Stat       Date:  2020-12-11       Impact factor: 1.884

3.  ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.

Authors:  Chongliang Luo; Rui Duan; Adam C Naj; Henry R Kranzler; Jiang Bian; Yong Chen
Journal:  Sci Rep       Date:  2022-04-22       Impact factor: 4.996

4.  Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution.

Authors:  Lu Tang; Ling Zhou; Peter X-K Song
Journal:  J Multivar Anal       Date:  2019-11-28       Impact factor: 1.473

5.  Sampling-based estimation for massive survival data with additive hazards model.

Authors:  Lulu Zuo; Haixiang Zhang; HaiYing Wang; Lei Liu
Journal:  Stat Med       Date:  2020-11-03       Impact factor: 2.373

6.  Accurate training of the Cox proportional hazards model on vertically-partitioned data while preserving privacy.

Authors:  Bart Kamphorst; Thomas Rooijakkers; Thijs Veugen; Matteo Cellamare; Daan Knoors
Journal:  BMC Med Inform Decis Mak       Date:  2022-02-24       Impact factor: 2.796

7.  Default risk prediction and feature extraction using a penalized deep neural network.

Authors:  Cunjie Lin; Nan Qiao; Wenli Zhang; Yang Li; Shuangge Ma
Journal:  Stat Comput       Date:  2022-09-15       Impact factor: 2.324

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.