Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A fast divide-and-conquer sparse Cox regression.

Literature DB >> 31545341

A fast divide-and-conquer sparse Cox regression.

Yan Wang^1,2, Chuan Hong³, Nathan Palmer³, Qian Di¹, Joel Schwartz¹, Isaac Kohane³, Tianxi Cai^2,3.

Abstract

We propose a computationally and statistically efficient divide-and-conquer (DAC) algorithm to fit sparse Cox regression to massive datasets where the sample size $n_0$ is exceedingly large and the covariate dimension $p$ is not small but $n_0\gg p$. The proposed algorithm achieves computational efficiency through a one-step linear approximation followed by a least square approximation to the partial likelihood (PL). These sequences of linearization enable us to maximize the PL with only a small subset and perform penalized estimation via a fast approximation to the PL. The algorithm is applicable for the analysis of both time-independent and time-dependent survival data. Simulations suggest that the proposed DAC algorithm substantially outperforms the full sample-based estimators and the existing DAC algorithm with respect to the computational speed, while it achieves similar statistical efficiency as the full sample-based estimators. The proposed algorithm was applied to extraordinarily large survival datasets for the prediction of heart failure-specific readmission within 30 days among Medicare heart failure patients.

Entities: Disease Gene Species

Keywords: Cox proportional hazards model; Distributed learning; Divide-and-conquer; Least square approximation; Shrinkage estimation; Variable selection

Year: 2021 PMID： 31545341 PMCID： PMC8036003 DOI： 10.1093/biostatistics/kxz036

Source DB: PubMed Journal: Biostatistics ISSN： 1465-4644 Impact factor: 5.899

17 in total

1. L1 penalized estimation in the Cox proportional hazards model.

Authors: Jelle J Goeman
Journal: Biom J Date: 2010-02 Impact factor: 2.207

2. Association of Short-term Exposure to Air Pollution With Mortality in Older Adults.

Authors: Qian Di; Lingzhen Dai; Yun Wang; Antonella Zanobetti; Christine Choirat; Joel D Schwartz; Francesca Dominici
Journal: JAMA Date: 2017-12-26 Impact factor: 56.272

3. Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data.

Authors: E F Philbin; T G DiSalvo
Journal: J Am Coll Cardiol Date: 1999-05 Impact factor: 24.094

4. COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

Authors: Patrick Breheny; Jian Huang
Journal: Ann Appl Stat Date: 2011-01-01 Impact factor: 2.083

5. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data.

Authors: Hude Quan; Vijaya Sundararajan; Patricia Halfon; Andrew Fong; Bernard Burnand; Jean-Christophe Luthi; L Duncan Saunders; Cynthia A Beck; Thomas E Feasby; William A Ghali
Journal: Med Care Date: 2005-11 Impact factor: 2.983

6. Trends in heart failure incidence and survival in a community-based population.

Authors: Véronique L Roger; Susan A Weston; Margaret M Redfield; Jens P Hellermann-Homan; Jill Killian; Barbara P Yawn; Steven J Jacobsen
Journal: JAMA Date: 2004-07-21 Impact factor: 56.272

7. Relation of heart failure hospitalization to exposure to fine particulate air pollution.

Authors: C Arden Pope; Dale G Renlund; Abdallah G Kfoury; Heidi T May; Benjamin D Horne
Journal: Am J Cardiol Date: 2008-08-27 Impact factor: 2.778

8. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors: Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal: Nat Biotechnol Date: 2013-12 Impact factor: 54.908

9. Generating survival times to simulate Cox proportional hazards models with time-varying covariates.

Authors: Peter C Austin
Journal: Stat Med Date: 2012-07-04 Impact factor: 2.373

Review 10. Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events.

Authors: Menelaos Pavlou; Gareth Ambler; Shaun Seaman; Maria De Iorio; Rumana Z Omar
Journal: Stat Med Date: 2015-10-29 Impact factor: 2.373

7 in total

A fast divide-and-conquer sparse Cox regression.

1. L1 penalized estimation in the Cox proportional hazards model.

2. Association of Short-term Exposure to Air Pollution With Mortality in Older Adults.

3. Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data.

4. COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.

5. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data.

6. Trends in heart failure incidence and survival in a community-based population.

7. Relation of heart failure hospitalization to exposure to fine particulate air pollution.

8. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

9. Generating survival times to simulate Cox proportional hazards models with time-varying covariates.

Review 10. Review and evaluation of penalised regression methods for risk prediction in low-dimensional data with few events.

1. Online Updating of Survival Analysis.

2. Scalable Algorithms for Large Competing Risks Data.

3. ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.

4. Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution.

5. Sampling-based estimation for massive survival data with additive hazards model.

6. Accurate training of the Cox proportional hazards model on vertically-partitioned data while preserving privacy.

7. Default risk prediction and feature extraction using a penalized deep neural network.