Literature DB >> 33145780

Sampling-based estimation for massive survival data with additive hazards model.

Lulu Zuo1, Haixiang Zhang1, HaiYing Wang2, Lei Liu3.   

Abstract

For massive survival data, we propose a subsampling algorithm to efficiently approximate the estimates of regression parameters in the additive hazards model. We establish consistency and asymptotic normality of the subsample-based estimator given the full data. The optimal subsampling probabilities are obtained via minimizing asymptotic variance of the resulting estimator. The subsample-based procedure can largely reduce the computational cost compared with the full data method. In numerical simulations, our method has low bias and satisfactory coverage probabilities. We provide an illustrative example on the survival analysis of patients with lymphoma cancer from the Surveillance, Epidemiology, and End Results Program.
© 2020 John Wiley & Sons Ltd.

Entities:  

Keywords:  additive hazards model; big data; subsample-based estimator; subsampling probabilities; survival analysis

Mesh:

Year:  2020        PMID: 33145780      PMCID: PMC7775260          DOI: 10.1002/sim.8783

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  9 in total

1.  Regularized estimation in the accelerated failure time model with high-dimensional covariates.

Authors:  Jian Huang; Shuangge Ma; Huiliang Xie
Journal:  Biometrics       Date:  2006-09       Impact factor: 2.571

2.  An online updating approach for testing the proportional hazards assumption with streams of survival data.

Authors:  Yishu Xue; HaiYing Wang; Jun Yan; Elizabeth D Schifano
Journal:  Biometrics       Date:  2019-11-10       Impact factor: 2.571

3.  A PARTIALLY LINEAR FRAMEWORK FOR MASSIVE HETEROGENEOUS DATA.

Authors:  Tianqi Zhao; Guang Cheng; Han Liu
Journal:  Ann Stat       Date:  2016-07-07       Impact factor: 4.028

4.  A Massive Data Framework for M-Estimators with Cubic-Rate.

Authors:  Chengchun Shi; Wenbin Lu; Rui Song
Journal:  J Am Stat Assoc       Date:  2018-06-19       Impact factor: 5.033

5.  A linear regression model for the analysis of life times.

Authors:  O O Aalen
Journal:  Stat Med       Date:  1989-08       Impact factor: 2.373

6.  DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS.

Authors:  Heather Battey; Jianqing Fan; Han Liu; Junwei Lu; Ziwei Zhu
Journal:  Ann Stat       Date:  2018-05-03       Impact factor: 4.028

7.  Optimal Subsampling for Large Sample Logistic Regression.

Authors:  HaiYing Wang; Rong Zhu; Ping Ma
Journal:  J Am Stat Assoc       Date:  2018-06-06       Impact factor: 5.033

8.  A fast divide-and-conquer sparse Cox regression.

Authors:  Yan Wang; Chuan Hong; Nathan Palmer; Qian Di; Joel Schwartz; Isaac Kohane; Tianxi Cai
Journal:  Biostatistics       Date:  2021-04-10       Impact factor: 5.899

9.  Disparities by race, age, and sex in the improvement of survival for lymphoma: Findings from a population-based study.

Authors:  Fahad Mukhtar; Paolo Boffetta; Bashir Dabo; Jong Y Park; Chi T D Tran; Thuan V Tran; Huong Thi-Thanh Tran; Madison Whitney; Harvey A Risch; Linh C Le; Wei Zheng; Xiao-Ou Shu; Hung N Luu
Journal:  PLoS One       Date:  2018-07-11       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.