Literature DB >> 30739966

A Massive Data Framework for M-Estimators with Cubic-Rate.

Chengchun Shi1, Wenbin Lu1, Rui Song1.   

Abstract

The divide and conquer method is a common strategy for handling massive data. In this article, we study the divide and conquer method for cubic-rate estimators under the massive data framework. We develop a general theory for establishing the asymptotic distribution of the aggregated M-estimators using a weighted average with weights depending on the subgroup sample sizes. Under certain condition on the growing rate of the number of subgroups, the resulting aggregated estimators are shown to have faster convergence rate and asymptotic normal distribution, which are more tractable in both computation and inference than the original M-estimators based on pooled data. Our theory applies to a wide class of M-estimators with cube root convergence rate, including the location estimator, maximum score estimator and value search estimator. Empirical performance via simulations and a real data application also validate our theoretical findings.

Entities:  

Keywords:  Cubic rate asymptotics; M-estimators; divide and conquer; massive data

Year:  2018        PMID: 30739966      PMCID: PMC6364750          DOI: 10.1080/01621459.2017.1360779

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  2 in total

1.  A PARTIALLY LINEAR FRAMEWORK FOR MASSIVE HETEROGENEOUS DATA.

Authors:  Tianqi Zhao; Guang Cheng; Han Liu
Journal:  Ann Stat       Date:  2016-07-07       Impact factor: 4.028

2.  A robust method for estimating optimal treatment regimes.

Authors:  Baqun Zhang; Anastasios A Tsiatis; Eric B Laber; Marie Davidian
Journal:  Biometrics       Date:  2012-05-02       Impact factor: 2.571

  2 in total
  2 in total

1.  Sampling-based estimation for massive survival data with additive hazards model.

Authors:  Lulu Zuo; Haixiang Zhang; HaiYing Wang; Lei Liu
Journal:  Stat Med       Date:  2020-11-03       Impact factor: 2.373

2.  On restricted optimal treatment regime estimation for competing risks data.

Authors:  Jie Zhou; Jiajia Zhang; Wenbin Lu; Xiaoming Li
Journal:  Biostatistics       Date:  2021-04-10       Impact factor: 5.899

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.