Literature DB >> 35656342

Bag of little bootstraps for massive and distributed longitudinal data.

Xinkai Zhou1, Jin J Zhou2, Hua Zhou1,3.   

Abstract

Linear mixed models are widely used for analyzing longitudinal datasets, and the inference for variance component parameters relies on the bootstrap method. However, health systems and technology companies routinely generate massive longitudinal datasets that make the traditional bootstrap method infeasible. To solve this problem, we extend the highly scalable bag of little bootstraps method for independent data to longitudinal data and develop a highly efficient Julia package MixedModelsBLB.jl. Simulation experiments and real data analysis demonstrate the favorable statistical performance and computational advantages of our method compared to the traditional bootstrap method. For the statistical inference of variance components, it achieves 200 times speedup on the scale of 1 million subjects (20 million total observations), and is the only currently available tool that can handle more than 10 million subjects (200 million total observations) using desktop computers.

Entities:  

Keywords:  EMR; bags of little bootstraps; big data; linear mixed models; longitudinal data; parallel and distributed computing

Year:  2021        PMID: 35656342      PMCID: PMC9159544          DOI: 10.1002/sam.11563

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.247


  4 in total

1.  Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: a sleep dose-response study.

Authors:  Gregory Belenky; Nancy J Wesensten; David R Thorne; Maria L Thomas; Helen C Sing; Daniel P Redmond; Michael B Russo; Thomas J Balkin
Journal:  J Sleep Res       Date:  2003-03       Impact factor: 3.981

2.  Effect of intensive treatment of hyperglycaemia on microvascular outcomes in type 2 diabetes: an analysis of the ACCORD randomised trial.

Authors:  Faramarz Ismail-Beigi; Timothy Craven; Mary Ann Banerji; Jan Basile; Jorge Calles; Robert M Cohen; Robert Cuddihy; William C Cushman; Saul Genuth; Richard H Grimm; Bruce P Hamilton; Byron Hoogwerf; Diane Karl; Lois Katz; Armand Krikorian; Patrick O'Connor; Rodica Pop-Busui; Ulrich Schubart; Debra Simmons; Harris Taylor; Abraham Thomas; Daniel Weiss; Irene Hramiak
Journal:  Lancet       Date:  2010-06-30       Impact factor: 79.321

3.  WiSER: Robust and scalable estimation and inference of within-subject variances from intensive longitudinal data.

Authors:  Christopher A German; Janet S Sinsheimer; Jin Zhou; Hua Zhou
Journal:  Biometrics       Date:  2021-06-18       Impact factor: 2.571

4.  Insulin Dose and Cardiovascular Mortality in the ACCORD Trial.

Authors:  Elias S Siraj; Daniel J Rubin; Matthew C Riddle; Michael E Miller; Fang-Chi Hsu; Faramarz Ismail-Beigi; Shyh-Huei Chen; Walter T Ambrosius; Abraham Thomas; William Bestermann; John B Buse; Saul Genuth; Carol Joyce; Christopher S Kovacs; Patrick J O'Connor; Ronald J Sigal; Sol Solomon
Journal:  Diabetes Care       Date:  2015-10-13       Impact factor: 19.112

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.