Literature DB >> 33016376

Optimal multiwave sampling for regression modeling in two-phase designs.

Tong Chen1, Thomas Lumley1.   

Abstract

Two-phase designs involve measuring extra variables on a subset of the cohort where some variables are already measured. The goal of two-phase designs is to choose a subsample of individuals from the cohort and analyse that subsample efficiently. It is of interest to obtain an optimal design that gives the most efficient estimates of regression parameters. In this article, we propose a multiwave sampling design to approximate the optimal design for design-based estimators. Influence functions are used to compute the optimal sampling allocations. We propose to use informative priors on regression parameters to derive the wave-1 sampling probabilities because any prespecified sampling probabilities may be far from optimal and decrease the design efficiency. The posterior distributions of the regression parameters derived from the current wave will then be used as priors for the next wave. Generalized raking is used in the final statistical analysis. We show that a two-wave sampling with reasonable informative priors will end up with a highly efficient estimation for the parameter of interest and be close to the underlying optimal design.
© 2020 John Wiley & Sons Ltd.

Entities:  

Keywords:  Neyman allocation; design-based estimators; influence function; optimal design; prior

Mesh:

Year:  2020        PMID: 33016376      PMCID: PMC7902311          DOI: 10.1002/sim.8760

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  9 in total

1.  Using the whole cohort in the analysis of countermatched samples.

Authors:  C Rivera; T Lumley
Journal:  Biometrics       Date:  2015-09-22       Impact factor: 2.571

2.  Using the whole cohort in the analysis of case-cohort data.

Authors:  Norman E Breslow; Thomas Lumley; Christie M Ballantyne; Lloyd E Chambless; Michal Kulich
Journal:  Am J Epidemiol       Date:  2009-04-08       Impact factor: 4.897

3.  Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors:  Thomas Lumley; Pamela A Shaw; James Y Dai
Journal:  Int Stat Rev       Date:  2011-08       Impact factor: 2.217

4.  Optimal sampling strategies for two-stage studies.

Authors:  M Reilly
Journal:  Am J Epidemiol       Date:  1996-01-01       Impact factor: 4.897

5.  Improved Horvitz-Thompson Estimation of Model Parameters from Two-phase Stratified Samples: Applications in Epidemiology.

Authors:  Norman E Breslow; Thomas Lumley; Christie M Ballantyne; Lloyd E Chambless; Michal Kulich
Journal:  Stat Biosci       Date:  2009-05-01

6.  Comparison between single-dose and divided-dose administration of dactinomycin and doxorubicin for patients with Wilms' tumor: a report from the National Wilms' Tumor Study Group.

Authors:  D M Green; N E Breslow; J B Beckwith; J Z Finklestein; P E Grundy; P R Thomas; T Kim; S J Shochat; G M Haase; M L Ritchey; P P Kelalis; G J D'Angio
Journal:  J Clin Oncol       Date:  1998-01       Impact factor: 44.544

7.  Treatment of Wilms' tumor. Results of the Third National Wilms' Tumor Study.

Authors:  G J D'Angio; N Breslow; J B Beckwith; A Evans; H Baum; A deLorimier; D Fernbach; E Hrabovsky; B Jones; P Kelalis
Journal:  Cancer       Date:  1989-07-15       Impact factor: 6.860

8.  Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies.

Authors:  Ran Tao; Donglin Zeng; Dan-Yu Lin
Journal:  J Am Stat Assoc       Date:  2017-02-28       Impact factor: 5.033

9.  Adaptive sampling in two-phase designs: a biomarker study for progression in arthritis.

Authors:  Michael A McIsaac; Richard J Cook
Journal:  Stat Med       Date:  2015-05-07       Impact factor: 2.373

  9 in total
  4 in total

1.  Optimal sampling for design-based estimators of regression models.

Authors:  Tong Chen; Thomas Lumley
Journal:  Stat Med       Date:  2022-01-06       Impact factor: 2.373

2.  Two-Phase Sampling Designs for Data Validation in Settings with Covariate Measurement Error and Continuous Outcome.

Authors:  Gustavo Amorim; Ran Tao; Sarah Lotspeich; Pamela A Shaw; Thomas Lumley; Bryan E Shepherd
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2021-04-15       Impact factor: 2.175

3.  Optimal allocation in stratified cluster-based outcome-dependent sampling designs.

Authors:  Sara Sauer; Bethany Hedt-Gauthier; Sebastien Haneuse
Journal:  Stat Med       Date:  2021-06-02       Impact factor: 2.497

4.  Efficient odds ratio estimation under two-phase sampling using error-prone data from a multi-national HIV research cohort.

Authors:  Sarah C Lotspeich; Bryan E Shepherd; Gustavo G C Amorim; Pamela A Shaw; Ran Tao
Journal:  Biometrics       Date:  2021-07-02       Impact factor: 2.571

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.