Literature DB >> 25220822

A highly efficient design strategy for regression with outcome pooling.

Emily M Mitchell1, Robert H Lyles, Amita K Manatunga, Neil J Perkins, Enrique F Schisterman.   

Abstract

The potential for research involving biospecimens can be hindered by the prohibitive cost of performing laboratory assays on individual samples. To mitigate this cost, strategies such as randomly selecting a portion of specimens for analysis or randomly pooling specimens prior to performing laboratory assays may be employed. These techniques, while effective in reducing cost, are often accompanied by a considerable loss of statistical efficiency. We propose a novel pooling strategy based on the k-means clustering algorithm to reduce laboratory costs while maintaining a high level of statistical efficiency when predictor variables are measured on all subjects, but the outcome of interest is assessed in pools. We perform simulations motivated by the BioCycle study to compare this k-means pooling strategy with current pooling and selection techniques under simple and multiple linear regression models. While all of the methods considered produce unbiased estimates and confidence intervals with appropriate coverage, pooling under k-means clustering provides the most precise estimates, closely approximating results from the full data and losing minimal precision as the total number of pools decreases. The benefits of k-means clustering evident in the simulation study are then applied to an analysis of the BioCycle dataset. In conclusion, when the number of lab tests is limited by budget, pooling specimens based on k-means clustering prior to performing lab assays can be an effective way to save money with minimal information loss in a regression setting.
Copyright © 2014 John Wiley & Sons, Ltd.

Entities:  

Keywords:  -means clustering; biomarkers; design; pooling; regression analysis

Mesh:

Year:  2014        PMID: 25220822      PMCID: PMC4225004          DOI: 10.1002/sim.6305

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  14 in total

1.  Logistic regression analysis of biomarker data subject to pooling and dichotomization.

Authors:  Z Zhang; A Liu; R H Lyles; B Mukherjee
Journal:  Stat Med       Date:  2011-09-23       Impact factor: 2.373

2.  Pooling biospecimens and limits of detection: effects on ROC curve analysis.

Authors:  Sunni L Mumford; Enrique F Schisterman; Albert Vexler; Aiyi Liu
Journal:  Biostatistics       Date:  2006-03-10       Impact factor: 5.899

3.  Analysis of multistage pooling studies of biological specimens for estimating disease incidence and prevalence.

Authors:  R Brookmeyer
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

4.  Caffeinated beverage intake and reproductive hormones among premenopausal women in the BioCycle Study.

Authors:  Karen C Schliep; Enrique F Schisterman; Sunni L Mumford; Anna Z Pollack; Cuilin Zhang; Aijun Ye; Joseph B Stanford; Ahmad O Hammoud; Christina A Porucznik; Jean Wactawski-Wende
Journal:  Am J Clin Nutr       Date:  2012-01-11       Impact factor: 7.045

5.  Binary regression analysis with pooled exposure measurements: a regression calibration approach.

Authors:  Zhiwei Zhang; Paul S Albert
Journal:  Biometrics       Date:  2010-07-21       Impact factor: 2.571

6.  Regression for skewed biomarker outcomes subject to pooling.

Authors:  Emily M Mitchell; Robert H Lyles; Amita K Manatunga; Michelle Danaher; Neil J Perkins; Enrique F Schisterman
Journal:  Biometrics       Date:  2014-02-12       Impact factor: 2.571

7.  Group testing regression models with fixed and random effects.

Authors:  Peng Chen; Joshua M Tebbs; Christopher R Bilder
Journal:  Biometrics       Date:  2009-12       Impact factor: 2.571

8.  To pool or not to pool, from whether to when: applications of pooling to biospecimens subject to a limit of detection.

Authors:  Enrique F Schisterman; Albert Vexler
Journal:  Paediatr Perinat Epidemiol       Date:  2008-09       Impact factor: 3.980

9.  Pooling of sera for human immunodeficiency virus (HIV) testing: an economical method for use in developing countries.

Authors:  J C Emmanuel; M T Bassett; H J Smith; J A Jacobs
Journal:  J Clin Pathol       Date:  1988-05       Impact factor: 3.411

10.  Hybrid pooled-unpooled design for cost-efficient measurement of biomarkers.

Authors:  Enrique F Schisterman; Albert Vexler; Sunni L Mumford; Neil J Perkins
Journal:  Stat Med       Date:  2010-02-28       Impact factor: 2.373

View more
  7 in total

1.  Positing, fitting, and selecting regression models for pooled biomarker data.

Authors:  Emily M Mitchell; Robert H Lyles; Enrique F Schisterman
Journal:  Stat Med       Date:  2015-04-06       Impact factor: 2.373

2.  Semiparametric regression models for a right-skewed outcome subject to pooling.

Authors:  Emily M Mitchell; Robert H Lyles; Amita K Manatunga; Enrique F Schisterman
Journal:  Am J Epidemiol       Date:  2015-03-03       Impact factor: 4.897

3.  Logistic regression with a continuous exposure measured in pools and subject to errors.

Authors:  Dane R Van Domelen; Emily M Mitchell; Neil J Perkins; Enrique F Schisterman; Amita K Manatunga; Yijian Huang; Robert H Lyles
Journal:  Stat Med       Date:  2018-07-18       Impact factor: 2.373

4.  Estimating relative risk of a log-transformed exposure measured in pools.

Authors:  Emily M Mitchell; Torie C Plowden; Enrique F Schisterman
Journal:  Stat Med       Date:  2016-08-16       Impact factor: 2.373

5.  An efficient design strategy for logistic regression using outcome- and covariate-dependent pooling of biospecimens prior to assay.

Authors:  Robert H Lyles; Emily M Mitchell; Clarice R Weinberg; David M Umbach; Enrique F Schisterman
Journal:  Biometrics       Date:  2016-03-09       Impact factor: 2.571

6.  Semiparametric isotonic regression modelling and estimation for group testing data.

Authors:  Ao Yuan; Jin Piao; Jing Ning; Jing Qin
Journal:  Can J Stat       Date:  2020-10-28       Impact factor: 0.875

7.  A Discriminant Function Approach to Adjust for Processing and Measurement Error When a Biomarker is Assayed in Pooled Samples.

Authors:  Robert H Lyles; Dane Van Domelen; Emily M Mitchell; Enrique F Schisterman
Journal:  Int J Environ Res Public Health       Date:  2015-11-18       Impact factor: 3.390

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.