Literature DB >> 26970249

Limitations of empirical calibration of p-values using observational data.

Susan Gruber1, Eric Tchetgen Tchetgen2.   

Abstract

Controversy over non-reproducible published research reporting a statistically significant result has produced substantial discussion in the literature. p-value calibration is a recently proposed procedure for adjusting p-values to account for both random and systematic errors that address one aspect of this problem. The method's validity rests on the key assumption that bias in an effect estimate is drawn from a normal distribution whose mean and variance can be correctly estimated. We investigated the method's control of type I and type II error rates using simulated and real-world data. Under mild violations of underlying assumptions, control of the type I error rate can be conservative, while under more extreme departures, it can be anti-conservative. The extent to which the assumption is violated in real-world data analyses is unknown. Barriers to testing the plausibility of the assumption using historical data are discussed. Our studies of the type II error rate using simulated and real-world electronic health care data demonstrated that calibrating p-values can substantially increase the type II error rate. The use of calibrated p-values may reduce the number of false-positive results, but there will be a commensurate drop in the ability to detect a true safety or efficacy signal. While p-value calibration can sometimes offer advantages in controlling the type I error rate, its adoption for routine use in studies of real-world health care datasets is premature. Separate characterizations of random and systematic errors provide a richer context for evaluating uncertainty surrounding effect estimates.
Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

Entities:  

Keywords:  OMOP; p-value; p-value calibration; pharmacovigilance; safety

Mesh:

Year:  2016        PMID: 26970249      PMCID: PMC5012943          DOI: 10.1002/sim.6936

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  11 in total

1.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

2.  Normalization of RNA-seq data using factor analysis of control genes or samples.

Authors:  Davide Risso; John Ngai; Terence P Speed; Sandrine Dudoit
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

3.  Sensitivity analysis for causal inference under unmeasured confounding and measurement error problems.

Authors:  Iván Díaz; Mark J van der Laan
Journal:  Int J Biostat       Date:  2013-11-19       Impact factor: 0.968

4.  Discussion: An estimate of the science-wise false discovery rate and application to the top medical literature.

Authors:  Steven N Goodman
Journal:  Biostatistics       Date:  2013-09-25       Impact factor: 5.899

5.  An estimate of the science-wise false discovery rate and application to the top medical literature.

Authors:  Leah R Jager; Jeffrey T Leek
Journal:  Biostatistics       Date:  2013-09-25       Impact factor: 5.899

Review 6.  An investigation of the false discovery rate and the misinterpretation of p-values.

Authors:  David Colquhoun
Journal:  R Soc Open Sci       Date:  2014-11-19       Impact factor: 2.963

7.  Interpreting observational studies: why empirical calibration is needed to correct p-values.

Authors:  Martijn J Schuemie; Patrick B Ryan; William DuMouchel; Marc A Suchard; David Madigan
Journal:  Stat Med       Date:  2013-07-30       Impact factor: 2.373

8.  Why most published research findings are false.

Authors:  John P A Ioannidis
Journal:  PLoS Med       Date:  2005-08-30       Impact factor: 11.613

9.  The control outcome calibration approach for causal inference with unobserved confounding.

Authors:  Eric Tchetgen Tchetgen
Journal:  Am J Epidemiol       Date:  2013-12-20       Impact factor: 4.897

10.  The extent and consequences of p-hacking in science.

Authors:  Megan L Head; Luke Holman; Rob Lanfear; Andrew T Kahn; Michael D Jennions
Journal:  PLoS Biol       Date:  2015-03-13       Impact factor: 8.029

View more
  7 in total

1.  Inflation of type I error rates due to differential misclassification in EHR-derived outcomes: Empirical illustration using breast cancer recurrence.

Authors:  Yong Chen; Jianqiao Wang; Jessica Chubak; Rebecca A Hubbard
Journal:  Pharmacoepidemiol Drug Saf       Date:  2018-10-30       Impact factor: 2.890

2.  Robust empirical calibration of p-values using observational data.

Authors:  Martijn J Schuemie; George Hripcsak; Patrick B Ryan; David Madigan; Marc A Suchard
Journal:  Stat Med       Date:  2016-09-30       Impact factor: 2.373

3.  The effectiveness of an oral opioid rescue medication algorithm for postoperative pain management compared to PCIA : A cohort analysis.

Authors:  J Erlenwein; M I Emons; F Petzke; M Quintel; I Staboulidou; M Przemeck
Journal:  Anaesthesist       Date:  2020-07-02       Impact factor: 1.041

4.  Risk of aortic aneurysm and dissection following exposure to fluoroquinolones, common antibiotics, and febrile illness using a self-controlled case series study design: Retrospective analyses of three large healthcare databases in the US.

Authors:  Ajit A Londhe; Chantal E Holy; James Weaver; Sergio Fonseca; Angelina Villasis; Daniel Fife
Journal:  PLoS One       Date:  2021-08-16       Impact factor: 3.240

5.  Comparative effectiveness over time of the mRNA-1273 (Moderna) vaccine and the BNT162b2 (Pfizer-BioNTech) vaccine.

Authors:  Nazmul Islam; Natalie E Sheils; Megan S Jarvis; Kenneth Cohen
Journal:  Nat Commun       Date:  2022-05-02       Impact factor: 17.694

6.  Assessing the effectiveness of empirical calibration under different bias scenarios.

Authors:  Hon Hwang; Juan C Quiroz; Blanca Gallego
Journal:  BMC Med Res Methodol       Date:  2022-07-27       Impact factor: 4.612

7.  Risk of retinal detachment and exposure to fluoroquinolones, common antibiotics, and febrile illness using a self-controlled case series study design: Retrospective analyses of three large healthcare databases in the US.

Authors:  Ajit A Londhe; Chantal E Holy; James Weaver; Sergio Fonseca; Angelina Villasis-Keever; Daniel Fife
Journal:  PLoS One       Date:  2022-10-06       Impact factor: 3.752

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.