Literature DB >> 24101570

The frequentist implications of optional stopping on Bayesian hypothesis tests.

Adam N Sanborn1, Thomas T Hills.   

Abstract

Null hypothesis significance testing (NHST) is the most commonly used statistical methodology in psychology. The probability of achieving a value as extreme or more extreme than the statistic obtained from the data is evaluated, and if it is low enough, the null hypothesis is rejected. However, because common experimental practice often clashes with the assumptions underlying NHST, these calculated probabilities are often incorrect. Most commonly, experimenters use tests that assume that sample sizes are fixed in advance of data collection but then use the data to determine when to stop; in the limit, experimenters can use data monitoring to guarantee that the null hypothesis will be rejected. Bayesian hypothesis testing (BHT) provides a solution to these ills because the stopping rule used is irrelevant to the calculation of a Bayes factor. In addition, there are strong mathematical guarantees on the frequentist properties of BHT that are comforting for researchers concerned that stopping rules could influence the Bayes factors produced. Here, we show that these guaranteed bounds have limited scope and often do not apply in psychological research. Specifically, we quantitatively demonstrate the impact of optional stopping on the resulting Bayes factors in two common situations: (1) when the truth is a combination of the hypotheses, such as in a heterogeneous population, and (2) when a hypothesis is composite-taking multiple parameter values-such as the alternative hypothesis in a t-test. We found that, for these situations, while the Bayesian interpretation remains correct regardless of the stopping rule used, the choice of stopping rule can, in some situations, greatly increase the chance of experimenters finding evidence in the direction they desire. We suggest ways to control these frequentist implications of stopping rules on BHT.

Entities:  

Mesh:

Year:  2014        PMID: 24101570     DOI: 10.3758/s13423-013-0518-9

Source DB:  PubMed          Journal:  Psychon Bull Rev        ISSN: 1069-9384


  17 in total

1.  Measuring the prevalence of questionable research practices with incentives for truth telling.

Authors:  Leslie K John; George Loewenstein; Drazen Prelec
Journal:  Psychol Sci       Date:  2012-04-16

2.  Too good to be true: publication bias in two prominent studies from experimental psychology.

Authors:  Gregory Francis
Journal:  Psychon Bull Rev       Date:  2012-04

3.  Why psychologists must change the way they analyze their data: the case of psi: comment on Bem (2011).

Authors:  Eric-Jan Wagenmakers; Ruud Wetzels; Denny Borsboom; Han L J van der Maas
Journal:  J Pers Soc Psychol       Date:  2011-03

4.  Feeling the future: experimental evidence for anomalous retroactive influences on cognition and affect.

Authors:  Daryl J Bem
Journal:  J Pers Soc Psychol       Date:  2011-03

Review 5.  The importance of proving the null.

Authors:  C R Gallistel
Journal:  Psychol Rev       Date:  2009-04       Impact factor: 8.934

6.  Bayesian hypothesis testing for psychologists: a tutorial on the Savage-Dickey method.

Authors:  Eric-Jan Wagenmakers; Tom Lodewyckx; Himanshu Kuriyal; Raoul Grasman
Journal:  Cogn Psychol       Date:  2010-01-12       Impact factor: 3.468

7.  Bayes factor approaches for testing interval null hypotheses.

Authors:  Richard D Morey; Jeffrey N Rouder
Journal:  Psychol Methods       Date:  2011-07-25

8.  Source reliability and the conjunction fallacy.

Authors:  Andreas Jarvstad; Ulrike Hahn
Journal:  Cogn Sci       Date:  2011-03-07

9.  Inferences under time pressure: how opportunity costs affect strategy selection.

Authors:  Jörg Rieskamp; Ulrich Hoffrage
Journal:  Acta Psychol (Amst)       Date:  2007-07-20

10.  When decision heuristics and science collide.

Authors:  Erica C Yu; Amber M Sprenger; Rick P Thomas; Michael R Dougherty
Journal:  Psychon Bull Rev       Date:  2014-04
View more
  16 in total

1.  Bayesian data analysis for newcomers.

Authors:  John K Kruschke; Torrin M Liddell
Journal:  Psychon Bull Rev       Date:  2018-02

2.  Three Insights from a Bayesian Interpretation of the One-Sided P Value.

Authors:  Maarten Marsman; Eric-Jan Wagenmakers
Journal:  Educ Psychol Meas       Date:  2016-10-05       Impact factor: 2.821

3.  Thou Shalt Not Bear False Witness Against Null Hypothesis Significance Testing.

Authors:  Miguel A García-Pérez
Journal:  Educ Psychol Meas       Date:  2016-10-05       Impact factor: 2.821

4.  Moving Sport and Exercise Science Forward: A Call for the Adoption of More Transparent Research Practices.

Authors:  Aaron R Caldwell; Andrew D Vigotsky; Matthew S Tenan; Rémi Radel; David T Mellor; Andreas Kreutzer; Ian M Lahart; John P Mills; Matthieu P Boisgontier
Journal:  Sports Med       Date:  2020-03       Impact factor: 11.136

Review 5.  Why optional stopping can be a problem for Bayesians.

Authors:  Rianne de Heide; Peter D Grünwald
Journal:  Psychon Bull Rev       Date:  2021-06

6.  Reply to Rouder (2014): good frequentist properties raise confidence.

Authors:  Adam N Sanborn; Thomas T Hills; Michael R Dougherty; Rick P Thomas; Erica C Yu; Amber M Sprenger
Journal:  Psychon Bull Rev       Date:  2014-04

7.  Optional stopping: no problem for Bayesians.

Authors:  Jeffrey N Rouder
Journal:  Psychon Bull Rev       Date:  2014-04

8.  Is the call to abandon p-values the red herring of the replicability crisis?

Authors:  Victoria Savalei; Elizabeth Dunn
Journal:  Front Psychol       Date:  2015-03-06

9.  The pervasive avoidance of prospective statistical power: major consequences and practical solutions.

Authors:  Patrizio E Tressoldi; David Giofré
Journal:  Front Psychol       Date:  2015-05-28

10.  The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits.

Authors:  Markus Loecher
Journal:  Front Artif Intell       Date:  2021-07-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.