| Literature DB >> 25120503 |
Abstract
No scientific conclusion follows automatically from a statistically non-significant result, yet people routinely use non-significant results to guide conclusions about the status of theories (or the effectiveness of practices). To know whether a non-significant result counts against a theory, or if it just indicates data insensitivity, researchers must use one of: power, intervals (such as confidence or credibility intervals), or else an indicator of the relative evidence for one theory over another, such as a Bayes factor. I argue Bayes factors allow theory to be linked to data in a way that overcomes the weaknesses of the other approaches. Specifically, Bayes factors use the data themselves to determine their sensitivity in distinguishing theories (unlike power), and they make use of those aspects of a theory's predictions that are often easiest to specify (unlike power and intervals, which require specifying the minimal interesting value in order to address theory). Bayes factors provide a coherent approach to determining whether non-significant results support a null hypothesis over a theory, or whether the data are just insensitive. They allow accepting and rejecting the null hypothesis to be put on an equal footing. Concrete examples are provided to indicate the range of application of a simple online Bayes calculator, which reveal both the strengths and weaknesses of Bayes factors.Entities:
Keywords: Bayes factor; confidence interval; highest density region; null hypothesis; power; significance testing; statistical inference
Year: 2014 PMID: 25120503 PMCID: PMC4114196 DOI: 10.3389/fpsyg.2014.00781
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Bayes factors corresponding to the p-values shown in Figure .
| 0.081 | 0.034* | 0.74 | 0.034* | 0.09 | 0.817 | 0.028* | 0.001* | 0.056 | 0.031* | 0.279 | 0.024* | 0.083 | |
| Null | |||||||||||||
| Neither | 2.96 | 0.52 | 2.70 | 0.46 | 1.73 | 2.96 | |||||||
| Alternative | 4.88 | 4.88 | 4.40 | 1024.6 | 3.33 | 4.88 | 4.28 | ||||||
| 0.002* | 0.167 | 0.172 | 0.387 | 0.614 | 0.476 | 0.006* | 0.028* | 0.002* | 0.024* | 0.144 | 0.23 | ||
| Null | |||||||||||||
| Neither | 2.16 | 2.12 | 1.01 | 0.65 | 0.75 | 2.36 | 1.73 | ||||||
| Alternative | 49.86 | 28.00 | 4.28 | 49.86 | 5.60 |
(Imaginary) means for the effectiveness of a hypnotic suggestion to reduce the Stroop effect.
| Incongruent | Neutral | Congruent | |
|---|---|---|---|
| No Suggestion | 850 | 750 | 720 |
| Suggestion | 785 | 745 | 715 |
Comparing intervals and Bayes for interpreting a non-significant result.
| What does it tell you? | What do you need to link data to theory? | Amount of data needed to obtain evidence for the null? | What would be a useful stopping rule to guarantee sensitivity? | |
| Intervals | How precisely a parameter has been estimated; a reflection of data rather than theory. | A minimal value below which the theory is refuted. | Enough to make sure the width of the interval is less than that of the null region; considerable participant numbers will typically be needed in contrast to Bayes factors. | Interval width no more than null region width and interval either completely in or completely out of the null region. |
| Bayes factors | The strength of evidence the data provide for one theory over another; specific to the two theories contrasted. | A rough expected value or maximum value consistent with theory. | Bayes factors ensure maximum efficiency in use of participants, given a Bayes factor measures strength of evidence. | Bayes factor either greater than three or less than a third. |