Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

Literature DB >> 35578622

Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

Abstract

While the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim-if all thresholds are abandoned-then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.

Entities: Chemical

Keywords: Data-dredging; Error probabilities; Fisher; Neyman and Pearson; P-values; Statistical significance tests

Year: 2022 PMID： 35578622 PMCID： PMC9096069 DOI： 10.1007/s11229-022-03692-0

Source DB: PubMed Journal: Synthese ISSN： 0039-7857 Impact factor: 1.595

Keyword Cloud
References

21 in total

1. Toward evidence-based medical statistics. 2: The Bayes factor.

Authors: S N Goodman
Journal: Ann Intern Med Date: 1999-06-15 Impact factor: 25.391

2. Two cheers for P-values?

Authors: S Senn
Journal: J Epidemiol Biostat Date: 2001

3. A comment on replication, p-values and evidence, S.N.Goodman, Statistics in Medicine 1992; 11:875-879.

Authors: Stephen Senn
Journal: Stat Med Date: 2002-08-30 Impact factor: 2.373

4. False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant.

Authors: Joseph P Simmons; Leif D Nelson; Uri Simonsohn
Journal: Psychol Sci Date: 2011-10-17

Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

1. Toward evidence-based medical statistics. 2: The Bayes factor.

2. Two cheers for P-values?

3. A comment on replication, p-values and evidence, S.N.Goodman, Statistics in Medicine 1992; 11:875-879.

4. False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant.

5. Revised standards for statistical evidence.

6. Absence of evidence is not evidence of absence.

7. Tests of Statistical Significance Made Sound.

8. P values are only an index to evidence: 20th- vs. 21st-century statistical science.

9. COMPare: a prospective cohort study correcting and monitoring 58 misreported trials in real time.

10. The statistics wars and intellectual conflicts of interest.