Literature DB >> 35953660

Researcher degrees of freedom in statistical software contribute to unreliable results: A comparison of nonparametric analyses conducted in SPSS, SAS, Stata, and R.

Cooper B Hodges1,2,3,4, Bryant M Stone5, Paula K Johnson1,6, James H Carter7, Chelsea K Sawyers8, Patricia R Roby9, Hannah M Lindsey1,2.   

Abstract

Researcher degrees of freedom can affect the results of hypothesis tests and consequently, the conclusions drawn from the data. Previous research has documented variability in accuracy, speed, and documentation of output across various statistical software packages. In the current investigation, we conducted Pearson's chi-square test of independence, Spearman's rank-ordered correlation, Kruskal-Wallis one-way analysis of variance, Wilcoxon Mann-Whitney U rank-sum tests, and Wilcoxon signed-rank tests, along with estimates of skewness and kurtosis, on large, medium, and small samples of real and simulated data in SPSS, SAS, Stata, and R and compared the results with those obtained through hand calculation using the raw computational formulas. Multiple inconsistencies were found in the results produced between statistical packages due to algorithmic variation, computational error, and statistical output. The most notable inconsistencies were due to algorithmic variations in the computation of Pearson's chi-square test conducted on 2 × 2 tables, where differences in p-values reported by different software packages ranged from .005 to .162, largely as a function of sample size. We discuss how such inconsistencies may influence the conclusions drawn from the results of statistical analyses depending on the statistical software used, and we urge researchers to analyze their data across multiple packages to check for inconsistencies and report details regarding the statistical procedure used for data analysis.
© 2022. The Psychonomic Society, Inc.

Entities:  

Keywords:  Nonparametric procedures; Reproducibility; Researcher degrees of freedom; Statistical conclusion validity; Statistical software

Year:  2022        PMID: 35953660     DOI: 10.3758/s13428-022-01932-2

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  14 in total

1.  Chi-squared and Fisher-Irwin tests of two-by-two tables with small sample recommendations.

Authors:  Ian Campbell
Journal:  Stat Med       Date:  2007-08-30       Impact factor: 2.373

2.  Inconsistencies between reported test statistics and p-values in two psychiatry journals.

Authors:  David Berle; Vladan Starcevic
Journal:  Int J Methods Psychiatr Res       Date:  2007       Impact factor: 4.035

3.  Recommended tests for association in 2 x 2 tables.

Authors:  Stian Lydersen; Morten W Fagerland; Petter Laake
Journal:  Stat Med       Date:  2009-03-30       Impact factor: 2.373

4.  Note on the sampling error of the difference between correlated proportions or percentages.

Authors:  Q McNEMAR
Journal:  Psychometrika       Date:  1947-06       Impact factor: 2.500

5.  Consistency errors in p-values reported in Spanish psychology journals.

Authors:  José Manuel Caperos; Antonio Pardo
Journal:  Psicothema       Date:  2013

6.  Asymptotic versus exact methods in the analysis of contingency tables: Evidence-based practical recommendations.

Authors:  Miguel A García-Pérez; Vicente Núñez-Antón
Journal:  Stat Methods Med Res       Date:  2020-02-05       Impact factor: 3.021

7.  Misuse of statistical test in three decades of psychotherapy research.

Authors:  R Dar; R C Serlin; H Omer
Journal:  J Consult Clin Psychol       Date:  1994-02

8.  Are assumptions of well-known statistical techniques checked, and why (not)?

Authors:  Rink Hoekstra; Henk A L Kiers; Addie Johnson
Journal:  Front Psychol       Date:  2012-05-14

9.  A manifesto for reproducible science.

Authors:  Marcus R Munafò; Brian A Nosek; Dorothy V M Bishop; Katherine S Button; Christopher D Chambers; Nathalie Percie du Sert; Uri Simonsohn; Eric-Jan Wagenmakers; Jennifer J Ware; John P A Ioannidis
Journal:  Nat Hum Behav       Date:  2017-01-10

10.  Statistical conclusion validity: some common threats and simple remedies.

Authors:  Miguel A García-Pérez
Journal:  Front Psychol       Date:  2012-08-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.