Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Permutation tests are robust and powerful at 0.5% and 5% significance levels.

Literature DB >> 34050436

Permutation tests are robust and powerful at 0.5% and 5% significance levels.

Kimihiro Noguchi¹, Frank Konietschke^2,3, Fernando Marmolejo-Ramos⁴, Markus Pauly⁵.

Abstract

Recent replication crisis has led to a number of ad hoc suggestions to decrease the chance of making false positive findings. Among them, Johnson (Proceedings of the National Academy of Sciences, 110, 19313-19317, 2013) and Benjamin et al. (Nature Human Behaviour, 2, 6-10 2018) recommend using the significance level of α = 0.005 (0.5%) as opposed to the conventional 0.05 (5%) level. Even though their suggestion is easy to implement, it is unclear whether or not the commonly used statistical tests are robust and/or powerful at such a small significance level. Therefore, the main aim of our study is to investigate the robustness and power curve behaviors of independent (unpaired) two-sample tests for metric and ordinal data at nominal significance levels of α = 0.005 and α = 0.05. Through an extensive simulation study, it is found that the permutation versions of the Welch t-test and the Brunner-Munzel test are particularly robust and powerful while the commonly used two-sample tests which utilize t-distribution tend to be either liberal or conservative, and have peculiar power curve behaviors under skewed distributions with variance heterogeneity.

Entities: Chemical

Keywords: Nonparametric tests; Permutation tests; Replication crisis; Reproducibility issue; Robust statistics; Statistical evidence; Statistical significance

Mesh：

Year: 2021 PMID： 34050436 DOI： 10.3758/s13428-021-01595-5

Source DB: PubMed Journal: Behav Res Methods ISSN： 1554-351X

20 in total

Review 1. On effect size.

Authors: Ken Kelley; Kristopher J Preacher
Journal: Psychol Methods Date: 2012-04-30

2. The Wilcoxon-Mann-Whitney test under scrutiny.

Authors: Morten W Fagerland; Leiv Sandvik
Journal: Stat Med Date: 2009-05-01 Impact factor: 2.373

3. Moving beyond P values: data analysis with estimation graphics.

Authors: Joses Ho; Tayfun Tumkaya; Sameer Aryal; Hyungwon Choi; Adam Claridge-Chang
Journal: Nat Methods Date: 2019-07 Impact factor: 28.547

4. Redefine statistical significance.

Authors: Daniel J Benjamin; James O Berger; Magnus Johannesson; Brian A Nosek; E-J Wagenmakers; Richard Berk; Kenneth A Bollen; Björn Brembs; Lawrence Brown; Colin Camerer; David Cesarini; Christopher D Chambers; Merlise Clyde; Thomas D Cook; Paul De Boeck; Zoltan Dienes; Anna Dreber; Kenny Easwaran; Charles Efferson; Ernst Fehr; Fiona Fidler; Andy P Field; Malcolm Forster; Edward I George; Richard Gonzalez; Steven Goodman; Edwin Green; Donald P Green; Anthony G Greenwald; Jarrod D Hadfield; Larry V Hedges; Leonhard Held; Teck Hua Ho; Herbert Hoijtink; Daniel J Hruschka; Kosuke Imai; Guido Imbens; John P A Ioannidis; Minjeong Jeon; James Holland Jones; Michael Kirchler; David Laibson; John List; Roderick Little; Arthur Lupia; Edouard Machery; Scott E Maxwell; Michael McCarthy; Don A Moore; Stephen L Morgan; Marcus Munafó; Shinichi Nakagawa; Brendan Nyhan; Timothy H Parker; Luis Pericchi; Marco Perugini; Jeff Rouder; Judith Rousseau; Victoria Savalei; Felix D Schönbrodt; Thomas Sellke; Betsy Sinclair; Dustin Tingley; Trisha Van Zandt; Simine Vazire; Duncan J Watts; Christopher Winship; Robert L Wolpert; Yu Xie; Cristobal Young; Jonathan Zinman; Valen E Johnson
Journal: Nat Hum Behav Date: 2018-01

Review 10. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.

Authors: Daniël Lakens
Journal: Front Psychol Date: 2013-11-26

1 in total

1. Advice on comparing two independent samples of circular data in biology.

Authors: Lukas Landler; Graeme D Ruxton; E Pascal Malkemper
Journal: Sci Rep Date: 2021-10-13 Impact factor: 4.379

1 in total

Permutation tests are robust and powerful at 0.5% and 5% significance levels.

Review 1. On effect size.

2. The Wilcoxon-Mann-Whitney test under scrutiny.

3. Moving beyond P values: data analysis with estimation graphics.

4. Redefine statistical significance.

5. 1,500 scientists lift the lid on reproducibility.

6. Four simple ways to increase power without increasing the sample size.

7. Sample size in four areas of psychological research.

8. Policy: NIH plans to enhance reproducibility.

9. Why most published research findings are false.

Review 10. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.

1. Advice on comparing two independent samples of circular data in biology.