| Literature DB >> 33443615 |
Salem Alawbathani1,2, Mehreen Batool1,3, Jan Fleckhaus1,4, Sarkawt Hamad1,5,6, Floyd Hassenrück1,7,8, Yanhong Hou1,9,10, Xia Li1,11, Jon Salmanton-García1,7,8, Sami Ullah1,11, Frederique Wieters1,12, Martin C Michel13.
Abstract
A poor understanding of statistical analysis has been proposed as a key reason for lack of replicability of many studies in experimental biomedicine. While several authors have demonstrated the fickleness of calculated p values based on simulations, we have experienced that such simulations are difficult to understand for many biomedical scientists and often do not lead to a sound understanding of the role of variability between random samples in statistical analysis. Therefore, we as trainees and trainers in a course of statistics for biomedical scientists have used real data from a large published study to develop a tool that allows scientists to directly experience the fickleness of p values. A tool based on a commonly used software package was developed that allows using random samples from real data. The tool is described and together with the underlying database is made available. The tool has been tested successfully in multiple other groups of biomedical scientists. It can also let trainees experience the impact of randomness, sample sizes and choice of specific statistical test on measured p values. We propose that live exercises based on real data will be more impactful in the training of biomedical scientists on statistical concepts.Entities:
Keywords: P value; Replicability; Statistical analysis; Teaching
Mesh:
Year: 2021 PMID: 33443615 PMCID: PMC8208928 DOI: 10.1007/s00210-020-02045-3
Source DB: PubMed Journal: Naunyn Schmiedebergs Arch Pharmacol ISSN: 0028-1298 Impact factor: 3.000
Fig. 1Flowchart of steps in the fickleness exercise (for details see the “Methods” section)
Fig. 2Twenty course participants had picked twice 10 random numbers and entered corresponding measured patient values as groups A and B into a Prism data table. For each sample means of groups A and B, their difference and the associated p value from an unpaired, two-tailed t test were calculated. As expected based on regression to the mean in the presence of a true null hypothesis, the mean difference was close to 0 (0.365 [95% confidence interval − 0.832; 1.562]). Each data point shows the values obtained by one participant