| Literature DB >> 22132184 |
Alex R Cook1, Shanice W L Teo.
Abstract
Simulation studies are often used to assess the frequency properties and optimality of statistical methods. They are typically reported in tables, which may contain hundreds of figures to be contrasted over multiple dimensions. To assess the degree to which these tables are fit for purpose, we performed a randomised cross-over experiment in which statisticians were asked to extract information from (i) such a table sourced from the literature and (ii) a graphical adaptation designed by the authors, and were timed and assessed for accuracy. We developed hierarchical models accounting for differences between individuals of different experience levels (under- and post-graduate), within experience levels, and between different table-graph pairs. In our experiment, information could be extracted quicker and, for less experienced participants, more accurately from graphical presentations than tabular displays. We also performed a literature review to assess the prevalence of hard-to-interpret design features in tables of simulation studies in three popular statistics journals, finding that many are presented innumerately. We recommend simulation studies be presented in graphical form.Entities:
Mesh:
Year: 2011 PMID: 22132184 PMCID: PMC3223202 DOI: 10.1371/journal.pone.0027974
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Details of the six tables taken from the literature used as stimuli in the cross-over experiment.
| Source table | Source ref. | Precision | Number of methods | Comparison dimensionality |
| 5.1 |
| 4 | 6 | 3 |
| 4 |
| 3 | 3 | 3 |
| 3 |
| 3 | 3 | 3 |
| 1 |
| 3 | 5 | 2 |
| 3 |
| 3 | 5 | 1 |
| 1 |
| 3 | 11 | 4 |
The other stimuli–graphical adaptations of the tables below, designed by the authors– appear in Supporting Information S1. The maximum precision in non-leading zero digits, the number of methods being compared, and the number of other dimensions (e.g. sample size, or effect size) along which comparisons could be made, are tabulated.
Figure 1Prevalence of undesirable design features in tabular displays in statistics journals.
All tables containing results of simulation studies in articles published in the Journal of the American Statistical Association (JASA), Annals of Statistics (AS) and Statistica Sinica (SS) during the year 2009 comprise the population reviewed. The proportion satisfying various criteria are marked: for the first three, the higher the proportion the better, while smaller tables with fewer significant figures (sig figs) are preferred. Overall proportions are indicated with a solid circle, within journal proportions by a hollow circle. For each criterion, if one journal did notably worse than the others, its proportion is labelled.
Figure 2Comparison of speed and accuracy at which information can be drawn from tables and figures in randomised cross-over experiment, by experience level.
Distributions in white panels account for parametric uncertainty and variability between individuals and table-graph pairs, and are estimated using standard kernel density estimation from MCMC samples. Distributions in grey panels are for an average individual, an average graph and account only for parameter uncertainty; posterior mean and 95% intervals are plotted. Left: timing in minutes per question. Right: probability of reporting a correct answer. Top: undergraduate statistics students, by medium. Middle: PhD candidates or faculty members, by medium. Bottom: difference between graph and table speed and accuracy of information extraction, by experience.