| Literature DB >> 30271944 |
Benjamin O Turner1, Erick J Paul2, Michael B Miller3, Aron K Barbey4,5,6,7,8,9.
Abstract
Despite a growing body of research suggesting that task-based functional magnetic resonance imaging (fMRI) studies often suffer from a lack of statistical power due to too-small samples, the proliferation of such underpowered studies continues unabated. Using large independent samples across eleven tasks, we demonstrate the impact of sample size on replicability, assessed at different levels of analysis relevant to fMRI researchers. We find that the degree of replicability for typical sample sizes is modest and that sample sizes much larger than typical (e.g., N = 100) produce results that fall well short of perfectly replicable. Thus, our results join the existing line of work advocating for larger sample sizes. Moreover, because we test sample sizes over a fairly large range and use intuitive metrics of replicability, our hope is that our results are more understandable and convincing to researchers who may have found previous results advocating for larger samples inaccessible.Entities:
Year: 2018 PMID: 30271944 PMCID: PMC6123695 DOI: 10.1038/s42003-018-0073-z
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1Unthresholded voxel-level results. Replicability results for voxel-level (unthresholded) analyses, measured as the Pearson correlation between paired group maps. Average observed (±1 mean within-task standard deviation) shown in black (dark gray); average null (±1 standard deviation) shown in dashed medium gray (light gray). Also shown are individual task curves for each task (colors given in legend; refer to Table 1 for task abbreviations)
Dataset details
| Dataset | Task abbreviation | Full task name |
|---|---|---|
| UIUC | 3-back | 3-back |
| ObLoc | Object Location | |
| PPA | People Pieces Analogy | |
| SST | Set-Shifting Task | |
| HCP | Emot | Emotion Processing |
| Gamb | Incentive Processing | |
| Lang | Language Processing | |
| Motor | Motor | |
| Rel | Relational Processing | |
| Social | Social Cognition (ToM) | |
| WM | Working Memory |
Names and abbreviations of all tasks associated with each dataset
Fig. 2Conservatively thresholded voxel-level results. Replicability results for voxel-level (thresholded conservatively) analyses, measured as the Jaccard overlap between paired group maps—that is the ratio of the number of voxels in the intersection of the two thresholded maps to the number of voxels in the union of the two. Average observed (±1 mean within-task standard deviation) shown in black (dark gray); average null (±1 standard deviation) shown in dashed medium gray (light gray). Also shown are individual task curves for each task (colors given in legend; refer to Table 1 for task abbreviations). See also Supplementary Fig. 1
Fig. 3Conservatively thresholded cluster-level results. Replicability results for cluster-level (thresholded conservatively) analyses, measured as the Jaccard overlap between paired group maps—that is the ratio of the number of voxels in the intersection of the two thresholded maps to the number of voxels in the union of the two. Average observed (±1 mean within-task standard deviation) shown in black (dark gray); average null (±1 standard deviation) shown in dashed medium gray (light gray). Also shown are individual task curves for each task (colors given in legend; refer to Table 1 for task abbreviations). See also Supplementary Fig. 2
Fig. 4Conservatively thresholded peak-level results. Replicability results for suprathreshold peak-level (thresholded conservatively) analyses, measured as the peak hit rate—that is, the proportion of cluster peaks in one map in each pair that are suprathreshold in the other map. Average observed (±1 mean within-task standard deviation) shown in black (dark gray); average null (±1 standard deviation) shown in dashed medium gray (light gray). Also shown are individual task curves for each task (colors given in legend; refer to Table 1 for task abbreviations). See also Supplementary Fig. 3
Task thresholds
| Task(s) | Sample size | Liberal threshold | Conservative threshold |
|---|---|---|---|
| 3-back | 214 | ±3.50 | ±5.25 |
| ObLoc | 200 | ±3.39 | ±5.08 |
| PPA, SST | 279 | ±4.00 | ±6.00 |
| All HCP tasks | 463 | ±5.15 | ±7.73 |
Specific z thresholds used in cluster-based thresholding of full-sample analyses for use in proportion-based thresholding analysis