| Literature DB >> 30899012 |
Wiktor Olszowy1,2, John Aston3, Catarina Rua4, Guy B Williams4.
Abstract
Given the recent controversies in some neuroimaging statistical methods, we compare the most frequently used functional Magnetic Resonance Imaging (fMRI) analysis packages: AFNI, FSL and SPM, with regard to temporal autocorrelation modeling. This process, sometimes known as pre-whitening, is conducted in virtually all task fMRI studies. Here, we employ eleven datasets containing 980 scans corresponding to different fMRI protocols and subject populations. We found that autocorrelation modeling in AFNI, although imperfect, performed much better than the autocorrelation modeling of FSL and SPM. The presence of residual autocorrelated noise in FSL and SPM leads to heavily confounded first level results, particularly for low-frequency experimental designs. SPM's alternative pre-whitening method, FAST, performed better than SPM's default. The reliability of task fMRI studies could be improved with more accurate autocorrelation modeling. We recommend that fMRI analysis packages provide diagnostic plots to make users aware of any pre-whitening problems.Entities:
Mesh:
Year: 2019 PMID: 30899012 PMCID: PMC6428826 DOI: 10.1038/s41467-019-09230-w
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Power spectra of the GLM residuals in native space averaged across brain voxels and across subjects for the assumed boxcar design of 10 s of rest followed by 10 s of stimulus presentation (boxcar10). The dips at 0.05 Hz are due to the assumed design period being 20 s (10 s + 10 s). For some datasets, the dip is not seen as the assumed design frequency was not covered by any of the sampled frequencies. The frequencies on the x-axis go up to the Nyquist frequency, which is 0.5/repetition time. If after pre-whitening the residuals were white (as it is assumed), the power spectra would be flat. AFNI and SPM’s alternative method: FAST, led to best whitening performance (most flat spectra). For FSL and SPM, there was substantial autocorrelated noise left after pre-whitening, particularly at low frequencies
Fig. 2Spatial distribution of significant clusters in AFNI (left), FSL (middle), and SPM (right) for different assumed experimental designs. Scale refers to the percentage of subjects where significant activation was detected at the given voxel. The red boxes indicate the true designs (for task data). Resting state data were used as null data. Thus, low numbers of significant voxels were a desirable outcome, as it was suggesting high specificity. Task data with assumed wrong designs were used as null data too. Thus, large positive differences between the true design and the wrong designs were a desirable outcome. The clearest cut between the true and the wrong/dummy designs was obtained with AFNI’s noise model. FAST performed similarly to AFNI’s noise model (not shown)
Fig. 3Group results for four task datasets with assumed true designs. Summary statistic analyses and mixed effects analyses led to only negligibly different percentages of significant voxels
Overview of the employed datasets
| Study | Experiment | Place | Design | No. subjects | Field [T] | TR [s] | Voxel size [mm] | No. voxels | Time points |
|---|---|---|---|---|---|---|---|---|---|
| FCP | Resting state | Beijing | N/A | 198 | 3 | 2 | 3.1 × 3.1 × 3.6 | 64 × 64 × 33 | 225 |
| Resting state | Cambridge, US | N/A | 198 | 3 | 3 | 3 × 3 × 3 | 72 × 72 × 47 | 119 | |
| NKI | Resting state | Orangeburg, US | N/A | 30 | 3 | 1.4 | 2 × 2 × 2 | 112 × 112 × 64 | 404 |
| Resting state | Orangeburg, US | N/A | 30 | 3 | 0.645 | 3 × 3 × 3 | 74 × 74 × 40 | 900 | |
| CRIC | Resting state | Cambridge, UK | N/A | 73 | 3 | 2 | 3 × 3 × 3.8 | 64 × 64 × 32 | 300 |
| neuRosim | Resting state | (Simulated) | N/A | 100 | NA | 2 | 3.1 × 3.1 × 3.6 | 64 × 64 × 33 | 225 |
| NKI | Checkerboard | Orangeburg, US | 20 s off + 20 s on | 30 | 3 | 1.4 | 2 × 2 × 2 | 112 × 112 × 64 | 98 |
| Checkerboard | Orangeburg, US | 20 s off + 20 s on | 30 | 3 | 0.645 | 3 × 3 × 3 | 74 × 74 × 40 | 240 | |
| BMMR | Checkerboard | Magdeburg | 12 s off + 12 s on | 21 | 7 | 3 | 1 × 1 × 1 | 182 × 140 × 45 | 80 |
| CRIC | Checkerboard | Cambridge, UK | 16 s off + 16 s on | 70 | 3 | 2 | 3 × 3 × 3.8 | 64 × 64 × 32 | 160 |
| CamCAN | Sensorimotor | Cambridge, UK | Event-related | 200 | 3 | 1.97 | 3 × 3 × 4.44 | 64 × 64 × 32 | 261 |
For the enhanced NKI data, only scans from release 3 were used. Out of the 46 subjects in release 3, scans of 30 subjects were taken. For the rest, at least 1 scan was missing. For the BMMR data, there were 7 subjects at 3 sessions, resulting in 21 scans. For the CamCAN data, 200 subjects were considered only
FCP Functional Connectomes Project, NKI Nathan Kline Institute, BMMR Biomedical Magnetic Resonance, CRIC Cambridge Research into Impaired Consciousness, CamCAN Cambridge Centre for Ageing and Neuroscience
Fig. 4The employed analyses pipelines. For SPM, we investigated both the default noise model and the alternative noise model: FAST. The noise models used by AFNI, FSL, and SPM were the only relevant difference (marked in a red box)