| Literature DB >> 24381539 |
Kendrick N Kay1, Ariel Rokem2, Jonathan Winawer3, Robert F Dougherty4, Brian A Wandell2.
Abstract
In task-based functional magnetic resonance imaging (fMRI), researchers seek to measure fMRI signals related to a given task or condition. In many circumstances, measuring this signal of interest is limited by noise. In this study, we present GLMdenoise, a technique that improves signal-to-noise ratio (SNR) by entering noise regressors into a general linear model (GLM) analysis of fMRI data. The noise regressors are derived by conducting an initial model fit to determine voxels unrelated to the experimental paradigm, performing principal components analysis (PCA) on the time-series of these voxels, and using cross-validation to select the optimal number of principal components to use as noise regressors. Due to the use of data resampling, GLMdenoise requires and is best suited for datasets involving multiple runs (where conditions repeat across runs). We show that GLMdenoise consistently improves cross-validation accuracy of GLM estimates on a variety of event-related experimental datasets and is accompanied by substantial gains in SNR. To promote practical application of methods, we provide MATLAB code implementing GLMdenoise. Furthermore, to help compare GLMdenoise to other denoising methods, we present the Denoise Benchmark (DNB), a public database and architecture for evaluating denoising methods. The DNB consists of the datasets described in this paper, a code framework that enables automatic evaluation of a denoising method, and implementations of several denoising methods, including GLMdenoise, the use of motion parameters as noise regressors, ICA-based denoising, and RETROICOR/RVHRCOR. Using the DNB, we find that GLMdenoise performs best out of all of the denoising methods we tested.Entities:
Keywords: BOLD fMRI; ICA; RETROICOR; correlated noise; cross-validation; general linear model; physiological noise; signal-to-noise ratio
Year: 2013 PMID: 24381539 PMCID: PMC3865440 DOI: 10.3389/fnins.2013.00247
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Summary of datasets.
| 1 | A | S1 | 69 | 3 | 1 | 5 | 3T1 | N8 | 2.5 | 1.323751 | 29.7 | 71 | 64 × 64 × 21 | 270 | 10 | No |
| 2 | A | S2 | 69 | 3 | 1 | 5 | 3T1 | N2 | 2.5 | 1.323751 | 29.7 | 71 | 64 × 64 × 21 | 270 | 10 | No |
| 3 | A | S3 | 69 | 3 | 1 | 5 | 3T1 | N2 | 2.5 | 1.323751 | 29.7 | 71 | 64 × 64 × 21 | 270 | 10 | No |
| 4 | B | S3 | 156 | 3 | 1 | 3 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 300 | 12 | No |
| 5 | B | S2 | 156 | 3 | 1 | 3 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 300 | 12 | No |
| 6 | B | S4 | 156 | 3 | 1 | 3 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 300 | 12 | No |
| 7 | C | S5 | 104 | 3.5 | 1 | 7 | CNI | N32 | 2 | 1.985626 | 31 | 77 | 80 × 80 × 26 | 140 | 14 | No |
| 8 | C | S4 | 104 | 3.5 | 1 | 7 | CNI | N32 | 2 | 1.985626 | 31 | 77 | 80 × 80 × 26 | 140 | 14 | No |
| 9 | D | S2 | 69 | 3 | 1 | 5 | CNI | N32 | 1.8 | 1.605242 | 35 | 73 | 70 × 70 × 20 | 225 | 10 | No |
| 10 | E | S4 | 35 | 3 | 1 | 10 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 270 | 10 | No |
| 11 | F | S3 | 9 | 3 | 2 | 15 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 150 | 15 | No |
| 12 | G | S6 | 69 | 1 | 2 | 6 | 3T2 | N8 | 2.5 | 1.323751 | 28.7 | 77 | 64 × 64 × 20 | 252 | 12 | No |
| 13 | G | S7 | 69 | 1 | 2 | 6 | 3T2 | N8 | 2.5 | 1.323751 | 28.7 | 77 | 64 × 64 × 20 | 252 | 12 | No |
| 14 | H | S2 | 20 | 3 | 1 | 4 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 162 | 4 | Yes |
| 15 | H | S3 | 20 | 3 | 1 | 4 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 162 | 4 | Yes |
| 16 | I | S8 | 9 | 3 | 2 | 4 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 150 | 4 | Yes |
| 17 | I | S9 | 9 | 3 | 2 | 4 | CNI | N32 | 2.5 | 1.337702 | 28 | 68 | 64 × 64 × 22 | 150 | 4 | Yes |
| 18 | J | S4 | 81 | 5 | 1 | 6 | CNI | N16 | 2 | 2.006553 | 31 | 77 | 80 × 80 × 26 | 156 | 12 | Yes |
| 19 | J | S10 | 81 | 5 | 1 | 6 | CNI | N16 | 2 | 2.006553 | 31 | 77 | 80 × 80 × 26 | 156 | 12 | Yes |
| 20 | K | S11 | 20 | 3 | 1 | 4 | CNI | N32 | 2 | 2.006553 | 31 | 77 | 80 × 80 × 26 | 108 | 4 | Yes |
| 21 | K | S12 | 20 | 3 | 1 | 4 | CNI | N32 | 2 | 2.006553 | 31 | 77 | 80 × 80 × 26 | 108 | 4 | Yes |
3T1, Lucas Center at Stanford University, 3T GE Signa HDX scanner; 3T2, Lucas Center at Stanford University, 3T GE Signa MR750 scanner; CNI, Stanford Center for Neurobiological Imaging, 3T GE Signa MR750 scanner; N2, Nova quadrature RF coil; N8, Nova 8-channel RF coil; N16, Nova 16-channel visual array RF coil; N32, Nova 32-channel RF coil.
Additional notes: All datasets used a T2*-weighted, single-shot, gradient-echo pulse sequence, either spiral-trajectory (3T1, 3T2) or echo-planar imaging (CNI). Experiment I was identical to experiment F except that physiological data were collected and fewer runs were collected. Experiment K was identical to experiment H except for differences in pulse sequence parameters (voxel size, TR, TE, flip angle).
Details of the stimuli used in the experiments.
| A | High-contrast black-and-white noise patterns; 10 frames/s for 3 s; conditions vary with respect to the visual field location of the patterns |
| B | Band-pass filtered grayscale images; 3 frames/s for 3 s; conditions vary with respect to visual dimensions such as location, contrast, and orientation |
| C | Arrays of grayscale faces and hands; 2 frames/s for 3.5 s; conditions vary with respect to whether faces or hands composed the arrays and with respect to the spatial layout of the arrays |
| D | Color textures composed of letters of different colors and sizes; 3 frames/s for 3 s; conditions vary with respect to the visual field location of the textures |
| E | Band-pass filtered grayscale objects; 3 frames/s for 3 s; each condition involves flashed presentation of one distinct object |
| F | High-contrast black-and-white noise patterns; 3 frames/s for 3 s; conditions vary with respect to the type and visual field location of the patterns |
| G | Achromatic white noise; 10 frames/s for 1 s; conditions vary with respect to the visual field location of the noise |
| H | Same as E |
| I | Same as F |
| J | Grayscale faces; 4 frames/s for 5 s; conditions vary with respect to the visual field location of the faces |
| K | Same as E |
Figure 1Schematic of GLMdenoise. (A) Inputs and outputs. GLMdenoise takes as input a design matrix (where each column indicates the onsets of a given condition) and fMRI time-series, and returns as output an estimate of the hemodynamic response function (HRF) and BOLD response amplitudes (beta weights). (B) Fitting procedure. The procedure consists of selecting voxels that are unrelated to the experiment (cross-validated R2 less than 0%), performing principal components analysis (PCA) on the time-series of these voxels to derive noise regressors, and using cross-validation to determine the number of regressors to enter into the model.
Figure 2Details of the GLMdenoise fitting procedure. (A) HRF fitting. A canonical HRF representing the response to a brief stimulus (black curve) is convolved with the appropriate square-wave function to predict the response for the condition duration used in a given experiment (red curve). This is the initial seed for the HRF. Iterative linear fitting is then used to estimate the optimal HRF (blue curve). Results are shown for dataset 1 (curves are normalized to peak at one). (B) HRF estimates. Shown are HRF estimates obtained in different datasets. Color scheme same as in (C). (C) Selecting the number of noise regressors. Voxels passing a minimum threshold are identified (voxels with cross-validated R2 greater than 0% under any number of noise regressors), and median cross-validated R2 values are calculated. The minimum number of regressors necessary to achieve within 5% of the maximum performance is selected. The performance curves are generally U-shaped, indicating that noise regressors help but too many noise regressors hurt performance.
Figure 3The Denoise Benchmark (DNB). We designed an architecture that enables automatic evaluation of a candidate denoising method. (A) Cross-validation accuracy. Leave-one-run-out cross-validation is used to quantify the accuracy of the denoising method. In each iteration of this procedure, the denoising method is trained on all runs except one and is asked to predict the task-related signal in the left-out run. Predictions are aggregated across the left-out runs, and the accuracy of the predictions is quantified using coefficient of determination (R2). (B) Signal-to-noise ratio (SNR). Variability of beta weight estimates across the cross-validation iterations is used to estimate SNR. (C) Candidate denoising methods. Any denoising method that conforms to the prescribed application programming interface (API) can be evaluated in the DNB architecture. Note that the cross-validation used in the DNB is distinct from any internal resampling scheme that might be used by a denoising method (such as the cross-validation used within GLMdenoise).
Figure 4GLMdenoise improves accuracy and reliability of BOLD response estimates. Using the DNB, we compared the accuracy and reliability of GLMdenoise to that of an analysis involving no noise regressors (termed Standard GLM). (A) Comparison of R2 for an example dataset. Each dot indicates cross-validated R2 values for an individual voxel. (B) Summary of changes in R2. Voxels are binned according to the cross-validated R2 of Standard GLM (bin size 10%). For each bin with at least five voxels, we compute the increase in R2 provided by GLMdenoise and plot a line indicating the 95% range of results. GLMdenoise provides more accurate BOLD response estimates for nearly all voxels. (C) Comparison of SNR for an example dataset. Format same as (A), except that only voxels passing a minimum threshold are shown (voxels with cross-validated R2 greater than 0% for either model). (D) Summary of changes in SNR. Format same as (B), except that voxels are binned according to SNR (bin size 1). For each bin, we compute the median increase in SNR for each dataset and then the median of these values across datasets. The results are shown as thick black lines (for bins with contributions from at least two datasets). On average, GLMdenoise provides more reliable BOLD response estimates than Standard GLM.
Figure 6GLMdenoise outperforms other denoising methods. Using the DNB, we quantified the cross-validation accuracy of a variety of denoising methods on a large number of datasets. (A) Results for individual datasets. For each dataset, we summarize the performance of a method by plotting the median cross-validated R2 value obtained under that method. Error bars indicate 68% confidence intervals and were obtained via bootstrapping. (B) Overall results. To summarize performance across datasets, we normalize the pattern of results from each dataset such that Standard GLM corresponds to 0 and the best-performing method corresponds to 1. We then compute the mean of this pattern across datasets (error bars indicate standard error of the mean). As an alternative performance summary, we count the number of datasets for which a given method achieves the best or nearly the best performance (specifically, the number of datasets for which the median performance of a method either is the best or provides at least 95% of the performance improvement provided by the best method). The number of datasets (out of 21 total datasets) is indicated in the legend.
Figure 5Example activation maps. As an intuitive way to visualize SNR improvements, we show maps of t-values obtained using Standard GLM and maps obtained using GLMdenoise. Maps have been thresholded at t > 3 and are overlaid on the mean functional volume. (A) Activation map for dataset 3, slice 11, condition 31. The green arrow indicates an activated region that exhibits substantial increases in t-values when using GLMdenoise. The blue arrow indicates a region that exhibits activation under GLMdenoise but not under Standard GLM. (B) Activation map for dataset 7, slice 11, condition 24. Format same as (A).