| Literature DB >> 34519385 |
Sarah M Weinstein1, Simon N Vandekar2, Azeez Adebimpe3,4, Tinashe M Tapera3,4, Timothy Robert-Fitzgerald1, Ruben C Gur4,5, Raquel E Gur4,5,6, Armin Raznahan7, Theodore D Satterthwaite1,3,4,8, Aaron F Alexander-Bloch5,6, Russell T Shinohara1,8.
Abstract
Many key findings in neuroimaging studies involve similarities between brain maps, but statistical methods used to measure these findings have varied. Current state-of-the-art methods involve comparing observed group-level brain maps (after averaging intensities at each image location across multiple subjects) against spatial null models of these group-level maps. However, these methods typically make strong and potentially unrealistic statistical assumptions, such as covariance stationarity. To address these issues, in this article we propose using subject-level data and a classical permutation testing framework to test and assess similarities between brain maps. Our method is comparable to traditional permutation tests in that it involves randomly permuting subjects to generate a null distribution of intermodal correspondence statistics, which we compare to an observed statistic to estimate a p-value. We apply and compare our method in simulated and real neuroimaging data from the Philadelphia Neurodevelopmental Cohort. We show that our method performs well for detecting relationships between modalities known to be strongly related (cortical thickness and sulcal depth), and it is conservative when an association would not be expected (cortical thickness and activation on the n-back working memory task). Notably, our method is the most flexible and reliable for localizing intermodal relationships within subregions of the brain and allows for generalizable statistical inference.Entities:
Keywords: covariance stationarity; hypothesis; intermodal correspondence; permutation testing; testing
Mesh:
Year: 2021 PMID: 34519385 PMCID: PMC8519855 DOI: 10.1002/hbm.25577
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
FIGURE 1Example left hemisphere sulcal depth, cortical thickness, and n‐back maps used for (a) the spin test and BrainSMASH and (b) the SPICE test. Brain map visualizations were generated using the R packages fsbrain (Schäfer, 2020b), freesurferformats (Schäfer, 2020a), and gifti (Muschelli, 2018). (c) Null hypotheses for previous methods and our proposed method for testing intermodal correspondence. Of the three methods, the SPICE test is the only one to incorporate subject‐level data
FIGURE 2Illustration of simple permutation‐based intermodal correspondence (SPICE) testing procedure. A null distribution for , the average within‐subject correspondence statistic, is calculated by randomly permuting the maps times and re‐estimating in each permuted version of the data. A p‐value is estimated as in (4)
FIGURE 3Illustration of bi‐modal image simulation. Subject‐level images are derived from average cortical thickness () and sulcal depth () data from the Philadelphia Neurodevelopmental Cohort. For illustrative purposes, the example above shows as mean sulcal depth, but we also consider a simulation setting using mean n‐back as the second population‐level map. Subject‐level images for subjects are simulated as and , where and the elements of and are normally distributed, with mean and variance parameters specified in Section 2.2.2. The null hypothesis is true when the variance of , is equal to 0. Otherwise, we expect to reject , with test power varying according to other parameter values
FIGURE 4Power and type I error of the simple permutation‐based intermodal correspondence (SPICE) test based on 5,000 simulations. Each point is the rate of rejecting (based on ) from 5,000 simulations of of the data (as shown in Figure 3) with unique combinations of parameters: sample size (), subject‐level variance (, ranging from 0.0 to 3.0 in increments of 0.15), and vertex‐level variance (, either 0.5, 1.5, 3.0, or 6.0). (a) Simulation setting 1: using mean cortical thickness and sulcal depth (from a subset of 789 participants in the Philadelphia Neurodevelopmental Cohort) as population‐level maps and , respectively (Corr(). (b) Simulation setting 2: using mean cortical thickness and n‐back as population‐level maps and , respectively (Corr()
Unadjusted p‐values from tests of intermodal correspondence in the left and right hemispheres using the SPICE, BrainSMASH, and spin methods
| Age | 8–9 | 10–11 | 12–13 | 14–15 | 16–17 | 18–19 | 20–21 | All subjects |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| (a) Sulcal depth vs. cortical thickness | ||||||||
| SPICE | ||||||||
| Left | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 |
| Right | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 |
| BrainSMASH | ||||||||
| Left | 0.007 | 0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 |
| Right | 0.034 | 0.004 | 0.005 | 0.002 | <0.001 | 0.001 | 0.001 | 0.002 |
| Spin | ||||||||
| Left | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 |
| Right | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | <0.001 |
| (b) | ||||||||
| SPICE | ||||||||
| Left | 0.899 | 0.842 | 0.281 | 0.958 | 0.796 | 0.345 | 0.829 | 0.838 |
| Right | 0.222 | 0.879 | 0.784 | 0.513 | 0.752 | 0.254 | 0.468 | 0.470 |
| BrainSMASH | ||||||||
| Left | 0.726 | 0.974 | 0.907 | 0.932 | 0.798 | 0.771 | 0.948 | 0.914 |
| Right | 0.344 | 0.464 | 0.795 | 0.786 | 0.936 | 0.878 | 0.868 | 0.831 |
| Spin | ||||||||
| Left | 0.987 | 0.851 | 0.760 | 0.894 | 0.575 | 0.572 | 0.835 | 0.730 |
| Right | 0.609 | 0.711 | 0.994 | 0.964 | 0.742 | 0.655 | 0.922 | 0.947 |
Note: Null hypotheses for each method are summarized in Figure 1. Additionally, Figure S2 shows the null distributions and observed test statistics used to estimate each p‐value above.
FIGURE 5Unadjusted p‐values from tests of intermodal correspondence within seven functional networks described by Yeo et al. (2011) in the left hemisphere for different age groups. We consider p < 0.007 to provide evidence against the null hypotheses (defined in Figure 1), after using a Bonferroni correction for comparisons across seven age groups