| Literature DB >> 33974213 |
Abstract
We are now in a time of readily available brain imaging data. Not only are researchers now sharing data more than ever before, but additionally large-scale data collecting initiatives are underway with the vision that many future researchers will use the data for secondary analyses. Here I provide an overview of available datasets and some example use cases. Example use cases include examining individual differences, more robust findings, reproducibility-both in public input data and availability as a replication sample, and methods development. I further discuss a variety of considerations associated with using existing data and the opportunities associated with large datasets. Suggestions for further readings on general neuroimaging and topic-specific discussions are also provided.Entities:
Keywords: Brain imaging; Connectome; Functional connectivity; Naturalistic neuroimaging; Sample size; Secondary data; fMRI
Mesh:
Year: 2021 PMID: 33974213 PMCID: PMC8111663 DOI: 10.1007/s12021-021-09519-6
Source DB: PubMed Journal: Neuroinformatics ISSN: 1539-2791
Fig. 1Trade-offs between number of participants and amount of data per participant. Note that some datasets have increased in size since the generation of this figure (e.g., IBC has more data per participant now); some datasets are not featured in the current review, e.g., VIM-1. Reprinted from Naselaris et al. (2021)
Overview of large open-access neuroimaging datasets
| Accessibility | |||||
|---|---|---|---|---|---|
| Short name | Full name | N | Comments | Score | Reference |
| HCP | Human Connectome Project (Young Adult) | 1200 | ages 22-35, multimodal, genetics, task fMRI, behaviour, 7 T, partial test-retest (2, N = 45) | 2,3,4 | Van Essen et al. ( |
| GSP | Brain Genomics Superstruct Project | 1570 | ages 18-35, resting state, behaviour, partial test-retest (2, N = 69) | 2,3 | Holmes et al. ( |
| AOMIC | Amsterdam Open MRI Collection | 1370 | ages 19-26, multimodal, task fMRI, behaviour | 1 | Snoek et al. ( |
| CoRR | Consortium for Reliability and Reproducibility | 1629 | resting state, test-retest (2+), multi-site (18), global | 1 | Zuo et al. ( |
| 7T-TRT | – | 22 | ages 21-30, multimodal, resting state, test-retest, behaviour, subset of CoRR (MPG1) | 1 | Gorgolewski et al. ( |
| CCBD | Center for Cognition and Brain Disorders | 30 | ages 20-30, resting state, test-retest (10), subset of CoRR (HNU1) | 1 | Chen et al. ( |
| – | Test-Retest Reliability of Brain Volume Measurements | 3 | ages 26-31, test-retest (20) | 1 | Maclaren et al. ( |
| SLIM | Southwest University Longitudinal Imaging Multimodal | 595 | ages 17-27, resting state, test-retest, China | 1 | Liu et al. ( |
| NARPS | Neuroimaging Analysis Replication and Prediction Study | 119 | ages 18-37, task fMRI | 1 | Botvinik-Nezer et al. ( |
| NKI-RS/eNKI | Enhanced Nathan Kline Institute – Rockland Sample | > 1000 | on-going (current: 1334), ages 6-85, community sample, multimodal, task fMRI, behaviour | 1,4 | Nooner et al. ( |
| IXI | Information eXtraction from Images | 581 | ages 20-86, multi-site (3), multimodal | 1 | Keihaninejad et al. ( |
| Kirby-21/MMRR | Multi-Modal MRI Reproducibility Resource | 21 | ages 22-61, multimodal, test-retest (2) | 1 | Landman et al. ( |
| DLBS | Dallas Life Brain Study | 315 | ages 20-89, multimodal, task fMRI, behaviour, genetics | 1,4 | Kennedy et al. ( |
| ICBM | International Consortium for Brain Mapping | 853 | ages 18-80, multi-site (3) | 3 | Mazziotta et al. ( |
| SALD | Southwest University Adult Lifespan Dataset | 494 | ages 19-80, resting state, China | 1 | Wei et al. ( |
| LEMON | MPI Leipzig Study for Mind-Body-Emotion Interactions | 228 | ages 20-35 and 59-77, multimodal, resting state, behaviour | 1 | Babayan et al. ( |
| N&C | MPI Leipzig Neuroanatomy & Connectivity | 321 | ages 20-75, multimodal, resting state, behaviour | 1 | Mendes et al. ( |
| CC-359 | Calgary-Campinas-359 | 359 | ages 29-80, multi-site (6) | 1 | Souza et al. ( |
| CHBMP | Cuban Human Brain Mapping Project | 203 | ages 18-68, multimodal, behaviour | 3 | Valdes-Sosa et al. ( |
| YaleLowres | Yale Low-Resolution Controls | 100 | ages 18-66, resting state | 1 | Scheinost et al. ( |
| YaleHires | Yale High-Resolution Controls | 120 | ages 18-58, resting state | 1 | Finn et al. ( |
| YaleTRT | Yale Test-Retest | 12 | ages 27-56, resting state, test-retest (4) | 1 | Noble et al. ( |
| MOUS | Mother Of Unification Studies | 204 | ages 18-33, multimodal, task fMRI | 2 | Schoffelen et al. ( |
| NNdb | Naturalistic Neuroimaging Database | 86 | ages 18-58, movie watching, behaviour | 1 | (Aliko et al. |
| Narratives | – | 345 | ages 18-53, story listening | 1 | Nastase et al. ( |
| HCP-A | Lifespan Human Connectome Project in Aging | > 1200 | on-going (current: 725), ages 36-100, multi-site (4), multimodal, task fMRI, behaviour | 4 | Bookheimer et al. ( |
| OMEGA | Open MEG Archive | 220 | ages 21-75, multimodal, task fMRI | 3 | Niso et al. ( |
| CamCAN | Cambridge Center for Ageing and Neuroscience | 700 | ages 18-87, community sample, multimodal, task fMRI, movie watching, genetics | 3 | Taylor et al. ( |
| BLSA | Baltimore Longitudinal Study of Aging | > 1000 | on-going (current: 889; long.), ages 50-90, community sample, longitudinal, task fMRI, behaviour, genetics | 4 | Ferrucci ( |
| BASE-II | Berlin Aging Study II | 2,200 | on-going (long.), ages 20-35 and 60-80, multimodal, behaviour, genetics | 4 | Bertram et al. ( |
| UKBB | UK Biobank Imaging study | 100,000 | on-going (current: 43k; long.), ages 35-80, community sample, longitudinal, multimodal, task fMRI, behaviour | 4 | Miller et al. ( |
| LBC1936 | Lothian Birth Cohort 1936 | 1091 | on-going (long.), ages 70-82, longitudinal, behaviour | 4 | Taylor et al. ( |
| Allen | Allen Human Brain Atlas | 8 | post-mortem, ages 24-57, multimodal, histology | 1 | Hawrylycz et al. ( |
| BigBrain | – | 1 | post-mortem, age 65, histology | 1 | Amunts et al. ( |
| – | Theory of Mind development | 155 | ages 3-12 and 18-39, movie watching, behaviour | 1 | Richardson et al. ( |
| dHCP | Developing Human Connectome Project | > 1000 | on-going (current: 538), ages 20-45 weeks, multimodal, resting state | 2 | Hughes et al. ( |
| ABCD | Adolescent Brain Cognitive Development study | > 10,000 | on-going (current: 11,878; long.), ages 9-10 until 20, longitudinal, multi-site (21), multimodal, task fMRI | 4 | Casey et al. ( |
| HCP-D | Lifespan Human Connectome Project in Development | > 1350 | on-going (current: 655), ages 5-21, multimodal, multi-site (4), task fMRI | 4 | Somerville et al. ( |
| PING | Pediatric Imaging, Neurocognition, and Genetics | 1493 | ages 3-20, multi-site (10), multimodal, resting state, genetics | 3 | Jernigan et al. ( |
| PNC | Philadelphia Neurodevelopmental Cohort | 1445 | ages 8-21, multimodal, task fMRI, genetics | 4 | Satterthwaite et al. ( |
| HBN | Healthy Brain Network | > 10,000 | on-going (current: 3625), ages 5-21, multimodal, task fMRI, movie watching, behaviour | 4 | Alexander et al. ( |
| PedsMRI | NIH Pediatric MRI Data Repository | 500 | ages birth-4, longitudinal, multimodal, multi-site (6), behaviour | 4 | Brain Development Cooperative Group and Evans ( |
| IBIS | Infant Brain Imaging Study | > 900 | on-going (current: 503; long.), ages 6-24 months, longitudinal, multimodal, multi-site (5) | 4 | Hazlett et al. ( |
| Dev-CoG | Developmental Chronnecto-Genomics | > 200 | on-going (no data release yet), ages 9-14, multi-site (2), longitudinal, multimodal, task fMRI, behaviour, genetics | 4 | Stephen et al. ( |
| MyConnectome | – | 1 | aged 45, repeated scans (84, spanning 18 months), multimodal, task fMRI | 1 | Poldrack et al. ( |
| MSC | Midnight Scan Club | 10 | repeated scans, multimodal, task fMRI, behaviour | 1 | Gordon et al. ( |
| Kirby Weekly | Single-subject Resting state fMRI Reproducibility Resource | 1 | aged 40, repeated scans (158, spanning 3.5 years), resting state | 1 | Choe et al. ( |
| SIMON | Single Individual volunteering for Multiple Observations across Networks | 1 | aged 29-46, repeated scans (73, spanning 15 years), mutli-site (36), multimodal, task fMRI | 1 | Duchesne et al. ( |
| IBC | Individual Brain Charting | 12 | on-going (long.), ages 26-41, repeated scans, task fMRI, movie watching | 1 | Pinho et al. ( |
| Studyforrest | – | 20 | multimodal, task fMRI, movie watching | 1 | Hanke et al. ( |
| BOLD5000 | Brain, Object, Landscape Dataset | 4 | ages 24-27, repeated scans, multimodal, task fMRI | 1 | Chang et al. ( |
| NSD | Natural Scenes Dataset | 8 | ages 19-32, repeated scans, multimodal, task fMRI | 3 | Naselaris et al. ( |
| Sherlock | – | 16 | movie watching, behaviour | 1 | Chen et al. ( |
| T1 250 | – | 1 | T1 scans at various resolutions (up to 250 | 1 | Lusebrink et al. ( |
| MASSIVE | Multiple Acquisitions for Standardization of Structural Imaging Validation and Evaluation | 1 | 8000 dMRI volumes with various parameters, along with T1 and T2 scans | 1 | Froeling et al. ( |
| Doctor Who | – | 1 | age 27, task fMRI, movie watching | Seeliger et al. ( | |
| C-NeuroMod | Courtois Project on Neuronal Modelling | 6 | on-going (repeated scans), repeated scans, multimodal, task fMRI, movie watching | 4 | Bellec and Boyle ( |
| OASIS-1 | Open Access Series of Imaging Studies 1: Cross-sectional MRI Data in Young, Middle Aged, Nondemented and Demented Older Adults | 416 | ages 18-96, repeated scans (4) | 1 | Marcus et al. ( |
| OASIS-2 | Longitudinal MRI Data in Nondemented and Demented Older Adults | 150 | ages 60-96, repeated scans (4), longitudinal | 1 | Marcus et al. ( |
| OASIS-3 | Longitudinal Neuroimaging, Clinical, and Cognitive Dataset for Normal Aging and Alzheimer’s Disease | 1098 | ages 42-95, longitudinal, multimodal, resting state, genetics | 3 | LaMontagne et al. ( |
| ADNI | Alzheimer’s Disease Neuroimaging Initiative | > 3500 | on-going (current: 3228; long.), ages 55-90, longitudinal, multimodal, resting state, multi-site (57), genetics | 3 | Jack et al. ( |
| AIBL | Australian Imaging, Biomarker & Lifestyle | 863 | ages 55-96, longitudinal, multimodal, genetics, borrowed methods from ADNI | 3 | Ellis et al. ( |
| PREVENT-AD | Pre-symptomatic Evaluation of Experimental or Novel Treatments for Alzheimer Disease | 425 | ages 55-85, test-retest, longitudinal, multimodal, task fMRI, behaviour, genetics | 3 | Orban et al. ( |
| HABS | Harvard Aging Brain Study | 290 | on-going (long.), ages 62-90, longitudinal, multimodal, task fMRI, behaviour, genetics | 4 | Dagley et al. ( |
| MAP | Rush Memory and Aging Project | > 2100 | on-going (current: 936 with MRI; long.), ages 65+, longitudinal, behaviour, genetics, histology | 4 | Bennett et al. ( |
| COMPASS-ND | Comprehensive Assessment of Neurodegeneration and Dementia | 2300 | on-going (current: 1132), ages 50-90, multi-site (30), longitudinal, behaviour | 4 | Chertkow et al. ( |
| ABIDE | Autism Brain Imaging Data Exchange | 1112 | autism, ages 7-64, multi-site (17), resting state | 1 | Di Martino et al. ( |
| ABIDE2 | Autism Brain Imaging Data Exchange II | 2156 | autism, ages 5-64, mutli-site (19), multimodal, resting state | 1 | Martino et al. ( |
| ADHD-200 | ADHD-200 Global Competition | 973 | ADHD, ages 7-27, multi-site (8), resting state | 1 | ADHD-200 Consortium ( |
| SchizConnect | – | 1400 | schizophrenia, ages 14-67, multimodal, multi-site, task fMRI | 3 | Wang et al. ( |
| COBRE | Centers of Biomedical Research Excellence | 146 | schizophrenia, ages 18-65, resting state, behaviour, subset of SchizConnect | 1 | Çetin et al. ( |
| MCIC | MIND Clinical Imaging Consortium | 331 | schizophrenia, ages 18-60, multi-site (4), multimodal, task fMRI, behaviour, genetics, subset of SchizConnect | 3,4 | Gollub et al. ( |
| NUSDAST | Northwestern University Schizophrenia Data and Software Tool | 451 | schizophrenia, ages 17-67, multimodal, behaviour, genetics, subset of SchizConnect | 3 | Wang et al. ( |
| CANDI | Child and Adolescent NeuroDevelopment Initiative | 103 | schizophrenia, bipolar disorder, ages 4-17 | 1 | Frazier et al. ( |
| LA5c | UCLA Consortium for Neuropsychiatric Phenomics | 273 | schizophrenia, bipolar disorder, ADHD, ages 21-50, multimodal, task fMRI | 1 | Poldrack et al. ( |
| PPMI | Parkinson Progression Marker Initiative | 600 | Parkinson’s disease, ages 30-87, multimodal, multi-site (21) | 3 | Marek et al. ( |
| QPN | Québec Parkinson Network | > 2000 | Parkinson’s disease, on-going (current:1070), ages 33-94, multimodal | 4 | Gan-Or et al. ( |
| TRACK-HD | Track Huntington’s Disease | 366 | Huntington’s disease, ages 18-65, multi-site (4), multimodal, behaviour, genetics | 4 | Tabrizi et al. ( |
| SRPBS-MD | Strategic Research Program for Brain Sciences - Multi-Disorder | 805 | several patient groups, mutli-site (9) | 3 | Yamashita et al. ( |
| PAIN | Pain and Interoception Imaging Network | 973 | pain-related conditions, multi-site, resting state, behaviour | 4 | Labus et al. ( |
| MNI Open iEEG | – | 106 | intracranial EEG | 3 | Frauscher et al. ( |
| RAM | Restoring Active Memory | 251 | intracranial EEG | 3 | Weidemann et al. ( |
| PRIME | PRIMatE Data Exchange | > 100 | primate, on-going (current: 227), multi-site, task fMRI | 2,4 | Milham et al. ( |
| MNDD | UNC-Wisconsin Rhesus Macaque Neurodevelopment Database | 34 | macaque, longitudinal, multimodal | 1 | Young et al. ( |
| – | Awake Rat rsfMRI Database | 90 | rat, resting-state | 1 | Liu et al. ( |
| – | Mouse rest multicentre | 255 | mouse, resting-state, multi-site (17) | 1 | Grandjean et al. ( |
Projects that are “on-going” have the target N listed, though the current available sample size (as of February 2021) or if the on-going nature is related to the collection of subsequent longitudinal timepoints. Numbers after “multi-site” indicate the number of sites. “resting state” indicates that resting-state fMRI data is available (but not task), “task fMRI” indicates that task, and often resting-state, fMRI data are available; “multimodal” indicates that other imaging modalities are available beyond T1-weighted and fMRI data; “behaviour” indicates that a substantial amount of non-fMRI behavioural measures; “genetics” indicates that at least some genetics data was also acquired and is shared. Sample sizes for the dementia and other clinical samples include all individuals (i.e., including matched healthy controls). See main text for a detailed description of the accessibility score
Fig. 2Robust resting-state activity patterns. a First gradient within the default-mode network, adapted from Margulies et al. (2016). b Regions associated with global signal intensity, adapted from Li et al. (2019)
Fig. 3Inter- and intra-individual differences in functional connectivity from highly-sampled individuals. a Inter-individual variability across 10 individuals, reprinted from Gordon et al. (2017). b Intra-individual variability (related to fasting/caffeination), reprinted from Poldrack et al. (2015). Distinct colours denote each functional network. Arrows highlight specific regions of inter-individual variability
Fig. 4Examples of infrequent morphological features examined in large datasets. a Typical olfactory bulbs and no apparent bulbs in monozygotic twins, shown on a T2-weighted coronal image, adapted from Weiss et al. (2020). b Typical hippocampus and incomplete hippocampal inversion, shown on a T1-weighted coronal image, adapted from Caciagli et al. (2019). c Single and double cingulate sulcus, shown on the medial view of a reconstructed cortical surface, adapted from Cachia et al. (2016).
Fig. 5Overview of white-matter tracts. Reprinted from Thiebaut de Schotten et al. (2015)
Fig. 6Performance of hyperalignment in comparison to conventional anatomical alignment. a Classification performance from a six-category animal localiser. wsMVPC denotes within-subject multivariate pattern classification; bsMVPC denotes between-subject. b Between-subject MVPC performance of movie-watching data, as a function of amount of data used in the hyperalignment. c Illustration of the method. Reprinted from Guntupalli et al. (2016)
Fig. 7Examples of MRI artifacts in T1 volumes present in the ABIDE dataset. a Head motion artifacts, with increasing magnitude of motion left to right. Volumes comparable to images 1 and 2 would be suitable for further analysis, but those rated as 3 through 5 have too much head motion to be useable. While most of the ABIDE data is of reasonable quality, it is large dataset and includes participants with autism spectrum disorder as well as children, both factors known to be associated with increased head motion (Pardoe et al. 2016; Engelhardt et al. 2017; Greene et al. 2018). b Ghosting artifacts, visible as overlapping images. The example on the left is only visible in the background with a constrained intensity range, but still results in distortions in the image. The image on the right shows a clear duplicate contour of the back of the head. c Blood flow artifact, creating a horizontal band of distortion, here affecting temporal lobe imaging. d Spike noise artifact, resulting in inconsistent signal intensity. e Coil failure artifact, resulting in a regional distortion around the affected coil. Participant IDs are included below each image to allow for the further examination of the original 3D volumes. Artifact MRIs were identified with the aid of MRIQC (Esteban et al. 2017). Pre-computed results are available from https://mriqc.s3.amazonaws.com/abide/T1w_group.html
Fig. 8Overview of cortical parcellation approaches instantiated in FreeSurfer. Parcellations are shown on inflated and pial surfaces and an oblique coronal slice, reconstructed from an MRI of a young adult. Updated from Madan and Kensinger (2018) to include Collantoni et al. (2020) and more clearly show parcellation boundaries on the inflated surface; visualisations produced based on previously described methods (Madan and Kensinger 2016; Klein and Tourville 2012; Destrieux et al. 2010; Scholtens et al. 2018; Fan et al. 2016; Hagmann et al. 2008)
Fig. 9Reported prediction accuracy as a function of sample size for studies in different meta-analyses. Reprinted from Varoquaux (2018). Copyright 2018, Elsevier
Fig. 10Correlations between head-motion during rest and movie-watching fMRI scans with age and body-mass index (BMI). Head motion axes are log-10 scaled to better show inter-individual variability. Reprinted from Madan (2018)
Fig. 11Age-related differences in brain morphology–characterised using cortical thickness, gyrification index, and fractal dimensionality–across the entire cortical gray matter (‘ribbon’) and for each lobe. Adapted from Madan and Kensinger (2016, 2017a)