| Literature DB >> 35036958 |
Corey Horien1,2, Kangjoo Lee3, Margaret L Westwater3, Stephanie Noble3, Link Tejavibulya1, Teimur Kayani3, R Todd Constable1,3,4, Dustin Scheinost1,3,5,6.
Abstract
Large, publicly available neuroimaging datasets are becoming increasingly common, but their use presents challenges because of insufficient knowledge of the tool options for data processing and proper data organization. Here, we describe a protocol to lessen these barriers. We describe the steps for the search and download of the open-source dataset. We detail the steps for proper data management and practical guidelines for data analysis. Finally, we give instructions for data and result sharing on public repositories and preprint services. For complete details on the use and execution of this profile, please refer to Horien et al. (2021).Entities:
Keywords: Behavior; Bioinformatics; Cognitive Neuroscience; Neuroscience
Mesh:
Year: 2022 PMID: 35036958 PMCID: PMC8749295 DOI: 10.1016/j.xpro.2021.101077
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1An overview of open-source datasets and open repositories
(A) For each dataset listed in the leftmost column, sample size is indicated, along with the type of data included (‘Data modalities’). ‘Data level’ refers to the level of preprocessing: white circle, raw data; gray circle, some level of preprocessed data; black, processed data (for example, statistical maps, connectivity matrices, etc.).
(B) For each open repository (i.e., a collection of open datasets) listed in the leftmost column, an estimate of the number of open datasets is listed. Datasets of interest are highlighted (‘Featured large datasets’). Sample sizes and the number of open datasets are current as of September 2021. Users are encouraged to visit the website associated with each dataset before use, as sample sizes, access conditions, etc. may change. Figure adapted with permission from (Horien et al., 2021).
Figure 2Steps to download data from OpenNeuro
Figure 3BIDS format
(A) Each participant has folders containing raw anatomical and functional data.
(B) Data generated during the course of the study are stored in a derivatives folder.
A sampling of online data repositories available for sharing different levels of data
| Data level | Available repositories | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Balsa | COINS | INDI | Loris | NDA | NeuroVault | NITRC-IR | Omega | Open neuro | |
| Preproc. data | Y | Y | Y | Y | Y | Y | |||
| Derived, statistical parametric data | Y | Y | Y | Y | Y | ||||
| Access | |||||||||
Table adapted with permission from (Horien et al., 2021).
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Yale Resting State fMRI/Pupillometry: Arousal Study | ||
| Amazon Web Services (AWS) | Amazon Web Services | |
| Bash | Free Software Foundation, Inc. | |
| MATLAB (ver R2021b) | MathWorks | |
| Python (ver 3.10) | Python Software Foundation | |
| R (ver 3.6.2) | The R Foundation | |
| Brain Imaging Data Structure (BIDS) | ||
| FMRIB Software Library (FSL; ver 6.0) | ||
| Statistical Parametric Mapping (ver SPM12) | ||
| Analysis of Functional NeuroImages (AFNI; ver AFNI_21.3.05) | ||
| Advanced Normalization Tools (ANTS) | ||
| fMRIPrep | ||
| Jupyter | Project Jupyter | |
| GitHub | GitHub, Inc. | |