| Literature DB >> 32090157 |
Pradyumna Lanka1,2, D Rangaprakash1,3,4, Sai Sheshan Roy Gotoor1, Michael N Dretsch5,6,7, Jeffrey S Katz1,7,8,9, Thomas S Denney1,7,8,9, Gopikrishna Deshpande1,7,8,10,9,11,12,13.
Abstract
Resting-state functional Magnetic Resonance Imaging (rs-fMRI) has been extensively used for diagnostic classification because it does not require task compliance and is easier to pool data from multiple imaging sites, thereby increasing the sample size. A MATLAB-based toolbox called Machine Learning in NeuroImaging (MALINI) for feature extraction and disease classification is presented. The MALINI toolbox extracts functional and effective connectivity features from preprocessed rs-fMRI data and performs classification between healthy and disease groups using any of 18 popular and widely used machine learning algorithms that are based on diverse principles. A consensus classifier combining the power of multiple classifiers is also presented. The utility of the toolbox is illustrated by accompanying data consisting of resting-state functional connectivity features from healthy controls and subjects with various brain-based disorders: autism spectrum disorder from autism brain imaging data exchange (ABIDE), Alzheimer's disease and mild cognitive impairment from Alzheimer's disease neuroimaging initiative (ADNI), attention deficit hyperactivity disorder from ADHD-200, and post-traumatic stress disorder and post-concussion syndrome acquired in-house. Results of classification performed on the above datasets can be obtained from the main article titled "Supervised machine learning for diagnostic classification from large-scale neuroimaging datasets" [1]. The data was divided into homogeneous and heterogeneous splits, such that 80% could be used for training, model building and cross-validation, while the remaining 20% of the data could be used as a hold-out independent test data for replication of the classification performance, to ensure the robustness of the classifiers to population variance in image acquisition site and age of the sample.Entities:
Keywords: ADHD; Alzheimer's disease; Autism; Diagnostic classification; Functional connectivity; PTSD; Resting-state functional MRI; Supervised machine learning
Year: 2020 PMID: 32090157 PMCID: PMC7025186 DOI: 10.1016/j.dib.2020.105213
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Number of subjects in each subgroup after splitting the entire data for the following datasets — (A) autism brain imaging data exchange (ABIDE), (B) attention deficit hyperactivity disorder-200 (ADHD-200), (C) post-traumatic stress disorder (PTSD), and (D) Alzheimer's disease neuroimaging initiative (ADNI), into training/validation and hold-out test data. PCS: post-concussion syndrome; ADHD –I (inattentive); ADHD–H (hyperactive/impulsive); ADHD–C (combined).
Fig. 2A snapshot of the GUI for the proposed MALINI toolbox.
Specifications Table
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility | |
| Related research article | P. Lanka, D. Rangaprakash, M.N. Dretsch, J.S. Katz, T.S. Denney Jr., G. Deshpande, Supervised machine learning for diagnostic classification from large-scale neuroimaging datasets, Brain Imaging and Behavior (2019) in press, |
Our data can be used for replication, benchmarking and testing the performance of various machine learning algorithms to classify neurological diseases. These datasets would provide machine learning enthusiasts, who are not from the field of neuroimaging, an opportunity to explore disease classification without any prior knowledge or experience in either neuroscience or neuroimaging. However, researchers from outside the field of neuroimaging must familiarize themselves with the nuances, caveats, and limitations of the application of machine learning to questions of clinical diagnosis using neuroimaging-derived features before using these methods [ The MALINI toolbox provides a one-stop solution for extracting the BOLD time series from regions in the CC200 template [ The toolbox we include has 18 different machine learning algorithms embedded within it which can be used for disease classification. Further, consensus classification, by combining inferences from multiple classifiers, is also available. The code in the toolbox can be modified to include other machine learning classifiers as well. |