| Literature DB >> 31827430 |
Taban Eslami1,2, Vahid Mirjalili3, Alvis Fong1, Angela R Laird4, Fahad Saeed2.
Abstract
Heterogeneous mental disorders such as Autism Spectrum Disorder (ASD) are notoriously difficult to diagnose, especially in children. The current psychiatric diagnostic process is based purely on the behavioral observation of symptomology (DSM-5/ICD-10) and may be prone to misdiagnosis. In order to move the field toward more quantitative diagnosis, we need advanced and scalable machine learning infrastructure that will allow us to identify reliable biomarkers of mental health disorders. In this paper, we propose a framework called ASD-DiagNet for classifying subjects with ASD from healthy subjects by using only fMRI data. We designed and implemented a joint learning procedure using an autoencoder and a single layer perceptron (SLP) which results in improved quality of extracted features and optimized parameters for the model. Further, we designed and implemented a data augmentation strategy, based on linear interpolation on available feature vectors, that allows us to produce synthetic datasets needed for training of machine learning models. The proposed approach is evaluated on a public dataset provided by Autism Brain Imaging Data Exchange including 1, 035 subjects coming from 17 different brain imaging centers. Our machine learning model outperforms other state of the art methods from 10 imaging centers with increase in classification accuracy up to 28% with maximum accuracy of 82%. The machine learning technique presented in this paper, in addition to yielding better quality, gives enormous advantages in terms of execution time (40 min vs. 7 h on other methods). The implemented code is available as GPL license on GitHub portal of our lab (https://github.com/pcdslab/ASD-DiagNet).Entities:
Keywords: ABIDE; ASD; SLP; autoencoder; classification; data augmentation; fMRI
Year: 2019 PMID: 31827430 PMCID: PMC6890833 DOI: 10.3389/fninf.2019.00070
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Class membership information of ABIDE-I dataset for each individual site.
| ASD | 19 | 14 | 20 | 29 | 24 | 75 | 12 | 19 | 29 | 15 | 14 | 19 | 22 | 54 | 66 | 46 | 28 |
| Healthy control | 18 | 13 | 28 | 34 | 28 | 100 | 14 | 15 | 27 | 15 | 22 | 20 | 25 | 44 | 74 | 25 | 28 |
| Male count | 29 | 21 | 36 | 55 | 48 | 139 | 26 | 29 | 48 | 30 | 29 | 31 | 47 | 86 | 113 | 71 | 40 |
| Female count | 8 | 6 | 12 | 8 | 4 | 36 | 0 | 5 | 8 | 0 | 7 | 8 | 0 | 12 | 27 | 0 | 16 |
| Average age | 27 | 26 | 10 | 18 | 25 | 15 | 10 | 16 | 18 | 34 | 14 | 9 | 16 | 13 | 14 | 22 | 12 |
Figure 1Structure of an autoencoder consisting of an encoder that receives the input data and encodes it into a lower dimensional representation at the bottleneck layer, and a decoder that reconstructs the original input from the bottleneck layer.
Figure 2Generating new artificial data: Step (1) Selecting a sample (p). Step (2) Find k-nearest neighbors of p from the same class, and pick one random neighbor (q). Step (3) Generate new sample p′ using p and q by linear interpolation.
Figure 3Workflow of ASD-DiagNet: (A) Pairwise Pearson's correlations for each subject in the training set is computed. The average of all correlation arrays is computed and the position of 1/4 largest and 1/4 smallest values in the average array is considered as a mask. Masked correlation array of each sample is considered as its feature vectors. (B) A set of artificial samples is generated using the feature vectors of training samples. (C) Autoencoder and SLP are jointly trained by adding up their training loss in each iteration. (D) For a test subject, the features are extracted using the mask generated in part A, followed by passing the features through the encoder part of the autoencoder, and finally predicting its label using the trained SLP.
Classification performance using 10-fold cross-validation on the whole dataset; Note that our proposed approach, ASD-DiagNet (with data augmentation) achieves the highest accuracy among other methods.
| ASD-DiagNet | 68.3 | 72.2 | |
| ASD-DiagNet (no aug.) | 69.4 | 69.6 | 69.2 |
| SVM | 68.3 | 64.4 | 72 |
| Random forest | 66.3 | 60.8 | 71.4 |
| Heinsfeld et al., | 65.4 | 61 | 69.3 |
Bold values show the highest accuracy among all methods.
Figure 4ROC curves of different methods for classification of whole dataset using CC-200 parcellation.
Classification accuracy using 5-fold cross-validation on individual data centers using our proposed method, ASD-DiagNet (with and without data augmentation), compared with other methods.
| Caltech | 52.8 | 49.9 | 52.3 | 46.9 | |
| CMU | 67.4 | 45.3 | 66.6 | 62.4 | |
| KKI | 68.6 | 58.2 | 66.4 | 66.6 | |
| Leuven | 57 | 51.8 | 59.8 | 59.8 | |
| MaxMun | 48.6 | 51.4 | 53.8 | 49.2 | |
| NYU | 68 | 65.1 | 64.5 | 61.8 | |
| 71.9 | 74 | 79.4 | 54.3 | ||
| Olin | 58.8 | 44 | 59.5 | 52.2 | |
| Pitt | 65.9 | 59.8 | 66.3 | 59.9 | |
| SBL | 51.6 | 47.5 | 46.6 | 48.3 | |
| SDSU | 63 | 61.3 | 58.7 | 62.7 | |
| Stanford | 53 | 48.5 | 51.4 | 62.1 | |
| Trinity | 54.1 | 51.2 | 53.1 | 54.5 | |
| UCLA | 70.3 | 57.7 | 72.1 | 69.3 | |
| USM | 68.2 | 65.1 | 62 | 58 | |
| UM | 63.8 | 57.6 | 64.2 | 64.8 | |
| Yale | 61.7 | 53 | 61.6 | 55.3 | |
| Average | 60.7 | 56.1 | 62.6 | 58.6 |
Bold and color values corresponds to highest accuracy achieved among all datasets.
Classification accuracy using 10-fold cross-validation on the whole dataset based on AAL atlas.
| ASD-DiagNet | 63.4 | 71.5 | |
| ASD-DiagNet (no aug.) | 64.5 | 60.9 | 68 |
| SVM | 63.9 | 70.9 | |
| Random forest | 65 | 56.8 | 72.7 |
| Heinsfeld et al., | 63.3 | 58.6 | 67.8 |
Bold values show the highest accuracy among all methods.
Classification accuracy using 10-fold cross-validation on the whole dataset based on TT atlas.
| ASD-DiagNet | 65.3 | 63.4 | 66.9 |
| ASD-DiagNet (no aug.) | 65.2 | 61.1 | 69 |
| SVM | 61.6 | 71 | |
| Random forest | 65.1 | 60.3 | 69.7 |
| Heinsfeld et al., | 63.2 | 59.8 | 66.4 |
Bold values show the highest accuracy among all methods.
Figure 5ROC curves of different methods for classification of whole dataset using AAL parcellation.
Figure 6ROC curves of different methods for classification of whole dataset using TT parcellation.
Classification accuracy using 10-fold cross-validation on the subjects below the age of 15.
| ASD-DiagNet | 66.7 | 69.4 | |
| ASD-DiagNet (no aug.) | 66.9 | 59.2 | 74.3 |
| SVM | 66.9 | 64.5 | 69.2 |
| Random forest | 64.3 | 57.4 | 70.8 |
| Heinsfeld et al., | 65.2 | 62.1 | 68.3 |
Figure 7ROC curves of different methods for classification of subjects below the age of 15 using CC-200 parcellation.
Algorithm 1: Computing weight vector for EROS (Yang and Shahabi, 2004)
| Each column vector |
| 1: for |
| 2: |
| 3: |
| 4: |
| 5: |
| 6: |
| 7: |
| 8: |
| 9: |
Algorithm 2: Data augmentation using EROS similarity measure
| 1: |
| 2: Find 5 nearest neighbors to |
| 3: |
| 4: |
| 5: |
| 6: |