| Literature DB >> 30828295 |
Milan N Parikh1, Hailong Li1, Lili He1,2.
Abstract
Autism spectrum disorder (ASD) is a developmental disorder, affecting about 1% of the global population. Currently, the only clinical method for diagnosing ASD are standardized ASD tests which require prolonged diagnostic time and increased medical costs. Our objective was to explore the predictive power of personal characteristic data (PCD) from a large well-characterized dataset to improve upon prior diagnostic models of ASD. We extracted six personal characteristics (age, sex, handedness, and three individual measures of IQ) from 851 subjects in the Autism Brain Imaging Data Exchange (ABIDE) database. ABIDE is an international collaborative project that collected data from a large number of ASD patients and typical non-ASD controls from 17 research and clinical institutes. We employed this publicly available database to test nine supervised machine learning models. We implemented a cross-validation strategy to train and test those machine learning models for classification between typical non-ASD controls and ASD patients. We assessed classification performance using accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC). Of the nine models we tested using six personal characteristics, the neural network model performed the best with a mean AUC (SD) of 0.646 (0.005), followed by k-nearest neighbor with a mean AUC (SD) of 0.641 (0.004). This study established an optimal ASD classification performance with PCD as features. With additional discriminative features (e.g., neuroimaging), machine learning models may ultimately enable automated clinical diagnosis of autism.Entities:
Keywords: autism spectrum disorder; biostatistics; diagnosis; machine learning; support vector machine
Year: 2019 PMID: 30828295 PMCID: PMC6384273 DOI: 10.3389/fncom.2019.00009
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Demographic information for our sub-sample of the Autism Brain Imaging Data Exchange (ABIDE) Database.
| Group | ASD ( | Control ( | |
|---|---|---|---|
| Age | 16.8 ± 7.7 | 16.7 ± 6.9 | 0.858 |
| Full-Scale IQ | 105.2 ± 16.8 | 110.9 ± 12.6 | <0.001 |
| Verbal IQ | 104.4 ± 17.8 | 111.3 ± 13.3 | <0.001 |
| Performance IQ | 105.0 ± 17.2 | 108.2 ± 13.3 | 0.003 |
| Sex (%) | 0.017 | ||
| Male | 88 | 82 | |
| Female | 12 | 18 | |
| Handedness (%) | 0.018 | ||
| Left | 13 | 6 | |
| Right | 85 | 92 | |
| Ambidextrous | 2 | 1 |
All data are mean ± SD unless otherwise specified.
Accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC) values for each machine learning model.
| Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC | |
|---|---|---|---|---|
| Decision tree | 54.7 ± 1.5 | 53.3 ± 2.0 | 54.9 ± 1.7 | 0.562 ± 0.015 |
| Majority model | 61.9 ± 0.8 | 55.4 ± 1.1 | 69.2 ± 1.3 | 0.568 ± 0.009 |
| Random forest | 57.2 ± 0.8 | 54.4 ± 1.2 | 60.4 ± 1.1 | 0.615 ± 0.007 |
| SVM (linear) | 61.4 ± 0.5 | 57.1 ± 0.6 | 66.7 ± 0.8 | 0.622 ± 0.002 |
| SVM (non-linear) | 61.9 ± 0.4 | 52.3 ± 1.5 | 71.6 ± 1.1 | 0.623 ± 0.005 |
| Confidence model | 61.5 ± 0.9 | 49.1 ± 1.4 | 67.1 ± 1.0 | 0.633 ± 0.008 |
| Logistic regression | 59.1 ± 0.5 | 55.5 ± 0.6 | 62.6 ± 0.8 | 0.635 ± 0.001 |
| k-Nearest neighbor | 61.8 ± 0.6 | 46.6 ± 1.0 | 72.1 ± 0.8 | 0.641 ± 0.004 |
| Neural network | 62.0 ± 0.9 | 53.3 ± 1.3 | 71.2 ± 1.9 | 0.646 ± 0.005 |
All data are mean ± SD; SVM, Support Vector Machine.