| Literature DB >> 27747592 |
Cao Xiao1, Jesse Bledsoe2, Shouyi Wang3, Wanpracha Art Chaovalitwongse2, Sonya Mehta2, Margaret Semrud-Clikeman4, Thomas Grabowski2.
Abstract
Today, diagnosis of attention deficit hyperactivity disorder (ADHD) still primarily relies on a series of subjective evaluations that highly rely on a doctor's experiences and intuitions from diagnostic interviews and observed behavior measures. An accurate and objective diagnosis of ADHD is still a challenge and leaves much to be desired. Many children and adults are inappropriately labeled with ADHD conditions, whereas many are left undiagnosed and untreated. Recent advances in neuroimaging studies have enabled us to search for both structural (e.g., cortical thickness, brain volume) and functional (functional connectivity) abnormalities that can potentially be used as new biomarkers of ADHD. However, structural and functional characteristics of neuroimaging data, especially magnetic resonance imaging (MRI), usually generate a large number of features. With a limited sample size, traditional machine learning techniques can be problematic to discover the true characteristic features of ADHD due to the significant issues of overfitting, computational burden, and interpretability of the model. There is an urgent need of efficient approaches to identify meaningful discriminative variables from a higher dimensional feature space when sample size is small compared with the number of features. To tackle this problem, this paper proposes a novel integrated feature ranking and selection framework that utilizes normalized brain cortical thickness features extracted from MRI data to discriminate ADHD subjects against healthy controls. The proposed framework combines information theoretic criteria and the least absolute shrinkage and selection operator (Lasso) method into a two-step feature selection process which is capable of selecting a sparse model while preserving the most informative features. The experimental results showed that the proposed framework generated the highest/comparable ADHD prediction accuracy compared with the state-of-the-art feature selection approaches with minimum number of features in the final model. The selected regions of interest in our model were consistent with recent brain-behavior studies of ADHD development, and thus confirmed the validity of the selected features by the proposed approach.Entities:
Year: 2016 PMID: 27747592 PMCID: PMC4999568 DOI: 10.1007/s40708-016-0047-1
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Fig. 1Flowchart of Integrated Feature Ranking and Selection Model
Performance comparison on simulated dataset
| Method | Validation accuracy | Training accuracy | Features selected |
|---|---|---|---|
| Our Model | 0.92 | 0.94 | 5 |
| LR + lasso | 0.86 | 0.97 | 8 |
Fig. 2Best prediction error using LR + lasso (green curve as training error, red curve as testing error, dashed line cuts at min testing error)
Fig. 3Best prediction error using our framework (green curve as training error, red curve as testing error, dashed line cuts at min testing error)
Comparison of testing results (leave-one-out cross-validation)
| Selected features | Testing accuracy | Training accuracy | Sensitivity | Specificity | Selection method |
|---|---|---|---|---|---|
| 4 | 0.81 | 0.87 | 0.81 | 0.80 | Proposed method |
| 5 | 0.76 | 0.78 | 0.75 | 0.80 | MRMR [ |
| 7 | 0.66 | 0.76 | 0.66 | 0.67 | Pudil’s floating search [ |
| 14 | 0.70 | 0.74 | 0.72 | 0.67 | PCA |
| 5 | 0.74 | 0.75 | 0.81 | 0.60 | MIM [ |
| 5 | 0.70 | 0.76 | 0.69 | 0.73 | MIFS [ |
| 5 | 0.72 | 0.78 | 0.72 | 0.73 | JMI [ |
| 5 | 0.74 | 0.76 | 0.75 | 0.73 | CMIM [ |