| Literature DB >> 22969709 |
Ani Eloyan1, John Muschelli, Mary Beth Nebel, Han Liu, Fang Han, Tuo Zhao, Anita D Barber, Suresh Joel, James J Pekar, Stewart H Mostofsky, Brian Caffo.
Abstract
Successful automated diagnoses of attention deficit hyperactive disorder (ADHD) using imaging and functional biomarkers would have fundamental consequences on the public health impact of the disease. In this work, we show results on the predictability of ADHD using imaging biomarkers and discuss the scientific and diagnostic impacts of the research. We created a prediction model using the landmark ADHD 200 data set focusing on resting state functional connectivity (rs-fc) and structural brain imaging. We predicted ADHD status and subtype, obtained by behavioral examination, using imaging data, intelligence quotients and other covariates. The novel contributions of this manuscript include a thorough exploration of prediction and image feature extraction methodology on this form of data, including the use of singular value decompositions (SVDs), CUR decompositions, random forest, gradient boosting, bagging, voxel-based morphometry, and support vector machines as well as important insights into the value, and potentially lack thereof, of imaging biomarkers of disease. The key results include the CUR-based decomposition of the rs-fc-fMRI along with gradient boosting and the prediction algorithm based on a motor network parcellation and random forest algorithm. We conjecture that the CUR decomposition is largely diagnosing common population directions of head motion. Of note, a byproduct of this research is a potential automated method for detecting subtle in-scanner motion. The final prediction algorithm, a weighted combination of several algorithms, had an external test set specificity of 94% with sensitivity of 21%. The most promising imaging biomarker was a correlation graph from a motor network parcellation. In summary, we have undertaken a large-scale statistical exploratory prediction exercise on the unique ADHD 200 data set. The exercise produced several potential leads for future scientific exploration of the neurological basis of ADHD.Entities:
Keywords: gradient boosting; random forest; singular value decomposition; voxel-based morphometry
Year: 2012 PMID: 22969709 PMCID: PMC3431009 DOI: 10.3389/fnsys.2012.00061
Source DB: PubMed Journal: Front Syst Neurosci ISSN: 1662-5137
Figure 1Motor cortex parcellation.
Figure 2264 seed voxels.
Figure 3Demographic information.
Figure 4Dot plot of composite intelligence quotients (average of all available IQ measurements per subject) by data contributing site color coded by disease subtype for the internal training set and internal test set.
Overview of final prediction methods used by each subteam.
| 1 | All IQ, age, gender, handedness, site | NITRC | Motor network parcellation, random forest random forest for prediction. |
| 2 | All IQ, age, gender, handedness, site | NITRC | Feature extraction, clustering, LDA, multi-class SVM. |
| 3 | Composite IQ, age, gender, handedness, site | NITRC | CUR decomposition feature extraction, gradient boosting. |
| 4 | Composite IQ, age, gender, handedness, site | NITRC NB Athena | 264 seed voxels, motion parameters, PCA, machine learning algorithms. |
Composite IQ uses the average of all available IQs. All IQ suggests the use of all available IQ measurements. NITRC for image processing implies the use of the 1000 Functional Connectome processing scripts. NB refers to the NeuroBureau pipelines.
Basic demographics by site.
| Control | 50 | 47 | 0 | 65 | 32 | 38 | 37 | 91 | 100 |
| Comb. | 17 | 12 | 0 | 17 | 25 | 29 | 20 | 0 | 0 |
| Hyper./Imp. | 1 | 0 | 0 | 1 | 8 | 1 | 2 | 0 | 0 |
| Inatt. | 11 | 20 | 0 | 5 | 1 | 17 | 11 | 0 | 0 |
| Withheld | 20 | 21 | 100 | 12 | 34 | 16 | 30 | 9 | 0 |
| Female | 38 | 29 | 65 | 40 | 41 | 35 | 46 | 46 | 46 |
| Male | 62 | 71 | 35 | 60 | 59 | 65 | 54 | 54 | 54 |
| % QC Fail | 22 | 1 | 4 | 6 | 12 | 34 | 28 | 32 | 72 |
| Min | 7.09 | 8.08 | 8.50 | 8.02 | 11.05 | 7.17 | 7.17 | 10.11 | 7.09 |
| Median | 11.42 | 11.75 | 14.83 | 10.10 | 17.78 | 11.11 | 8.75 | 14.87 | 10.35 |
| Mean | 12.43 | 11.70 | 14.54 | 10.22 | 17.64 | 11.45 | 9.10 | 15.08 | 11.47 |
| Max | 26.31 | 17.33 | 17.87 | 12.99 | 26.31 | 17.96 | 12.50 | 20.45 | 21.83 |
| Sd | 3.33 | 1.96 | 2.54 | 1.34 | 3.05 | 2.91 | 1.25 | 2.78 | 3.88 |
Acronyms are: Comb., ADHD combined type; Hyper./Imp., ADHD hyperactive impulsive; Inatt., ADHD inattentive; % QC fail, percentage where any imaging quality control flag is listed as failing.
Figure 5Voxels chosen by CUR decomposition.
Figure 6Dot plot of composite intelligence quotients (average of all available IQ measurements per subject) by data contributing site color coded by disease subtype.
Average fMRI resting state correlations between motor network M1 parcels across subjects classified by disease status subtypes.
| Mean | 0.115 | 0.344 | 0.183 | 0.277 | −0.002 | 0.272 | 0.146 | 0.450 | 0.229 | 0.187 |
| SD | 0.206 | 0.184 | 0.204 | 0.191 | 0.189 | 0.207 | 0.201 | 0.182 | 0.205 | 0.187 |
| Mean | 0.134 | 0.349 | 0.192 | 0.284 | −0.007 | 0.279 | 0.168 | 0.456 | 0.241 | 0.179 |
| SD | 0.207 | 0.182 | 0.203 | 0.188 | 0.183 | 0.205 | 0.200 | 0.174 | 0.201 | 0.189 |
| Mean | 0.084 | 0.349 | 0.192 | 0.281 | 0.008 | 0.289 | 0.084 | 0.469 | 0.210 | 0.171 |
| SD | 0.209 | 0.173 | 0.198 | 0.185 | 0.196 | 0.201 | 0.194 | 0.174 | 0.201 | 0.198 |
| Mean | 0.103 | 0.317 | 0.185 | 0.249 | −0.015 | 0.266 | 0.120 | 0.449 | 0.239 | 0.175 |
| SD | 0.210 | 0.183 | 0.191 | 0.203 | 0.187 | 0.213 | 0.199 | 0.187 | 0.201 | 0.148 |
| Model 1 | 0.023 | 0.237 | 0.942 | 0.212 | 0.555 | 0.655 | 0.000 | 0.613 | 0.235 | 0.884 |
| Model 2 | 0.440 | 0.276 | 0.801 | 0.241 | 0.526 | 0.621 | 0.012 | 0.625 | 0.705 | 0.925 |
| Model 3 | 0.418 | 0.110 | 0.883 | 0.657 | 0.472 | 0.921 | 0.057 | 0.485 | 0.280 | 0.701 |
AL, anterior lateral; DL, dorsolateral; DM, dorsomedial; PL, posterior lateral; VL, ventrolateral. P-values correspond to likelihood ratio tests of multinomial models of the resting state correlation. Model 1 included no covariates, model 2 included gender, age, handedness and IQ, model 3 included model 2 variables plus an indicator for data collecting site.
Figure 7Plot of correlations between the dorsomedial and dorsolateral M1 parcels by disease subtype. A reference line is drawn at zero while the inter-subject means (small horizontal line) and 95% confidence intervals (small vertical lines) are given to the left of each group.