Literature DB >> 24123508

Pattern classification of response inhibition in ADHD: toward the development of neurobiological markers for ADHD.

Heledd Hart1, Kaylita Chantiluke, Ana I Cubillo, Anna B Smith, Andrew Simmons, Michael J Brammer, Andre F Marquand, Katya Rubia.   

Abstract

The diagnosis of Attention Deficit Hyperactivity Disorder (ADHD) is based on subjective measures despite evidence for multisystemic structural and functional deficits. ADHD patients have consistent neurofunctional deficits in motor response inhibition. The aim of this study was to apply pattern classification to task-based functional magnetic resonance imaging (fMRI) of inhibition, to accurately predict the diagnostic status of ADHD. Thirty adolescent ADHD and thirty age-matched healthy boys underwent fMRI while performing a Stop task. fMRI data were analyzed with Gaussian process classifiers (GPC), a machine learning approach, to predict individual ADHD diagnosis based on task-based activation patterns. Traditional univariate case-control analyses were also performed to replicate previous findings in a relatively large dataset. The pattern of brain activation correctly classified up to 90% of patients and 63% of controls, achieving an overall classification accuracy of 77%. The regions of the discriminative network most predictive of controls included later developing lateral prefrontal, striatal, and temporo-parietal areas that mediate inhibition, while regions most predictive of ADHD were in earlier developing ventromedial fronto-limbic regions, which furthermore correlated with symptom severity. Univariate analysis showed reduced activation in ADHD in bilateral ventrolateral prefrontal, striatal, and temporo-parietal regions that overlapped with areas predictive of controls, suggesting the latter are dysfunctional areas in ADHD. We show that significant individual classification of ADHD patients of 77% can be achieved using whole brain pattern analysis of task-based fMRI inhibition data, suggesting that multivariate pattern recognition analyses of inhibition networks can provide objective diagnostic neuroimaging biomarkers of ADHD.
Copyright © 2013 The Authors. Human Brain Mapping published by Wiley Periodicals, Inc.

Entities:  

Keywords:  ADHD; Gaussian process classifier; biomarker; diagnosis; functional magnetic resonance imaging; inhibition

Mesh:

Substances:

Year:  2013        PMID: 24123508      PMCID: PMC4190683          DOI: 10.1002/hbm.22386

Source DB:  PubMed          Journal:  Hum Brain Mapp        ISSN: 1065-9471            Impact factor:   5.038


INTRODUCTION

Attention Deficit Hyperactivity Disorder (ADHD) is the most commonly diagnosed child psychiatric disorder, defined by age‐inappropriate problems with inattention, impulsivity, and hyperactivity [American Psychiatric Association, 2000]. One of the most consistent deficits is in motor response inhibition during the Stop task [Alderson et al., 2007; Rubia et al., 2007a; Willcutt et al., 2005], underpinned by functional magnetic resonance imaging (fMRI) findings of reduced activation in key inhibition areas of ventrolateral prefrontal cortex (VLPFC), supplementary motor area (SMA) and caudate, as well as temporo‐parietal regions [Cubillo et al., 2010, 2012; Hart et al., 2013; Passarotti et al., 2010; Pliszka et al., 2006; Rubia, 2011; Rubia et al., 1999, 2005, 2008, 2010, 2011a, 2013; Smith et al., 2006]. Despite the fact that ADHD is a neurodevelopmental disorder with consistent evidence for brain structure and function deficits [Cubillo et al., 2012; Nakao et al., 2011; Hart et al., 2012, 2013; Rubia, 2011; Valera et al., 2007], currently ADHD is diagnosed solely on the basis of subjective clinical and rating measures, which are often unreliable, leading to diagnostic variability between clinicians, cultures and countries [Polanczyk et al., 2007]. Sensitivity of classification of ADHD children with clinical measures based on Diagnostic and Statistical Manual (DSM)‐IV criteria has been shown to be 70–90% [Weiler et al., 2000], thus misdiagnoses are around 10–30%. It is thus highly desirable to develop additional and more reliable diagnostic methods for ADHD patients that rely on objectively measurable neuroimaging data. Attempts to find objective neuroimaging biomarkers for ADHD, however, have been limited by the fact that in traditional univariate group statistical analyses, subjects in both groups tend to overlap in measures that show group differences and effect sizes have been relatively small [Valera et al., 2007], which has made it difficult to make diagnostic predictions at the level of individual subjects. In contrast, multivariate pattern analyses (MVPA) for imaging data take into account interactions between regions (i.e., brain structure or function patterns) and can make predictions (e.g., of class membership) for individual subjects as opposed to group‐level inferences. These methods have been shown to provide sensitive and specific diagnostic indicators for individual patients with other pathologies such as autism, depression, and Alzheimer's disease [Ecker et al., 2010; Marquand et al., 2008; Orru et al., 2012]. To date, few imaging studies have used multivariate pattern recognition analyses techniques to classify ADHD patients. A recent competition to apply multivariate methods on a multicenter resting state functional and anatomical imaging dataset of 285 children and adolescents with ADHD and 491 healthy controls (ADHD‐200 Consortium; http://fcon_1000.projects.nitrc.org/indi/adhd200/) was met by a range of classification approaches including random forests, gradient boosting, multi‐kernel learning and support vector machines [Cheng et al., 2012; Colby et al., 2012; Dai et al., 2012; Eloyan et al., 2012]. Accuracies derived by internal cross‐validation ranged from 55–78%, although the accuracies reported on an external test dataset for which diagnostic labels were withheld were substantially lower [61% for the winning team (Eloyan et al., 2012)]. This difference was attributed to a lack of standardization between sites, leading to multiple confounds including missing data, site‐specific differences in behavioural measurements, imaging acquisition, processing, and protocols, scanner quality and other unmeasured confounding and mediating variables. Furthermore, the competition dataset was highly unbalanced, with more control subjects than ADHD patients (63% and 37% respectively) and balanced accuracy measures, caclulated as the mean of sensitivity and specificity, that accommodate this imbalance [Broderson et al., 2012] are consistently lower than the figures reported (e.g., 57.5% for the winning team). In addition, the competition scoring rewarded specificity more than sensitivity so that all teams reported high specificity, but poor sensitivity (21% for the winning team). Also, none of the studies used probabilistic classification models such as Gaussian Process Classifiers (GPCs). GPCs are kernel classifiers used in machine learning, similar to support vector machines (SVMs), which have good performance for fMRI [Marquand et al., 2010]. Their main advantage over alternative methods is that they provide estimates of predictive uncertainty and can accommodate unbalanced diagnostic settings or variations in disease prevalence, which are crucial for clinical applications [Hahn et al., 2012]. The only study that used GPC in structural imaging data in ADHD showed that it is possible to accurately classify 29 ADHD patients relative to controls and patients with autism with an accuracy of over 79% based on structural MRI data [Lim et al., 2013]. Lastly, to our knowledge, no study in ADHD has used multivariate classification methods to task‐related functional imaging data. The aim of the present study was therefore to test the hypothesis that GPCs of task‐based fMRI data during the tracking Stop task in 30 boys with and 30 healthy boys can identify distributed neurofunctional patterns that will provide accurate diagnostic predictors of ADHD. The tracking Stop task is individually adjusted for accuracy and involves the inhibition of a pre‐potent motor response to a Stop signal that infrequently and unexpectedly follows a go signal. We considered the functional neuroanatomy of the Stop task to be an excellent candidate for a diagnostic biomarker for ADHD since (1) deficits in motor response inhibition in the Stop task are one of the most consistent findings in ADHD [Alderson et al., 2007; Rubia et al., 2007a; Willcutt et al., 2005] and (2) fMRI studies using the Stop task consistently report reduced activation in ADHD patients relative to healthy controls in key areas of motor response inhibition such as right VLPFC, SMA, caudate and thalamus [Cubillo et al., 2010; Hart et al., 2013; Pliszka et al., 2006; Rubia et al., 1999, 2005, 2008, 2010, 2011a]. A secondary aim was to use traditional univariate analyses to replicate previous findings of reduced function in inhibitory regions of VLPFC and the basal ganglia in a relatively large cohort of 30 ADHD patients, given that previous studies were typically conducted in less than 20 subjects [Cubillo et al., 2010; 2013; Passarotti et al., 2010; Pliszka et al., 2006; Rubia et al., 1999, 2008, 2010, 2011a,b, 2005] and to test whether discriminating activation patterns from the MVPA overlapped with activations identified in conventional univariate analysis group comparisons.

METHODS AND MATERIALS

Participants

Sixty right‐handed boys aged between 10–17 years participated. Thirty boys were recruited from clinics with a clinical diagnosis of ADHD, hyperactive‐impulsive/inattentive combined subtype, as assessed by an experienced child psychiatrist using the standardized Maudsley diagnostic interview [Goldberg and Murray, 2002] that assesses ADHD according to DSM‐IV‐TR criteria [American Psychiatric Association, 1994]. ADHD boys scored above clinical threshold for hyperactive‐impulsive/inattentive symptoms on the Strengths and Difficulties Questionnaire for parents (SDQ) [Goodman et al., 2000], the Conners' Parent Rating Scale (CPRS‐R) [Conners et al., 1998], and below clinical threshold on the Social Communication Questionnaire (SCQ) [Rutter et al., 2003]. The majority of the ADHD boys were medication naive (N = 20). The remainder were taking methylphenidate but discontinued treatment for a 48hr washout period prior to scanning (N = 9) or had taken methylphenidate in the past but had discontinued treatment for over a year prior to scanning (N = 1). Thirty healthy control boys were recruited through advertisement in the same geographical area. They scored below clinical threshold on the SDQ, SCQ and CPRS‐R. Exclusion criteria for all participants were IQ<70 on the Wechsler Abbreviated Scale of Intelligence (WASI) [Wechsler, 1999], history of substance abuse or neurological deficits, presence of other psychiatric disorder (except for conduct/oppositional defiant disorder in the ADHD group, N = 2), learning disability, reading, speech or language disorder (see Table 1 for demographic and clinical measures).
Table 1

Demographic and clinical data for 30 boys with ADHD and 30 healthy control boys

Demographic/clinical measuresControls Mean (SD)ADHD Mean (SD)
Age (years, month)14.1 (2.5)13.9 (2)
IQ109 (12)92 (11)
Conners Parent Rating Scale45 (4)79 (8)
SDQ Hyperactive‐impulsive/inattentive behaviours1.9 (2)8.6 (1.5)
Social Communication Questionnaire1.0 (1)9 (4.6)

SDQ, strength and difficulty questionnaire.

Demographic and clinical data for 30 boys with ADHD and 30 healthy control boys SDQ, strength and difficulty questionnaire. One‐way analyses of variance (ANOVAs) showed no significant group differences for age (F(1,58)=0.63; P = 0.43), but for IQ ((F(1,58)=30.379; P < 0.001) which is typical in this population [Kuntsi et al., 2004; Polderman et al., 2006]. Although ANCOVA is commonly conducted in case‐control studies, this is statistically illegitimate when the covariate is typically associated with the disorder and when, as in this study, groups were not randomly selected. It then becomes meaningless to “adjust” the group effects for differences in the covariate, and ANCOVA cannot be used to control group assignment independent of the covariate as it would alter the group effect in potentially problematic ways, leading to spurious results [Dennis et al., 2009; Miller and Chapman, 2001]. Therefore, all analyses were conducted without IQ as a covariate. However, to assess the potential impact of IQ on the classification findings, GPC predictive probabilities were correlated with IQ within each group. Originally 34 ADHD boys were scanned, but 3 ADHD boys had to be excluded due to high motion and 1 ADHD boy was excluded due to extreme outlying task performance. Participants received £50 per scanning session. Parental and child informed consent/assent and approval from the local Ethical Committee were obtained.

Stop Task

Participants practiced the 9‐min mixed‐trial, event‐related tracking fMRI Stop task, which measures the ability to suppress an already triggered motor response [Cubillo et al., 2013; Rubia et al., 2003, 2005, 2007b, 2008, 2010, 2013], once prior to scanning. The basic go trials are choice reaction time trials with a mean ITI of 1.8 s, where participants have to respond to go arrows (80% of trials, 236 trials, 1 s duration) pointing either right or left with a right or left button response with the right/left thumb. In 20% of trials (60 trials), the go‐signals are followed by stop‐signals ∼250 ms later and participants have to inhibit their motor responses. A tracking algorithm changes the time interval between go‐signal and stop‐signal onsets in steps of 50 ms according to each subject's performance on previous trials based on the average percentage of inhibition over previous stop trials, recalculated after each stop trial, resulting in 50% successful and 50% unsuccessful inhibition trials [Cubillo et al., 2013; Rubia et al., 2003, 2005, 2007b, 2008, 2010, 2013]. The dependent task variables is the Stop Signal Reaction Time (SSRT), calculated by subtracting the mean stop‐signal delay (SSD: average time between go‐ and stop‐signal, at which the subject inhibited 50% of stop trials) from the mean reaction time (MRT) to go trials, i.e. MRT‐SSD [Logan et al., 1997]. Measures of the Go process of the task are the MRT to go trials and intra‐subject standard deviation of MRT (SD of MRT) and premature responses (defined as responses made 200 ms before stimulus appearance).

fMRI Data Acquisition

Gradient‐echo echoplanar MR imaging (EPI) data were acquired on a GE Signa 3T Horizon HDx system (General Electric, Milwaukee, WI) at the Centre for Neuroimaging Sciences, Institute of Psychiatry, King's College London, UK. A semi‐automated quality control procedure ensured consistent image quality [Simmons, 1999]. The body coil was used for RF transmission and an 8 channel headcoil for RF reception. In each of 28 noncontiguous planes parallel to the anterior‐posterior commissure, 296 T2*‐weighted MR images depicting BOLD (Blood Oxygen Level Dependent) contrast covering the whole brain were acquired with TE = 30 ms, TR = 1.8 s, flip angle = 75°, in‐plane resolution = 3 mm, slice thickness = 5.5 mm (including slice‐skip = 0.5 mm). A high‐resolution gradient echo planar image was also acquired in the inter‐commissural plane, with TE = 30 ms, TR = 1.8 s, flip angle =9 0°, 43 slices, slice thickness = 3.0 mm, slice skip = 0.3 mm, 1.875 mm in‐plane voxel size (matrix size 128x128), providing complete brain coverage.

Univariate fMRI Data Analysis

The software package XBAM was used for univariate analysis (http://www.brainmap.co.uk) [Brammer et al., 1997]. fMRI data were first processed to minimise motion related artifacts [Bullmore et al., 1999]. A 3D volume consisting of the average intensity at each voxel over the whole experiment was calculated and used as a template. The 3D image volume at each time point was then realigned to this template by computing the combination of rotations (around the x y and z axes) and translations (in x y and z) that maximized the correlation between the image intensities of the volume in question and the template (rigid body registration). Following realignment, data were then smoothed using a Gaussian filter (FWHM 2.354 * in‐plane fMRI voxel size mm) to improve the signal to noise characteristics of the images [Bullmore et al., 1999]. After preprocessing, time series analysis for each subject was based on a wavelet‐based data resampling method for fMRI data [Bullmore et al., 1999, 2001]. At the individual subject level, a standard general linear modelling (GLM) approach was used to obtain estimates of the response size (beta) to the Stop task condition (successful stop trials) against an implicit baseline (go trials). After first‐level analysis, the individual statistical maps were normalised into Talairach standard space [Bullmore et al., 2001]. A group brain activation map was then produced for the contrast successful stop–go trials and hypothesis testing was carried out at the cluster level. The detection of activated voxels is extended from voxel to cluster level using a two‐pass method [Bullmore et al., 1999]. We first used a voxel‐level threshold of p < 0.05 to give maximum sensitivity and to avoid type II errors. 3D clusters were then built by joining together adjacent significant voxels. Cluster mass (rather than a cluster extent) threshold was used as a second‐pass cluster statistic, to minimize discrimination against possible small, strongly responding foci of activation [Bullmore et al., 1999]. The cluster‐level threshold was then computed in such a way as to ensure that the final expected number of type I error cluster was less than one per whole brain. For univariate between‐group comparisons an ANOVA was carried out comparing controls with ADHD adolescents.

Pattern Recognition Analysis

GPCs were used to classify ADHD patients from controls on the basis of whole‐brain individual beta maps/GLM coefficients for the Stop task obtained from the univariate analysis, i.e., for the contrast of successful Stop – Go trials. GPC models are Bayesian extensions of logistic regression that aim to learn statistical properties of a set of “training” data that enables accurate prediction of the label of unseen (“test”) data using the rules of probability. For theoretical background and implementation details see [Marquand et al., 2010; Rasmussen, 2006]. The main advantage GPCs hold over alternative methods such as support vector machines is that they provide probabilistic class predictions, thereby accurately quantifying the predictive confidence assigned to each data point. In this work, the GPC modelled the probability of each scan being assigned to the ADHD class (equivalent to 1 minus the probability of each scan being assigned to the control class). Classifier performance was evaluated using: (i) a receiver operating characteristic (ROC) curve, which plots the classifier's true positive rate (sensitivity) against its false positive rate (1‐specificity) as the decision threshold is varied, (ii) the area under the ROC curve (AUC), which summarises classifier performance across all decision thresholds, and (iii) classification accuracy, which describes the proportion of correct predictions based at a particular decision threshold. The GPC classifier was trained with leave‐one‐out cross‐validation using the PROBID software package (http://www.brainmap.co.uk/PROBID). For each cross‐validation iteration, the data were partitioned into training and test sets, excluding a different participant from each group each time. This process was repeated leaving each participant out once, allowing the sensitivity and specificity to be computed respectively as the number of ADHD and control examples correctly classified over all trials. Statistical significance of the classifier was determined by permutation testing, as described previously [Marquand et al., 2010; Rasmussen, 2006]. Briefly, permutation testing was performed by repeatedly retraining the classifier after permuting the class labels (1000 permutations). A P‐value was computed by counting the number of permutations for which the permuted accuracy was equal or greater than the true (nonpermuted) accuracy, then dividing by 1000. The classifier was trained on a set of whole brain images with only nonbrain tissue masked out. At the given image resolution, this resulted in 21,658 features. To examine the discriminative value of different brain regions an unthresholded GPC weight map which shows the relative contribution of each voxel to the classifier decision was generated. In addition, two‐tailed Pearson correlation analyses were carried out between GPC predictive probability and ADHD symptom severity using the CPRS ADHD T‐scores and SDQ hyperactive‐impulsive/inattention scores.

RESULTS

Task Performance

The probability of inhibition (PI) did not differ between groups showing that the tracking algorithm worked as expected (t = 0.8; df = 58; P = 0.4). A multivariate ANOVA between controls and ADHD patients showed a significant group effect (F (df = 6, 53) = 3, P < 0.01), due to significantly increased intra‐subject variance to go trials (P < 0.02), and premature responses to go trials (defined as responses made 200 ms before stimulus onset) (P < 0.05) in ADHD relative to control boys. However, groups did not differ in the inhibitory measure of the task, the SSRT (Table 2).
Table 2

Performance data for 30 boys with ADHD and 30 healthy control boys

Performance variableControls mean (SD)ADHD mean (SD)
PI (%)50.7 (2.8)50.1 (3.4)
SSRT (ms)168.0 (102.4)131.2 (93.3)
SSD (ms)427.4 (139.1)472.2 (81.0)
MRT go trials (ms)586.8 (114.6)587.5 (96.6)
SD go trials (ms)161.6 (50.6)189.3 (36.0)
Premature responses (%)1.2 (1.7)3.2 (5.2)

PI, probability of inhibition; SSRT, stop signal reaction time; SSD, average stop signal delay, i.e. time between go and stop signal; MRT, mean reaction time; SD, intra‐subject standard deviation.

Performance data for 30 boys with ADHD and 30 healthy control boys PI, probability of inhibition; SSRT, stop signal reaction time; SSD, average stop signal delay, i.e. time between go and stop signal; MRT, mean reaction time; SD, intra‐subject standard deviation.

Brain Activation

Motion

MANOVAs showed no significant group effects in the three dimensional extent of maximum rotation and translation movement parameters for the x, y, and z axes (F(3,56)=2.6; P = n.s.) or in the three‐dimensional Euclidean space (F (1,59) = 0.02, P = n.s.).

Gaussian process analysis

The ROC curve showed that GPC discriminated between ADHD and control subjects above chance across all decision thresholds (Fig. 1A). Accordingly, the AUC for this classifier was 0.81, well above the 0.50 level that would be predicted by chance. At the default decision threshold (i.e., defined by thresholding the probabilistic predictions at 0.5, where a value > 0.5 was deemed to be an ADHD patient), diagnostic classification of patients reached a sensitivity of 90% and specificity of 63%, leading to a diagnostic accuracy of 77% (P < 0.001) (Fig. 1B). The positive predictive value (PPV) was 71.05% and the negative predictive value (NPP) was 86.36%. Notably, only 3 ADHD patients were misclassified as healthy controls. The probabilistic predictions derived from the classifier are shown in Figure 1B along with the decision threshold.
Figure 1

A: Receiver operating characteristic (ROC) curve. Each point on the ROC curve describes the performance of the classifier at a particular decision threshold. Points on the lower left‐hand side of the ROC graph are “conservative,” requiring strong evidence for a positive classification and therefore have low true and false positive rates (TPR/FPR). Points on the upper right‐hand side of the graph are “liberal,” making positive classifications with weak evidence and therefore have a high TPR but also a high FPR. A classifier is more accurate than another if it is closer to the point (1,0), which reflects perfect classification. Chance level is indicated by the dotted diagonal line. B: Classification accuracies for GPC predictions for classifying individuals into either the ADHD or the control group. The x‐axis describes the probability with which each subject is predicted to be an ADHD patient (equal to 1‐ the probability of being a control). The dotted line indicates the decision threshold.

A: Receiver operating characteristic (ROC) curve. Each point on the ROC curve describes the performance of the classifier at a particular decision threshold. Points on the lower left‐hand side of the ROC graph are “conservative,” requiring strong evidence for a positive classification and therefore have low true and false positive rates (TPR/FPR). Points on the upper right‐hand side of the graph are “liberal,” making positive classifications with weak evidence and therefore have a high TPR but also a high FPR. A classifier is more accurate than another if it is closer to the point (1,0), which reflects perfect classification. Chance level is indicated by the dotted diagonal line. B: Classification accuracies for GPC predictions for classifying individuals into either the ADHD or the control group. The x‐axis describes the probability with which each subject is predicted to be an ADHD patient (equal to 1‐ the probability of being a control). The dotted line indicates the decision threshold. The discriminating global activation pattern was mapped for the Stop task unthresholded (Fig. 2A). Although the discriminating activation pattern is multivariate and therefore encompasses the whole brain, the regions with the highest weight vector coefficients contributing to the delineation between ADHD and healthy boys corresponded to a distributed network of brain regions involved in response inhibition including VLPFC, dorsolateral prefrontal cortex (DLPFC), basal ganglia, thalamus, cingulate, cerebellum and parietal, temporal, and occipital cortices. This discriminating pattern included clusters with magnitude (positive) weights predictive of healthy controls in bilateral VLPFC, DLPFC, and rostromedial frontal cortex, as well as ACC, SMA, and premotor cortices, the basal ganglia (caudate, putamen, and globus pallidus), thalamus, inferior parietal lobes, predominantly right superior temporal areas, posterior cingulate cortex (PCC) and precuneus, parahippocampal gyri, right hippocampus, superior lateral cerebellar regions and cerebellar vermis and some medial and lateral occipital areas. The discriminating patterns with high magnitude (negative) weights predictive of ADHD patients included predominantly ventromedial frontal cortex and ACC, with small areas in DLPFC, left precentral gyrus, anterior and posterior insula, thalamus, predominantly left superior temporal areas, in mid‐cingulate, precuneus, bilateral amygdala, hippocampus, inferior caudate tail, occipital areas, and more inferior cerebellar regions than the ones predictive of controls.
Figure 2

A: Unthresholded GPC weights overlaid on an anatomical template. The color code shows the relative weight of each voxel for the decision boundary (red/yellow scales: higher weights for ADHD boys and blue scales: higher weights for healthy control boys). B: The univariate ANOVA group comparison map at P < 0.05 for voxel and P < 0.01 for clusters, showing brain areas that are decreased in activation in ADHD relative to healthy control boys in blue and brain areas that are increased in activation in ADHD relative to healthy control boys in red/yellow. [Color figure can be viewed in the online issue, which is available at http://wileyonlinelibrary.com.]

A: Unthresholded GPC weights overlaid on an anatomical template. The color code shows the relative weight of each voxel for the decision boundary (red/yellow scales: higher weights for ADHD boys and blue scales: higher weights for healthy control boys). B: The univariate ANOVA group comparison map at P < 0.05 for voxel and P < 0.01 for clusters, showing brain areas that are decreased in activation in ADHD relative to healthy control boys in blue and brain areas that are increased in activation in ADHD relative to healthy control boys in red/yellow. [Color figure can be viewed in the online issue, which is available at http://wileyonlinelibrary.com.] Pearson correlation analyses between GPC predictive probabilities and ADHD symptom severity across all subjects were significant for both CPRS‐R ADHD t‐scores (r = 0.4; P < 0.001) and SDQ hyperactivity scores (r = 0.4; P < 0.005). To test whether IQ impacted upon the GPC predictive probabilities, we correlated betwen the two measures within each group and across all participants. We found no significant correlations with IQ within each group. However, there was a significant negative correlation between GPC predictive values and IQ across all participants (r= −0.3; P < 0.014). Given that a predictive value of > 0.5 was deemed to be classifying ADHD patients, while a value of < 0.5 was deemed to classify controls (see Fig. 1B), the findings suggest that the higher the predictive value for ADHD, the lower the IQ which is in line with the finding of lower IQ values in ADHD relative to controls. There was no correlation across all participants between GPC predictive values and the extent of maximum rotation and translation movement parameters for x, y, and z axes in the three‐dimensional Euclidian space. Given that the brain function patterns that classified ADHD patients were in earlier developing regions, we tested whether the GPC classifier was correlated with age across all subjects. In fact there was a trend for a significant negative correlation with age (r = −0.25; P < 0.057). For comparative purposes, to test whether the imaging‐based GPC classification was comparable to a classification based on the clinical behavioral scores, we also conducted a discriminant analysis from group sizes using leave‐one‐out classification and applying cross‐validation to test for sensitivity and specificity of the clinical behavioral scores (i.e. the CPRS and the SDQ for hyperactive/impulsive and inattentive behaviors). The sensitivity (classifying patients) and specificity (classifying controls) were relatively high for both measures: for the CPRS, the sensitivity was 96.3%, the specificity was 100%, and the overall accuracy was 98%; for the SDQ, the sensitivity was 93.3%, the specificity was 100% and the overall accuracy was 96.7%. However, given that groups were selected based on these clinical questionnaires, higher accuracy was to be expected than from the imaging‐based GPC measures.

Univariate analysis results

ANOVA showed that ADHD relative to control boys had reduced activation in two large bilateral clusters in VLPFC reaching subcortically into insula, head of caudate and putamen and in the right hemisphere into superior and middle temporal lobes (Table 3, Fig. 2B). In addition, they had reduced activation in a cluster comprising globus pallidus, nucleus accumbens, putamen, and caudate, in a cluster comprising left superior/middle/inferior temporal and inferior parietal lobe, in a cluster in right fusiform gyrus/inferior temporal lobe and in bilateral precuneus and posterior cingulate. ADHD boys relative to controls had increased activation in a cluster in left cerebellum which extended into bilateral lingual gyrus, cuneus, and PCC (Table 3, Fig. 2B).
Table 3

Results of the univariate analysis, showing brain regions that are (a) underactivated in ADHD boys relative to healthy controls and (b) overactivated in ADHD boys relative to controls

Cluster SizeTalairach coordinates P valueBrodman Area (BA)Brain region
x y z
(a) Controls > ADHD
1752825−130.00183347/11/45/38/21R VLPFC/superior & middle temporal lobe/insula/caudate head/putamen
164−3625−90.00155847/11/45L VLPFC/insula/caudate head/putamen
75−314−190.009029/B globus pallidus/nucleus accumbens/ putamen/caudate
106−43−4060.00251222/21/39/41/40L superior/middle/inferior temporal/ inferior parietal lobe
13139−59−130.00148720/37R fusiform gyrus/inferior temporal lobe
81−50−44260.00650640L inferior parietal lobe
91−7−74390.0070727/31L precuneus/posterior cingulate
12221−59260.0021227/31R precuneus/posterior cingulate
(b) ADHD > Controls
615−25−66−60.00009418/19/30L Cerebellum, lingual gyrus, posterior cingulate
Results of the univariate analysis, showing brain regions that are (a) underactivated in ADHD boys relative to healthy controls and (b) overactivated in ADHD boys relative to controls To test whether VLPFC areas that differed between groups were correlated with stop signal reaction time (SSRT), we extracted the BOLD response in these areas and correlated these with SSRT. In controls, the activation in right VLPFC cluster was significantly negatively correlated with SSRT (r = −0.4, P < 0.02). To test whether the cerebellar activation was a compensation for the reduced fronto‐striatal activation, as previously observed during attention tasks [Christakou et al., 2013; Cubillo et al., 2012; Rubia et al., 2009], we tested for correlations within ADHD patients between this cluster and left and right VLPFC and the basal ganglia clusters. The (increased) cerebellar activation cluster in fact correlated negatively with the (decreased) activation cluster in the basal ganglia (r = −0.4, P < 0.03). To test whether IQ had an impact on the group difference findings, statistical BOLD response in each cluster that differed between groups was extracted for each subject and correlated with IQ in each group. No correlations with IQ were observed.

DISCUSSION

We demonstrate that it is possible to classify individual ADHD patients based on their functional neuroanatomy pattern of motor response inhibition, at an accuracy of 77%, with 90% sensitivity to correctly identify ADHD boys and 63% specificity to classify controls. Furthermore, the component regions of the discriminative pattern most predictive of controls were in later developing typical inhibition areas of lateral VLPFC and DLPFC, SMA, ACC, striatal, temporo‐parietal and superior cerebellar regions, whilst regions most predictive of ADHD were in earlier developing mostly ventromedial fronto‐limbic areas such as ventromedial frontal cortex, ACC, insula, amygdala, hippocampus, and inferior cerebellum. In addition, the GPC predictive probabilities for ADHD patients were correlated with ADHD severity scores, reinforcing their diagnostic utility. The univariate group difference analysis showed that ADHD patients relative to controls had reduced activation in bilateral VLPFC, caudate and temporo‐parietal regions, which mostly overlapped with the areas that were predictive of classifying controls, suggesting that these areas that classify controls are dysfunctional areas in ADHD. The overall classification accuracy findings of 77% are promising and, if replicated, suggest that it may be possible in the future to use machine learning based pattern recognition analyses to aid in the diagnostic classification of ADHD with a more objective and reliable measure such as a short task‐based fMRI scan. The classification accuracy of 77% observed in this study is larger than that obtained in other machine learning studies of resting state functional and anatomical data in ADHD [Cheng et al., 2012; Dai et al., 2012, Colby et al., 2012; Eloyan et al., 2012]. They demonstrate for the first time that individuals with ADHD can be differentiated from controls on the basis of their discriminative task‐based neurofunctional activation patterns during inhibitory performance. As GPC is a multivariate technique and considers inter‐regional correlations, discriminative brain networks should be interpreted cautiously as a spatially distributed pattern rather than permitting inference on the constituent regions. GPC identified a distributed network predictive of controls in later developing lateral and medial fronto‐striatal and parieto‐temporal regions that are crucial for motor response inhibition [Aron and Poldrack, 2006; Cai et al., 2012; Chambers et al., 2006, 2009; Juan and Muggleton, 2012; Rubia et al., 2003, 2007b, 2013]. Structurally [Sowell et al., 1999, 2004] and functionally during inhibition tasks (Adleman et al., 2002; Bunge et al., 2002; Rubia et al., 2000; Rubia et al., 2007b; Rubia et al., 2006, Rubia et al., 2013) [for review see (Rubia, 2013)], these lateral prefrontal, striatal and parietal brain regions develop later than the ventromedial prefrontal, limbic (i.e., hippocampus, amygdala) and paralimbic areas (insula) that were predictive of ADHD patients. Our finding of high magnitude weights predictive of controls in later developing lateral fronto‐striato‐parietal regions and for ADHD in earlier developing ventromedial fronto‐limbic regions hence suggest that the ADHD discrimination networks are reflective of more immature activation patterns, while the control discrimination patterns are reflective of a more mature activation pattern for Stop task performance. This was also confirmed in the trend‐wise correlation between age and the GPC probabilistic classification pattern. This would be in line with the notion that ADHD is a disorder of developmental delay, as demonstrated in univariate structural longitudinal analyses that showed a 2–5 year delay in ADHD patients in the maturation of cortical thickness and surface morphology, most prominently in DLPFC and VLPFC, superior temporal and inferior parietal brain regions [Rubia, 2007; Shaw et al., 2007, 2012]. The significant correlation between the GPC predictive probabilities and ADHD symptom severity suggests that the most severely impaired ADHD patients are more confidently predicted by their immature fronto‐limbic activation patterns than the less severely impaired ones, which further reinforces the potential diagnostic value of GPC pattern recognition analyses combined with fMRI for ADHD. The univariate ANOVA showed that ADHD boys had a significant reduction in the activation of left and right VLPFC, reaching into caudate and superior temporal lobes, as well as in parieto‐temporal regions, replicating previous evidence for consistent fronto‐striatal as well as temporo‐parietal underactivations in ADHD patients during Stop task performance [Cubillo et al., 2010, 2013; Passarotti et al., 2010; Pliszka et al., 2006; Rubia et al., 1999, 2005, 2011b, 2008, 2010], as also confirmed in a recent meta‐analysis [Hart et al., 2013]. While our findings replicate previous work, this study is, to our knowledge, the largest fMRI ADHD dataset on the Stop task, with previous fMRI studies including less than 20 ADHD patients and controls [Cubillo et al., 2010; 2013; Passarotti et al., 2010; Pliszka et al., 2006; Rubia et al., 1999, 2005, 2011b, 2010, 2008]. Interestingly, the key region for inhibitory control, the right VLPFC [Aron et al., 2003; Aron and Poldrack, 2006; Chambers et al., 2006; Rubia et al., 2001, 2003, 2007b, 2013], that was reduced in ADHD patients relative to controls, was also correlated with the main inhibitory task variable, the SSRT, in controls, even though patients were not impaired in this measure. Patients, however, had a more premature and variable response to go trials. The lack of inhibitory performance deficits despite brain function deficits replicates previous fMRI findings in patients with ADHD [Cubillo et al., 2010, 2013; Pliszka et al., 2006; Rubia et al., 1999, 2005, 2011b, 2008] and is likely related to the fact that fMRI stop tasks generally have less behavioral sensitivity than neuropsychological task versions due to the target separation and hence larger predictability of stop trials. Also, neuropsychological studies are typically conducted in children, while we studied an older age group of adolescescents who may have grown out of their performance deficits. It is also possible that the enhanced inferior cerebellar and visual cortex activation in ADHD patients relative to controls may have compensated for the reduced fronto‐striatal‐activations, sparing inhibitory performance deficits. This hypothesis was supported by the negative correlation between cerebellar and striatal activation in ADHD. Multivariate pattern recognition analyses reveal discriminating activation patterns without necessarily defining the direction of activation. However, the coanalysis of univariate analyses in this study showed that several of the brain areas that were reduced in activation in ADHD relative to control boys in the univariate analyses, in particular left and right VLPFC, the basal ganglia and inferior parietal lobes, overlapped with regions that showed higher weights for discriminating the control group from the ADHD group in the multivariate analysis (see Fig. 2A,B), suggesting that they reflect brain areas that are dysfunctional (i.e., underactivated) in ADHD children. The fMRI stop task‐based classifier was significantly better than inhibitory performance to discriminate between groups, given that performance did not differ between patients and controls. However, while the task‐based fMRI classification accuracy of 77% was relatively higher than previous attempts to use MVPRA combined with resting state fMRI data, which achieved only a maximum classification accuracy of 61% for the best classification [Eloyan et al., 2012], and was comparable to our accuracy of 79% using GPC analyses combined with brain structure data [Lim et al., 2013], the classification accuracy is nevertheless not high enough to be used in isolation, even if replicated. The discriminant analysis based on clinical measures was substantially higher, in particular for specificity, which was 100% relative to only 77% for the fMRI based GPC classification, while the sensitivity measures for the ADHD classification were relatively comparable, with 93.3% and 96.3% for the clinical measures, and 90% for the fMRI‐ based GPC analysis. However, given that disorders were selected based on these clinical measures, the higher classification accuracy based on these measures was to be expected as clinical diagnostic measures were used as labels for the fMRI‐based classifier. Studies that tested for the sensitivity of classification of ADHD children with clinical measures based on Diagnostic and Statistical Manual (DSM)‐IV criteria have shown that this is between 70–90% [Weiler et al., 2000], thus misdiagnoses are around 10–30%. Task‐based fMRI classification, if replicated across centers and scanners, may therefore have its use as an adjunct to clinical diagnostic measures, in particular for difficult to diagnose cases. However, for its potential use as aid for clinical diagnosis, future studies will have to show disorder‐specificity of fMRI based individual classifications relative to other childhood disorders, as differential diagnosis is typically more challenging than differentiation from controls. For structural MRI classifications we have shown relatively high disorder‐specific accuracy relative to children with autism of over 80% [Lim et al., 2013]. A strength of the study is the relatively large sample, in particular for the univariate analysis. A limitation is the potential confound of IQ differences between cases and controls. However, GPC probabilistic predictors did not correlate with IQ in either group and hence are unlikely to have affected the findings. The univariate analysis results did not correlate with IQ in either group. Furthermore, Stop task performance has been shown to be independent of IQ [Friedman et al., 2006]. IQ correlated with GPC predictive values across both groups, which is, however to be expected, given that both groups differed in IQ and in predictive GPC values (the latter, by definition). Another limitation is that we only included males and the combined hyperactive‐impulsive/inattentive subtype of ADHD which limits the generalizability of the diagnostic classification patterns. Also, future diagnostic utility of pattern classification analyses will rely on its ability to classify ADHD subgroups as to determine disorder‐specificity of classification patterns. In summary, to our knowledge this is the first study to use multivariate machine‐learning techniques to predict ADHD diagnosis based on task‐based fMRI imaging data showing a classification accuracy of 77%. Whilst classification algorithms are not meant as a substitute for clinical assessment and diagnosis, they may be a useful objective, automated, and reliable complementary diagnostic tool that could reduce variability in clinical practice and, ultimately, may help to improve diagnostic accuracy or to revise clinical diagnosis through biomarker classification of uncertain diagnostic cases.
  64 in total

Review 1.  Misunderstanding analysis of covariance.

Authors:  G A Miller; J P Chapman
Journal:  J Abnorm Psychol       Date:  2001-02

2.  Functional frontalisation with age: mapping neurodevelopmental trajectories with fMRI.

Authors:  K Rubia; S Overmeyer; E Taylor; M Brammer; S C Williams; A Simmons; C Andrew; E T Bullmore
Journal:  Neurosci Biobehav Rev       Date:  2000-01       Impact factor: 8.989

3.  A longitudinal twin study on IQ, executive functioning, and attention problems during childhood and early adolescence.

Authors:  Tinca J C Polderman; M Florencia Gosso; Danielle Posthuma; Toos C E M Van Beijsterveldt; Peter Heutink; Frank C Verhulst; Dorret I Boomsma
Journal:  Acta Neurol Belg       Date:  2006-12       Impact factor: 2.396

4.  Neuro-anatomic evidence for the maturational delay hypothesis of ADHD.

Authors:  Katya Rubia
Journal:  Proc Natl Acad Sci U S A       Date:  2007-12-06       Impact factor: 11.205

5.  A review of fronto-striatal and fronto-cortical brain abnormalities in children and adults with Attention Deficit Hyperactivity Disorder (ADHD) and new evidence for dysfunction in adults with ADHD during motivation and attention.

Authors:  Ana Cubillo; Rozmin Halari; Anna Smith; Eric Taylor; Katya Rubia
Journal:  Cortex       Date:  2011-04-27       Impact factor: 4.027

6.  Hypofrontality in attention deficit hyperactivity disorder during higher-order motor control: a study with functional MRI.

Authors:  K Rubia; S Overmeyer; E Taylor; M Brammer; S C Williams; A Simmons; E T Bullmore
Journal:  Am J Psychiatry       Date:  1999-06       Impact factor: 18.112

7.  Attention-deficit/hyperactivity disorder is characterized by a delay in cortical maturation.

Authors:  P Shaw; K Eckstrand; W Sharp; J Blumenthal; J P Lerch; D Greenstein; L Clasen; A Evans; J Giedd; J L Rapoport
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-16       Impact factor: 11.205

8.  Dissociated functional brain abnormalities of inhibition in boys with pure conduct disorder and in boys with pure attention deficit hyperactivity disorder.

Authors:  Katya Rubia; Rozmin Halari; Anna B Smith; Majeed Mohammed; Steven Scott; Vincent Giampietro; Eric Taylor; Michael J Brammer
Journal:  Am J Psychiatry       Date:  2008-04-15       Impact factor: 18.112

9.  Executive "brake failure" following deactivation of human frontal lobe.

Authors:  Christopher D Chambers; Mark A Bellgrove; Mark G Stokes; Tracy R Henderson; Hugh Garavan; Ian H Robertson; Adam P Morris; Jason B Mattingley
Journal:  J Cogn Neurosci       Date:  2006-03       Impact factor: 3.225

10.  Automated diagnoses of attention deficit hyperactive disorder using magnetic resonance imaging.

Authors:  Ani Eloyan; John Muschelli; Mary Beth Nebel; Han Liu; Fang Han; Tuo Zhao; Anita D Barber; Suresh Joel; James J Pekar; Stewart H Mostofsky; Brian Caffo
Journal:  Front Syst Neurosci       Date:  2012-08-30
View more
  43 in total

1.  DAT1 Gene Methylation as an Epigenetic Biomarker in Attention Deficit Hyperactivity Disorder: A Commentary.

Authors:  Erika Tonelli; Esterina Pascale; Miriam Troianiello; Claudio D'Addario; Walter Adriani
Journal:  Front Genet       Date:  2020-05-11       Impact factor: 4.599

Review 2.  Machine learning studies on major brain diseases: 5-year trends of 2014-2018.

Authors:  Koji Sakai; Kei Yamada
Journal:  Jpn J Radiol       Date:  2018-11-29       Impact factor: 2.374

3.  Differential Recruitment of Brain Regions During Response Inhibition in Children Prenatally Exposed to Alcohol.

Authors:  Vikas N Kodali; Joseph L Jacobson; Nadine M Lindinger; Neil C Dodge; Christopher D Molteno; Ernesta M Meintjes; Sandra W Jacobson
Journal:  Alcohol Clin Exp Res       Date:  2017-01-11       Impact factor: 3.455

4.  Neurobiological measures to classify ADHD: a critical appraisal.

Authors:  Nanda Rommelse; Patrick de Zeeuw
Journal:  Eur Child Adolesc Psychiatry       Date:  2014-05       Impact factor: 4.785

5.  Classification Accuracy of Neuroimaging Biomarkers in Attention-Deficit/Hyperactivity Disorder: Effects of Sample Size and Circular Analysis.

Authors:  Alfredo A Pulini; Wesley T Kerr; Sandra K Loo; Agatha Lenartowicz
Journal:  Biol Psychiatry Cogn Neurosci Neuroimaging       Date:  2018-06-27

6.  Cognitive Modeling Informs Interpretation of Go/No-Go Task-Related Neural Activations and Their Links to Externalizing Psychopathology.

Authors:  Alexander Weigard; Mary Soules; Bailey Ferris; Robert A Zucker; Chandra Sripada; Mary Heitzeg
Journal:  Biol Psychiatry Cogn Neurosci Neuroimaging       Date:  2019-12-10

Review 7.  The Mechanism of Cortico-Striato-Thalamo-Cortical Neurocircuitry in Response Inhibition and Emotional Responding in Attention Deficit Hyperactivity Disorder with Comorbid Disruptive Behavior Disorder.

Authors:  Yuncheng Zhu; Xixi Jiang; Weidong Ji
Journal:  Neurosci Bull       Date:  2018-03-05       Impact factor: 5.203

8.  Classification of cocaine-dependent participants with dynamic functional connectivity from functional magnetic resonance imaging data.

Authors:  Unal Sakoglu; Mutlu Mete; John Esquivel; Katya Rubia; Richard Briggs; Bryon Adinoff
Journal:  J Neurosci Res       Date:  2019-04-07       Impact factor: 4.164

Review 9.  Psychoradiology: The Frontier of Neuroimaging in Psychiatry.

Authors:  Su Lui; Xiaohong Joe Zhou; John A Sweeney; Qiyong Gong
Journal:  Radiology       Date:  2016-11       Impact factor: 11.105

10.  Aberrant Cross-Brain Network Interaction in Children With Attention-Deficit/Hyperactivity Disorder and Its Relation to Attention Deficits: A Multisite and Cross-Site Replication Study.

Authors:  Weidong Cai; Tianwen Chen; Luca Szegletes; Kaustubh Supekar; Vinod Menon
Journal:  Biol Psychiatry       Date:  2015-11-02       Impact factor: 13.382

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.