| Literature DB >> 28443027 |
Hin K Wong1, Paul A Tiffin2, Michael J Chappell3, Thomas E Nichols1, Patrick R Welsh4, Orla M Doyle5, Boryana C Lopez-Kolkovska1, Sarah K Inglis6, David Coghill7, Yuan Shen8, Peter Tiño8.
Abstract
Attention-Deficit Hyperactive Disorder (ADHD) is one of the most common mental health disorders amongst school-aged children with an estimated prevalence of 5% in the global population (American Psychiatric Association, 2013). Stimulants, particularly methylphenidate (MPH), are the first-line option in the treatment of ADHD (Reeves and Schweitzer, 2004; Dopheide and Pliszka, 2009) and are prescribed to an increasing number of children and adolescents in the US and the UK every year (Safer et al., 1996; McCarthy et al., 2009), though recent studies suggest that this is tailing off, e.g., Holden et al. (2013). Around 70% of children demonstrate a clinically significant treatment response to stimulant medication (Spencer et al., 1996; Schachter et al., 2001; Swanson et al., 2001; Barbaresi et al., 2006). However, it is unclear which patient characteristics may moderate treatment effectiveness. As such, most existing research has focused on investigating univariate or multivariate correlations between a set of patient characteristics and the treatment outcome, with respect to dosage of one or several types of medication. The results of such studies are often contradictory and inconclusive due to a combination of small sample sizes, low-quality data, or a lack of available information on covariates. In this paper, feature extraction techniques such as latent trait analysis were applied to reduce the dimension of on a large dataset of patient characteristics, including the responses to symptom-based questionnaires, developmental health factors, demographic variables such as age and gender, and socioeconomic factors such as parental income. We introduce a Bayesian modeling approach in a "learning in the model space" framework that combines existing knowledge in the literature on factors that may potentially affect treatment response, with constraints imposed by a treatment response model. The model is personalized such that the variability among subjects is accounted for by a set of subject-specific parameters. For remission classification, this approach compares favorably with conventional methods such as support vector machines and mixed effect models on a range of performance measures. For instance, the proposed approach achieved an area under receiver operator characteristic curve of 82-84%, compared to 75-77% obtained from conventional regression or machine learning ("learning in the data space") methods.Entities:
Keywords: Bayesian inference; attention-deficit hyperactivity disorder; machine learning; methylphenidate; mixed effects model; personalized medicine; prognosis; treatment response
Year: 2017 PMID: 28443027 PMCID: PMC5387107 DOI: 10.3389/fphys.2017.00199
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
Figure 1High level causal factor model of treatment response prediction in ADHD.
Figure 2Reduced causal factor model of treatment response prediction in ADHD.
Figure 3Boxplot of combined equivalent (in IR-MPHunits) daily dosages of medications taken for all patients vs. appointment number. Red horizontal lines: median; boxes: interquartile range; whiskers: 95% confidence intervals; red crosses: outliers.
Figure 4Boxplots of symptom scores across all patients vs. appointment number. (A) Inattentiveness, (B) hyperactivity. Red horizontal lines: median; boxes: interquartile range; whiskers: 95% confidence intervals; red crosses: outliers.
Figure 5Examples of Bayesian linear regression on continuous inattentiveness (INA) and hyperactivity (HYP) symptom scores with the Bayesian linear regression training dataset (. (A) Subject #11, (B) Subject #74.
Figure 6Examples of continuous hyperactivity symptom score prediction with the validation set. (A) Subject #6. (B) Subject #148.
Figure 7Rms prediction (validation) error averaged across all subjects vs. appointment number.
Figure 8Receiver operator characteristic (ROC) plots of inattentiveness prediction using virtual patient profile constructed by Methods 1 and 2 (A,B) in the learning in model space approach; crosses: no critical value adjustment (based on point estimates); squares: best performing critical values on training set; circles: best performing critical values on the validation set; AUC: Area under the ROC curve AIC: appointment-independent classifier; BRC: retrospective Bayesian linear regression classifier; BUC: incremental Bayesian learning/update classifier.
Sensitivity, specificity, accuracy, and AUC of the remission classifier with critical values adjusted with respect to uncertainties in the predicted symptom scores.
| CFM | 93 | 327 | 141 | 501 | 82 | 314 | 121 | 448 | |
| 67 | 660 | 19 | 486 | 57 | 694 | 18 | 560 | ||
| 140 | 482 | 140 | 483 | 125 | 480 | 125 | 483 | ||
| 20 | 505 | 20 | 504 | 14 | 528 | 14 | 525 | ||
| 117 | 338 | 123 | 280 | 97 | 307 | 109 | 253 | ||
| 43 | 649 | 37 | 707 | 42 | 701 | 30 | 755 | ||
| SEN | 58.1% | 88.1% | 59.0% | 87.1% | |||||
| 87.5% | 87.5% | 89.9% | 89.9% | ||||||
| 73.1% | 76.9% | 69.8% | 78.4% | ||||||
| SPC | 66.9% | 49.2% | 68.9% | 55.6% | |||||
| 51.2% | 51.1% | 52.4% | 52.1% | ||||||
| 65.8% | 71.6% | 69.5% | 74.9% | ||||||
| BAC | 62.5% | 68.7% | 63.9% | 71.3% | |||||
| 69.3% | 69.3% | 71.2% | 71.0% | ||||||
| 69.4% | 74.3% | 69.7% | 76.7% | ||||||
| PPV | 22.1% | 22.0% | 20.7% | 21.3% | |||||
| 22.5% | 22.5% | 20.7% | 20.6% | ||||||
| 25.7% | 30.5% | 24.0% | 30.1% | ||||||
| NPV | 90.8% | 96.2% | 92.4% | 96.9% | |||||
| 96.2% | 96.2% | 97.4% | 97.4% | ||||||
| 93.8% | 95.0% | 94.4% | 96.2% | ||||||
| AUC | 69.0% | 72.0% | 68.0% | 73.8% | |||||
| 81.2% | 80.9% | 83.6% | 83.3% | ||||||
| 77.1% | 82.3% | 76.7% | 84.4% | ||||||
CFM: confusion matrix.
SEN: sensitivity.
SPC: specificity.
BAC: balanced accuracy.
PPV: positive predictive value.
NPV: negative predictive value.
AUC: area under ROC curve.
AIC: appointment-independent classifier.
BRC: retrospective Bayesian linear regression classifier.
BUC: incremental Bayesian learning/update classifier.
Shaded values represent best performance amongst the compared methods.
Sensitivity, specificity, accuracy, and AUC of the remission classifier with critical values adjusted with respect to uncertainties in the predicted symptom scores.
| Sensitivity | 70.0% | 70.6% | 67.6% | 66.9% | |
| 68.1% | 67.5% | 67.6% | 66.9% | ||
| 76.9% | 33.8% | 76.9% | 69.1% | ||
| 43.1% | 43.2% | 71.3% | 48.2% | ||
| 60.0% | 58.9% | ||||
| 53.1% | 56.0% | ||||
| Specificity | 61.9% | 62.3% | 67.8% | 66.6% | |
| 67.9% | 69.1% | 67.8% | 71.3% | ||
| 55.9% | 77.6% | 62.1% | 62.1% | ||
| 59.0% | 18.4% | 50.7% | 50.6% | ||
| 73.3% | 73.8% | ||||
| 81.4% | 81.4% | ||||
| Balanced accuracy | 66.0% | 66.5% | 67.7% | 66.7% | |
| 68.0% | 68.3% | 67.7% | 69.1% | ||
| 66.4% | 55.7% | 69.5% | 65.6% | ||
| 51.1% | 44.8% | 46.9% | 49.4% | ||
| 66.6% | 66.4% | ||||
| 67.2% | 70.6% | ||||
| Positive predictive value | 23.0% | 23.3% | 22.4% | 21.6% | |
| 25.6% | 26.2% | 22.4% | 24.3% | ||
| 22.0% | 19.6% | 21.9% | 20.1% | ||
| 15.6% | 12.4% | 10.8% | 11.9% | ||
| 26.7% | 23.7% | ||||
| 31.6% | 29.0% | ||||
| Negative predictive value | 92.7% | 92.9% | 93.8% | 93.9% | |
| 92.9% | 92.9% | 93.8% | 93.4% | ||
| 93.7% | 87.8% | 95.1% | 93.6% | ||
| 86.5% | 79.8% | 86.6% | 87.63% | ||
| 91.8% | 92.9% | ||||
| 91.5% | 94.1% | ||||
| Area under ROC curve | 71% | 69% | 73% | 71% | |
| 75% | 71% | 73% | 70% | ||
| 71% | 60% | 76% | 71% | ||
| 49% | 41% | 46% | 48% | ||
| 74.8% | 77.5% | ||||
| 75.8% | 77.2% | ||||
dsSVC: down-sampled support vector machine classifier;
dsGPC: down-sampled Gaussian processes classifier;
rwSVC: regularization-weighted support vector machine classifier;
rwGPC: regularization-weighted support Gaussian processes classifier;
taMEC: threshold-adjusted mixed effects classifier;
.
Shaded values represent best performance amongst the compared methods.
Figure 9Receiver operator characteristic (ROC) plot of inattentiveness prediction using mixed effects models; crosses: no threshold adjustment (based on point estimates); squares: best performing threshold setting on training set; circles: best performing threshold setting on the validation set; AUC: Area under the ROC curve; .
| 0.98 | 0.84 | 0.97 | 0.85 | |
| 0.82 | 0.82 | 0.84 | 0.84 | |
| 0.99 | 0.73 | 1.01 | 0.75 | |
AIR: appointment-independent Bayesian linear prediction.
BRR: retrospective Bayesian linear regression.
BUR: incremental Bayesian learning/update linear regression.
| 0.73 | 0.74 | 0.76 | 0.81 | |
| 0.72 | 0.77 | 0.76 | 0.84 | |
| 0.82 | 0.83 | |||
SVR: support vector machine regression.
GPR: Gaussian processes regression.
MER: mixed effects regression.