| Literature DB >> 31012492 |
Arjun P Athreya1,2, Drew Neavin2, Tania Carrillo-Roa3, Michelle Skime4, Joanna Biernacka4, Mark A Frye4, A John Rush5,6,7, Liewei Wang2, Elisabeth B Binder3,8, Ravishankar K Iyer1, Richard M Weinshilboum2, William V Bobo9.
Abstract
We set out to determine whether machine learning-based algorithms that included functionally validated pharmacogenomic biomarkers joined with clinical measures could predict selective serotonin reuptake inhibitor (SSRI) remission/response in patients with major depressive disorder (MDD). We studied 1,030 white outpatients with MDD treated with citalopram/escitalopram in the Mayo Clinic Pharmacogenomics Research Network Antidepressant Medication Pharmacogenomic Study (PGRN-AMPS; n = 398), Sequenced Treatment Alternatives to Relieve Depression (STAR*D; n = 467), and International SSRI Pharmacogenomics Consortium (ISPC; n = 165) trials. A genomewide association study for PGRN-AMPS plasma metabolites associated with SSRI response (serotonin) and baseline MDD severity (kynurenine) identified single nucleotide polymorphisms (SNPs) in DEFB1, ERICH3, AHR, and TSPAN5 that we tested as predictors. Supervised machine-learning methods trained using SNPs and total baseline depression scores predicted remission and response at 8 weeks with area under the receiver operating curve (AUC) > 0.7 (P < 0.04) in PGRN-AMPS patients, with comparable prediction accuracies > 69% (P ≤ 0.07) in STAR*D and ISPC. These results demonstrate that machine learning can achieve accurate and, importantly, replicable prediction of SSRI therapy response using total baseline depression severity combined with pharmacogenomic biomarkers.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31012492 PMCID: PMC6739122 DOI: 10.1002/cpt.1482
Source DB: PubMed Journal: Clin Pharmacol Ther ISSN: 0009-9236 Impact factor: 6.875
Figure 1The two‐stage analysis workflow. Our analysis workflow proceeded in two stages. In stage 1, we identified depressive symptom severity clusters in the Mayo Clinic Pharmacogenomics Research Network Antidepressant Medication Pharmacogenomic Study (PGRN‐AMPS) dataset, separately for men and women, using a data‐driven approach (stage 1A); we then validated those clusters using data from Sequenced Treatment Alternatives to Relieve Depression (STAR*D) and International SSRI Pharmacogenomics Consortium (ISPC; stage 1B). Factors that differentiated the validated depressive symptom clusters were identified in stage 1C. In stage 2, predictive models were developed using PGRN‐AMPS data and were externally validated using STAR*D and ISPC data. HDRS, Hamilton Depression Rating Scale; QIDS‐C, Quick Inventory of Depressive Symptomatology; SMOTE, Synthetic Minority Oversampling Technique.
Figure 2Probability density functions (PDFs) of depression severity scores. Baseline Quick Inventory of Depressive Symptomatology (QIDS‐C) symptom severity scores in men (a), and the estimated components of the PDF using an expectation‐maximization algorithm (b). [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 3Depressive symptom–based clusters identified by data‐driven unsupervised learning using Gaussian mixture models. Probability densities of symptom severity in clusters at baseline, 4 weeks, and 8 weeks of the Mayo Clinic Pharmacogenomics Research Network Antidepressant Medication Pharmacogenomic Study trial for both the Quick Inventory of Depressive Symptomatology (QIDS‐C) (a) and Hamilton Depression Rating Scale (HDRS) (b) scales. Probability densities are proportional to the fraction of patients with the associated symptom severity scores.
Prediction performance of random forests using baseline depression severity and functionally validated SNPs of , , , and
| Rating scale | Trial | Training data | Gender | Accuracy (%) | 95% CI in training cross‐validation | NIR |
| Sensitivity | Specificity | PPV | NPV | AUC in training cross‐validation | Top 3 predictors in cross‐validation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Response | |||||||||||||
| QIDS‐C | PGRN‐AMPS | 10‐fold cross‐validation | Men | 73 | (63, 82) | 0.66 | 0.01 | 0.7 | 0.78 | 0.86 | 0.57 | 0.85 | DEFB1_2, Baseline severity, DEFB1_1 |
| Women | 74 | (65, 83) | 0.63 | 0.0003 | 0.71 | 0.8 | 0.85 | 0.62 | 0.7 | TSPAN5, DEFB1_1, Baseline severity | |||
| STAR*D | PGRN‐AMPS | Men | 69 | NA | 0.68 | 0.06 | 0.67 | 0.72 | 0.82 | 0.51 | NA | NA | |
| Women | 66 | NA | 0.68 | 0.0007 | 0.68 | 0.63 | 0.78 | 0.52 | NA | NA | |||
| HDRS | PGRN‐AMPS | 10‐fold cross‐validation | Men | 86 | (81, 94) | 0.68 | 5.70E‐04 | 0.9 | 0.85 | 0.93 | 0.79 | 0.88 | TSPAN5, DEFB1_1, DEFB1_2 |
| Women | 88 | (78, 93) | 0.7 | 7.30E‐09 | 0.9 | 0.82 | 0.91 | 0.7 | 0.9 | DEFB1_1, Baseline severity, DEFB1_2 | |||
| ISPC | PGRN‐AMPS | Men | 77 | NA | 0.69 | 0.05 | 0.8 | 0.71 | 0.85 | 0.62 | NA | NA | |
| Women | 75 | NA | 0.65 | 0.01 | 0.78 | 0.68 | 0.82 | 0.63 | NA | NA | |||
| Remission | |||||||||||||
| QIDS‐C | PGRN‐AMPS | 10‐fold cross‐validation | Men | 78 | (69, 86) | 0.63 | 5.40E‐08 | 0.81 | 0.75 | 0.84 | 0.69 | 0.86 | Baseline severity, DEFB1_1, DEFB1_2 |
| Women | 69 | (60, 80) | 0.62 | 0.0001 | 0.6 | 0.83 | 0.84 | 0.59 | 0.75 | Baseline severity, DEFB1_2, DEFB1_1 | |||
| STAR*D | PGRN‐AMPS | Men | 75 | NA | 0.55 | 0.008 | 0.79 | 0.69 | 0.75 | 0.72 | NA | NA | |
| Women | 66 | NA | 0.5 | 0.001 | 0.59 | 0.72 | 0.67 | 0.63 | NA | NA | |||
| HDRS | PGRN‐AMPS | 10‐fold cross‐validation | Men | 86 | (75, 90) | 0.55 | 0.0001 | 0.9 | 0.84 | 0.87 | 0.87 | 0.84 | Baseline severity, DEFB1_2, DEFB1_1 |
| Women | 83 | (75, 90) | 0.51 | 0.03 | 0.87 | 0.8 | 0.82 | 0.85 | 0.9 | Baseline severity, DEFB1_2, DEFB1_1 | |||
| ISPC | PGRN‐AMPS | Men | 76 | NA | 0.56 | 0.04 | 0.8 | 0.71 | 0.77 | 0.73 | NA | NA | |
| Women | 74 | NA | 0.52 | 0.07 | 0.76 | 0.72 | 0.74 | 0.73 | NA | NA | |||
Results of 10‐fold internal cross‐validation using PGRN‐AMPS patients’ data are reported in the light‐blue blocks. Results from external validation in STAR*D (QIDS‐C) and ISPC (HDRS) of the prediction model trained using PGRN‐AMPS patients’ data are reported in the light‐orange blocks.
AUC, area under the receiver operating curve; CI, confidence interval; HDRS, Hamilton Depression Rating Scale; ISPC, International SSRI Pharmacogenomics Consortium; NA, not applicable; NIR, null information rate; NPV, negative predictive value; PGRN‐AMPS, Mayo Clinic Pharmacogenomics Research Network Antidepressant Medication Pharmacogenomic Study; PPV, positive predictive value; QIDS‐C, Quick Inventory of Depressive Symptomatology; SNP, single nucleotide polymorphism; STAR*D, Sequenced Treatment Alternatives to Relieve Depression.
Figure 4Importance of variables for predicting clinical outcomes measured using Hamilton Depression Rating Scale (HDRS). SNPs, single nucleotide polymorphisms.