| Literature DB >> 34620827 |
Jeremiah B Joyce1, Caroline W Grant2, Duan Liu2, Siamak MahmoudianDehkordi3, Rima Kaddurah-Daouk3, Michelle Skime1, Joanna Biernacka4, Mark A Frye1, Taryn Mayes5, Thomas Carmody6, Paul E Croarkin1, Liewei Wang2, Richard Weinshilboum2, William V Bobo7, Madhukar H Trivedi5, Arjun P Athreya8.
Abstract
Combination antidepressant pharmacotherapies are frequently used to treat major depressive disorder (MDD). However, there is no evidence that machine learning approaches combining multi-omics measures (e.g., genomics and plasma metabolomics) can achieve clinically meaningful predictions of outcomes to combination pharmacotherapy. This study examined data from 264 MDD outpatients treated with citalopram or escitalopram in the Mayo Clinic Pharmacogenomics Research Network Antidepressant Medication Pharmacogenomic Study (PGRN-AMPS) and 111 MDD outpatients treated with combination pharmacotherapies in the Combined Medications to Enhance Outcomes of Antidepressant Therapy (CO-MED) study to predict response to combination antidepressant therapies. To assess whether metabolomics with functionally validated single-nucleotide polymorphisms (SNPs) improves predictability over metabolomics alone, models were trained/tested with and without SNPs. Models trained with PGRN-AMPS' and CO-MED's escitalopram/citalopram patients predicted response in CO-MED's combination pharmacotherapy patients with accuracies of 76.6% (p < 0.01; AUC: 0.85) without and 77.5% (p < 0.01; AUC: 0.86) with SNPs. Then, models trained solely with PGRN-AMPS' escitalopram/citalopram patients predicted response in CO-MED's combination pharmacotherapy patients with accuracies of 75.3% (p < 0.05; AUC: 0.84) without and 77.5% (p < 0.01; AUC: 0.86) with SNPs, demonstrating cross-trial replication of predictions. Plasma hydroxylated sphingomyelins were prominent predictors of treatment outcomes. To explore the relationship between SNPs and hydroxylated sphingomyelins, we conducted multi-omics integration network analysis. Sphingomyelins clustered with SNPs and metabolites related to monoamine neurotransmission, suggesting a potential functional relationship. These results suggest that integrating specific metabolites and SNPs achieves accurate predictions of treatment response across classes of antidepressants. Finally, these results motivate functional investigation into how sphingomyelins might influence MDD pathophysiology, antidepressant response, or both.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34620827 PMCID: PMC8497535 DOI: 10.1038/s41398-021-01632-z
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 7.989
Fig. 1Conceptual overview of model development and evaluation.
PGRN-AMPS (Pharmacogenomic Research Network Antidepressant Medication Pharmacogenomic Study) and CO-MED (Combining Medications to Enhance Outcomes of Antidepressant Therapy) participants were partitioned into training/testing groups based upon treatment allocation.
Clinical and sociodemographic features of training and testing cohorts.
| Training and testing cohort | ||||
|---|---|---|---|---|
| Training Set | Testing Set | Training Set | Testing Set | |
| PGRN-AMPS patients ( | 264 | 0 | 245 | 0 |
| CO-MED patients ( | 34 | 77 | 32 | 71 |
| Sex [% female] | 66.1% | 71.4% | 65.3% | 70.4% |
| Age [mean (SD)] | 40.6 (13.3)* | 43.3 (11.5)* | 41.0 (13.3) | 43.4 (11.6) |
| Years of education [mean (SD)] | 14.7 (2.5)* | 13.9 (2.4)* | 14.8 (2.5)* | 13.9 (2.4)* |
| Race [% White] | 93.6%* | 77.9%* | 96.0%* | 77.5%* |
| Race [% Black or African American] | 3.4%* | 16.9%* | 2.2%* | 16.9%* |
| Race [% Other] | 3.0% | 5.2% | 1.8% | 5.6% |
| Ethnicity [% Hispanic] | 2.0%†* | 20.8%* | 1.4%†* | 21.1%* |
| Depression onset < age 18 years [%] | 43.6% | 39.0% | 41.5% | 39.4% |
| Prior suicide attempts [ | 46 (15.4%) | 7 (9.1%) | 40 (16.3%) | 6 (8.4%) |
| Antidepressants ( | Citalopram (112) Escitalopram (152) Escitalopram + Placebo (34) | Venlafaxine + Mirtazapine (42) Escitalopram + Bupropion (35) | Citalopram (99) Escitalopram (146) Escitalopram + Placebo (32) | Venlafaxine + Mirtazapine (38) Escitalopram + Bupropion (33) |
| QIDS-C at baseline [mean (SD)] | 15.1 (3.4) | 15.5 (3.8) | 15.1 (3.3) | 15.3 (3.7) |
| QIDS-C response at week 4 | 47.0% | 40.2% | 47.7% | 42.2% |
| QIDS-C remission at week 4 | 26.2% | 26.0% | 26.7% | 28.2% |
| QIDS-C response at week 8 | 67.8% | 62.3% | 69.0% | 63.3% |
| QIDS-C remission at week 8 | 47.0% | 42.9% | 48.4% | 43.7% |
*Significantly different (p < 0.05) between training and testing sets according to Mann-Whitney U or chi-square tests. †Ethnicity characterization in PGRN-AMPS is based off data from 205 out of 264 patients, per availability of data.
Fig. 2Comparison of test-set accuracies for metabolomic and multi-omics models, with variable importance.
Dashed line: null information rate (NIR). The NIR represents the response rate of 63.4% at 8 weeks of therapy. This serves as a benchmark to assess the significance of prediction accuracy.
Metrics of prediction performance.
| Prediction model metrics | ||||
|---|---|---|---|---|
| 2A: Models trained with PGRN-AMPS escitalopram, PGRN-AMPS citalopram, and CO-MED escitalopram + placebo patients. | ||||
| Model Set 1 (Metabolomic) | Model Set 2 (Multi-omics) | |||
| XGBoost | Penalized regression | XGBoost | Penalized regression | |
| Testing-set metrics | ||||
| AUC | 0.76 | 0.85 | 0.83 | 0.86 |
| Accuracy | 0.727 | 0.766 | 0.761 | 0.775 |
| NIR | 0.62 | 0.62 | 0.63 | 0.63 |
| | 0.053 | 0.0045 | 0.017 | 0.0067 |
| Sensitivity | 0.75 | 0.69 | 0.69 | 0.71 |
| Specificity | 0.69 | 0.90 | 0.88 | 0.88 |
| Training-set metrics (in cross-validation) | ||||
| AUC | 0.69 | 0.69 | 0.68 | 0.72 |
| Accuracy | 0.64 | 0.66 | 0.67 | 0.65 |
| NIR | 0.68 | 0.68 | 0.69 | 0.69 |
| Testing-set metrics | ||||
| AUC | 0.75 | 0.84 | 0.74 | 0.86 |
| Accuracy | 0.753 | 0.753 | 0.732 | 0.775 |
| | 0.026 | 0.026 | 0.085 | 0.0067 |
| Sensitivity | 0.65 | 0.73 | 0.80 | 0.71 |
| Specificity | 0.93 | 0.79 | 0.62 | 0.88 |
| Training-set metrics (in cross-validation) | ||||
| AUC | 0.68 | 0.68 | 0.72 | 0.72 |
| Accuracy | 0.64 | 0.65 | 0.67 | 0.68 |
| NIR | 0.69 | 0.69 | 0.70 | 0.70 |
For the penalized regression models, cartesian grid search was used to tune penalty and mixture hyperparameters, with 20 evenly spaced penalty values ranging from 1e − 6 to 10 and mixture values of 0, 0.05, 0.2, 0.4, 0.6, 0.8, and 1. For the XGBoost models, we tuned the number of trees, tree depth, minimum number of data points to split a node, learning rate, loss function reduction, and sample size using Bayesian optimization and a stopping criterion of no improvement over 30 iterations.
Fig. 3Multi-omics integration network analysis.
A Multi-omics integration network. Each metabolite is labeled with a number. Metabolite names corresponding to these numbers can be found in Supplementary Table 6. Correlation values between metabolites and SNPs can be found in Supplementary Table 7. B Labeled and enlarged community-2 containing sphingomyelins. Represented genotypes are: TSPAN5 (rs10516436), ERICH3 (rs696692), AHR (rs17137566), DEFB1_1 (rs5743467), DEFB1_2 (rs2741130), DEFB1_3 (rs2702877).