| Literature DB >> 32432711 |
Michael C Hughes1, Melanie F Pradier2, Andrew Slavin Ross2, Thomas H McCoy3,4, Roy H Perlis3,4, Finale Doshi-Velez2.
Abstract
Importance: In the absence of readily assessed and clinically validated predictors of treatment response, pharmacologic management of major depressive disorder often relies on trial and error. Objective: To assess a model using electronic health records to identify predictors of treatment response in patients with major depressive disorder. Design, Setting, and Participants: This retrospective cohort study included data from 81 630 adults with a coded diagnosis of major depressive disorder from 2 academic medical centers in Boston, Massachusetts, including outpatient primary and specialty care clinics from December 1, 1997, to December 31, 2017. Data were analyzed from January 1, 2018, to March 15, 2020. Exposures: Treatment with at least 1 of 11 standard antidepressants. Main Outcomes and Measures: Stable treatment response, intended as a proxy for treatment effectiveness, defined as continued prescription of an antidepressant for 90 days. Supervised topic models were used to extract 10 interpretable covariates from coded clinical data for stability prediction. With use of data from 1 hospital system (site A), generalized linear models and ensembles of decision trees were trained to predict stability outcomes from topic features that summarize patient history. Held-out patients from site A and individuals from a second hospital system (site B) were evaluated.Entities:
Year: 2020 PMID: 32432711 PMCID: PMC7240354 DOI: 10.1001/jamanetworkopen.2020.5308
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Comparison of General and Drug-Specific Stability Prediction for Proposed and Baseline Covariates
Comparison of discriminative ability, as measured by area under the receiver operating characteristic curve (AUC), for general and drug-specific prediction models. A, Models compared use of the proposed 10-dimensional topics covariates with a logistic regression predictor. B, Models compared use of the baseline high-dimensional demographics and words covariates with an ensemble of 512 extremely randomized decision trees. For each of the 11 target antidepressants, an AUC score was obtained for a given model by considering predictions from that model on the subset of the site A test set that included all known outcomes associated with that drug (ignoring data from patients who were never given that drug). To indicate uncertainty in reported AUC values, the evaluation was repeated across 5000 bootstrap samples of each test set and reported error bars indicating 95% CIs for the AUC across these bootstrap samples.
Figure 2. Assessment of Generalization From Site A to Site B for Both General and Drug-Specific Stability Prediction
Side-by-side comparison of discriminative ability on the site A and site B testing sets, as measured by area under the receiver operating characteristic curve (AUC), for general and drug-specific prediction models. A, Models use the proposed 10-dimensional topics covariates with a logistic regression predictor. B, Models use the baseline high-dimensional demographics and words covariates with an ensemble of 512 extremely randomized decision trees. For each of the 11 target antidepressants, an AUC score was obtained for a given model by considering predictions from that model on the subset of the site A test set that included all known outcomes associated with that drug (ignoring data from patients who were never given that drug). To indicate uncertainty in reported AUC values, the evaluation was repeated across 5000 bootstrap samples of each test set and reported error bars indicating 95% CIs for the AUC across these bootstrap samples.
Visualization of Representative Topics From Proposed Supervised Topic Model
| Probability | Type | ID | Word |
|---|---|---|---|
| 0.033 | 99213 | Office visit >15 min | |
| 0.024 | 99214 | Office visit >25 min | |
| 0.016 | 99211 | Office visit >5 min | |
| 0.015 | 08527 | Complete blood count tests | |
| 0.010 | 82565 | Creatinine blood test | |
| 0.010 | 78900 | Abdominal pain | |
| 0.010 | 85025 | Complete blood count tests | |
| 0.010 | 71020 | Radiologic examination of chest | |
| 0.009 | 84520 | Urea nitrogen laboratory test | |
| 0.009 | Prescription | 42347 | Bupropion prescription |
| 0.009 | 311 | Depressive disorder | |
| 0.098 | 97110 | Physical therapy | |
| 0.052 | 7245 | Back ache | |
| 0.037 | 97140 | Manual therapy | |
| 0.033 | Prescription | 7804 | Oxycodone treatment |
| 0.032 | 7242 | Lumbago | |
| 0.019 | Prescription | 214182 | Acetaminophen or hydrocodone |
| 0.018 | 7231 | Cervicalgia | |
| 0.015 | 71941 | Shoulder pain | |
| 0.012 | Prescription | 25480 | Gabapentin |
| 0.012 | 71947 | Ankle or foot pain | |
| 0.012 | 71596 | Osteoarthritis of lower leg | |
| 0.034 | 87591 | Test for gonorrhea | |
| 0.033 | 87491 | Test for chlamydia | |
| 0.020 | 87070 | Bacterial culture | |
| 0.017 | 81025 | Urine pregnancy test | |
| 0.017 | 84702 | hCG test | |
| 0.016 | 87086 | Bacterial culture from urine | |
| 0.015 | V762 | Cervical screening | |
| 0.015 | 462 | Acute pharyngitis | |
| 0.015 | V222 | Incidental pregnancy | |
| 0.015 | 76856 | Pelvic ultrasonography | |
| 0.015 | 6259 | Female genital concern | |
| 0.048 | v761 | Mammogram | |
| 0.038 | Prescription | 10582 | Levothyroxine |
| 0.032 | 2449 | Hypothyroidism | |
| 0.025 | 2724 | Hyperlipidemia | |
| 0.021 | 76092 | Mammogram | |
| 0.020 | v762 | Cervical screening | |
| 0.019 | 6272 | Menopause | |
| 0.017 | 73300 | Osteoporosis | |
| 0.013 | v103 | Breast cancer history | |
| 0.012 | 6961 | Psoriasis | |
| 0.012 | 78079 | Malaise and fatigue | |
| 0.138 | 29630 | Major depressive disorder | |
| 0.125 | 90806 | Psychotherapy | |
| 0.098 | 90862 | Pharmacologic management | |
| 0.034 | 30000 | Anxiety | |
| 0.032 | Prescription | 2598 | Clonazepam |
| 0.023 | 29650 | Bipolar disorder | |
| 0.023 | 30490 | Drug dependency | |
| 0.022 | 90870 | Electroconvulsive therapy | |
| 0.018 | 30981 | Posttraumatic stress disorder | |
| 0.018 | 2967 | Bipolar disorder | |
| 0.018 | 90807 | Psychotherapy | |
| 0.012 | 6961 | Psoriasis | |
| 0.012 | 78079 | Malaise and fatigue | |
| 0.048 | 99213 | Office visit >15 min | |
| 0.037 | 99214 | Office visit >25 min | |
| 0.029 | 99211 | Office visit >5 min | |
| 0.021 | 36415 | Blood samples obtained for laboratory test | |
| 0.016 | 85027 | Complete blood count | |
| 0.016 | v700 | Routine examination | |
| 0.016 | Prescription | 7646 | Omeprazole treatment |
| 0.013 | 80061 | Lipid panel | |
| 0.011 | 90658 | Influenza vaccination | |
| 0.011 | 99215 | Office visit >40 min | |
| 0.011 | 80053 | Metabolomic tests | |
Abbreviations: CPT, Current Procedural Terminology; ICD, International Classification of Diseases; hCG, human chorionic gonadotropin; LR, logistic regression.
Six learned topics from our proposed Latent Dirichlet Allocation topic model trained to predict general stability that were selected as representative of the 10 total topics learned by the model. Code words with high probability in the same topic were likely to cooccur together in a patient’s record explained by that topic. The top 10 most probable codes are shown. Each topic is labeled with a clinician annotated title (provided post hoc by R.H.P.) and the topic’s index order within the original model. Learned LR coefficients were rounded to the nearest 0.1 for the task of predicting general stability. Large positive coefficients suggest that a patient whose history uses more of that topic will be more stable.
Each topic is defined by a learned distribution over 9256 possible diagnostic (ICD), procedural (CPT), and medication-related code words.