| Literature DB >> 31740657 |
Yantao Ma1,2,3,4, Jun Ji5,6,7,8, Yun Huang1,2,3,4, Huimin Gao1,2,3,4, Zhiying Li1,2,3,4, Wentian Dong1,2,3,4, Shuzhe Zhou1,2,3,4, Yue Zhu1,2,3,4, Weimin Dang1,2,3,4, Tianhang Zhou1,2,3,4, Haiqing Yu6,9, Bin Yu6, Yuefeng Long6, Long Liu6, Gary Sachs10, Xin Yu11,12,13,14.
Abstract
Bipolar disorder (BPD) is often confused with major depression, and current diagnostic questionnaires are subjective and time intensive. The aim of this study was to develop a new Bipolar Diagnosis Checklist in Chinese (BDCC) by using machine learning to shorten the Affective Disorder Evaluation scale (ADE) based on an analysis of registered Chinese multisite cohort data. In order to evaluate the importance of each item of the ADE, a case-control study of 360 bipolar disorder (BPD) patients, 255 major depressive disorder (MDD) patients and 228 healthy (no psychiatric diagnosis) controls (HCs) was conducted, spanning 9 Chinese health facilities participating in the Comprehensive Assessment and Follow-up Descriptive Study on Bipolar Disorder (CAFÉ-BD). The BDCC was formed by selected items from the ADE according to their importance as calculated by a random forest machine learning algorithm. Five classical machine learning algorithms, namely, a random forest algorithm, support vector regression (SVR), the least absolute shrinkage and selection operator (LASSO), linear discriminant analysis (LDA) and logistic regression, were used to retrospectively analyze the aforementioned cohort data to shorten the ADE. Regarding the area under the receiver operating characteristic (ROC) curve (AUC), the BDCC had high AUCs of 0.948, 0.921, and 0.923 for the diagnosis of MDD, BPD, and HC, respectively, despite containing only 15% (17/113) of the items from the ADE. Traditional scales can be shortened using machine learning analysis. By shortening the ADE using a random forest algorithm, we generated the BDCC, which can be more easily applied in clinical practice to effectively enhance both BPD and MDD diagnosis.Entities:
Mesh:
Year: 2019 PMID: 31740657 PMCID: PMC6861254 DOI: 10.1038/s41398-019-0638-8
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
The top 17 features ranked by mRMR.
| Rank | BDCC Items | mRMR |
|---|---|---|
| 1 | Over the past 2 weeks, how many days have you had any severe abnormal mood elevation? | 0.229 |
| 2 | Other features of past episodes of depression: sudden onset? | 0.037 |
| 3 | Dysthymia: depressed more days than not for >2 years? | 0.037 |
| 4 | Over the past 2 weeks, how many days have you had lowered interest in most activities or found that you could not enjoy even pleasurable activities most of the day? | 0.031 |
| 5 | How old were you when you were first treated for depression? | 0.031 |
| 6 | Rate associated symptoms for the past week: guilt | 0.010 |
| 7 | Rate associated symptoms for the past week: life not worth living (LNWL) | 0.007 |
| 8 | Rate associated symptoms for the past week: flight of ideas (FOI)/racing thoughts | 0.005 |
| 9 | Past psychiatric history: suicide attempt | 0.005 |
| 10 | Over the past year, how many days have you had any abnormal anxiety? | 0.002 |
| 11 | Past depression: other features of past episodes of depression: anger attacks | 0.002 |
| 12 | Over the past 2 weeks, how many days have you had any abnormal severe irritability? | 0.000 |
| 13 | Abnormal mood elevation (lifetime): during the most severe episode identified above, were there any times when your mood was euphoric? | −0.001 |
| 14 | Over the past 2 weeks, how many days have you been depressed most of the day? | −0.003 |
| 15 | Rate associated symptoms for the past week: sleep anhedonia | −0.004 |
| 16 | Rate associated symptoms for the past week: psychomotor agitation (PMA) | −0.005 |
| 17 | Past depression: other features of past episodes of depression: feelings of worthlessness | −0.005 |
LNWL life not worth living, FOI flight of ideas, PMA psychomotor agitation
Fig. 1Forward feature selection results.
a MDD diagnosis, b BPD diagnosis, and c HC diagnosis.
Precision of the machine learning algorithms and the BDCC.
| Random forest | SVR | LASSO | LDA | Logistic regression | BDCC | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | Features used | AUC | Features used | AUC | Features used | AUC | Features used | AUC | Features used | AUC | Features used | |
| MDD | 0.973 | 74/113 | 0.943 | 56/113 | 0.964 | 50/113 | 0.963 | 54/113 | 0.960 | 34/113 | 0.948 | 17/113 |
| BPD | 0.959 | 91/113 | 0.933 | 56/113 | 0.943 | 105/113 | 0.943 | 99/113 | 0.936 | 34/113 | 0.921 | 17/113 |
| HC | 0.927 | 111/113 | 0.905 | 91/113 | 0.918 | 21/113 | 0.923 | 99/113 | 0.925 | 18/113 | 0.923 | 17/113 |
BDCC Bipolar Diagnosis Checklist in Chinese, SVR support vector regression, LASSO least absolute shrinkage and selection operator, LDA linear discriminant analysis, AUC area under curve, MDD major depressive disorder, BPD bipolar disorder, HCs healthy controls
Fig. 2ROC curves of 10-fold cross-validation subsamples of the best random forest performance.
a MDD diagnosis, b BPD diagnosis, and c HC diagnosis.
Fig. 3ROC curves of 10-fold cross-validation subsamples of the BDCC.
a MDD diagnosis, b BPD diagnosis, and c HC diagnosis.