| Literature DB >> 29888086 |
Yaoyun Zhang1, Hee-Jin Li1, Jingqi Wang1, Trevor Cohen1, Kirk Roberts1, Hua Xu1.
Abstract
Mental health is increasingly recognized an important topic in healthcare. Information concerning psychiatric symptoms is critical for the timely diagnosis of mental disorders, as well as for the personalization of interventions. However, the diversity and sparsity of psychiatric symptoms make it challenging for conventional natural language processing techniques to automatically extract such information from clinical text. To address this problem, this study takes the initiative to use and adapt word embeddings from four source domains - intensive care, biomedical literature, Wikipedia and Psychiatric Forum - to recognize symptoms in the target domain of psychiatry. We investigated four different approaches including 1) only using word embeddings of the source domain, 2) directly combining data of the source and target to generate word embeddings, 3) assigning different weights to word embeddings, and 4) retraining the word embedding model of the source domain using a corpus of the target domain. To the best of our knowledge, this is the first work of adapting multiple word embeddings of external domains to improve psychiatric symptom recognition in clinical text. Experimental results showed that the last two approaches outperformed the baseline methods, indicating the effectiveness of our new strategies to leverage embeddings from other domains.Entities:
Year: 2018 PMID: 29888086 PMCID: PMC5961810
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.An example paragraph from psychiatric notes with symptoms. The psychiatric symptoms are highlighted in italic.
Figure 2.Study design of adapting word embeddings of multiple source domains to psychiatric notes for psychiatric symptom recognition using the deep learning method.
Corpus statistics for MIMIC, MEDLINE, Psychiatric Forum, Wikipedia and psychiatric notes.
| Corpus | Size of data | #Unique tokens | #Coverage of target tokens | |
|---|---|---|---|---|
| Source | MIMIC III | 1.95G | 257,472 | 3,029 (73.1%) |
| MEDLINE | 1.3G | 251,080 | 3,947(95.2%) | |
| Psychiatric Forum | 78.5M | 36,336 | 3,264 (78.8%) | |
| Wikipedia | 10.4G | 2,448,552 | 4,063 (98.0%) | |
| Target | Psychiatric Notes | 2.4M | 4,144 | # |
Results for RNN-based psychiatric symptom recognition using word embeddings from multiple domains (%).
| Corpus | Method | P | R | F-measure |
|---|---|---|---|---|
| None | Randomize | 68.93 | 63.06 | 65.87 |
| Psychiatric Notes | Target_only | 70.82 | 64.05 | 67.26 |
| Psychiatric Forum | Source_only | 67.85 | 67.42 | 67.63 |
| Source+target | 69.34 | 65.52 | 67.38 | |
| Weighted_concatenate | 72.01 | 64.05 | 67.80 | |
| Retrain_source | 71.60 | 64.84 | ||
| Mimic | Source_only | 69.66 | 65.16 | 67.34 |
| Source+target | 67.86 | 66.71 | 67.28 | |
| Weighted_concatenate | 69.52 | 66.90 | 68.19 | |
| Retrain_source | 71.25 | 67.87 | ||
| Wikipedia | Source_only | 69.99 | 69.04 | |
| Source+target | 69.80 | 68.82 | 69.30 | |
| Weighted_concatenate | 71.42 | 66.86 | 69.07 | |
| Retrain_source | 69.50 | 66.19 | 67.80 | |
| MEDLINE | Source_only | 72.18 | 63.85 | 67.76 |
| Source+target | 71.82 | 64.05 | 67.71 | |
| Weighted_concatenate | 68.95 | 67.34 | 68.14 | |
| Retrain_source | 73.62 | 64.81 |
Performance comparison between the Deep learning based algorithm and the CRF algorithm for psychiatric symptom recognition (%).
Error analysis of deep learning based psychiatric symptom recognition. False positive/negative errors are highlighted in italic.
| Error type | Example |
|---|---|
| False positive | |
| Term taken out of context | She felt very well from a mood and anxiety stand point prior to pregnancy |
| Non-specific symptomatology | it is in context of needing to work long hours and sometimes can be associated with impulsive incidents in the remote past but no |
| Non-psychiatric symptomatology | Patient is |
| False negative | |
| Complex syntactic structure | notices light sensitivity, emotional numbness / |
| Abbreviations | Denies history of symptoms of AH / VH / TH / |
| Rare symptom pattern | At heaviest use was |
| Telegraphic writing | Brother |
Examples of partial matched psychiatric symptoms.
| Symptom annotation | Partially matched symptom |
| Suicidal thoughts | Periods of suicidal thoughts |
| etoh or drug use | drug use |
| Complicated Grief; pain medication abuse | Complicated Grief and pain medication abuse |
| subjective sense of “ brain fogginess | sense of “ brain fogginess |
| had anxiety all my life | anxiety |