| Literature DB >> 33859596 |
Usman Ahmed1, Suresh Kumar Mukhiya1, Gautam Srivastava2,3, Yngve Lamo1, Jerry Chun-Wei Lin1.
Abstract
With the increasing prevalence of Internet usage, Internet-Delivered Psychological Treatment (IDPT) has become a valuable tool to develop improved treatments of mental disorders. IDPT becomes complicated and labor intensive because of overlapping emotion in mental health. To create a usable learning application for IDPT requires diverse labeled datasets containing an adequate set of linguistic properties to extract word representations and segmentations of emotions. In medical applications, it is challenging to successfully refine such datasets since emotion-aware labeling is time consuming. Other known issues include vocabulary sizes per class, data source, method of creation, and baseline for the human performance level. This paper focuses on the application of personalized mental health interventions using Natural Language Processing (NLP) and attention-based in-depth entropy active learning. The objective of this research is to increase the trainable instances using a semantic clustering mechanism. For this purpose, we propose a method based on synonym expansion by semantic vectors. Semantic vectors based on semantic information derived from the context in which it appears are clustered. The resulting similarity metrics help to select the subset of unlabeled text by using semantic information. The proposed method separates unlabeled text and includes it in the next active learning mechanism cycle. Our method updates model training by using the new training points. The cycle continues until it reaches an optimal solution, and it converts all the unlabeled text into the training set. Our in-depth experimental results show that the synonym expansion semantic vectors help enhance training accuracy while not harming the results. The bidirectional Long Short-Term Memory (LSTM) architecture with an attention mechanism achieved 0.85 Receiver Operating Characteristic (ROC curve) on the blind test set. The learned embedding is then used to visualize the activated word's contribution to each symptom and find the psychiatrist's qualitative agreement. Our method improves the detection rate of depression symptoms from online forum text using the unlabeled forum texts.Entities:
Keywords: NLP; adaptive treatments; internet-delivered interventions; text clustering; word sense identification
Year: 2021 PMID: 33859596 PMCID: PMC8042786 DOI: 10.3389/fpsyg.2021.642347
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1A workflow architecture for estimating PHQ-9 symptoms from the patient-authored texts. PQ, psychometric questionnaire; NLP, natural language processing.
Figure 2A flow of training and adaption using attention based domain adoption. The visualization and symptoms similarity is suggested for the psychiatrist for note-making and suggesting patient remedy for certain symptoms.
PHQ-9 questionnaire and seed terms for each symptoms.
| S1 | Little interest or pleasure in doing things | Interest |
| S2 | Felling down depressed or hopeless | Feeling, depressed, hopeless |
| S3 | Trouble falling or staying asleep or sleeping too much | Sleep, asleep |
| S4 | Feeling tired or having little energy | Tired, energy |
| S5 | Poor appetite or over eating | Appetite, overeating |
| S6 | Feeling bad about yourself or that you are a failure or have let yourself or your family down | Failure, family |
| S7 | Trouble concentrating on things such as reading the newspaper or watching television | Concentration, reading, watching |
| S8 | Moving or speaking so slowly that other people could have noticed or the opposite being or restless that you have been moving around a lot more than usual | Moving, speaking, restless |
| S9 | Thoughts that you would be better off dead or of hurt yourself | Dead, hurt, suicide |
The statistical summary of the training and testing set.
| Corpus size (Number of posts collected) | 15,044 |
| Number of sentences | 133,524 |
| Average sentences per post | 8.87 |
| Average words per post | 232 |
| Training set size (Number of posts) | 14,944 |
| Testing set size (Number of posts) | 100 |
A snippets of dataset used.
| It is too much to handle. The depression and anxiety. Tried so many ways to get better including varying cocktails of meds but I feel so hopeless. Last semester and I think I'm going to fail. | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Having a very bad day today. Haven't even got dressed yet might not bother at all today. Don't really know why I keep going. Feel so very very sad and…… | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| Hi all I'm after a bit of advice. I think my partner is depressed and I told him he needs to go to the doctors. He works away Monday to Friday and is stressed out at work working as a lorry driver he does long hours (70+ a week)… | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
The mean ROC–Area Under the ROC Curve values of training and testing set.
| Baseline | 0.89 | 0.81 |
| LSTM | 0.65 | 0.38 |
| Bidirectional LSTM | 0.91 | 0.8 |
| Bidirectional_LSTM_Attention |
The bold value represents the highest ROC-AUC value.
Figure 3The feed-forward neural network baseline model.
Figure 4The performance of Long Short-Term Memory (LSTM) model.
Figure 5The performance of bidirectional Long Short-Term Memory (LSTM) model.
Figure 6The performance of bidirectional Long Short-Term Memory (LSTM) with attention.
Figure 7An example patient-authored text and visualization of depression symptoms extracted by our approach.