| Literature DB >> 36059763 |
Yongheng Liu1,2, Yajing Shen1, Zhiyong Cai1.
Abstract
For a long time, it takes a lot of time and energy for psychological workers to classify the psychological problems of college students. In order to quickly and efficiently understand the common psychological problems of college students in the region for real-time analysis in the post-epidemic era, 2,000 college students' psychological problems were selected as research data in the community question section of the "Su Xin" application, a psychological self-help and mutual aid platform for college students in Jiangsu Province. First, word segmentation, removal of stop words, establishment of word vectors, etc. were used for the preprocessing of research data. Secondly, it was divided into 9 common psychological problems by LDA clustering analysis, which also combined with previous researches. Thirdly, the text information was processed into word vectors and transferred to the Attention-Based Bidirectional Long Short-Term Memory Networks (AB-LSTM). The experimental results showed that the proposed model has a higher test accuracy of 78% compared with other models.Entities:
Keywords: AB-LSTM; machine learning; natural language processing; psychological problem categories; text cluster analysis
Year: 2022 PMID: 36059763 PMCID: PMC9430022 DOI: 10.3389/fpsyg.2022.975493
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Attention-based bidirectional long short-term memory networks.
Text content classification algorithm based on AB-LSTM.
| 1: |
| 2: Preprocess the dataset (text data vectorization, data division). |
| 3: Enter the training set and randomly initialize the LTSM model parameters: |
| 4: Build the embedding layer |
| 5: Build the SpatialDropout1D layer |
| 6: Build the LSTM layer |
| 7: Attention (name = “attention_weight”) |
| 8: Build a fully connected layer |
| 10: |
FIGURE 2The distribution of psychological problems of college students.
FIGURE 3(A) Model loss diagram. (B) Model accuracy chart.
FIGURE 4Confusion matrix diagram.
Accuracy of different classification algorithms.
| Label | MultinomialNB | SVC | AB-LSTM |
| Interpersonal relationship | 0.52 | 0.62 |
|
| Emotional stress | 0.43 | 0.78 |
|
| Academic stress | 0.56 | 0.56 |
|
| Negative life events |
| 0.58 | 0.67 |
| Consult the teacher | 0.8 |
| 0.60 |
| Family relationship |
| 0.8 | 0.83 |
| Mental disease | 0.5 | 0.5 |
|
| Romantic relationship | 0 |
| 0.81 |
| Personal growth |
| 0 | 0.83 |
Bold is the highest value.
Score table of different classification algorithms.
| Classification model | Precision | Recall | F1 score |
| MultinomialNB | 0.56 | 0.53 | 0.54 |
| SVC | 0.66 | 0.64 | 0.65 |
| AB-LSTM |
|
|
|
Bold is the highest value.