| Literature DB >> 35465357 |
K Nimmi1, B Janet1, A Kalai Selvan2, N Sivakumaran3.
Abstract
The COVID-19 precautions, lockdown, and quarantine implemented throughout the epidemic resulted in a worldwide economic disaster. People are facing unprecedented levels of intense threat, necessitating professional, systematic psychiatric intervention and assistance. New psychological services must be established as quickly as possible to support the mental healthcare needs of people in this pandemic condition. This study examines the contents of calls landed in the emergency response support system (ERSS) during the pandemic. Furthermore, a combined analysis of Twitter patterns connected to emergency services could be valuable in assisting people in this pandemic crisis and understanding and supporting people's emotions. The proposed Average Voting Ensemble Deep Learning model (AVEDL Model) is based on the Average Voting technique. The AVEDL Model is utilized to classify emotion based on COVID-19 associated emergency response support system calls (transcribed) along with tweets. Pre-trained transformer-based models BERT, DistilBERT, and RoBERTa are combined to build the AVEDL Model, which achieves the best results. The AVEDL Model is trained and tested for emotion detection using the COVID-19 labeled tweets and call content of the emergency response support system. This is the first deep learning ensemble model using COVID-19 emotion analysis to the best of our knowledge. The AVEDL Model outperforms standard deep learning and machine learning models by attaining an accuracy of 86.46 percent and Macro-average F1-score of 85.20 percent.Entities:
Keywords: BERT; COVID-19; Deep learning; DistilBERT; Emergency response support system (ERSS); Emotion detection.; Ensemble model; Health emergency; RoBERTa
Year: 2022 PMID: 35465357 PMCID: PMC9014641 DOI: 10.1016/j.asoc.2022.108842
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 8.263
Fig. 1Statistics of calls received by ERSS-Kerala from February 2019 to July 2021.
Details of combined ERSS-tweet dataset.
| Emotion | Class | Count |
|---|---|---|
| No_specific_emotion | 0 | 19 581 |
| Anticipation | 1 | 19 935 |
| Fear | 2 | 23 138 |
| Anger | 3 | 19 962 |
| Sadness | 4 | 21 099 |
Fig. 2Emotions categories dataset count and distribution.
Hyper-parameters tuning.
| Sl no | Model / Hyper-parameters | DistilBERT | BERT | RoBERTa |
|---|---|---|---|---|
| 1 | Pre-trained model | DistilBERT-base-uncased | Bert-base-uncased | Roberta-base |
| 2 | Learning rate | 1e−5, 5e−5, 3e−5, 4e−5, 2e−5 | 1e−5, 5e−5, 3e−5, 4e−5, 2e−5 | 1e−5, 5e−5, 3e−5, 4e−5, 2e−5 |
| 3 | Activation | Softmax | Softmax | Softmax |
| 4 | Batch size | 8, 16, 32 | 8, 16, 32 | 8, 16, 32 |
| 5 | Number of epochs trained | 10,5 | 10,5 | 10 |
| 6 | Maximum sequence length | 125 | 125 | 125 |
| 7 | Dropout | 0.2 | 0.2 | 0.2 |
| 8 | Hidden size | 768 | 768 | 768 |
| 9 | Optimizer | AdamW, Adam | AdamW, Adam | AdamW, Adam |
Fig. 3Overview of Proposed AVEDL model.
System requirements.
| Integrated Development Environment (IDE) | Jupyter notebook |
| Processor | Tesla K80 |
| RAM | 25GB |
| Model name | Intel(R) Xeon(R) |
| Programming space allocated | 160 GB |
| Programming Language | Python (version 3.6.5) |
| Packages | Numpy, nltk, Pandas, and scikit-learn |
Performance evaluation metrics of BERT.
| Batch size | Learning rate | Macro precision | Macro recall | Macro F1-score | Accuracy | Time |
|---|---|---|---|---|---|---|
| 32 | 1e−5 | 82.72% | 82.70% | 82.69% | 84.11% | 1229 s |
| 2e−5 | 83.71% | 83.31% | 83.41% | 84.82% | ||
| 3e−5 | 82.74% | 82.01% | 82.14% | 83.77% | ||
| 4e−5 | 82.79% | 82.24% | 82.20% | 83.78% | ||
| 5e−5 | 82.78% | 82.39% | 82.54% | 83.91% | ||
| 16 | 1345 s | |||||
| 2e−5 | 82.60% | 82.29% | 82.43% | 83.87% | ||
| 3e−5 | 82.73% | 82.22% | 82.38% | 83.79% | ||
| 4e−5 | 82.66% | 82.09% | 82.27% | 83.75% | ||
| 5e−5 | 82.74% | 82.01% | 82.14% | 83.77% | ||
| 8 | 1e−5 | 82.92% | 82.58% | 82.67% | 84.16% | 1642 s |
| 2e−5 | 81.89% | 81.35% | 81.51% | 81.35% | ||
| 3e−5 | 81.92% | 81.59% | 81.69% | 81.59% | ||
| 4e−5 | 81.35% | 80.66% | 80.87% | 80.66% | ||
| 5e−5 | 83.47% | 83.34% | 83.29% | 84.69% | ||
Performance evaluation metrics of DistilBERT.
| Batch size | Learning rate | Macro precision | Macro recall | Macro F1-score | Accuracy | Time |
|---|---|---|---|---|---|---|
| 32 | 1e−5 | 82.60% | 82.16% | 82.32% | 83.78% | 619 s |
| 2e−5 | 83.13% | 82.95% | 82.96% | 84.29% | ||
| 3e−5 | 79.61% | 79.58% | 79.55% | 81.30% | ||
| 4e−5 | 80.05% | 79.52% | 79.70% | 81.37% | ||
| 5e−5 | 80.22% | 79.70% | 79.90% | 81.56% | ||
| 16 | 1e−5 | 83.62% | 83.35% | 83.41% | 691 s | |
| 3e−5 | 79.61% | 79.58% | 79.55% | 81.30% | ||
| 4e−5 | 81.86% | 81.66% | 81.64% | 83.31% | ||
| 5e−5 | 82.27% | 81.92% | 81.90% | 83.36% | ||
| 8 | 1e−5 | 82.08% | 82.66% | 81.81% | 83.34% | 572 s |
| 2e−5 | 83.30% | 82.83% | 82.92% | 84.24% | ||
| 3e−5 | 81.94% | 81.88% | 81.86% | 83.28% | ||
| 4e−5 | 81.12% | 80.70% | 80.88% | 82.44% | ||
| 5e−5 | 80.13% | 79.59% | 79.77% | 81.36% | ||
Performance evaluation metrics of RoBERTa.
| Batch size | Learning rate | Macro precision | Macro recall | Macro F1-score | Accuracy | Time |
|---|---|---|---|---|---|---|
| 32 | 1e−5 | 83.05% | 82.54% | 82.74% | 84.20% | 1256 s |
| 2e−5 | 83.58% | 83.18% | 83.33% | 84.69% | ||
| 3e−5 | 82.60% | 82.29% | 82.43% | 83.87% | ||
| 4e−5 | 81.34% | 81.02% | 81.13% | 82.59% | ||
| 5e−5 | 82.05% | 81.98% | 81.98% | 83.49% | ||
| 16 | 1e−5 | 83.06% | 82.96% | 82.96% | 82.96% | 1437 s |
| 2e−5 | 82.74% | 82.35% | 82.47% | 82.35% | ||
| 3e−5 | 81.94% | 81.65% | 81.73% | 81.65% | ||
| 4e−5 | 83.74% | 83.26% | 83.44% | 84.83% | ||
| 5e−5 | 81.59% | 81.09% | 81.21% | 81.09% | ||
| 8 | 1e−5 | 83.22% | 82.88% | 82.98% | 82.88% | 1428 s |
| 3e−5 | 83.15% | 82.67% | 82.88% | 84.33% | ||
| 4e−5 | 82.84% | 82.38% | 82.57% | 84.06% | ||
| 5e−5 | 81.64% | 81.41% | 81.45% | 83.00% | ||
Detailed performance evaluation metrics of BERT showing best results.
| Class | Precision | Recall | F1-score |
|---|---|---|---|
| 0 | 78.45% | 79.56% | 79.00% |
| 1 | 85.14% | 75.78% | 80.19% |
| 2 | 95.53% | 94.93% | 95.23% |
| 3 | 75.30% | 83.09% | 79.00% |
| 4 | 85.21% | 83.45% | 84.32% |
Detailed performance evaluation metrics of DistilBERT showing best result.
| Class | Precision | Recall | F1-score |
|---|---|---|---|
| 0 | 82.78% | 73.63% | 77.94% |
| 1 | 79.67% | 80.98% | 80.32% |
| 2 | 95.59% | 94.66% | 95.13% |
| 3 | 77.31% | 82.38% | 79.76% |
| 4 | 82.78% | 85.11% | 83.93% |
Detailed Performance evaluation metrics of RoBERTa showing best result.
| Class | Precision | Recall | F1-score |
|---|---|---|---|
| 0 | 82.98% | 75.83% | 79.25% |
| 1 | 81.44% | 78.04% | 79.70% |
| 2 | 94.90% | 95.35% | 95.13% |
| 3 | 76.54% | 83.26% | 79.76% |
| 4 | 83.94% | 84.68% | 84.31% |
Fig. 4BERT ROC curves for emotion classification.
Fig. 5DistilBERT ROC curves for emotion classification.
Fig. 6ROC curves for emotion classifications using RoBERTa.
Performance evaluation metrics of Proposed AVEDL model.
| Model | Recall | Precision | Accuracy | F1-score |
|---|---|---|---|---|
| Proposed AVEDL model |
Detailed performance evaluation metrics of AVEDL Model showing best result.
| Class | Precision | Recall | F1-score |
|---|---|---|---|
| 0 | 82.79% | 78.30% | 80.49% |
| 1 | 85.71% | 79.09% | 82.27% |
| 2 | 96.54% | 95.34% | 95.94% |
| 3 | 77.23% | 85.10% | 80.98% |
| 4 | 85.17% | 87.53% | 86.33% |
Wrongly classified sentence using BERT model.
| Sl.no. | Text | Actual class | Predicted class |
|---|---|---|---|
| 1 | Sunil calling to inform that an migrant labor is having fever need immediate medical aid need ambulance shift hospital loction xxxx do something fast | Fear | Anger |
| 2 | During covid there is no geographic identification for containment zones but we wanted to serve people there so what all things are to be done we are not afraid of anything | Anger | No specific emotion |
| 3 | Know you would bounce back soon and give it back to the wuhan virus get well soon | No specific emotion | Anticipation |
| 4 | Resister complainant regarding auto radhika taking vegetables along persons without permission from police in lockdown time location xxxx | Sadness | No specific emotion |
Misclassified sentence using DistilBERT model.
| Sl.no. | Text | Actual class | Predicted class |
| 1 | There is always a mismatch in their numbers negative reports are sent later focus is on what is posted | Anger | Sadness |
| 2 | Please look department of posts at kolkata offices specially head post offices and sub post offices too no precaution about corona | Sadness | Anger |
| 3 | What ever may be the religion of yours if you are covid positive or suspected and you are hiding from people | No specific emotion | Fear |
| 4 | The may have been covid positive but the cause of death covid or due to some other reason people are spreading rumors | Fear | Sadness |
| 5 | caa corona applied appeared | Anticipation | No specific emotion |
The RoBERTa model incorrectly predicted sentences.
| Sl.no. | Text | Actual class | Predicted class |
|---|---|---|---|
| 1 | Before covid also more people died of diabetes cancer road accidents and malaria Covid is like fire | Sadness | Fear |
| 2 | Crowd infront of hotel xxxx covid will spread fast if no action taken take some action | Anger | Fear |
| 3 | Sujatha sheeba wife of subramanian staying lonely need adequate food not having enough food since last week due to lockdown location xxxx nearby am much worried because of lockdown | Anger | Sadness |
| 4 | He may have been covid positive but the cause of death covid or due to some other reason please try to understand | Fear | Sadness |
Comparing AVEDL model results with state-of-art models.
| Model | Macro Recall | Macro Precision | Accuracy | F1-score |
|---|---|---|---|---|
| SVM | 58.32% | 58.33% | 58.41% | 57.11% |
| SVM(tf-idf) | 69.30% | 63.72% | 66.22% | 65.34% |
| Random Forest(tf-idf) | 57.65% | 42.02% | 42.35% | 47.49% |
| BERT | 83.92% | 83.36% | 84.91% | 83.55% |
| RoBERTa | 84.87% | 84.19% | 85.69% | 84.47% |
| DistilBERT | 83.62% | 83.35% | 84.86% | 83.41% |
| Proposed AVEDL Model | 85.07% | 85.49% | 86.46% | 85.20% |
Fig. 7Comparing the results of AVEDL Model with existing models based on Macro Precision and Macro Recall.
Fig. 8Comparing the results of AVEDL Model with existing models based on Accuracy and Macro F1-score.
Fig. 9Flow chart of emotion detection on ERSS-Kerala data.
Performance evaluation metrics of AVEDL model on ERSS dataset.
| Macro Precision | Macro Recall | Macro F1-score | Accuracy |
|---|---|---|---|
| 83.15% | 82.67% | 82.88% | 84.33% |