| Literature DB >> 35890994 |
Mirosław Płaza1, Sławomir Trusz2, Justyna Kęczkowska1, Ewa Boksa3, Sebastian Sadowski4, Zbigniew Koruba5.
Abstract
Over the past few years, virtual assistant solutions used in Contact Center systems are gaining popularity. One of the main tasks of the virtual assistant is to recognize the intentions of the customer. It is important to note that quite often the actual intention expressed in a conversation is also directly influenced by the emotions that accompany that conversation. Unfortunately, scientific literature has not identified what specific types of emotions in Contact Center applications are relevant to the activities they perform. Therefore, the main objective of this work was to develop an Emotion Classification for Machine Detection of Affect-Tinged Conversational Contents dedicated directly to the Contact Center industry. In the conducted study, Contact Center voice and text channels were considered, taking into account the following families of emotions: anger, fear, happiness, sadness vs. affective neutrality of the statements. The obtained results confirmed the usefulness of the proposed classification-for the voice channel, the highest efficiency was obtained using the Convolutional Neural Network (accuracy, 67.5%; precision, 80.3; F1-Score, 74.5%), while for the text channel, the Support Vector Machine algorithm proved to be the most efficient (accuracy, 65.9%; precision, 58.5; F1-Score, 61.7%).Entities:
Keywords: call/contact center; chatbot; emotions recognition; virtual assistant; voicebot
Mesh:
Year: 2022 PMID: 35890994 PMCID: PMC9321989 DOI: 10.3390/s22145311
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Selected systems for automatic recognition of affective states.
| System | Affective State | Parameters | Technical Solutions | References |
|---|---|---|---|---|
| An analysis of the emotions | Anger, neutrality, disgust, sadness, fear, happiness, surprise | Facial expressions, | Sensor electrodermal activity (EDA), face | [ |
| Dynamic facial expression | Neutrality, disgust, | Facial expressions | Convolutional Neural Network (CNN) | [ |
| Detection of emotions based | Lividness, boredom, | Body posture | Algorithm on C++, a sensor that analyzes | [ |
| Detection of emotions | Happiness, sadness, | Gestures and body | A sensor that analyzes posture, | [ |
| Moodies for voice-based | Disgust, happiness, | Sound | An app that detects the emotion | [ |
| Techniques for recognizing emotions from voice | Anger, happiness, sadness and neutral state | Sound | Deep neural networks, hybrid CNN | [ |
| A prototype system for detecting emotions in a text based on social media posts | Anger, anticipation, | Text | Long Short Term Memory (LSTM) networks | [ |
| A model for emotion | Happiness, sadness, | ECG | Spiker-Shield Heart and Brain sensor, Extra Tree Classification, ADA Boost Classification with SVM, Python Scikit API | [ |
| Development of an emotion | Sadness, fear | ECG 1, GSR 2, BVP 3, pulse, respiration | A system with five physiological signal | [ |
| Emotion Recognition Using Heart Rate Data from a Smart Bracelet | Happiness, sadness | Pulse | A smart bracelet (Algoband F8), | [ |
1 ECG, electrocardiogram; 2 GSR, galvanic skin responses; 3 BVP, blood volume pulse.
Frequency analysis of the emotional coloring of utterance fragments in Sample 1 (voice channel).
| Affective State | Descriptive Statistics | Family of Related Emotions | |
|---|---|---|---|
| N(%) 1 | L(%) 2 | ||
|
| 937 (32) | 1 h 19 min 7 s (5.74) | irritation, impatience, negative surprise, disappointment, bitterness, anger, irony, sarcasm, rage |
|
| |||
|
| 166 (5) | 36 min 58 s (2.68) | uncertainty, fear, worry, confusion, anxiety, panic |
|
| |||
|
| 196 (7) | 3 min 16 s (0.24) | interest, satisfaction, positive surprise, excitement, gratitude, hope, happiness, amusement |
|
| |||
|
| 1491 (51) | 14 h 36 min 46 s | not applicable |
|
| |||
|
| 145 (5) | 7 min 18 s (0.53) | resignation, bitterness, helplessness, regret, melancholy |
|
| |||
1 N(%), total number of fragments and their percentage in the sample; 2 L(%), total duration of fragments and their percentage in the sample.
Frequency analysis of the emotional coloring of utterance fragments in Sample 1 (text channel).
| Affective State | Descriptive Statistics | Family of Related Emotions | |
|---|---|---|---|
| N (%) 1 | L (%) 2 | ||
|
| 312 (4.15) | 9910 (5.67) | irritation, impatience, negative surprise, disappointment, bitterness, anger, irony, sarcasm, rage |
|
| |||
|
| 102 (1.35) | 3659 (2.09) | uncertainty, fear, worry, confusion, anxiety, panic |
|
| |||
|
| 761 (10.12) | 21,521 (12.32) | interest, satisfaction, positive surprise, excitement, gratitude, hope, happiness, amusement |
|
| |||
|
| 2269 (30.19) | 90,817 (52.00) | not applicable |
|
| |||
|
| 305 (4.05) | 8741 (5.00) | resignation, bitterness, helplessness, regret, melancholy |
|
| |||
1 N (%), total number of fragments and their percentage in the sample; 2 L (%), total duration of fragments and their percentage in the sample.
Effectiveness of automatic detection of affective states in voice channels.
| No | Classifier Type | Voice Channel | |||||
|---|---|---|---|---|---|---|---|
|
| SD 1 |
| SD 1 |
| SD 1 | ||
| [%] | [%] | [%] | [%] | [%] | [%] | ||
| 1. | CNN | 72.9 | 2.58 | 80.7 | 0.89 | 75.0 | 1.58 |
| 2. | kNN | 70.0 | 1.00 | 67.8 | 1.64 | 67.2 | 1.64 |
| 3. | SVM | 69.2 | 3.11 | 69.4 | 5.81 | 63.4 | 3.84 |
1 SD—Standard deviation.
Effectiveness of automatic detection of affective states in text channels.
| No | Classifier Type | Text Channel | |||||
|---|---|---|---|---|---|---|---|
|
| SD 1 |
| SD 1 |
| SD 1 | ||
| [%] | [%] | [%] | [%] | [%] | [%] | ||
| 1. | ANN | 63.9 | 2.45 | 64.0 | 2.49 | 64.4 | 2.57 |
| 2. | DT | 54.4 | 2.23 | 58.1 | 4.34 | 55.6 | 3.14 |
| 3. | kNN | 55.5 | 2.14 | 62.1 | 4.98 | 56.1 | 3.45 |
| 4. | RFC | 53.4 | 1.58 | 66.0 | 2.81 | 55.8 | 2.78 |
| 5. | SVM | 49.2 | 1.54 | 57.7 | 4.57 | 49.1 | 4.18 |
1 SD—Standard deviation.
Results of verification of the machine emotion detection process.
| No | Classifier Type | Accuracy [%] | Precision [%] | F1-Score [%] |
|---|---|---|---|---|
| VOICE CHANNEL | ||||
| 1. | CNN | 67.5 | 80.3 | 74.5 |
| 2. | kNN | 52.7 | 67.6 | 57.5 |
| 3. | SVM | 62.4 | 63.9 | 62.2 |
| TEXT CHANNEL | ||||
| 1. | ANN | 55.8 | 62.4 | 58.4 |
| 2. | DT | 49.6 | 58.9 | 53.4 |
| 3. | kNN | 55.3 | 57.2 | 55.7 |
| 4. | RFC | 56.5 | 59.2 | 57.0 |
| 5. | SVM | 65.9 | 58.5 | 61.7 |
Figure 1How virtual assistants function taking into account the recognized emotions.