| Literature DB >> 26633967 |
Shuangping Gong1, Yonghui Dai2, Jun Ji3, Jinzhao Wang4, Hai Sun4.
Abstract
Customer complaint has been the important feedback for modern enterprises to improve their product and service quality as well as the customer's loyalty. As one of the commonly used manners in customer complaint, telephone communication carries rich emotional information of speeches, which provides valuable resources for perceiving the customer's satisfaction and studying the complaint handling skills. This paper studies the characteristics of telephone complaint speeches and proposes an analysis method based on affective computing technology, which can recognize the dynamic changes of customer emotions from the conversations between the service staff and the customer. The recognition process includes speaker recognition, emotional feature parameter extraction, and dynamic emotion recognition. Experimental results show that this method is effective and can reach high recognition rates of happy and angry states. It has been successfully applied to the operation quality and service administration in telecom and Internet service company.Entities:
Mesh:
Year: 2015 PMID: 26633967 PMCID: PMC4655047 DOI: 10.1155/2015/506905
Source DB: PubMed Journal: Comput Intell Neurosci
Classifications of basic emotions.
| Scholars | Classifications of basic emotions |
|---|---|
| Arnold | Anger, aversion, courage, dejection, desire, despair, fear, hate, hope, love, and sadness |
|
| |
| Ekman, Friesen, and Ellsworth | Anger, disgust, fear, joy, sadness, and surprise |
|
| |
| Frijda | Desire, happiness, interest, surprise, wonder, and sorrow |
|
| |
| Gray | Rage and terror, anxiety, and joy |
|
| |
| Izard | Anger, contempt, disgust, distress, fear, guilt, interest, joy, shame, and surprise |
|
| |
| James | Fear, grief, love, and rage |
|
| |
| McDougall | Anger, disgust, elation, fear, subjection, tender-emotion, and wonder |
|
| |
| Mowrer | Pain, pleasure |
|
| |
| Oatley and Johnson-laird | Anger, disgust, anxiety, happiness, and sadness |
|
| |
| Panksepp | Expectancy, fear, rage, and panic |
|
| |
| Plutchik | Acceptance, anger, anticipation, disgust, joy, fear, sadness, and surprise |
|
| |
| Tomkins | Anger, interest, contempt, disgust, distress, fear, joy, shame, and surprise |
|
| |
| Watson | Fear, love, and rage |
|
| |
| Weiner and Graham | Happiness, sadness |
Figure 1Continuous form of emotions in different dimensions.
Figure 2Extraction process of MFCC coefficients.
Figure 3The simplified structure of neurons.
Someone's speech characteristics.
| Speech characteristics | Someone's emotional states | ||
|---|---|---|---|
| Calmness | Discontent | Anger | |
| Mean-intensity ( | 43.82 | 60.59 | 76.82 |
| Maximum pitch (Hz) | 315.59 | 408.13 | 532.11 |
| Min pitch (Hz) | 148.61 | 122.05 | 180.69 |
| Mean pitch (Hz) | 207.82 | 257.33 | 267.91 |
| Pitch range (Hz) | 107.77 | 286.08 | 321.42 |
Figure 4Speaker identification algorithm based on cost-sensitive learning technology.
Figure 5The process of speech emotion recognition.
Part of the “.wav” files of telephone complaints.
| Recorders of telephone complaints | ||
|---|---|---|
| (1) Anger 01-broadband fault.wav | (4) Discontent 01-broadband fault.wav | (7) Calmness 01-broadband fault.wav |
|
| ||
| (2) Anger 02-improper charges.wav | (5) Discontent 02-improper charges.wav | (8) Calmness 02-improper charges.wav |
|
| ||
| (3) Anger 03-harassing messages.wav | (6) Discontent 03-harassing messages.wav | (9) Calmness 03-harassing messages.wav |
Figure 6The characteristic value of three emotional states.
Figure 7Framing and Hamming windowing of speech file.
Figure 8Fundamental frequency of the sample voice.
12-order MFCC parameters.
| 12-order MFCC coefficients | ||||||
|---|---|---|---|---|---|---|
| 1 | 11.9380 | 13.8424 | 14.5935 | 12.0397 | ⋯ | 16.7550 |
| 2 | −3.6532 | −1.2277 | −0.5424 | −0.9713 | 5.7762 | |
| 3 | −1.2243 | −0.3909 | 1.0603 | 0.4253 | 3.1929 | |
| 4 | 0.0357 | 1.8983 | 0.4281 | −0.6828 | 7.2475 | |
| 5 | 0.3830 | 0.7901 | 0.2184 | −0.4252 | 0.3533 | |
| 6 | −0.6169 | −0.2753 | −0.0506 | −0.3335 | −0.3407 | |
| 7 | 1.3436 | 0.6748 | −1.2993 | −2.5699 | 1.1169 | |
| 8 | 1.6690 | 3.4504 | 4.5318 | 4.7105 | 1.6278 | |
| 9 | −1.6734 | 1.3769 | 3.5090 | 4.7951 | −1.5150 | |
| 10 | −0.3022 | −0.6043 | −0.8377 | −0.3965 | −0.4561 | |
| 11 | −0.9944 | −1.0738 | 0.5673 | 1.1786 | 2.0980 | |
| 12 | 0.2397 | 0.3336 | 0.0626 | −0.1852 | 0.4053 | |
Figure 912-order MFCC parameters extracted from a sample of telephone speech.
Average recognition rate of emotions.
| Recognition methods | Recognition rate | |||
|---|---|---|---|---|
| Calmness | Discontent | Angry | Average | |
| BPNN | 81.22% | 60.46% | 80.92% | 74.20% |
| SVM (12-order MFCC) | 84.50% | 61.40% | 83.27% | 76.39% |
| SVM (combined 12-order MFCC and short energy) | 88.60% | 61.83% | 89.80% | 80.08% |