| Literature DB >> 26712757 |
Aitor Álvarez1, Basilio Sierra2, Andoni Arruti3, Juan-Miguel López-Gil4, Nestor Garay-Vitoria5.
Abstract
In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first layer to select the optimal subset from the standard base classifiers. The good performance of the proposed new paradigm was demonstrated over different configurations and datasets. First, several CSS stacking classifiers were constructed on the RekEmozio dataset, using some specific standard base classifiers and a total of 123 spectral, quality and prosodic features computed using in-house feature extraction algorithms. These initial CSS stacking classifiers were compared to other multi-classifier systems and the employed standard classifiers built on the same set of speech features. Then, new CSS stacking classifiers were built on RekEmozio using a different set of both acoustic parameters (extended version of the Geneva Minimalistic Acoustic Parameter Set (eGeMAPS)) and standard classifiers and employing the best meta-classifier of the initial experiments. The performance of these two CSS stacking classifiers was evaluated and compared. Finally, the new paradigm was tested on the well-known Berlin Emotional Speech database. We compared the performance of single, standard stacking and CSS stacking systems using the same parametrization of the second phase. All of the classifications were performed at the categorical level, including the six primary emotions plus the neutral one.Entities:
Keywords: affective computing; machine learning; speech emotion recognition
Mesh:
Year: 2015 PMID: 26712757 PMCID: PMC4732054 DOI: 10.3390/s16010021
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Human recognition accuracy percentages for utterances as a function of language and emotions (taken from [41]).
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| 75% | 51% | 78% | 71% | 66% | 52% | 80% | |
| 77% | 52% | 68% | 74% | 59% | 51% | 77% |
Figure 1Stacked generalization schemata.
Figure 2Classifier subset selection stacked generalization.
Figure 3The combinations of base classifiers as the estimation of distribution algorithm (EDA) individuals.
First phase. Accuracy percentages for each person using single classifiers. Mean and SD rows denote the average and standard deviation for each classifier considering all of the actors. BN, Bayesian network; NBT, naive Bayes tree; OneR, one rule; RIPPER, repeated incremental pruning to produce error reduction; RandomF, random forest.
| BN | C4.5 | KNN | KStar | NBT | NB | OneR | RIPPER | RandomF | SVM | |
|---|---|---|---|---|---|---|---|---|---|---|
| 69.90% | 64.08% | 65.05% | 54.37% | 55.34% | 61.17% | 53.40% | 48.54% | 64.08% | ||
| 58.25% | 45.63% | 49.51% | 36.89% | 44.66% | 39.81% | 34.95% | 46.60% | 53.40% | ||
| 46.60% | 45.63% | 49.51% | 36.89% | 44.66% | 39.81% | 34.95% | 46.60% | 34.95% | ||
| 43.69% | 39.81% | 35.92% | 43.69% | 34.95% | 45.63% | 33.01% | 54.37% | |||
| 52.43% | 42.72% | 41.75% | 36.89% | 54.37% | 49.51% | 42.72% | 48.54% | 52.43% | ||
| 53.40% | 48.54% | 46.60% | 38.83% | 56.31% | 38.83% | 43.69% | 46.60% | 63.11% | ||
| 42.72% | 33.98% | 32.04% | 25.24% | 41.75% | 37.86% | 38.83% | 38.83% | 45.63% | ||
| 18.18% | 29.09% | 29.09% | 20.91% | 29.09% | 19.09% | 12.73% | 12.73% | 26.36% | ||
| 44.55% | 43.64% | 37.27% | 33.64% | 40.00% | 38.18% | 30.00% | 36.36% | 43.64% | ||
| 54.55% | 43.64% | 50.91% | 26.36% | 52.73% | 55.45% | 30.00% | 41.82% | 58.18% | ||
| 56.36% | 42.73% | 37.27% | 28.18% | 50.00% | 43.64% | 38.18% | 42.73% | 55.45% | 54.55% | |
| 44.55% | 32.73% | 37.27% | 27.27% | 31.82% | 27.27% | 38.18% | 39.09% | 37.27% | 45.45% | |
| 44.55% | 40.91% | 33.64% | 35.45% | 44.55% | 27.27% | 45.45% | 42.73% | 50.91% | ||
| 51.82% | 37.27% | 31.82% | 54.55% | 35.45% | 40.00% | 45.45% | 60.00% | 56.36% | ||
| 51.82% | 53.64% | 53.64% | 37.27% | 52.73% | 40.91% | 40.00% | 57.27% | |||
| 58.18% | 48.18% | 47.27% | 36.36% | 50.00% | 40.91% | 43.64% | 54.55% | 53.64% | ||
| 50.91% | 46.36% | 40.91% | 23.64% | 45.45% | 37.27% | 40.00% | 40.91% | 50.91% | ||
| 51.22% | 44.53% | 42.87% | 33.29% | 46.57% | 39.26% | 38.37% | 42.49% | 51.45% | ||
| 10.98 | 7.91 | 8.83 | 7.57 | 7.69 | 9.72 | 8.52 | 9.51 | 10.13 | 8.69 |
First phase. Accuracy percentages for each person using stacking and bagging and boosting multi-classifiers. Mean and SD rows denote the average and standard deviation for each standard multi-classifier considering all of the actors.
| BN | C4.5 | KNN | KStar | NBT | NB | OneR | RIPPER | RandomF | SVM | Bagging | Boosting | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 64.08% | 59.22% | 72.82% | 57.28% | 67.96% | 61.17% | 43.69% | 62.14% | 66.99% | 65.05% | 34.95% | ||
| 47.57% | 54.37% | 42.72% | 36.89% | 47.57% | 36.89% | 49.51% | 49.51% | 52.43% | 57.28% | 30.10% | ||
| 49.51% | 44.66% | 48.54% | 41.75% | 46.60% | 45.63% | 46.60% | 52.43% | 49.51% | 49.51% | 35.92% | ||
| 49.51% | 48.54% | 41.75% | 33.98% | 53.40% | 45.63% | 39.81% | 42.72% | 50.49% | 44.66% | 33.01% | ||
| 51.46% | 45.63% | 48.54% | 32.04% | 50.49% | 41.75% | 52.43% | 54.37% | 32.04% | ||||
| 56.31% | 53.40% | 42.72% | 54.37% | 55.34% | 54.37% | 55.34% | 52.43% | 32.04% | ||||
| 46.60% | 33.98% | 41.75% | 39.81% | 46.60% | 44.66% | 37.86% | 41.75% | 46.60% | 40.78% | 37.86% | ||
| 23.64% | 30.91% | 24.55% | 29.09% | 25.45% | 19.09% | 20.91% | 23.64% | 24.55% | 30.91% | 17.27% | ||
| 45.45% | 41.82% | 47.27% | 36.36% | 49.09% | 45.45% | 41.82% | 36.36% | 46.36% | 30.91% | |||
| 55.45% | 55.45% | 53.64% | 40.91% | 56.36% | 48.18% | 37.27% | 52.73% | 55.45% | 54.55% | 35.45% | ||
| 49.09% | 48.18% | 43.64% | 39.09% | 49.09% | 40.91% | 39.09% | 50.00% | 55.45% | 52.73% | 35.45% | ||
| 39.09% | 35.45% | 36.36% | 20.91% | 38.18% | 48.18% | 30.91% | 35.45% | 35.45% | 43.64% | 35.45% | ||
| 39.09% | 41.82% | 34.55% | 30.91% | 42.73% | 43.64% | 40.91% | 40.91% | 40.91% | 44.55% | 34.55% | ||
| 50.00% | 56.36% | 57.27% | 40.91% | 63.64% | 62.73% | 37.27% | 56.36% | 44.55% | 34.55% | |||
| 44.55% | 57.27% | 50.91% | 40.91% | 57.27% | 52.73% | 37.27% | 50.91% | 56.36% | 59.09% | 36.36% | ||
| 55.45% | 45.45% | 49.09% | 40.91% | 50.00% | 49.09% | 40.00% | 48.18% | 54.55% | 48.18% | 32.73% | ||
| 41.82% | 44.55% | 38.18% | 38.18% | 46.36% | 44.55% | 36.36% | 43.64% | 43.64% | 43.64% | 30.91% | ||
| 48.22% | 46.63% | 46.55% | 37.54% | 51.24% | 48.41% | 38.88% | 46.50% | 50.73% | 50.16% | 32.91% | ||
| 7.44 | 9.05 | 9.33 | 7.53 | 9.04 | 8.08 | 6.82 | 8.96 | 9.86 | 9.86 | 7.53 | 4.32 |
First phase. Accuracy percentages for each person applying CSS stacking with the EDA classification method. Mean and SD rows denote the average and standard deviation for each classifier working as meta classifiers and considering all of the actors.
| BN | C4.5 | KNN | KStar | NBT | NB | OneR | RIPPER | RandomF | SVM | |
|---|---|---|---|---|---|---|---|---|---|---|
| 71.84% | 70.87% | 72.82% | 68.93% | 70.87% | 44.66% | 68.93% | 71.84% | |||
| 66.02% | 70.87% | 72.82% | 60.19% | 70.87% | 36.89% | 68.93% | 71.84% | |||
| 57.28% | 70.87% | 72.82% | 65.05% | 70.87% | 45.63% | 68.93% | 71.84% | |||
| 55.34% | 60.19% | 51.46% | 53.40% | 51.46% | 39.81% | 51.46% | 59.22% | 61.17% | ||
| 55.34% | 63.11% | 57.28% | 49.51% | 51.46% | 62.14% | 42.72% | 58.25% | 62.14% | ||
| 64.08% | 59.22% | 58.25% | 50.49% | 54.37% | 57.28% | 60.19% | 66.02% | 59.22% | ||
| 51.46% | 47.57% | 51.46% | 41.75% | 39.81% | 51.46% | 37.86% | 47.57% | 50.49% | ||
| 30.91% | 34.55% | 35.45% | 30.00% | 34.55% | 26.36% | 33.64% | 33.64% | 32.73% | ||
| 48.18% | 50.91% | 48.18% | 47.27% | 46.36% | 45.45% | 42.73% | 47.27% | 50.00% | ||
| 58.18% | 60.91% | 58.18% | 56.36% | 45.45% | 60.91% | 37.27% | 60.91% | 60.91% | ||
| 53.64% | 53.64% | 54.55% | 54.55% | 46.36% | 55.45% | 41.82% | 50.91% | 56.36% | ||
| 34.55% | 49.09% | 45.45% | 42.73% | 32.73% | 49.09% | 40.91% | 41.82% | 49.09% | ||
| 47.27% | 51.82% | 54.55% | 52.73% | 52.73% | 40.91% | 52.73% | 50.91% | |||
| 50.91% | 62.73% | 64.55% | 59.09% | 55.45% | 64.55% | 38.18% | 64.55% | 63.64% | ||
| 58.18% | 62.73% | 59.09% | 59.09% | 50.00% | 60.91% | 37.27% | 59.09% | 59.09% | ||
| 55.45% | 50.91% | 59.09% | 53.64% | 44.55% | 60.00% | 40.91% | 52.73% | 56.36% | ||
| 48.18% | 50.91% | 52.73% | 45.45% | 45.45% | 49.09% | 37.27% | 44.55% | 48.18% | ||
| 53.34% | 58.00% | 57.05% | 54.36% | 50.01% | 57.38% | 40.50% | 54.90% | 57.74% | ||
| 9.57 | 9.34 | 9.73 | 10.87 | 8.40 | 9.26 | 5.75 | 9.63 | 9.55 | 9.37 |
First phase. Best accuracy per person by using each classification method. Improvements comparing the best accuracy from multi-classifiers (bagging, boosting and stacking) against single classifiers are presented in the Differences_1 column. In addition, the improvements between the CSS stacking with EDA and the best accuracy from both single and standard multi-classifiers are shown in the Differences_2 column. Mean and SD rows denote the average and standard deviation for each classification method and the type of differences considering all of the actors. Differences are expressed in percentage points.
| Single | Bagging | Boosting | Stacking | Differences_1 | CSS Stacking | Differences_2 | |
|---|---|---|---|---|---|---|---|
| 65.05% | 34.95% | 0.00 | 0.00 | ||||
| 66.02% | 57.28% | 30.10% | 60.19% | −5.83 | +7.77 | ||
| 53.40% | 49.51% | 35.92% | 58.25% | +4.85 | +15.54 | ||
| 59.22% | 55.34% | 33.01% | 53.40% | −3.88 | +2.92 | ||
| 54.37% | 32.04% | 55.34% | −13.59 | 65.05% | −3.88 | ||
| 52.43% | 32.04% | 59.22% | −7.77 | 0.00 | |||
| 50.49% | 48.54% | 37.86% | 46.60% | −1.95 | +1.94 | ||
| 30.91% | 17.27% | 31.82% | −10.00 | 35.45% | −6.37 | ||
| 52.73% | 50.91% | 30.91% | 50.91% | −1.82 | +1.82 | ||
| 54.55% | 35.45% | 60.91% | −3.64 | 61.82% | −2.73 | ||
| 56.36% | 58.18% | 35.45% | 55.45% | +1.82 | +1.82 | ||
| 45.45% | 47.27% | 35.45% | 48.18% | +2.73 | +3.64 | ||
| 44.55% | 34.55% | 46.36% | −15.45 | 59.09% | −2.73 | ||
| 64.55% | 44.55% | 34.55% | 63.64% | −0.91 | +1.82 | ||
| 62.73% | 60.91% | 36.36% | 59.09% | −1.82 | +0.91 | ||
| 59.09% | 56.36% | 32.73% | 55.45% | −2.73 | +1.82 | ||
| 53.64% | 43.64% | 30.91% | 50.00% | −3.64 | +0.91 | ||
| 58.92% | 51.43% | 32.91% | 54.62% | −3.74 | +1.48 | ||
| 8.30% | 7.75% | 4.45% | 8.77% | 5.28 | 9.33% | 4.70 |
Figure 4First phase. Best accuracies per person considering single, multi-classifiers and CSS stacking with EDA classification methods.
First phase. Accuracies and improvements per person in percentage points comparing stacking and CSS stacking with EDA classification methods using SVM as the meta classifier. Mean and SD rows denote the average and standard deviation for each classification method and improvements considering all of the actors.
| Stacking | CSS Stacking | Improvements | |
|---|---|---|---|
| 73.79% | 73.79% | 0.00 | |
| 52.43% | 73.79% | +21.36 | |
| 49.51% | 73.79% | +24.27 | |
| 44.66% | 61.17% | +16.50 | |
| 55.34% | 65.05% | +9.71 | |
| 55.34% | 59.22% | +3.88 | |
| 40.78% | 52.43% | +11.65 | |
| 24.55% | 32.73% | +8.18 | |
| 46.36% | 54.55% | +8.18 | |
| 55.45% | 60.91% | +5.45 | |
| 52.73% | 60.00% | +7.27 | |
| 43.64% | 51.82% | +8.18 | |
| 40.91% | 59.09% | +18.18 | |
| 60.00% | 66.36% | +6.36 | |
| 59.09% | 63.64% | +4.55 | |
| 48.18% | 60.91% | +12.73 | |
| 50.00% | 54.55% | +4,55 | |
| 50.16% | 60.22% | +10.06 | |
| 10.14 | 9.64 | 6.42 |
First phase. p-values of the pair-wise comparison between CSS stacking and the other multi-classifiers.
| Hypothesis | Adjusted |
|---|---|
Second phase. Accuracy percentages per actor for the CSS stacking classifier systems of the second phase (CSS stacking 2nd_Phase) and the comparison with the CSS stacking classifiers of the first phase (CSS stacking 1st_Phase). Mean and SD rows denote the average and standard deviation for each classifier for all of the actors.
| CSS Stacking 2nd_Phase | CSS Stacking 1st_Phase | Differences | |
|---|---|---|---|
| 85.06% | 73.79% | +11.27 | |
| 69.48% | 73.79% | −4.31 | |
| 75.32% | 73.79% | +1.53 | |
| 70.78% | 61.17% | +9.61 | |
| 77.27% | 65.05% | +12.22 | |
| 64.29% | 59.22% | +5.07 | |
| 47.4% | 52.43% | −5.03 | |
| 49.35% | 32.73% | +16.62 | |
| 46.01% | 54.55% | −8.54 | |
| 73.38% | 60.91% | +12.47 | |
| 66.88% | 60.00% | +6.88 | |
| 72.08% | 51.82% | +20.26 | |
| 79.87% | 59.09% | +20.78 | |
| 61.69% | 66.36% | −4.67 | |
| 59.09% | 63.64% | −4.55 | |
| 46.1% | 60.91% | −14.81 | |
| 57.14% | 54.55% | +2.59 | |
| 64.78% | 60.22% | +4.56 | |
| 12.34 | 9.94 | 10.46 |
Third phase. Accuracy percentages per actor for the best three classifiers of each system built on the Berlin Emotional Speech database (Emo-DB). Mean and SD rows represent the average and standard deviation considering all of the actors.
| Single | Standard Stacking | CSS Stacking | |||||||
|---|---|---|---|---|---|---|---|---|---|
| MLP | RandomF | SVM | MLP | RandomF | SVM | MLP | RandomF | SVM | |
| 79.59% | 73.46% | 77.55% | 63.26% | 71.42% | 61.22% | 79.59% | 79.59% | ||
| 94.82% | 87.93% | 86.20% | 79.31% | 89.65% | 72.41% | 93.10% | 94.83% | ||
| 74.41% | 62.79% | 67.44% | 62.79% | 67.44% | 62.79% | 74.42% | 74.42% | ||
| 84.21% | 84.21% | 81.57% | 68.42% | 71.05% | 68.42% | 84.21% | 86.84% | ||
| 63.63% | 72.72% | 56.36% | 65.45% | 54.54% | 67.27% | 72.73% | 78.18% | ||
| 77.14% | 74.28% | 80.00% | 71.42% | 68.57% | 68.57% | ||||
| 78.68% | 75.40% | 72.13% | 67.21% | 70.49% | 65.57% | 77.05% | 78.69% | ||
| 78.26% | 75.36% | 78.26% | 73.91% | 76.81% | 78.26% | 82.61% | 85.51% | ||
| 67.85% | 66.07% | 69.64% | 71.42% | 64.28% | 76.79% | 75.00% | 75.00% | ||
| 74.64% | 83.09% | 76.05% | 73.23% | 71.83% | 76.05% | 83.10% | 80.28% | ||
| 77.32% | 77.87% | 75.80% | 68.55% | 72.41% | 67.21% | 80.63% | 81.32% | ||
| 8.52 | 7.17 | 6.28 | 6.56 | 6.76 | 7.13 | 7.41 | 6.57 | 6.33 | |
P1 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 19 | 2 | 0 | 0 | 0 | 1 | 0 |
| Fear | 0 | 20 | 0 | 0 | 0 | 1 | 1 |
| Joy | 0 | 0 | 18 | 3 | 1 | 0 | 0 |
| Anger | 1 | 0 | 2 | 17 | 1 | 1 | 0 |
| Surprise | 0 | 0 | 3 | 2 | 17 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 2 | 0 | 20 | 0 |
| Neutral | 1 | 0 | 0 | 0 | 0 | 1 | 20 |
P2 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 16 | 2 | 0 | 0 | 0 | 1 | 3 |
| Fear | 1 | 20 | 0 | 0 | 0 | 1 | 0 |
| Joy | 0 | 0 | 17 | 2 | 3 | 0 | 0 |
| Anger | 0 | 0 | 3 | 19 | 0 | 0 | 0 |
| Surprise | 0 | 2 | 7 | 0 | 13 | 0 | 0 |
| Disgust | 2 | 1 | 2 | 0 | 3 | 5 | 9 |
| Neutral | 0 | 1 | 0 | 0 | 1 | 8 | 12 |
P3 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 18 | 2 | 0 | 0 | 0 | 1 | 1 |
| Fear | 1 | 18 | 0 | 0 | 2 | 1 | 0 |
| Joy | 0 | 0 | 10 | 8 | 4 | 0 | 0 |
| Anger | 0 | 0 | 8 | 14 | 0 | 0 | 0 |
| Surprise | 0 | 1 | 2 | 2 | 16 | 1 | 0 |
| Disgust | 1 | 2 | 0 | 0 | 0 | 16 | 3 |
| Neutral | 3 | 0 | 0 | 0 | 0 | 1 | 18 |
P4 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 21 | 0 | 0 | 0 | 0 | 0 | 1 |
| Fear | 0 | 16 | 5 | 0 | 0 | 1 | 0 |
| Joy | 0 | 4 | 12 | 4 | 2 | 0 | 0 |
| Anger | 0 | 1 | 4 | 13 | 2 | 0 | 2 |
| Surprise | 0 | 2 | 2 | 1 | 14 | 2 | 1 |
| Disgust | 0 | 2 | 1 | 2 | 0 | 15 | 2 |
| Neutral | 2 | 0 | 0 | 1 | 0 | 1 | 18 |
P5 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 21 | 0 | 0 | 0 | 0 | 0 | 1 |
| Fear | 0 | 15 | 2 | 2 | 1 | 1 | 1 |
| Joy | 0 | 1 | 13 | 5 | 1 | 2 | 0 |
| Anger | 0 | 2 | 2 | 17 | 1 | 0 | 0 |
| Surprise | 0 | 0 | 1 | 5 | 16 | 0 | 0 |
| Disgust | 1 | 2 | 2 | 0 | 0 | 17 | 0 |
| Neutral | 1 | 0 | 0 | 0 | 0 | 1 | 20 |
P6 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 20 | 0 | 1 | 0 | 1 | 0 | 0 |
| Fear | 0 | 16 | 0 | 1 | 3 | 2 | 0 |
| Joy | 2 | 0 | 16 | 1 | 0 | 1 | 2 |
| Anger | 2 | 5 | 3 | 8 | 3 | 1 | 0 |
| Surprise | 0 | 6 | 0 | 0 | 14 | 2 | 0 |
| Disgust | 3 | 2 | 2 | 3 | 0 | 9 | 3 |
| Neutral | 5 | 0 | 3 | 0 | 1 | 0 | 13 |
P7 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 10 | 3 | 0 | 0 | 3 | 4 | 2 |
| Fear | 4 | 8 | 0 | 3 | 3 | 4 | 0 |
| Joy | 1 | 5 | 10 | 1 | 2 | 1 | 2 |
| Anger | 1 | 2 | 7 | 8 | 1 | 2 | 1 |
| Surprise | 2 | 4 | 1 | 0 | 12 | 2 | 1 |
| Disgust | 6 | 0 | 0 | 2 | 3 | 11 | 0 |
| Neutral | 4 | 1 | 1 | 2 | 0 | 0 | 14 |
P8 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 12 | 5 | 0 | 0 | 0 | 2 | 3 |
| Fear | 5 | 8 | 1 | 0 | 0 | 6 | 2 |
| Joy | 1 | 1 | 10 | 5 | 4 | 0 | 1 |
| Anger | 0 | 1 | 5 | 11 | 5 | 0 | 0 |
| Surprise | 0 | 1 | 8 | 4 | 9 | 0 | 0 |
| Disgust | 3 | 7 | 0 | 0 | 0 | 11 | 1 |
| Neutral | 2 | 3 | 0 | 0 | 0 | 2 | 15 |
P9 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 12 | 5 | 1 | 0 | 0 | 4 | 0 |
| Fear | 5 | 5 | 1 | 0 | 0 | 10 | 1 |
| Joy | 0 | 3 | 6 | 7 | 4 | 1 | 1 |
| Anger | 0 | 1 | 4 | 7 | 8 | 0 | 2 |
| Surprise | 1 | 1 | 4 | 5 | 11 | 0 | 0 |
| Disgust | 2 | 8 | 1 | 0 | 0 | 11 | 0 |
| Neutral | 0 | 2 | 0 | 2 | 0 | 2 | 16 |
P10 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 19 | 0 | 0 | 0 | 0 | 1 | 2 |
| Fear | 0 | 16 | 1 | 0 | 1 | 4 | 0 |
| Joy | 0 | 1 | 12 | 4 | 1 | 4 | 0 |
| Anger | 0 | 1 | 3 | 16 | 0 | 2 | 0 |
| Surprise | 1 | 1 | 1 | 0 | 19 | 0 | 0 |
| Disgust | 0 | 4 | 3 | 4 | 1 | 10 | 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 1 | 21 |
P11 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 20 | 0 | 0 | 1 | 0 | 0 | 1 |
| Fear | 0 | 14 | 3 | 1 | 2 | 2 | 0 |
| Joy | 0 | 2 | 14 | 4 | 2 | 0 | 0 |
| Anger | 2 | 0 | 4 | 13 | 1 | 2 | 0 |
| Surprise | 0 | 3 | 3 | 0 | 11 | 4 | 1 |
| Disgust | 0 | 1 | 2 | 5 | 2 | 12 | 0 |
| Neutral | 2 | 0 | 0 | 0 | 0 | 1 | 19 |
P12 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 20 | 0 | 0 | 0 | 0 | 1 | 1 |
| Fear | 1 | 13 | 1 | 0 | 0 | 6 | 1 |
| Joy | 0 | 1 | 18 | 2 | 1 | 0 | 0 |
| Anger | 0 | 1 | 2 | 16 | 2 | 1 | 0 |
| Surprise | 0 | 0 | 2 | 3 | 17 | 0 | 0 |
| Disgust | 2 | 3 | 0 | 1 | 1 | 11 | 4 |
| Neutral | 1 | 0 | 0 | 0 | 1 | 4 | 16 |
P13 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 20 | 1 | 0 | 0 | 0 | 1 | 0 |
| Fear | 2 | 17 | 1 | 1 | 1 | 0 | 0 |
| Joy | 0 | 1 | 18 | 1 | 0 | 0 | 2 |
| Anger | 0 | 0 | 1 | 20 | 0 | 0 | 1 |
| Surprise | 0 | 3 | 1 | 1 | 17 | 0 | 0 |
| Disgust | 1 | 0 | 1 | 0 | 1 | 15 | 4 |
| Neutral | 0 | 1 | 1 | 0 | 1 | 3 | 16 |
P14 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 7 | 1 | 0 | 0 | 0 | 7 | 7 |
| Fear | 2 | 14 | 1 | 0 | 2 | 3 | 0 |
| Joy | 0 | 1 | 14 | 2 | 5 | 0 | 0 |
| Anger | 0 | 0 | 2 | 17 | 2 | 0 | 1 |
| Surprise | 0 | 2 | 10 | 3 | 7 | 0 | 0 |
| Disgust | 5 | 2 | 0 | 0 | 0 | 14 | 1 |
| Neutral | 2 | 0 | 0 | 0 | 0 | 1 | 19 |
P15 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 10 | 0 | 0 | 0 | 0 | 5 | 7 |
| Fear | 3 | 14 | 1 | 1 | 1 | 1 | 1 |
| Joy | 0 | 0 | 16 | 3 | 3 | 0 | 0 |
| Anger | 0 | 1 | 4 | 8 | 6 | 2 | 1 |
| Surprise | 0 | 0 | 5 | 3 | 13 | 1 | 0 |
| Disgust | 4 | 4 | 0 | 0 | 0 | 13 | 1 |
| Neutral | 4 | 0 | 0 | 0 | 0 | 1 | 17 |
P16 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 16 | 0 | 0 | 0 | 0 | 4 | 2 |
| Fear | 0 | 9 | 5 | 4 | 2 | 0 | 2 |
| Joy | 1 | 3 | 7 | 3 | 4 | 3 | 1 |
| Anger | 2 | 4 | 3 | 8 | 4 | 1 | 0 |
| Surprise | 0 | 4 | 2 | 3 | 13 | 0 | 0 |
| Disgust | 4 | 0 | 0 | 0 | 2 | 12 | 4 |
| Neutral | 1 | 3 | 2 | 0 | 0 | 3 | 13 |
P17 actor confusion matrix from the RekEmozio dataset in the second phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 7 | 1 | 0 | 0 | 0 | 8 | 6 |
| Fear | 1 | 9 | 2 | 3 | 4 | 3 | 0 |
| Joy | 0 | 2 | 7 | 9 | 3 | 1 | 0 |
| Anger | 0 | 6 | 7 | 4 | 5 | 0 | 0 |
| Surprise | 0 | 2 | 3 | 2 | 10 | 5 | 0 |
| Disgust | 3 | 3 | 0 | 1 | 5 | 9 | 1 |
| Neutral | 8 | 0 | 1 | 0 | 0 | 2 | 11 |
A1 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 13 | 0 | 0 | 1 | 0 | 0 | 0 |
| Fear | 0 | 2 | 0 | 0 | 0 | 0 | 3 |
| Joy | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| Anger | 3 | 0 | 0 | 1 | 0 | 0 | 0 |
| Surprise | 0 | 0 | 0 | 0 | 7 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 7 | 0 |
| Neutral | 0 | 2 | 0 | 0 | 0 | 0 | 9 |
A2 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 11 | 0 | 0 | 0 | 1 | 0 | 0 |
| Fear | 0 | 10 | 0 | 0 | 0 | 0 | 0 |
| Joy | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Anger | 0 | 0 | 0 | 6 | 0 | 0 | 0 |
| Surprise | 0 | 0 | 0 | 0 | 11 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 9 | 0 |
| Neutral | 0 | 1 | 0 | 0 | 0 | 0 | 9 |
A3 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 12 | 0 | 1 | 0 | 0 | 0 | 0 |
| Fear | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| Joy | 1 | 0 | 7 | 0 | 0 | 0 | 0 |
| Anger | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| Surprise | 2 | 0 | 0 | 0 | 2 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 4 | 0 |
| Neutral | 0 | 1 | 0 | 0 | 0 | 0 | 8 |
A4 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 10 | 0 | 0 | 0 | 0 | 0 | 0 |
| Fear | 0 | 7 | 0 | 0 | 0 | 1 | 0 |
| Joy | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| Anger | 0 | 0 | 0 | 7 | 1 | 0 | 0 |
| Surprise | 0 | 0 | 0 | 0 | 4 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 3 | 0 |
| Neutral | 0 | 2 | 0 | 0 | 0 | 0 | 2 |
A5 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 8 | 0 | 0 | 1 | 2 | 0 | 0 |
| Fear | 0 | 4 | 0 | 0 | 0 | 3 | 1 |
| Joy | 0 | 0 | 0 | 2 | 0 | 0 | 0 |
| Anger | 0 | 0 | 0 | 10 | 0 | 0 | 0 |
| Surprise | 2 | 0 | 0 | 1 | 5 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 7 | 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
A6 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 12 | 0 | 0 | 0 | 0 | 0 | 0 |
| Fear | 0 | 3 | 0 | 0 | 2 | 0 | 0 |
| Joy | 0 | 0 | 1 | 1 | 0 | 0 | 0 |
| Anger | 0 | 1 | 0 | 5 | 0 | 0 | 0 |
| Surprise | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 4 | 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
A7 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 11 | 0 | 0 | 0 | 1 | 0 | 0 |
| Fear | 0 | 9 | 0 | 0 | 0 | 0 | 1 |
| Joy | 0 | 1 | 6 | 0 | 0 | 0 | 1 |
| Anger | 1 | 0 | 1 | 5 | 0 | 0 | 0 |
| Surprise | 1 | 0 | 0 | 0 | 9 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 5 | 0 |
| Neutral | 0 | 6 | 0 | 0 | 0 | 0 | 3 |
A8 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 16 | 0 | 0 | 0 | 0 | 0 | 0 |
| Fear | 0 | 7 | 0 | 0 | 0 | 0 | 1 |
| Joy | 0 | 0 | 7 | 1 | 0 | 0 | 0 |
| Anger | 0 | 0 | 1 | 10 | 1 | 0 | 0 |
| Surprise | 6 | 0 | 0 | 0 | 2 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 10 | 0 |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
A9 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 11 | 0 | 0 | 1 | 1 | 0 | 0 |
| Fear | 0 | 7 | 0 | 0 | 0 | 1 | 1 |
| Joy | 0 | 0 | 4 | 0 | 0 | 0 | 1 |
| Anger | 2 | 0 | 0 | 6 | 0 | 0 | 0 |
| Surprise | 2 | 0 | 0 | 2 | 2 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 4 | 0 |
| Neutral | 0 | 2 | 0 | 0 | 0 | 1 | 8 |
A10 actor confusion matrix from the Emo-DB in the third phase.
| Sadness | Fear | Joy | Anger | Surprise | Disgust | Neutral | |
|---|---|---|---|---|---|---|---|
| Sadness | 12 | 0 | 0 | 1 | 1 | 0 | 0 |
| Fear | 0 | 12 | 1 | 0 | 0 | 0 | 1 |
| Joy | 0 | 1 | 10 | 0 | 0 | 0 | 0 |
| Anger | 1 | 0 | 1 | 4 | 1 | 0 | 0 |
| Surprise | 1 | 0 | 0 | 0 | 10 | 0 | 0 |
| Disgust | 0 | 0 | 0 | 0 | 0 | 9 | 0 |
| Neutral | 0 | 2 | 0 | 0 | 0 | 0 | 3 |