| Literature DB >> 22163713 |
Xiaoming Zhao1, Shiqing Zhang.
Abstract
Facial expression recognition is an interesting and challenging subject. Considering the nonlinear manifold structure of facial images, a new kernel-based manifold learning method, called kernel discriminant isometric mapping (KDIsomap), is proposed. KDIsomap aims to nonlinearly extract the discriminant information by maximizing the interclass scatter while minimizing the intraclass scatter in a reproducing kernel Hilbert space. KDIsomap is used to perform nonlinear dimensionality reduction on the extracted local binary patterns (LBP) facial features, and produce low-dimensional discrimimant embedded data representations with striking performance improvement on facial expression recognition tasks. The nearest neighbor classifier with the Euclidean metric is used for facial expression classification. Facial expression recognition experiments are performed on two popular facial expression databases, i.e., the JAFFE database and the Cohn-Kanade database. Experimental results indicate that KDIsomap obtains the best accuracy of 81.59% on the JAFFE database, and 94.88% on the Cohn-Kanade database. KDIsomap outperforms the other used methods such as principal component analysis (PCA), linear discriminant analysis (LDA), kernel principal component analysis (KPCA), kernel linear discriminant analysis (KLDA) as well as kernel isometric mapping (KIsomap).Entities:
Keywords: dimensionality reduction; facial expression recognition; isometric mapping; kernel; local binary patterns
Mesh:
Year: 2011 PMID: 22163713 PMCID: PMC3231257 DOI: 10.3390/s111009573
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.An example of basic LBP operator.
Figure 2.An example of the extended LBP with different (P, R).
Figure 3.Examples of facial expression images from the JAFFE database.
Figure 4.Examples of facial expression images from the Cohn-Kanade database.
Figure 5.(a) Two eyes location of an original image from the Cohn-Kanade database. (b) The final cropped image of 110 × 150 pixels.
Figure 6.The basic system structure for facial expression recognition experiments using dimensionality reduction methods.
Figure 7.Performance comparisons of different methods on the JAFFE database.
The best accuracy (%) of different methods on the JAFFE database.
| Dimension | 20 | 6 | 40 | 6 | 70 | 20 |
| Accuracy | 78.09 ± 4.2 | 80.81 ± 3.6 | 78.47 ± 4.0 | 80.93 ± 3.9 | 69.52 ± 4.7 | 81.59 ± 3.5 |
Confusion matrix of 7-class facial expression recognition results obtained by KDIsomap on the JAFFE database.
| 0 | 3.58 | 0 | 3.32 | 0 | 3.00 | ||
| 0 | 3.12 | 0 | 0 | 0 | 3.34 | ||
| 6.45 | 3.21 | 0 | 3.29 | 9.68 | 15.49 | ||
| 0 | 3.13 | 3.54 | 0 | 6.66 | 0 | ||
| 7.42 | 0 | 3.68 | 0 | 7.42 | 0 | ||
| 0 | 0 | 12.48 | 6.25 | 3.13 | 0 | ||
| 0 | 0 | 17.23 | 3.45 | 0 | 0 | ||
Figure 8.Performance comparisons of different methods on the Cohn-Kanade database.
The best accuracy (%) of different methods on the Cohn-Kanade database.
| Dimension | 55 | 6 | 60 | 6 | 40 | 30 |
| Accuracy | 92.43 ± 3.3 | 90.18 ± 3.0 | 92.59 ± 3.6 | 93.32 ± 3.0 | 75.81 ± 4.2 | 94.88 ± 3.1 |
Confusion matrix of 7-class facial expression recognition results obtained by KDIsomap on the Cohn-Kanade database.
| 0 | 0.96 | 0 | 0 | 1.44 | 0 | ||
| 0.31 | 0.28 | 0 | 1.97 | 0.30 | 1.61 | ||
| 2.15 | 1.02 | 0 | 5.76 | 0 | 1.23 | ||
| 0.24 | 0.24 | 1.99 | 0 | 0 | 0.35 | ||
| 0 | 1.16 | 1.28 | 3.00 | 0.35 | 0 | ||
| 0 | 0 | 0 | 0.38 | 0 | 0 | ||
| 2.12 | 1.79 | 3.27 | 0.44 | 1.79 | 0.44 | ||
Computational and memory complexity of different dimensionality reduction methods.
| Computational complexity | ||||||
| Memory complexity |