| Literature DB >> 22737035 |
Shiqing Zhang1, Xiaoming Zhao, Bicheng Lei.
Abstract
Recently, compressive sensing (CS) has attracted increasing attention in the areas of signal processing, computer vision and pattern recognition. In this paper, a new method based on the CS theory is presented for robust facial expression recognition. The CS theory is used to construct a sparse representation classifier (SRC). The effectiveness and robustness of the SRC method is investigated on clean and occluded facial expression images. Three typical facial features, i.e., the raw pixels, Gabor wavelets representation and local binary patterns (LBP), are extracted to evaluate the performance of the SRC method. Compared with the nearest neighbor (NN), linear support vector machines (SVM) and the nearest subspace (NS), experimental results on the popular Cohn-Kanade facial expression database demonstrate that the SRC method obtains better performance and stronger robustness to corruption and occlusion on robust facial expression recognition tasks.Entities:
Keywords: Gabor wavelets representation; compressive sensing; corruption and occlusion; facial expression recognition; local binary patterns; sparse representation
Year: 2012 PMID: 22737035 PMCID: PMC3376615 DOI: 10.3390/s120303747
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.(a) The real part of the Gabor wavelet kernels at five scales and eight orientations; (b) The magnitude of the Gabor wavelet kernels at five scales.
Figure 2.The process of LBP features extraction
Figure 3.Examples of facial expression images from the Cohn-Kanade database.
Recognition results (%) of different methods with the raw pixels and LBP features.
| Raw pixels | 92.29±1.9 | 93.80±2.1 | 92.74±1.9 | 94.76±1.7 |
| LBP | 96.22±4.6 | 95.24±4.2 | 95.71±5.8 | 97.14±3.9 |
Figure 4.Recognition results of different methods with reduced dimension of Gabor wavelets representation.
Best results (%) of different methods with reduced dimension of Gabor wavelets representation.
| Dimension | 60 | 80 | 60 | 50 |
| Accuracy | 97.14±3.7 | 96.17±4.0 | 96.94±4.3 | 98.10±3.8 |
Confusion matrix of recognition results of SRC with the raw pixels.
| Anger | 10 | 0 | 0 | 0 | 0 | 0 | |
| Joy | 0 | 0 | 0 | 0 | 0 | 0 | |
| Sadness | 0 | 0 | 0 | 10 | 0 | 0 | |
| Surprise | 0 | 0 | 0 | 0 | 0 | 0 | |
| Disgust | 0 | 0 | 0 | 0 | 0 | 0 | |
| Fear | 0 | 0 | 0 | 0 | 0 | 0 | |
| Neutral | 0 | 0 | 6.67 | 0 | 3.33 | 6.67 |
Confusion matrix of recognition results of SRC with 50 reduced Gabor wavelets representation.
| Anger | 0 | 0 | 0 | 0 | 0 | 0 | |
| Joy | 0 | 0 | 0 | 0 | 0 | 0 | |
| Sadness | 0 | 0 | 0 | 0 | 0 | 0 | |
| Surprise | 0 | 0 | 0 | 0 | 0 | 3.33 | |
| Disgust | 10 | 0 | 0 | 0 | 0 | 0 | |
| Fear | 0 | 0 | 0 | 0 | 0 | 0 | |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 |
Figure 5.A corrupted image example (a) Original image of 640 × 490 pixels; (b) Resized image of 32 × 32 pixels; (c) 50% corrupted image.
Figure 6.Recognition accuracy under different percentage corrupted.
Figure 7.An occluded image example (a) Baboon; (b) Original image of 640 × 490 pixels; (c) Resized image of 32 × 32 pixels; (d) 30% occluded image.
Figure 8.Recognition accuracy under different percentage occluded.
Confusion matrix of recognition results of SRC with LBP features.
| Anger | 0 | 0 | 0 | 0 | 0 | 10 | |
| Joy | 0 | 0 | 0 | 0 | 0 | 0 | |
| Sadness | 3.33 | 0 | 0 | 0 | 0 | 6.67 | |
| Surprise | 0 | 0 | 0 | 0 | 0 | 0 | |
| Disgust | 0 | 0 | 0 | 0 | 0 | 0 | |
| Fear | 0 | 0 | 0 | 0 | 0 | 0 | |
| Neutral | 0 | 0 | 0 | 0 | 0 | 0 |