| Literature DB >> 31791234 |
Olalekan Agbolade1, Azree Nazri2, Razali Yaakob3, Abdul Azim Ghani4, Yoke Kqueen Cheah5.
Abstract
BACKGROUND: Expression in H-sapiens plays a remarkable role when it comes to social communication. The identification of this expression by human beings is relatively easy and accurate. However, achieving the same result in 3D by machine remains a challenge in computer vision. This is due to the current challenges facing facial data acquisition in 3D; such as lack of homology and complex mathematical analysis for facial point digitization. This study proposes facial expression recognition in human with the application of Multi-points Warping for 3D facial landmark by building a template mesh as a reference object. This template mesh is thereby applied to each of the target mesh on Stirling/ESRC and Bosphorus datasets. The semi-landmarks are allowed to slide along tangents to the curves and surfaces until the bending energy between a template and a target form is minimal and localization error is assessed using Procrustes ANOVA. By using Principal Component Analysis (PCA) for feature selection, classification is done using Linear Discriminant Analysis (LDA). RESULT: The localization error is validated on the two datasets with superior performance over the state-of-the-art methods and variation in the expression is visualized using Principal Components (PCs). The deformations show various expression regions in the faces. The results indicate that Sad expression has the lowest recognition accuracy on both datasets. The classifier achieved a recognition accuracy of 99.58 and 99.32% on Stirling/ESRC and Bosphorus, respectively.Entities:
Keywords: 3D faces; Automatic facial landmark; Facial expression recognition; LDA; Multi-point warping; PCA
Mesh:
Year: 2019 PMID: 31791234 PMCID: PMC6889223 DOI: 10.1186/s12859-019-3153-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Architecture of the proposed method
Procrustes ANOVAs for facial shape on Stirling and Bosphorus datasets
| Effect | SS | MS | DF | F | P |
|---|---|---|---|---|---|
| Stirling | |||||
| Expression | 0.178563 | 2.39E-05 | 7465 | 8.36 | <.0001 |
| Individual | 1.02071 | 2.86E-06 | 356,827 | 1.86 | <.0001 |
| Error | 0.041283 | 1.54E-06 | 26,874 | ||
| Bosphorus | |||||
| Expression | 0.308295 | 4.13E-05 | 7465 | 16.4 | <.0001 |
| Individual | 0.65404 | 2.52E-06 | 259,782 | 0.72 | 1 |
| Error | 0.09455 | 3.52E-06 | 26,874 | ||
Fig. 2Visualisation of the expression group. PC deformation and the percentage variance of selected principal components, showing only the first principal component which accounted for the largest variation on both datasets
Fig. 3Scatterplot of Expression group. Separability and distribution of expression group using scatter plot. a Stirling dataset. b Bosphorus dataset
Confusion matrix for six group facial expression recognition on Stirling dataset
| % | Ang | Dis | Fea | Sad | Hap | Sur |
|---|---|---|---|---|---|---|
| Ang | 100 | 0 | 0 | 0 | 0 | 0 |
| Dis | 0 | 100 | 0 | 0 | 0 | 0 |
| Fea | 0 | 0 | 100 | 0 | 0 | 0 |
| Sad | 0 | 0 | 2.44 | 97.56 | 0 | 0 |
| Hap | 0 | 0 | 0 | 0 | 100 | 0 |
| Sur | 0 | 0 | 0 | 0 | 0 | 100 |
Confusion matrix for six group facial expression recognition in Bosphorus dataset
| % | Ang | Dis | Fea | Hap | Sad | Sur |
|---|---|---|---|---|---|---|
| Ang | 100 | 0 | 0 | 0 | 0 | 0 |
| Dis | 0 | 100 | 0 | 0 | 0 | 0 |
| Fea | 0 | 0 | 100 | 0 | 0 | 0 |
| Hap | 0 | 0 | 0 | 100 | 0 | 0 |
| Sad | 0 | 0 | 0 | 0 | 95.45 | 4.55 |
| Sur | 0 | 0 | 0 | 0 | 0 | 100 |
Performance metrics reports for facial expression on Stirling and Bosphorus dataset
| Stirling Dataset | Bosphorus Dataset | |||||
|---|---|---|---|---|---|---|
| Exp | Precision | Sensitivity | Specificity | Precision | Sensitivity | Specificity |
| Ang | 1 | 1 | 1 | 1 | 1 | 1 |
| Dis | 1 | 1 | 1 | 1 | 1 | 1 |
| Fea | 1 | 0.975 | 1 | 1 | 1 | 1 |
| Sad | 0.976 | 1 | 0.995 | 0.954 | 1 | 0.992 |
| Hap | 1 | 1 | 1 | 1 | 1 | 1 |
| Sur | 1 | 1 | 1 | 1 | 0.96 | 1 |
| Avg/Total | 0.997 | 0.996 | 0.999 | 0.992 | 0.993 | 0.999 |
Comparison of mean localization error with state-of-the-art method on Bosphorus datasets
| Author | Method | Landmark | Mean error (mm) |
|---|---|---|---|
| [ | CLM-ICA-GGD | 33 | 2.71 |
| [ | Geometric Descriptor | 13 | 4.75 |
| This work | Multi-points warping | 500 | 0.094 |
Comparison of classification rates with state-of-the-art method on Stirling and Bosphorus datasets
| Author | Method | Dataset | Classifier | Accuracy (%) |
|---|---|---|---|---|
| [ | Local shape descriptor | Bosphorus | RANSAC | 93.40 |
| [ | KMTS | Bosphorus | TPWCRC | 98.90 |
| [ | Face augmentation technique | Bosphorus | CNN | 99.20 |
| [ | Geometric framework | Bosphorus | – | 87.06 |
| [ | Extended SIFT-like matching | Bosphorus | – | 98.80 |
| [ | 3D-PSFD | – | LDA | 83.60 |
| [ | Differential Evolution based optimization | Bosphorus | SVM | 84.00 |
| [ | Extended LBP | Bosphorus | SVM | 76.98 |
| [ | Covariance matrices of descriptors | Bosphorus | SVM | 86.17 |
| This work | Multi-points warping | Stirling/ESRC | LDA | 99.58 |
| Bosphorus | LDA | 99.32 |
Comparison of classification rates by each expression with state-of-the-art method on Bosphorus datasets
| Author | Hap (%) | Fea (%) | Dis (%) | Ang (%) | Sad (%) | Sur (%) | Overall (%) |
|---|---|---|---|---|---|---|---|
| [ | 97.50 | 86.25 | 90.00 | 82.50 | 67.50 | 83.75 | 84.10 |
| [ | 93.00 | 81.00 | 85.25 | 86.25 | 79.75 | 90.50 | 86.17 |
| This work | 100 | 100 | 100 | 100 | 95.45 | 100 | 99.32 |
Fig. 4A three-dimensional mesh template with the location of the prominent point at the center of the face for pose-invariant correction. The 16 fixed anatomical landmarks are shown in red color. The blue color on the Pronasale indicates the point where the semi-landmarks begin the sliding process
Anchor anatomical points and descriptions
| No | Anchor Landmarks | 3D Notation | Description |
|---|---|---|---|
| 1 | Endocanthion left | enl | Left most medial point of the palpebral fissure, at the inner commissure of the eye |
| 2 | Exocanthion left | exl | Left most lateral point of the palpebral fissure, at the outer commissure of the eye |
| 3 | Exocanthion right | exr | Right most lateral point of the palpebral fissure, at the outer commissure of the eye |
| 4 | Endocanthion right | enr | Right most medial point of the palpebral fissure, at the inner commissure of the eye |
| 5 | Sellion | se | Deepest midline point of the nasofronal angle |
| 6 | Pronasale | pr | The most anteriorly protruded point of the apex nasi |
| 7 | subnasale | su | Median point at the junction between the lower border of the nasal septum and the philtrum area |
| 8 | Alare left | all | Left most lateral point on the nasal ala |
| 9 | Alare right | alr | Right most lateral point on the nasal ala |
| 10 | Cheilion left | chl | Left outer corners of the mouth where the outer edges of the upper and lower vermilions meet |
| 11 | Cheilion right | chr | Right outer corners of the mouth where the outer edges of the upper and lower vermilions meet |
| 12 | Labiale superius | ls | Midpoint of the vermilion border of the upper lip |
| 13 | Labiale inferius | li | Midpoint of the vermilion border of the lower lip |
| 14 | Gnathion | gn | Median point halfway between pogonion and menton |
| 15 | Obelion left | obl | Left median point where the sagittal suture intersects with a transverse line connecting parietal foramina |
| 16 | Obelion right | obr | Right median point where the sagittal suture intersects with a transverse line connecting parietal foramina |
Fig. 5A three-dimensional mesh template with 500 landmarks for reference model. Showing 16 fixed anatomical points and 484 semi-landmarks with 1.5 mm radius. a Frontal skewed view. b Profile view
Fig. 6Partially warped 500 sliding point on target facial surface. a Angry. b Disgust. c Fear. d Sad. e Surprise. f Happy
Fig. 7Complete and homologous 500 warped points on target mesh. a Angry. b Disgust. c Fear. d Sad. e Surprise. f Happy