| Literature DB >> 36071434 |
Amira Gaber1, Mona F Taher2, Manal Abdel Wahed2, Nevin Mohieldin Shalaby3, Sarah Gaber4.
Abstract
Facial paralysis (FP) is an inability to move facial muscles voluntarily, affecting daily activities. There is a need for quantitative assessment and severity level classification of FP to evaluate the condition. None of the available tools are widely accepted. A comprehensive FP evaluation system has been developed by the authors. The system extracts real-time facial animation units (FAUs) using the Kinect V2 sensor and includes both FP assessment and classification. This paper describes the development and testing of the FP classification phase. A dataset of 375 records from 13 unilateral FP patients and 1650 records from 50 control subjects was compiled. Artificial Intelligence and Machine Learning methods are used to classify seven FP categories: the normal case and three severity levels: mild, moderate, and severe for the left and right sides. For better prediction results (Accuracy = 96.8%, Sensitivity = 88.9% and Specificity = 99%), an ensemble learning classifier was developed rather than one weak classifier. The ensemble approach based on SVMs was proposed for the high-dimensional data to gather the advantages of stacking and bagging. To address the problem of an imbalanced dataset, a hybrid strategy combining three separate techniques was used. Model robustness and stability was evaluated using fivefold cross-validation. The results showed that the classifier is robust, stable and performs well for different train and test samples. The study demonstrates that FAUs acquired by the Kinect sensor can be used in classifying FP. The developed FP assessment and classification system provides a detailed quantitative report and has significant advantages over existing grading scales.Entities:
Keywords: Ensemble classification; Facial animation units; Facial paralysis; Grading; Kinect; Machine learning
Mesh:
Year: 2022 PMID: 36071434 PMCID: PMC9449956 DOI: 10.1186/s12938-022-01036-0
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 3.903
Comparison of recent FP classification systems
| References | Objective | Facial movements | Ground truth | Tools | Dataset | Performance | Limitations |
|---|---|---|---|---|---|---|---|
| Chaoqun Jiang et al. 2020 [ | FP classification (6 FP grades) | HB | LSCI scanners K-NN SVM NN | RGB images blood flow images 80 unilateral FP patients | Accuracy NN 96.77% K-NN 67.74% SVM 86.77% | ||
| Xin Liu et al. 2020 [ | FP classification (3 severity levels) | Rest Open mouth Closure the eyes lightly Elevation of eyebrows Pursing lips etc. | HB | PHCNN-LSTM | YouTube Facial Palsy Database Extended CohnKanade Database | Accuracy PHCNN-LSTM 0.9481% | Few public FP databases available Lack of various facial expressions in the datasets |
| Jocelyn Barbosa et al. 2019 [ | Health classification (normal/patient) FP classification (PP/CP) | Rest Raising of eyebrows Screwing-up of nose Smiling with showing of teeth | RLR RF SVM DT NB Hybrid | 440 2D images 60 normal subjects 40 PP patients 10 CP patients | Sensitivity RLR 85.9% RF 92.3% SVM 72.5% DT 90.2% NB 79.9% | No evaluation of FP degree No classification of facial paralysis grade Small dataset | |
| Anping Song et al. 2018 [ | FP classification (7 categories) | Rest Eye closed Eyebrows raised Cheeks puffed Grinning Nose Wrinkled Whistling | FNGS2.0 | IDFNP (Inception v3 CNN + DeepID CNN) | 2D images 860 FP patients | Accuracy 97.5% | |
| Muhammad Sajid et al. 2018 [ | FP classification (5 grades) | HB | CNNs GAN | 2D images 2000 Patients | Accuracy 92.60% | ||
| Banita and Tanwar. 2018 [ | Evaluation of FP 3 categories for patient (can be cured, cannot be cured, may or may not be cured) | HB | Fuzzy logic | 3D images 82 patients | |||
| Ting Wang et al. 2015 [ | FP classification (6 grades) | Raise eyebrows Close eyes Screw up nose Plump cheeks Open mouth | HB | FPASMs SVM (RBF Kernel) | 62 FP patients single-side and both-side | ||
| Anguraj and Padma 2015 [ | Classifying the severity of facial paralysis (normal–mild–moderate–severe) | Closing of eye Raising of eyebrows Opening of mouth Screwing of nose | SPSA FFBPN | 9 images (2D and grayscale) | Accuracy 94% Sensitivity 90% | 2D grayscale images Small number of images |
CNNs: Convolutional Neural Networks, HB: House–Brackmann, LSCI: laser speckle contrast imaging, K-NN: K-nearest neighbor, SVM: Support Vector Machine, NN: Neural Network, PHCNN: Parallel Hierarchy Convolutional Neural Network, LSTM: Long Short-Term Memory, FNGS2.0: Facial Nerve Grading System 2.0, IDFNP: Inception-Deep Facial Nerve Paralysis, GAN: Generative Adversarial Network, FPASMs: Facial Paralysis Active Shape Models, RF: Random Forest, RLR: Regularized Logistic Regression, DT: Decision Tree, NB: Naïve Bayes, SPSA: Salient Point Selection Algorithm, FFBPN: Feed Forward Back Propagation Neural Network
Maximum cross-validation accuracy and its corresponding best values of C and gamma for the five SVM classifiers (without data augmentation)
| Classifier | #1 Smiling | #2 Closing eyes | #3 Raising eyebrows | #4 Blowing cheeks | #5 Whistling |
|---|---|---|---|---|---|
| Accuracy % | 96 | 91 | 84 | 90 | 91 |
| C | 107 | 100 | 107 | 108 | 100 |
| Gamma | 1 | 10 | 10 | 0.1 | 10 |
The ranges of C and gamma are (10–3, 10–2 ……. 108) and (10–3, 10–2 ……. 103), respectively
Maximum cross-validation accuracy and its corresponding best values of hyperparameters for the five Random Forests classifiers (without data augmentation)
| Classifier | #1 Smiling | #2 Closing eyes | #3 Raising eyebrows | #4 Blowing cheeks | #5 Whistling |
|---|---|---|---|---|---|
| Accuracy % | 76 | 56 | 61 | 88 | 91 |
| max_depth | 6 | 8 | 8 | 8 | 10 |
| n_estimators | 100 | 30 | 100 | 50 | 30 |
The values of max_depth and n_estimators are (1, 2 …. 10) and (5, 10, 20, 30, 50,100), respectively
Fig. 1Variation of K-NN accuracies with changing the number of nearest neighbors parameter (from 1 to 9) in the five classifiers: a smiling, b closing eyes, c raising eyebrows, d blowing cheeks, and e whistling
Performance measure of the five individual SVM classifiers (with and without threshold change) and the ensemble-based classifier
| Classifier | #1 Smiling | #2 Closing eyes | #3 Raising eyebrows | #4 Blowing cheeks | #5 Whistling | Ensemble | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Threshold change? | No | Yes | No | Yes | No | Yes | No | Yes | No | Yes | Yes |
| Accuracy % | 93.6 | 95.2 | 90.4 | 92 | 87.2 | 92 | 90.4 | 93.6 | 77.6 | 87.2 | 96.8 |
| Precision % | 88 | 92 | 80 | 84 | 72 | 84 | 80 | 88 | 48 | 72 | 96 |
| Sensitivity % | 81.5 | 85.2 | 74.1 | 77.8 | 66.7 | 77.8 | 74.1 | 81.5 | 44.4 | 66.7 | 88.9 |
| F1-score % | 84.6 | 88.5 | 76.9 | 80.8 | 69.2 | 80.8 | 76.9 | 84.6 | 46.2 | 69.2 | 92.3 |
| Specificity % | 96.9 | 98 | 94.9 | 95.9 | 92.9 | 95.9 | 94.9 | 96.9 | 86.7 | 92.9 | 99 |
Performance measure for each individual category using the ensemble-based classifier
| Class | N | L_MI | R_MI | R_MO | R_S |
|---|---|---|---|---|---|
| Accuracy % | 88 | 92 | 100 | 96 | 92 |
| Sensitivity % | 100 | 71.3 | 100 | 87.5 | 100 |
| Specificity % | 84.2 | 100 | 100 | 100 | 100 |
Confusion Matrix for the ensemble classifier
| Predicted class | N | L_MI | R_MI | R_MO | R_S |
|---|---|---|---|---|---|
| Normal (N) | 0 | 0 | 0 | 0 | |
| Left Mild (L_MI) | 10 | 0 | 0 | 0 | |
| Right Mild (R_MI) | 0 | 0 | 0 | 0 | |
| Right Moderate (R_MO) | 5 | 0 | 0 | 0 | |
| Right Severe (R_S) | 0 | 0 | 0 | 0 |
The bold indicates the true predicted values
SVM models performances measures using fivefold CV (with and without data augmentation)
| Classifier | #1 Smiling | #2 Closing eyes | #3 Raising eyebrows | #4 Blowing cheeks | #5 Whistling | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Augmentation? | No | Yes | No | Yes | No | Yes | No | Yes | No | Yes |
| Accuracy % | 90 ± 7 | 98 ± 1 | 86 ± 4 | 98 ± 1 | 73 ± 8 | 94 ± 3 | 82 ± 8 | 93 ± 3 | 87 ± 5 | 96 ± 4 |
| Precision % | 90 ± 10 | 98 ± 2 | 81 ± 9 | 98 ± 1 | 71 ± 15 | 94 ± 4 | 87 ± 5 | 93 ± 3 | 90 ± 6 | 95 ± 4 |
| Sensitivity % | 89 ± 11 | 98 ± 2 | 78 ± 10 | 98 ± 1 | 70 ± 10 | 93 ± 4 | 84 ± 5 | 93 ± 4 | 87 ± 4 | 96 ± 3 |
| F1-score % | 88 ± 12 | 98 ± 2 | 77 ± 9 | 98 ± 1 | 68 ± 14 | 93 ± 4 | 84 ± 5 | 93 ± 4 | 85 ± 6 | 95 ± 4 |
Fig. 2Block diagram of Facial Paralysis Evaluation system
Characteristics of FP patients
| Patient # | Gender | Age (years) | Paralysis side | Duration of having FP (weeks) | Degree of paralysis | Type of paralysis |
|---|---|---|---|---|---|---|
| 1 | Female | 30 | Left | 8 | Mild | Chronic |
| 2 | Female | 32 | Left | 9 | Mild | Chronic |
| 3 | Female | 40 | Right | 10 | Moderate | Chronic |
| 4 | Female | 38 | Right | 11 | Moderate | Chronic |
| 5 | Male | 17 | Right | 3 | Moderate | Subacute |
| 6 | Male | 16 | Right | 4 | Moderate | Subacute |
| 7 | Male | 18 | Right | 3 | Mild | Subacute |
| 8 | Male | 13 | Left | 12 | Mild | Chronic |
| 9 | Female | 55 | Right | 10 | Moderate | Chronic |
| 10 | Female | 60 | Right | 2 | Severe | Acute |
| 11 | Male | 52 | Right | 1 | Severe | Acute |
| 12 | Female | 58 | Left | 2 | Mild | Acute |
| 13 | Female | 50 | Right | 2 | Severe | Acute |
Seven categories of FP classification and their frequencies in the dataset
| Category | Description | Frequency |
|---|---|---|
| N | Normal | 289 |
| L_MI | Left mild facial paralysis | 127 |
| L_MO | Left moderate facial paralysis | 0 |
| L_S | Left severe facial paralysis | 0 |
| R_MI | Right mild facial paralysis | 35 |
| R_MO | Right moderate facial paralysis | 177 |
| R_S | Right severe facial paralysis | 36 |
Fig. 3Detailed analysis of features in each stage of FP evaluation
Fig. 4Framework of the classifiers and the corresponding features from the grading and symmetry modules
Fig. 5Framework of Facial Paralysis Classification approach
Fig. 6Flowchart of the rule-based classifier procedure