| Literature DB >> 32673247 |
Bo Jin1, Yue Qu1, Liang Zhang2, Zhan Gao3.
Abstract
BACKGROUND: The number of patients with neurological diseases is currently increasing annually, which presents tremendous challenges for both patients and doctors. With the advent of advanced information technology, digital medical care is gradually changing the medical ecology. Numerous people are exploring new ways to receive a consultation, track their diseases, and receive rehabilitation training in more convenient and efficient ways. In this paper, we explore the use of facial expression recognition via artificial intelligence to diagnose a typical neurological system disease, Parkinson disease (PD).Entities:
Keywords: Parkinson disease; artificial intelligence; face landmarks; machine learning
Mesh:
Year: 2020 PMID: 32673247 PMCID: PMC7382014 DOI: 10.2196/18697
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Smiles of a patient with Parkinson disease (left) and person without Parkinson disease (right).
Example data collected using the registration form to collect data via video of patients with Parkinson disease.
| Patient Number | Age (years) | Gender | Confirmed | Other neurological disease | Length of disease (month) | Date of collection |
| 1 | 60 | Male | Yes | No | 10 | 11/13/2017 |
| 2 | 55 | Male | Yes | No | 24 | 11/13/2017 |
| 3 | 60 | Male | Yes | No | 10 | 11/13/2017 |
| 4 | 63 | Female | Yes | No | 14 | 11/13/2017 |
Figure 2Face key points (n=106) returned by the Face++ interface.
Figure 3Relative face coordinate system, where the Cartesian, or absolute, coordinate system is represented by the black coordinate system, which was used to record the position of pixels in the image, and the non-Cartesian, or relative, coordinate system is represented by the blue coordinate system, which was used to record the relative position of key points on the face.
Figure 4Coordinate system conversion, where the absolute coordinates (m1,n1), (m2, n2), (a1, a2), and (b1, b2) can be converted to the relative coordinates (0,0), (x,y), (1,0), and (0,1), respectively.
Video data statistics.
| Data statistics | Video data |
| Creation date | 3/15/2018 |
| Number of patients with Parkinson disease | 33 |
| Number of people without Parkinson disease | 31 |
| Number of records | 176 |
| Number of features | 848 |
| Task | Classification |
Experimental results of common machine learning algorithms.
| Algorithm | Precision | Recall | F1 value |
| LRa | 0.98 | 0.98 | 0.98 |
| SVMb | 0.99 | 0.99 | 0.99 |
| DTc | 0.93 | 0.93 | 0.93 |
| RFd | 0.98 | 0.98 | 0.98 |
aLR: logistic regression.
bSVM: support vector machine.
cDT: decision tree.
dRF: random forest.
Number of points that reached significance for each feature type.
| Feature name | Number of key points that reached significance | |
|
| ||
|
| 83 | 69 |
|
| 56 | 45 |
| Cov(X, Y) | 97 | 87 |
| Cov(X_abs, Y_abs) | 13 | 12 |
| Jitter | 106 | 106 |
| Jitter_PPQ5 | 106 | 106 |
| Jitter_rap | 106 | 90 |
| Jitter_ddp | 106 | 106 |
| Total | 673 | 621 |
Figure 5The effects of least absolute shrinkage and selection operator (LASSO) feature compression on logistic regression (LR) and support vector machine (SVM) models.
Figure 6The key points that have a large influence on the classification result.
Experimental results of neural network models.
| Algorithm | Precision | Recall | F1 value |
| LSTMa | 0.86 | 0.66 | 0.75 |
| RNNb | 0.48 | 0.46 | 0.47 |
aLSTM: long short-term memory.
bRNN: recurrent neural network.
Comparison with a selection of prior work.
| Work | Target and result | Data | Feature | Technology |
| Bandini et al [ | Found PDa patients have lower average facial expression movement distance; facial expression recognition for PD | 17 PD patients, | Average distance of 49 facial key points in the facial expression movement | Face tracing, SVMb |
| Rajnoha et al [ | Identified PD hypomimia by analyzing static facial images; less accurate compared with video-recording processing method. | 50 PD patients, | 128 facial measures (embedding) by CNNc | Face detector-based (HOGd), CNN, traditional classifiers (eg, random forests, XGBoost) |
| PARKe framework by Langevin et al [ | PARK instructs and guides users through 6 motor tasks and 1 audio task selected from MDS-UPDRSf and records their performance by videos | 127 PD patients, | Facial features: facial action units (AUs); | OpenFace tool version 2, FFT |
| Our method | Proposed facial landmark features from videos to diagnose PD using facial expressions and achieved outstanding performance | 33 PD patients, | 848 facial expression amplitude features and tremor features of facial key points; | Face ++, traditional classifiers (LRh, SVM, DTi, RFj), LSTMk, LASSOl |
aPD: Parkinson disease.
bSVM: support vector machine.
cCNN: convolutional neural network.
dHOG: histogram of oriented gradients.
ePARK: Parkinson's Analysis with Remote Kinetic-tasks.
fMDS-UPDRS: Movement Disorder Society Unified Parkinson Disease Rating Scale.
gFFT: fast fourier transform.
hLR: logistic regression.
iDT: decision tree.
jRF: random forest.
kLSTM: long short-term memory.
lLASSO: least absolute shrinkage and selection operator.