| Literature DB >> 35214585 |
Juan Arturo Nolazco-Flores1, Marcos Faundez-Zanuy2, Oliver Alejandro Velázquez-Flores1, Carolina Del-Valle-Soto3, Gennaro Cordasco4, Anna Esposito4.
Abstract
In this research, we analyse data obtained from sensors when a user handwrites or draws on a tablet to detect whether the user is in a specific mood state. First, we calculated the features based on the temporal, kinematic, statistical, spectral and cepstral domains for the tablet pressure, the horizontal and vertical pen displacements and the azimuth of the pen's position. Next, we selected features using a principal component analysis (PCA) pipeline, followed by modified fast correlation-based filtering (mFCBF). PCA was used to calculate the orthogonal transformation of the features, and mFCBF was used to select the best PCA features. The EMOTHAW database was used for depression, anxiety and stress scale (DASS) assessment. The process involved the augmentation of the training data by first augmenting the mood states such that all the data were the same size. Then, 80% of the training data was randomly selected, and a small random Gaussian noise was added to the extracted features. Automated machine learning was employed to train and test more than ten plain and ensembled classifiers. For all three moods, we obtained 100% accuracy results when detecting two possible grades of mood severities using this architecture. The results obtained were superior to the results obtained by using state-of-the-art methods, which enabled us to define the three mood states and provide precise information to the clinical psychologist. The accuracy results obtained when detecting these three possible mood states using this architecture were 82.5%, 72.8% and 74.56% for depression, anxiety and stress, respectively.Entities:
Keywords: SVM; autoML; data augmentation; feature extraction; negative mood states recognition
Mesh:
Year: 2022 PMID: 35214585 PMCID: PMC8875261 DOI: 10.3390/s22041686
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Capture of sensor data from the tablet and pen when handwriting or drawing on a tablet. Sensor data are processed and sent to a clinical psychologist for analysis.
Figure 2Three classes before and after applying PCA. Three classes before and after applying PCA. In black are shown Iris-setosa’s observations; in green are shown Iris-versicolor’s observations; in yellow are shown Iris-virginica’s observations.
Interpretation of DASS scores [26]: binary labelling [15] and trinary labelling [this paper].
| Binary Labeling Used in [ | Trinary Labeling | Interpretation of DASS | Depression | Anxiety | Stress |
|---|---|---|---|---|---|
| Normal | Normal | Normal | 0–9 | 0–7 | 0–14 |
| Above normal | Mild | Mild | 10–13 | 8–9 | 15–18 |
| Above mild | Moderate | 14–20 | 10–14 | 19–25 | |
| Severe | 21–27 | 15–19 | 26–33 | ||
| Extremely severe | 28+ | 20+ | 34+ |
Task performed for each user.
| Tasks |
|---|
|
Drawing a copy of two overlapping pentagons |
|
Drawing a copy of a house |
|
Handwriting of four Italian words in capital letters |
|
Drawing circular loops with the left hand |
|
Drawing circular loops with the right hand |
|
Handwriting of an Italian sentence in cursive letters |
|
Drawing of a clock with twelve hours and hands |
Figure 3Examples of drawings for different tasks: (a) overlapping pentagons, (b) a house, (c) words, (d) circular loops drawn with the left hand, (e) circular loops drawn with the right hand, (f) a cursive sentence and (g) a clock.
Figure 4DASS score distribution in the EMOTHAW database. For binary labelling, the dark blue bars show normal scores, and the yellow and red bars show above-normal scores. For trinary labelling, the dark blue bars show normal scores, the yellow bars show mild scores and the red bars show above mild scores.
Figure 5Cross-tables showing the percentage of co-occurrence of mood states in the EMOTHAW database.
Figure 6Online time-series drawing for the pentagon-drawing task.
Figure 7System front-end that starts with the temporal features, kinematic features, statistical features, spectral-domain features and cepstral-domain features. These features are concatenated; then, they are orthogonalised using PCA. Finally, the features are selected using our mFCBF.
Temporal features for the user () for all tasks .
| Notation | Definition |
|---|---|
|
| Pen’s displacement at the sample |
|
| Trajectory taken during handwriting divided by the duration of writing |
|
| On-air pen duration |
|
| On-paper pen duration |
|
| Duration of the stroke |
|
| |
|
| Ratio of time the pen spent in air or on the tablet’s surface |
|
| Number of changes in the direction of the velocity vector |
|
| Number of changes in direction of the acceleration vector |
|
| |
|
|
Kinematic features for the user () for all tasks .
|
| Definition |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Statistical features for the user () for all tasks .
|
| Definition | |
|---|---|---|
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
|
|
| |
List of Machine Learning Models.
| Classification Algorithms |
|---|
| Deep neural network (DNN) |
| Distributed random forest (DRF) |
| Extremely randomised trees (ERT) |
| Generalised linear model (GLM) |
| Gradient boosting machine (GBM) |
| Naïve Bayes classifier (NBC) |
| Rulefit (RF) |
| Stacked ensembles (SE) |
| XGBoost (XGB) |
| Support vector machine (SVM) |
Parameters used when running H2O.
| Parameter | Value |
|---|---|
| max_runtime_secs | 200 |
| max_models | 15 |
| exclude_algos | GBM |
| seed | 1 |
| nfolds | 2 |
| stopping_metric | logloss |
Binary accuracy results for temporal features using SVM [16] and AutoML.
|
|
| |
|---|---|---|
| Depression | 71.47 | 80.70 |
| Anxiety | 58.53 | 71.93 |
| Stress | 61.24 | 66.67 |
Binary accuracy results for PCA, mFCBF and PCA-mFCBF feature selection methods using SVM and autoML.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Depression | 74.01 | 79.82 | 74.01 | 88.60 | 87.40 | 92.10 |
| Anxiety | 62.20 | 71.05 | 72.44 | 81.58 | 83.46 | 85.96 |
| Stress | 57.48 | 68.42 | 70.07 | 81.58 | 85.03 | 88.59 |
Figure 8Selected features for (a) stress, (b) anxiety and (c) depression.
Binary accuracy results with and without adding kinematic and statistical features, PCA-mFBCF pipeline for features selection and using autoML.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Depression | 79.82 | 81.57 | 88.60 | 92.98 | 92.10 | 100.00 |
| Anxiety | 71.05 | 75.43 | 81.58 | 88.60 | 85.96 | 100.00 |
| Stress | 68.42 | 71.92 | 81.58 | 89.47 | 88.59 | 100.00 |
Trinary accuracy results for PCA, mFCBF and PCA–mFCBF pipeline for the FS methods and by using AutoML.
|
|
|
|
|
|
|---|---|---|---|---|
| Depression | 74.56 | 77.19 | 81.57 | 82.45 |
| Anxiety | 50.87 | 57.89 | 71.92 | 72.80 |
| Stress | 47.36 | 54.38 | 65.78 | 74.56 |