| Literature DB >> 31547220 |
Philip Schmidt1,2, Attila Reiss3, Robert Dürichen4, Kristof Van Laerhoven5.
Abstract
Affect recognition is an interdisciplinary research field bringing together researchers from natural and social sciences. Affect recognition research aims to detect the affective state of a person based on observables, with the goal to, for example, provide reasoning for the person's decision making or to support mental wellbeing (e.g., stress monitoring). Recently, beside of approaches based on audio, visual or text information, solutions relying on wearable sensors as observables, recording mainly physiological and inertial parameters, have received increasing attention. Wearable systems enable an ideal platform for long-term affect recognition applications due to their rich functionality and form factor, while providing valuable insights during everyday life through integrated sensors. However, existing literature surveys lack a comprehensive overview of state-of-the-art research in wearable-based affect recognition. Therefore, the aim of this paper is to provide a broad overview and in-depth understanding of the theoretical background, methods and best practices of wearable affect and stress recognition. Following a summary of different psychological models, we detail the influence of affective states on the human physiology and the sensors commonly employed to measure physiological changes. Then, we outline lab protocols eliciting affective states and provide guidelines for ground truth generation in field studies. We also describe the standard data processing chain and review common approaches related to the preprocessing, feature extraction and classification steps. By providing a comprehensive summary of the state-of-the-art and guidelines to various aspects, we would like to enable other researchers in the field to conduct and evaluate user studies and develop wearable systems.Entities:
Keywords: affect recognition; affective computing; data collection; machine learning; physiological feature; physiological signals; review; sensors; wearables
Mesh:
Year: 2019 PMID: 31547220 PMCID: PMC6806301 DOI: 10.3390/s19194079
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Exemplary dimensional model. (a) Schematic representation of the circumplex (valence-arousal) model. Adapted from Valenza et al. [36]; (b) Exemplary Self-Assessment Manikins [38], used to generate labels in the valence-arousal space. Adapted from Jirayucharoensak et al. [40].
Major functions of the sympathetic nervous system and parasympathetic nervous system.
| Sympathetic Nervous System (SNS) | Parasympathetic Nervous System (PNS) |
|---|---|
|
associated with ‘fight or flight’ |
associated with ‘rest and digest’ |
|
pupils dilate |
pupils constrict |
|
decreased salivation and digestion |
increased salivation and digestion |
|
increased heart and respiration rate |
decreased heart and respiration rate |
|
increased electrodermal activity | |
|
increased muscle activity | |
|
adrenalin and glucose release |
Four exemplary affective states and their physiological response [58]. Abbreviations: ↓ indicate a decrease, ↑ indicates an increase, indicate both increase and decrease (depending on the study), − indicates no change in the parameter under consideration, # represents number of.
| Anger | Sadness | Amusement | Happiness | |
|---|---|---|---|---|
| (Non-Crying) | ||||
|
| ||||
| HR | ↑ | ↓ |
| ↑ |
| HRV | ↓ | ↓ | ↑ | ↓ |
|
| ||||
| SCL | ↑ | ↓ | ↑ |
|
| # SCRs | ↑ | ↓ | ↑ | ↑ |
|
| ||||
| Respiration rate | ↑ | ↑ | ↑ | ↑ |
Sensor modalities and derived indicators used in the wearable-based AR. Abbreviations: heart rate (HR), heart rate variability (HRV).
| Physiological Signal Type | Derived Indicators | |
|---|---|---|
|
| Electroencephalogram | Electric potential changes of brain neurons |
| Electromyogram | Facial muscle activity (e.g., zygomaticus major) | |
| Electrooculography | Eye movements | |
| Photoplethysmogram (ear) | HR and HRV | |
|
| Electrocardiogram | HR and HRV |
| Electrodermal activity | Tonic and phasic component | |
| Electromyogram | Muscle activity | |
| Inertial sensor | Physical activity/body pose | |
| Respiratory inductive Plethys-mograph | Respiration rate and volume | |
| Body thermometer | Temperature | |
|
| Electrodermal activity meter | Tonic and phasic component |
| Blood Oxymeter | Blood oxygen saturation | |
| Blood pressure | Sphygmomanometer | |
| Inertial sensor | Physical activity | |
| Photoplethysmogram | HR and HRV | |
| Thermometer | Temperature | |
|
| Electrodermal activity | Tonic and phasic component |
| Inertial sensor | Physical activity | |
| Context | Sensors of a mobile phone (GPS, microphone, etc.) | Location, Sound, Activity, Interaction |
Affective states and sensor signals frequently employed in wearable-based AR. Table 9 provides further detail on algorithms, location and performance. Abbreviations: 3-axes acceleration (ACC), blood pressure (BP), (EEG), electromyogram (EMG), electrooculography (EOG), heart rate (HR), magnetoencephalogram (MEG), pupil diameter (PD), photoplethysmogram (PPG), respiration (RESP), skin-temperature (TEMP), arterial oxygen level (SpO2), low arousal/low valence (LALV), low arousal/high valence (LAHV), high arousal/low valence (HALV), high arousal/high valence (HAHV)
| Author | Affective States | Sensor Signals | |
|---|---|---|---|
| <2005 | Picard et al. [ | Neutral, anger, hate, grief, joy, platonic/romantic love, reverence | EDA, EMG, PPG, RESP |
| Haag et al. [ | Low/medium/high arousal and positive/negative valence | ECG, EDA, EMG, TEMP, PPG, RESP | |
| Lisetti and Nasoz [ | Sadness, anger, fear, surprise, frustration, amusement | ECG, EDA, TEMP | |
| 2005 | Liu et al. [ | Anxiety, boredom, engagement, frustration, anger | ECG, EDA, EMG |
| Wagner et al. [ | Joy, anger, pleasure, sadness | ECG, EDA, EMG, RESP | |
| Healey and Picard [ | Three stress levels | ECG, EDA, EMG, RESP | |
| 07 | Leon et al. [ | Neutral/positive/negative valence | EDA, HR, BP |
| 2008 | Zhai and Barreto [ | Relaxed and stressed | EDA, PD, PPG, TEMP |
| Kim et al. [ | Distinguish high/low stress group of individuals | PPG | |
| Kim and André [ | Four quadrants in valence-arousal space | ECG, EDA, EMG, RESP | |
| Katsis et al. [ | High/low stress, disappointment, euphoria | ECG, EDA, EMG, RESP | |
| 2009 | Calvo et al. [ | Neutral, anger, hate, grief, joy, platonic/romantic love, reverence | ECG, EMG |
| Chanel et al. [ | Positively/negatively excited, calm-neutral (in valence-arousal space) | BP, EEG, EDA, PPG, RESP | |
| Khalili and Moradi [ | Positively/negatively excited, calm (valence-arousal space) | BP, EEG, EDA, RESP, TEMP | |
| 2010 | Healey et al. [ | Points in valence arousal space. moods | ACC, EDA, HR, audio |
| 2011 | Plarre et al. [ | Baseline, different types of stress (social, cognitive and physical), perceived stress | ACC, ECG, EDA, RESP, TEMP, ambient temperature |
| Hernandez et al. [ | Detect stressful calls | EDA | |
| 2012 | Valenza et al. [ | Five classes of arousal and five valence levels | ECG, EDA, RESP |
| Hamdi et al. [ | Joy, sadness, disgust, anger, fear, surprise | ECG, EEG, EMG | |
| Agrafioti et al. [ | Neutral, gore, fear, disgust, excitement, erotica, game elicited mental arousal | ECG | |
| Koelstra et al. [ | Four quadrants in valence-arousal space | ECG, EDA, EEG, EMG, EOG, RESP, TEMP, facial video | |
| Soleymani et al. [ | Neutral, anxiety, amusement, sadness, joy, disgust, anger, surprise, fear | ECG, EDA, EEG, RESP, TEMP | |
| 2013 | Sano and Picard [ | Stress vs. neutral | ACC, EDA, phone usage |
| Martinez et al. [ | Relaxation, anxiety, excitement, fun | EDA, PPG | |
| 2014 | Valenza et al. [ | Four quadrants in valence-arousal space | ECG |
| Adams et al. [ | Stress vs. neutral (aroused vs. non-aroused) | EDA, audio | |
| 2015 | Hovsepian et al. [ | Stress vs. neutral | ECG, RESP |
| Abadi et al. [ | High/Low valence, arousal and dominance | ECG, EOG, EMG, near-infrared face video, MEG | |
| 2016 | Rubin et al. [ | Panic attack | ACC, ECG, RESP |
| Jaques et al. [ | Stress, happiness, health values | EDA, TEMP, ACC, phone usage | |
| Rathod et al. [ | Normal, happy, sad, fear, anger | EDA, PPG | |
| Zenonos et al. [ | Excited, happy, calm, tired, bored, sad, stressed, angry | ACC, ECG, PPG, TEMP | |
| Zhu et al. [ | Angle in valence arousal space | ACC, phone context | |
| Birjandtalab et al. [ | Relaxation, different types of stress (physical, emotional, cognitive) | ACC, EDA, TEMP, HR, SpO2 | |
| 2017 | Gjoreski et al. [ | Lab: no/low/high stress; | ACC, EDA, PPG, TEMP |
| Mozos et al. [ | Stress vs. neutral | ACC, EDA, PPG, audio | |
| Taylor et al. [ | Tomorrow’s mood, stress, health | ACC, EDA, context | |
| Girardi et al. [ | High vs. low valence and arousal | EEG, EDA, EMG | |
| 2018 | Schmidt et al. [ | Neutral, amusement, stress | Torso: ACC, ECG, EDA, EMG, RESP, TEMP; |
| Zhao et al. [ | LALV, LAHV, HALV, HAHV | EDA, PPG, TEMP | |
| Marín-Morales et al. [ | LALV, LAHV, HALV, HAHV | ECG, EEG | |
| Santamaria- Granados et al. [ | LALV, LAHV, HALV, HAHV | ECG, EDA | |
| 2019 | Heinisch et al. [ | High positive pleasure high arousal, high negative pleasure high arousal and neutral | EMG, PPG, TEMP |
| Hassan et al. [ | Happy, relaxed, disgust, sad and neutral | EDA, PPG, EMG (from DEAP) | |
| Kanjo et al. [ | Five valence classes | ACC, EDA, HR, TEMP, environmental, GPS | |
| Di Lascio et al. [ | Detect laughter episodes | ACC, EDA, PPG |
Questionnaires utilized in recent wearable-based AR field studies. Abbreviations: Number of Items (I), Big Five Inventory (BFI), Photo Affect Meter (PAM), Positive and Negative Affect Schedule (PANAS), PHQ-9, Pittsburgh Sleep Quality Index (PSQI), Perceived Stress Scale (PSS), Self-Assessment Manikins (SAM), Stress Response Inventory (SRI), Stait-Trait Anxiety Inventory (STAI).
| Questionnaires Employed | ||||
|---|---|---|---|---|
|
|
|
|
|
|
| Stress level | PSS: subject’s perception and awareness of stress | 10 | Cohen et al. [ | Sano and Picard [ |
| SRI: score severity of stress-related symptoms within time interval | 22 | Koh et al. [ | Kim et al. [ | |
| Depression level | PHQ-9: score DSM-IV manual | 9 | Kroenke et al. [ | Wang et al. [ |
| Loneliness level | UCLA loneliness scale: addressing loneliness and social isolation. | 20 | Russell [ | Wang et al. [ |
| Sleep behaviour and quality | PSQI: Providing information about sleep quality | 19 | Buysse et al. [ | Sano and Picard [ |
| Measure suc-cess areas | Flourishing scale: measure success, self-esteem, purpose and optimism | 8 | Diener et al. [ | Wang et al. [ |
| Personality traits | BFI: indicating personality traits | 44 | John and Srivastava [ | Taylor et al. [ |
|
| ||||
| Affect in Valence-arousal space | Mood Map: a translation of the circumplex model of emotion | 2 | Morris and Guilak [ | Healey et al. [ |
| SAM | 2 | Morris [ | Schmidt et al. [ | |
| Positive and negative affect | Shortened PANAS | 10 | Muaremi et al. [ | Muaremi et al. [ |
| Positive Affect of PANAS | PAM: choose one of 16 images, mapped to the valence-arousal space | 1 | Pollak et al. [ | Wang et al. [ |
| Subjective mood indicator | Smartphone app querying user’s mood | 8 | HealthyOffice app | Zenonos et al. [ |
| Stress level assessment | Adaptation of PSS for ambulatory setting | 5 | Hovsepian et al. [ | Hovsepian et al. [ |
| Log current Stress Level | 1 | Gjoreski et al. [ | Gjoreski et al. [ | |
| Severity of panic attack symptoms | Symptoms from the DSM-IV and Panic Disorder Severity Scale standard instrument | 15 | Shear et al. [ | Rubin et al. [ |
Questionnaires employed during recent field studies, focusing on the applied scheduling (Pre-, During or Post-study).
| Author | Employed Questionnaires and Their Scheduling | |
|---|---|---|
| Emotion | Healey et al. [ | |
| Rubin et al. [ | ||
| Jaques et al. [ | ||
| Stress | Hernandez et al. [ | |
| Muaremi et al. [ | ||
| Kim et al. [ | ||
| Sano and Picard [ | ||
| Adams et al. [ | ||
| Hovsepian et al. [ | ||
| Gjoreski et al. [ | ||
| Schmidt et al. [ | ||
| Mood | Wang et al. [ | |
| Sano et al. [ | ||
| Zenonos et al. [ |
Publicly available datasets relevant for wearable affect and stress recognition. Abbreviations: Number of subjects (Sub), Location (Loc), Lab (L), Field (F), Field with constraint (FC), Population (Pop) reported as mean age or as category, College Student (CS), Graduate Student (GS), 3-axes acceleration (ACC), electrocardiogram (ECG), electrodermal activity (EDA), electroencephalogram (EEG), electromyogram (EMG), electrooculography (EOG), magnetoencephalogram (MEG), respiration (RESP), arterial oxygen level (SpO2), skin-temperature (TEMP).
| Name | Labels | Pop. | Sub. | Loc. | Included Modalities | |
|---|---|---|---|---|---|---|
| Emotion (E) | Eight-Emotion [ | Neutral, anger, hate, grief, joy, platonic love,romantic love, reverence | GS | 1 | L | ECG, EDA, EMG, RESP |
| DEAP [ | Continuous scale of valence, arousal, liking, dominance, Discrete scale of familiarity | 26.9 | 32 | L | ECG, EDA, EEG, EMG, EOG, RESP, TEMP, face video (not all subjects) | |
| MAHNOB-HCI [ | Discrete scale of valence, arousal, dominance, predictability, Emotional keywords |
| 27 | L | ECG, EDA EEG, RESP, TEMP, face and body video, eye gaze tracker, audio | |
| DECAF [ | Discrete scale of valence, arousal, dominance |
| 30 | L | ECG, EMG, EOG, MEG, near-infrared face video | |
| ASCERTAIN [ | Discrete scale of valence, arousal, liking, engagement, familiarity, Big Five | 30 | 58 | L | ECG, EDA, EEG, facial activity data (facial landmark trajectories) | |
| USI_Laughs [ | Detect and distinguish laughter from other events |
| 34 | L | ACC, EDA, PPG, TEMP | |
| Stress (S) | Driver [ | Stress levels: low, medium, high | - | 24 | FC | ECG, EDA, EMG, RESP |
| Non-EEG [ | Four types of stress (physical, emotional, cognitive, none) | CS | 20 | L | ACC, EDA, HR, TEMP, SpO2 | |
| Distracted Driving [ | Driving being subject to no, emotional, cognitive, and sensorimotor distraction | Elder + Young | 68 | L | EDA, heart and respiration rate, facial expressions, eye tracking | |
| StudentLife [ | Sleep, activity, sociability, mental well-being, stress, academic performance | CS + GS | 48 | F | ACC, audio, context, GPS, smartphone usage | |
| E+S | WESAD [ | Three affective states: neutral, amusement, stress |
| 15 | L |
Features commonly extracted and applied in the wearable-based AR.
| Features | |
|---|---|
|
| |
|
| |
|
| |
|
| |
|
| |
|
|
Comparison of algorithms, validation methods, and accuracies of recent wearable-based AR studies. If not stated differently, scores are reported as (mean) accuracy. Abbreviations: Setting (Set.), Lab (L), Field (F), Field with constraint (FC), Validation (Val), cross-validation (CV), Leave-One-Out (LOO), leave-one-subject-out (LOSO), Leave-One-Trial-Out (LOTO), Arousal (AR), Valence (VA), Dominance (DO), Liking (LI), AdaBoost (AB), Analysis of Variance (ANOVA), Bayesian Network (BN), CNN, deep belief network (DBN), Gradient Boosting (GB), Gaussian Mixture Model (GMM), Hidden Markow Model (HMM), linear discriminant analysis (LDA), Linear Discriminant Function (LDF), Logistic Regression (LR), Naive Bayes (NB), NN, Passive Aggressive Classifier (PA), RF, Decision/Regression/Function Tree (DT/RT/FT), Ridge Regression (RR), Quadratic Discriminant Analysis (QDA).
| Author | Algorithm | Classes | Set. | Sub. | Val. | Accuracy | |
|---|---|---|---|---|---|---|---|
| <2005 | Picard et al. [ | kNN | 8 | L | 1 | LOO | 81% |
| Haag et al. [ | NN | contin. | L | 1 | 3-fold split | AR: <96%, VA: <90% | |
| Lisetti and Nasoz [ | kNN, LDA, NN | 6 | L | 14 | LOO | 72%; 75%; 84% | |
| 2005 | Liu et al. [ | BN, kNN, RT, SVM | 5 | L | 15 | LOO | 74%; 75%; 84%; 85% |
| Wagner et al. [ | kNN, LDF, NN | 4 | L | 1 | LOO | 81%; 80%; 81% | |
| Healey and Picard [ | LDF | 3 | FC | 24 | LOO | 97% | |
| 07 | Leon et al. [ | NN | 3 | L | 8+1 | LOSO | 71% |
| 2008 | Zhai and Barreto [ | DT, NB, SVM | Bin. | L | 32 | 20-fold CV | 88%; 79%; 90% |
| Kim et al. [ | LR | Bin. | FC | 53 | 5-fold CV | ∼63% | |
| Kim and André [ | LDA | 4 | L | 3 | LOO | sub. dependent/independent: 95%/70% | |
| Katsis et al. [ | SVM | 4 | L | 10 | 10-fold CV | 79% | |
| 2009 | Calvo et al. [ | BN, FT, LR, NB, NN, SVM | 8 | L | 3 | 10-fold CV | one subject: 37–98%, |
| Chanel et al. [ | LDA, QDA, SVM | 3/Bin. | L | 10 | LOSO | <50%; <47%; <50%, | |
| Khalili and Moradi [ | QDA | 3 | L | 5 | LOO | 66.66% | |
| 10 | Healey et al. [ | AB,DT, BN, NB | Bin. | F | 19 | 10-fold CV | None 2 |
| 2011 | Plarre et al. [ | AB, DT, SVM/ HMM | Bin. | L/F | 21/17 | 10-fold CV | 82%; 88%; 88%/0.71 3 |
| Hernandez et al. [ | SVM | Bin. | F | 9 | LOSO | 73% | |
| 2012 | Valenza et al. [ | QDA | 5 | L | 35 | 40-fold CV | >90% |
| Hamdi et al. [ | ANOVA | 6 | L | 16 | - | None 4 | |
| Agrafioti et al. [ | LDA | Bin. | L | 31 | LOO | Active/Pas AR: 78/52% | |
| Koelstra et al. [ | NB | Bin. | L | 32 | LOO | AR/VA/LI: 57%/63%/59% | |
| Soleymani et al. [ | SVM | 3 | L | 27 | LOSO | VA: 46%, AR: 46% | |
| 2013 | Sano and Picard [ | kNN, SVM | Bin. | F | 18 | 10-fold CV | <88% |
| Martinez et al. [ | CNN | 4 1 | L | 36 | 3-fold CV | learned features: <75%, | |
| 2014 | Valenza et al. [ | SVM | Bin. | L | 30 | LOO | VA: 79%, AR: 84% |
| Adams et al. [ | GMM | Bin. | F | 7 | - | 74% | |
| 2015 | Hovsepian et al. [ | SVM/BN | Bin. | L/F | 26/20 | LOSO | 92%/>40% |
| Abadi et al. [ | NB, SVM | Bin. | L | 30 | LOTO | VA/AR/DO: 50-60% | |
| 2016 | Rubin et al. [ | DT, GB, kNN, LR, PA, RF, RR, SVM | Bin. | F | 10 | 10-fold CV | Bin. panic: 73–97% |
| Jaques et al. [ | LR, NN,SVM | Bin. | F | 30 | 5-fold CV | <76%; <86%; <88% | |
| Rathod et al. [ | Rule-based | 6 | L | 6 | - | <87% | |
| Zenonos et al. [ | DT, kNN, RF | 5 | F | 4 | LOSO | 58%; 57%; 62% | |
| Zhu et al. [ | RR | 1 | F | 18 | LOSO |
| |
| Birjandtalab et al. [ | GMM | 4 | L | 20 | - | <85% | |
| 2017 | Gjoreski et al. [ | AB, BN, DT, kNN, RF, SVM | 3/Bin. | L/F | 21/5 | LOSO | <73%/<90% |
| Mozos et al. [ | AB, kNN, SVM | Bin. | L | 18 | CV | 94%; 93%; 87% | |
| Taylor et al. [ | Single/Multitask LR, NN, SVM | Bin. | F | 104 | Cust. 6 | Mood: <78%, Stress/Health<82% | |
| Girardi et al. [ | DT, NB, SVM | Bin. | L | 19 | LOSO |
| |
| 2018 | Schmidt et al. [ | AB, DT, kNN, LDA, RF | 3/Bin. | L | 15 | LOSO | <80%/<93% |
| Zhao et al. [ | NB, NN, RF, SVM | 4/Bin. | L | 15 | LOSO | 76% | |
| Marín-Morales et al. [ | SVM | Bin. | L | 60 | LOSO | Val<75%, AR<82% | |
| Santamaria- Granados et al. [ | CNN | Bin. | L | 40 | - | Val: 75%, AR:71% | |
| 2019 | Heinisch et al. [ | DT, kNN, RF | 3 | L | 18 | LOSO | <67% |
| Hassan et al. [ | DBN+SVM | 5 | L | 32 | 10-fold CV | 89.53% use DEAP | |
| Kanjo et al. [ | CNN+LSTM | 5 | FC | 34 | User 7 | <95% | |
| Di Lascio et al. [ | LR, RF, SVM | Bin. | L | 34 | LOSO | <81% |
1 Given as pairwise preferences. 2 DT overfit, other classifiers performed worse than random guessing. 3 Correlation between self-reported and output of model. 4 No significant differences could be found between the affective states. 5 Mean absolute error of mood angle in circumplex model. 6 80/20% split of the entire data+5-fold CV. 7 User specific models. Trained random on 70/30% splits with non-overlapping windows.