| Literature DB >> 35408149 |
Paweł Jemioło1, Dawid Storman2, Maria Mamica1, Mateusz Szymkowski1, Wioletta Żabicka3, Magdalena Wojtaszek-Główka3, Antoni Ligęza1.
Abstract
Our review aimed to assess the current state and quality of publicly available datasets used for automated affect and emotion recognition (AAER) with artificial intelligence (AI), and emphasising cardiovascular (CV) signals. The quality of such datasets is essential to create replicable systems for future work to grow. We investigated nine sources up to 31 August 2020, using a developed search strategy, including studies considering the use of AI in AAER based on CV signals. Two independent reviewers performed the screening of identified records, full-text assessment, data extraction, and credibility. All discrepancies were resolved by discussion. We descriptively synthesised the results and assessed their credibility. The protocol was registered on the Open Science Framework (OSF) platform. Eighteen records out of 195 were selected from 4649 records, focusing on datasets containing CV signals for AAER. Included papers analysed and shared data of 812 participants aged 17 to 47. Electrocardiography was the most explored signal (83.33% of datasets). Authors utilised video stimulation most frequently (52.38% of experiments). Despite these results, much information was not reported by researchers. The quality of the analysed papers was mainly low. Researchers in the field should concentrate more on methodology.Entities:
Keywords: affective computing; artificial intelligence; automated affect recognition; automated emotion recognition; cardiovascular; dataset; systematic review
Mesh:
Year: 2022 PMID: 35408149 PMCID: PMC9002643 DOI: 10.3390/s22072538
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) study flow diagram [63].
Characteristics of stimuli in 18 included studies (21 experiments).
| Variable (No. of Experiments Available for Calculations) | No (%) | |
|---|---|---|
| Type of stimuli (21) | ||
| Video (music, movie, ads) | 11 (52.38) | |
| Audio (music excerpts) | 4 (19.05) | |
| Game (FIFA 2016, Maze-Ball) | 2 (9.52) | |
| Virtual Reality (videos, scenes) | 2 (9.52) | |
| Self elicitation (actors) | 1 (4.76) | |
| Mixed (TSST 1, video and meditation in one experiment) | 1 (4.76) | |
| Length of stimuli [seconds] (17) | 304.60 (32–1200) | |
| No. of stimuli in dataset (20) | 27.70 (4–144) | |
| No. of elicited emotions [classes] (18) | 6.06 (3–23) | |
1 Trier Social Stress Test.
Characteristics of devices and signals in 18 included studies (21 experiments).
| Variable (No. of Datasets Available for Calculations) 1 | No (%) | |
|---|---|---|
| Used devices (16) | ||
| Shimmer 2R | 3 (18.75) | |
| BIOPAC MP150 | 3 (18.75) | |
| Biosemi ActiveTwo | 2 (12.50) | |
| NeXus-10 | 1 (6.25) | |
| ProComp Infiniti | 1 (6.25) | |
| BIOPAC BioNomadix | 1 (6.25) | |
| BItalino | 1 (6.25) | |
| RespiBAN Professional | 1 (6.25) | |
| B-Alert x10 | 1 (6.25) | |
| Empatica E4 | 1 (6.25) | |
| Polar H7 | 1 (6.25) | |
| Zephyr BioHarness | 1 (6.25) | |
| IOM Biofeedback | 1 (6.25) | |
| CV 2 signals recorded (18) | ||
| ECG 3 | 15 (83.33) | |
| HR 4 | 3 (16.67) | |
| BVP 5 | 3 (16.67) | |
| PPG 6 | 1 (5.56) | |
| Sampling frequency [Hz] (12) | 543.31 (32–2048) | |
| Length of baseline recording [seconds] (7) | 292.14 (5–1200) | |
1 some studies used more than one device or cardiovascular signal; 2 cardiovascular; 3 electrocardiogram; 4 heart rate; 5 blood volume pressure; 6 photoplethysmogram.
Characteristics of population in 18 included studies (21 experiments).
| Variable (No. of Experiments Available for Calculations) | No. (%) |
|---|---|
| Participating people (21) | 916 |
| Eligible people (20) | 812 |
| Age (18) | 23.8 (17–47) |
| Percentage of females (16) | 45.13 (0–86) |
| Ethnicity (4) | |
| Chinese | 2 (9.52) |
| European | 2 (9.52) |
Figure 2Risk of bias (RoB) in included studies.
Risk of Bias (RoB) among 18 included studies.
| Study | RoB Item 1 | Overall Quality 2 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | Scenario 1 | Scenario 2 | |
| [ | NR | PY | PY | Y | Y | Y | Y | PN | Low | Low |
| [ | NR | PY | PY | Y | PN | Y | Y | NR | Low | Low |
| [ | NR | NR | NR | Y | PN | Y | Y | PN | Low | Low |
|
[ | NR | Y | PY | Y | Y | Y | Y | PY | Unclear | High |
| [ | NR | PY | PN | PY | PN | Y | Y | N | Low | Low |
| [ | NR | PN | N | PN | PN | N | N | PY | Low | Low |
| [ | NR | Y | PN | PY | Y | Y | Y | PN | Low | Low |
| [ | NR | Y | Y | PY | Y | Y | Y | PN | Low | Low |
| [ | NR | NR | NR | PN | PN | Y | Y | NR | Low | Low |
| [ | NR | NR | NR | NR | PN | Y | Y | N | Low | Low |
|
[ | NR | Y | Y | Y | Y | Y | Y | Y | Unclear | High |
| [ | NR | PY | PY | NR | Y | Y | Y | NR | Unclear | Unclear |
| [ | NR | Y | Y | NR | Y | Y | N | NR | Low | Low |
|
[ | NR | Y | Y | Y | Y | Y | PN | Y | Low | Low |
| [ | NR | Y | PN | NR | PN | Y | Y | NR | Low | Low |
| [ | NR | PY | PY | PY | Y | Y | Y | NR | Unclear | Unclear |
| [ | NR | PN | PY | NR | Y | Y | Y | NR | Low | Low |
| [ | NR | Y | NR | Y | Y | NR | N | NR | Low | Low |
1 Y, PY, NR, PN, N stands for yes, partial yes, not reported, partial no, no; 2 for more details, see Appendix C.
Excluded Studies.
| Study ID | Reason of Excluding |
|---|---|
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong study design |
| [ | Wrong population |
| [ | Wrong population |
| [ | Wrong population |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong target |
| [ | Wrong index method |
| [ | Wrong index method |
| [ | Wrong type of data |
| [ | Wrong setting |
| [ | Wrong setting |
| [ | Wrong setting |
| [ | Wrong outcomes |
| [ | Wrong outcomes |
| [ | Wrong outcomes |
| [ | Wrong outcomes |
| [ | Wrong outcomes |
| [ | Wrong outcomes |
Risk of bias tool.
| Domain 1 | Review Authors’s | Criteria for Judgement | |
|---|---|---|---|
| Sample | 1. Was the sample | Yes/partial yes | The experiment was preceded by calculating the minimum sample |
| No/partial no | It is stated that the minimum sample size has not been calculated, or it | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Sample | 2. Were eligibility | Yes/partial yes | The criteria for inclusion in the experiment are specified. |
| No/partial no | The criteria for inclusion in the experiment were used, however not | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Participants | 3. Were all | Yes/partial yes | The criteria for inclusion and exclusion are relevant to the aim of the |
| No/partial no | The established criteria for inclusion and exclusion are irrelevant to | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Measurement | 4. Was the | Yes/partial yes | The selection of stimuli is adequately justified in the context of eliciting |
| No/partial no | The selection of stimuli was carried out based on inadequate criteria. | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Measurement | 5. Was the | Yes/partial yes | The assessment tool used for emotions measurement is described in |
| No/partial no | The assessment tool used for emotions measurement is not described, | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Flow and | 6. Did all | Yes/partial yes | Emotions were measured in all participants, and the measurement |
| No/partial no | Not all participants had their emotions measured. | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Flow and | 7. Did participants | Yes/partial yes | The same assessment standard was used in all participants who had |
| No/partial no | A different assessment standard was used in some of the participants | ||
| Not reported | No sufficient information is provided in this regard. | ||
| Control of | 8. Were the | Yes/partial yes | Adequate confounding factors were measured, and relevant |
| No/partial no | The control of confounding factors is not justified, or the measured | ||
| Not reported | No sufficient information is provided in regard to confounding factors. | ||
| High | All judgements are yes or partial yes. | ||
| Low | At least one judgement is no or partial no. | ||
| Unclear | All judgements are yes or partial yes with at least one not reported. | ||
1 the specific domain was based on an instrument provided in the reference.