| Literature DB >> 32911809 |
Ronglong Xiong1,2, Fanmeng Kong1,2, Xuehong Yang1,2, Guangyuan Liu1,2, Wanhui Wen1,2.
Abstract
The matching of cognitive load and working memory is the key for effective learning, and cognitive effort in the learning process has nervous responses which can be quantified in various physiological parameters. Therefore, it is meaningful to explore automatic cognitive load pattern recognition by using physiological measures. Firstly, this work extracted 33 commonly used physiological features to quantify autonomic and central nervous activities. Secondly, we selected a critical feature subset for cognitive load recognition by sequential backward selection and particle swarm optimization algorithms. Finally, pattern recognition models of cognitive load conditions were constructed by a performance comparison of several classifiers. We grouped the samples in an open dataset to form two binary classification problems: (1) cognitive load state vs. baseline state; (2) cognitive load mismatching state vs. cognitive load matching state. The decision tree classifier obtained 96.3% accuracy for the cognitive load vs. baseline classification, and the support vector machine obtained 97.2% accuracy for the cognitive load mismatching vs. cognitive load matching classification. The cognitive load and baseline states are distinguishable in the level of active state of mind and three activity features of the autonomic nervous system. The cognitive load mismatching and matching states are distinguishable in the level of active state of mind and two activity features of the autonomic nervous system.Entities:
Keywords: cognitive load; e-learning; nervous response; pattern recognition; physiological measures
Mesh:
Year: 2020 PMID: 32911809 PMCID: PMC7571025 DOI: 10.3390/s20185122
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Related work in cognitive load recognition.
| Study | # Subjects | # Features | # Categories | Classifier | Signals | Best Accuracy | Validation Approach |
|---|---|---|---|---|---|---|---|
| Hasanbasic [ | 10 | 12 | 3 | SVM | ECG, EDA | 91.00% | SD |
| Melillo [ | 42 | 3 | 2 | LDA | ECG | 90.00% | SI |
| Cheema [ | 30 | 5 | 2 | LS-SVM | PCG | 96.67% | SD |
| Wang [ | 10 | 32 | 2 | PCA, SVM | EEG | 97.14% | SD |
| Al-Shargie [ | 22 | 9 | 2 | SVM | EEG, fNIRS | 95.10% | SD |
| McDuff [ | 10 | 7 | 2 | Naïve Bayes | PPG (HR, HRV, BR) | 86.00% | SI |
| Ahn [ | 14 | 4 | 2 | SVM | ECG, EEG | 87.50% | SD |
| Xia [ | 22 | 4 | 2 | PCA, SVM | EEG, ECG | 79.54% | SD |
| Dimitrakopoulos [ | 28 | 23 | 2 | SVM | EEG | 86.00% | SD |
| Yu [ | 20 | 4 | 2 | ELM | ECG | 84.75% | SI |
| Wang [ | 160 | - | 2 | LFDM, XGBoost | ECG, PPG | 97.2% | - |
| Das Chakladar [ | 48 | 6 | 2 | BLSTM-LSTM | EEG | 86.33% | - |
| Barua [ | 66 | 42 | 2 | Random Forest | HRV, GSR, RESP | 78.00% | SD |
| Plechawska [ | 11 | 52 | 3 | KNN | EEG | 91.50% | SI |
| Fan [ | 20 | 5 | 3 | SVM, PCA | EEG, ECG | 80.00% | SI |
SD: subject-dependent. In this case, samples belonging to one subject appear both in the training dataset and in the validation dataset, usually causing over-optimistic accuracy of the classifier. SI: subject-independent, which means subjects and samples belonging to the validation dataset are totally new to the trained classifier. LS-SVM: least-square support vector machine; PCG: phonocardiography; PPG: photoplethysmography; ELM: extreme learning machine; LFDM: linear feature dependency modeling; XGBoost: eXtreme Gradient Boosting; BLSTM-LSTM: a combination of bidirectional long short-term memory (BLSTM) and long short-term memory (LSTM) networks. PCG: phonocardiography; PPG: photoplethysmography; EDA: electrodermal activity; fNIRS: functional near-infrared spectroscopy; HR: heart rate; BR: breathing rate; RESP: respiration. #: number.
Figure 1The flowchart of the data selection and exclusion.
Grouping rules.
| Group | # Subjects | # Male | # Female | Model Name | Physiological Signal |
|---|---|---|---|---|---|
| CL vs. BL | 27 vs. 27 | 8 vs. 7 | 19 vs. 20 | Model A | HRV |
| Model B | EEG | ||||
| Model C | HRV and EEG | ||||
| CLMM vs. CLM | 9 vs. 18 | 3 vs. 5 | 6 vs. 13 | Model D | HRV |
| Model E | EEG | ||||
| Model F | HRV and EEG |
Among the data of 29 subjects, the BL data of two subjects and the CL data of another two subjects were heavily distorted by noise and eliminated from the dataset. #: number.
HRV features of cognitive load.
| Indices | Description | Relation with ANS Activity |
|---|---|---|
| SDRR | Standard deviation of RR intervals | A measure of HRV in time domain [ |
| RMSSD | Square root of the mean squared differences of successive RR intervals | A measure of HRV at one adjacent beat scale, which reflects the vagal activity [ |
| Mean | Mean of RR intervals | A measure of the average level of ANS activity [ |
| Area | Summation of RR intervals | A measure of total amount of ANS activity in time domain. |
| MFD | Mean of the first differences of RR intervals | A measure of HRV at one adjacent beat scale, which reflects the average fluctuation of ANS activity [ |
| pNN20 | Proportion of differences between successive RR intervals longer than 20 ms | A measure of HRV in time domain, which reflects the fluctuation of ANS activity. |
| pNN10 | Proportion of differences between successive RR intervals longer than 10 ms | A measure of HRV in time domain, which reflects the fluctuation of ANS activity. |
| HRVC | Heart rate variation coefficient, calculated by the ratio of SD to Mean | A measure of normalized fluctuation of ANS activity. |
| VLF | The power of RR intervals between 0 Hz and 0.04 Hz | A measure of SNS activity [ |
| LF | The power of RR intervals between 0.04 Hz and 0.15 Hz | A measure of combined activities of SNS and PNS [ |
| HF | The power of RR intervals between 0.15 Hz and 0.4 Hz | A measure of PNS activity [ |
| TOTPWR | The power of RR intervals between 0 Hz and 0.4 Hz | A measure of total amount of ANS activity in frequency domain [ |
| HF/(LF+HF) | The ratio of HF/(LF+HF) | A measure of normalized PNS activity. |
| LF/(LF+HF) | The ratio of LF/(LF+HF) | A measure of normalized PNS+SNS activity [ |
| LF/HF | The ratio of LF/HF | A measure of the balance between SNS and ANS [ |
| Entropy | PeEn, ApEn, MFEn, SampEn | Measures of the complexity of RR interval series caused by competition between SNS and PNS [ |
| DFA (α1, α2, α1/α2) | Detrend fluctuation analysis | Measures of the fractal properties of RR interval series caused by competition between SNS and PNS [ |
| TFC | Total fluctuation coefficient | A measure of the fluctuation of ANS activity in scales 1~ |
| PP (SD1, SD2, SD1/SD2) | Poincaré Plot | Measures of short-term and long-term HRV, which reflects the fluctuation of ANS activity [ |
| RLHE | Range of the local Hurst exponents | A measure of the complexity of RR interval series, which is controlled by competition between SNS and PNS [ |
PeEn: permutation entropy; ApEn: approximate entropy; SampEn: sample entropy; MFEn: multiscale fuzzy measure entropy; PNS: parasympathetic nervous system; α1 and α2 were calculated with the small scale (4 ≤ n ≤ 16) and large scale (16 ≤ n ≤ 32), respectively.
EEG Features of Cognitive Load.
| EEG Index | Description | Relation with CNS Activity |
|---|---|---|
| DP | Delta band (1–4 Hz) power | A measure of unconscious mind [ |
| TP | Theta band (4.1–5.8 Hz) power | A measure of subconscious mind [ |
| AP | Alpha band (5.9–7.4 Hz) power | A measure of relaxed mental state [ |
| BP | Beta band (13–19.9 Hz) power | A measure of active state of mind [ |
| GP | Gamma band (20–25 Hz) power | A measure of hyper brain activity [ |
| WE | Wavelet entropy | A measure of energy distribution of EEG at different scales [ |
In this work, we chose DB4 as wavelet packet decomposition function with the scale of 7 to calculate the WE.
Settings of entropy parameters based on MANN–WHITNEY U test.
| Feature | Group | Mean ± SD | Embedding Dimension | Tolerance Threshold | Sig. | Description |
|---|---|---|---|---|---|---|
| ApEn | CL | 0.67 ± 0.16 | 0.002 | |||
| BL | 0.78 ± 0.14 | |||||
| CLMM | 0.62 ± 0.12 | 0.041 | ||||
| CLM | 0.47 ± 0.18 | |||||
| SampEn | CL | 1.17 ± 0.33 | 0.01 | |||
| BL | 1.39 ± 0.29 | |||||
| CLMM | 0.70 ± 0.16 | 0.03 | ||||
| CLM | 0.52 ± 0.21 | |||||
| PeEn | CL | 0.59 ± 0.03 | - | 0.009 | ||
| BL | 0.61 ± 0.02 | |||||
| CLMM | 0.97 ± 0.13 | - | 0.017 | |||
| CLM | 0.93 ± 0.40 | |||||
| MFEn | CL | 0.33 ± 0.18 | <0.001 | |||
| BL | 0.50 ± 0.13 | |||||
| CLMM | 1.25 ± 0.16 | 0.015 | ||||
| CLM | 1.02 ± 0.27 |
Sig.: significance of Mann–Whitney U test.
Figure 2The results of HRV feature selection for Models A and D. (a) SBS and PSO, and (b) SBS.
Figure 3The results of EEG feature selection for Models B and E. (a) SBS and PSO, and (b) SBS.
Figure 4The results of EEG+HRV feature selection for Models C and F. (a) SBS and PSO, and (b) SBS.
Critical feature subsets of models A–F and performance indices of the classifiers.
| Model | Classifier | Critical Feature Subset | Mfs | Tfs (min) | F1 | Prec. (%) | Sens. (%) | Spec. (%) | AUC | Acc. (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| Model A | SVM_q | Area, LF, HF/(LF+HF) | SBS | 11.9 | 0.87 | 83.3 | 92.6 | 81.5 | 0.87 | 87.0 |
| SVM_r | RMSSD, LF, MFEn | SBS | 10.4 | 0.83 | 82.1 | 85.2 | 81.5 | 0.83 | 83.3 | |
| KNN | Area, LF, LF/HF | SBS | 9.6 | 0.86 | 81.3 | 96.3 | 77.8 | 0.87 | 87.0 | |
|
|
|
|
|
|
|
|
|
|
| |
| Model B | SVM_q | AP_Pz, BP_F7, BP_O2 | SBS | 29.3 | 0.72 | 71.4 | 74.1 | 70.4 | 0.72 | 72.2 |
| SVM_r | DP_F8, AP_Fp1, BP_Pz | SBS | 25.8 | 0.78 | 75.9 | 81.5 | 74.1 | 0.78 | 77.8 | |
| KNN | DP_T3, TP_F8, AP_O1 | PSO and SBS | 1455.3 | 0.72 | 73.1 | 70.4 | 74.1 | 0.72 | 72.2 | |
|
|
|
|
|
|
|
|
|
|
| |
| Model C | SVM_q | AP_O1, AP_A2A1, GP_O1, Mean | SBS | 167.8 | 0.93 | 92.6 | 92.6 | 92.6 | 0.93 | 92.6 |
| SVM_r | WE_P3, Area, LF, ApEn | SBS | 198.5 | 0.91 | 92.3 | 88.9 | 92.6 | 0.91 | 90.7 | |
| KNN | TP_O1, Mean, LF, ApEn | PSO and SBS | 762.9 | 0.90 | 86.7 | 96.3 | 85.2 | 0.91 | 90.7 | |
|
|
|
|
|
|
|
|
|
|
| |
| Model D | SVM_q | MFD, SampEn, MFEn | SBS | 7.9 | 0.88 | 100 | 77.8 | 100 | 0.89 | 88.9 |
|
|
|
|
|
|
|
|
|
|
| |
| KNN | ApEn, SD1, SD1/SD2 | PSO and SBS | 558.3 | 0.85 | 93.3 | 77.8 | 94.4 | 0.86 | 86.1 | |
|
|
|
|
|
|
|
|
|
|
| |
| Model E | SVM_q | DP_T4, AP_Pz, | PSO and SBS | 1207.0 | 0.80 | 86.7 | 72.2 | 88.9 | 0.81 | 80.6 |
| SVM_r | WE_F4, WE_F7 | SBS | 29.0 | 0.78 | 73.9 | 94.4 | 66.7 | 0.81 | 80.6 | |
|
|
|
|
|
|
|
|
|
|
| |
| DT | DP_T6, GP_T4 | SBS | 35.2 | 0.85 | 81.0 | 94.4 | 77.8 | 0.86 | 86.1 | |
| Model F |
|
|
|
|
|
|
|
|
|
|
| SVM_r | GP_Fz, MFD, SampEn, SD2 | SBS | 125.8 | 0.94 | 94.4 | 94.4 | 94.4 | 0.94 | 94.4 | |
| KNN | GP_T4, MFD, PeEn, TFC | SBS | 200.8 | 0.94 | 100 | 88.9 | 100 | 0.94 | 94.4 | |
| DT | AP_T4, LF, TFC, SD1/SD2 | SBS | 149.3 | 0.92 | 89.5 | 94.4 | 88.9 | 0.92 | 91.7 |
SVM_q: quadratic polynomial as kernel function of SVM. KNN: k-nearest neighbors using Euclidean distance weighting. SVM_rbf: rbf as kernel function of SVM. Fp2, F7, Pz, T3, C3, A2, A1, F4, T4, T5, F8, C4, O1, O2, F3, Fp1, Cz, P4 and T6 represent various channels of EEG signal. Naming rules for EEG features: feature name _ channel name. Mfs: Method of feature selection. Tfs: Time of feature selection.
The confusion matrices of Models C and F.
| Model | Classifier | Mfs | Classified as | CL | BL | CLMM | CLM |
|---|---|---|---|---|---|---|---|
| Model C | SVM_q | SBS | CL | 92.6% | 7.4% | - | - |
| BL | 7.4% | 92.6% | - | - | |||
| SVM_r | SBS | CL | 88.9% | 11.1% | - | - | |
| BL | 7.4% | 92.6% | - | - | |||
| KNN | SBS and PSO | CL | 96.3% | 3.7% | - | - | |
| BL | 14.8% | 85.2% | - | - | |||
|
|
|
|
|
| - | - | |
|
|
|
| - | - | |||
| Model F |
|
|
| - | - |
|
|
|
| - | - |
|
| |||
| SVM_r | SBS | CLMM | - | - | 94.4% | 5.6% | |
| CLM | - | - | 5.6% | 94.4% | |||
| KNN | SBS | CLMM | - | - | 100% | 0% | |
| CLM | - | - | 11.1% | 88.9% | |||
| DT | SBS | CLMM | - | - | 88.9% | 11.1% | |
| CLM | - | - | 5.6% | 94.4% |
The confusion matrix of validation using e-learning data.
| Classifier | Classified as | CL | BL |
|---|---|---|---|
| DT | CL | 55.0% | 45.0% |
| BL | 19.0% | 81.0% |