| Literature DB >> 32867055 |
William L Romine1, Noah L Schroeder2, Josephine Graft1, Fan Yang3, Reza Sadeghi4, Mahdieh Zabihimayvan5, Dipesh Kadariya3, Tanvi Banerjee3.
Abstract
Automated tracking of physical fitness has sparked a health revolution by allowing individuals to track their own physical activity and health in real time. This concept is beginning to be applied to tracking of cognitive load. It is well known that activity in the brain can be measured through changes in the body's physiology, but current real-time measures tend to be unimodal and invasive. We therefore propose the concept of a wearable educational fitness (EduFit) tracker. We use machine learning with physiological data to understand how to develop a wearable device that tracks cognitive load accurately in real time. In an initial study, we found that body temperature, skin conductance, and heart rate were able to distinguish between (i) a problem solving activity (high cognitive load), (ii) a leisure activity (moderate cognitive load), and (iii) daydreaming (low cognitive load) with high accuracy in the test dataset. In a second study, we found that these physiological features can be used to predict accurately user-reported mental focus in the test dataset, even when relatively small numbers of training data were used. We explain how these findings inform the development and implementation of a wearable device for temporal tracking and logging a user's learning activities and cognitive load.Entities:
Keywords: cognitive load; learning analytics; machine learning; studying; wearable sensor
Mesh:
Year: 2020 PMID: 32867055 PMCID: PMC7506959 DOI: 10.3390/s20174833
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Summary of the three tasks used in Study 1, with cognition state and hypothesized level of cognitive load.
| Cognition State | Cognitive Load | Associated Task |
|---|---|---|
| Deep Cognition | High | Playing Sudoku |
| Leisured Cognition | Moderate | Browsing the Internet |
| Daydreaming | Low | Sitting and Doing Nothing |
Descriptive statistics for each participant’s physiological measures across each activity in Study 1. N = number of measurements (1 measure per second), EDA = electrodermal activity, HR = heart rate, and TEMP = body temperature.
| EDA (μS) | HR (Beats/Minute) | TEMP (°C) | ||||||
|---|---|---|---|---|---|---|---|---|
| Person | Activity | N | Mean | SD | Mean | SD | Mean | SD |
| 1 | Sudoku | 1198 | 0.06 | 0.03 | 72.61 | 10.70 | 32.32 | 1.67 |
| Browsing Internet | 1273 | 0.04 | 0.02 | 81.37 | 18.26 | 31.96 | 1.92 | |
| Daydreaming | 1359 | 0.05 | 0.01 | 68.55 | 7.68 | 31.16 | 1.53 | |
| 2 | Sudoku | 9813 | 4.07 | 3.03 | 92.08 | 10.33 | 32.53 | 0.43 |
| Browsing Internet | 9074 | 3.27 | 2.19 | 92.24 | 12.32 | 32.52 | 0.38 | |
| Daydreaming | 8860 | 2.81 | 2.68 | 88.74 | 12.90 | 32.55 | 0.33 | |
| 3 | Sudoku | 4260 | 0.17 | 0.03 | 78.69 | 7.62 | 30.92 | 1.64 |
| Browsing Internet | 3900 | 0.19 | 0.05 | 73.42 | 8.55 | 30.92 | 1.58 | |
| Daydreaming | 3229 | 0.19 | 0.08 | 73.50 | 8.15 | 31.56 | 0.60 | |
| 4 | Sudoku | 2296 | 0.10 | 0.09 | 84.98 | 6.70 | 33.52 | 1.65 |
| Browsing Internet | 2562 | 0.14 | 0.08 | 80.96 | 3.87 | 32.54 | 0.99 | |
| Daydreaming | 1735 | 0.09 | 0.01 | 89.97 | 3.85 | 34.49 | 0.78 | |
| 5 | Sudoku | 2227 | 0.16 | 0.11 | 82.92 | 14.37 | 30.05 | 1.30 |
| Browsing Internet | 2169 | 0.11 | 0.06 | 74.94 | 8.83 | 33.01 | 0.62 | |
| Daydreaming | 1804 | 0.05 | 0.04 | 71.89 | 11.35 | 31.93 | 2.53 | |
| 6 | Sudoku | 2880 | 0.13 | 0.06 | 70.02 | 7.77 | 32.59 | 2.38 |
| Browsing Internet | 3600 | 0.14 | 0.05 | 68.92 | 8.64 | 32.70 | 1.94 | |
| Daydreaming | 3420 | 0.14 | 0.08 | 74.29 | 7.60 | 32.06 | 2.61 | |
| 7 | Sudoku | 10200 | 0.38 | 0.23 | 75.17 | 5.32 | 32.93 | 0.71 |
| Browsing Internet | 3840 | 0.39 | 0.25 | 80.10 | 16.35 | 32.65 | 0.42 | |
| Daydreaming | 3540 | 0.43 | 0.44 | 73.85 | 6.25 | 32.31 | 0.89 | |
Classification performance of the test dataset. Models include the constant (outcome bias), baseline models (k-nearest neighbors (KNN), logistic regression, naïve Bayes, decision tree), and black box models (support vector machine (SVM), neural network, AdaBoost, and random forest).
| Activity (Resting, Web, Sudoku) | Liberal (95% Training) | Conservative (10% Training) | ||||||
|---|---|---|---|---|---|---|---|---|
| Model | AUC | F1 | Precision | Recall | AUC | F1 | Precision | Recall |
| Constant | 0.49 | 0.22 | 0.16 | 0.40 | 0.50 | 0.22 | 0.16 | 0.39 |
| K-Nearest Neighbors (k = 3) | 0.97 | 0.93 | 0.93 | 0.93 | 0.89 | 0.84 | 0.84 | 0.84 |
| Logistic Regression | 0.52 | 0.23 | 0.44 | 0.41 | 0.36 | 0.28 | 0.41 | 0.41 |
| Naïve Bayes | 0.65 | 0.45 | 0.46 | 0.47 | 0.67 | 0.44 | 0.45 | 0.46 |
| Decision Tree (depth = 4) | 0.68 | 0.43 | 0.57 | 0.49 | 0.67 | 0.45 | 0.53 | 0.48 |
| Support Vector Machine | 0.50 | 0.31 | 0.33 | 0.32 | 0.44 | 0.28 | 0.31 | 0.29 |
| Neural Network | 0.79 | 0.61 | 0.61 | 0.61 | 0.33 | 0.45 | 0.47 | 0.47 |
| AdaBoost | 0.93 | 0.92 | 0.92 | 0.92 | 0.78 | 0.80 | 0.80 | 0.80 |
| Random Forest | 0.99 | 0.94 | 0.94 | 0.94 | 0.93 | 0.85 | 0.85 | 0.85 |
Descriptive statistics for each participant’s physiological measures across the activities in Study 2. N = number of measurements (1 measure per second), EDA = electrodermal activity, HR = heart rate, and TEMP = body temperature.
| EDA (μS) | HR (Beats/Minute) | TEMP (°C) | ||||||
|---|---|---|---|---|---|---|---|---|
| Person | Focused | N | Mean | SD | Mean | SD | Mean | SD |
| 1 | No | 60 | 0.19 | 0.04 | 98.18 | 15.36 | 31.84 | 0.08 |
| Yes | 514 | 0.22 | 0.02 | 74.74 | 7.05 | 32.29 | 0.13 | |
| 2 | No | 0 | ||||||
| Yes | 855 | 0.09 | 0.03 | 97.94 | 18.28 | 33.68 | 0.43 | |
| 3 | No | 117 | 0.07 | 0.01 | 80.29 | 4.37 | 30.85 | 0.06 |
| Yes | 880 | 0.09 | 0.01 | 75.10 | 5.03 | 30.48 | 0.18 | |
| 4 | No | 176 | 0.04 | 0.02 | 89.62 | 16.05 | 29.16 | 0.21 |
| Yes | 583 | 0.05 | 0.02 | 90.47 | 8.59 | 30.69 | 1.44 | |
| 5 | No | 0 | ||||||
| Yes | 475 | 0.20 | 0.07 | 79.36 | 9.74 | 32.33 | 0.15 | |
| 6 | No | 293 | 0.38 | 0.08 | 71.40 | 10.63 | 33.53 | 0.35 |
| Yes | 250 | 0.42 | 0.05 | 80.45 | 11.80 | 33.50 | 0.12 | |
| 7 | No | 653 | 0.11 | 0.03 | 96.29 | 19.11 | 33.80 | 0.27 |
| Yes | 238 | 0.06 | 0.02 | 101.41 | 13.67 | 33.38 | 0.58 | |
Binary logistic regression model for predicting a user’s reported state of focus based on EDA, body temperature, and heart rate. This model controls for variations in each participant’s intercepts and trends (χ2 = 4224.09, df = 31, p << 0.001).
| Variable | B | SE | χ2 (df = 1) | OR | |
|---|---|---|---|---|---|
| EDA | −1.424 | 0.114 | 157.288 | 0.000 | 0.241 |
| TEMP | −0.954 | 0.098 | 94.613 | 0.000 | 0.385 |
| HR | 0.332 | 0.111 | 8.941 | 0.003 | 1.394 |
Classification performance of the test dataset. Models include the constant (outcome bias), baseline models (k-nearest neighbors (KNN), logistic regression, naïve Bayes, decision tree), and black box models (support vector machine (SVM), neural network, AdaBoost, and random forest).
| Reported Focus (Yes/No) | Liberal (95% Training) | Conservative (10% Training) | ||||||
|---|---|---|---|---|---|---|---|---|
| Model | AUC | F1 | Precision | Recall | AUC | F1 | Precision | Recall |
| Constant | 0.46 | 0.64 | 0.56 | 0.75 | 0.50 | 0.64 | 0.56 | 0.75 |
| K-Nearest Neighbors (k = 3) | 0.87 | 0.80 | 0.80 | 0.80 | 0.81 | 0.79 | 0.79 | 0.79 |
| Logistic Regression | 0.54 | 0.64 | 0.56 | 0.75 | 0.52 | 0.64 | 0.69 | 0.75 |
| Naïve Bayes | 0.68 | 0.64 | 0.56 | 0.75 | 0.66 | 0.66 | 0.68 | 0.74 |
| Decision Tree (depth = 4) | 0.73 | 0.75 | 0.81 | 0.80 | 0.71 | 0.74 | 0.74 | 0.77 |
| Support Vector Machine | 0.51 | 0.61 | 0.61 | 0.61 | 0.53 | 0.64 | 0.62 | 0.67 |
| Neural Network | 0.54 | 0.64 | 0.56 | 0.75 | 0.52 | 0.64 | 0.68 | 0.75 |
| AdaBoost | 0.86 | 0.90 | 0.90 | 0.90 | 0.72 | 0.78 | 0.78 | 0.78 |
| Random Forest | 0.96 | 0.90 | 0.90 | 0.91 | 0.85 | 0.81 | 0.81 | 0.82 |
Figure 1Conceptual diagram of the EduFit system, including the user, the wearable sensor, the smartphone app, and a web server.