| Literature DB >> 33217938 |
Andree Vela1, Joanna Alvarado-Uribe1, Manuel Davila2, Neil Hernandez-Gress1, Hector G Ceballos1.
Abstract
The understanding of occupancy patterns has been identified as a key contributor to achieve improvements in energy efficiency in buildings since occupancy information can benefit different systems, such as HVAC (Heating, Ventilation, and Air Conditioners), lighting, security, and emergency. This has meant that in the past decade, researchers have focused on improving the precision of occupancy estimation in enclosed spaces. Although several works have been done, one of the less addressed issues, regarding occupancy research, has been the availability of data for contrasting experimental results. Therefore, the main contributions of this work are: (1) the generation of two robust datasets gathered in enclosed spaces (a fitness gym and a living room) labeled with occupancy levels, and (2) the evaluation of three Machine Learning algorithms using different temporal resolutions. The results show that the prediction of 3-4 occupancy levels using the temperature, humidity, and pressure values provides an accuracy of at least 97%.Entities:
Keywords: Internet of Things; Machine Learning; enclosed spaces; energy efficiency; environmental variables; occupancy estimation
Mesh:
Year: 2020 PMID: 33217938 PMCID: PMC7698753 DOI: 10.3390/s20226579
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Summary of the reviewed related work.
| Author | Year | Variables | Algo. | OR | SR | TR | BR | GT | Demo Scale | Dataset |
|---|---|---|---|---|---|---|---|---|---|---|
| Viani et al. [ | 2014 | temp., hum., CO | SVM | 2 | 3 | N/A | 0.82 acc. | N/A | 29 IoT nodes | own |
| Cali et al. [ | 2015 | CO | Mass balance | 2 | 3 | N/A | 0.79 custom. | man. | 8 d., 5 rooms | own |
| Fiebig et al. [ | 2017 | VOC, Network, Bt. K.F., calendar | MLP, kNN, DT, RF | 1 | 1 | N/A | 0.75 F1 | man. | 52 d., 2 bedroom apt. | own |
| Parise et al. [ | 2019 | temp., hum., CO | SVM | 1 | 3 | 10 s | 0.96 F1 | N/A | 2 weeks, 1 room | own |
| Adeogun et al. [ | 2019 | press, hum., CO | FNN | 2 | 3 | 5 min | 0.91 acc. | man. | 6 weeks, 2 off., 4 pers. | own |
| Zemouri et al. [ | 2019 | temp., hum. | LR, LDA, kNN, CART, NB, SVM, GB | 1 | 3 | 5 min | 0.83 acc. | video | 37 d., 1 off., 3 pers. | own |
| Huchuck et al. [ | 2019 | temp., hum., o. temp., PIR, others | LR, MM, HMM, RF, LSTM | 1 | 3 | 30 min | 0.75 acc. | No | 1 year, 100 tstat. | DYP [ |
| Chitu et al. [ | 2019 | CO | RF, ELM | 2 | 3 | 1 min | 69.17 acc. | algo. | 15 d., 4 rooms | USD-OD [ |
| Jiang et al. [ | 2020 | CO | Bayessian Filtering with ELM and IMM | 2 | 3 | 15 min | 0.77 acc. | video | 30 d., 1 off., 28 pers. | own |
| Kumar et al. [ | 2020 | temp., hum., hum. ratio, CO | ELM | 1 | 3 | 1 min | 0.99 acc. | photos | building office | UCI [ |
| Zhou et al. [ | 2020 | CO | gcForest | 2 | 2 | 1 min | 0.82 acc. | video | 20 d., 1 lab., 4 pers. | own |
| Yuan et al. [ | 2020 | IR array sensor, temp. | IHMM | 2 | 3 | N/A | 0.81 acc. | video | 6 IoT nodes, 1 lab. | own |
Columns: Algorithms (Algo.), Occupancy Resolution (OR), Space Resolution (SR), Time Resolution (TR), Best Result (BR), Ground Truth (GT). Variables: Temperature (temp.), humidity (hum.), CO2Accumulation (CO2), CO2Occupancy Profile (CO2Prof.), Windows State (windows), Doors State (doors), Network Connections (Network), Bluetooth Key Fobs (Bt. K.F.), Pressure (press), outdoor temperature (o. temp.), infrared(IR), Volatile Organic Compound (VOC). Algorithms: SVM (Support Vector Machine), MLP (Multilayer Perceptron), kNN (k-Nearest Neighbor), DT (Decision Tress), RF (Random Forest), FNN (Feedforward Neural Network), LR (Logistic Regression), LDA (Linear Discriminant Analysis), CART (Classification and Regression Trees), NB (Naive Bayes), GB (Gradient Boost), MM (Markov Model), HMM (Hidden Markov Model), LSTM (Long Short-Term Memory), ELM (Extreme Learning Machine), gcForest (Multigrain Cascade Forest), IMM (Inhomogeneous Markov Model), IHMM (Inhomogeneous Hidden Markov Model). Occupancy Resolution: (1) Detection, (2) Estimation. Space Resolution: (1) Building, (2) Floor, (3) Room. Best Result: ACC (Accuracy), F1 (F1 Score), Custom (in a period, percentage of time the algorithm was correct). Ground Truth: M (manual), V (video), P (photograph). Demo Scale: D (days), W (weeks), Y (years), off. (office), apt. (apartment), lab. (laboratory), pers. (person), tstat (thermostat).
Figure 1Electronic circuit used for data collection.
Figure 2Sketch of the living room. The locations of the environmental sensor, ceiling fan, and AC unit are shown. The monitored area is approximately 32 m.
Datasets before and after being balanced.
| Dataset | Before | After | ||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| Living room 10 s | 4098 | 16,325 | 2830 | 547 | 16,353 | 16,325 | 16,330 | 16,329 |
| Living room 30 s | 1345 | 5450 | 956 | 188 | 5445 | 5450 | 5436 | 5452 |
| Living room 1 min | 673 | 2730 | 480 | 92 | 2720 | 2730 | 2720 | 2732 |
| Living room 5 min | 136 | 553 | 96 | 19 | 556 | 553 | 555 | 554 |
| Gym 10 s | N/A | 202 | 430 | 189 | N/A | 430 | 430 | 430 |
| Gym 30 s | N/A | 149 | 149 | 59 | N/A | 149 | 149 | 150 |
| Gym 1 min | N/A | 33 | 77 | 34 | N/A | 77 | 77 | 77 |
| Gym 5 min | N/A | 21 | 9 | 5 | N/A | 21 | 21 | 21 |
Figure 3Data distribution for (a) relative humidity, (b) temperature, and (c) pressure grouped by occupancy level and location. Where the occupancy levels are E = Empty, L = Low, M = Medium, and H = High.
Figure 43D scatter plot based on the (a) living room and (b) fitness gym data. The occupancy levels are indicated as follows: low (yellow), medium (orange), and high (red).
Figure 5Humidity (green), temperature (blue), and occupancy (red) timeline of the living room data. Time gaps are not shown and occupancy is encoded from zero (Empty) to three (High).
Parameters used for Features Selection and Resolution Selection experimentation.
| Location | Dataset | DT | kNN | SVM | |||
|---|---|---|---|---|---|---|---|
| Criterion | Max Depth | Algorithm | Neighbors | c | Gamma | ||
| Feature Selection | FULL | entropy | 10 | ball_tree | 1 | 10 | 1 |
| RFE | entropy | 6 | ball_tree | 1 | 1 | 1 | |
| KBEST | entropy | 10 | ball_tree | 1 | 1 | 1 | |
| MIN | entropy | 8 | ball_tree | 1 | 1 | 1 | |
| Feature Selection | FULL | gini | 12 | ball_tree | 1 | N/A | N/A |
| RFE | gini | 12 | ball_tree | 1 | N/A | N/A | |
| KBEST | gini | 12 | ball_tree | 1 | N/A | N/A | |
| MIN | gini | 12 | ball_tree | 1 | N/A | N/A | |
| Resolution Selection | 10 s | entropy | 8 | ball_tree | 1 | 1 | 1 |
| 10 avg. | entropy | 10 | ball_tree | 1 | 1 | 1 | |
| 30 s | entropy | 16 | ball_tree | 1 | 1 | 1 | |
| 30 avg. | entropy | 14 | ball_tree | 1 | 1 | 1 | |
| 1 min | entropy | 14 | ball_tree | 1 | 1 | 1 | |
| 1 avg. | gini | 16 | ball_tree | 1 | 1 | 1 | |
| Resolution Selection | 10 s | entropy | 22 | ball_tree | 1 | 100 | 10 |
| 10 avg. | entropy | 24 | ball_tree | 1 | 100 | 10 | |
| 30 s | entropy | 22 | ball_tree | 1 | 100 | 10 | |
| 30 avg. | entropy | 22 | ball_tree | 1 | 100 | 10 | |
| 1 min | entropy | 20 | ball_tree | 1 | 100 | 10 | |
| 1 avg. | gini | 20 | ball_tree | 1 | 100 | 10 | |
| 5 min | entropy | 18 | ball_tree | 1 | 100 | 10 | |
| 5 avg. | entropy | 12 | ball_tree | 1 | 100 | 10 | |
Subsets of features for the fitness gym dataset. An X indicates a selected feature.
| Humidity | Temperature | Pressure | |||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| FULL | X | X | X | X | X | X | X | X | X |
| RFE | X | X | X | X | X | ||||
| KBEST | X | X | X | X | X | ||||
| MIN | X | X | X | ||||||
Feature selection results for the living room and the fitness gym. Individual scores are shown in green, and mean scores in yellow. A stronger color indicates a higher score.
| Living Room | Fitness Gym | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| SVM | NA | NA | NA | NA | 0.9512 | 1.0 | 0.9951 | 1.0 | 0.9866 | |
| kNN | 0.7868 | 0.9231 | 0.9172 | 0.9848 | 0.9030 | 0.9512 | 1.0 | 0.9853 | 1.0 | 0.9841 |
| DT | 0.9441 | 0.9562 | 0.9586 | 0.9663 | 0.9563 | 0.9951 | 0.9951 | 0.9902 | 1.0 | 0.9951 |
| Mean | 0.8655 | 0.9397 | 0.9379 | 0.9756 | 0.9658 | 0.9984 | 0.9902 | 1.0 | ||
Resolution selection results for the fitness gym location. Individual scores are shown in green and mean scores in yellow. A stronger color indicates a higher score.
| 1 Sample | Averaged | |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| SVM | 0.9951 | 1.0 | 1.0 | 0.9984 | 1.0 | 0.9857 | 1.0 | 0.9952 |
| kNN | 0.9951 | 1.0 | 1.0 | 0.9984 | 1.0 | 1.0 | 1.0 | 1.0 |
| DT | 1.0 | 1.0 | 0.9722 | 0.9907 | 0.9902 | 1.0 | 1.0 | 0.9967 |
| Mean | 0.9967 | 1.0 | 0.9907 | 0.9967 | 0.9952 | 1.0 | ||
Subsets of features for the living room dataset. An X indicates a selected feature.
| Humidity | Temperature | Pressure | |||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| FULL | X | X | X | X | X | X | X | X | X |
| RFE | X | X | X | X | X | ||||
| KBEST | X | X | X | X | X | ||||
| MIN | X | X | X | ||||||
Resolution selection results for the living room location. Individual scores are shown in green and mean scores in yellow. A stronger color indicates a higher score.
| 1 Sample | Averaged | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| |
| SVM | 0.9716 | 0.9652 | 0.9517 | 0.9054 | 0.9485 | 0.9756 | 0.9657 | 0.9657 | 0.8955 | 0.9506 |
| kNN | 0.9848 | 0.9753 | 0.9577 | 0.9104 | 0.9571 | 0.9882 | 0.9768 | 0.9607 | 0.9452 | 0.9677 |
| DT | 0.9825 | 0.9712 | 0.9436 | 0.8358 | 0.9333 | 0.9831 | 0.9743 | 0.9476 | 0.9054 | 0.9526 |
| Mean | 0.9796 | 0.9706 | 0.9510 | 0.8839 | 0.9823 | 0.9723 | 0.9580 | 0.9154 | ||
Comparison of best results with results in the related works.
| Viani | Adeogun | Chitu | Jiang | Zhou | Fitness Gym | Living Room | |
|---|---|---|---|---|---|---|---|
| Accuracy | 0.82 | 0.91 | 0.69 | 0.77 | 0.82 | 1.0 | 0.98 |
| Variables | temp. | temp. | CO | CO | CO | temp. | temp. |