| Literature DB >> 32365603 |
Francesco Salamone1, Benedetta Barozzi1, Ludovico Danza1, Matteo Ghellere1, Italo Meroni1.
Abstract
Users' satisfaction in indoor spaces plays a key role in building design. In recent years, scientific research has focused more and more on the effects produced by the presence of greenery solutions in indoor environments. In this study, the Internet of Things (IoT) concept is used to define an effective solution to monitor indoor environmental parameters, along with the biometric data of users involved in an experimental campaign conducted in a Zero Energy Building laboratory where a living wall has been installed. The growing interest in the key theory of the IoT allows for the development of promising frameworks used to create datasets usually managed with Machine Learning (ML) approaches. Following this tendency, the dataset derived by the proposed infield research has been managed with different ML algorithms in order to identify the most suitable model and influential variables, among the environmental and biometric ones, that can be used to identify the plant configuration. The obtained results highlight how the eXtreme Gradient Boosting (XGBoost)-based model can obtain the best average accuracy score to predict the plant configuration considering both a selection of environmental parameters and biometric data as input values. Moreover, the XGBoost model has been used to identify the users with the highest accuracy considering a combination of picked biometric and environmental features. Finally, a new Green View Factor index has been introduced to characterize how greenery has an impact on the indoor space and it can be used to compare different studies where green elements have been used.Entities:
Keywords: IoT; living wall; machine learning; wearable
Mesh:
Year: 2020 PMID: 32365603 PMCID: PMC7249040 DOI: 10.3390/s20092523
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Main features of the reference literature and proposed study.
| Reference Study | Test Environment | Greenery | Environmental Data | Biometric | Data Processing |
|---|---|---|---|---|---|
| [ | Classroom | Real plants | Monitored | Not monitored | Statistical |
| [ | Offices | Real plants | Monitored | Not monitored | Statistical |
| [ | University | Living wall | Monitored | Not monitored | Statistical |
| [ | University | Active Living Wall | Monitored | Not monitored | Statistical |
| [ | University | Real plants and photos | Not monitored | Monitored | Statistical |
| [ | Experimental room | Real plants | Not monitored | Monitored | Statistical |
| [ | Hospital | Photos on monitor | Not monitored | Monitored | Statistical |
| [ | Hospital | Photos | Not monitored | Not monitored | Statistical |
| [ | Hospital | Real plants | Not monitored | Monitored | Statistical |
| [ | Office | Real plants | Not monitored | Monitored | Statistical |
| [ | School | Real plants | Not monitored | Monitored | Statistical |
| [ | Office | Photos | Not monitored | Monitored | Statistical |
| [ | School | Living wall | Not monitored | Not monitored | Statistical |
| [ | Office | Real plants and Virtual Reality | Not monitored | Monitored | Statistical |
| Current study | Office lab | Living wall | Monitored | Monitored | Machine Learning |
Figure 1LabZEB: (a) outdoor view; (b) indoor plan.
Figure 2A1 room: (a) spatial distribution of sensors; (b) photo of the room set up.
Characteristics of sensors installed in the test cell.
| Sensor ( | Position ( | Variable | U.M. | Measure Range | Accuracy |
|---|---|---|---|---|---|
| TRH | In front of P1 and P2 | Relative Humidity | [%] | 0 ÷ 100% | ±2 % |
| Air Temperature | [°C] | −40 ÷ +60 °C | ±0.1 °C | ||
| TRA | Near P1 and P2 | Radiant Temperature (derived) | [°C] | −40 ÷ +60 °C | ±0.1 °C |
| ANM | Left to P1 and P2 | Air Velocity | [m/s] | 0 ÷ 5 m/s | ±0.02 m/s |
| Air Temperature | [°C] | −20 ÷ +80 °C | ±0.3 °C | ||
| LXL | In front of P1 and P2 | Illuminance | [lx] | ||
| CO2 | Behind P1 and P2 | CO2 concentration | [ppm] | 0-5000 ppm | ±50 ppm |
| VOC | Behind P1 and P2 | Volatile Organic Compounds | [%] | 0 ÷ 100 % of VOC | ±20 % |
| PPG sensor | Smart wearable | Heat Recovery (HR) (derived) | [bpm] | - | - |
| EDA sensor | Smart wearable | EDA | [μS] | 0.01 ÷ 100 µS | - |
| Skin temperature sensor | Smart wearable | Tskin | [°C] | −40 ÷ +85 °C | - |
| 3-axes accelerometer | Smart wearable | Accelerations | [g] | ±2 g | - |
Questions of Web-based survey.
| Question ID | Questions | Answer Options |
|---|---|---|
| Q1 | How do you evaluate the performance of your work? | 1 (tiring) to 5 (untiring) |
| Q2 | How do you assess thermo-hygrometric wellness on average? | 1 (very unsatisfactory) to 5 (very satisfactory) |
| Q3 | How do you assess the air quality on average? | 1 (very unsatisfactory) to 5 (very satisfactory) |
| Q4 | How do you assess the lighting quality on average? | 1 (very unsatisfactory) to 5 (very satisfactory) |
Considered plant configurations and related setting.
| Plant Configuration | Setting |
|---|---|
| 1 | Living wall absent |
| 2 | Living wall absent |
| 3 | Living wall present |
| 4 | Living wall present |
Figure 3P1 and P2 Field of View: (a) P1 photo; (b) P2 photo; (c) P1 Black and white file for Green View Factor (GVF) calculation; (d) P2 Black and white file for GVF calculation; (e) P1 green area peripheral vision; (f) P2 green area peripheral vision.
Figure 4Starting dataset and related parameters: in yellow—null data, in blue—non-null data.
Figure 5Correlation matrix between environmental parameters.
Dataset attributes and description.
| ID | Label | Number | Type | Description |
|---|---|---|---|---|
| 0 | ID | 5692 | non-null int64 | ID progressive |
| 1 | Data&Time | 5692 | non-null datetime64 | Date and time |
| 2 | VA | 5692 | non-null float64 | Air velocity [m/s] measured close to the workstations P1 and P2 |
| 3 | TA | 5692 | non-null float64 | Air temperature [°C] measured by the anemometer close to the workstations P1 and P2 |
| 4 | AT | 5692 | non-null float64 | Air temperature [°C] measured by the thermo-hygrometer close to the workstations P1 and P2 |
| 5 | RH | 5692 | non-null float64 | Relative humidity [%] measured by the thermo-hygrometer close to the workstations P1 and P2 |
| 6 | TRA | 5692 | non-null float64 | Radiant temperature [°C] measured by the globe thermometer close to the workstations P1 and P2 |
| 7 | LX | 5692 | non-null float64 | Illuminance [lx] measured by the luxmeter close to the workstations P1 and P2 |
| 8 | CO2 | 5692 | non-null int64 | CO2 indoor concentration [ppm] close to the workstations P1 and P2 |
| 9 | VOC | 5692 | non-null int64 | VOCs [%] close to the workstations P1 and P2 |
| 10 | EDA | 5692 | non-null float64 | ElectroDermal Activity [μS] |
| 11 | AccelX | 5692 | non-null float64 | Acceleration along the X axis [g] |
| 12 | AccelY | 5692 | non-null float64 | Acceleration along the Y axis [g] |
| 13 | AccelZ | 5692 | non-null float64 | Acceleration along the Z axis [g] |
| 14 | Temp | 5692 | non-null float64 | Skin temperature [°C] |
| 15 | motion | 5692 | non-null float64 | Root mean squared 3 axis acceleration [ |
| 16 | HR | 5692 | non-null int64 | Heart rate [bpm] |
| 17 | Q1 | 5692 | non-null int64 | Question 1 (see |
| 18 | Q2 | 5692 | non-null int64 | Question 2 (see |
| 19 | Q3 | 5692 | non-null int64 | Question 3 (see |
| 20 | Q4 | 5692 | non-null int64 | Question 4 (see |
| 21 | IEQ_avg | 5692 | non-null float64 | IEQ as a weighted average of previous scores |
| 22 | User | 5692 | non-null float64 | Number of the user |
| 23 | P1/P2 | 5692 | non-null object | P1/P2 workstation |
| 24 | M/A | 5692 | non-null object | Morning/Afternoon |
| 25 | Plant_Config. | 5692 | non-null int64 | Configuration as reported in |
Figure 6Feature importance—environmental data for plant configuration identification.
Average accuracy—selected environmental data for plant configuration identification. With *: average accuracy defined considering the tuning of hyperparameters.
| Algorithm | Average Accuracy | Standard Deviation | ||
|---|---|---|---|---|
| 25 = | 25 = | 25 = | 25 = | |
| LR | 0.988* | 0.988* | 0.006* | 0.005* |
| LDA | 0.969 | 0.634 | 0.008 | 0.028 |
| KNN | 0.982* | 0.994* | 0.005 | 0.005* |
| CART | 0.997 | 0.974* | 0.002 | 0.008* |
| ETC | 0.993 | 0.875 | 0.006 | 0.015 |
| NB | 0.984 | 0.621 | 0.006 | 0.027 |
| SVM | 0.977* | 0.986* | 0.008* | 0.007* |
| RF | 0.998* | 0.994* | 0.002* | 0.005* |
| XGBoost | 0.998* | 0.995* | 0.002* | 0.004* |
Hyperparameters tuning range.
| Algorithms | Hyperparameters | Range |
|---|---|---|
| LR | Solver | [‘newton-cg’, ‘lbfgs’, ‘liblinear’] |
| Penalty | [‘l1′, ‘l2′, ‘elasticnet’, ‘none’] | |
| C_value | [100, 10, 1.0, 0.1, 0.01] | |
| KNN | Leaf_size | range(1,10,2) |
| n_neighbors | range(1,30,5) | |
| p_value | [ | |
| CART | Max_depth | range(1,50,4) |
| Min_samples_leaf | [i/10.0 for i in range(1,6)] | |
| Max_features | [i/10.0 for i in range(1,11)] | |
| ETC | Max_depth | range(1,50,4) |
| Min_samples_leaf | [i/10.0 for i in range(1,6)] | |
| Max_features | [i/10.0 for i in range(1,11)] | |
| SVM | Kernel | [‘poly’, ‘rbf’, ‘sigmoid’] |
| C_value | [50, 10, 1.0, 0.1, 0.01] | |
| RF | n_estimators | range(1,22,2) |
| XGBoost | Max_depth | range(3,10,2) |
| Min_child_weight | range(1,6,2) | |
| Gamma | [i/10.0 for i in range(0,5)] |
Validation—selected environmental data for plant configuration identification.
| Plant Config. | Precision | Recall | F1-Score | Support | |
|---|---|---|---|---|---|
| RF, 25 = | 1 | 0.98 | 0.87 | 0.92 | 119 |
| 2 | 0.95 | 0.99 | 0.97 | 290 | |
| 3 | 1.00 | 1.00 | 1.00 | 330 | |
| 4 | 1.00 | 1.00 | 1.00 | 400 | |
| XGBoost, 25 = | 1 | 1.00 | 1.00 | 1.00 | 119 |
| 2 | 0.99 | 0.99 | 1.00 | 290 | |
| 3 | 1.00 | 0.99 | 1.00 | 330 | |
| 4 | 0.98 | 1.00 | 1.00 | 400 | |
| 1 | 1.00 | 0.95 | 0.97 | 119 | |
| XGBoost, 25 = | 2 | 0.98 | 1.00 | 0.99 | 290 |
| 3 | 1.00 | 1.00 | 1.00 | 330 | |
| 4 | 0.99 | 1.00 | 0.99 | 400 |
Figure 7Correlation matrix between biometric parameters.
Figure 8Feature importance—biometric parameters for plant configuration identification.
Average accuracy—selected biometric data for plant configuration identification. With *: average accuracy defined considering the tuning of hyperparameters.
| Algorithm | Avgerage Accuracy | Standard Deviation | ||
|---|---|---|---|---|
| 25 = | 25 = | 25 = | 25 = | |
| LR | 0.500* | 0.500* | 0.015* | 0.015* |
| LDA | 0.363 | 0.363 | 0.024 | 0.023 |
| KNN | 0.871* | 0.742* | 0.011 | 0.019* |
| CART | 0.855 | 0.626 | 0.011 | 0.016 |
| ETC | 0.848 | 0.632 | 0.024 | 0.024 |
| NB | 0.397 | 0.347 | 0.015 | 0.024 |
| SVM | 0.559* | 0.495* | 0.015* | 0.014* |
| RF | 0.891* | 0.687* | 0.011 | 0.020* |
| XGBoost | 0.902* | 0.743* | 0.014 | 0.016* |
Validation—selected biometric data for plant configuration identification.
| Plant Config. | Precision | Recall | F1-Score | Support | |
|---|---|---|---|---|---|
| XGBoost, 25 = | 1 | 0.94 | 0.92 | 0.93 | 119 |
| 2 | 0.82 | 0.84 | 0.83 | 290 | |
| 3 | 0.92 | 0.91 | 0.91 | 330 | |
| 4 | 0.85 | 0.85 | 0.85 | 400 | |
| XGBoost, 25 = | 1 | 0.89 | 0.92 | 0.91 | 119 |
| 2 | 0.70 | 0.68 | 0.69 | 290 | |
| 3 | 0.74 | 0.70 | 0.72 | 330 | |
| 4 | 0.70 | 0.73 | 0.71 | 400 |
Figure 9Feature importance—environmental and biometric data for user identification.
Average accuracy—selected environmental and biometric data for user identification. With *: average accuracy defined considering the tuning of hyperparameters.
| Algorithm | Average Accuracy | Standard Deviation |
|---|---|---|
| LR | 0.364* | 0.018 |
| LDA | 0.217 | 0.021 |
| KNN | 0.963* | 0.006* |
| CART | 0.816 | 0.009 |
| ETC | 0.788 | 0.013 |
| NB | 0.321 | 0.030 |
| SVM | 0.251* | 0.015* |
| RF | 0.957* | 0.009* |
| XGBoost | 0.959* | 0.008* |
Validation—selected environmental and biometric data for user identification.
| User | Precision | Recall | F1-Score | Support | |
|---|---|---|---|---|---|
| XGBoost, 22 = | 1 | 0.99 | 0.97 | 0.98 | 106 |
| 2 | 0.94 | 0.98 | 0.96 | 138 | |
| 3 | 0.92 | 0.95 | 0.93 | 83 | |
| 4 | 0.91 | 0.95 | 0.93 | 63 | |
| 5 | 0.96 | 0.93 | 0.94 | 137 | |
| 6 | 0.95 | 0.94 | 0.94 | 160 | |
| 7 | 0.98 | 0.91 | 0.94 | 159 | |
| 8 | 0.95 | 0.97 | 0.96 | 162 | |
| 9 | 0.93 | 0.96 | 0.95 | 131 |
Average accuracy—selected environmental and biometric data for plant configuration identification. With *: average accuracy defined considering the tuning of hyperparameters.
| Algorithm | Average Accuracy | Standard Deviation | ||
|---|---|---|---|---|
| 25 = | 25 = | 25 = | 25 = | |
| LR | 0.977* | 0.952* | 0.005* | 0.008* |
| LDA | 0.964 | 0.937 | 0.008 | 0.008 |
| KNN | 0.998* | 0.986* | 0.002* | 0.005* |
| CART | 0.984 | 0.983 | 0.008 | 0.007 |
| ETC | 0.978 | 0.977 | 0.005 | 0.007 |
| NB | 0.605 | 0.933 | 0.026 | 0.008 |
| SVM | 0.961* | 0.967* | 0.009 | 0.007* |
| RF | 0.994* | 0.986* | 0.003 | 0.004* |
| XGBoost | 0.998* | 0.986* | 0.002 | 0.004* |
Validation—selected environmental and biometric data for plant configuration identification.
| Plant Config. | Precision | Recall | F1-Score | Support | |
|---|---|---|---|---|---|
| XGBoost, 25 = | 1 | 1.00 | 1.00 | 1.00 | 119 |
| 2 | 1.00 | 1.00 | 1.00 | 290 | |
| 3 | 1.00 | 1.00 | 1.00 | 330 | |
| 4 | 1.00 | 1.00 | 1.00 | 400 | |
| XGBoost, 25 = | 1 | 0.98 | 1.00 | 0.99 | 119 |
| 2 | 1.00 | 0.99 | 1.00 | 290 | |
| 3 | 0.98 | 0.99 | 0.98 | 330 | |
| 4 | 1.00 | 0.99 | 0.99 | 400 | |
| 1 | 0.98 | 1.00 | 0.99 | 119 | |
| RF, 25 = | 2 | 1.00 | 0.99 | 1.00 | 290 |
| 3 | 0.98 | 0.98 | 0.98 | 330 | |
| 4 | 0.99 | 0.98 | 0.99 | 400 |
Figure 10SHapley Additive exPlanations (SHAP) value plot.