| Literature DB >> 34640986 |
Abdulah Jeza Aljohani1, Junaid Shuja2, Waleed Alasmary3, Abdulaziz Alashaikh4.
Abstract
COVID-19 tracing applications have been launched in several countries to track and control the spread of viruses. Such applications utilize Bluetooth Low Energy (BLE) transmissions, which are short range and can be used to determine infected and susceptible persons near an infected person. The COVID-19 risk estimation depends on an epidemic model for the virus behavior and Machine Learning (ML) model to classify the risk based on time series distance of the nodes that may be infected. The BLE technology enabled smartphones continuously transmit beacons and the distance is inferred from the received signal strength indicators (RSSI). The educational activities have shifted to online teaching modes due to the contagious nature of COVID-19. The government policy makers decide on education mode (online, hybrid, or physical) with little technological insight on actual risk estimates. In this study, we analyze BLE technology to debate the COVID-19 risks in university block and indoor class environments. We utilize a sigmoid based epidemic model with varying thresholds of distance to label contact data with high risk or low risk based on features such as contact duration. Further, we train multiple ML classifiers to classify a person into high risk or low risk based on labeled data of RSSI and distance. We analyze the accuracy of the ML classifiers in terms of F-score, receiver operating characteristic (ROC) curve, and confusion matrix. Lastly, we debate future research directions and limitations of this study. We complement the study with open source code so that it can be validated and further investigated.Entities:
Keywords: BLE; COVID-19; classification; epidemic model; machine learning
Mesh:
Year: 2021 PMID: 34640986 PMCID: PMC8513035 DOI: 10.3390/s21196667
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1BLE based contact tracing application operation.
Figure 2The research methodology for ML based risk prediction on BLE data.
Figure 3Percentage of high and low risk persons for varying thresholds: linear.
Figure 4Percentage of high and low risk persons on varying thresholds: sigmoid.
Figure 5Comparison of ML classifiers for in terms of ROC.
Comparison of ROC.
| Datasets | ML Model | ||||
|---|---|---|---|---|---|
| LR | SVM | LDA | DT | KNN | |
| Indoor | 0.62 | 0.61 | 0.62 | 0.58 | 0.61 |
| Outdoor | 0.91 | 0.87 | 0.90 | 0.82 | 0.91 |
Figure 6Comparison of ML classifiers in terms of F-score.
Comparison of Best F-scores.
| Datasets | ML Model (Classifier Threshold) | ||||
|---|---|---|---|---|---|
| LR | SVM | LDA | DT | KNN | |
| Indoor | 0.68 (0.3) | 0.69 (0.3) | 0.69 (0.4) | 0.57 (0.5) | 0.67 (0.1) |
| Outdoor | 0.86 (0.6) | 0.86 (0.8) | 0.85 (0.4) | 0.84 (0.1) | 0.88 (0.4) |
Figure 7Comparison of ML classifiers for indoor dataset (confusion matrix).
Figure 8Comparison of ML classifiers for outdoor dataset (confusion matrix).