| Literature DB >> 31533735 |
Bin Ye1,2, Kangping Liu3, Siting Cao2, Padmaja Sankaridurg4,5, Wayne Li4, Mengli Luan2, Bo Zhang2, Jianfeng Zhu2, Haidong Zou1,2, Xun Xu1,2, Xiangui He6,7,8.
Abstract
BACKGROUND: Wearable smart watches provide large amount of real-time data on the environmental state of the users and are useful to determine risk factors for onset and progression of myopia. We aim to evaluate the efficacy of machine learning algorithm in differentiating indoor and outdoor locations as collected by use of smart watches.Entities:
Keywords: Machine learning algorithm; Myopia intervention; Outdoor time; Smart watch
Year: 2019 PMID: 31533735 PMCID: PMC6751881 DOI: 10.1186/s12967-019-2057-2
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1Flowchart of study design. In step 1, two datasets were collected. In step 2, each dataset was split into a training set to build a predicting model and a testing set to test the model. In step 3, two models were built and used to predict 2 testing groups
Common classification type deep learning algorithms to determine positional accuracy
| Machine learning algorithms | Accuracy % (N) |
|---|---|
| Gaussian process | 78.4% (17,949/22,886) |
| Ensemble | 79.7% (18,242/22,886) |
| Neural network | 80.2% (18,361/22,886) |
| Discriminant analysis | 83.8% (19,183/22,886) |
| Naive Bayes | 87.4% (20,006/22,886) |
| Random forest | 90.9% (20,805/22,886) |
| SVM | 97.1% (22,229/22,886) |
| Total | 85.4% (136,775/160,202) |
All of the pairwise comparisons between these seven methods show significantly different (p < 0.001), except that between accuracy of neural network algorithm and average accuracy of these algorithms (p = 0.165)
Fig. 2The luminance and UV of indoor and outdoor in dataset A and dataset B
Fig. 3a The ROC curves of SVM (model A) and univariate threshold segmentation method for identifying indoor/outdoor locations of Testing group A. b The ROC curves of SVM (model A) and univariate threshold segmentation method for identifying indoor/outdoor locations of Testing group B. c The ROC curves of SVM (model B) and univariate threshold segmentation method for identifying indoor/outdoor locations of Testing group B
Location of the testing group A predicted by Model A, the dataset B predicted by Model A and the testing group B predicted by Model B
| Model | Data sets | Real location | Predicted | Total | |
|---|---|---|---|---|---|
| Outdoor | Indoor | ||||
| A | A | Outdoor | 15,502 | 59a | 15,561 |
| Indoor | 43 | 7282 | 7325 | ||
| Total | 15,545 | 7341 | 22,886 | ||
| A | B | Outdoor | 9799 | 3788b | 13,587 |
| Indoor | 291 | 9661 | 9952 | ||
| Total | 10,090 | 13,449 | 23,539 | ||
| B | B | Outdoor | 4386 | 495c | 4881 |
| Indoor | 39 | 2142 | 2181 | ||
| Total | 4425 | 2637 | 7062 | ||
a59 outdoor locations were mistaken as indoors, and 43 indoor locations were mistaken as outdoors Kappa = 0.990, p < 0.001
b3788 outdoor locations were mistaken as indoors, and 291 indoor locations were mistaken as outdoors. Kappa = 0.692, p < 0.001
c495 outdoor locations were mistaken as indoors, and 39 indoor locations were mistaken as outdoors. Kappa = 0.821, p < 0.001
Machine learning algorithm compared with other published methods
| Author | Method | Cut point | AUC | Sensitivity | Specificity | Youden Index |
|---|---|---|---|---|---|---|
| Tandon [ | Luminance segmentation method | 110 lx | 0.82 | 74% | 86% | 0.60 |
| Tandon [ | GPS segmentation method | 250 SNRa | 0.89 | 82% | 88% | 0.70 |
| Flynn [ | Luminance segmentation method | 240 lx | 0.96 | 92% | 90% | 0.82 |
| Dharani [ | Luminance segmentation method | 1000 lx | – | – | – | – |
| In our study | SVM Machine learning algorithm | – | 0.99 | 99% | 99% | 0.99 |
aSignal-to-noise ratio (SNR); Area under the ROC curve (AUC)