| Literature DB >> 33739298 |
Benjamin Lam1, Michael Catt2, Sophie Cassidy3, Jaume Bacardit1, Philip Darke1, Sam Butterfield1, Ossama Alshabrawy4, Michael Trenell5, Paolo Missier1.
Abstract
BACKGROUND: Between 2013 and 2015, the UK Biobank collected accelerometer traces from 103,712 volunteers aged between 40 and 69 years using wrist-worn triaxial accelerometers for 1 week. This data set has been used in the past to verify that individuals with chronic diseases exhibit reduced activity levels compared with healthy populations. However, the data set is likely to be noisy, as the devices were allocated to participants without a set of inclusion criteria, and the traces reflect free-living conditions.Entities:
Keywords: accelerometry; digital biomarkers; digital phenotyping; digital technology; machine learning; mobile phone; physical activity; type 2 diabetes
Year: 2021 PMID: 33739298 PMCID: PMC8080299 DOI: 10.2196/23364
Source DB: PubMed Journal: JMIR Diabetes ISSN: 2371-4379
Figure 1Training set selection criteria for type 2 diabetes–negative and type 2 diabetes–positive individuals. EHR: electronic health record; QC: quality control.
Number of participants in each subpopulation according to activity impairment severity score.
| Impairment score | Total participants, N | Participants with adequate wear time, n (%) |
| Norm-0 | 11,019 | 8463 (76.80) |
| Norm-2 | 3355 | 1666 (49.66) |
Figure 2Hierarchy of physical activity representations.
Figure 3Feature matrix for physical activity bout representation space.
Sociodemographic, lifestyle, and anthropometric characteristics selected from the UK Biobank baseline assessment for comparison with high-level activity bout features space.
| Sociodemographic, lifestyle, and anthropometry characteristic | Description |
| Sex | Male or female (approximately 50:50 ratio) |
| Age at the assessment center | Recruits at baseline were aged between 40 and 69 years |
| Ethnic group | Predominantly White British, with some participants identifying as Black, Asian and Minority Ethnic groups |
| Alcohol drinking status | Participant reports if they were alcohol drinkers in the past, were currently drinking alcohol, or never had drunk alcohol |
| Smoking status | Participants report if they had smoked in the past, were currently smoking, or had never smoked |
| Body fat percentage | Percentage of fat in total body mass (a better indicator for obesity than BMI) |
| Waist circumference | Measurement taken around the abdomen at the level of the umbilicus (belly button) |
| Sleep duration | Self-reported average duration of sleep in a day |
| Time spent watching television | Self-reported average time spent watching television per day |
| Townsend index | Metric for material deprivation within a population |
| Duration of walking activity | Self-reported average duration of time spent walking in a day |
| Duration of vigorous activity | Self-reported average duration of time spent performing vigorous activities during the day |
| Duration of moderate activity | Self-reported average duration of time spent performing moderate activity during the day |
Figure 4Histogram for daily average percentage times spent asleep.
Figure 6Histogram for daily average number of sleep bouts.
Classification results measured using area under the receiver operating characteristic curve scores, showing the effect of choice of type 2 diabetes–negatives, Norm-0 (no physical activity impairment) versus Norm-2 (severe physical activity impairment). The values in the cells represent area under the receiver operating characteristic curve scores.
| Predictive model | High-level activity-bout features | Sociodemographic and lifestyle | High-level activity bout features+sociodemographic and lifestyle | |||
| Norm-0 | Norm-2 | Norm-0 | Norm-2 | Norm-0 | Norm-2 | |
| Random forest | 0.80 | 0.68 | 0.83 | 0.78 | 0.86 | 0.77 |
| Logistic regression | 0.79 | 0.70 | 0.83 | 0.78 | 0.86 | 0.78 |
| Extreme gradient boosting | 0.78 | 0.66 | 0.80 | 0.74 | 0.85 | 0.75 |
Classification results measured using F1, showing the effect of choice of type 2 diabetes-negatives, Norm-0 (no physical activity impairment) versus Norm-2 (severe physical activity impairment). The values in the cells represent F1 scores.
| Predictive model and T2Da status | High-level activity bout features | Sociodemographic and lifestyle | High-level activity bout features+sociodemographic and lifestyle | ||||
| Norm-0 | Norm-2 | Norm-0 | Norm-2 | Norm-0 | Norm-2 | ||
|
| |||||||
| T2D-positive | 0.65 | 0.70 | 0.65 | 0.77 | 0.73 | 0.77 | |
| T2D-negative | 0.78 | 0.54 | 0.78 | 0.63 | 0.81 | 0.63 | |
|
| |||||||
| T2D-positive | 0.66 | 0.72 | 0.69 | 0.77 | 0.74 | 0.77 | |
| T2D-negative | 0.77 | 0.54 | 0.79 | 0.65 | 0.82 | 0.65 | |
|
| |||||||
| T2D-positive | 0.66 | 0.68 | 0.67 | 0.74 | 0.73 | 0.76 | |
| T2D- negative | 0.77 | 0.52 | 0.76 | 0.62 | 0.80 | 0.63 | |
aT2D: type 2 diabetes.
Figure 7Receiver operating characteristic curve and area under the receiver operating characteristic curve for type 2 diabetes vs Norm-0: High-level activity bout features & sociodemographic and lifestyle features combined. AUC: area under the receiver operating characteristic curve; ROC: receiver operating characteristic curve; T2D: type 2 diabetes.
Figure 12Receiver operating characteristic curve and area under the receiver operating characteristic curve for type 2 diabetes vs Norm-2: Sociodemographic and lifestyle features only. AUC: area under the receiver operating characteristic curve; ROC: receiver operating characteristic curve; T2D: type 2 diabetes.