| Literature DB >> 33783362 |
Jessica Chia Liu1, Jack Goetz1, Srijan Sen2,3, Ambuj Tewari1.
Abstract
BACKGROUND: The use of wearables facilitates data collection at a previously unobtainable scale, enabling the construction of complex predictive models with the potential to improve health. However, the highly personal nature of these data requires strong privacy protection against data breaches and the use of data in a way that users do not intend. One method to protect user privacy while taking advantage of sharing data across users is federated learning, a technique that allows a machine learning model to be trained using data from all users while only storing a user's data on that user's device. By keeping data on users' devices, federated learning protects users' private data from data leaks and breaches on the researcher's central server and provides users with more control over how and when their data are used. However, there are few rigorous studies on the effectiveness of federated learning in the mobile health (mHealth) domain.Entities:
Keywords: data protection; machine learning; mobile health; privacy; wearable electronic devices
Year: 2021 PMID: 33783362 PMCID: PMC8044739 DOI: 10.2196/23728
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Figure 1Diagram comparing central and federated learning workflows. Color abstractly represents the private information content at different locations, with red, blue, and green colors representing private information for different user devices. When training on a central server, user data are uploaded onto the server once. In federated learning, model parameters are updated on the user device, producing updates that contain less private information than the data themselves. The updates from many users are then aggregated, further mixing the contribution from each individual user.
Figure 2Pseudocode for federated learning algorithm 1.
Demographic characteristics of the participants in the study (N=15).
| Characteristic | Value, mean (SD) |
| Age (years) | 27.5 (2.4) |
| Height (cm) | 177.6 (6.7) |
| Weight (kg) | 73.1 (10.3) |
Summary statistics of a subset of features.
| Feature | Participants, first quartile | Participants, median | Participants, mean (SD) | Participants, third quartile |
| Mean EDAa | 2.0 | 3.7 | 4.6 (3.4) | 6.3 |
| Mean ECGb | 7.2E−04 | 1.1E−03 | 1.1E−03 (7.8E−04) | 1.5E−03 |
| Mean EMGc | −3.5E−03 | −3.0E−03 | −3.0E−03 (9.2E−04) | −2.5E−03 |
| Mean respiration | −0.02 | 0.05 | 5.4E−02 (2.0E−01) | 0.13 |
| Mean temperature | 34 | 34 | 34 (1.3) | 35 |
| Mean ACC_Xd | 0.73 | 0.86 | 8.0E−01 (1.3E−01) | 0.90 |
| Mean ACC_Ye | −0.06 | −0.02 | −3.1E−02 (1.0E−01) | 0.02 |
| Mean ACC_Zf | −0.54 | −0.31 | −3.5E−01 (2.6E−01) | −0.17 |
aEDA: electrodermal activity.
bECG: electrocardiogram.
cEMG: electromyogram.
dACC_X: 3-axis acceleration (x-axis).
eACC_Y: 3-axis acceleration (y-axis).
fACC_Z: 3-axis acceleration (z-axis).
Figure 3Comparison between the architecture of nonpersonalized and personalized versions of our models. The green blocks represent trained model parameters, and the blue blocks represent the input covariates. Note the user ID is removed from the covariates once it is used to attach the correct parameters, which represent that user’s embedding.
Median and mean accuracy of each model over 15 tests.
| Model | Accuracy, median | Accuracy, mean (SD) |
| Personalized server | 0.929 | 0.932 (0.019) |
| Server | 0.897 | 0.888 (0.028) |
| Personalized federated | 0.929 | 0.928 (0.018) |
| Federated | 0.853 | 0.859 (0.021) |
| Individual | 0.899 | 0.902 (0.021) |