| Literature DB >> 36015747 |
Qiang Duan1, Shijing Hu2, Ruijun Deng2, Zhihui Lu2,3.
Abstract
Federated learning (FL) and split learning (SL) are two emerging collaborative learning methods that may greatly facilitate ubiquitous intelligence in the Internet of Things (IoT). Federated learning enables machine learning (ML) models locally trained using private data to be aggregated into a global model. Split learning allows different portions of an ML model to be collaboratively trained on different workers in a learning framework. Federated learning and split learning, each have unique advantages and respective limitations, may complement each other toward ubiquitous intelligence in IoT. Therefore, the combination of federated learning and split learning recently became an active research area attracting extensive interest. In this article, we review the latest developments in federated learning and split learning and present a survey on the state-of-the-art technologies for combining these two learning methods in an edge computing-based IoT environment. We also identify some open problems and discuss possible directions for future research in this area with the hope of arousing the research community's interest in this emerging field.Entities:
Keywords: edge computing; federated learning; internet of things; split learning; ubiquitous intelligence
Mesh:
Year: 2022 PMID: 36015747 PMCID: PMC9414384 DOI: 10.3390/s22165983
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The framework architecture of federated learning.
A summary of the main challenges to federated learning in IoT.
| Challenges | Brief Descriptions |
|---|---|
| Statistical Heterogeneity | imbalanced and non-iid data distributions |
| System Heterogeneity | heterogeneous implementations with diverse system capabilities |
| Communication Overheads | data transmissions for exchanging models |
| Constrained Resources | resource-constrained IoT devices |
| System Scalability | a large number of IoT devices involved in FL |
| System Dynamism | time-varying resource availability in IoT |
| Privacy & Security | privacy/security vulnerabilities of FL in IoT |
The main challenges to FL in IoT and representative technical strategies for addressing them.
| FL Algorithms | Model Aggregation | Client Selection | Communication Control | Privacy/Security Protection | |
|---|---|---|---|---|---|
| Data Heterogeneity | + | + | + | ||
| System Heterogeneity | + | + | + | ||
| Communication Overheads | + | ||||
| Constrained Resources | + | + | |||
| System Scalability | + | + | |||
| System Dynamism | + | + | |||
| Privacy & Security | + |
Figure 2Framework architecture for multi-client split learning.
Figure 3Configurations for split learning frameworks: (a) basic configuration, (b) extended configuration, (c) U-shape configuration, (d) vertical configuration.
SL and FL communication overheads per client and entire system.
| Methods | Communication Cost per Client | Total Communication Cost |
|---|---|---|
| SL with synchronization |
|
|
| SL without synchronization |
|
|
| Federated learning |
|
|
Figure 4State diagram for controlling client-server communications in [57].
Figure 5Generalized SplitFed (SFLG) learning framework architecture.
Key Features of representative Hybrid SL-FL Frameworks.
| SL-FL Frameworks | Federated Parallel Training | Server Deployment |
|---|---|---|
| SplitFed [ | on both client and server | centralized on a single server |
| SplitFedv2 [ | only on client | centralized on a single server |
| SplitFedv3 [ | only on client (alternative mini-batch on server) | centralized on a single server |
| CPSL [ | parallel-first-then-sequential | centralized on a single server |
| HSFL [ | mixed sequential SL and FL | centralized on a single server |
| FedSL [ | on both client and server | distributed to multi-servers |
| SFLG [ | on both client and server | hierarchically distributed |
Figure 6Hybrid SL-FL framework with vertical configuration.
Comparison of information exposure in FL, SL, and hybrid SL-FL.
| Information Revealed | Raw Data | Model Parameters | Intermediate Representation |
|---|---|---|---|
| FL | No | Yes | No |
| SL | No | No | Yes |
| hybrid SL-FL | No | Yes | Yes |
Privacy attacks to split learning and representative protection schemes.
| Data Inference [ | Model Inversion [ | Label Leakage [ | |
|---|---|---|---|
| SplitGuard [ | + | ||
| correlation-based defense [ | + | + | |
| noise-based defense [ | + | + | |
| binarization-based defense [ | + | + | |
| client-based protection [ | + | + | |
| resistance transfer scheme [ | + | ||
| random perturbation technique [ | + | ||
| gradient perturbation technique [ | + | ||
| activations and labels mixing [ | + |