| Literature DB >> 36160367 |
Attia Qammar1, Ahmad Karim2, Huansheng Ning1, Jianguo Ding3.
Abstract
Federated learning (FL) is a promising framework for distributed machine learning that trains models without sharing local data while protecting privacy. FL exploits the concept of collaborative learning and builds privacy-preserving models. Nevertheless, the integral features of FL are fraught with problems, such as the disclosure of private information, the unreliability of uploading model parameters to the server, the communication cost, etc. Blockchain, as a decentralized technology, is able to improve the performance of FL without requiring a centralized server and also solves the above problems. In this paper, a systematic literature review on the integration of Blockchain in federated learning was considered with the analysis of the existing FL problems that can be compensated. Through carefully screening, most relevant studies are included and research questions cover the potential security and privacy attacks in traditional federated learning that can be solved by blockchain as well as the characteristics of Blockchain-based FL. In addition, the latest Blockchain-based approaches to federated learning have been studied in-depth in terms of security and privacy, records and rewards, and verification and accountability. Furthermore, open issues related to the combination of Blockchain and FL are discussed. Finally, future research directions for the robust development of Blockchain-based FL systems are proposed.Entities:
Keywords: Blockchain; Blockchain-based FL; Federated learning; Privacy; Security; Systematic literature review
Year: 2022 PMID: 36160367 PMCID: PMC9483378 DOI: 10.1007/s10462-022-10271-9
Source DB: PubMed Journal: Artif Intell Rev ISSN: 0269-2821 Impact factor: 9.588
Fig. 1PRISMA flow diagram of the systematic review phases.
Adapted from (Moher et al. 2009)
Fig. 2Publications in heterogeneous databases
Inclusion and exclusion criteria with justification
| Criteria | Justification | |
|---|---|---|
| Inclusion | Studies published online in years 2016 to June 2022. | The fundamental research on this topic has been revealed in the papers published in recent year |
| Studies based on the integration of blockchain and federated learning | Have promising research status in academia and industry | |
| Papers that address the mechanism of blockchain-based federated learning in context of (1) security and privacy, (2) record and reward and, (3) verification and accountability approaches as it leads to a secure FL system | Have an auspicious research status in academia and industry | |
| Exclusion | Papers that were not written in English | No ability to examine non-English language papers |
| Duplicate material from a similar study | Novel research papers were considered and repetitive information was removed | |
| Short research papers of less than 4 pages | These studies did not provide much knowledge, therefore excluded from our research |
Fig. 3Federated learning architecture
Fig. 4Attacks to federated learning architecture
Blockchain-based federated learning characteristics over conventional FL
| Characteristics | Federated learning | Blockchain-based federated learning |
|---|---|---|
| Decentralization | Traditional FL systems have centralized servers that can be compromised by a malicious user and insecure | It has multiple decentralized servers that can store model updates in an irritability resistance nature and hard against a single point of failure attack |
| Traceability | FL does not record the history of model updates, it only stores the latest model. So accountability and audit of participants are impossible | Blockchain-based FL keeps the history of all blocks linked into a chain. The participants cannot deny the authorship of model updates. |
| Immutability | It is highly possible to temper historical model training updates by the malicious server which makes it difficult to detect | Tempering of records in blockchain-based federated learning is detectable and blocked by the server. Each block contains a unique hash value to make it permanent and unalterable |
| Incentives | The quality of local model updates are directly proportional to the global model accuracy, FL system does not have a reward mechanism to encourage participants to take part in the model training process | Participants are attracted through rewards or incentives mechanisms, in that way they contributed with quality data model updates, resulting in an accurate global model |
| Integrity and reliability | In federated learning, the model training process is coordinated by a single central server. The data could be corrupted by the malicious participant or a server | All blocks are connected cryptographically, in case of data alteration they can be detected easily. Blockchain proves as an inherently secure and reliable technology |
| Trust | A federated learning system does not provide any consensus algorithm or design an agreement for model training | Blockchain-based federated learning makes use of a consensus algorithm to establish trust between parties. The participants who agree to the contract are allowed to participate in training rounds |
Fig. 5Blockchain-based federated learning architecture
Fig. 6One-epoch workflow of blockchain-based federated learning system
Comparative analysis of blockchain deployment frameworks
| Blockchain framework | Category | Consensus algorithm | Smart contract language | Hosted by | Cryptocurrency | Level of support for FL | Related studies |
|---|---|---|---|---|---|---|---|
| Ethereum | Public | PoW | Solidity | Ethereum developers | Ether (ETH) and Bitcoin (BTC) | High | Buterin ( |
| Hyperledger fabric | Private | PBFT | GoLang, Java | Linux Foundation | None | High | Androulaki et al. ( |
| EOS.IO | Public and consortium | DPoS | C, C++ | Block.One | EOS | Moderate | Huang et al. ( |
| FISCO BCOS | Consortium | PBFT, Raft | Solidity, C++ | Webank | None | Moderate | Li et al. ( |
| Corda | Consortium | PBFT, Raft | Kotlin, Java | R3 Consortium | None | Moderate | Brown ( |
Fig. 7State-of-the-art: blockchain-based federated learning approaches
Blockchain-based federated learning security and privacy approaches
| Approaches | Major contribution | Blockchain type | Block structure | Block storage | Consensus algorithm | Blockchain tool |
|---|---|---|---|---|---|---|
| BytoChain (Li et al. | Byzantine resistant consensus Proof of Accuracy (PoA) Detected the random and reverse poisoning, overfitting poisoning, DoS, and free-riding attacks | Private | Merkle Tree | – | PoA | – |
| Chainsfl (Yuan et al. | Raft and DAG-based blockchain consensus algorithm Synchronous and asynchronous learning combined to dismiss the drag down of stragglers | Private | Merkle Tree | Off chain | Raft and DAG | Hyperlegdger fabric |
| BLADE-FL (Ma | Prevented from the Single point of failure (SPoF) attack Misbehaved and lazy participants are recognized | Public | – | – | PoW | – |
| BFEL (Kang et al. | Proof of Verifying (PoV) consensus algorithm to filter out poisoning updates A gradient compression scheme with PoV | Public and Consortium | Merkle Tree | – | PoV, DPoS, and PBFT | EOS.IO |
| (Short et al. | Based on the accuracy improvement, model updates are evaluated Traceability function of blockchain for the detection of malicious users | Private | – | – | – | Hyperlegdger fabric |
| BFLC (Li et al. | Committee consensus algorithm to reduce model poisoning attacks Storage optimization, scalability of BFLC, and incentives | Consortium | – | On-chain | Committee | FISCO |
| (Kumar et al. | Differential privacy (DP) and homomorphic encryption (HE) to improve the security in FL Incentive scheme | Public | – | Off-chain | – | Ethereum |
| Biscotti (Shayan et al. | Prevent Sybil and poisoning attacks using VRF and PoF, and multi-krum, respectively Implemented the secret sharing scheme for secure model aggregation | Private | Merkle Tree | Off-chain | PoF | Hyperlegdger fabric |
| Fed-BC (Wu et al. | Fully decentralized system avoids SPoF attack and privacy leakage | Private | – | Off-chain | – | Hyperlegdger fabric |
Blockchain-based federated learning record and reward approache
| Approaches | Major contribution | Blockchain type | Block structure | Block storage | Consensus algorithm | Blockchain tool |
|---|---|---|---|---|---|---|
| FedCoin (Liu et al. | PoSap consensus protocol for fair payment distribution between clients Record of all payments | Public | Merkle Tree | – | PoSap | – |
| (Martinez et al. | Class-Sampled Validation-Error Scheme (CSVES) for rewarding and validating the model updates Record model training updates | Private | – | Off-chain | – | EOS |
| (Kang et al. | Reputation metric to measure the fairness of model updates Workers reputation is calculated and managed Encouraged the high reputation workers with effective incentives | Consortium | – | On-chain | PBFT | Corda V4.0 |
| (Behera et al. | Record contributions of clients through smart contract and then rewarded A decentralized communication scheme for FL | Consortium blockchain setup | Merkle Tree | Off-chain | – | Ethereum |
| FL-MAB (Batool et al. | Measured the relative contribution of every client by Shapley value, and allocate rewards accordingly | Public | – | Off-chain | – | Ethereum |
Blockchain-based federated learning verification and accountable approaches
| Approaches | Major contribution | Blockchain type | Block structure | Block storage | Consensus algorithm | Blockchain tool |
|---|---|---|---|---|---|---|
| Vfchain (Peng et al. | A VFChain to verify and audit the updates Aggregated models and proofs recorded by committee selection | Private | Dual Skip Chain | – | – | Hyperlegdger fabric |
| BC-based PPFL (Awan et al. | An accountable method to record local and global model updates Tracking of data flows in FL system provides the trust and verification | Private | – | Off-chain | PoW, PoS | Hyperlegdger fabric |
| BlockFLA (Desai et al. | Through accountability protects against adversarial attacks Discouraged the backdoor attacks and applied the transparency | Hybrid | – | Off-chain | PoW, PBFT | Hyperlegdger fabric, Ethereum |
| (Lo et al. | A trustworthy system to enable accountability in FL. For auditing purposes track the local model and global model. To improve the fairness of data and models a weighted fair training was introduced | Parity consortium blockchain | – | Off-chain | Proof-of-Authority (PoA) | Galaxy FL framework (Ethereum) |
| Blockflow (Vaikkunth Mugunthan | A unique accountability mechanism for model contribution Resultant auditing scores reflect the quality of the honest and malicious clients | Public | – | Off-chain | – | Ethereum |