| Literature DB >> 33847779 |
Margarita Kirienko1,2, Martina Sollini3,4, Gaia Ninatti2, Daniele Loiacono5, Edoardo Giacomello5, Noemi Gozzi6, Francesco Amigoni5, Luca Mainardi5, Pier Luca Lanzi5, Arturo Chiti2,6.
Abstract
PURPOSE: The present scoping review aims to assess the non-inferiority of distributed learning over centrally and locally trained machine learning (ML) models in medical applications.Entities:
Keywords: Clinical trial; Distributed learning; Ethics; Federated learning; Machine learning; Privacy
Mesh:
Year: 2021 PMID: 33847779 PMCID: PMC8041944 DOI: 10.1007/s00259-021-05339-7
Source DB: PubMed Journal: Eur J Nucl Med Mol Imaging ISSN: 1619-7070 Impact factor: 9.236
Fig. 1Distributed learning framework
Fig. 2Schematic overview of the most popular distributed learning methods. Differences among methods are summarized according to two general design principles: (i) how the model parameters are displaced over the network of nodes, local, local + shared, and shared (y-axis), and (ii) how nodes interact and which type of data they exchange, no, partial, and complete exchange (x-axis)
Fig. 3Distributed learning methods
Fig. 4Study selection
Fig. 5Summary of the topics covered by selected studies. (ADR, adverse drug reaction)
Studies using distributed learning models for risk prediction
| Reference | Data, type | Aim | Distributed network | Machine learning model | Distributed learning method | Distributed vs centralized learning | Distributed vs localized learning |
|---|---|---|---|---|---|---|---|
| [ | Clinical data | Prediction of risk of hospitalisations for cardiac events | Simulated distribution among 5 and 10 nodes | SVM | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of foetal loss risk | Simulated distribution among 10 nodes | Logistic regression | Federated learning | Federated learning performs close to centralized learning | Federated learning outperforms localized learning models |
| [ | Genomic data | Prediction of risk of developing ankylosing spondylitis | Real distribution among 3 nodes | PCA | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of readmission for heart failure within 30 days | Simulated distribution among 50 and 200 nodes | Cox proportional hazards model | Ensembling | Ensembling performs close to centralized learning | NA |
NA, not assessed; SVM, support vector machine
Studies using distributed learning models for diagnosis
| Reference | Data, type | Aim | Distributed network | Machine learning model | Distributed learning method | Distributed vs. centralized learning | Distributed vs. localized learning |
|---|---|---|---|---|---|---|---|
| [ | Images (digital FNA images) | Detection of breast cancer | Simulated distribution among many different numbers of nodes | ANN, SVM, RF | Ensembling | Ensembling performs close to centralized learning | Ensembling outperforms localized learning models |
| [ | Clinical data | Detection of MCI | Real distribution among 43 nodes | ANN, SVM, RF | Ensembling | Centralized learning outperforms ensembling | Ensembling outperforms localized learning models |
| [ | Clinical data | Detection of diabetes | Simulated distribution among many different numbers of nodes | ANN, SVM, RF | Ensembling | Ensembling performs close to centralized learning | Ensembling outperforms localized learning models |
| [ | Clinical data | Detection of heart disease | Simulated distribution among many different numbers of nodes | ANN, SVM, RF | Ensembling | Ensembling performs close to centralized learning | Ensembling outperforms localized learning models |
| [ | Image-derived data (radiomic features from head and neck CT scans) | Prediction of HPV status in head and neck cancer | Real distribution among 6 nodes | Logistic regression | Federated learning | Federated learning performs close to Centralized Learning | NA |
| [ | (Retinal fundus images) | Diabetic retinopathy detection | Simulated distribution among 4 nodes | CNN | Federated learning + ensembling | Federated learning performs close to centralized Learning | NA |
| [ | Images (mammograms) | Breast cancer detection | Simulated distribution among 4 nodes | CNN | Federated learning + ensembling | Federated learning performs close to centralized learning | NA |
| [ | Clinical data (ICU data) | Disease detection | Distribution among 2 nodes based on the data provisioning system | Autoencoder | Federated learning | NA | Federated learning outperforms localized learning models |
| [ | Clinical data (ICU data) | Disease detection | Simulated distribution among 2 and 3 nodes | Hashing | Federated learning | Federated learning performs close to centralized learning | Federated learning outperforms localized learning models |
| [ | Images (brain MRI) | Brain tumour segmentation | Simulated distribution among 4, 8,16, and 32 nodes | CNN | Federated learning, CIIL, IIL | Federated learning performs close to centralized learning | NA |
| [ | Images (brain MRI) | Classification of neurodegenerative diseases | Real distribution among 4 nodes | PCA | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Images (head CT scans) | Haemorrhage segmentation | Real distribution among 4 nodes (only 2 nodes used for training) | CNN | Federated learning | NA | Federated learning outperforms localized learning models |
| [ | Images (head CT scans) | Haemorrhage segmentation | Real distribution among 2 nodes | CNN | Federated learning | NA | Federated learning outperforms localized learning models |
| [ | Images (retinal fundus images) | Diabetic retinopathy detection | Simulated distribution among 4 nodes | CNN | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Images (chest X-rays) | Thoracic disease classification | Simulated distribution among 4 nodes | CNN | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Images (brain fMRI) | Autism spectrum disorder detection | Real distribution among 4 nodes | ANN | Federated learning + ensembling | NA | Federated learning outperforms localized learning models |
| [ | Images (chest CT scans) | Pneumonia classification | Real distribution among 4 nodes | CNN | Federated learning | Federated learning performs close to centralized learning (except on viral pneumonia other than COVID-19) | Federated learning outperforms localized learning models |
| [ | Images (digital FNA images) | Breast lesion classification | Simulated distribution among 5 nodes | Semi-supervised ANN + ELM | Federated learning | Centralized learning outperforms federated learning | Federated learning outperforms localized learning models |
| [ | Images (brain MRI) | Schizophrenia detection | Real distribution among 4 nodes | SVM | Ensembling | Ensembling performs close to centralized learning | Ensembling outperforms localized learning models |
| [ | Images (digital FNA images) | Breast lesion classification | Simulated distribution among 10 nodes | ANN | Federated learning | Federated learning performs close to centralized learning | NA |
ANN, artificial neural network; CIIL, cyclic institutional incremental learning; CNN, convolutional neural network; CT, computed tomography; ELM, extreme learning machine; FNA, fine-needle aspiration; HPV, human papilloma virus; ICU, intensive care unit; IIL, institutional incremental learning; MCI, mild cognitive impairment; MRI, magnetic resonance imaging; NA, not assessed; PCA, principal component analysis; RF, random forest; SVM, support vector machine
Studies using distributed learning models for prognosis
| Reference | Data, type | Aim | Distributed network | Machine learning model | Distributed learning method | Distributed vs. centralized learning | Distributed vs. localized learning |
|---|---|---|---|---|---|---|---|
| [ | Clinical data | Post-treatment 2-year survival prediction | Real distribution among 3 nodes | SVM with ADMM | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Post-treatment 2-year survival prediction | Real distribution among 8 nodes | Bayesian network | Federated learning | NA | Localized learning models sometimes outperform federated learning |
| [ | Image-derived data (radiomic features from head and neck CT scans) | Post-treatment 2-year survival prediction | Real distribution among 6 nodes | Logistic regression | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of hospital LoS | Simulated distribution among 3, 6, and 12 nodes | Linear regression | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of mortality | Real distribution among 50 nodes | Autoencoder, K-means, and nearest neighbour | Federated learning + ensembling | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of ICU LoS | Real distribution among 50 nodes | Autoencoder, K-means, and nearest neighbour | Federated learning + ensembling | Federated learning performs close to centralized learning | NA |
ADMM, alternative direction method of multipliers; CT, computed tomography; ICU, intensive care unit; LoS, length of stay; NA, not assessed; SVM, support vector machine
Studies using distributed learning models for adverse drug reaction/side effect prediction
| Reference | Data, type | Aim | Distributed network | Machine learning model | Distributed learning method | Distributed vs. centralized learning | Distributed vs. localized learning |
|---|---|---|---|---|---|---|---|
| [ | Clinical data | Prediction of post-radiotherapy dyspnoea | Real distribution among 5 nodes | Bayesian network | Federated learning | Federated learning performs close to centralized learning | Federated learning performs close to localized learning |
| [ | Clinical data | Prediction of post-radiotherapy dyspnoea | Real distribution among 5 nodes | SVM with ADMM | Federated learning | Federated learning performs close to centralized learning | NA |
| [ | Clinical data | Prediction of opioid chronic use | Simulated distribution among 10 nodes | SVM, single-layer perceptron, logistic regression (all of them trained with SGD) | Federated learning | Federated learning performs close to centralized learning | Federated learning outperforms localized learning models |
| [ | Clinical data | Prediction of antipsychotics side effects | Simulated distribution among 10 nodes | SVM, single-layer perceptron, logistic regression (all of them trained with SGD) | Federated learning | Federated learning performs close to centralized learning | Federated learning outperforms localized learning models |
ADMM, alternative direction method of multipliers; NA, not assessed; SGD, stochastic gradient descent; SVM, support vector machine