Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital

Literature DB >> 28029405

Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital - A real life proof of concept.

Arthur Jochems¹, Timo M Deist², Johan van Soest², Michael Eble³, Paul Bulens⁴, Philippe Coucke⁵, Wim Dries⁶, Philippe Lambin², Andre Dekker⁷.

Abstract

PURPOSE: One of the major hurdles in enabling personalized medicine is obtaining sufficient patient data to feed into predictive models. Combining data originating from multiple hospitals is difficult because of ethical, legal, political, and administrative barriers associated with data sharing. In order to avoid these issues, a distributed learning approach can be used. Distributed learning is defined as learning from data without the data leaving the hospital. PATIENTS AND METHODS: Clinical data from 287 lung cancer patients, treated with curative intent with chemoradiation (CRT) or radiotherapy (RT) alone were collected from and stored in 5 different medical institutes (123 patients at MAASTRO (Netherlands, Dutch), 24 at Jessa (Belgium, Dutch), 34 at Liege (Belgium, Dutch and French), 48 at Aachen (Germany, German) and 58 at Eindhoven (Netherlands, Dutch)). A Bayesian network model is adapted for distributed learning (watch the animation: http://youtu.be/nQpqMIuHyOk). The model predicts dyspnea, which is a common side effect after radiotherapy treatment of lung cancer.
RESULTS: We show that it is possible to use the distributed learning approach to train a Bayesian network model on patient data originating from multiple hospitals without these data leaving the individual hospital. The AUC of the model is 0.61 (95%CI, 0.51-0.70) on a 5-fold cross-validation and ranges from 0.59 to 0.71 on external validation sets.
CONCLUSION: Distributed learning can allow the learning of predictive models on data originating from multiple hospitals while avoiding many of the data sharing barriers. Furthermore, the distributed learning approach can be used to extract and employ knowledge from routine patient data from multiple hospitals while being compliant to the various national and European privacy laws.

Entities: Disease Species

Keywords: Bayesian networks; Distributed learning; Dyspnea; Machine learning; Privacy preserving data-mining

Mesh：

Year: 2016 PMID： 28029405 DOI： 10.1016/j.radonc.2016.10.002

Source DB: PubMed Journal: Radiother Oncol ISSN： 0167-8140 Impact factor: 6.280

Keyword Cloud
Cited

30 in total

1. AI-based applications in hybrid imaging: how to build smart and truly multi-parametric decision models for radiomics.

Authors: Isabella Castiglioni; Francesca Gallivanone; Paolo Soda; Michele Avanzo; Joseph Stancanello; Marco Aiello; Matteo Interlenghi; Marco Salvatore
Journal: Eur J Nucl Med Mol Imaging Date: 2019-07-11 Impact factor: 9.236

2. The effect of imputing missing clinical attribute values on training lung cancer survival prediction model performance.

Authors: Mohamed S Barakat; Matthew Field; Aditya Ghose; David Stirling; Lois Holloway; Shalini Vinod; Andre Dekker; David Thwaites
Journal: Health Inf Sci Syst Date: 2017-12-06

3. Radiation Therapy Outcomes Models in the Era of Radiomics and Radiogenomics: Uncertainties and Validation.

Authors: Issam El Naqa; Gaurav Pandey; Hugo Aerts; Jen-Tzung Chien; Christian Nicolaj Andreassen; Andrzej Niemierko; Randall K Ten Haken
Journal: Int J Radiat Oncol Biol Phys Date: 2018-10-18 Impact factor: 7.038

Review 4. Genomics, bio specimens, and other biological data: Current status and future directions.

Authors: Barry S Rosenstein; Arvind Rao; Jean M Moran; Daniel E Spratt; Marc S Mendonca; Bissan Al-Lazikani; Charles S Mayo; Corey Speers
Journal: Med Phys Date: 2018-09-18 Impact factor: 4.071

5. Developing and Validating a Survival Prediction Model for NSCLC Patients Through Distributed Learning Across 3 Countries.

Authors: Arthur Jochems; Timo M Deist; Issam El Naqa; Marc Kessler; Chuck Mayo; Jackson Reeves; Shruti Jolly; Martha Matuszak; Randall Ten Haken; Johan van Soest; Cary Oberije; Corinne Faivre-Finn; Gareth Price; Dirk de Ruysscher; Philippe Lambin; Andre Dekker
Journal: Int J Radiat Oncol Biol Phys Date: 2017-04-24 Impact factor: 7.038

6. Data collection of patient outcomes: one institution's experience.

Authors: Thomas J Whitaker; Charles S Mayo; Daniel J Ma; Michael G Haddock; Robert C Miller; Kimberly S Corbin; Michelle Neben-Wittich; James L Leenstra; Nadia N Laack; Mirek Fatyga; Steven E Schild; Carlos E Vargas; Katherine S Tzou; Austin R Hadley; Steven J Buskirk; Robert L Foote
Journal: J Radiat Res Date: 2018-03-01 Impact factor: 2.724

7. Infrastructure and distributed learning methodology for privacy-preserving multi-centric rapid learning health care: euroCAT.

Authors: Timo M Deist; A Jochems; Johan van Soest; Georgi Nalbantov; Cary Oberije; Seán Walsh; Michael Eble; Paul Bulens; Philippe Coucke; Wim Dries; Andre Dekker; Philippe Lambin
Journal: Clin Transl Radiat Oncol Date: 2017-05-19

Review 8. Requirements and reliability of AI in the medical context.

Authors: Yoganand Balagurunathan; Ross Mitchell; Issam El Naqa
Journal: Phys Med Date: 2021-03-13 Impact factor: 2.685

Review 9. Machine learning applications in radiation oncology.

Authors: Matthew Field; Nicholas Hardcastle; Michael Jameson; Noel Aherne; Lois Holloway
Journal: Phys Imaging Radiat Oncol Date: 2021-06-24

10. A situational awareness Bayesian network approach for accurate and credible personalized adaptive radiotherapy outcomes prediction in lung cancer patients.

Authors: Yi Luo; Shruti Jolly; David Palma; Theodore S Lawrence; Huan-Hsin Tseng; Gilmer Valdes; Daniel McShan; Randall K Ten Haken; Issam Ei Naqa
Journal: Phys Med Date: 2021-06-04 Impact factor: 3.119