Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.

Literature DB >> 33007638

The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.

Gustav Mårtensson¹, Daniel Ferreira², Tobias Granberg³, Lena Cavallin³, Ketil Oppedal⁴, Alessandro Padovani⁵, Irena Rektorova⁶, Laura Bonanni⁷, Matteo Pardini⁸, Milica G Kramberger⁹, John-Paul Taylor¹⁰, Jakub Hort¹¹, Jón Snædal¹², Jaime Kulisevsky¹³, Frederic Blanc¹⁴, Angelo Antonini¹⁵, Patrizia Mecocci¹⁶, Bruno Vellas¹⁷, Magda Tsolaki¹⁸, Iwona Kłoszewska¹⁹, Hilkka Soininen²⁰, Simon Lovestone²¹, Andrew Simmons²², Dag Aarsland²³, Eric Westman²⁴.

Abstract

Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.

Entities: Disease

Keywords: Clinical application; Deep learning; Domain shift; Neuroimaging

Mesh：

Year: 2020 PMID： 33007638 DOI： 10.1016/j.media.2020.101714

Source DB: PubMed Journal: Med Image Anal ISSN： 1361-8415 Impact factor: 8.545

Keyword Cloud
Cited

12 in total

1. Multi-Institutional Validation of a Mammography-Based Breast Cancer Risk Model.

Authors: Adam Yala; Peter G Mikhael; Fredrik Strand; Gigin Lin; Siddharth Satuluru; Thomas Kim; Imon Banerjee; Judy Gichoya; Hari Trivedi; Constance D Lehman; Kevin Hughes; David J Sheedy; Lisa M Matthis; Bipin Karunakaran; Karen E Hegarty; Silvia Sabino; Thiago B Silva; Maria C Evangelista; Renato F Caron; Bruno Souza; Edmundo C Mauad; Tal Patalon; Sharon Handelman-Gotlib; Michal Guindy; Regina Barzilay
Journal: J Clin Oncol Date: 2021-11-12 Impact factor: 50.717

2. Data harmonisation for information fusion in digital healthcare: A state-of-the-art systematic review, meta-analysis and future research directions.

Authors: Yang Nan; Javier Del Ser; Simon Walsh; Carola Schönlieb; Michael Roberts; Ian Selby; Kit Howard; John Owen; Jon Neville; Julien Guiot; Benoit Ernst; Ana Pastor; Angel Alberich-Bayarri; Marion I Menzel; Sean Walsh; Wim Vos; Nina Flerin; Jean-Paul Charbonnier; Eva van Rikxoort; Avishek Chatterjee; Henry Woodruff; Philippe Lambin; Leonor Cerdá-Alberich; Luis Martí-Bonmatí; Francisco Herrera; Guang Yang
Journal: Inf Fusion Date: 2022-06 Impact factor: 17.564

Review 3. Plant Genotype to Phenotype Prediction Using Machine Learning.

Authors: Monica F Danilevicz; Mitchell Gill; Robyn Anderson; Jacqueline Batley; Mohammed Bennamoun; Philipp E Bayer; David Edwards
Journal: Front Genet Date: 2022-05-18 Impact factor: 4.772

4. Comparison of Two-Dimensional- and Three-Dimensional-Based U-Net Architectures for Brain Tissue Classification in One-Dimensional Brain CT.

Authors: Meera Srikrishna; Rolf A Heckemann; Joana B Pereira; Giovanni Volpe; Anna Zettergren; Silke Kern; Eric Westman; Ingmar Skoog; Michael Schöll
Journal: Front Comput Neurosci Date: 2022-01-10 Impact factor: 2.380

5. A study of generalization and compatibility performance of 3D U-Net segmentation on multiple heterogeneous liver CT datasets.

Authors: Baochun He; Dalong Yin; Xiaoxia Chen; Huoling Luo; Deqiang Xiao; Mu He; Guisheng Wang; Chihua Fang; Lianxin Liu; Fucang Jia
Journal: BMC Med Imaging Date: 2021-11-24 Impact factor: 1.930

6. Evaluating the progress of deep learning for visual relational concepts.

Authors: Sebastian Stabinger; David Peer; Justus Piater; Antonio Rodríguez-Sánchez
Journal: J Vis Date: 2021-10-05 Impact factor: 2.240

7. A Contrast Augmentation Approach to Improve Multi-Scanner Generalization in MRI.

Authors: Maria Ines Meyer; Ezequiel de la Rosa; Nuno Pedrosa de Barros; Roberto Paolella; Koen Van Leemput; Diana M Sima
Journal: Front Neurosci Date: 2021-08-31 Impact factor: 4.677

8. Classifying the Acquisition Sequence for Brain MRIs Using Neural Networks on Single Slices.

Authors: Norbert Braeker; Cornelia Schmitz; Natalie Wagner; Badrudin J Stanicki; Christina Schröder; Felix Ehret; Christoph Fürweger; Daniel R Zwahlen; Robert Förster; Alexander Muacevic; Paul Windisch
Journal: Cureus Date: 2022-02-21

9. AutoMorph: Automated Retinal Vascular Morphology Quantification Via a Deep Learning Pipeline.

Authors: Yukun Zhou; Siegfried K Wagner; Mark A Chia; An Zhao; Peter Woodward-Court; Moucheng Xu; Robbert Struyven; Daniel C Alexander; Pearse A Keane
Journal: Transl Vis Sci Technol Date: 2022-07-08 Impact factor: 3.048

10. Machine Learning for Health: Algorithm Auditing & Quality Control.

Authors: Luis Oala; Andrew G Murchison; Pradeep Balachandran; Shruti Choudhary; Jana Fehr; Alixandro Werneck Leite; Peter G Goldschmidt; Christian Johner; Elora D M Schörverth; Rose Nakasi; Martin Meyer; Federico Cabitza; Pat Baird; Carolin Prabhu; Eva Weicken; Xiaoxuan Liu; Markus Wenzel; Steffen Vogler; Darlington Akogo; Shada Alsalamah; Emre Kazim; Adriano Koshiyama; Sven Piechottka; Sheena Macpherson; Ian Shadforth; Regina Geierhofer; Christian Matek; Joachim Krois; Bruno Sanguinetti; Matthew Arentz; Pavol Bielik; Saul Calderon-Ramirez; Auss Abbood; Nicolas Langer; Stefan Haufe; Ferath Kherif; Sameer Pujari; Wojciech Samek; Thomas Wiegand
Journal: J Med Syst Date: 2021-11-02 Impact factor: 4.920