Literature DB >> 33007638

The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.

Gustav Mårtensson1, Daniel Ferreira2, Tobias Granberg3, Lena Cavallin3, Ketil Oppedal4, Alessandro Padovani5, Irena Rektorova6, Laura Bonanni7, Matteo Pardini8, Milica G Kramberger9, John-Paul Taylor10, Jakub Hort11, Jón Snædal12, Jaime Kulisevsky13, Frederic Blanc14, Angelo Antonini15, Patrizia Mecocci16, Bruno Vellas17, Magda Tsolaki18, Iwona Kłoszewska19, Hilkka Soininen20, Simon Lovestone21, Andrew Simmons22, Dag Aarsland23, Eric Westman24.   

Abstract

Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.
Copyright © 2020 The Author(s). Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Clinical application; Deep learning; Domain shift; Neuroimaging

Mesh:

Year:  2020        PMID: 33007638     DOI: 10.1016/j.media.2020.101714

Source DB:  PubMed          Journal:  Med Image Anal        ISSN: 1361-8415            Impact factor:   8.545


  12 in total

1.  Multi-Institutional Validation of a Mammography-Based Breast Cancer Risk Model.

Authors:  Adam Yala; Peter G Mikhael; Fredrik Strand; Gigin Lin; Siddharth Satuluru; Thomas Kim; Imon Banerjee; Judy Gichoya; Hari Trivedi; Constance D Lehman; Kevin Hughes; David J Sheedy; Lisa M Matthis; Bipin Karunakaran; Karen E Hegarty; Silvia Sabino; Thiago B Silva; Maria C Evangelista; Renato F Caron; Bruno Souza; Edmundo C Mauad; Tal Patalon; Sharon Handelman-Gotlib; Michal Guindy; Regina Barzilay
Journal:  J Clin Oncol       Date:  2021-11-12       Impact factor: 50.717

2.  Data harmonisation for information fusion in digital healthcare: A state-of-the-art systematic review, meta-analysis and future research directions.

Authors:  Yang Nan; Javier Del Ser; Simon Walsh; Carola Schönlieb; Michael Roberts; Ian Selby; Kit Howard; John Owen; Jon Neville; Julien Guiot; Benoit Ernst; Ana Pastor; Angel Alberich-Bayarri; Marion I Menzel; Sean Walsh; Wim Vos; Nina Flerin; Jean-Paul Charbonnier; Eva van Rikxoort; Avishek Chatterjee; Henry Woodruff; Philippe Lambin; Leonor Cerdá-Alberich; Luis Martí-Bonmatí; Francisco Herrera; Guang Yang
Journal:  Inf Fusion       Date:  2022-06       Impact factor: 17.564

Review 3.  Plant Genotype to Phenotype Prediction Using Machine Learning.

Authors:  Monica F Danilevicz; Mitchell Gill; Robyn Anderson; Jacqueline Batley; Mohammed Bennamoun; Philipp E Bayer; David Edwards
Journal:  Front Genet       Date:  2022-05-18       Impact factor: 4.772

4.  Comparison of Two-Dimensional- and Three-Dimensional-Based U-Net Architectures for Brain Tissue Classification in One-Dimensional Brain CT.

Authors:  Meera Srikrishna; Rolf A Heckemann; Joana B Pereira; Giovanni Volpe; Anna Zettergren; Silke Kern; Eric Westman; Ingmar Skoog; Michael Schöll
Journal:  Front Comput Neurosci       Date:  2022-01-10       Impact factor: 2.380

5.  A study of generalization and compatibility performance of 3D U-Net segmentation on multiple heterogeneous liver CT datasets.

Authors:  Baochun He; Dalong Yin; Xiaoxia Chen; Huoling Luo; Deqiang Xiao; Mu He; Guisheng Wang; Chihua Fang; Lianxin Liu; Fucang Jia
Journal:  BMC Med Imaging       Date:  2021-11-24       Impact factor: 1.930

6.  Evaluating the progress of deep learning for visual relational concepts.

Authors:  Sebastian Stabinger; David Peer; Justus Piater; Antonio Rodríguez-Sánchez
Journal:  J Vis       Date:  2021-10-05       Impact factor: 2.240

7.  A Contrast Augmentation Approach to Improve Multi-Scanner Generalization in MRI.

Authors:  Maria Ines Meyer; Ezequiel de la Rosa; Nuno Pedrosa de Barros; Roberto Paolella; Koen Van Leemput; Diana M Sima
Journal:  Front Neurosci       Date:  2021-08-31       Impact factor: 4.677

8.  Classifying the Acquisition Sequence for Brain MRIs Using Neural Networks on Single Slices.

Authors:  Norbert Braeker; Cornelia Schmitz; Natalie Wagner; Badrudin J Stanicki; Christina Schröder; Felix Ehret; Christoph Fürweger; Daniel R Zwahlen; Robert Förster; Alexander Muacevic; Paul Windisch
Journal:  Cureus       Date:  2022-02-21

9.  AutoMorph: Automated Retinal Vascular Morphology Quantification Via a Deep Learning Pipeline.

Authors:  Yukun Zhou; Siegfried K Wagner; Mark A Chia; An Zhao; Peter Woodward-Court; Moucheng Xu; Robbert Struyven; Daniel C Alexander; Pearse A Keane
Journal:  Transl Vis Sci Technol       Date:  2022-07-08       Impact factor: 3.048

10.  Machine Learning for Health: Algorithm Auditing & Quality Control.

Authors:  Luis Oala; Andrew G Murchison; Pradeep Balachandran; Shruti Choudhary; Jana Fehr; Alixandro Werneck Leite; Peter G Goldschmidt; Christian Johner; Elora D M Schörverth; Rose Nakasi; Martin Meyer; Federico Cabitza; Pat Baird; Carolin Prabhu; Eva Weicken; Xiaoxuan Liu; Markus Wenzel; Steffen Vogler; Darlington Akogo; Shada Alsalamah; Emre Kazim; Adriano Koshiyama; Sven Piechottka; Sheena Macpherson; Ian Shadforth; Regina Geierhofer; Christian Matek; Joachim Krois; Bruno Sanguinetti; Matthew Arentz; Pavol Bielik; Saul Calderon-Ramirez; Auss Abbood; Nicolas Langer; Stefan Haufe; Ferath Kherif; Sameer Pujari; Wojciech Samek; Thomas Wiegand
Journal:  J Med Syst       Date:  2021-11-02       Impact factor: 4.920

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.