Literature DB >> 33528370

Hidden Variables in Deep Learning Digital Pathology and Their Potential to Cause Batch Effects: Prediction Model Study.

Max Schmitt1, Roman Christoph Maron1, Achim Hekler1, Albrecht Stenzinger2, Axel Hauschild3, Michael Weichenthal3, Markus Tiemann4, Dieter Krahl5, Heinz Kutzner6, Jochen Sven Utikal7,8, Sebastian Haferkamp9, Jakob Nikolas Kather10, Frederick Klauschen11, Eva Krieghoff-Henning1, Stefan Fröhling12, Christof von Kalle13, Titus Josef Brinker1.   

Abstract

BACKGROUND: An increasing number of studies within digital pathology show the potential of artificial intelligence (AI) to diagnose cancer using histological whole slide images, which requires large and diverse data sets. While diversification may result in more generalizable AI-based systems, it can also introduce hidden variables. If neural networks are able to distinguish/learn hidden variables, these variables can introduce batch effects that compromise the accuracy of classification systems.
OBJECTIVE: The objective of the study was to analyze the learnability of an exemplary selection of hidden variables (patient age, slide preparation date, slide origin, and scanner type) that are commonly found in whole slide image data sets in digital pathology and could create batch effects.
METHODS: We trained four separate convolutional neural networks (CNNs) to learn four variables using a data set of digitized whole slide melanoma images from five different institutes. For robustness, each CNN training and evaluation run was repeated multiple times, and a variable was only considered learnable if the lower bound of the 95% confidence interval of its mean balanced accuracy was above 50.0%.
RESULTS: A mean balanced accuracy above 50.0% was achieved for all four tasks, even when considering the lower bound of the 95% confidence interval. Performance between tasks showed wide variation, ranging from 56.1% (slide preparation date) to 100% (slide origin).
CONCLUSIONS: Because all of the analyzed hidden variables are learnable, they have the potential to create batch effects in dermatopathology data sets, which negatively affect AI-based classification systems. Practitioners should be aware of these and similar pitfalls when developing and evaluating such systems and address these and potentially other batch effect variables in their data sets through sufficient data set stratification. ©Max Schmitt, Roman Christoph Maron, Achim Hekler, Albrecht Stenzinger, Axel Hauschild, Michael Weichenthal, Markus Tiemann, Dieter Krahl, Heinz Kutzner, Jochen Sven Utikal, Sebastian Haferkamp, Jakob Nikolas Kather, Frederick Klauschen, Eva Krieghoff-Henning, Stefan Fröhling, Christof von Kalle, Titus Josef Brinker. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 02.02.2021.

Entities:  

Keywords:  artifacts; artificial intelligence; clinical pathology; convolutional neural networks; deep learning; digital pathology; machine learning; neural networks; pathology; pitfalls

Year:  2021        PMID: 33528370      PMCID: PMC7886613          DOI: 10.2196/23436

Source DB:  PubMed          Journal:  J Med Internet Res        ISSN: 1438-8871            Impact factor:   5.428


  23 in total

Review 1.  A study about color normalization methods for histopathology images.

Authors:  Santanu Roy; Alok Kumar Jain; Shyam Lal; Jyoti Kini
Journal:  Micron       Date:  2018-08-01       Impact factor: 2.251

2.  Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer.

Authors:  Babak Ehteshami Bejnordi; Mitko Veta; Paul Johannes van Diest; Bram van Ginneken; Nico Karssemeijer; Geert Litjens; Jeroen A W M van der Laak; Meyke Hermsen; Quirine F Manson; Maschenka Balkenhol; Oscar Geessink; Nikolaos Stathonikos; Marcory Crf van Dijk; Peter Bult; Francisco Beca; Andrew H Beck; Dayong Wang; Aditya Khosla; Rishab Gargeya; Humayun Irshad; Aoxiao Zhong; Qi Dou; Quanzheng Li; Hao Chen; Huang-Jing Lin; Pheng-Ann Heng; Christian Haß; Elia Bruni; Quincy Wong; Ugur Halici; Mustafa Ümit Öner; Rengul Cetin-Atalay; Matt Berseth; Vitali Khvatkov; Alexei Vylegzhanin; Oren Kraus; Muhammad Shaban; Nasir Rajpoot; Ruqayya Awan; Korsuk Sirinukunwattana; Talha Qaiser; Yee-Wah Tsang; David Tellez; Jonas Annuscheit; Peter Hufnagl; Mira Valkonen; Kimmo Kartasalo; Leena Latonen; Pekka Ruusuvuori; Kaisa Liimatainen; Shadi Albarqouni; Bharti Mungal; Ami George; Stefanie Demirci; Nassir Navab; Seiryo Watanabe; Shigeto Seno; Yoichi Takenaka; Hideo Matsuda; Hady Ahmady Phoulady; Vassili Kovalev; Alexander Kalinovsky; Vitali Liauchuk; Gloria Bueno; M Milagro Fernandez-Carrobles; Ismael Serrano; Oscar Deniz; Daniel Racoceanu; Rui Venâncio
Journal:  JAMA       Date:  2017-12-12       Impact factor: 56.272

Review 3.  Multiscale integration of -omic, imaging, and clinical data in biomedical informatics.

Authors:  John H Phan; Chang F Quo; Chihwen Cheng; May Dongmei Wang
Journal:  IEEE Rev Biomed Eng       Date:  2012

4.  Automatic batch-invariant color segmentation of histological cancer images.

Authors:  Sonal Kothari; John H Phan; Richard A Moffitt; Todd H Stokes; Shelby E Hassberger; Qaiser Chaudry; Andrew N Young; May D Wang
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2011 Mar-Apr

5.  Removing batch effects from histopathological images for enhanced cancer diagnosis.

Authors:  Sonal Kothari; John H Phan; Todd H Stokes; Adeboye O Osunkoya; Andrew N Young; May D Wang
Journal:  IEEE J Biomed Health Inform       Date:  2014-05       Impact factor: 5.772

6.  Stain Normalization using Sparse AutoEncoders (StaNoSA): Application to digital pathology.

Authors:  Andrew Janowczyk; Ajay Basavanhally; Anant Madabhushi
Journal:  Comput Med Imaging Graph       Date:  2016-05-16       Impact factor: 4.790

Review 7.  Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology.

Authors:  Kaustav Bera; Kurt A Schalper; David L Rimm; Vamsidhar Velcheti; Anant Madabhushi
Journal:  Nat Rev Clin Oncol       Date:  2019-08-09       Impact factor: 66.675

8.  Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent.

Authors:  Angel Cruz-Roa; Hannah Gilmore; Ajay Basavanhally; Michael Feldman; Shridar Ganesan; Natalie N C Shih; John Tomaszewski; Fabio A González; Anant Madabhushi
Journal:  Sci Rep       Date:  2017-04-18       Impact factor: 4.379

9.  Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.

Authors:  John R Zech; Marcus A Badgeley; Manway Liu; Anthony B Costa; Joseph J Titano; Eric Karl Oermann
Journal:  PLoS Med       Date:  2018-11-06       Impact factor: 11.069

10.  Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning.

Authors:  Nicolas Coudray; Paolo Santiago Ocampo; Theodore Sakellaropoulos; Navneet Narula; Matija Snuderl; David Fenyö; Andre L Moreira; Narges Razavian; Aristotelis Tsirigos
Journal:  Nat Med       Date:  2018-09-17       Impact factor: 53.440

View more
  7 in total

Review 1.  Digital pathology and artificial intelligence in translational medicine and clinical practice.

Authors:  Vipul Baxi; Robin Edwards; Michael Montalto; Saurabh Saha
Journal:  Mod Pathol       Date:  2021-10-05       Impact factor: 7.842

2.  Improving Diagnosis Through Digital Pathology: Proof-of-Concept Implementation Using Smart Contracts and Decentralized File Storage.

Authors:  Hemang Subramanian; Susmitha Subramanian
Journal:  J Med Internet Res       Date:  2022-03-28       Impact factor: 5.428

3.  Validating Automatic Concept-Based Explanations for AI-Based Digital Histopathology.

Authors:  Daniel Sauter; Georg Lodde; Felix Nensa; Dirk Schadendorf; Elisabeth Livingstone; Markus Kukuk
Journal:  Sensors (Basel)       Date:  2022-07-18       Impact factor: 3.847

Review 4.  Computational pathology in ovarian cancer.

Authors:  Sandra Orsulic; Joshi John; Ann E Walts; Arkadiusz Gertych
Journal:  Front Oncol       Date:  2022-07-29       Impact factor: 5.738

Review 5.  Developing image analysis methods for digital pathology.

Authors:  Peter Bankhead
Journal:  J Pathol       Date:  2022-05-23       Impact factor: 9.883

6.  Computational tumor stroma reaction evaluation led to novel prognosis-associated fibrosis and molecular signature discoveries in high-grade serous ovarian carcinoma.

Authors:  Jun Jiang; Burak Tekin; Lin Yuan; Sebastian Armasu; Stacey J Winham; Ellen L Goode; Hongfang Liu; Yajue Huang; Ruifeng Guo; Chen Wang
Journal:  Front Med (Lausanne)       Date:  2022-09-07

7.  Quality control stress test for deep learning-based diagnostic model in digital pathology.

Authors:  Birgid Schömig-Markiefka; Alexey Pryalukhin; Wolfgang Hulla; Andrey Bychkov; Junya Fukuoka; Anant Madabhushi; Viktor Achter; Lech Nieroda; Reinhard Büttner; Alexander Quaas; Yuri Tolkach
Journal:  Mod Pathol       Date:  2021-06-24       Impact factor: 7.842

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.