Literature DB >> 35445341

Finding a Suitable Class Distribution for Building Histological Images Datasets Used in Deep Model Training-The Case of Cancer Detection.

Ismat Ara Reshma1, Camille Franchet2, Margot Gaspard2, Radu Tudor Ionescu3, Josiane Mothe4, Sylvain Cussat-Blanc4,5, Hervé Luga4, Pierre Brousset2,6,7.   

Abstract

The class distribution of a training dataset is an important factor which influences the performance of a deep learning-based system. Understanding the optimal class distribution is therefore crucial when building a new training set which may be costly to annotate. This is the case for histological images used in cancer diagnosis where image annotation requires domain experts. In this paper, we tackle the problem of finding the optimal class distribution of a training set to be able to train an optimal model that detects cancer in histological images. We formulate several hypotheses which are then tested in scores of experiments with hundreds of trials. The experiments have been designed to account for both segmentation and classification frameworks with various class distributions in the training set, such as natural, balanced, over-represented cancer, and over-represented non-cancer. In the case of cancer detection, the experiments show several important results: (a) the natural class distribution produces more accurate results than the artificially generated balanced distribution; (b) the over-representation of non-cancer/negative classes (healthy tissue and/or background classes) compared to cancer/positive classes reduces the number of samples which are falsely predicted as cancer (false positive); (c) the least expensive to annotate non-ROI (non-region-of-interest) data can be useful in compensating for the performance loss in the system due to a shortage of expensive to annotate ROI data; (d) the multi-label examples are more useful than the single-label ones to train a segmentation model; and (e) when the classification model is tuned with a balanced validation set, it is less affected than the segmentation model by the class distribution of the training set.
© 2022. The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine.

Entities:  

Keywords:  Class distribution analysis; Class-biased training; Computer-aided diagnosis; Deep learning; Histological image; Image segmentation and classification; Medical information retrieval

Mesh:

Year:  2022        PMID: 35445341      PMCID: PMC9582112          DOI: 10.1007/s10278-022-00618-7

Source DB:  PubMed          Journal:  J Digit Imaging        ISSN: 0897-1889            Impact factor:   4.903


  19 in total

Review 1.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

2.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review.

Authors:  Waseem Rawat; Zenghui Wang
Journal:  Neural Comput       Date:  2017-06-09       Impact factor: 2.026

3.  Detection of concealed cars in complex cargo X-ray imagery using Deep Learning.

Authors:  Nicolas Jaccard; Thomas W Rogers; Edward J Morton; Lewis D Griffin
Journal:  J Xray Sci Technol       Date:  2017       Impact factor: 1.535

4.  What Do Different Evaluation Metrics Tell Us About Saliency Models?

Authors:  Zoya Bylinskii; Tilke Judd; Aude Oliva; Antonio Torralba; Fredo Durand
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-03-13       Impact factor: 6.226

Review 5.  Histopathological image analysis: a review.

Authors:  Metin N Gurcan; Laura E Boucheron; Ali Can; Anant Madabhushi; Nasir M Rajpoot; B Yener
Journal:  IEEE Rev Biomed Eng       Date:  2009-10-30

6.  Dense Steerable Filter CNNs for Exploiting Rotational Symmetry in Histology Images.

Authors:  Simon Graham; David Epstein; Nasir Rajpoot
Journal:  IEEE Trans Med Imaging       Date:  2020-11-30       Impact factor: 10.048

7.  Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data.

Authors:  Salman H Khan; Munawar Hayat; Mohammed Bennamoun; Ferdous A Sohel; Roberto Togneri
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2017-08-17       Impact factor: 10.451

Review 8.  Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology.

Authors:  Kaustav Bera; Kurt A Schalper; David L Rimm; Vamsidhar Velcheti; Anant Madabhushi
Journal:  Nat Rev Clin Oncol       Date:  2019-08-09       Impact factor: 66.675

Review 9.  Standard deviation and standard error of the mean.

Authors:  Dong Kyu Lee; Junyong In; Sangseok Lee
Journal:  Korean J Anesthesiol       Date:  2015-05-28

Review 10.  Machine Learning Methods for Histopathological Image Analysis.

Authors:  Daisuke Komura; Shumpei Ishikawa
Journal:  Comput Struct Biotechnol J       Date:  2018-02-09       Impact factor: 7.271

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.