| Literature DB >> 33565124 |
Jeremias Krause1, Heike I Grabsch2,3, Matthias Kloor4, Michael Jendrusch4, Amelie Echle1, Roman David Buelow5, Peter Boor5, Tom Luedde6, Titus J Brinker7, Christian Trautwein1, Alexander T Pearson8, Philip Quirke3, Josien Jenniskens9, Kelly Offermans9, Piet A van den Brandt9, Jakob Nikolas Kather1,3,10.
Abstract
Deep learning can detect microsatellite instability (MSI) from routine histology images in colorectal cancer (CRC). However, ethical and legal barriers impede sharing of images and genetic data, hampering development of new algorithms for detection of MSI and other biomarkers. We hypothesized that histology images synthesized by conditional generative adversarial networks (CGANs) retain information about genetic alterations. To test this, we developed a 'histology CGAN' which was trained on 256 patients (training cohort 1) and 1457 patients (training cohort 2). The CGAN synthesized 10 000 synthetic MSI and non-MSI images which contained a range of tissue types and were deemed realistic by trained observers in a blinded study. Subsequently, we trained a deep learning detector of MSI on real or synthetic images and evaluated the performance of MSI detection in a held-out set of 142 patients. When trained on real images from training cohort 1, this system achieved an area under the receiver operating curve (AUROC) of 0.742 [0.681, 0.854]. Training on the larger cohort 2 only marginally improved the AUROC to 0.757 [0.707, 0.869]. Training on purely synthetic data resulted in an AUROC of 0.743 [0.658, 0.801]. Training on both real and synthetic data further increased AUROC to 0.777 [0.715, 0.821]. We conclude that synthetic histology images retain information reflecting underlying genetic alterations in colorectal cancer. Using synthetic instead of real images to train deep learning systems yields non-inferior classifiers. This approach can be used to create large shareable data sets or to augment small data sets with rare molecular features.Entities:
Keywords: artificial intelligence; colorectal cancer; deep learning; digital pathology; generative adversarial network; generative model; machine learning; microsatellite instability
Mesh:
Year: 2021 PMID: 33565124 DOI: 10.1002/path.5638
Source DB: PubMed Journal: J Pathol ISSN: 0022-3417 Impact factor: 7.996