| Literature DB >> 31788084 |
Paola Sena1, Rita Fioresi2, Francesco Faglioni3, Lorena Losi4, Giovanni Faglioni5, Luca Roncucci6.
Abstract
Trained pathologists base colorectal cancer identification on the visual interpretation of microscope images. However, image labeling is not always straightforward and this repetitive task is prone to mistakes due to human distraction. Significant efforts are underway to develop informative tools to assist pathologists and decrease the burden and frequency of errors. The present study proposes a deep learning approach to recognize four different stages of cancerous tissue development, including normal mucosa, early preneoplastic lesion, adenoma and cancer. A dataset of human colon tissue images collected and labeled over a 10-year period by a team of pathologists was partitioned into three sets. These were used to train, validate and test the neural network, comprising several convolutional and a few linear layers. The approach used in the present study is 'direct'; it labels raw images and bypasses the segmentation step. An overall accuracy of >95% was achieved, with the majority of mislabeling referring to a near category. Tests on an external dataset with a different resolution yielded accuracies >80%. The present study demonstrated that the neural network, when properly trained, can provide fast, accurate and reproducible labeling for colon cancer images, with the potential to significantly improve the quality and speed of medical diagnoses. Copyright: © Sena et al.Entities:
Keywords: colorectal cancer cells; deep learning; medical imaging
Year: 2019 PMID: 31788084 PMCID: PMC6865164 DOI: 10.3892/ol.2019.10928
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.Neural network comprising four modules, each consisting of Convolution (3×3 kernels with 1pixel padding), BatchNorm2d, RELU and Maxpool, followed by a flattening operator, and 3 linear modules consisting of Linear, RELU and BatchNorm1d; RELU, rectified linear unit.
Small dataset and large dataset.
| A, Small dataset | |||||
|---|---|---|---|---|---|
| Small Dataset | Normal mucosa | Preneoplastic lesion | Adenoma | Colon cancer | Total |
| Training | 404 | 407 | 685 | 534 | 2,012 |
| Validation | 50 | 51 | 86 | 64 | 251 |
| Test | 50 | 51 | 85 | 64 | 250 |
| Training | 1,616 | 1,628 | 2,736 | 2,068 | 8,048 |
| Validation | 200 | 204 | 344 | 256 | 1,004 |
| Test | 200 | 204 | 340 | 256 | 1,000 |
Figure 2.Logarithmic plot of loss-of-function during typical optimization.
Test accuracy after short and long training.
| Training | Exact match large dataset (%) | Nearest match large dataset (%) | Exact match small dataset (%) | Nearest match small dataset (%) |
|---|---|---|---|---|
| Short | 93.79±0.76 | 99.85±0.11 | 92.92±0.86 | 99.20±0.25 |
| Long | 95.28±0.19 | 99.90±0.00 | 93.08±0.57 | 99.20±0.00 |
Accuracies for the test set of different optimization and evaluation conditions are expressed as the percentage of correct results ± standard deviation.
Figure 3.Histopathological sections of hematoxylin and eosin stained colon tissues that are representative of colorectal carcinogenesis. (A) Normal mucosa exhibits benign glands consisting of a regular circular lumen in cross section. (B) Pre-neoplastic lesion (aberrant crypt foci) consists of larger glands with enlarged epithelial nuclei, often stratified and crowded. (C) Adenoma is characterized by ovoid enlarged nuclei, vesicular dispersed chromatin and occasional mitoses. Sections exhibit an uneven distribution of goblet cells within crypts, luminal serration, budding, branching, crowding and fusion of glands. (D) In carcinoma, architectural changes increase with evolution and progression of malignancy. Luminal serration, budding, branching, crowding and fusion of glands are presented. Scale bar, 210 µm.
Confusion matrix for long training and large dataset. Rows correspond to predictions, columns to true labels.
| True→ Predicted↓ | M | P | A | K | Precision False positive |
|---|---|---|---|---|---|
| M | 0.3 | 0.0 | 0.0 | 98.2 1.8 | |
| P | 1.9 | 0.8 | 0.0 | 87.3 12.7 | |
| A | 0.0 | 1.5 | 0.0 | 95.6 4.4 | |
| K | 0.0 | 0.1 | 0.8 | 99.3 0.7 | |
| Recall False negative | 90.7 9.3 | 90.3 9.7 | 97.4 2.6 | 100.0 0.0 |
Rows correspond to predictions, columns to true labels. Each box contains the percentage with respect to all 1,006 test cases. The diagonal (bold) data contains correct predictions, all other data contains mislabeling. Bottom right box indicates the overall precision end error. M, normal mucosa; P, preneoplastic; K, cancer; A, adenoma.