| Literature DB >> 34257363 |
Sara P Oliveira1,2, Pedro C Neto3,4, João Fraga5, Diana Montezuma5,6,7, Ana Monteiro5, João Monteiro5, Liliana Ribeiro5, Sofia Gonçalves5, Isabel M Pinto5, Jaime S Cardoso3,4.
Abstract
Most oncological cases can be detected by imaging techniques, but diagnosis is based on pathological assessment of tissue samples. In recent years, the pathology field has evolved to a digital era where tissue samples are digitised and evaluated on screen. As a result, digital pathology opened up many research opportunities, allowing the development of more advanced image processing techniques, as well as artificial intelligence (AI) methodologies. Nevertheless, despite colorectal cancer (CRC) being the second deadliest cancer type worldwide, with increasing incidence rates, the application of AI for CRC diagnosis, particularly on whole-slide images (WSI), is still a young field. In this review, we analyse some relevant works published on this particular task and highlight the limitations that hinder the application of these works in clinical practice. We also empirically investigate the feasibility of using weakly annotated datasets to support the development of computer-aided diagnosis systems for CRC from WSI. Our study underscores the need for large datasets in this field and the use of an appropriate learning methodology to gain the most benefit from partially annotated datasets. The CRC WSI dataset used in this study, containing 1,133 colorectal biopsy and polypectomy samples, is available upon reasonable request.Entities:
Mesh:
Year: 2021 PMID: 34257363 PMCID: PMC8277780 DOI: 10.1038/s41598-021-93746-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Digital pathology workflow, from collecting the biopsy sample to the WSI visualisation.
Figure 2Normal colonic mucosa and dysplastic progression.
Colorectal low- and high-grade dysplasia characterisation.
| Low-grade dysplasia | High-grade dysplasia | |
|---|---|---|
| Extension | – | Changes must involve more than two glands (except in tiny biopsies of polyps) |
| Low power magnification | Lack of architectural complexity suggests low-grade dysplasia throughout | Alterations have to be enough to be identified atlow power: complex architectural abnormalities, epithelium looks thick, blue, disorganised and “dirty” |
| Cytology/architecture | Does not combine cytological high-grade dysplasia with architectural high-grade features | Needs to combine high-grade cytological and high-grade architectural alterations |
| Architectural features* | Gland crowding, showing parallel disposition, with no complexity (no back-to-back or cribriforming); Global architecture may vary from tubular to villous | Complex glandular crowding and irregularity; Prominent budding; Cribriform appearance and back-to-back glands; Prominent intra-luminal papillary tufting |
| Cytological features** | Nucleus are enlarged and hyperchromatic, many times cigar-shaped; Nucleus maintain basal orientation (only up to the lower half of the height of the epithelium, although in some cases we can see glands with full-thickness nuclear stratification - this is not HGD if the archite- ture is bland); There is no loss of cell polarity or pleomorphism; No atypical mitosis; Maintained cytological maturation (mucin) | Noticeably enlarged nuclei, often with a dispersed chromatin pattern and evident nucleoli; Loss of cell polarity or nuclear stratification to the extent that the nuclei are distributed within all 1/3 of the height of the epithelium; Atypical mitoses; Prominent apoptosis/necrosis, giving the lesion a “dirty” appearance; Lack of cytological maturation (loss of mucin) |
*Architectural features: gland morphology and placement; **Cytological features: cell level characteristics.
Literature overview on colorectal whole slide image diagnosis.
| Author | Year | Task | Dataset | Description | Results |
|---|---|---|---|---|---|
| Kalkan et al.[ | 2012 | CRC detection (normal vs cancer) | 120 H&E slides (tile annotations) | 1024 × 1024 px tiles; k-NN classifier + Logistic-linear classifier | Acc.: 87.69%; AUC: 0.90 |
| Korbar et al.[ | 2017 | Polyp classification (6-class): normal, hyperplastic, sessile serrated, traditional serrated, tubular and tubulovillous/villous | 697 H&E slides (annotated) | 811 × 984 px ROIs (mean size); ResNet-152 + argmax of tile class frequency | Acc.: 93%; Precision: 89.7%; Recall: 88.3%; F1-score: 88.8% |
| Yoshida et al.[ | 2017 | CRC classification (4-class): unclassifiable, non-neoplastic, adenoma and CA | 1068 H&E slides (w/ labelled tissue sections) | Tissue sections crop + cytological atypia analysis + structural atypia analysis + overall classification | FNR (CA): 9.3%; FNR (adenoma): 0%; FPR: 27.1% |
| Iizuka et al.[ | 2020 | CRC classification (3-class): non-neoplastic, AD and ADC | 4536 H&E slides (annotated) + 547 H&E slides from TCGA-COAD collection | 512 × 512 px tiles at 20×; Inception-v3 + RNN | AUC: 0.962 (ADC), 0.993 (AD); AUC (TCGA-COAD subset): 0.982 (ADC) |
| Song et al.[ | 2020 | Colorectal adenoma detection (normal vs adenoma) | 411 H&E slides (annotated) + external set: 168 H&E slides | 640 × 640 px tiles at 10×; Modified DeepLab-v2 + 15th largest pixel probability | AUC: 0.92; Acc. (external set): >90% |
| Wei et al.[ | 2020 | Polyp classification (5-class): Normal, hyperplastic, tubular, tubulovillous/villous, sessile serrated | 508 H&E slides (annotated) + external set: 238 H&E slides | 224 × 224 px tiles at 40×; ResNet models ensemble + hierarchical classification | Acc.: 93.5%; Acc. (external set): 87% |
| Xu et al.[ | 2020 | CRC detection (normal vs cancer) | 307 H&E slides (annotated) + 50 H&E slides (external set) | 768 × 768 px tiles; Inception-v3 + tiles tumour probability thresholding | Acc.: >93%; Acc. (external set): >87% |
CRC: Colorectal Cancer; AD: Adenoma; CA: Carcinoma; ADC: Adenocarcinoma; H&E: Haemotoxylin & Eosin; px: pixels; k-NN: k Nearest Neighbours; ROI: Region of Interest CNN: Convolutional Neu- ral Network; SVM: Support Vector Machine; MLP: Multi-Layer Perceptron; MIL: Multiple Instance Learning; Acc.: Accuracy; AUC: Area Under the ROC Curve; FNR/FPR: False Negative/Positive Rate
CRC dataset class definition.
| Algorithm data classes | Pathological diagnosis |
|---|---|
| Non-neoplastic | Normal CR mucosa, non-specific inflammation, hyperplasia |
| Low-grade lesion | Low-grade conventional adenoma |
| High-grade lesion | High-grade conventional adenoma and invasive adenocarcinoma |
Figure 3Example of a whole-slide (a) from the CRC dataset. Manual segmentations (b) include regions annotated as non-neoplastic (white), low-grade lesions (blue), high-grade lesions (pink), linfocytes (green) and fulguration (yellow).
Figure 4Slide classes distribution on CRC dataset.
Comparison between the number of tiles extracted from the PCa slides and the CRC slides.
| Dataset | # Slides | # Tiles | Mean # Tiles per slide |
|---|---|---|---|
| PCa | 9825 | 253,291 | 25.78 |
| CRC all | 1133 | 1,322,596 | 1167.34 |
| CRC annotated | 100 | 211,235 | 2112.35 |
Figure 5Proposed workflow for colorectal cancer diagnosis on whole-slide images.
Evolution of the model performance when trained on subsets of the PCa dataset with different sizes, keeping the test set size constant (n=1,477).
| # Train slides | # Train tiles | QWK score | Accuracy (%) |
|---|---|---|---|
| 80 | 1919 | 0.497 | 32.36 |
| 160 | 3851 | 0.586 | 37.71 |
| 500 | 12,714 | 0.628 | 41.28 |
| 1000 | 25,757 | 0.692 | 47.66 |
| 2500 | 64,697 | 0.738 | 50.03 |
| 5000 | 129,734 | 0.771 | 58.43 |
| 8348 | 215,116 |
Bold values indicate best results
Performance comparison of the model trained on a subset of the PCa dataset, when evaluated on test sets with different sizes.
| Dataset | # Train slides | # Test slides | # Train tiles | # Test tiles | QWK score |
|---|---|---|---|---|---|
| PCa | 80 | 1,477 | 1,919 | 38,175 | 0.497 |
| PCa | 80 | 20 | 1,919 | 579 |
Bold value indicates best results
Performance of the model on the different experiments on the CRC dataset.
| Dataset | Pre-train | QWK score | Accuracy (%) | Convergence Time (Epoch) |
|---|---|---|---|---|
| CRC annotated (n=100) | No | 0.583 | 75.00 | 6.5 h (13) |
| CRC All (n=1,133) | No | 0.795 | 84.17 | 2 days and 19 h (27) |
| CRC All (n=1,133) | Yes | 4 days (40) |
Bold values indicate best results
Figure 6Performance evaluated on CRC dataset.