| Literature DB >> 33828898 |
Sudhir Sornapudi1, R Joe Stanley1, William V Stoecker2, Rodney Long3, Zhiyun Xue3, Rosemary Zuna4, Shellaine R Frazier5, Sameer Antani3.
Abstract
BACKGROUND: Cervical cancer is one of the deadliest cancers affecting women globally. Cervical intraepithelial neoplasia (CIN) assessment using histopathological examination of cervical biopsy slides is subject to interobserver variability. Automated processing of digitized histopathology slides has the potential for more accurate classification for CIN grades from normal to increasing grades of pre-malignancy: CIN1, CIN2, and CIN3.Entities:
Keywords: Attention networks; cervical cancer; cervical intraepithelial neoplasia; classification; convolutional neural networks; digital pathology; fusion-based classification; histology; recurrent neural networks
Year: 2020 PMID: 33828898 PMCID: PMC8020842 DOI: 10.4103/jpi.jpi_50_20
Source DB: PubMed Journal: J Pathol Inform
Figure 1Sections of epithelium region with increasing cervical intraepithelial neoplasia severity (from [b-d]) showing delayed maturation with an increase in immature atypical cells from bottom to top. The sections can be categorized as (a) Normal, (b) CIN1, (c) CIN2, and (d) CIN3. In these images, left to right corresponds to bottom to top of the epithelium
Figure 2Overview of DeepCIN model
Figure 3Localized vertical segment generation from an epithelial image
Figure 4Segment-level sequence generator network with two-stage encoder structures
Segment-level sequence generator model architecture
| Layers | Configurations | Size | |
|---|---|---|---|
| Stage I | Input | - | 3×64×704 |
| Transition layer 0 | 64×32×352 | ||
| Dense block 1 | 256×16×176 | ||
| Transition layer 1 | 128×8×88 | ||
| Dense block 2 | 512×8×88 | ||
| Transition layer 2 | 256×4×44 | ||
| Dense block 3 | 1024×4×44 | ||
| Pooling | 1024×1×44 | ||
| Stage II | BLSTM+ | 512×44 | |
| BLSTM+ NN | 512×44 | ||
| Output | - | 4×1 |
k, s, p, mp, ap, and nh, are kernel, stride size, padding size, max pooling, average pooling, and number of hidden layers, respectively. “BLSTM” and “NN” stands for bidirectional LSTM and single-layer neural network, respectively. : Bidirectional Long-Short-Term Memory, NN: Neural network, LSTM: Long-Short-Term Memory
Figure 5Attention-based fusion network for epithelial image-level classification. The input sequences are fed to GRU cells. ѲDenote a two-layer neural network with hyperbolic tangent and softmax activation functions, respectively to generate attentional weights. ѲDenotes a single layer NN with softmax activation function that produces the classification output
Class label distribution from 453 epithelial images
| Class | Count(%) | |
|---|---|---|
| Epithelial images | Segments | |
| Normal | 244(53.8) | 6836 (57.7) |
| CIN1 | 57 (12.6) | 1433(12.1) |
| CIN2 | 79(17.5) | 2039(17.2) |
| CIN3 | 73 (16.1) | 1546 (13.0) |
| Total | 453(100.0) | 11,854(100.0) |
CIN: Cervical intraepithelial neoplasia
Ablation study on segment widths
| Segment width | F1 | ACC | AUC | AP | MCC | ||
|---|---|---|---|---|---|---|---|
| 32 | 82.9 | 82.3 | 81.2 | 82.3 | 93.5 | 85.3 | 72.3 |
| 64* | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 |
| 128 | 85.3 | 85.6 | 84.9 | 85.6 | 95.9 | 89.8 | 77.1 |
P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under Receiver Operating Characteristic curve, ACC: Classification accuracy, *Indicates the best performing model
Ablation study on stage I encoder models
| Stage I encoder | F1 | ACC | AUC | AP | MCC | ||
|---|---|---|---|---|---|---|---|
| DesnseNet-121* | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 |
| ResNet-101 | 87.1 | 86.9 | 86.4 | 86.9 | 95.0 | 88.9 | 79.6 |
| Inception-v3 | 85.5 | 85.4 | 85.1 | 85.4 | 94.8 | 87.8 | 77.1 |
P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under Receiver Operating Characteristic curve, ACC: Classification accuracy
Ablation study on stage II encoder models
| Stage II encoder | F1 | ACC | AUC | AP | MCC | ||
|---|---|---|---|---|---|---|---|
| BLSTM* | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 |
| BLSTM+attention | 87.9 | 87.6 | 87.7 | 87.6 | 95.2 | 88.9 | 80.1 |
| FC | 85.3 | 85.0 | 84.2 | 85.0 | 94.7 | 87.4 | 76.3 |
BLSTM: Bidirectional Long-Short-Term Memory, P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under Receiver Operating Characteristic curve, ACC: Classification accuracy, FC: Fully-connected layer, * indicates the best performing model
Ablation study on fusion techniques
| Fusion | F1 | ACC | AUC | AP | MCC | ||
|---|---|---|---|---|---|---|---|
| GRU | 86.3 | 86.1 | 85.6 | 86.1 | 96.3 | 90.4 | 78.0 |
| GRU+attention* | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 |
| Max vote | 87.6 | 87.2 | 87.0 | 87.2 | - | - | 79.9 |
| Avg vote | 88.0 | 87.6 | 87.4 | 87.6 | - | - | 80.6 |
GRU: Gated recurrent unit
Comparison with state-of-the-art models
| Model | F1 | ACC | AUC | AP | MCC | ||
|---|---|---|---|---|---|---|---|
| Guo | 67.5 | 73.3 | 69.4 | 73.4 | - | - | 56.5 |
| AlMubarak | 66.1 | 75.6 | 70.4 | 75.5 | 90.9 | 78.1 | 60.3 |
| Ours* | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 |
P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under receiver operating characteristic curve, ACC: Classification accuracy
Figure 6Results of DeepCIN. From top to bottom, each column presents original image, localized vertical regions, contribution of segments within an image toward the image-level CIN classification (represented as probability distribution over the segments [attentional weights], the dotted lines indicate mean value and segments above the mean value, highlighted in green, are contributing the most), and corresponding ground truth and prediction labels, respectively
Figure 7Sankey diagram – based on the combined test results from the fivefold cross-validation. The height of each bar is proportional to the number of samples corresponding to each class
Fivefold cross-validation results with different scoring schemes
| Scoring scheme | F1 | ACC | AUC | AP | MCC | κ | ||
|---|---|---|---|---|---|---|---|---|
| Exact class label | 88.6 | 88.5 | 88.0 | 88.5 | 96.5 | 91.5 | 82.0 | 81.5 |
| CIN versus Normal | 94.6 | 94.1 | 94.0 | 94.1 | 93.8 | 97.7 | 88.5 | 87.9 |
| CIN3-CIN2 versus CIN1-normal | 96.8 | 96.7 | 96.7 | 96.7 | 96.0 | 98.9 | 92.7 | 92.5 |
| CIN3 versus CIN2-CIN1-normal | 96.2 | 96.0 | 96.0 | 96.0 | 88.4 | 98.3 | 85.3 | 84.8 |
| Off-by-one | - | - | - | 98.9 | - | - | - | - |
P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under receiver operating characteristic curve, ACC: Classification accuracy
Cervical intraepithelial neoplasia classification results on 224 image-set
| Scoring scheme | F1 | ACC | AUC | AP | MCC | κ | ||
|---|---|---|---|---|---|---|---|---|
| Exact class label | 90.2 | 88.4 | 88.2 | 88.4 | 98.0 | 93.1 | 80.5 | 80.0 |
| CIN versus normal | 97.3 | 97.3 | 97.3 | 97.3 | 97.2 | 99.7 | 94.4 | 94.4 |
| CIN3-CIN2 versus CIN1-Normal | 95.7 | 95.6 | 95.5 | 95.5 | 94.0 | 99.1 | 90.3 | 90.0 |
| CIN3 versus CIN2-CIN1-Normal | 93.0 | 92.4 | 91.5 | 92.4 | 78.2 | 97.0 | 71.9 | 68.1 |
| Off-by-one | - | - | - | 98.2 | - | - | - | - |
P: Precision, R: Recall, F1: F1-score, AP: Average precision, MCC: Matthews correlation coefficient, AUC: Area under receiver operating characteristic curve, ACC: Classification accuracy