| Literature DB >> 30755214 |
Wasswa William1, Andrew Ware2, Annabella Habinka Basaza-Ejiri3, Johnes Obungoloch4.
Abstract
BACKGROUND: Cervical cancer is preventable if effective screening measures are in place. Pap-smear is the commonest technique used for early screening and diagnosis of cervical cancer. However, the manual analysis of the pap-smears is error prone due to human mistake, moreover, the process is tedious and time-consuming. Hence, it is beneficial to develop a computer-assisted diagnosis tool to make the pap-smear test more accurate and reliable. This paper describes the development of a tool for automated diagnosis and classification of cervical cancer from pap-smear images.Entities:
Keywords: Cervical cancer; Fuzzy C-means; Pap-smear
Mesh:
Year: 2019 PMID: 30755214 PMCID: PMC6373062 DOI: 10.1186/s12938-019-0634-5
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 2.819
Some of the available techniques in the literature for automated/semi-automated detection of cervical cancer from pap-smear images
| Author | Paper | Datasets | Features | Pre-processing | Segmentation | Classification | Results |
|---|---|---|---|---|---|---|---|
| Su et al. [ | Automatic detection of cervical cancer cells by a two-level cascade classification system | Liquid-based cytology slides | 20 Morphological and 8 texture features | Histogram equalization and Median filter | Adaptive threshold | C4.5 and Logical Regression classifiers | Recognition rates of 95.6% achieved |
| Sharma et al. [ | Classification of clinical dataset of cervical cancer using KNN | Single cells data sets from Fortis Hospital, India | 7 morphological features | Min–max and edge detection | K-nearest neighbour | Accuracy of 82.9% with fivefold cross-validation | |
| Kumar et al. [ | Detection and classification of cancer from microscopic biopsy images using clinically significant features | Histology image dataset (histology DS2828) | 125 Nucleus and cytoplasm morphologic features | Contrast limited adaptive histogram equalization | K-means segmentation algorithm | K-NN, fuzzy KNN, SVM and random forest-based classifiers | Accuracy, specificity and sensitivity of 92%, 94% and 81% |
| Chankong et al. [ | Automatic cervical cell segmentation and classification in Pap smears | Herlev dataset | Morphological features | Median filter | Patch-based fuzzy C-means and FCM | Fuzzy C-means | Accuracies of 93.78% and 99.27% for 7 and 2-class classifications |
| Talukdar et al. [ | Fuzzy clustering based image segmentation of pap smear images of cervical cancer cell using FCM algorithm | Colour image | Morphometric, densitometry, colorimetric and textural feature | Adaptive histogram equalization with Otsu’s method | Chaos theory corresponding to R, G and B value | Pixel-level classification and shape analysis | Preserves the colour of the images and data loss is minimal |
| Sreedevi et al. [ | Pap smear image-based detection of cervical cancer, | Herlev dataset | Nucleus features | Colour conversions and contrast enhancement | Iterative thresholding method | Based on the area of the nucleus | A sensitivity of 100% and specificity of 90% achieved |
| Ampazis et al. [ | Pap-smear classification using efficient second-order neural network | Herlev University Hospital | 20 morphological features | Contrast enhancement | Neural networks | LMAM and OLMAM algorithms | Classification accuracy of 98.86% was obtained |
Fig. 1The approach to achieve cervical cancer detection from pap-smear images
Some of the characteristics of the cervical cells from the training dataset (N = nucleus, C = cytoplasm)
| Cell type | Cancer class | Image | N area | C area | N/C ratio | N bright | C bright | N perimeter | C perimeter |
|---|---|---|---|---|---|---|---|---|---|
| Normal cells | Superficial squamous |
| 631 (±) (206) | 61,487 (±) (23,780) | 0.01 (±) (0.01) | 66 (±) (17) | 134 (±) (23) | 88 (±) (15) | 1034 (±) (221) |
| Intermediate squamous |
| 1315 (±) (390) | 44,961 (±) (15,345) | 0.03 (±) (0.01) | 67 (±) (19) | 131 (±) (22) | 130 (±) (19) | 894 (±) (166) | |
| Columnar epithelial |
| 1591 (±) (699) | 3290 (±) (1829) | 0.35 (±) (0.10) | 94 (±) (25) | 138 (±) (36) | 153 (±) (35) | 323 (±) (103) | |
| Abnormal cells | Mild squamous |
| 4690 (±) (1901) | 15,459 (±) (10,539) | 0.27 (±) (0.10) | 98 (±) (17) | 142 (±) (19) | 257 (±) (55) | 589 (±) (203) |
| Moderate squamous |
| 3873 (±) (1651) | 7288 (±) (5207) | 0.38 (±) (0.12) | 92 (±) (15) | 135 (±) (18) | 231 (±) (49) | 443 (±) (141) | |
| Severe squamous |
| 2949 (±) (1474) | 3415 (±) (2276) | 0.49 (±) (0.14) | 94 (±) (22) | 143 (±) (29) | 208 (±) (52) | 323 (±) (95) | |
| Carcinoma in situ |
| 2986 (±) (1474) | 2115 (±) (1490) | 0.60 (±) (0.13) | 97 (±) (18) | 142 (±) (22) | 215 (±) (48) | 28 (±) (67) |
Fig. 2Generation of the feature vector from the training images
Fig. 3Three-phase sequential elimination approach for debris rejection
Extracted features from the pap-smear images
| Nucleus | Cytoplasm | ||
|---|---|---|---|
| 1 | 16 | ||
| 2 | 17 | ||
| 3 | 18 | ||
| 4 | 19 | ||
| 5 | 20 | ||
| 6 | 21 | ||
| 7 | 22 | ||
| 8 | 23 | ||
| 9 | 24 | ||
| 10 | 25 | ||
| 11 | 26 | ||
| 12 | 27 | ||
| 13 | 28 | ||
| 14 | 29 | ||
| 15 | Nucleus entropy: The entropy of gray values of the nucleus region |
Fig. 4The fuzzy C-means is wrapped into a black box from which an estimated error is obtained
Tool evaluation criteria
| Diagnostic efficacy | Evaluation metrics |
|---|---|
| Technical efficacy | How well the tool extracts features used for classification? These included nucleus and cytoplasm areas, perimeters etc. |
| Diagnostic accuracy efficacy | Classification accuracy, sensitivity, specificity, false positive rate and false negative rate |
Fig. 5PAT graphical user interface
Nucleus and cytoplasm segmentation using the proposed method
Comparison of the extracted features from a normal superficial cell by CHAMP and PAT
| Features | CHAMP | PAT | %|Error| |
|---|---|---|---|
| Nucleus area | 562.38 µm2 | 563.64 µm2 | 0.22 |
| Cytoplasm area | 69,395.88 µm2 | 69,430.30 µm2 | 0.05 |
| Nucleus brightness | 66.00 | 66.14 | 0.21 |
| Nucleus to cytoplasm ratio | 0.00810 | 0.00811 | 0.17 |
Fig. 6Boxplot for the percentage error in the extracted features
Comparison of the extracted features from a normal superficial cell by a cytopathologist and PAT
| Superficial cell feature | Evaluation | %|Error| | |
|---|---|---|---|
| Cytopathologist | PAT | ||
| Nucleus area | 1328 µm2 | 1331.67 µm2 | 0.27 |
| Cytoplasm area | 44,991 µm2 | 45,001.85 µm2 | 0.02 |
| Nucleus brightness | 67 (light) | 67.32 | 0.41 |
| Nucleus to cytoplasm ratio | 0.02951 (Small) | 0.02959 | 0.25 |
Comparison of the extracted features from an abnormal superficial cell by a cytopathologist and PAT
| Superficial cells features | Evaluation | %|Error| | |
|---|---|---|---|
| Cytopathologist | PAT | ||
| Nucleus area | 3996 µm2 | 4006.67 µm2 | 0.26 |
| Cytoplasm area | 7188 µm2 | 7191.40 µm2 | 0.04 |
| Nucleus brightness | 97 (very dark) | 97.31 | 0.31 |
| Nucleus to cytoplasm ratio | 0.555 (very large) | 0.5571 | 0.21 |
Fig. 7Boxplot for the percentage error in the extracted features from 50 normal pap-smear slides (first boxplot) and 50 abnormal pap-smear slides (second boxplot)
Cervical cancer classification results from single cells
| Abnormal | Normal | ||
|---|---|---|---|
| False negative | 4 | True negative | 154 |
| True positive | 555 | False positive | 4 |
| Total | 559 | Total | 158 |
Fig. 8ROC curve for the classifier performance on Dataset 1
Cervical cancer classification results from single cells
| Abnormal | Normal | ||
|---|---|---|---|
| False negative | 3 | True negative | 137 |
| True positive | 153 | False positive | 4 |
| Total | 156 | Total | 141 |
Cervical cancer classification results from pap-smear cells
| Abnormal slides | Normal slides | ||
|---|---|---|---|
| False negative | 0 | True negative | 27 |
| True positive | 30 | False positive | 3 |
| Total | 30 | Total | 30 |
Comparison of the developed classifier’s performance with methods in [62–64]
| Method | Method | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Zhang et al. [ | Deep convolutional networks | 98.2 | 98.3 | 98.3 |
| Bora et al. [ | Ensemble classifier | 99.0 | 89.7 | 96.5 |
| Marinakis et al. [ | Genetic algorithm | 98.5 | 92.1 | 96.8 |
| Proposed Tool (PAT) | Enhanced Fuzzy C-means | 99.28 | 97.47 | 98.88 |