| Literature DB >> 31618951 |
Teresa Conceição1, Cristiana Braga2, Luís Rosado3, Maria João M Vasconcelos4.
Abstract
Cervical cancer is the one of the most common cancers in women worldwide, affecting around 570,000 new patients each year. Although there have been great improvements over the years, current screening procedures can still suffer from long and tedious workflows and ambiguities. The increasing interest in the development of computer-aided solutions for cervical cancer screening is to aid with these common practical difficulties, which are especially frequent in the low-income countries where most deaths caused by cervical cancer occur. In this review, an overview of the disease and its current screening procedures is firstly introduced. Furthermore, an in-depth analysis of the most relevant computational methods available on the literature for cervical cells analysis is presented. Particularly, this work focuses on topics related to automated quality assessment, segmentation and classification, including an extensive literature review and respective critical discussion. Since the major goal of this timely review is to support the development of new automated tools that can facilitate cervical screening procedures, this work also provides some considerations regarding the next generation of computer-aided diagnosis systems and future research directions.Entities:
Keywords: cervical cancer; classification; computer-aided diagnosis; machine learning; pap-smear; screening; segmentation
Mesh:
Year: 2019 PMID: 31618951 PMCID: PMC6834130 DOI: 10.3390/ijms20205114
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary information about the main classification systems for cervical cancer.
| Classification System | Author | Grading Criteria | Reporting Purpose | Clinical Purpose |
|---|---|---|---|---|
| The Bethesda System (TBS) [ | United States National Cancer Institute (NCI) | For cervical cytological report (results of microscopic examination of a smear) | Depending on the cells’ extent of abnormality | Screening (test for detecting early changes of the cells of the cervix) |
| Cervical Intraepithelial Neoplasia (CIN) [ | Richart R.M. | For histological report (results of microscopic examination of tissue samples) | According to the thickness of the abnormal epithelium | Diagnosis (medical test to aid in the diagnosis or detection of cervical cancer) |
| TNM [ | Union for International Cancer Control (UICC) | To document prognostic factors: tumour’s size (T), affected lymph nodes (N) and distant metastases (M) | Based either on clinical description or pathological classification | Staging and tumour risk assessment |
| FIGO [ | International Federation of Gynaecology and Obstetrics (FIGO) | To determine the extent of the cervical invasion | Based on clinical examination | Staging and tumour risk assessment |
Figure 1Satisfactory (a) and unsatisfactory (b) LBC preparations. From: Nayar, R.; Wilbur, D. The Bethesda System for Reporting Cervical Cytology: Definitions, Criteria, and Explanatory Notes, 3rd ed.; Springer International Publishing, 2015 [26] and reproduced with permission of Springer.
Figure 2Atypical squamous cells on liquid-based cytology (LBC). From: Nayar, R.; Wilbur, D. The Bethesda System for Reporting Cervical Cytology: Definitions, Criteria, and Explanatory Notes, 3rd ed.; Springer International Publishing, 2015 [26] and reproduced with permission of Springer.
Figure 3Atypical glandular cells on LBC. From: Nayar, R.; Wilbur, D. The Bethesda System for Reporting Cervical Cytology: Definitions, Criteria, and Explanatory Notes, 3rd ed.; Springer International Publishing, 2015 [26] and reproduced with permission of Springer.
Public datasets summary. Seg. (Segmentation); Class. (Classification).
| Dataset | Year | Type | No Images | Purpose | Description |
|---|---|---|---|---|---|
| Herlev [ | 2005 | Image | 917 | Seg. Class. | Single-cell images with segmentation ground-truth. Classification divided in seven classes ( |
| ISBI14 [ | 2014 | Image | 16 EDF + 945 Synthetic | Seg. | Extended depth field (EDF) [ |
| ISBI15 [ | 2015 | Image | 17 EDF (each with 20 FOVs) | Seg. | EDF images containing cells with different overlapping degrees and respective fields of view (FOVs). Nuclei and cytoplasm segmentation ( |
| CERVIX93 [ | 2018 | Image | 93 EDF (each with 20 FOVs) | Seg. Class. | Similiar to ISBI15 images. Classification divided in seven classes ( |
| Risk-Factors [ | 2017 | Text | - | Class. | Patient’s information and medical history. Target variables: required diagnosis tests (Hinselmann, Schiller, Cytology and Biopsy). It can be used to infer the patient’s likelihood of having cervical cancer. |
Figure 4Sample images and corresponding classification of Herlev and CERVIX93 datasets.
Figure 5Sample images and corresponding segmentation masks of ISBI14 and ISBI15 datasets.
Summary table with highlighted works on cervical cells segmentation. When more than one dataset was used, performance is given only on the public datasets for comparison purposes. Extension works presented by the same author/group are in the same row, with the performance being given for the best case. Acc (Accuracy), Prec (Precision), Rec (Recall), Sp (Specificity), Nuc (Nuclei), Cyt (Cytoplasm), DSC (Dice similarity coefficient) (same as ZSI-Zijdenbos similarity index).
| Paper/Authors | Segmentation Technique | Cells Overlap | Datasets | Performance |
|---|---|---|---|---|
|
| ||||
| Plissiti et al. (2011, 2011) [ | Watershed computation + Refinement based on shape prior. Artifact removal by distance-dependent rule and pixel classification (Fuzzy C-means (FCM), support vector machine (SVM)). | No | Private | FCM: Rec: 90.6% Sp: 75.3%. SVM: Rec: 69.9% Sp: 92.0% |
| Gençtav et al. 2012 [ | Multi-scale watershed + Hierarchical unsupervised segmentation tree + Final binary classifier within cell regions | Yes (clumps and nuclei only) | Herlev, Private | (Herlev): Acc: 97%; Prec: 88%. Rec: 93%; DSC: 0.89 |
| Tareef et al. 2018 [ | Multi-pass watershed + Ellipse fitting | Yes | ISBI 2014, ISBI 2015 | (ISBI 2014): Nuc DSC: 0.925; Rec: 95.0%; Prec: 90.6%. (ISBI 2015): Cyt DSC: 0.851 |
|
| ||||
| Bamford et al. 1998 [ | Viterbi search-based dual active contour | No | Private | Acc: 99.6% |
| Li et al. 2012 [ | K-means clustering + Edge computation map by Radiating GVF | No | Herlev | DSC: 0.954 |
| Plissiti et al. 2012 [ | Snake driven by adaptative physical model | Overl. Nuclei | Private | Hausdorf distance: 19.91 |
|
| ||||
| Lu et al. (2015, 2013) [ | Unsupervised Gaussian mixture models (GMM) + Maximally stable extremal regions (MSER) + Level set with elliptical shape | Yes | ISBI 2014 | Nuc Prec:94.2%; Rec:91.2%; DSC:0.921. Cyt DSC: 0.88 |
| Nosrati and Hamarneh 2015 [ | Random forest (RF) classifier + Level Set with elliptical, 2014, and/or star shape prior, 2015, and Voronoi energy based, 2015 | Yes | ISBI 2014 | Nuc Prec: 90.1%; Rec:91.6%; DSC:0.900. Cyt DSC: 0.871 |
|
| ||||
| Ushizima et al. 2015, 3 pages [ | Graph-based region growing + Voronoi Diagram | Yes | ISBI 2014, ISBI 2015 | (ISBI 2014): Nuc Rec: 87.1%; Prec: 96.8%; DSC: 0.914. Cyt DSC:0.872. (ISBI 2015): Cyt DSC: 0.875 |
| Zhang et al. (2014, 2014) [ | Multi-way graph cut globally on the a* channel for background/cell segmentation + Local adaptative graph-cut (LAGC) for nucleus delineation. | Only touching nuclei | Private | Nuc Prec: 85%; Rec: 90%; Cyt Acc: 93%; DSC: 0.93 |
| Phoulady et al. (2015, 2016, 2017) [ | Iterative thresholding + GMM Expectation-Maximization (EM) + Grid approach with distance metric from multi-focal images | Yes | ISBI 2014, ISBI 2015 | (ISBI 2014): Nuc Prec: 96.1%; Rec: 93.3%. Cyt DSC: 0.901. (ISBI 2015): Cyt DSC: 0.869 |
|
| ||||
| Tareef et al. 2014 [ | Linear kernel SVM classifier on superpixels followed by edge enchancement and adaptative thresholding techniques | Yes | ISBI 2014 | Nuc Prec: 94.3%; Rec: 92.0%; DSC: 0.926. Cyt: DSC 0.914 |
| Zhao et al. 2016 [ | Markov random field (MRF) classifier with a Gap-search algorithm + Automatic labeling map | No | Herlev, Private | (Herlev) Nuc DSC: 0.93. Cyt DSC: 0.82 |
| Tareef et al. 2017 [ | SVM classification + Shape based-guided Level Set based on Sparse Coding for overlapping cytoplasm | Yes | ISBI 2014 | Nuc Prec: 95%; Rec: 93%; DSC: 0.93. Cyt DSC: 0.89 |
|
| ||||
| Song et al. (2014, 2017) [ | Multi-scale CNN feature extraction with spatial pyramids + neural network (NN). Refinement: Graph partitioning + Unsupervised Clustering (2015). Dynamic multi-template shape model (2017). | Only touching nuclei (2015). Yes (2017) | Private, ISBI 2015 | (ISBI 2015): Nuc DSC: 0.93. Cyt DSC: 0.91 |
| Gautam et al. (2018, 2018) [ | CNN with selective pre-processing based on nucleus size and chromatin pattern + post-processing morphological filtering. | No | Herlev | Prec: 89%; Rec: 91%; DSC:0.90 |
| Tareef et al. 2017 [ | CNN patch-based for cellular components classification. Cytoplasm estimation by Voronoi Diagram + Level Set with Shape prior | Yes | ISBI 2014 | Nuc Prec: 94%; Rec:95%; DSC:0.94.Cyt DSC:0.897 |
Summary table with highlighted works on cervical cell classification. When more than one dataset was used, performance is given only on the public datasets for comparison purposes. Extension works presented by the same author/group are in the same line. In this case, performance is given for the best case, which is the most recent work. Acc (Accuracy), Prec (Precision), Rec (Recall), Sp (Specificity), H-mean (Harmonic mean of Sensitivity and Specificity), CCR (Correct Classification Rate), Rs (Spearman rank-order correlation coefficient), k (Cohen’s kappa coefficient), kw (weighted kappa coefficient), RMSE (Root Mean Square Error), OE (Overall Error).
| Paper/Authors | Classification Technique | Datasets | Classes | Performance |
|---|---|---|---|---|
|
| ||||
| Chen et al. 2014 [ | SVM and Fisher linear discriminant classifiers with feature selection filter and wrapper methods. Best: SVM with Recursive Feature Addition (RFA) | Private | 2 | Acc 98.8%; Rec 91.4%; Sp 99.9%; |
| Mariarputham et al. 2014 [ | NN and SVM with different kernels + Feature set (FS). Best: Linear Kernel SVM | Herlev | 2, 7 class | Acc: Norm. 96.91%; Interm. 93.89%; Col. 92.35%; Mild 92.33%; Mod. 96.62%; Sev. 92.10%; CIS. 91.72% |
| Zhao et al. 2016 [ | Block image partitioning and segmentation. Feature extraction on non-background blocks followed by classification through a radial basis function-SVM. | Private | 2-class | Acc 98.98%; Rec 95.0%; Sp 99.33% |
|
| ||||
| Mat-Isa et al. 2008 [ | Cascade Hybrid Multilayer Perceptron (H | Private | 3 class | Acc 97.50%; Rec 96.67%; Sp 100% |
| Chankong et al. 2014 [ | Extensive comparison of five classifiers and FS. Best: three layer Backpropagation ANN with nine features | Herlev, Private (ERUDIT, LCH) | 2, 4, 7 class | (Herlev) 2-class: Acc 99.27%; Rec 99.85%; Sp 96.53%. 7-class: Acc 93.78%; Rec 98.96%; Sp 96.69%; |
| Zhang et al. 2014 [ | Artifact classifier + four Iterative Abnormality MLP classifiers | HELBC (Private) | 2 class | CCR 94.3%; Rec 88.1%; Sp 100% |
|
| ||||
| Marinakis et al. (2006, 2008, 2009) [ | K-NN with FS: Tabu Search (2006), Particle Swarm (2008) and Genetic Algorithm (2009) | Herlev, Private | 2, 7 class | (Herlev) 2-class: RMSE 0.1796; OE 3.164%. 7-class: RMSE 0.895; OE 4.253% |
| Gençtav et al. 2012 [ | Hierarchical clustering tree + optimal leaf ordering that maximizes similarly of adjacent leaves and ranks cells’ abnormality. | Herlev, Hacettepe (Private) | 7 class | (Herlev) Rs 0.848; k 0.848; kw 0.848 |
| Plissiti et al. 2012 [ | Fuzzy C-means and Spectral Clustering based on nuclei features only | Herlev | 2, 7 class | FCM H-mean: 90.58%; SClust H-mean: 88.77% |
|
| ||||
| Bora et al. 2017 [ | Ensemble of LSSVM, MLP and RF weighted by majority voting. Single cell and smear level classification | Herlev, Private | 2, 3 class | (Herlev) 2-class: Acc 96.51%; Rec 98.96%; Sp 89.67%. 3-class: Acc 91.71%; Rec 89.41%; Sp 94.84%; |
| Gómez et al. 2017 [ | Comparison of several algorithms. Best: Bagging + MultilayerPerceptron and AdaBoostM1 + LMT | Herlev | 2-class | Acc 95.74% |
|
| ||||
| Zhang et al. 2017 [ | Nuclei centered patched-based CNN through Transfer Learning | Herlev, HEMLBC (Private) | 2-class: | Acc 98.3%; Rec 98.2%; Sp 98.3%; H-mean 98.3%; |
| Jith et al. 2018 [ | CNN based on fine tuned AlexNet | Herlev, Aindra (Private) | 2-class | Acc 99.6% |
| Gautam et al. 2018 [ | Two patch-based CNNs with selective pre-processing + pre-trained AlexNet classification or Hierarchical Decision Tree with CNN on each leaf | Herlev, Aindra (Private) | 2, 7-class | 2-class Acc: 99.3%. 7-class Acc: 93.75% |
| Lin et al. 2019 [ | Concatenate nucleus centered RGB images patches with cytoplasm and nucleus masks as a five-channel input to several pre-trained CNN | Herlev | 2,7-class | 2-class: Acc 94.5%; Rec 97.4%; Sp 90.4%. 7-class: Acc 64.5% |
Summary of commonly used image features for cervical cell classification. Some of the features represent more high-level concepts, for its measures and respective extraction we refer to some of its implementations [32,54,61,73,74,140,142,143]. N/C (nucleus/cytoplasm; GLCM (grey-level co-occurrence matrix); SDNRL (standard deviation of the normalized radial length). LBP (local binary pattern). * These characteristics are extracted for both nucleus and cytoplasm.
| Shape | Chromatin | Texture | Other |
|---|---|---|---|
| Area * | Brightness * | Multi-nucleus cells | Fourier descriptor |
| Roundness * | Mean Grey Level | GLCM measures | Nucleus distribution |
| Longest Diameter * | Intensity Disparity | Optical Density | Nucleus Position |
| Eccentricity | Minima * | Uniformity | Graph-based (contextual) |
| Major Axis length | Maxima * | Entropy | |
| Minor Axis Length | Average Color | Smoothness | |
| Perimeter * | Boundary intensity | Neighborhood Intensity Disparity | |
| Elongation * | Smoothness | LBP mean value | |
| Convexity | Variance | Coarseness | |
| SDNRL | |||
| N/C ratio |