| Literature DB >> 31359913 |
Parikshit Sanyal1, Sanghita Barui1, Prabal Deb2, Harish Chander Sharma3.
Abstract
CONTEXT: Cervical cancer is the second most common cancer in women. The liquid based cervical cytology (LBCC) is a useful tool of choice for screening cervical cancer. AIMS: To train a convolutional neural network (CNN) to identify abnormal foci from LBCC smears. SETTINGS ANDEntities:
Keywords: Artificial intelligence; cervical cytology; liquid based smears; neural network; screening
Year: 2019 PMID: 31359913 PMCID: PMC6592125 DOI: 10.4103/JOC.JOC_201_18
Source DB: PubMed Journal: J Cytol ISSN: 0970-9371 Impact factor: 1.000
Distribution of LBCC images by diagnostic category as per Bethesda System 2014 (n=551)
| Category | “Normal” | “Abnormal” | ||
|---|---|---|---|---|
| Normal cellular elements only | Shift in flora suggestive of bacterial vaginosis | HSIL | LSIL | |
| Number of cases (smears) | 22 | 06 | 04 | 04 |
| Number of microphotographs | 392 | 70 | 58 | 31 |
Splitting the image data in training, testing and evaluation categories (n=2816)
| Abnormal | Normal | Total | |
|---|---|---|---|
| Training | 410 | 410 | 820 |
| Testing | 200 | 200 | 400 |
| Evaluation | 206 | 1390 | 1596 |
| Total | 816 | 2000 | 2816 |
Figure 1Architecture of the Convolutional Neural Network
Figure 2Training of the CNN (Model A)
Figure 3Accuracy and loss function of two models plotted against epochs of training
Results on concurrent testing dataset (n=400)
| Original | ||||
|---|---|---|---|---|
| Model A | Model B | |||
| Normal | Abnormal | Normal | Abnormal | |
| Labeled by CNN | ||||
| Normal | 187 | 18 | 186 | 7 |
| Abnormal | 13 | 182 | 14 | 193 |
| Total | 200 | 200 | 200 | 200 |
Figure 4Accuracy and loss function of two models plotted against epochs of training
Figure 5The CNN predicting labels for the Evaluation set (Model A)
Results on evaluation dataset (n=1596) by Model A
| True label | Total | ||
|---|---|---|---|
| Normal | Abnormal | ||
| Label by CNN | |||
| Normal | 1221 True negative (TN) | 31 False negative (FN) | 1252 |
| Abnormal | 169 False positive (FP) | 175 True positive (TP) | 344 |
| Total | 1390 | 206 | 1596 |
| Sensitivity | TP/(TP + FN) | 175/206 | 84.95% |
| Specificity | TN/(TN + FP) | 1221/1390 | 87.84% |
| Positive predictive value | TP/(TP + FP) | 175/344 | 50% |
| Negative predictive value | TN/(TN + FN) | 1221/1252 | 97.54% |
Results on evaluation dataset (n=1596) by Model B
| True label | Total | ||
|---|---|---|---|
| Normal | Abnormal | ||
| Label by CNN | |||
| Normal | 1123 True negative (TN) | 10 False negative (FN) | 1133 |
| Abnormal | 267 False positive (FP) | 196 True positive (TP) | 463 |
| Total | 1390 | 206 | 1596 |
| Sensitivity | TP/(TP + FN) | 196/206 | 95.10% |
| Specificity | TN/(TN + FP) | 1123/1390 | 80.80% |
| Positive predictive value | TP/(TP + FP) | 196/463 | 42.30% |
| Negative predictive value | TN/(TN + FN) | 1123/1133 | 99.10% |
Results on evaluation dataset by combinatorial analysis of two models
| True label | Total | ||
|---|---|---|---|
| Normal | Abnormal | ||
| Combined label by two models | |||
| Normal | 1110 True negative (TN) | 9 False negative (FN) | 1119 |
| Abnormal | 280 False positive (FP) | 197 True positive (TP) | 477 |
| Total | 1390 | 206 | 1596 |
| Sensitivity | TP/(TP + FN) | 197/206 | 95.63% |
| Specificity | TN/(TN + FP) | 1110/1390 | 79.85% |
| Positive predictive value | TP/(TP + FP) | 196/477 | 41.29% |
| Negative predictive value | TN/(TN + FN) | 1123/1133 | 99.19% |
Figure 6Intermediate layers of the network, producing a final label
Figure 7Foci containing neutrophils falsely labeled as “abnormal” by the CNN