| Literature DB >> 29682411 |
Sivaramakrishnan Rajaraman1, Sameer K Antani1, Mahdieh Poostchi1, Kamolrat Silamut2, Md A Hossain3, Richard J Maude2,4,5, Stefan Jaeger1, George R Thoma1.
Abstract
Malaria is a blood disease caused by the Plasmodium parasites transmitted through the bite of female Anopheles mosquito. Microscopists commonly examine thick and thin blood smears to diagnose disease and compute parasitemia. However, their accuracy depends on smear quality and expertise in classifying and counting parasitized and uninfected cells. Such an examination could be arduous for large-scale diagnoses resulting in poor quality. State-of-the-art image-analysis based computer-aided diagnosis (CADx) methods using machine learning (ML) techniques, applied to microscopic images of the smears using hand-engineered features demand expertise in analyzing morphological, textural, and positional variations of the region of interest (ROI). In contrast, Convolutional Neural Networks (CNN), a class of deep learning (DL) models promise highly scalable and superior results with end-to-end feature extraction and classification. Automated malaria screening using DL techniques could, therefore, serve as an effective diagnostic aid. In this study, we evaluate the performance of pre-trained CNN based DL models as feature extractors toward classifying parasitized and uninfected cells to aid in improved disease screening. We experimentally determine the optimal model layers for feature extraction from the underlying data. Statistical validation of the results demonstrates the use of pre-trained CNNs as a promising tool for feature extraction for this purpose.Entities:
Keywords: Blood smear; Computer-aided diagnosis; Convolutional Neural Networks; Deep Learning; Feature extraction; Machine Learning; Malaria; Pre-trained models; Screening
Year: 2018 PMID: 29682411 PMCID: PMC5907772 DOI: 10.7717/peerj.4568
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Data for cross-validation studies.
| Folds | Parasitized | Uninfected |
|---|---|---|
| 1 | 2,756 | 2,757 |
| 2 | 2,758 | 2,758 |
| 3 | 2,776 | 2,762 |
| 4 | 2,832 | 2,760 |
| 5 | 2,657 | 2,742 |
Figure 1Architecture of the customized model.
Figure 2RBC detection and segmentation using level sets.
(A) Input image. (B) Initial cell detection using LoG. (C) Final RBC segmentation mask. (D) Segmentation results superimposed on the original image.
Performance metrics.
| Models | Accuracy | AUC | Sensitivity | Specificity | F1-score | MCC |
|---|---|---|---|---|---|---|
| AlexNet | 0.937 ± 0.012 | 0.981 ± 0.007 | 0.940 ± 0.017 | 0.933 ± 0.034 | 0.937 ± 0.011 | 0.872 ± 0.024 |
| VGG-16 | 0.945 ± 0.015 | 0.981 ± 0.007 | 0.939 ± 0.022 | 0.951 ± 0.019 | 0.945 ± 0.016 | 0.887 ± 0.030 |
| ResNet-50 | ||||||
| Xception | 0.890 ± 0.107 | 0.948 ± 0.062 | 0.931 ± 0.039 | 0.835 ± 0.218 | 0.895 ± 0.100 | 0.772 ± 0.233 |
| DenseNet-121 | 0.931 ± 0.018 | 0.976 ± 0.023 | 0.942 ± 0.023 | 0.926 ± 0.032 | 0.931 ± 0.017 | 0.894 ± 0.036 |
| Customized | 0.940 ± 0.010 | 0.979 ± 0.009 | 0.931 ± 0.026 | 0.951 ± 0.030 | 0.941 ± 0.010 | 0.880 ± 0.020 |
Notes.
Bold text indicate the performance measures of the best-performing model/s.
Candidate layers giving the best performance.
| Model | Optimal layer |
|---|---|
| AlexNet | fc6 |
| VGG-16 | block5_conv2 |
| ResNet-50 | res5c_branch2c |
| Xception | block14_sepconv1 |
| DenseNet-121 | Conv5_16_x2 |
Performance metrics achieved with feature extraction from optimal layers.
| Models | Accuracy | AUC | Sensitivity | Specificity | F1-score | MCC |
|---|---|---|---|---|---|---|
| AlexNet | 0.944 ± 0.010 | 0.983 ± 0.006 | 0.947 ± 0.016 | 0.941 ± 0.025 | 0.944 ± 0.010 | 0.886 ± 0.020 |
| VGG-16 | 0.949 ± 0.020 | 0.969 ± 0.016 | 0.916 ± 0.017 | |||
| ResNet-50 | 0.947 ± 0.015 | |||||
| Xception | 0.915 ± 0.005 | 0.965 ± 0.019 | 0.925 ± 0.039 | 0.907 ± 0.120 | 0.918 ± 0.042 | 0.836 ± 0.088 |
| DenseNet-121 | 0.952 ± 0.022 | 0.944 ± 0.048 | 0.953 ± 0.020 | 0.902 ± 0.041 | ||
| Customized | 0.927 ± 0.026 | 0.978 ± 0.012 | 0.905 ± 0.074 | 0.951 ± 0.031 | 0.928 ± 0.041 | 0.884 ± 0.002 |
Notes.
Bold text indicate the performance measures of the best-performing model/s.
Consolidated results of Kruskal–Wallis H and post-hoc tests.
| Metric | Kruskal-Wallis H summary | Mean ranks | Post-hoc | |
|---|---|---|---|---|
| Accuracy | AlexNet | 11.20 | Xception & ResNet-50 ( | |
| VGG-16 | 22.30 | |||
| Xception | 7.20 | |||
| DenseNet-121 | 19.60 | |||
| Customized | 9.70 | |||
| AUC | AlexNet | 13.00 | Xception & ResNet-50 ( | |
| VGG-16 | 21.70 | |||
| ResNet-50 | 21.50 | |||
| Xception | 4.50 | |||
| Customized | 9.40 | |||
| Sensitivity | AlexNet | 16.20 | – | |
| VGG-16 | 17.30 | |||
| ResNet-50 | 15.80 | |||
| Xception | 11.40 | |||
| Customized | 10.50 | |||
| Specificity | AlexNet | 9.80 | – | |
| VGG-16 | 20.70 | |||
| Xception | 13.30 | |||
| DenseNet-121 | 14.10 | |||
| Customized | 13.80 | |||
| F1-score | AlexNet | 11.70 | Xception & ResNet-50 ( | |
| VGG-16 | 22.20 | |||
| Xception | 6.90 | |||
| DenseNet-121 | 19.50 | |||
| Customized | 10.10 | |||
| MCC | AlexNet | 11.30 | Xception & ResNet-50 ( | |
| VGG-16 | 22.30 | |||
| Xception | 7.60 | |||
| DenseNet-121 | 19.40 | |||
| Customized | 9.80 | |||
Notes.
Bold text indicate the performance measures of the best-performing model/s.
Comparison with the state-of-the-art literature.
| Method | Accuracy | Sensitivity | Specificity | AUC | F1-score | MCC |
|---|---|---|---|---|---|---|
| Proposed model (cell level ) | ||||||
| Proposed model (patient level ) | 0.959 | 0.947 | 0.972 | 0.991 | 0.959 | 0.917 |
| 0.977 | 0.971 | 0.985 | – | – | 0.731 | |
| 0.963 | 0.976 | 0.959 | – | |||
| 0.981 | – | – | – | |||
| 0.973 | 0.969 | 0.977 | – | |||
| 0.840 | 0.689 | – | ||||
| 0.730 | 0.850 | – | – |
Notes.
Bold text indicate the performance measures of the best-performing model/s.