| Literature DB >> 31179181 |
Sivaramakrishnan Rajaraman1, Stefan Jaeger1, Sameer K Antani1.
Abstract
BACKGROUND: Malaria is a life-threatening disease caused by Plasmodium parasites that infect the red blood cells (RBCs). Manual identification and counting of parasitized cells in microscopic thick/thin-film blood examination remains the common, but burdensome method for disease diagnosis. Its diagnostic accuracy is adversely impacted by inter/intra-observer variability, particularly in large-scale screening under resource-constrained settings.Entities:
Keywords: Blood smear; Computer-aided diagnosis; Convolutional neural networks; Deep learning; Ensemble; Machine learning; Malaria; Red blood cells; Screening; Statistical analysis
Year: 2019 PMID: 31179181 PMCID: PMC6544011 DOI: 10.7717/peerj.6977
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Process flow diagram for optimizing the hyperparameters of the custom CNN model.
The model architecture is instantiated and evaluated within the parameter search boundaries. The process is repeated until an acceptable model is found.
Search ranges for the hyperparameters of the custom CNN model.
The following parameters are optimized: (A) Dropout in the convolutional layer; (B) Dropout in the dense layer; (C) Optimizer; (D) Activation function; and (E) Number of neurons in the dense layer.
| Parameters | Search range |
|---|---|
| Convolutional dropout | [0.25, 0.5] |
| Dense dropout | [0.25, 0.5] |
| Optimizer | [SGD, Adam] |
| Activation | [ReLU, eLU] |
| Dense neurons | [256, 512] |
Figure 2The custom architecture of pretrained models used in this study.
The pretrained CNNs are instantiated with their convolutional layer weights, truncated at their deepest convolutional layer, and added with a GAP and dense layer.
Figure 3Process flow diagram depicting the construction of the model averaging ensemble.
The averaging ensemble averages the prediction probabilities from the individual models.
Optimal hyperparameter values obtained with Talos optimization for the custom CNN model.
The custom model is trained and optimized with the hyperparameter values obtained through Talos optimization to categorize the cell images to their respective classes.
| Parameters | Optimal values |
|---|---|
| Convolutional dropout | 0.25 |
| Dense dropout | 0.5 |
| Optimizer | Adam |
| Activation | ReLU |
| Dense neurons | 256 |
Performance metrics of individual models and model ensemble.
The performance of the models are evaluated with metrics including accuracy, AUC, MSE, precision, F-score, and MCC.
| Model | Accuracy | AUC | MSE | Precision | F-score | MCC |
|---|---|---|---|---|---|---|
| Custom CNN | 99.09 ± 0.08 | 99.3 ± 0.6 | 00.9 ± 0.1 | 99.56 ± 0.1 | 99.08 ± 0.1 | 98.18 ± 0.1 |
| VGG-19 | ||||||
| SqueezeNet | 98.66 ± 0.1 | 98.85 ± 0.3 | 1.41 ± 0.2 | 99.44 ± 0.1 | 98.64 ± 0.1 | 97.32 ± 0.1 |
| InceptionResNet-V2 | 98.79 ± 0.1 | 99.2 ± 0.9 | 1.88 ± 0.9 | 99.56 ± 0.2 | 98.77 ± 0.1 | 97.59 ± 0.2 |
| All-Ensemble | 99.11 ± 0.1 | 98.94 ± 0.3 | 0.78 ± 0.1 | 99.67 ± 0.1 | 99.1 ± 0.1 | 98.21 ± 0.2 |
Notes.
Bold text indicates the performance measures of the best-performing model/s.
Combining different models to determine the optimum ensemble.
Several combinations of models are created and their prediction probabilities are averaged in an attempt to find the best performing ensemble toward the current task.
| Combination index | Models |
|---|---|
| A | [Custom CNN, VGG-19] |
| B | [Custom CNN, SqueezeNet] |
| C | [Custom CNN, InceptionResNet-V2] |
| D | [VGG-19, SqueezeNet] |
| E | [VGG-19,InceptionResNet-V2] |
| F | [SqueezeNet, InceptionResNet-V2] |
| G | [Custom CNN,VGG-19, SqueezeNet] |
| H | [Custom CNN, VGG-19, InceptionResNet-V2] |
| I | [VGG-19, SqueezeNet,InceptionResNet-V2] |
Performance metrics achieved with different combinations of model ensembles.
The performance of the different combination of model ensembles is evaluated with metrics including accuracy, AUC, MSE, precision, F-score, and MCC.
| Combination index | Accuracy | AUC | MSE | Precision | F-score | MCC |
|---|---|---|---|---|---|---|
| A | 99.34 ± 0.1 | 99.07 ± 0.5 | 0.71 ± 0.1 | 99.76 ± 0.1 | 99.32 ± 0.1 | 98.65 ± 0.2 |
| B | 98.98 ± 0.1 | 99.76 ± 0.1 | 1.07 ± 0.1 | 99.43 ± 0.1 | 98.96 ± 0.1 | 97.95 ± 0.2 |
| C | 98.72 ± 0.8 | 98.64 ± 1.1 | 1.88 ± 0.6 | 99.56 ± 0.1 | 99.07 ± 0.1 | 98.15 ± 0.2 |
| D | ||||||
| E | 99.16 ± 0.1 | 99.18 ± 0.2 | 0.83 ± 0.1 | 99.73 ± 0.1 | 99.15 ± 0.1 | 98.31 ± 0.2 |
| F | 98.73 ± 0.1 | 99.2 ± 0.6 | 1.65 ± 0.4 | 99.63 ± 0.2 | 99.08 ± 0.1 | 98.18 ± 0.2 |
| G | 99.21 ± 0.1 | 98.98 ± 0.2 | 0.81 ± 0.1 | 99.64 ± 0.1 | 99.2 ± 0.1 | 98.42 ± 0.1 |
| H | 99.22 ± 0.1 | 99.89 ± 0.1 | 0.82 ± 0.1 | 99.75 ± 0.1 | 99.21 ± 0.1 | 98.44 ± 0.2 |
| I | 99.13 ± 0.1 | 99.67 ± 0.1 | 0.99 ± 0.1 | 99.75 ± 0.1 | 99.12 ± 0.1 | 98.26 ± 0.2 |
Notes.
Bold text indicates the performance measures of the best-performing model/s.
Consolidated results of Shapiro–Wilk, Levene, one-way ANOVA and Tukey post-hoc analyses.
The value p > 0.05 (95% CI) for Shapiro-Wilk and Levene’s test signified that the assumptions of data normality and homogeneity of variances are not violated. Hence, one-way ANOVA analysis is performed to explore the presence/absence of a statistically significant difference in the mean values of the performance metrics for the models.
| Accuracy | 0.342 | 0.316 | 37.151 | (M1, M2, M3) | |
| AUC | 0.416 | 0.438 | 8.321 | (M2, M3) | |
| MSE | 0.862 | 0.851 | 11.288 | (M1, M2) & (M2, M3) | |
| Precision | 0.52 | 0.294 | 5.841 | (M2, M3) | |
| F-score | 0.599 | 0.73 | 34.799 | (M1, M2) & (M1, M3) | |
| MCC | 0.63 | 0.697 | 35.062 | (M1, M2, M3) | |
Figure 4Snapshot of the web application interface.
The web application is placed into the static web directory and the web server is initiated to browse through the Malaria Cell Analyzer App. The user submits a cell image and the model embedded into the browser gives the predictions.
Comparison of the results obtained with the proposed ensemble and the state-of-the-art literature.
The ensemble model constructed with VGG-19 and SqueezeNet outperformed the other models and the state-of-the-art toward classifying the parasitized and uninfected cells to aid in improved disease screening.
| Method | Accuracy | AUC | MSE | Precision | F-score | MCC |
|---|---|---|---|---|---|---|
| Proposed Ensemble (patient level) | ||||||
| 95.9 | 99.1 | – | – | 95.9 | 91.7 | |
| 97.7 | – | – | – | – | 73.1 | |
| 96.3 | – | – | – | – | – | |
| 98.1 | – | – | – | – | – | |
| 97.3 | – | – | – | – | – | |
| 84.0 | – | – | – | – | – | |
| 73.0 | – | – | – | – | – |
Notes.
Bold text indicates the performance measures of the best-performing model/s.