| Literature DB >> 34122785 |
Christian Tchito Tchapga1, Thomas Attia Mih1, Aurelle Tchagna Kouanou1,2, Theophile Fozin Fonzin2,3, Platini Kuetche Fogang4, Brice Anicet Mezatio2, Daniel Tchiotsop5.
Abstract
In modern-day medicine, medical imaging has undergone immense advancements and can capture several biomedical images from patients. In the wake of this, to assist medical specialists, these images can be used and trained in an intelligent system in order to aid the determination of the different diseases that can be identified from analyzing these images. Classification plays an important role in this regard; it enhances the grouping of these images into categories of diseases and optimizes the next step of a computer-aided diagnosis system. The concept of classification in machine learning deals with the problem of identifying to which set of categories a new population belongs. When category membership is known, the classification is done on the basis of a training set of data containing observations. The goal of this paper is to perform a survey of classification algorithms for biomedical images. The paper then describes how these algorithms can be applied to a big data architecture by using the Spark framework. This paper further proposes the classification workflow based on the observed optimal algorithms, Support Vector Machine and Deep Learning as drawn from the literature. The algorithm for the feature extraction step during the classification process is presented and can be customized in all other steps of the proposed classification workflow.Entities:
Mesh:
Year: 2021 PMID: 34122785 PMCID: PMC8191587 DOI: 10.1155/2021/9998819
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Comparison of classification methods in biomedical image based on the literature [32, 46].
| Decision trees | Neural networks | Naïve bayes | KNN | SVM | Rule-learning | |
|---|---|---|---|---|---|---|
| Accuracy | ∗∗ | ∗∗∗ | ∗ | ∗∗ | ∗∗∗∗ | ∗∗ |
| Speed of classification | ∗∗∗∗ | ∗∗∗∗ | ∗∗∗∗ | ∗ | ∗∗∗∗ | ∗∗∗∗ |
| Tolerance to redundant attributes | ∗∗ | ∗∗ | ∗ | ∗∗ | ∗∗∗ | ∗∗ |
| Speed of learning | ∗∗∗ | ∗ | ∗∗∗∗ | ∗∗∗∗ | ∗ | ∗∗ |
| Tolerance to missing values | ∗∗∗ | ∗ | ∗∗∗∗ | ∗ | ∗∗ | ∗∗ |
| Tolerance to highly interdependent attributes | ∗∗ | ∗∗∗ | ∗ | ∗ | ∗∗∗ | ∗∗ |
| Dealing with discrete/binary/continues attributes | ∗∗∗∗ | ∗∗∗ (not discrete) | ∗∗∗ (not continuous) | ∗∗∗ (not directly discrete) | ∗∗ (Not discrete) | ∗∗∗ (not directly discrete) |
| Tolerance to noise | ∗∗ | ∗∗ | ∗∗∗ | ∗ | ∗∗ | ∗ |
| Dealing with a danger of overfitting | ∗∗ | ∗ | ∗∗∗ | ∗∗∗ | ∗∗ | ∗∗ |
| Attempts for incremental learning | ∗∗ | ∗∗∗ | ∗∗∗∗ | ∗∗∗∗ | ∗∗ | ∗ |
∗∗∗∗Very good. ∗∗∗Good. ∗∗Fairly Good. ∗Bad.
Figure 1Classification system workflow for training and testing processes.
Figure 2Convolutional neural network (CNN) architecture for biomedical image classification.
Figure 3Job execution Apache Spark in four clusters: one master and three slaves.
Figure 4Importing images in Spark DataFrame.
Algorithm 1Feature extraction process.
Algorithm 2Prediction process.
Some ML methods and application comparison.
| Authors | Deep learning methods | Machine learning method | Big data technologies | Applications |
|---|---|---|---|---|
| Luo et al. [ | No | No | Yes (Hadoop) | Healthcare |
| Tchagna Kouanou et al. [ | No | No | Yes (Spark and Hadoop) | Biomedical images |
| Manogaran and Lopez [ | No | Yes | Yes | Healthcare |
| Thrall et al. [ | No | Yes | No | Radiology |
| Fujiyoshi et al. [ | Yes | No | No | Image recognition |
| Tchagna Kouanou et al. [ | No | Yes (K-Means- unsupervised learning) | No | Biomedical image compression |
| Tchagna Kouanou et al. [ | No | Yes (K-Means- unsupervised learning) | Yes (Hadoop) | Biomedical image compression |
| Tchagna Kouanou et al. [ | No | Yes (K-Means- unsupervised learning) | No | Image compression |
| Alla Takam et al. [ | Yes (CNN) | No | Yes (Spark) | Biomedical image |
| Chowdhary and Acharjya [ | No | Yes (fuzzy C-means) | No | Feature extraction and segmentation |
| Bhattacharya et al. [ | Yes | No | No | Biomedical image |
| Chowdhary et al. [ | Yes | No | No | Biomedical images (breast cancer classification) |
| Wang et al. [ | Yes (CNN, hierarchical loss) | No | No | Biomedical images (breast cancer classification) |