| Literature DB >> 35909844 |
Atta-Ur Rahman1, Muhammad Umar Nasir2, Mohammed Gollapalli3, Suleiman Ali Alsaif4, Ahmad S Almadhor5, Shahid Mehmood2, Muhammad Adnan Khan6, Amir Mosavi7,8,9.
Abstract
A genetic disorder is a serious disease that affects a large number of individuals around the world. There are various types of genetic illnesses, however, we focus on mitochondrial and multifactorial genetic disorders for prediction. Genetic illness is caused by a number of factors, including a defective maternal or paternal gene, excessive abortions, a lack of blood cells, and low white blood cell count. For premature or teenage life development, early detection of genetic diseases is crucial. Although it is difficult to forecast genetic disorders ahead of time, this prediction is very critical since a person's life progress depends on it. Machine learning algorithms are used to diagnose genetic disorders with high accuracy utilizing datasets collected and constructed from a large number of patient medical reports. A lot of studies have been conducted recently employing genome sequencing for illness detection, but fewer studies have been presented using patient medical history. The accuracy of existing studies that use a patient's history is restricted. The internet of medical things (IoMT) based proposed model for genetic disease prediction in this article uses two separate machine learning algorithms: support vector machine (SVM) and K-Nearest Neighbor (KNN). Experimental results show that SVM has outperformed the KNN and existing prediction methods in terms of accuracy. SVM achieved an accuracy of 94.99% and 86.6% for training and testing, respectively.Entities:
Mesh:
Year: 2022 PMID: 35909844 PMCID: PMC9334098 DOI: 10.1155/2022/2650742
Source DB: PubMed Journal: Comput Intell Neurosci
Constraints and comparisons of previous studies.
| Study | Model | Used dataset | Accuracy (%) | Constraint | IoMT |
|---|---|---|---|---|---|
| Asif et al. [ | RF, SVM | miRNA (feature) | 79 | Handcrafted features, imbalance data | No |
| Alshamlan et al. [ | GBC algorithm | SRBCT (feature) | 81 | Handcrafted features, imbalance classes, imbalance gene sequence | No |
| KhaderKhader et al. [ | BA, SVM | Gene seq (feature) | 80.5 | Imbalance gene classes | No |
Figure 1IoMT-based proposed model for the prediction of genetic disorder.
Simulation parameters of the proposed model of KNN and SVM.
| Algorithm | Neighbors | NS method | Distance | Standardize |
|
| ||||
| KNN | 5 | Exhaustive | Minkowski | True |
|
| ||||
| SVM | Kernel function | Polynomial order | Kernel scale | Standardize |
| Polynomial | 3 | Auto | True | |
Training confusion metrics of the proposed model of KNN and SVM.
| Total instances (8595) | 1 | 2 |
|---|---|---|
| SVM | ||
| 1 | 6922 | 191 |
| 2 | 825 | 657 |
|
| ||
| KNN | ||
| 1 | 6959 | 154 |
| 2 | 277 | 1205 |
Testing confusion metrics of the proposed model of KNN and SVM.
| Total instances (3684) | 1 | 2 |
|---|---|---|
| SVM | ||
| 1 | 2931 | 169 |
| 2 | 322 | 262 |
|
| ||
| KNN | ||
| 1 | 3023 | 77 |
| 2 | 469 | 115 |
Figure 2Mean square error of support vector machine.
Performance of SVM and KNN models.
| Instances (12280) | SVM | KNN | ||
|---|---|---|---|---|
| Training (%) (8596 instances) | Testing (%) (3684 instances) | Training (%) (8596 instances) | Testing (%) (3684 instances) | |
| Accuracy | 94.99 | 86.6 | 88.3 | 85.1 |
| Miss-classification rate | 5.01 | 13.4 | 11.7 | 14.9 |
| Precision | 96.17 | 90.10 | 89.35 | 86.56 |
| Sensitivity | 97.83 | 94.54 | 97.31 | 97.51 |
| F1-score | 96.98 | 92.26 | 93.15 | 91.7 |
Comparative analysis with previous studies.
| Study | Model | Dataset | Accuracy (%) | IoMT |
|---|---|---|---|---|
| Asif et al. [ | RF, SVM | miRNA (feature) | 79 | No |
| Alshamlan et al. [ | GBC algorithm | SRBCT (feature) | 81 | No |
| KhaderKhader et al. [ | BA, SVM | Gene seq (feature) | 80.5 | No |
|
|
|
|
|
|