| Literature DB >> 31825992 |
Akram Vasighizaker1, Alok Sharma2,3,4,5,6, Abdollah Dehzangi7.
Abstract
Disease causing gene identification is considered as an important step towards drug design and drug discovery. In disease gene identification and classification, the main aim is to identify disease genes while identifying non-disease genes are of less or no significant. Hence, this task can be defined as a one-class classification problem. Existing machine learning methods typically take into consideration known disease genes as positive training set and unknown genes as negative samples to build a binary-class classification model. Here we propose a new One-class Classification Support Vector Machines (OCSVM) method to precisely classify candidate disease genes. Our aim is to build a model that concentrate its focus on detecting known disease-causing gene to increase sensitivity and precision. We investigate the impact of our proposed model using a benchmark consisting of the gene expression dataset for Acute Myeloid Leukemia (AML) cancer. Compared with the traditional methods, our experimental result shows the superiority of our proposed method in terms of precision, recall, and F-measure to detect disease causing genes for AML. OCSVM codes and our extracted AML benchmark are publicly available at: https://github.com/imandehzangi/OCSVM.Entities:
Mesh:
Year: 2019 PMID: 31825992 PMCID: PMC6905554 DOI: 10.1371/journal.pone.0226115
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The general architecture of one-class classification of SVM, OCSVM (Left) and SVDD (Right).
Fig 2The results of OCSVM Method with linear and RBF kernel.
The results of OCSVM with linear and RBF kernel.
| Kernel | Precision | Recall | F-measure |
|---|---|---|---|
| 95.70 | 95.70 | 95.70 | |
The comparison among methods (precision, recall, and F-measure).
| Method | Precision | Recall | F-measure |
|---|---|---|---|
| 82.21 | 82.54 | 82.36 | |
| 93.70 | 82.33 | 87.64 | |
| 90.66 | 88.57 | 89.60 | |
| 92.92 | 88.41 | 90.61 | |
Fig 3The comparison between OCSVM and other methods for precision.
Fig 5The comparison between OCSVM and other methods for F-Measure.