Literature DB >> 26209022

SFM: A novel sequence-based fusion method for disease genes identification and prioritization.

Abdulaziz Yousef1, Nasrollah Moghadam Charkari2.   

Abstract

The identification of disease genes from human genome is of great importance to improve diagnosis and treatment of disease. Several machine learning methods have been introduced to identify disease genes. However, these methods mostly differ in the prior knowledge used to construct the feature vector for each instance (gene), the ways of selecting negative data (non-disease genes) where there is no investigational approach to find them and the classification methods used to make the final decision. In this work, a novel Sequence-based fusion method (SFM) is proposed to identify disease genes. In this regard, unlike existing methods, instead of using a noisy and incomplete prior-knowledge, the amino acid sequence of the proteins which is universal data has been carried out to present the genes (proteins) into four different feature vectors. To select more likely negative data from candidate genes, the intersection set of four negative sets which are generated using distance approach is considered. Then, Decision Tree (C4.5) has been applied as a fusion method to combine the results of four independent state-of the-art predictors based on support vector machine (SVM) algorithm, and to make the final decision. The experimental results of the proposed method have been evaluated by some standard measures. The results indicate the precision, recall and F-measure of 82.6%, 85.6% and 84, respectively. These results confirm the efficiency and validity of the proposed method.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Classification; Disease gene; Fusion method; Physicochemical properties of amino acid; Protein

Mesh:

Substances:

Year:  2015        PMID: 26209022     DOI: 10.1016/j.jtbi.2015.07.010

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  GLADIATOR: a global approach for elucidating disease modules.

Authors:  Yael Silberberg; Martin Kupiec; Roded Sharan
Journal:  Genome Med       Date:  2017-05-26       Impact factor: 11.117

2.  A semantic relationship mining method among disorders, genes, and drugs from different biomedical datasets.

Authors:  Li Zhang; Jiamei Hu; Qianzhi Xu; Fang Li; Guozheng Rao; Cui Tao
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-14       Impact factor: 2.796

3.  A novel one-class classification approach to accurately predict disease-gene association in acute myeloid leukemia cancer.

Authors:  Akram Vasighizaker; Alok Sharma; Abdollah Dehzangi
Journal:  PLoS One       Date:  2019-12-11       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.