Literature DB >> 26357091

Predicting Protein Function Using Multiple Kernels.

Guoxian Yu, Huzefa Rangwala, Carlotta Domeniconi, Guoji Zhang, Zili Zhang.   

Abstract

High-throughput experimental techniques provide a wide variety of heterogeneous proteomic data sources. To exploit the information spread across multiple sources for protein function prediction, these data sources are transformed into kernels and then integrated into a composite kernel. Several methods first optimize the weights on these kernels to produce a composite kernel, and then train a classifier on the composite kernel. As such, these approaches result in an optimal composite kernel, but not necessarily in an optimal classifier. On the other hand, some approaches optimize the loss of binary classifiers and learn weights for the different kernels iteratively. For multi-class or multi-label data, these methods have to solve the problem of optimizing weights on these kernels for each of the labels, which are computationally expensive and ignore the correlation among labels. In this paper, we propose a method called Predicting Protein Function using Multiple Kernels (ProMK). ProMK iteratively optimizes the phases of learning optimal weights and reduces the empirical loss of multi-label classifier for each of the labels simultaneously. ProMK can integrate kernels selectively and downgrade the weights on noisy kernels. We investigate the performance of ProMK on several publicly available protein function prediction benchmarks and synthetic datasets. We show that the proposed approach performs better than previously proposed protein function prediction approaches that integrate multiple data sources and multi-label multiple kernel learning methods. The codes of our proposed method are available at https://sites.google.com/site/guoxian85/promk.

Mesh:

Substances:

Year:  2015        PMID: 26357091     DOI: 10.1109/TCBB.2014.2351821

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  6 in total

1.  A Multi-Label Supervised Topic Model Conditioned on Arbitrary Features for Gene Function Prediction.

Authors:  Lin Liu; Lin Tang; Xin Jin; Wei Zhou
Journal:  Genes (Basel)       Date:  2019-01-17       Impact factor: 4.096

2.  Interspecies gene function prediction using semantic similarity.

Authors:  Guoxian Yu; Wei Luo; Guangyuan Fu; Jun Wang
Journal:  BMC Syst Biol       Date:  2016-12-23

3.  A multi-network integration approach for measuring disease similarity based on ncRNA regulation and heterogeneous information.

Authors:  Ningyi Zhang; Tianyi Zang
Journal:  BMC Bioinformatics       Date:  2022-03-07       Impact factor: 3.169

4.  Predicting antifreeze proteins with weighted generalized dipeptide composition and multi-regression feature selection ensemble.

Authors:  Shunfang Wang; Lin Deng; Xinnan Xia; Zicheng Cao; Yu Fei
Journal:  BMC Bioinformatics       Date:  2021-06-23       Impact factor: 3.169

5.  An efficient method for protein function annotation based on multilayer protein networks.

Authors:  Bihai Zhao; Sai Hu; Xueyong Li; Fan Zhang; Qinglong Tian; Wenyin Ni
Journal:  Hum Genomics       Date:  2016-09-27       Impact factor: 4.639

6.  deepNF: deep network fusion for protein function prediction.

Authors:  Vladimir Gligorijevic; Meet Barot; Richard Bonneau
Journal:  Bioinformatics       Date:  2018-11-15       Impact factor: 6.937

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.