Literature DB >> 28247893

Protein subcellular localization prediction using multiple kernel learning based support vector machine.

Md Al Mehedi Hasan1, Shamim Ahmad1, Md Khademul Islam Molla1.   

Abstract

Predicting the subcellular locations of proteins can provide useful hints that reveal their functions, increase our understanding of the mechanisms of some diseases, and finally aid in the development of novel drugs. As the number of newly discovered proteins has been growing exponentially, which in turns, makes the subcellular localization prediction by purely laboratory tests prohibitively laborious and expensive. In this context, to tackle the challenges, computational methods are being developed as an alternative choice to aid biologists in selecting target proteins and designing related experiments. However, the success of protein subcellular localization prediction is still a complicated and challenging issue, particularly, when query proteins have multi-label characteristics, i.e., if they exist simultaneously in more than one subcellular location or if they move between two or more different subcellular locations. To date, to address this problem, several types of subcellular localization prediction methods with different levels of accuracy have been proposed. The support vector machine (SVM) has been employed to provide potential solutions to the protein subcellular localization prediction problem. However, the practicability of an SVM is affected by the challenges of selecting an appropriate kernel and selecting the parameters of the selected kernel. To address this difficulty, in this study, we aimed to develop an efficient multi-label protein subcellular localization prediction system, named as MKLoc, by introducing multiple kernel learning (MKL) based SVM. We evaluated MKLoc using a combined dataset containing 5447 single-localized proteins (originally published as part of the Höglund dataset) and 3056 multi-localized proteins (originally published as part of the DBMLoc set). Note that this dataset was used by Briesemeister et al. in their extensive comparison of multi-localization prediction systems. Finally, our experimental results indicate that MKLoc not only achieves higher accuracy than a single kernel based SVM system but also shows significantly better results than those obtained from other top systems (MDLoc, BNCs, YLoc+). Moreover, MKLoc requires less computation time to tune and train the system than that required for BNCs and single kernel based SVM.

Mesh:

Substances:

Year:  2017        PMID: 28247893     DOI: 10.1039/c6mb00860g

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  10 in total

1.  A Systematic Evaluation of Supervised Machine Learning Algorithms for Cell Phenotype Classification Using Single-Cell RNA Sequencing Data.

Authors:  Xiaowen Cao; Li Xing; Elham Majd; Hua He; Junhua Gu; Xuekui Zhang
Journal:  Front Genet       Date:  2022-02-23       Impact factor: 4.599

2.  Machine and Deep Learning for Prediction of Subcellular Localization.

Authors:  Gaofeng Pan; Chao Sun; Zijun Liao; Jijun Tang
Journal:  Methods Mol Biol       Date:  2021

3.  Protein Subcellular Localization with Gaussian Kernel Discriminant Analysis and Its Kernel Parameter Selection.

Authors:  Shunfang Wang; Bing Nie; Kun Yue; Yu Fei; Wenjia Li; Dongshu Xu
Journal:  Int J Mol Sci       Date:  2017-12-15       Impact factor: 5.923

4.  Prediction of subcellular location of apoptosis proteins by incorporating PsePSSM and DCCA coefficient based on LFDA dimensionality reduction.

Authors:  Bin Yu; Shan Li; Wenying Qiu; Minghui Wang; Junwei Du; Yusen Zhang; Xing Chen
Journal:  BMC Genomics       Date:  2018-06-19       Impact factor: 3.969

Review 5.  Computational methods for protein localization prediction.

Authors:  Yuexu Jiang; Duolin Wang; Weiwei Wang; Dong Xu
Journal:  Comput Struct Biotechnol J       Date:  2021-10-19       Impact factor: 7.271

6.  Computational identification of multiple lysine PTM sites by analyzing the instance hardness and feature importance.

Authors:  Sabit Ahmed; Afrida Rahman; Md Al Mehedi Hasan; Shamim Ahmad; S M Shovan
Journal:  Sci Rep       Date:  2021-09-23       Impact factor: 4.379

7.  Comprehensive Analysis of the SBP Family in Blueberry and Their Regulatory Mechanism Controlling Chlorophyll Accumulation.

Authors:  Xin Xie; Shaokang Yue; Baosheng Shi; Hongxue Li; Yuhai Cui; Jingying Wang; Pengjie Yang; Shuchun Li; Xuyan Li; Shaomin Bian
Journal:  Front Plant Sci       Date:  2021-07-01       Impact factor: 5.753

8.  ksrMKL: a novel method for identification of kinase-substrate relationships using multiple kernel learning.

Authors:  Minghui Wang; Tao Wang; Ao Li
Journal:  PeerJ       Date:  2017-12-20       Impact factor: 2.984

9.  Protein subnuclear localization based on a new effective representation and intelligent kernel linear discriminant analysis by dichotomous greedy genetic algorithm.

Authors:  Shunfang Wang; Yaoting Yue
Journal:  PLoS One       Date:  2018-04-12       Impact factor: 3.240

10.  Consistent prediction of GO protein localization.

Authors:  Flavio E Spetale; Debora Arce; Flavia Krsticevic; Pilar Bulacio; Elizabeth Tapia
Journal:  Sci Rep       Date:  2018-05-17       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.