Literature DB >> 27299433

TargetCrys: protein crystallization prediction by fusing multi-view features with two-layered SVM.

Jun Hu1, Ke Han1, Yang Li1, Jing-Yu Yang1, Hong-Bin Shen2, Dong-Jun Yu3.   

Abstract

The accurate prediction of whether a protein will crystallize plays a crucial role in improving the success rate of protein crystallization projects. A common critical problem in the development of machine-learning-based protein crystallization predictors is how to effectively utilize protein features extracted from different views. In this study, we aimed to improve the efficiency of fusing multi-view protein features by proposing a new two-layered SVM (2L-SVM) which switches the feature-level fusion problem to a decision-level fusion problem: the SVMs in the 1st layer of the 2L-SVM are trained on each of the multi-view feature sets; then, the outputs of the 1st layer SVMs, which are the "intermediate" decisions made based on the respective feature sets, are further ensembled by a 2nd layer SVM. Based on the proposed 2L-SVM, we implemented a sequence-based protein crystallization predictor called TargetCrys. Experimental results on several benchmark datasets demonstrated the efficacy of the proposed 2L-SVM for fusing multi-view features. We also compared TargetCrys with existing sequence-based protein crystallization predictors and demonstrated that the proposed TargetCrys outperformed most of the existing predictors and is competitive with the state-of-the-art predictors. The TargetCrys webserver and datasets used in this study are freely available for academic use at: http://csbio.njust.edu.cn/bioinf/TargetCrys .

Keywords:  Machine learning; Multi-view feature fusion; Protein crystallization prediction; Support vector machine

Mesh:

Year:  2016        PMID: 27299433     DOI: 10.1007/s00726-016-2274-4

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  4 in total

Review 1.  Understanding pre-mRNA splicing through crystallography.

Authors:  Sara Espinosa; Lingdi Zhang; Xueni Li; Rui Zhao
Journal:  Methods       Date:  2017-05-12       Impact factor: 3.608

2.  Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Authors:  Fang Ge; Ying Zhang; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization.

Authors:  Fanchi Meng; Chen Wang; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2018-01-03       Impact factor: 3.169

4.  TLCrys: Transfer Learning Based Method for Protein Crystallization Prediction.

Authors:  Chen Jin; Zhuangwei Shi; Chuanze Kang; Ken Lin; Han Zhang
Journal:  Int J Mol Sci       Date:  2022-01-16       Impact factor: 5.923

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.