Literature DB >> 30668845

Protein fold recognition based on multi-view modeling.

Ke Yan1, Xiaozhao Fang2, Yong Xu1, Bin Liu1,3.   

Abstract

MOTIVATION: Protein fold recognition has attracted increasing attention because it is critical for studies of the 3D structures of proteins and drug design. Researchers have been extensively studying this important task, and several features with high discriminative power have been proposed. However, the development of methods that efficiently combine these features to improve the predictive performance remains a challenging problem.
RESULTS: In this study, we proposed two algorithms: MV-fold and MT-fold. MV-fold is a new computational predictor based on the multi-view learning model for fold recognition. Different features of proteins were treated as different views of proteins, including the evolutionary information, secondary structure information and physicochemical properties. These different views constituted the latent space. The ε-dragging technique was employed to enlarge the margins between different protein folds, improving the predictive performance of MV-fold. Then, MV-fold was combined with two template-based methods: HHblits and HMMER. The ensemble method is called MT-fold incorporating the advantages of both discriminative methods and template-based methods. Experimental results on five widely used benchmark datasets (DD, RDD, EDD, TG and LE) showed that the proposed methods outperformed some state-of-the-art methods in this field, indicating that MV-fold and MT-fold are useful computational tools for protein fold recognition and protein homology detection and would be efficient tools for protein sequence analysis. Finally, we constructed an update and rigorous benchmark dataset based on SCOPe (version 2.07) to fairly evaluate the performance of the proposed method, and our method achieved stable performance on this new dataset. This new benchmark dataset will become a widely used benchmark dataset to fairly evaluate the performance of different methods for fold recognition. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30668845     DOI: 10.1093/bioinformatics/btz040

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  RFPR-IDP: reduce the false positive rates for intrinsically disordered protein and region prediction by incorporating both fully ordered proteins and disordered proteins.

Authors:  Yumeng Liu; Xiaolong Wang; Bin Liu
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

2.  BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors:  Bin Liu; Xin Gao; Hanyu Zhang
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

3.  Why can deep convolutional neural networks improve protein fold recognition? A visual explanation by interpretation.

Authors:  Yan Liu; Yi-Heng Zhu; Xiaoning Song; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

4.  Improving protein fold recognition using triplet network and ensemble deep learning.

Authors:  Yan Liu; Ke Han; Yi-Heng Zhu; Ying Zhang; Long-Chen Shen; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

5.  iPromoter-2L2.0: Identifying Promoters and Their Types by Combining Smoothing Cutting Window Algorithm and Sequence-Based Features.

Authors:  Bin Liu; Kai Li
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-14       Impact factor: 8.886

6.  A Method for Identifying Vesicle Transport Proteins Based on LibSVM and MRMD.

Authors:  Zhiyu Tao; Yanjuan Li; Zhixia Teng; Yuming Zhao
Journal:  Comput Math Methods Med       Date:  2020-10-19       Impact factor: 2.238

7.  iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection.

Authors:  Chenchen Ding; Haitao Han; Qianyue Li; Xiaoxia Yang; Taigang Liu
Journal:  Comput Math Methods Med       Date:  2021-01-06       Impact factor: 2.238

Review 8.  Research on the Computational Prediction of Essential Genes.

Authors:  Yuxin Guo; Ying Ju; Dong Chen; Lihong Wang
Journal:  Front Cell Dev Biol       Date:  2021-12-06

9.  Network-based protein structural classification.

Authors:  Khalique Newaz; Mahboobeh Ghalehnovi; Arash Rahnama; Panos J Antsaklis; Tijana Milenković
Journal:  R Soc Open Sci       Date:  2020-06-03       Impact factor: 2.963

10.  PSBP-SVM: A Machine Learning-Based Computational Identifier for Predicting Polystyrene Binding Peptides.

Authors:  Chaolu Meng; Yang Hu; Ying Zhang; Fei Guo
Journal:  Front Bioeng Biotechnol       Date:  2020-03-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.