Literature DB >> 29990004

Protein Remote Homology Detection and Fold Recognition Based on Sequence-Order Frequency Matrix.

Bin Liu, Junjie Chen, Mingyue Guo, Xiaolong Wang.   

Abstract

Protein remote homology detection and fold recognition are two critical tasks for the studies of protein structures and functions. Currently, the profile-based methods achieve the state-of-the-art performance in these fields. However, the widely used sequence profiles, like position-specific frequency matrix (PSFM) and position-specific scoring matrix (PSSM), ignore the sequence-order effects along protein sequence. In this study, we have proposed a novel profile, called sequence-order frequency matrix (SOFM), to extract the sequence-order information of neighboring residues from multiple sequence alignment (MSA). Combined with two profile feature extraction approaches, top-n-grams and the Smith-Waterman algorithm, the SOFMs are applied to protein remote homology detection and fold recognition, and two predictors called SOFM-Top and SOFM-SW are proposed. Experimental results show that SOFM contains more information content than other profiles, and these two predictors outperform other state-of-the-art methods. It is anticipated that SOFM will become a very useful profile in the studies of protein structures and functions.

Mesh:

Substances:

Year:  2017        PMID: 29990004     DOI: 10.1109/TCBB.2017.2765331

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.

Authors:  Fuyi Li; Yanan Wang; Chen Li; Tatiana T Marquez-Lago; André Leier; Neil D Rawlings; Gholamreza Haffari; Jerico Revote; Tatsuya Akutsu; Kuo-Chen Chou; Anthony W Purcell; Robert N Pike; Geoffrey I Webb; A Ian Smith; Trevor Lithgow; Roger J Daly; James C Whisstock; Jiangning Song
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

2.  RFPR-IDP: reduce the false positive rates for intrinsically disordered protein and region prediction by incorporating both fully ordered proteins and disordered proteins.

Authors:  Yumeng Liu; Xiaolong Wang; Bin Liu
Journal:  Brief Bioinform       Date:  2021-03-22       Impact factor: 11.622

3.  Why can deep convolutional neural networks improve protein fold recognition? A visual explanation by interpretation.

Authors:  Yan Liu; Yi-Heng Zhu; Xiaoning Song; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.