Literature DB >> 26357331

An Integrated Framework for Functional Annotation of Protein Structural Domains.

Lei Deng, Zhigang Chen.   

Abstract

Structural domains are evolutionary and functional units of proteins and play a critical role in comparative and functional genomics. Computational assignment of domain function with high reliability is essential for understanding whole-protein functions. However, functional annotations are conventionally assigned onto full-length proteins rather than associating specific functions to the individual structural domains. In this article, we present Structural Domain Annotation (SDA), a novel computational approach to predict functions for SCOP structural domains. The SDA method integrates heterogeneous information sources, including structure alignment based protein-SCOP mapping features, InterPro2GO mapping information, PSSM Profiles, and sequence neighborhood features, with a Bayesian network. By large-scale annotating Gene Ontology terms to SCOP domains with SDA, we obtained a database of SCOP domain to Gene Ontology mappings, which contains ~162,000 out of the approximately 166,900 domains in SCOPe 2.03 (>97 percent) and their predicted Gene Ontology functions. We have benchmarked SDA using a single-domain protein dataset and an independent dataset from different species. Comparative studies show that SDA significantly outperforms the existing function prediction methods for structural domains in terms of coverage and maximum F-measure.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26357331     DOI: 10.1109/TCBB.2015.2389213

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  8 in total

1.  TOPDOM: database of conservatively located domains and motifs in proteins.

Authors:  Julia Varga; László Dobson; Gábor E Tusnády
Journal:  Bioinformatics       Date:  2016-04-12       Impact factor: 6.937

2.  RicyerDB: A Database For Collecting Rice Yield-related Genes with Biological Analysis.

Authors:  Jing Jiang; Fei Xing; Xiangxiang Zeng; Quan Zou
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

3.  MADOKA: an ultra-fast approach for large-scale protein structure similarity searching.

Authors:  Lei Deng; Guolun Zhong; Chenzhe Liu; Judong Luo; Hui Liu
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

4.  A boosting approach for prediction of protein-RNA binding residues.

Authors:  Yongjun Tang; Diwei Liu; Zixiang Wang; Ting Wen; Lei Deng
Journal:  BMC Bioinformatics       Date:  2017-12-01       Impact factor: 3.169

5.  RFAmyloid: A Web Server for Predicting Amyloid Proteins.

Authors:  Mengting Niu; Yanjuan Li; Chunyu Wang; Ke Han
Journal:  Int J Mol Sci       Date:  2018-07-16       Impact factor: 5.923

6.  Gene Ontology-based function prediction of long non-coding RNAs using bi-random walk.

Authors:  Jingpu Zhang; Shuai Zou; Lei Deng
Journal:  BMC Med Genomics       Date:  2018-11-20       Impact factor: 3.063

7.  SDADB: a functional annotation database of protein structural domains.

Authors:  Cheng Zeng; Weihua Zhan; Lei Deng
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

8.  Identifying Plant Pentatricopeptide Repeat Coding Gene/Protein Using Mixed Feature Extraction Methods.

Authors:  Kaiyang Qu; Leyi Wei; Jiantao Yu; Chunyu Wang
Journal:  Front Plant Sci       Date:  2019-01-10       Impact factor: 5.753

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.