Literature DB >> 26370280

Integrated protein function prediction by mining function associations, sequences, and protein-protein and gene-gene interaction networks.

Renzhi Cao1, Jianlin Cheng2.   

Abstract

MOTIVATIONS: Protein function prediction is an important and challenging problem in bioinformatics and computational biology. Functionally relevant biological information such as protein sequences, gene expression, and protein-protein interactions has been used mostly separately for protein function prediction. One of the major challenges is how to effectively integrate multiple sources of both traditional and new information such as spatial gene-gene interaction networks generated from chromosomal conformation data together to improve protein function prediction.
RESULTS: In this work, we developed three different probabilistic scores (MIS, SEQ, and NET score) to combine protein sequence, function associations, and protein-protein interaction and spatial gene-gene interaction networks for protein function prediction. The MIS score is mainly generated from homologous proteins found by PSI-BLAST search, and also association rules between Gene Ontology terms, which are learned by mining the Swiss-Prot database. The SEQ score is generated from protein sequences. The NET score is generated from protein-protein interaction and spatial gene-gene interaction networks. These three scores were combined in a new Statistical Multiple Integrative Scoring System (SMISS) to predict protein function. We tested SMISS on the data set of 2011 Critical Assessment of Function Annotation (CAFA). The method performed substantially better than three base-line methods and an advanced method based on protein profile-sequence comparison, profile-profile comparison, and domain co-occurrence networks according to the maximum F-measure.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Chromosome conformation capturing; Data integration; Protein function prediction; Protein–protein interaction network; Spatial gene–gene interaction network

Mesh:

Year:  2015        PMID: 26370280      PMCID: PMC4894840          DOI: 10.1016/j.ymeth.2015.09.011

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  45 in total

1.  Assessment of prediction accuracy of protein function from protein--protein interaction data.

Authors:  H Hishigaki; K Nakai; T Ono; A Tanigami; T Takagi
Journal:  Yeast       Date:  2001-04       Impact factor: 3.239

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  Prediction of human protein function from post-translational modifications and localization features.

Authors:  L J Jensen; R Gupta; N Blom; D Devos; J Tamames; C Kesmir; H Nielsen; H H Staerfeldt; K Rapacki; C Workman; C A F Andersen; S Knudsen; A Krogh; A Valencia; S Brunak
Journal:  J Mol Biol       Date:  2002-06-21       Impact factor: 5.469

Review 4.  Automatic prediction of protein function.

Authors:  B Rost; J Liu; R Nair; K O Wrzeszczynski; Y Ofran
Journal:  Cell Mol Life Sci       Date:  2003-12       Impact factor: 9.261

5.  Automated Gene Ontology annotation for anonymous sequence data.

Authors:  Steffen Hennig; Detlef Groth; Hans Lehrach
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  OntoBlast function: From sequence similarities directly to potential functional annotations by ontology terms.

Authors:  Günther Zehetner
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

7.  Predicting protein function from protein/protein interaction data: a probabilistic approach.

Authors:  Stanley Letovsky; Simon Kasif
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

8.  Global protein function prediction from protein-protein interaction networks.

Authors:  Alexei Vazquez; Alessandro Flammini; Amos Maritan; Alessandro Vespignani
Journal:  Nat Biotechnol       Date:  2003-05-12       Impact factor: 54.908

9.  Prediction of protein function using protein-protein interaction data.

Authors:  Minghua Deng; Kui Zhang; Shipra Mehta; Ting Chen; Fengzhu Sun
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

10.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae).

Authors:  Olga G Troyanskaya; Kara Dolinski; Art B Owen; Russ B Altman; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-25       Impact factor: 12.779

View more
  20 in total

1.  A Multi-Label Supervised Topic Model Conditioned on Arbitrary Features for Gene Function Prediction.

Authors:  Lin Liu; Lin Tang; Xin Jin; Wei Zhou
Journal:  Genes (Basel)       Date:  2019-01-17       Impact factor: 4.096

2.  FFPred 3: feature-based function prediction for all Gene Ontology domains.

Authors:  Domenico Cozzetto; Federico Minneci; Hannah Currant; David T Jones
Journal:  Sci Rep       Date:  2016-08-26       Impact factor: 4.379

3.  Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.

Authors:  Jiajun Hong; Yongchao Luo; Yang Zhang; Junbiao Ying; Weiwei Xue; Tian Xie; Lin Tao; Feng Zhu
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

4.  DNSS2: Improved ab initio protein secondary structure prediction using advanced deep learning architectures.

Authors:  Zhiye Guo; Jie Hou; Jianlin Cheng
Journal:  Proteins       Date:  2020-09-16

5.  Missing gene identification using functional coherence scores.

Authors:  Meghana Chitale; Ishita K Khan; Daisuke Kihara
Journal:  Sci Rep       Date:  2016-08-24       Impact factor: 4.379

6.  DeepQA: improving the estimation of single protein model quality with deep belief networks.

Authors:  Renzhi Cao; Debswapna Bhattacharya; Jie Hou; Jianlin Cheng
Journal:  BMC Bioinformatics       Date:  2016-12-05       Impact factor: 3.169

7.  An Algorithm of Association Rule Mining for Microbial Energy Prospection.

Authors:  Muhammad Shaheen; Muhammad Shahbaz
Journal:  Sci Rep       Date:  2017-04-10       Impact factor: 4.379

8.  DeepSF: deep convolutional neural network for mapping protein sequences to folds.

Authors:  Jie Hou; Badri Adhikari; Jianlin Cheng
Journal:  Bioinformatics       Date:  2018-04-15       Impact factor: 6.937

9.  Protein single-model quality assessment by feature-based probability density functions.

Authors:  Renzhi Cao; Jianlin Cheng
Journal:  Sci Rep       Date:  2016-04-04       Impact factor: 4.379

10.  PrESOgenesis: A two-layer multi-label predictor for identifying fertility-related proteins using support vector machine and pseudo amino acid composition approach.

Authors:  Mohammad Reza Bakhtiarizadeh; Maryam Rahimi; Abdollah Mohammadi-Sangcheshmeh; Vahid Shariati J; Seyed Alireza Salami
Journal:  Sci Rep       Date:  2018-06-13       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.