Literature DB >> 23418186

Prediction of contact matrix for protein-protein interaction.

Alvaro J González1, Li Liao, Cathy H Wu.   

Abstract

MOTIVATION: Prediction of protein-protein interaction has become an important part of systems biology in reverse engineering the biological networks for better understanding the molecular biology of the cell. Although significant progress has been made in terms of prediction accuracy, most computational methods only predict whether two proteins interact but not their interacting residues-the information that can be very valuable for understanding the interaction mechanisms and designing modulation of the interaction. In this work, we developed a computational method to predict the interacting residue pairs-contact matrix for interacting protein domains, whose rows and columns correspond to the residues in the two interacting domains respectively and whose values (1 or 0) indicate whether the corresponding residues (do or do not) interact.
RESULTS: Our method is based on supervised learning using support vector machines. For each domain involved in a given domain-domain interaction (DDI), an interaction profile hidden Markov model (ipHMM) is first built for the domain family, and then each residue position for a member domain sequence is represented as a 20-dimension vector of Fisher scores, characterizing how similar it is as compared with the family profile at that position. Each element of the contact matrix for a sequence pair is now represented by a feature vector from concatenating the vectors of the two corresponding residues, and the task is to predict the element value (1 or 0) from the feature vector. A support vector machine is trained for a given DDI, using either a consensus contact matrix or contact matrices for individual sequence pairs, and is tested by leave-one-out cross validation. The performance averaged over a set of 115 DDIs collected from the 3 DID database shows significant improvement (sensitivity up to 85%, and specificity up to 85%), as compared with a multiple sequence alignment-based method (sensitivity 57%, and specificity 78%) previously reported in the literature. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2013        PMID: 23418186      PMCID: PMC3624801          DOI: 10.1093/bioinformatics/btt076

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  A discriminative framework for detecting remote protein homologies.

Authors:  T Jaakkola; M Diekhans; D Haussler
Journal:  J Comput Biol       Date:  2000 Feb-Apr       Impact factor: 1.479

3.  Interrogating protein interaction networks through structural biology.

Authors:  Patrick Aloy; Robert B Russell
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-23       Impact factor: 11.205

4.  Principles of protein-protein recognition.

Authors:  C Chothia; J Janin
Journal:  Nature       Date:  1975-08-28       Impact factor: 49.962

5.  Arf, Arl, Arp and Sar proteins: a family of GTP-binding proteins with a structural device for 'front-back' communication.

Authors:  Sebastiano Pasqualato; Louis Renault; Jacqueline Cherfils
Journal:  EMBO Rep       Date:  2002-11       Impact factor: 8.807

Review 6.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

7.  Morphology of protein-protein interfaces.

Authors:  T A Larsen; A J Olson; D S Goodsell
Journal:  Structure       Date:  1998-04-15       Impact factor: 5.006

Review 8.  Principles of protein-protein interactions.

Authors:  S Jones; J M Thornton
Journal:  Proc Natl Acad Sci U S A       Date:  1996-01-09       Impact factor: 11.205

9.  Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation.

Authors:  C D Livingstone; G J Barton
Journal:  Comput Appl Biosci       Date:  1993-12

10.  Protein 3D structure computed from evolutionary sequence variation.

Authors:  Debora S Marks; Lucy J Colwell; Robert Sheridan; Thomas A Hopf; Andrea Pagnani; Riccardo Zecchina; Chris Sander
Journal:  PLoS One       Date:  2011-12-07       Impact factor: 3.240

View more
  6 in total

1.  Forecasting residue-residue contact prediction accuracy.

Authors:  P P Wozniak; B M Konopka; J Xu; G Vriend; M Kotulska
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

2.  Enhancing interacting residue prediction with integrated contact matrix prediction in protein-protein interaction.

Authors:  Tianchuan Du; Li Liao; Cathy H Wu
Journal:  EURASIP J Bioinform Syst Biol       Date:  2016-10-22

3.  Distance-based reconstruction of protein quaternary structures from inter-chain contacts.

Authors:  Elham Soltanikazemi; Farhan Quadir; Raj S Roy; Zhiye Guo; Jianlin Cheng
Journal:  Proteins       Date:  2021-11-02

4.  Improved multi-level protein-protein interaction prediction with semantic-based regularization.

Authors:  Claudio Saccà; Stefano Teso; Michelangelo Diligenti; Andrea Passerini
Journal:  BMC Bioinformatics       Date:  2014-04-12       Impact factor: 3.169

Review 5.  Applications of contact predictions to structural biology.

Authors:  Felix Simkovic; Sergey Ovchinnikov; David Baker; Daniel J Rigden
Journal:  IUCrJ       Date:  2017-04-18       Impact factor: 4.769

6.  Pattern to Knowledge: Deep Knowledge-Directed Machine Learning for Residue-Residue Interaction Prediction.

Authors:  Andrew K C Wong; Ho Yin Sze-To; Gary L Johanning
Journal:  Sci Rep       Date:  2018-10-04       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.