Literature DB >> 20078397

Graphlet kernels for prediction of functional residues in protein structures.

Vladimir Vacic1, Lilia M Iakoucheva, Stefano Lonardi, Predrag Radivojac.   

Abstract

We introduce a novel graph-based kernel method for annotating functional residues in protein structures. A structure is first modeled as a protein contact graph, where nodes correspond to residues and edges connect spatially neighboring residues. Each vertex in the graph is then represented as a vector of counts of labeled non-isomorphic subgraphs (graphlets), centered on the vertex of interest. A similarity measure between two vertices is expressed as the inner product of their respective count vectors and is used in a supervised learning framework to classify protein residues. We evaluated our method on two function prediction problems: identification of catalytic residues in proteins, which is a well-studied problem suitable for benchmarking, and a much less explored problem of predicting phosphorylation sites in protein structures. The performance of the graphlet kernel approach was then compared against two alternative methods, a sequence-based predictor and our implementation of the FEATURE framework. On both tasks, the graphlet kernel performed favorably; however, the margin of difference was considerably higher on the problem of phosphorylation site prediction. While there is data that phosphorylation sites are preferentially positioned in intrinsically disordered regions, we provide evidence that for the sites that are located in structured regions, neither the surface accessibility alone nor the averaged measures calculated from the residue microenvironments utilized by FEATURE were sufficient to achieve high accuracy. The key benefit of the graphlet representation is its ability to capture neighborhood similarities in protein structures via enumerating the patterns of local connectivity in the corresponding labeled graphs.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20078397      PMCID: PMC2921594          DOI: 10.1089/cmb.2009.0029

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  90 in total

1.  Large-scale characterization of HeLa cell nuclear phosphoproteins.

Authors:  Sean A Beausoleil; Mark Jedrychowski; Daniel Schwartz; Joshua E Elias; Judit Villén; Jiaxu Li; Martin A Cohn; Lewis C Cantley; Steven P Gygi
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-09       Impact factor: 11.205

2.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

3.  Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques.

Authors:  R Nussinov; H J Wolfson
Journal:  Proc Natl Acad Sci U S A       Date:  1991-12-01       Impact factor: 11.205

4.  Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm.

Authors:  H M Grindley; P J Artymiuk; D W Rice; P Willett
Journal:  J Mol Biol       Date:  1993-02-05       Impact factor: 5.469

5.  Protein structure comparison by alignment of distance matrices.

Authors:  L Holm; C Sander
Journal:  J Mol Biol       Date:  1993-09-05       Impact factor: 5.469

6.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

7.  Enlarged representative set of protein structures.

Authors:  U Hobohm; C Sander
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

8.  Analysis and prediction of the location of catalytic residues in enzymes.

Authors:  M J Zvelebil; M J Sternberg
Journal:  Protein Eng       Date:  1988-07

9.  Kinase peptide specificity: improved determination and relevance to protein phosphorylation.

Authors:  Koichi Fujii; Guozhi Zhu; Yin Liu; John Hallam; Lin Chen; Juan Herrero; Stephen Shaw
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-08       Impact factor: 11.205

10.  The prediction and characterization of metal binding sites in proteins.

Authors:  D S Gregory; A C Martin; J C Cheetham; A R Rees
Journal:  Protein Eng       Date:  1993-01
View more
  13 in total

1.  Structure-based kernels for the prediction of catalytic residues and their involvement in human inherited disease.

Authors:  Fuxiao Xin; Steven Myers; Yong Fuga Li; David N Cooper; Sean D Mooney; Predrag Radivojac
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

2.  Network analysis of synonymous codon usage.

Authors:  Khalique Newaz; Gabriel Wright; Jacob Piland; Jun Li; Patricia L Clark; Scott J Emrich; Tijana Milenković
Journal:  Bioinformatics       Date:  2020-12-08       Impact factor: 6.937

3.  From homogeneous to heterogeneous network alignment via colored graphlets.

Authors:  Shawn Gu; John Johnson; Fazle E Faisal; Tijana Milenković
Journal:  Sci Rep       Date:  2018-08-21       Impact factor: 4.379

4.  Classification in biological networks with hypergraphlet kernels.

Authors:  Jose Lugo-Martinez; Daniel Zeiberg; Thomas Gaudelet; Noël Malod-Dognin; Natasa Przulj; Predrag Radivojac
Journal:  Bioinformatics       Date:  2021-05-17       Impact factor: 6.937

5.  Biological function derived from predicted structures in CASP11.

Authors:  Peter J Huwe; Qifang Xu; Maxim V Shapovalov; Vivek Modi; Mark D Andrake; Roland L Dunbrack
Journal:  Proteins       Date:  2016-06-15

6.  Graphlet signature-based scoring method to estimate protein-ligand binding affinity.

Authors:  Omkar Singh; Kunal Sawariya; Polamarasetty Aparoy
Journal:  R Soc Open Sci       Date:  2014-12-10       Impact factor: 2.963

7.  Mining the entire Protein DataBank for frequent spatially cohesive amino acid patterns.

Authors:  Pieter Meysman; Cheng Zhou; Boris Cule; Bart Goethals; Kris Laukens
Journal:  BioData Min       Date:  2015-01-31       Impact factor: 2.522

8.  Cutoff lensing: predicting catalytic sites in enzymes.

Authors:  Simon Aubailly; Francesco Piazza
Journal:  Sci Rep       Date:  2015-10-08       Impact factor: 4.379

9.  Systematically differentiating functions for alternatively spliced isoforms through integrating RNA-seq data.

Authors:  Ridvan Eksi; Hong-Dong Li; Rajasree Menon; Yuchen Wen; Gilbert S Omenn; Matthias Kretzler; Yuanfang Guan
Journal:  PLoS Comput Biol       Date:  2013-11-07       Impact factor: 4.475

10.  The Loss and Gain of Functional Amino Acid Residues Is a Common Mechanism Causing Human Inherited Disease.

Authors:  Jose Lugo-Martinez; Vikas Pejaver; Kymberleigh A Pagel; Shantanu Jain; Matthew Mort; David N Cooper; Sean D Mooney; Predrag Radivojac
Journal:  PLoS Comput Biol       Date:  2016-08-26       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.