Holger Dinkel1, Heinrich Sticht. 1. Abteilung für Bioinformatik, Institut für Biochemie, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany.
Abstract
MOTIVATION: Short linear peptide motifs mediate protein-protein interaction, cell compartment targeting and represent the sites of post-translational modification. The identification of functional motifs by conventional sequence searches, however, is hampered by the short length of the motifs resulting in a large number of hits of which only a small portion is functional. RESULTS: We have developed a procedure for the identification of functional motifs, which scores pattern conservation in homologous sequences by taking explicitly into account the sequence similarity to the query sequence. For a further improvement of this method, sequence filters have been optimized to mask those sequence regions containing little or no linear motifs. The performance of this approach was verified by measuring its ability to identify 576 experimentally validated motifs among a total of 15 563 instances in a set of 415 protein sequences. Compared to a random selection procedure, the joint application of sequence filters and the novel scoring scheme resulted in a 9-fold enrichment of validated functional motifs on the first rank. In addition, only half as many hits need to be investigated to recover 75% of the functional instances in our dataset. Therefore, this motif-scoring approach should be helpful to guide experiments because it allows focusing on those short linear peptide motifs that have a high probability to be functional.
MOTIVATION: Short linear peptide motifs mediate protein-protein interaction, cell compartment targeting and represent the sites of post-translational modification. The identification of functional motifs by conventional sequence searches, however, is hampered by the short length of the motifs resulting in a large number of hits of which only a small portion is functional. RESULTS: We have developed a procedure for the identification of functional motifs, which scores pattern conservation in homologous sequences by taking explicitly into account the sequence similarity to the query sequence. For a further improvement of this method, sequence filters have been optimized to mask those sequence regions containing little or no linear motifs. The performance of this approach was verified by measuring its ability to identify 576 experimentally validated motifs among a total of 15 563 instances in a set of 415 protein sequences. Compared to a random selection procedure, the joint application of sequence filters and the novel scoring scheme resulted in a 9-fold enrichment of validated functional motifs on the first rank. In addition, only half as many hits need to be investigated to recover 75% of the functional instances in our dataset. Therefore, this motif-scoring approach should be helpful to guide experiments because it allows focusing on those short linear peptide motifs that have a high probability to be functional.
Authors: Norman E Davey; Joanne L Cowan; Denis C Shields; Toby J Gibson; Mark J Coldwell; Richard J Edwards Journal: Nucleic Acids Res Date: 2012-09-12 Impact factor: 16.971
Authors: Holger Dinkel; Sushama Michael; Robert J Weatheritt; Norman E Davey; Kim Van Roey; Brigitte Altenberg; Grischa Toedt; Bora Uyar; Markus Seiler; Aidan Budd; Lisa Jödicke; Marcel A Dammert; Christian Schroeter; Maria Hammer; Tobias Schmidt; Peter Jehl; Caroline McGuigan; Magdalena Dymecka; Claudia Chica; Katja Luck; Allegra Via; Andrew Chatr-Aryamontri; Niall Haslam; Gleb Grebnev; Richard J Edwards; Michel O Steinmetz; Heike Meiselbach; Francesca Diella; Toby J Gibson Journal: Nucleic Acids Res Date: 2011-11-21 Impact factor: 16.971
Authors: Cathryn M Gould; Francesca Diella; Allegra Via; Pål Puntervoll; Christine Gemünd; Sophie Chabanis-Davidson; Sushama Michael; Ahmed Sayadi; Jan Christian Bryne; Claudia Chica; Markus Seiler; Norman E Davey; Niall Haslam; Robert J Weatheritt; Aidan Budd; Tim Hughes; Jakub Pas; Leszek Rychlewski; Gilles Travé; Rein Aasland; Manuela Helmer-Citterich; Rune Linding; Toby J Gibson Journal: Nucleic Acids Res Date: 2009-11-17 Impact factor: 16.971