MOTIVATION: Protein phosphorylation, driven by specific recognition of substrates by kinases and phosphatases, plays central roles in a variety of important cellular processes such as signaling and enzyme activation. Mass spectrometry enables the determination of phosphorylated peptides (and thereby proteins) in scenarios ranging from targeted in vitro studies to in vivo cell lysates under particular conditions. The characterization of commonalities among identified phosphopeptides provides insights into the specificities of the kinases involved in a study. Several algorithms have been developed to uncover linear motifs representing position-specific amino acid patterns in sets of phosphopeptides. To more fully capture the available information, reduce sensitivity to both parameter choices and natural experimental variation, and develop more precise characterizations of kinase specificities, it is necessary to determine all statistically significant motifs represented in a dataset. RESULTS: We have developed MMFPh (Maximal Motif Finder for Phosphoproteomics datasets), which extends the approach of the popular phosphorylation motif software Motif-X (Schwartz and Gygi, 2005) to identify all statistically significant motifs and return the maximal ones (those not subsumed by motifs with more fixed amino acids). In tests with both synthetic and experimental data, we show that MMFPh finds important motifs missed by the greedy approach of Motif-X, while also finding more motifs that are more characteristic of the dataset relative to the background proteome. Thus MMFPh is in some sense both more sensitive and more specific in characterizing the involved kinases. We also show that MMFPh compares favorably to other recent methods for finding phosphorylation motifs. Furthermore, MMFPh is less dependent on parameter choices. We support this powerful new approach with a web interface so that it may become a useful tool for studies of kinase specificity and phosphorylation site prediction. AVAILABILITY: A web server is at www.cs.dartmouth.edu/~cbk/.
MOTIVATION: Protein phosphorylation, driven by specific recognition of substrates by kinases and phosphatases, plays central roles in a variety of important cellular processes such as signaling and enzyme activation. Mass spectrometry enables the determination of phosphorylated peptides (and thereby proteins) in scenarios ranging from targeted in vitro studies to in vivo cell lysates under particular conditions. The characterization of commonalities among identified phosphopeptides provides insights into the specificities of the kinases involved in a study. Several algorithms have been developed to uncover linear motifs representing position-specific amino acid patterns in sets of phosphopeptides. To more fully capture the available information, reduce sensitivity to both parameter choices and natural experimental variation, and develop more precise characterizations of kinase specificities, it is necessary to determine all statistically significant motifs represented in a dataset. RESULTS: We have developed MMFPh (Maximal Motif Finder for Phosphoproteomics datasets), which extends the approach of the popular phosphorylation motif software Motif-X (Schwartz and Gygi, 2005) to identify all statistically significant motifs and return the maximal ones (those not subsumed by motifs with more fixed amino acids). In tests with both synthetic and experimental data, we show that MMFPh finds important motifs missed by the greedy approach of Motif-X, while also finding more motifs that are more characteristic of the dataset relative to the background proteome. Thus MMFPh is in some sense both more sensitive and more specific in characterizing the involved kinases. We also show that MMFPh compares favorably to other recent methods for finding phosphorylation motifs. Furthermore, MMFPh is less dependent on parameter choices. We support this powerful new approach with a web interface so that it may become a useful tool for studies of kinase specificity and phosphorylation site prediction. AVAILABILITY: A web server is at www.cs.dartmouth.edu/~cbk/.
Authors: Viveka Mayya; Deborah H Lundgren; Sun-Il Hwang; Karim Rezaul; Linfeng Wu; Jimmy K Eng; Vladimir Rodionov; David K Han Journal: Sci Signal Date: 2009-08-18 Impact factor: 8.192
Authors: Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton Journal: Bioinformatics Date: 2009-01-16 Impact factor: 6.937
Authors: Martin Lee Miller; Lars Juhl Jensen; Francesca Diella; Claus Jørgensen; Michele Tinti; Lei Li; Marilyn Hsiung; Sirlester A Parker; Jennifer Bordeaux; Thomas Sicheritz-Ponten; Marina Olhovsky; Adrian Pasculescu; Jes Alexander; Stefan Knapp; Nikolaj Blom; Peer Bork; Shawn Li; Gianni Cesareni; Tony Pawson; Benjamin E Turk; Michael B Yaffe; Søren Brunak; Rune Linding Journal: Sci Signal Date: 2008-09-02 Impact factor: 8.192
Authors: Shuhei Matsuoka; Bryan A Ballif; Agata Smogorzewska; E Robert McDonald; Kristen E Hurov; Ji Luo; Corey E Bakalarski; Zhenming Zhao; Nicole Solimini; Yaniv Lerenthal; Yosef Shiloh; Steven P Gygi; Stephen J Elledge Journal: Science Date: 2007-05-25 Impact factor: 47.728
Authors: Rachael A McCloy; Benjamin L Parker; Samuel Rogers; Rima Chaudhuri; Velimir Gayevskiy; Nolan J Hoffman; Naveid Ali; D Neil Watkins; Roger J Daly; David E James; Thierry Lorca; Anna Castro; Andrew Burgess Journal: Mol Cell Proteomics Date: 2015-06-08 Impact factor: 5.911
Authors: Christian Posch; Martina Sanlorenzo; Igor Vujic; Juan A Oses-Prieto; Brian D Cholewa; Sarasa T Kim; Jeffrey Ma; Kevin Lai; Mitchell Zekhtser; Rosaura Esteve-Puig; Gary Green; Shreya Chand; Alma L Burlingame; Renate Panzer-Grümayer; Klemens Rappersberger; Susana Ortiz-Urda Journal: J Invest Dermatol Date: 2016-05-29 Impact factor: 8.551