| Literature DB >> 16341901 |
Dariusz Plewczynski1, Adrian Tkacz, Lucjan Stanisław Wyrwicz, Adam Godzik, Andrzej Kloczkowski, Leszek Rychlewski.
Abstract
Our algorithm predicts short linear functional motifs in proteins using only sequence information. Statistical models for short linear functional motifs in proteins are built using the database of short sequence fragments taken from proteins in the current release of the Swiss-Prot database. Those segments are confirmed by experiments to have single-residue post-translational modification. The sensitivities of the classification for various types of short linear motifs are in the range of 70%. The query protein sequence is dissected into short overlapping fragments. All segments are represented as vectors. Each vector is then classified by a machine learning algorithm (Support Vector Machine) as potentially modifiable or not. The resulting list of plausible post-translational sites in the query protein is returned to the user. We also present a study of the human protein kinase C family as a biological application of our method.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16341901 DOI: 10.1007/s00894-005-0070-2
Source DB: PubMed Journal: J Mol Model ISSN: 0948-5023 Impact factor: 1.810