| Literature DB >> 23093603 |
Niklaas Colaert1, Davy Maddelein, Francis Impens, Petra Van Damme, Kim Plasman, Kenny Helsens, Niels Hulstaert, Joël Vandekerckhove, Kris Gevaert, Lennart Martens.
Abstract
We here present The Online Protein Processing Resource (TOPPR; http://iomics.ugent.be/toppr/), an online database that contains thousands of published proteolytically processed sites in human and mouse proteins. These cleavage events were identified with COmbinded FRActional DIagonal Chromatography proteomics technologies, and the resulting database is provided with full data provenance. Indeed, TOPPR provides an interactive visual display of the actual fragmentation mass spectrum that led to each identification of a reported processed site, complete with fragment ion annotations and search engine scores. Apart from warehousing and disseminating these data in an intuitive manner, TOPPR also provides an online analysis platform, including methods to analyze protease specificity and substrate-centric analyses. Concretely, TOPPR supports three ways to retrieve data: (i) the retrieval of all substrates for one or more cellular stimuli or assays; (ii) a substrate search by UniProtKB/Swiss-Prot accession number, entry name or description; and (iii) a motif search that retrieves substrates matching a user-defined protease specificity profile. The analysis of the substrates is supported through the presence of a variety of annotations, including predicted secondary structure, known domains and experimentally obtained 3D structure where available. Across substrates, substrate orthologs and conserved sequence stretches can also be shown, with iceLogo visualization provided for the latter.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23093603 PMCID: PMC3531153 DOI: 10.1093/nar/gks998
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The substrate sequence view in TOPPR. This display provides an overview of the protein sequence and links to dynamic annotations (here secondary structure and domain annotation have been selected). A protein bar representation, below the sequence, represents the full length of the protein with processing events indicated; immediately underneath, the domain visualization is shown. Below this protein-centric sequence view, details are shown on the individual peptides that were found to represent the annotated cleavage sites. Each peptide sequence in turn links to a peptide-centric view, where motif analyses, mass spectrometry data and all matching proteins can be found. Note that the peptide is annotated with its start and end coordinates on the protein.