| Literature DB >> 18842634 |
Yoshinobu Igarashi1, Emily Heureux, Kutbuddin S Doctor, Priti Talwar, Svetlana Gramatikova, Kosi Gramatikoff, Ying Zhang, Michael Blinov, Salmaz S Ibragimova, Sarah Boyd, Boris Ratnikov, Piotr Cieplak, Adam Godzik, Jeffrey W Smith, Andrei L Osterman, Alexey M Eroshkin.
Abstract
The Proteolysis MAP (PMAP, http://www.proteolysis.org) is a user-friendly website intended to aid the scientific community in reasoning about proteolytic networks and pathways. PMAP is comprised of five databases, linked together in one environment. The foundation databases, ProteaseDB and SubstrateDB, are driven by an automated annotation pipeline that generates dynamic 'Molecule Pages', rich in molecular information. PMAP also contains two community annotated databases focused on function; CutDB has information on more than 5000 proteolytic events, and ProfileDB is dedicated to information of the substrate recognition specificity of proteases. Together, the content within these four databases will ultimately feed PathwayDB, which will be comprised of known pathways whose function can be dynamically modeled in a rule-based manner, and hypothetical pathways suggested by semi-automated culling of the literature. A Protease Toolkit is also available for the analysis of proteases and proteolysis. Here, we describe how the databases of PMAP can be used to foster understanding of proteolytic pathways, and equally as significant, to reason about proteolysis.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18842634 PMCID: PMC2686432 DOI: 10.1093/nar/gkn683
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Protease Molecule Page (top) and Substrate Molecule Page (bottom). The Protease Molecule Page shows recent news and literature of the protease (top left), known proteolytic events (top right), domain location and structure view (middle panel), as well as a cross annotation in other databases section (bottom). A graphics interface of SubstrateDB (bottom) is shown here displaying domains and experimental protease cut sites for the human amyloid β A4 protein. A total of 35 protease cut sites are filtered via selection boxes on the right. The display makes co-location of many features visible. The annotation displayed is linked back to sources (CutDB, ProteaseDB, UniProt, etc.) for more detailed annotation. The graphics display is synchronized with a text table of the same data, which can also be downloaded.
Figure 2.ProfileDB proteome-wide substrate search. (A) Initial page where the user selects options for proteome screening. (B) A results table generated from the proteome-wide screening with a detailed output for a particular protein. (C) Presentation of the cleavage site on the amino acid sequence with protein sequence features (secondary structure and disorder regions location).
Figure 3.Three types of information in PathwayDB. (A) Automated network reconstruction for one proteolytic events caused by ADAM17 peptidase and Notch1 preproprotein. Left panel: from CutDB front page, users search for the events. Middle panel: selection of events to reconstruct networks. Users can also define three parameters to show the network diagrams: (i) ID display mode; (ii) flexibility of connection, the nodes are connected within given threshold of sequence similarity; and (iii) steps of expanding, the edges are extended by given extension level. Right panel: the reconstructed pathway. The red arrows indicate the selected events. The black arrows indicate expanded events. The nonarrows edges are protein–protein interactions. (B) Proteolytic inferred networks. Left panel: PubMed-XML writer, a web service for extracting Medline abstracts, is used to delimit lexical queries, for example ‘complement AND proteolysis’. These abstracts are then assembled in XML format and submitted to Agilent Literature Search, acting as plug-in to Cytoscape (middle panel), which parses sentences containing ontologically known entities (such as gene and protein names). Cytoscape takes these gene/protein names and builds interactive networks as hyper-graphs. Right panel: hyper-graph generated for one ‘Network of the month’ entitled ‘Complement activation and regulation 01’. This network captured five proteases (nodes in yellow) that interplay with six substrates (nodes in green), four cofactors (nodes in light yellow), two inhibitors (nodes in red) and other binding proteins such as C4BP (white nodes). (C) Rule-based modeling of network function. Left panel: schematic representation of major events involved in coagulation. Middle panel: rules for coagulation cascade. Right panel: modeling of production/consumption profiles of individual components of the pathway using an in-house developed rule-based model for coagulation. In the plot, the x-axis denoted the time of simulations (100 iterations) and the y-axis is the molecular concentrations in nanomolar (0–100 000).