| Literature DB >> 27990265 |
Nirupama Benis1, Dirkjan Schokker2, Frank Kramer3, Mari A Smits4, Maria Suarez-Diez5.
Abstract
Biological pathways are increasingly available in the BioPAX format which uses an RDF model for data storage. One can retrieve the information in this data model in the scripting language R using the package rBiopaxParser, which converts the BioPAX format to one readable in R. It also has a function to build a regulatory network from the pathway information. Here we describe an extension of this function. The new function allows the user to build graphs of entire pathways, including regulated as well as non-regulated elements, and therefore provides a maximum of information. This function is available as part of the rBiopaxParser distribution from Bioconductor.Entities:
Keywords: BioPAX; R; pathways; rBiopaxParser
Year: 2016 PMID: 27990265 PMCID: PMC5130076 DOI: 10.12688/f1000research.9582.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Hypothetical pathway.
This cartoon of a pathway shows examples of nodes and edges that could be encountered in a BioPAX database. The nodes are proteins, complexes or other physical entities and the edges are interactions between the nodes, that represent either interactions among proteins or protein modifications. The solid edges are those detected by the P2RG function and the solid and dashed edges are detected by the P2G function.
Figure 2. Interplay of classes in Reactome BioPAX.
This figure shows a network of the Interaction and PhysicalEntity classes that are a part of any pathway in Reactome v51 BioPAX level 3. Nodes are classes and the directed edges are links between them in the database. The green nodes are the Pathway and PathwayStep classes, the blue nodes are Interaction classes and orange nodes are PhysicalEntity classes.
Figure 3. Graphs of the pathway ‘Apoptosis induced DNA fragmentation’.
Both graphs were extracted from the same BioPAX file. A) Graph recovered using the new P2G function; B) Graph recovered using P2RG function. In both panels blue nodes are proteins or protein complexes, white nodes are non-protein entities. Black encircled nodes are found in both graphs and red encircled nodes are only detected with the new P2G function. Names of the nodes are in Table 2.
Numbers of nodes and edges.
The number of nodes and edges of ten different pathways (Reactome Categories) are indicated as obtained after application of P2RG and P2G on the same set of BioPAX RDF information.
| Reactome Categories | P2RG
| P2RG
| P2G
| P2G
|
|---|---|---|---|---|
| Binding and Uptake of Ligands
| 0 | 0 | 68 | 56 |
| Cell-Cell communication | 13 | 14 | 142 | 142 |
| Disease | 3,396 | 5,878 | 4,888 | 12,159 |
| Gene Expression | 652 | 900 | 1,110 | 2,450 |
| Immune System | 1,431 | 2,233 | 2,419 | 5,045 |
| Membrane Trafficking | 86 | 121 | 181 | 382 |
| Metabolism | 3,082 | 5,922 | 3,479 | 11,289 |
| Signaling Pathways | 2,069 | 3,274 | 3,430 | 7,131 |
| Steroid hormones | 72 | 147 | 81 | 333 |
| Transcription | 281 | 420 | 623 | 1,324 |
Node names and locations of the “Apoptosis induced DNA fragmentation” pathway.
The first column has the names of the nodes in the pathwayas depicted in Figure 3. The second column has the actual name of the node and the third column the cellular location of the node. All this information is represented as given in Reactome version 51. The nodes shown with a black outline in Figure 3 are shown here in bold font.
| Node | Name | Location |
|---|---|---|
| Protein8776 | DFFB | Cytosol |
| Protein8777 | DFFA | Cytosol |
| Complex4232 | DFFA : DFFB | Cytosol |
| Complex4233 | Importin alpha : Importin beta | Cytosol |
| Complex4234 | DFF : associated with Importin alpha : Importin beta | Cytosol |
| Complex4235 | DFF : associated with Importin alpha : Importin beta | Nucleoplasm |
| Complex4169 | Active CASP3 | Cytosol |
|
|
| Nucleoplasm |
|
|
| Nucleoplasm |
|
|
| Nucleoplasm |
|
|
| Nucleoplasm |
| Protein8779 | DFFB | Nucleoplasm |
| Protein8784 | DFFA fragment | Nucleoplasm |
| Protein8785 | DFFA fragment | Nucleoplasm |
| Protein8783 | DFFA fragment | Nucleoplasm |
| Complex4241 | DFFB homodimer | Nucleoplasm |
|
|
| Nucleoplasm |
| Complex2061 | Histone H1 bound chromatin DNA | Nucleoplasm |
|
|
| Nucleoplasm |
| Protein8786 | HMGB1/HMGB2 | Nucleoplasm |
| PhysicalEntity109 | DNA | Nucleoplasm |
|
|
| Nucleoplasm |
|
|
| Nucleoplasm |