| Literature DB >> 34128961 |
François Moreews1,2, Hugo Simon1, Anne Siegel1, Florence Gondret2, Emmanuelle Becker1.
Abstract
SUMMARY: PAX2GRAPHML is an open source Python library that allows to easily manipulate BioPAX source files as regulated reaction graphs described in .graphml format. The concept of regulated reactions, which allows connecting regulatory, signaling and metabolic levels, has been used. Biochemical reactions and regulatory interactions are homogeneously described by regulated reactions involving substrates, products, activators and inhibitors as elements. PAX2GRAPHML is highly flexible and allows generating graphs of regulated reactions from a single BioPAX source or by combining and filtering BioPAX sources. Supported by the graph exchange format .graphml, the large-scale graphs produced from one or more data sources can be further analyzed with PAX2GRAPHML or standard Python and R graph libraries.Entities:
Year: 2021 PMID: 34128961 PMCID: PMC8665752 DOI: 10.1093/bioinformatics/btab441
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Example of reaction graph manipulated by PAX2GRAPHML showing reactions and entities as nodes
BioPAX files transformation of datasources available in PC into regulated reaction graphs with PAX2GRAPHML
| Data sources | Nodes | Reaction nodes | Entity nodes | Edges | Substrate of | Product of | Activator of | Inhibitor of |
|---|---|---|---|---|---|---|---|---|
| PC | 175 262 | 85 750 | 89 512 | 639 945 | 84 496 | 95 743 | 52 009 | 407 697 |
| CTD | 44 639 | 19 814 | 24 825 | 98 993 | 18 538 | 19 073 | 35 077 | 26 305 |
| HumanCyc | 5733 | 1778 | 3955 | 11 890 | 3875 | 4455 | 3560 | 0 |
| INOH | 4315 | 2188 | 2127 | 7409 | 4247 | 3162 | 0 | 0 |
| Intact complex | 2187 | 563 | 1624 | 2869 | 2306 | 563 | 0 | 0 |
| KEGG | 3133 | 1560 | 1573 | 7041 | 3488 | 3553 | 0 | 0 |
| Mirtarbase | 32 727 | 15 064 | 17 663 | 395 703 | 0 | 15 064 | 0 | 380 639 |
| Panther | 3766 | 1662 | 2104 | 5263 | 2914 | 2165 | 142 | 42 |
| PID | 9403 | 4495 | 4908 | 14 827 | 6613 | 4544 | 3233 | 437 |
| Reactome | 31 718 | 11 404 | 20 314 | 46 541 | 27 210 | 15 358 | 3717 | 256 |
| Reconx | 6956 | 2821 | 4135 | 20 485 | 7722 | 7689 | 5074 | 0 |
| PC | 119 285 | 55 184 | 64 101 | 167 515 | 83 782 | 66 056 | 16 939 | 7 38 |
| PID and HumanCyc | 12 561 | 4518 | 8043 | 16 085 | 3575 | 5859 | 6221 | 430 |
| PID and HumanCyc and KEGG | 15 564 | 6079 | 9485 | 19 651 | 5332 | 7652 | 6237 | 430 |
| PID and HumanCyc and KEGG and Reactome | 38 585 | 10 398 | 28 187 | 43 899 | 16 087 | 17 371 | 9756 | 685 |
Note: Single datasources transformations were performed with the sub-package pax_import. Combination of several datasources was performed by filtering the PC* all sources graphml file with the sub-package extract. Nodes are either reactions or entities (proteins, small molecules, etc.). The numbers of regulated reactions computed by PAX2GRAPHML, together with the number of substrates, products, activators, inhibitors, are indicated. All these graphs can be directly downloaded from the PAX2GRAPHML website.
‘PC' version 12, September 2019.
KEGG, July 2011 (only human, hsa* files).