| Literature DB >> 18940869 |
Atanas Kamburov1, Christoph Wierling, Hans Lehrach, Ralf Herwig.
Abstract
ConsensusPathDB is a database system for the integration of human functional interactions. Current knowledge of these interactions is dispersed in more than 200 databases, each having a specific focus and data format. ConsensusPathDB currently integrates the content of 12 different interaction databases with heterogeneous foci comprising a total of 26,133 distinct physical entities and 74,289 distinct functional interactions (protein-protein interactions, biochemical reactions, gene regulatory interactions), and covering 1738 pathways. We describe the database schema and the methods used for data integration. Furthermore, we describe the functionality of the ConsensusPathDB web interface, where users can search and visualize interaction networks, upload, modify and expand networks in BioPAX, SBML or PSI-MI format, or carry out over-representation analysis with uploaded identifier lists with respect to substructures derived from the integrated interaction network. The ConsensusPathDB database is available at: http://cpdb.molgen.mpg.de.Entities:
Mesh:
Year: 2008 PMID: 18940869 PMCID: PMC2686562 DOI: 10.1093/nar/gkn698
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Database content and pairwise database overlaps in terms of matching physical entities and similar interactions
| Reactome | Kegg | Humancyc | Pid | Biocarta | Netpath | Intact | Dip | Mint | Hprd | Biogrid | Spike | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Database overlaps in terms of matching physical entities | ||||||||||||
| Reactome | 1037 | 893 | 560 | 796 | 241 | 1067 | 352 | 1217 | 2041 | 1704 | 1506 | |
| Kegg | 1037 | 1482 | 59 | 383 | 15 | 225 | 23 | 329 | 1297 | 540 | 508 | |
| Humancyc | 893 | 1482 | 253 | 688 | 110 | 553 | 138 | 701 | 1656 | 1065 | 947 | |
| Pid | 560 | 59 | 253 | 764 | 354 | 799 | 371 | 886 | 1357 | 1263 | 1027 | |
| Biocarta | 796 | 383 | 688 | 764 | 302 | 892 | 392 | 1011 | 1695 | 1477 | 1346 | |
| Netpath | 241 | 15 | 110 | 354 | 302 | 392 | 173 | 450 | 615 | 568 | 479 | |
| Intact | 1067 | 225 | 553 | 799 | 892 | 392 | 576 | 2660 | 3546 | 3016 | 3237 | |
| Dip | 352 | 23 | 138 | 371 | 392 | 173 | 576 | 637 | 865 | 828 | 707 | |
| Mint | 1217 | 329 | 701 | 886 | 1011 | 450 | 2660 | 637 | 4684 | 3939 | 3915 | |
| Hprd | 2041 | 1297 | 1656 | 1357 | 1695 | 615 | 3546 | 865 | 4684 | 7185 | 5973 | |
| Biogrid | 1704 | 540 | 1065 | 1263 | 1477 | 568 | 3016 | 828 | 3939 | 7185 | 5248 | |
| Spike | 1506 | 508 | 947 | 1027 | 1346 | 479 | 3237 | 707 | 3915 | 5973 | 5248 | |
| Database overlaps in terms of similar interactions | ||||||||||||
| Reactome | 262 | 122 | 101 | 80 | 34 | 97 | 31 | 51 | 304 | 212 | 118 | |
| Kegg | 262 | 213 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Humancyc | 122 | 213 | 0 | 2 | 0 | 1 | 2 | 2 | 7 | 4 | 2 | |
| Pid | 101 | 0 | 0 | 264 | 96 | 63 | 46 | 73 | 333 | 243 | 186 | |
| Biocarta | 80 | 4 | 2 | 264 | 67 | 51 | 35 | 43 | 139 | 114 | 173 | |
| Netpath | 34 | 0 | 0 | 96 | 67 | 57 | 34 | 123 | 821 | 510 | 224 | |
| Intact | 97 | 0 | 1 | 63 | 51 | 57 | 312 | 2714 | 3216 | 1583 | 4080 | |
| Dip | 31 | 0 | 2 | 46 | 35 | 34 | 312 | 389 | 821 | 656 | 418 | |
| Mint | 51 | 0 | 2 | 73 | 43 | 123 | 2714 | 389 | 7211 | 4428 | 5676 | |
| Hprd | 304 | 0 | 7 | 333 | 139 | 821 | 3216 | 821 | 7211 | 19463 | 11351 | |
| Biogrid | 212 | 0 | 4 | 243 | 114 | 510 | 1583 | 656 | 4428 | 19463 | 10303 | |
| Spike | 118 | 0 | 2 | 186 | 173 | 224 | 4080 | 418 | 5676 | 11351 | 10303 | |
Current versions of the integrated databases are: Reactome 25, KEGG 47.0, HumanCyc 12.1, PID 2008_06_10, BioCarta 2008_01_08, NetPath downloaded on 6.7.2008, IntAct 2008-06-27, DIP 2008-01-13, MINT 2008-05-19, HPRD I_090107, BioGRID 2.0.42 and SPIKE downloaded on 6.7.2008.
Figure 1.Graphical summary of the main functions of the ConsensusPathDB web interface. The search for interactions of specific entities or pathways, search for interaction paths, overrepresentation analysis and model upload induce specific interaction subnetworks that can be visualized in the dynamic visualization environment and exported in BioPAX format for further computational analysis, or as images. Different colors of nodes and edges refer to different types of interactions or entities and different source databases, respectively. The background image shows a screenshot of the ConsensusPathDB start page.
Figure 2.Illustration of the mapping procedure. Two biochemical reactions that are identical according to their primary participants and the user-specified mapping criteria are mapped. The reaction A + B -> C + D, catalyzed by enzyme E1, originates from database X. A similar reaction, catalyzed by enzymes E1 and E2, originates from database Y. Since both reactions are identical according to the user-specified rules, the interactions are merged and visualized reflecting the source annotations.