| Literature DB >> 24135261 |
Christian Theil Have1, Lars Juhl Jensen.
Abstract
Entities:
Mesh:
Year: 2013 PMID: 24135261 PMCID: PMC3842757 DOI: 10.1093/bioinformatics/btt549
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Relational versus graph database representation of a small protein interaction network. In the relational database, the network is stored as an interactions table (left). By contrast a graph database directly stores interactions as pointers between protein nodes (right). Below, we show the queries to identify second-order interaction partners in SQL and Cypher, respectively
Query benchmark of a relational and a graph database
| Neighbor network | Best-scoring path | Shortest path | |
|---|---|---|---|
| PostgreSQL | 206.31 s | 1147.74 s | 976.22 s |
| Neo4j | 5.68 s | 1.17 s | 0.40 s |
| Speedup | 36× | 981× | 2441× |
Note: For each of three selected tasks, we ran the corresponding queries for randomly selected human proteins/protein pairs and report the average time. We used a Linux machine equipped with a 3GHz quad-core Intel Core i3 processor, 4 GB random access memory and a 250 GB 7200 rpm hard drive.
aNeighbor networks cannot be expressed as a single Cypher query. Instead we report the total time of all queries involved in solving this task. Similar speedup was observed for PostgreSQL when similarly decomposing the complex query into multiple simple queries.