| Literature DB >> 30458802 |
Antonio Messina1, Antonino Fiannaca1, Laura La Paglia1, Massimo La Rosa2, Alfonso Urso1.
Abstract
BACKGROUND: Several online databases provide a large amount of biomedical data of different biological entities. These resources are typically stored in systems implementing their own data model, user interface and query language. On the other hand, in many bioinformatics scenarios there is often the need to use more than one resource. The availability of a single bioinformatics platform that integrates many biological resources and services is, for those reasons a fundamental issue. DESCRIPTION: Here, we present BioGraph, a web application that allows to query, visualize and analyze biological data belonging to several online available sources. BioGraph is built upon our previously developed graph database called BioGraphDB, that integrates and stores heterogeneous biological resources and make them available by means of a common structure and a unique query language. BioGraph implements state-of-the-art technologies and provides pre-compiled bioinformatics scenarios, as well as the possibility to perform custom queries and obtaining an interactive and dynamic visualization of results.Entities:
Keywords: BioGraphDB; Bioinformatics databases; Graph databases; Integrated databases; miRNA
Mesh:
Substances:
Year: 2018 PMID: 30458802 PMCID: PMC6245492 DOI: 10.1186/s12918-018-0616-4
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1BioGraphDB scheme. The graph data model of BioGraphDB consists of a set of nodes/vertices classes matching the biological entities provided by the used data sources. Relationships between entities are modeled as relations, or edges
Associations between graph entities and biological information
|
|
| # |
|
|
|---|---|---|---|---|
| Nodes | Gene | 59839 | Genes | NCBI entrez genes |
| Go | 43969 | Functional annotations | Gene ontology | |
| Protein | 20193 | Proteins | UniProtKB | |
| Pathway | 1920 | Pathways | Reactome | |
| MiRNA | 28645 | miRNA precursors | miRBase | |
| MiRNAmature | 38558 | miRNA matures | miRBase | |
| MiRNAsnp | 236 | miRNA SNPs | miRNASNP | |
| Cancer | 107 | Cancers | mirCancer | |
| ProteinName | 219132 | Proteins accessions | UniProtKB | |
| GeneName | 115027 | Genes symbols | HGNC | |
| Interaction | 913285 | miRNA-target interactions | mirTarBase,miRanda | |
| Relations | ANNOTATES | 514528 | Links to annotated entities | Gene ontology |
| CONTAINS | 99979 | Links to entities in pathways | Reactome | |
| PRECURSOR_OF | 38558 | Precursors-matures relations | miRBase | |
| HAS_SNP | 236 | miRNAs-mutations relations | miRNASNP | |
| SYNONYM_OF | 115027 | Symbols-genes relations | HGNC | |
| REFERS_TO | 219132 | Accessions-proteins relations | UniProtKB | |
| INTERACTING_GENE | 913285 | Genes-interactions relations | mirTarBase,miRanda,miRNASNP | |
| INTERACTING_MIRNA | 657904 | miRNA-interactions relations | mirTarBase,miRanda | |
| INTERACTING_SNP | 255381 | SNPs-interactions relations | miRNASNP |
Fig. 2BioGraph architectural stack. The pictures gives an overview on the state-of-the-art technology behind BioGraph. Used tools are grouped into three levels. From the bottom to the top: the Graph Data level overlooked by Apache Tinkerpop, the Microservices level, and the Web Application level. Protocols and data formats of inter-level communications are also highlighted
Fig. 3The Templates tab. Templates are simple predefined queries given as examples of how an user can traverse BioGraphDB. The queries are customizable and grouped by category
Fig. 4The Scenarios tab. The four proposed scenarios are examples of how complex Gremlin queries can help in the analysis of non-trivial bioinformatics problems
Fig. 5The Gremlin Workbench. It is basically composed of four panes: 1) the Gremlin query pane, 2) Tree View, 3) Graph view-port, 4) the Details pane
Fig. 6Case study scenario. The functional analysis of miRNAs in breast cancer can be done starting from the first scenario in the Scenarios tab and personalizing the values of parameters
Fig. 7Custom Gremlin query. Gremlin Workbench allows the user to manually type and execute any query he wants
Fig. 8Results for the case study scenario. The user can immediately study the results given in graphical and tree form. He can also (1) export them in several formats or (2) perform some data analysis, when applicable
Fig. 9GO analysis Analysis of data through p-value calculation. As the figure shows, it is possible to deeply investigate a specific field of the interrogation. In the shown case study figure, it is possible to investigate about a specific annotation linked to a particular gene target
Technical and technological features of BioGraph and other considered integrated systems
|
|
|
|
|
|
|---|---|---|---|---|
| ncRNA-DB | Graph | OrientDB | Yes | OrientDB OSQL |
| JBioWH | Relational | MySQL | Yes | SQL, JBioWH API |
| mirWalk | Relational | MySQL | No | SQL |
| ARN | Relational | MS SQL Server | No | SQL |
| SSER | Relational | MySQL | No | SQL |
| Bio4j | Graph | Titan | Yes | Anguillos |
| HumanMine | Object oriented | PostgreSQL | Yes | SQL |
| BioGraph | Graph | Neo4j | Yes | Gremlin |
Content type and functional features of BioGraph and other considered integrated systems
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| ncRNA-DB | ncRNAs, RNAs, genes, diseases | Yes | Yes | Yes | No | No |
| JBioWH | Genes, proteins, proteins clusters, proteins domains, chromosomes, enzymes, ppi, pathways, reactions, drugs, taxonomies, functional annotations | Desktop client | No | Yes | No | Yes |
| mirWalk | Genes, miRNAs, functional annotations, miRNA-target interactions | Yes | No | No | Yes | No |
| ARN | Genes, miRNAs, regulations of adipogenesis | Yes | Yes | No | Yes | No |
| SSER | Essential reactions | Yes | No | No | No | No |
| Bio4j | Proteins, taxonomy, functional annotations, enzymes | Command line | No | Yes | No | Yes |
| HumanMine | Genes, proteins, protein domains, protein localizations, pathways, genes expressions, functional annotations, diseases, phenotypes, molecular interactions, genetic interactions | Yes | No | Yes | Yes | Yes |
| BioGraph | Genes, proteins, miRNAs, pathways, functional annotations, miRNA-target interactions, miRNA-cancer relations, miRNA-SNP relations | Yes | Yes | Yes | Yes | Yes |