| Literature DB >> 25540777 |
Vincenzo Bonnici1, Francesco Russo2, Nicola Bombieri1, Alfredo Pulvirenti3, Rosalba Giugno3.
Abstract
Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs). Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established on-line repositories. The interactions involve RNA, DNA, proteins, and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command-line, and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape.Entities:
Keywords: cytoscape; gene expression; lncRNAs; microRNAs; networks; non-coding RNAs
Year: 2014 PMID: 25540777 PMCID: PMC4261811 DOI: 10.3389/fbioe.2014.00069
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
The number of imported elements from external resources and how many among them are present at least in another datasource.
| DataSource | Number of entities | Shared |
|---|---|---|
| CIRC2TRAITS | 83,432 | 326 |
| HMDD.2 | 8,040 | 282 |
| LNCRNADISEASE | 1,505 | 244 |
| MIRANDOLA.1.6 2246 | 98 | |
| NPINTER.2.0 | 138,328 | 440 |
| MIRTARBASE | 40,532 | 218 |
| STARBASE.V2.0 | 31,463 | 8 |
This representation of shared notation is dictated by the fact that the number of elements shared in three or more datasources is approximately close to 0.
Figure 1The database schema. The picture reports all the stored information together with their associations.
The total number of aliases associated with the imported elements from external resources and how many among them are present at least in another datasource.
| DataSource | Number of aliases | Shared |
|---|---|---|
| HGNC | 436,361 | 19,368 |
| NONCODE.V4 | 327,099 | 5,671 |
| LNCRNADB | 218 | 115 |
| CIRC2TRAITS | 16,730 | 1,076 |
| HMDD.2 | 1,376 | 1,376 |
| LNCRNADISEASE | 1,366 | 285 |
| MIRANDOLA.1.6 | 1,231 | 1,231 |
| NPINTER.2.0 | 7,678 | 4,857 |
| MIRTARBASE | 62,207 | 12,998 |
| STARBASE.V2.0 | 5,298 | 3,747 |
For each biological entity type we report the number of entries present in ncRNA-DB.
| Entity | Total | In relation |
|---|---|---|
| ncRNA | 193,440 | 25,463 |
| RNA | 4,962 | 4,962 |
| Gene | 19,271 | 12,265 |
| Disease | 1,330 | 735 |
| Others | 6,700 | 5,517 |
We report also the number of entities having relations with some other entities (details are given in Table .
The number of ncRNAs interacting with other ncRNA-DB biological entities.
| Relation | Total |
|---|---|
| ncRNA-ncRNA | 77,982 |
| ncRNA-RNA | 36,369 |
| ncRNA-gene | 52,611 |
| ncRNA-disease | 16,662 |
| ncRNA-others | 132,663 |
Figure 2We report the main architecture of our system. The resource is integrated through an import procedure and stored into OrientDB. All the data are exposed using three different user interfaces: (i) the ncINetView; Cytoscape app; (ii) the web app; and (iii) the command-line interface. All the data can be also queried using languages APIs and third party applications. ncRNA-DB is designed to be a server for client applications, thanks to the features offered by OrientDB.
Figure 3(A) The picture reports the retrieved Biogrid network (with the E2F6 query) with four nodes (E2F6, EZH1, EZH2, and ARAF) selected by the user; (B) the first panel of ncINetView allows the annotation of selected nodes with all ncRNA neighbors and all their interactions with actors of the network present in ncRNA-DB; (C) the resulted annotated network; (D) an extracted circuit by the user involving the long non-coding RNA HOTAIR, the selected genes and the hsa-mir-148p-3b. The presence of more than one edge connecting nodes is motivated by reporting in the table panel for each interaction specific information such as the support sentence.
Figure 4The add nodes panel of ncINetView. (A) The user performs a query specifying: diabetes mellitus, hypertension, and myocardial infarction. All elements found in ncRNA-DB are reported in the text area with the associated aliases. A check box is used to include the elements in the network generated by clicking in Import. User selects in Network among those present in the cytoscape network panel, which network must be annotated; together with the name of table columns containing the aliases and the type of each node (the last is optional). (B) The corresponding network view is generated.
Figure 5The search section of the web interface of ncRNA-DB.