| Literature DB >> 35489073 |
Zhitao Mao1,2, Ruoyu Wang1,2, Haoran Li1,2, Yixin Huang3, Qiang Zhang4, Xiaoping Liao1,2, Hongwu Ma1,2.
Abstract
Cellular regulation is inherently complex, and one particular cellular function is often controlled by a cascade of different types of regulatory interactions. For example, the activity of a transcription factor (TF), which regulates the expression level of downstream genes through transcriptional regulation, can be regulated by small molecules through compound-protein interactions. To identify such complex regulatory cascades, traditional relational databases require ineffective additional operations and are computationally expensive. In contrast, graph databases are purposefully developed to execute such deep searches efficiently. Here, we present ERMer (E. coli Regulation Miner), the first cloud platform for mining the regulatory landscape of Escherichia coli based on graph databases. Combining the AWS Neptune graph database, AWS lambda function, and G6 graph visualization engine enables quick search and visualization of complex regulatory cascades/patterns. Users can also interactively navigate the E. coli regulatory landscape through ERMer. Furthermore, a Q&A module is included to showcase the power of graph databases in answering complex biological questions through simple queries. The backend graph model can be easily extended as new data become available. In addition, the framework implemented in ERMer can be easily migrated to other applications or organisms. ERMer is available at https://ermer.biodesign.ac.cn/.Entities:
Year: 2022 PMID: 35489073 PMCID: PMC9252789 DOI: 10.1093/nar/gkac288
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.The graph database schema and architecture of ERMer.
Various types of regulatory interactions in ERMer
| Classification | Number | Sources |
|---|---|---|
| Chemical protein interaction (CPI) | 7067 | BRENDA, RegulonDB, and STITCH |
| Transcriptional factor regulation (TFGI) | 4734 | RegulonDB |
| Sigma factor regulation (SFGI) | 2352 | RegulonDB |
| sRNA regulation (sRGI) | 145 | RegulonDB |
| Protein–protein interaction (PPI) | 9102 | STRING |
| Reaction metabolite interaction (RMI) | 3163 | iML1515 |
| Metabolite reaction interaction (MRI) | 3096 | iML1515 |
| Pathway reaction interaction (PRI) | 2375 | iML1515 |
| Gene reaction interaction (GRI) | 4297 | iML1515 |
Figure 2.Interactive search. (A) The first order neighbors of glycine; (B) Interactive search setting and the corresponding gremlin query; (C) four times ‘Interactive search’ find a path between Glycine and gcvT.
Figure 3.Regulatory cascades from glycine to gcvT using graph databases.
Figure 4.Retrieval of key TFs regulating both pathways.