| Literature DB >> 34745229 |
John P Thomas1,2,3, Dezso Modos1,2, Tamas Korcsmaros1,2, Johanne Brooks-Warburton4,5.
Abstract
Inflammatory bowel disease (IBD) is a chronic immune-mediated condition arising due to complex interactions between multiple genetic and environmental factors. Despite recent advances, the pathogenesis of the condition is not fully understood and patients still experience suboptimal clinical outcomes. Over the past few years, investigators are increasingly capturing multi-omics data from patient cohorts to better characterise the disease. However, reaching clinically translatable endpoints from these complex multi-omics datasets is an arduous task. Network biology, a branch of systems biology that utilises mathematical graph theory to represent, integrate and analyse biological data through networks, will be key to addressing this challenge. In this narrative review, we provide an overview of various types of network biology approaches that have been utilised in IBD including protein-protein interaction networks, metabolic networks, gene regulatory networks and gene co-expression networks. We also include examples of multi-layered networks that have combined various network types to gain deeper insights into IBD pathogenesis. Finally, we discuss the need to incorporate other data sources including metabolomic, histopathological, and high-quality clinical meta-data. Together with more robust network data integration and analysis frameworks, such efforts have the potential to realise the key goal of precision medicine in IBD.Entities:
Keywords: gene coexpression network; gene regulatory network; inflammatory bowel disease; metabolic network; multilayered network; network biology; precision medicine; protein-protein interaction network
Year: 2021 PMID: 34745229 PMCID: PMC8566351 DOI: 10.3389/fgene.2021.760501
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Characteristics of various network types discussed in this review and their main advantages and disadvantages.
| Network type | Node | Edge | Required information to build the network | Pros | Cons |
|---|---|---|---|---|---|
| Protein-protein interaction networks | Proteins | Physical interactions | Measurement of the actual protein interactions e.g. using yeast two-hybrid, affinity purification mass spectrometry or small-scale binding experiments | Many different resources, based on physical interactions ensuring larger coverage | Highly incomplete, biases in network generating methods |
| Metabolic networks | Metabolites | Enzymes, reactions | Measured reactions of the enzymes | Most complete network type, good for systematic modelling | Need to decide what parameter to optimise |
| Gene regulatory networks | Transcription factors, promoters, enhancers, and target genes | Regulatory interaction | Measurement or modelling of the regulatory interactions e.g. using ChIP-seq, yeast one-hybrid, or through inference from transcriptomics | Various network building approaches to build large coverage and make it research question specific | Highly variable and state-specific, cannot infer feedback loops from transcriptomics only |
| Gene co-expression networks | Genes | Similarity between the expression of two genes | Gene expression measurement | Needs only transcriptomic data | Correlation does not always equal causation |
FIGURE 1Basic network biological nomenclature and concepts. Hubs are nodes with a high number of interactions (edges). Modules are regions of the network where the nodes interact with members of the region more than with non-members. Bottlenecks are nodes which are connecting two.
Network resources relevant to IBD research.
| Name | Description | Website | Latest version (year) |
|---|---|---|---|
| STRING | Large PPI database with various sources and confidence scores. It contains text mining data and also other databases. It has both directed and undirected interactions |
| 11.5 (2021) |
| BioGRID | Genetic and protein interactions from both high and low throughput experiments |
| 4.4.201 (2021) |
| BioPlex | Large affinity-purification mass spectrometry based database. It contains undirected PPI data |
| 3.0 (2021) |
| HAPPI-2 | Large database collection of PPI data with confidence scores |
| HAPPI 2.0 (2017) |
| IntAct | Large PPI database collection. Mostly undirected interactions | https:// | 4.2.18 (2021) |
| Reactome | Large reaction-centric PPI database, concentrating on signalling with well-developed toolsets. It has directed interactions |
| 77 (2021) |
| WikiPathways | Community curated database of signalling pathways. It has varying coverage | https:// | September 2021 (2021) |
| SignaLink | Multi-layered database of signalling pathways with a manually curated core extended by regulatory data, external datasets and predictions |
| 3.0 (2021) |
| Signor | Manually curated signalling network |
| 2.0 (2020) |
| CellPhoneDB | Network database containing directed intercellular ligand-receptor interactions (i.e. a type of PPI network database) |
| 2.1.7 (2021) |
| Ramilowski et al. | Directed intercellular ligand-receptor interaction (PPI) network database developed by the FANTOM5 team |
| (2015) |
| DoRothEA | Transcription factor (TF)-target gene (i.e. GRN) database with varying confidence levels and an easy-to-use application programming interface (API) |
| 1.5.0 (2021) |
| TRRUST | Manually curated transcription factor (TF)-target gene (i.e. GRN) database |
| 2 (2017) |
| HuRI | References interactome of human binary protein-protein interactions captured using high throughput yeast two-hybrid assays |
| April 2020 (2020) |
| ConsensusPathDB | A meta-database of binary and complex protein-protein, genetic, metabolic, signaling, gene regulatory and drug-target interactions, as well as biochemical pathways, originating from over 30 publicly available resources |
| Release 35 (2021) |
| OmniPath | One-stop solution of intracellular and intercellular interactions. It contains almost all the above mentioned databases and has a programmatically accessible application programming interface (API) both in R and |
| 2.0 (2021) |
FIGURE 2Various methods for generating PPI networks in IBD. (A) Known IBD-associated genes can be mapped to a PPI network and the nearby genes in the network can be associated with IBD as well (guilt by association) (B) Mapping a transcriptome to the PPI network can elucidate disease-specific modules in the network (C) Single-cell RNA-seq data combined with intercellular (ligand-receptor) communication networks can show how various cells are interacting with each other in disease or healthy states. For b) the data from (Olsen et al., 2009) was used. For c) the uniform manifold approximation and projection (UMAP) plot (a nonlinear dimensionality reduction technique for visualising high-dimensional data) was generated using data from Lukassen et al., 2020 (Lukassen et al., 2020).
FIGURE 3Flux balance analysis - the basics of metabolic network modelling. For metabolic networks the initial step involves collecting the metabolic reactions that form the network. These reactions are represented by a stoichiometric matrix where each reaction is represented by the nodes and metabolites by the edges. The aim of flux balance analysis is to find the optimal vector (flux) that yields the maximum output for a given metabolite or metabolites (Z) through these reactions. For illustration, the glucose metabolism was used from Köenig et al., 2012 (König et al., 2012).
FIGURE 4Bayesian network construction for gene regulatory networks. From the high dimension of gene expression data, the correlations between genes can be calculated. These correlations can be modelled as conditional probabilities and, using the Bayes theorem, a casual gene regulatory network can be constructed.
FIGURE 5Gene co-expression network analysis. Calculating a similarity between the genes from expression data can be used as an adjacency matrix in a co-expression network. The similarity function depends on the used method but after that the most similar parts of the network can be denoted as modules.
FIGURE 6Future perspectives of using network biology and network based modeling in IBD research. From the large amount of omics datasets (genomics, transcriptomics, metabolomics, metagenomics), various interaction networks can be used to develop sophisticated network models, ideally in a multi-layered fashion. Adding granularity with patient metadata from large databases can help to validate these models and will result in better understanding of IBD pathogenesis, novel/personalised therapeutic strategies, and clinical decision-driving signatures.