| Literature DB >> 32180795 |
Noemi Di Nanni1,2, Matteo Bersanelli3,4, Luciano Milanesi1, Ettore Mosca1.
Abstract
The development of integrative methods is one of the main challenges in bioinformatics. Network-based methods for the analysis of multiple gene-centered datasets take into account known and/or inferred relations between genes. In the last decades, the mathematical machinery of network diffusion-also referred to as network propagation-has been exploited in several network-based pipelines, thanks to its ability of amplifying association between genes that lie in network proximity. Indeed, network diffusion provides a quantitative estimation of network proximity between genes associated with one or more different data types, from simple binary vectors to real vectors. Therefore, this powerful data transformation method has also been increasingly used in integrative analyses of multiple collections of biological scores and/or one or more interaction networks. We present an overview of the state of the art of bioinformatics pipelines that use network diffusion processes for the integrative analysis of omics data. We discuss the fundamental ways in which network diffusion is exploited, open issues and potential developments in the field. Current trends suggest that network diffusion is a tool of broad utility in omics data analysis. It is reasonable to think that it will continue to be used and further refined as new data types arise (e.g. single cell datasets) and the identification of system-level patterns will be considered more and more important in omics data analysis.Entities:
Keywords: biological networks; integrative analysis; network-diffusion; omics data; precision medicine
Year: 2020 PMID: 32180795 PMCID: PMC7057719 DOI: 10.3389/fgene.2020.00106
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Classification of integrative methods. Criteria: (A) Goals; (B) Input data; (C) Network Diffusion (ND) model: Random Walk (RW), Random Walk with Restart (RWR), Insulated Heat Diffusion (IHD), Diffusion Kernel (DK); (D) Molecular network; (E) ND usage.
Network diffusion based methods for the integrative analyses of multiple biological layers.
| Method | Input | Integration Level | ND | Network | Goal | Language and Availability | URLs | |
|---|---|---|---|---|---|---|---|---|
| Type | Application | |||||||
| Dmfind | gene mutations | single omics | RWR | ND-first | A priori | module detection | R package for download |
|
| EMDN | DNA methylation, gene expression | multiple omics | RWR | ND-first | Inferred | module detection | R package for download |
|
| EPU | gene expression, PPI, gene ontology, gene-phenotype association data and phenotype similarity network | multiple networks | RWR | ND-first | Mixed | gene prioritization | – | – |
| GeneMANIA | co-expression, PPI, genetic interaction, co-localization, shared protein domains | multiple networks | RWR | ND-first | A priori | function prediciton | Web server |
|
| Mashup | PPI | multiple networks | RWR | ND-first | A priori | function prediciton | Matlab code for download |
|
| M – module | gene mutation, gene expression | multiple omics | RWR | ND-first | Inferred | module detection | R package for download |
|
| mND | gene mutation, gene expression | single omics, multiple omics | RWR | ND-first | A priori | gene prioritization | R package for download |
|
| NetBag | gene expression | single omics | RWR | ND-first | A priori | disease subtyping | – | – |
| NetICS | aberration events, gene expression | multiple omics | IHM | ND-first | A priori | gene prioritization | Matlab code for download |
|
| NBS | gene mutations | single omics | RWR | ND-first | A priori | disease subtyping | Matlab code for download |
|
| NBS2 | gene mutations | single omics | RWR | ND-first | Mixed | disease subtyping | Phyton package for download |
|
| RegNet | CNV, gene expression | multiple omics | RW | ND-after | Inferred | gene prioritization | R package for download |
|
|
| gene mutations, gene expression | multiple omics | RWR | ND-first | A priori | gene prioritization | – | – |
|
| gene mutations, gene expression | multiple omics | RW | ND-first | Mixed | gene prioritization | – | – |
| SRF | gene mutations, gene expression | multiple omics | RWR | ND-first | A priori | disease subtyping | Java code for download |
|
| SNF | DNA methylation, gene expression | multiple omics | DK | ND-during | Inferred | survival prediction, disease subtyping | R and Matlab code for downloads |
|
| stSVM | gene expression (mRNA, miRNA) | multiple omics | DK | ND-after | A priori | gene prioritization, survival prediction | R package for download |
|
| TieDie | gene mutations, gene expression | multiple omics | IHM | ND-first | A priori | module detection | Python and Matlab code for downloads |
|
| WSNF | gene expression (mRNA, miRNA) | multiple omics | DK | ND-during | Inferred | survival prediction, disease subtyping | R package for download |
|
RWR, random walk with restart; DK, diffusion kernel; IHM, insulated heat model; RW, random walk; CNV, copy number variations; PPI, Protein-Protein interaction network.
Figure 2Ways in which ND enters the integrative analysis pipelines.
Figure 3Network diffusion methods for the integrative analyses of multiple biological layers. GP, gene prioritization; MD, module detection; FP, function prediction; DS, disease subtyping; SP, survival prediction. We classified methods according to their main use described by the respective authors.