| Literature DB >> 36087049 |
Jack Kelly1, Carlo Berzuini1, Bernard Keavney2,3, Maciej Tomaszewski2,4, Hui Guo1.
Abstract
BACKGROUND: With the increasing availability and size of multi-omics datasets, investigating the casual relationships between molecular phenotypes has become an important aspect of exploring underlying biology andgenetics. There are an increasing number of methodlogies that have been developed and applied to moleular networks to investigate these causal interactions.Entities:
Keywords: Bayesian networks; causal inference; causal molecular network; mendelian randomisation; omics
Mesh:
Year: 2022 PMID: 36087049 PMCID: PMC9544222 DOI: 10.1002/mgg3.2055
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.473
FIGURE 1(a) an example of an undirected network, (b) a directed network and (c) a mixed network. Mixed networks have both directed and undirected edges.
FIGURE 2(a) Schematic representation of MR. MR infers the causal effect of an exposure (phenotype) on the outcome using instrumental variables (IVs). (b) Causal Bayesian networks connect nodes via directed edges determined by conditional independence, which is present when the relationship between two nodes is independent conditioning on all other nodes in the graph. (c) Schematic representation of the PC algorithm. The true causal graph is shown in (b). The PC algorithm initially begins with an undirected fully connected graph (i) and uses data to create a skeleton graph with undirected edges. In this case, the X1 − X2 edge is removed because X1 is independent of X2 (ii) and the edges between X1 − X4 are removed as the nodes are independent given X3 . The same is true for the X2 − X4 edge (iii). Then v‐structures are identified (iv) and final edges oriented (v) (Le et al., 2019).
Summary of the discovery methods for analysis of causal molecular networks including the software available
| Methodologies | Data source required | Advantages | Disadvantages | Software available |
|---|---|---|---|---|
|
| ||||
| Mendelian randomisation (MR) | GWAS, omics |
Only requires summary statistics, fast to run Estimates causal effect size |
Data must meet certain (possibly untestable) assumptions Incapable of modelling complex relationships |
MendelianRandomization (R package) (Yavorska & Burgess, TwoSampleMR (R package) (Hemani et al., MR‐Base (Hemani et al., |
| Bayesian MR | GWAS, omics |
Flexibility of modelling complex data structure (overlapping samples, horizontal pleiotropy, interactions, multiple exposures) Estimates causal effect size |
Data must meet certain (possibly untestable) assumptions Computationally intensive, applicable to small‐to‐medium causal networks | |
|
| ||||
| Bayesian networks (BN) | omics |
Can generate larger causal networks Causal edges probabilities are given Estimates causal effect size |
Computationally intensive limiting network size |
Bnlearn (R and Python package) (Scutari, BayesNetty (Howey et al., |
| PC algorithm (Spirtes et al., | omics |
Relatively fast compared to other BNs |
Although faster than alternatives, computationally challenging when run on very large datasets Causal effect size is not inferred |
Bnlearn (R and Python package) (Scutari, Pcalg (R package) (Hauser & Bühlmann, |
|
| ||||
| Bayesian and MR (findr (Wang et al., | GWAS, omics |
Undirected network construction followed edge directions inferred using MR |
Still computationally intensive and applications have been on subsets of omics data Causal effect size is not inferred |
findr (R package) (Wang et al., MRPC (R package) (Badsha & Fu, |
| Genome Granularity DAG (GDAG) (Yazdani et al., | GWAS, omics |
Undirected network construction followed edge directions inferred using MR |
Still computationally intensive and applications have been on subsets of omics data Causal effect size is not inferred | |
| Causal Graphical Analysis Using GEnetics (cGAUGE) (Amar et al., | GWAS, omics |
Approach has greater power and lower false discovery rate than BNs |
Computationally intensive Causal effect size is not inferred Greater power than BN, reduced power in presence of horizontal pleiotropy |
cGUAGE (R code: |
|
| ||||
| Granger causality (Granger, | Time series omics |
Allows causal inference using time series omics data |
Time intervals between measurements needs to be enough for a noticeable change to take place Needs to be no confounders |
lmtest (R package) (Zeileis & Hothorn, statsmodels.tsa.stattools.grangercausalitytests (Python package) (Seabold & Perktold, |
| Optimal Causation Entropy (OCE) (Sun et al., | Time series omics |
Outperform Granger causality using time series omics data Can generate large scale causal networks |
Assumes stationarity which can be violated by confounders |
TIGRAMITE (Python package) (Runge et al., |