| Literature DB >> 17683609 |
Rainer Opgen-Rhein1, Korbinian Strimmer.
Abstract
BACKGROUND: The use of correlation networks is widespread in the analysis of gene expression and proteomics data, even though it is known that correlations not only confound direct and indirect associations but also provide no means to distinguish between cause and effect. For "causal" analysis typically the inference of a directed graphical model is required. However, this is rather difficult due to the curse of dimensionality.Entities:
Mesh:
Year: 2007 PMID: 17683609 PMCID: PMC1995222 DOI: 10.1186/1752-0509-1-37
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Formulas for computing partial variances and partial correlations
| Definition | True value | Estimate | |
| Covariance matrix: | cov( | ||
| Concentration matrix: | |||
| Variances: | var( | ||
| Partial variances | var( | ||
| Correlations: | corr( | ||
| Partial correlations: | corr( |
Index i runs from 1 to n (sample size), and indices k and l run from 1 to p (dimension). A tilde denotes a "partial" quantity.
Figure 1Correlation network inferred from the Arabidopsis thaliana data. The solid and dotted lines indicate positive and negative correlation coefficients, respectively, and the line intensity denotes their strength. The network displays the 150 edges with the largest absolute correlation. For annotation of the nodes in this graph see the electronic information contained in the R package "GeneNet" [40] and the original data paper [35].
Figure 2Distribution of log for the Arabidopsis thaliana data. The null distribution is depicted by the dashed line; it follows a normal distribution with zero mean and a standard deviation of 0.014. The solid line signifies the alternative distribution. The empirical distribution (indicated by the histogram) is composed of the null distribution (η0 = 0.8995) and of the alternative distribution (η= 0.1005).
Figure 3Partially causal network inferred from the Arabidopsis thaliana data by the method introduced in this paper – note the difference to the correlation network of Figure 1. The topology of the partially causal network is identical to that of a partial correlation graph (GGM, CIG). However, edges with significant directionality (as indicated by a factor that is significantly smaller or larger than one) are oriented.