| Literature DB >> 24381584 |
Hyonho Chun1, Min Chen2, Bing Li3, Hongyu Zhao4.
Abstract
It is challenging to identify meaningful gene networks because biological interactions are often condition-specific and confounded with external factors. It is necessary to integrate multiple sources of genomic data to facilitate network inference. For example, one can jointly model expression datasets measured from multiple tissues with molecular marker data in so-called genetical genomic studies. In this paper, we propose a joint conditional Gaussian graphical model (JCGGM) that aims for modeling biological processes based on multiple sources of data. This approach is able to integrate multiple sources of information by adopting conditional models combined with joint sparsity regularization. We apply our approach to a real dataset measuring gene expression in four tissues (kidney, liver, heart, and fat) from recombinant inbred rats. Our approach reveals that the liver tissue has the highest level of tissue-specific gene regulations among genes involved in insulin responsive facilitative sugar transporter mediated glucose transport pathway, followed by heart and fat tissues, and this finding can only be attained from our JCGGM approach.Entities:
Keywords: GGMs; Gaussian graphical models; conditional GGMs; gene networks; joint sparsity
Year: 2013 PMID: 24381584 PMCID: PMC3865369 DOI: 10.3389/fgene.2013.00294
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Illustration of conditional GGM: . When the marker effect is ignored, there are two edges in a graphical model: 1 ↔ 2 and 2 ↔ 3. After considering the marker effect, there is a single edge, represented with a solid line, in a conditional graphical model.
Results for Case 1.
| GGMs | 0.081 (0.002) | 0.755 (0.004) | 0.222 (0.004) | 0.518 (0.008) | 0.703 (0.002) |
| CGGMs | 0.946 (0.001) | 0.063 (0.002) | 0.999 (0.000) | 0.000 (0.000) | 5087.146 (135.93) |
| JGGM | 0.053 (0.002) | 0.560 (0.004) | 0.067 (0.002) | 0.524 (0.005) | 0.564 (0.002) |
| JCGGM | 0.114 (0.013) | 0.459 (0.007) | 0.134 (0.014) | 0.434 (0.008) | 2.517 (0.624) |
| GGMs | 0.051 (0.001) | 0.475 (0.003) | 0.144 (0.003) | 0.262 (0.004) | 0.577 (0.001) |
| CGGMs | 0.054 (0.001) | 0.335 (0.003) | 0.152 (0.003) | 0.164 (0.004) | 0.348 (0.002) |
| JGGM | 0.027 (0.002) | 0.383 (0.002) | 0.030 (0.001) | 0.346 (0.003) | 0.504 (0.001) |
| JCGGM | 0.020 (0.001) | 0.329 (0.002) | 0.021 (0.001) | 0.298 (0.003) | 0.263 (0.001) |
The performances of GGMs, CGGMs, JGGMs, and JCGGMs are compared with the comparison criteria explained in subsection 3.1. When the sample size is small, the separate CGGMs select many false positives, which can be alleviated with JCGGMs. Under the scenario which is favored to JGGM, the JCGGM performs as well as the JGGM in both small and large sample cases.
Results for Case 2.
| GGMs | 0.143 (0.003) | 0.685 (0.005) | 0.367 (0.006) | 0.359 (0.007) | 0.692 (0.002) |
| CGGMs | 0.945 (0.001) | 0.066 (0.002) | 1.000 (0.000) | 0.000 (0.000) | 5343.2 (142.343) |
| JGGM | 0.011 (0.005) | 0.907 (0.006) | 0.014 (0.005) | 0.890 (0.006) | 71.99 (71.27) |
| JCGGM | 0.112 (0.013) | 0.467 (0.008) | 0.133 (0.013) | 0.444 (0.008) | 2.992 (0.84) |
| GGMs | 0.161 (0.002) | 0.226 (0.002) | 0.365 (0.004) | 0.061 (0.002) | 0.471 (0.002) |
| CGGMs | 0.080 (0.001) | 0.228 (0.002) | 0.189 (0.003) | 0.060 (0.002) | 0.328 (0.002) |
| JGGM | 0.103 (0.001) | 0.164 (0.002) | 0.135 (0.002) | 0.132 (0.003) | 0.392 (0.001) |
| JCGGM | 0.023 (0.001) | 0.162 (0.003) | 0.024 (0.002) | 0.127 (0.003) | 0.234 (0.001) |
The performances of GGMs, CGGMs, JGGMs, and JCGGMs are compared with the comparison criteria explained in subsection 3.1. When the sample size is small, the separate CGGMs select many false positives, which can be alleviated with JCGGMs. Under the scenario which is favored to JCGGMs, the JCGGM performs the best in both small and large sample cases.
Figure 2ROC curves: the average ROC curves are presented. Throughout all scenarios, the JCGGM performs the best. (A) With no external variable and a small sample size, JGGM, and JCGGM perform well. (B) With no external variable and a large sample size, JCGGM performs the best, followed by CGGM and JGGM. These two performs similarly. (C) With external variables and a small sample size, only JCGGM performs well. (D) With external variables and a large sample size, JCGGM performs the best, followed by JGGM and CGGM.
Results from JGGM and JCGGM.
| JGGMs | Number of edges | 93 | 120 | 115 | 117 |
| % specific edges | 1.1 | 5.8 | 6 | 4.2 | |
| JCGGMs | Number of edges | 74 | 99 | 94 | 93 |
| % specific edges | 0 | 9.1 | 3.2 | 2.1 |
The JGGM and JCGGM are applied to the expression measurements of genes involved in insulin responsive facilitative sugar transporter mediated glucose transport pathway. The JGGM implies that liver, heart, and fat tissues have the similar level of tissue-specificity, whereas the JCGGM implies that the liver tissue has the highest level of tissue specificity. The result from JCGGM is more convincing due to the fact that the specialized enzyme activity of glycogen phosphorylase only occurs in liver tissue.
Figure 3Networks inferred from JCGGM: the liver network has the largest number of edges and the highest level of tissue-specificity. (A) The inferred gene regulation network of the kidney tissue is presented. (B) The inferred gene regulation network of the liver tissue is presented. (C) The inferred gene regulation network of the heart tissue is presented. (D) The inferred gene regulation network of the fat tissue is presented.