| Literature DB >> 20504374 |
Xianwen Ren1, Xiaobo Zhou, Ling-Yun Wu, Xiang-Sun Zhang.
Abstract
BACKGROUND: Biological systems process the genetic information and environmental signals through pathways. How to map the pathways systematically and efficiently from high-throughput genomic and proteomic data is a challenging open problem. Previous methods design different heuristics but do not describe explicitly the behaviours of the information flow.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20504374 PMCID: PMC2890502 DOI: 10.1186/1752-0509-4-72
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1A schematic diagram of the information-flow model with dissipation, saturation and direction for pathway inference. A. Simplified example of the pathway inferring problem. The paths composed of green nodes and edges are the inferred pathway whereas the blue nodes and edges are predicted to be not relevant. The edge thickness denotes the capacity limit of each edge. B. Constraints imposed on the edges and the nodes. B1. The source only sends information flows out. The total amount is given by I0. B2. The information flow dissipates on each edge, illustrated by the thickness of the edge. B3. There is a capacity limit on each edge. B4. The information only flows in the direction of the interactions. B5. The amount of the input flow should not be less than the amount of the output flow at each intermediate node. B6. The target only receives information flows in. The goal is to maximize the total information flow that the target receives.
Figure 2The yeast MAPK signalling pathways deposited in KEGG [24]. A. The pheromone-induced yeast MAPK pathway from Ste3 to Ste12. B. The yeast MAPK pathway induced by hypotonic shock from Mid2 to Rlm1. C. The yeast MAPK pathway induced by high osmolarity from Sln1 to Hog1. D. The starvation-induced yeast MAPK pathway from Ras2 to Ste12.
Figure 3The predicted yeast MAPK pathways induced by pheromones. A. The shorted path from Ste3 to Ste12 which is also the result of the Color Coding method with the path length as 5. B. The pathway predicted by the information flow method with dissipation, saturation and direction. C. The pathway predicted by the integer linear programming model[8]. D. The pathway predicted by the electric current method[11]. Red: the source or the target; green: proteins appeared in the yeast pheromone-induced MAPK pathway from Ste3 to Ste12 in KEGG[24]; Blue: false positive proteins.
Comparison of IFDSD, ILP and EC on the yeast MAPK pathways.
| Source | Target | Method | Connectivity | Intermediate | Precision | Recall |
|---|---|---|---|---|---|---|
| Ste3 | Ste12 | IFDSD | Yes | Yes | 0.67 | 0.71 |
| ILP | Depend on λ | Yes | 0.56 | 0.36 | ||
| EC | Depend on cutoff | No | 0.48 | 0.79 | ||
| Ras2 | Ste12 | IFDSD | Yes | Yes | 0.17 | 0.63 |
| ILP | Depend on λ | Yes | 0.16 | 0.38 | ||
| EC | Depend on cutoff | No | 0.16 | 0.75 | ||
| Mid2 | Rlm1 | IFDSD | Yes | Yes | 0.27 | 0.57 |
| ILP | Depend on λ | Yes | 0.18 | 0.57 | ||
| EC | Depend on cutoff | No | 0.29 | 0.71 | ||
| Sln1 | Hog1 | IFDSD | Yes | Yes | 0.60 | 1.00 |
| ILP | Depend on λ | Yes | 0.33 | 1.00 | ||
| EC | Depend on cutoff | No | 0.86 | 1.00 | ||
Four merits were compared among IFDSD, ILP and EC based on the yeast MAPK pathways. "Connectivity" and "intermediate" were about edges while precision and recall were about the nodes. Pathways predicted by IFDSD are always connected but the connectivity of the pathways predicted by ILP and EC depends on the parameters because they could filter the less-weighted edges. Since the nodes except the source and the target should transfer information from the source to the target, these "intermediate" nodes should have more than two edges linked to them. IFDSD and ILP always generate pathways satisfying this request whereas EC can not. Making sure the connectivity of the predicted pathways, the precision and recall were calculated by selecting the optimal parameters for each method on the yeast MAPK pathways.
Figure 4Pathways between Gpa1 to Prp39 predicted by our method. Two pathways are inferred. Pathway 1 is the pheromone signaling pathway identified by both Tu et al.'s method and the information flow model. Pathway 2 is identified only by the information flow model. It regulates the recovery from the pheromone arrest.
Functional enrichment analysis for the pathways from GPA1 to PRP39 identified by IFDSD.
| Pathways | GO term | Corrected | Pathway Frequency | Genome Frequency |
|---|---|---|---|---|
| Pathway 1: | Pheromone-dependent signal transduction during conjugation with cellular fusion | 1.0285e-8 | 4/4, 100% | 29/5819, 0.4% |
| Response to pheromone | 7.4146e-7 | 4/4, 100% | 101/5819, 1.7% | |
| Filamentous growth | 8.2752e-5 | 3/4, 75% | 105/5819, 1.8% | |
| Cell cycle arrest | 8.2752e-5 | 2/4, 50% | 12/5819, 0.2% | |
| Pathway 2 | Adaptation to pheromone during conjugation with cellular fusion | 4.2907e-6 | 3/4, 75% | 15/5819, 0.2% |
| Negative regulation of signal transduction | 2.5475e-5 | 3/4, 75% | 30/5819, 0.5% | |
| Re-entry into mitotic cell cycle after pheromone arrest | 4.1196e-5 | 2/4, 50% | 3/5819, 0.0% | |
| Negative regulation of cellular process | 1.2814e-4 | 4/4, 100% | 290/5819, 4.9% | |