| Literature DB >> 27716040 |
Junli Du1,2, Manlin Li1, Zhifa Yuan1, Mancai Guo1, Jiuzhou Song3, Xiaozhen Xie1, Yulin Chen4.
Abstract
BACKGROUND: The knowledge base-driven pathway analysis is becoming the first choice for many investigators, in that it not only can reduce the complexity of functional analysis by grouping thousands of genes into just several hundred pathways, but also can increase the explanatory power for the experiment by identifying active pathways in different conditions. However, current approaches are designed to analyze a biological system assuming that each pathway is independent of the other pathways.Entities:
Keywords: Bovine mammary; Coefficient of determination (CD); Decision coefficient (DC); Pathway analysis
Mesh:
Year: 2016 PMID: 27716040 PMCID: PMC5053338 DOI: 10.1186/s12859-016-1285-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The construction principle of decision coefficient for a specified pathway. a One path-chain including one bi-directional arrow. b The decision coefficient of one specified pathway including the direct determination factor and the indirect determination factors from all other p − 1 pathways
The detailed subdivided result of decision coefficient
| Subcategory/the secondary pathways |
| ⋯ |
| ⋯ |
| ⋯ |
|
|---|---|---|---|---|---|---|---|
| Direct and indirect determination factor ( | ( | ⋯ | 2 | ⋯ | 2 | ⋯ | 2 |
| ⋮ | ⋱ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | |
| 2 | ⋯ | ( | ⋯ | 2 | ⋯ | 2 | |
| ⋮ | ⋮ | ⋮ | ⋱ | ⋮ | ⋮ | ⋮ | |
| 2 | ⋯ | 2 | ⋯ | ( | ⋯ | 2 | |
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋱ | ⋮ | |
| 2 | ⋯ | 2 | ⋯ | 2 | ⋯ | ( | |
| DC ( |
| ⋯ |
| ⋯ |
| ⋯ |
|
r (j, t = 1, 2, ⋯, p; j ≠ t) indicates the correlation coefficient x and x . Obviously, the data satisfy r = r and according to the decision analysis method. In order to distinguish between the direct and indirect determination factor clearly, the direct determination factor has been indicated in bold italics
Fig. 2The decision tree of KEGG pathways according to the decision percentage. The red sign ‘?%’ denotes the decision percentage of KEGG subcategory pathway to its corresponding category pathway. Similarly, the black sign ‘?%’ denotes the decision percentage of the secondary KEGG pathway to its corresponding subcategory pathway. In addition, the activated KEGG subcategory pathways were marked with red color, the inhibited KEGG subcategory pathways were marked with blue color. In the same way, the activated secondary KEGG pathways were marked with red circles; the inhibited secondary KEGG pathways were marked with blue circles
The decision analysis and path analysis results of simulated data
|
|
|
|
| 2 |
|
|
|
|
|---|---|---|---|---|---|---|---|---|
|
| 0.383 |
| −0.162 | −0.124 | 0.002 | 0.385 | −0.700 | 0.148 |
|
| 2.504 | 1.916 | ||||||
|
| −2.340 | −1.795 | ||||||
|
| −1.097 |
| 0.057 | −0.124 | 0.243 | −0.854 | −0.416 | 0.670 |
|
| −0.198 | 0.433 | ||||||
|
| 0.384 | −0.842 | ||||||
|
| −2.593 |
| −0.369 | 1.196 | 2.009 | −0.584 | −0.278 | −3.697 |
|
| −0.084 | 0.434 | ||||||
|
| 2.462 | −12.772 | ||||||
|
| 2.471 |
| −0.362 | −1.795 | −3.117 | −0.647 | 0.506 | −9.300 |
|
| −0.170 | −0.843 | ||||||
|
| −2.585 | −12.799 |
The percentage of direct and indirect CD in the total CD for selected KEGG pathway categories and subcategories
| KEGG pathway category and subcategory | Total CD | |
|---|---|---|
| direct CD | indirect CD | |
| 1. Metabolism | 0.179 | 0.821 |
| 1.1 Carbohydrate Metabolism | 0.168 | 0.832 |
| 1.2 Energy Metabolism | 0.616 | 0.384 |
| 1.3 Lipid Metabolism | 0.260 | 0.740 |
| 1.4 Nucleotide Metabolism | 0.538 | 0.462 |
| 1.5 Amino Acid Metabolism | 0.203 | 0.797 |
| 1.6 Metabolism of Other Amino Acids | 0.478 | 0.523 |
| 1.7 Glycan Biosynthesis and Metabolism | 0.238 | 0.762 |
| 1.8 Metabolism of Cofactors and Vitamins | 0.379 | 0.621 |
| 1.11 Xenobiotics Biodegradation and Metabolism | 0.453 | 0.547 |
| 3. Environmental Information Processing | 0.512 | 0.488 |
| 3.2 Signal Transduction | 0.139 | 0.861 |
| 3.3 Signaling Molecules and Interaction | 0.364 | 0.636 |
The comparison results of the most impacted pathways and impact direction under decision analysis model, KEGG-PATH and DIA method
| KEGG pathway Categories/Sub-categories | The concordance rate of impact direction | The concordance rate of the most impacted pathways | ||||
|---|---|---|---|---|---|---|
| Decision analysis | KEGG-PATH | DIA | KEGG-PATH | |||
| 1 | 1. Metabolism | For its sub-category pathways | 54.5 %(6/11) | 45.5 %(5/11) | 62.5 % (5/8) | 87.5 %(7/8) |
| 2 | 3. Environmental Information Processing | 100 %(3/3) | 33.3 %(1/3) | 0 (0/1) | 100 %(1/1) | |
| 3 | 1.1 Carbohydrate Metabolism | For its secondary pathways | 35.7 % (5/14) | 50 %(7/14) | 40 % (2/5) | 50 % (3/6) |
| 4 | 1.2 Energy Metabolism | 33.3 %(1/3) | 33.3 %(1/3) | 100 %(2/2) | 100 %(2/2) | |
| 5 | 1.3 Lipid Metabolism | 76.9 %(10/13) | 30.8 %(4/13) | 50 %(3/6) | 33.3 %(2/6) | |
| 6 | 1.4 Nucleotide Metabolism | 50 %(1/2) | 0 (0/2) | 100 %(2/2) | 100 %(2/2) | |
| 7 | 1.5 Amino Acid Metabolism | 54.5 %(6/11) | 72.7 %(8/11) | 50 %(2/4) | 60 %(3/5) | |
| 8 | 1.6 Metabolism of Other Amino Acids | 75 %(3/4) | 75 %(3/4) | 100 %(2/2) | 100 %(2/2) | |
| 9 | 1.7 Glycan Biosynthesis and Metabolism | 58.3 %(7/12) | 41.7 %(5/12) | 50 %(2/4) | 25 %(1/4) | |
| 10 | 1.8 Metabolism of Cofactors and Vitamins | 37.5 %(3/8) | 37.5 %(3/8) | 60 %(3/5) | 60 %(3/5) | |
| 11 | 1.11 Xenobiotics Biodegradation and Metabolism | 66.7 %(2/3) | 100 %(3/3) | 50 %(1/2) | 100 %(2/2) | |
| 12 | 3.2 Signal Transduction | 63.6 %(7/11) | 63.6 %(7/11) | 60 %(3/5) | 60 %(3/5) | |
| 13 | 3.3 Signaling Molecules and Interaction | 66.7 %(2/3) | 33.3 %(1/3) | 100 %(3/3) | 100 %(3/3) | |
For the ‘The concordance rate of impact direction’ column, the denominator of each fraction in the parentheses denotes the number of subcategory pathways and the secondary pathways from the front corresponding categories and sub-categories for two columns, and the numerator of each fraction for two columns denotes the number of pathways with the same impact direction under DIA and decision analysis, and under DIA and KEGG-PATH respectively. For the ‘The concordance rate of the most impacted pathways’ column, the denominator of each fraction in the parentheses denotes the number (a) of the most impacted pathways identified based on DC values in corresponding pathway categories and sub-categories for two columns, and the numerator of each fraction for two columns denotes the number of pathways which also appeared in top a pathways identified by DIA average impact values and by total effect from KEGG-PATH, respectively