| Literature DB >> 20187943 |
Jui-Hung Hung1, Troy W Whitfield, Tun-Hsiang Yang, Zhenjun Hu, Zhiping Weng, Charles DeLisi.
Abstract
One of the important challenges to post-genomic biology is relating observed phenotypic alterations to the underlying collective alterations in genes. Current inferential methods, however, invariably omit large bodies of information on the relationships between genes. We present a method that takes account of such information - expressed in terms of the topology of a correlation network - and we apply the method in the context of current procedures for gene set enrichment analysis.Entities:
Mesh:
Year: 2010 PMID: 20187943 PMCID: PMC2872883 DOI: 10.1186/gb-2010-11-2-r23
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Ten highest TIF genes in the colorectal cancer dataset
| Gene |
| KEGG annotation | GO annotation (evidence codea) | |
|---|---|---|---|---|
| 1.34 | 4.79 (2e-6) | Calcium signaling pathway | ||
| 1.33 | 1.90 (0.06) | Cytokine-cytokine receptor interaction | ||
| 1.32 | 5.82 (6e-9) | Calcium signaling pathway | ||
| 1.32 | 6.02 (2e-9) | Wnt signaling pathway | ||
| 1.32 | 5.80 (7e-9) | Cell adhesion molecules (CAMs) | ||
| 1.32 | 7.60 (3e-14) | Complement and coagulation cascades | ||
| 1.32 | 4.70 (3e-6) | Complement and coagulation cascades | ||
| 1.32 | 4.04 (5e-5) | Calcium signaling pathway | ||
| 1.32 | 5.94 (3e-09) | Fatty acid elongation in mitochondria | ||
| 1.30 | 3.69 (0.0002) | Glutamate metabolism |
aEvidence codes defined by GO: EXP (Inferred from Experiment), IDA (Inferred from Direct Assay), IEP (Inferred from Expression Pattern), IPI (Inferred from Physical Interaction), NAS (Non-traceable Author Statement), and TAS (Traceable Author Statement).
Pathways from the colon cancer dataset found exclusively by PWEA
| Pathway | Size | DE fractiona | Type | Possible relation to the cancer | Reference. |
|---|---|---|---|---|---|
| Arachidonic acid metabolism | 50 | 34% | Lipid metabolism | Inflammation | [ |
| Axon guidance | 126 | 20% | Development | Cell mobility and cell growth, related to MAPK signaling pathway | [ |
| Nicotinate and nicotinamide metabolism | 23 | 22% | Metabolism of cofactors and vitamins | Stimulate cell growth | [ |
| Drug metabolism - cytochrome P450 | 63 | 30% | Xenobiotics biodegradation and metabolism | Therapeutic target, related to prognosis | [ |
| Urea cycle and metabolism of amino groups | 28 | 39% | Amino acid metabolic | Nutrition intake | [ |
| Pyruvate metabolism | 41 | 37% | Carbohydrate metabolism | Nutrition intake | [ |
| Bile acid biosynthesis | 31 | 39% | Lipid metabolism | Lead to high concentration of bile acid | [ |
| Colorectal cancer | 84 | 15% | Disease | - | - |
| Long-term depression | 70 | 15% | Disease | Unknown | - |
| Amyotrophic lateral sclerosis | 54 | 15% | Disease | Inflammation and MAPK signaling pathway | - |
aDE fraction is the fraction of genes that show differential expression with P < 0.05 using a two-tailed t-test.
Figure 1Pathways adapted from KEGG. (a) Renal cell carcinoma. (b) MAPK signaling pathway. (c) Axon guidance. (d) Amyotrophic lateral sclerosis. (e) Fcε RI signaling pathway. (f) Gonadotropin-releasing hormone signaling pathway. (g) Jak-STAT signaling pathway. (h) Basal cell carcinoma. Red indicates an abnormality.
Figure 2. The regions circled in red and blue are clustered around the early stages of carcinoma, in accordance with the tissue origin being early stage.
Ten highest TIF genes in the small cell lung cancer dataset
| Gene |
| KEGG annotation | GO annotation (evidence codea) | |
|---|---|---|---|---|
| 1.33 | 3.87 (0.0001) | Lysine degradation | ||
| 1.33 | 5.60 (2e-8) | Biotin metabolism | ||
| 1.33 | 10.60 (3e-26) | Cell cycle | ||
| 1.33 | 5.31 (1e-7) | Pathways in cancer | ||
| 1.29 | 5.69 (1e-8) | MAPK signaling pathway | ||
| 1.29 | 11.07 (2e-28) | MAPK signaling pathway | ||
| 1.29 | 7.36 (2e-13) | B cell receptor signaling pathway | ||
| 1.29 | 12.69 (7e-37) | Phosphatidylinositol signaling system | ||
| 1.29 | 5.07 (4e-7) | Glycosphingolipid biosynthesis - lacto and neolacto series | ||
| 1.29 | 0.52 (0.60) | Bile acid biosynthesis |
aEvidence codes defined by GO: ND (No biological Data available), EXP (Inferred from Experiment), IC (Inferred by Curator), IDA (Inferred from Direct Assay), IEA (Inferred from Electronic Annotation), IEP (Inferred from Expression Pattern), IPI (Inferred from Physical Interaction), NAS (Non-traceable Author Statement), and TAS (Traceable Author Statement).
Pathways from the small cell lung cancer dataset found exclusively by PWEA
| Pathway | Size | DE fractiona | Type | Possible relation to the cancer | Reference |
|---|---|---|---|---|---|
| GnRH signaling pathway | 78 | 37% | Endocrine system | Negative autocrine regulator | [ |
| Complement and coagulation cascades | 56 | 54% | Immune system | Inflammation | [ |
| MAPK signaling pathway | 199 | 38% | Signal transduction | Cell growth | - |
| Fc epsilon RI signaling pathway | 63 | 44% | Immune system | Angiogenesis | [ |
| Apoptosis | 67 | 34% | Cell growth and death | Apoptosis | - |
| ABC transporters | 34 | 24% | Membrane transport | Drug resistance | [ |
| Jak-STAT signaling pathway | 93 | 37% | Signal transduction | Cell growth | [ |
| Drug metabolism - cytochrome P450 | 41 | 51% | Xenobiotics biodegradation and metabolism | Anticancer drugs topotecan and etoposide | [ |
| Drug metabolism - other enzymes | 28 | 46% | Xenobiotics biodegradation and metabolism | Anticancer drug irinotecan | [ |
| Histidine metabolism | 24 | 42% | Amino acid metabolism | Nutrition intake. | [ |
| Tryptophan metabolism | 36 | 39% | Amino acid metabolism | As above | [ |
| Phenylalanine metabolism | 13 | 54% | Amino acid metabolism | As above | [ |
| Fatty acid metabolism | 37 | 38% | Lipid metabolism | Apoptosis. | [ |
| Basal cell carcinoma | 36 | 17% | Disease | Proliferation invasion through hedgehog signaling pathway | - |
aDE fraction is the fraction of genes that show differential expression with P < 0.05 using a two-tailed t-test. DDC: enzymatic neuroendocrine markers L-DOPA decarboxylase.
Figure 3. The identification of genes associated with primary and metastatic stages is consistent with the tissue of origin being stage heterogeneous, and not purely primary.
Figure 4Algorithmic scheme of PWEA. In step 1, two different colors (yellow and orange) in the signature vector indicate two phenotypes (for example, normal and cancer). Blue rectangles in the gene list vector indicate genes in a particular pathway P. For a pathway k, the expression profiles are categorized into two groups: P(blue) and its complement, 'Not P' (cyan). In step 2 the TIF scores for genes in Pare calculated. In step 3, TIF scores of the genes in 'Not P' set is computed. In step 4, the maximum deviation (MD) between two cumulative distribution functions is computed. After calculating MD for each of n iterations of phenotype shuffling, the fraction of occurrences of shuffled MDs ≥ the original MD is the P-value of P. In step 5, after all pathways have been tested, FDR is used to correct for multiple testing. In step 6, results and a KEGG markup language topology file for visualization in visANT [68] are the final output. CDF, cumulative distribution function.