| Literature DB >> 19706733 |
Chunquan Li1, Xia Li, Yingbo Miao, Qianghu Wang, Wei Jiang, Chun Xu, Jing Li, Junwei Han, Fan Zhang, Binsheng Gong, Liangde Xu.
Abstract
With the development of high-throughput experimental techniques such as microarray, mass spectrometry and large-scale mutagenesis, there is an increasing need to automatically annotate gene sets and identify the involved pathways. Although many pathway analysis tools are developed, new tools are still needed to meet the requirements for flexible or advanced analysis purpose. Here, we developed an R-based software package (SubpathwayMiner) for flexible pathway identification. SubpathwayMiner facilitates sub-pathway identification of metabolic pathways by using pathway structure information. Additionally, SubpathwayMiner also provides more flexibility in annotating gene sets and identifying the involved pathways (entire pathways and sub-pathways): (i) SubpathwayMiner is able to provide the most up-to-date pathway analysis results for users; (ii) SubpathwayMiner supports multiple species ( approximately 100 eukaryotes, 714 bacteria and 52 Archaea) and different gene identifiers (Entrez Gene IDs, NCBI-gi IDs, UniProt IDs, PDB IDs, etc.) in the KEGG GENE database; (iii) the system is quite efficient in cooperating with other R-based tools in biology. SubpathwayMiner is freely available at http://cran.r-project.org/web/packages/SubpathwayMiner/.Entities:
Mesh:
Year: 2009 PMID: 19706733 PMCID: PMC2770656 DOI: 10.1093/nar/gkp667
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic overview of SubpathwayMiner.
Figure 2.A visualized example of sub-pathway mining. (a) A metabolic pathway in KEGG, citrate cycle (TCA cycle). (b) The metabolic pathway is converted to the undirected graph using our pathway simplification programming (the function updateGraphs in SubpathwayMiner). (c) A 3-clique sub-pathway in which distance between any two enzymes is no >3. It is mined from the undirected graph that the pathway corresponds to (surrounded by a black line in Figure 2a and b).
Figure 3.Screenshots of visualization provided in SubpathwayMiner. (a) Display results using a data frame in R. Each row corresponds to information of the pathway that genes are annotated to. The first column contains pathway identifiers. Relevant pathways are listed in ascending order of p-values and multiple-comparison corrected q-values. (b) Visualize a sub-pathway as the undirected graph. Enzymes are colored red if the according enzyme is identified in the submitted sets of genes. (c) Visualize a pathway through linking to the KEGG website. On the pathway map, enzymes are colored red if the according enzyme is identified in the submitted set of genes.
The statistically significantly enriched sub-pathways identified by SubpathwayMiner for differentially expressed genes from lung cancer
| Entire pathway ID ( | Entire pathway name | Sub-pathway ID | Sub-pathway |
|---|---|---|---|
| Path:00350 (0.1037/49%) | Tyrosine metabolism | Path:00350_12 | 0.003248 |
| Path:00350_3 | 0.002156 | ||
| Path:00350_5 | 0.004418 | ||
| Path:00350_6 | 0.003799 | ||
| Path:00350_7 | 0.006378 | ||
| Path:00350_8 | 0.003975 | ||
| Path:00260 (0.01109/0) | Glycine, serine and threonine metabolism | Path:00260_9 | 0.002947 |
| Path:00564 (0.01057/88%) | Glycerophospholipid metabolism | Path:00564_1 | 0.006880 |
| Path:00564_2 | 0.007549 | ||
| Path:00010 (0.00012/27%) | Glycolysis/gluconeogenesis | Path:00010_2 | 0.004475 |
| Path:00010_3 | 0.0008654 | ||
| Path:00010_4 | 0.001629 | ||
| Path:00010_5 | 0.002797 | ||
| Path:00010_6 | 0.0004917 | ||
| Path:00010_7 | 0.003566 | ||
| Path:00220 (0.004746/59%) | Urea cycle and metabolism of amino groups | Path:00220_3 | 0.006607 |
| Path:00220_5 | 0.003975 | ||
| Path:00220_6 | 0.006607 | ||
| Path:00220_7 | 0.004745 | ||
| Path:00230 (0.000128/35%) | Purine metabolism | Path:00230_1 | 0.001154 |
| Path:00230_10 | 0.006450 | ||
| Path:00230_11 | 0.0008266 | ||
| Path:00230_2 | 0.009967 | ||
| Path:00230_4 | 0.001097 | ||
| Path:00230_6 | 0.001063 | ||
| Path:00230_8 | 0.006450 | ||
| Path:00230_9 | 0.0005034 | ||
| Path:00565 (0.0011/64%) | Ether lipid metabolism | Path:00565_2 | 0.004406 |
| Path:00565_3 | 0.005799 | ||
| Path:00565_4 | 0.003269 | ||
| Path:00590 (0.002233/37%) | Arachidonic acid metabolism | Path:00590_1 | 0.0009760 |
| Path:00590_2 | 0.002773 | ||
| Path:00590_3 | 0.002411 | ||
| Path:00590_4 | 0.003772 | ||
| Path:00480 (0.005111/0) | Glutathione metabolism | Path:00480_1 | 0.005110 |
| Path:00670 (0.009117/0) | One carbon pool by folate | Path:00670_1 | 0.009117 |
aThe average overlap between the significant sub-pathways found within each single pathway.
Figure 4.The tyrosine metabolism pathway where the differentially expressed genes of lung cancer were annotated. The enzymes identified in all genes of Homo sapiens were colored green. The enzymes identified in the submitted genes were colored red. The results show that these genes were mostly concentrated in local areas of the pathway such as the right-bottom part of the figure.