| Literature DB >> 29657291 |
Salvatore Alaimo1, Gioacchino Paolo Marceca2, Alfredo Ferro3, Alfredo Pulvirenti4.
Abstract
In the era of network medicine, pathway analysis methods play a central role in the prediction of phenotype from high throughput experiments. In this paper, we present a network-based systems biology approach capable of extracting disease-perturbed subpathways within pathway networks in connection with expression data taken from The Cancer Genome Atlas (TCGA). Our system extends pathways with missing regulatory elements, such as microRNAs, and their interactions with genes. The framework enables the extraction, visualization, and analysis of statistically significant disease-specific subpathways through an easy to use web interface. Our analysis shows that the methodology is able to fill the gap in current techniques, allowing a more comprehensive analysis of the phenomena underlying disease states.Entities:
Keywords: drugs; functional analysis.; microRNAs; pathway perturbation; subpathway analysis
Year: 2017 PMID: 29657291 PMCID: PMC5831934 DOI: 10.3390/ncrna3020020
Source DB: PubMed Journal: Noncoding RNA ISSN: 2311-553X
Top-20 pathways obtained for breast invasive carcinoma (BRCA) and colon adenocarcinoma (COAD) after subpathways extraction and enrichment performed by means of SubPathway ExtraCtor and enrICher (SPECifIC). All terms were first ranked by adjusted p-value and the top-20 significant results were taken for further analysis.
| BRCA | COAD | ||
|---|---|---|---|
| Pathway | Pathway | ||
| metabolism of xenobiotics by cytochrome p450 | 0 | metabolism of xenobiotics by cytochrome p450 | 0 |
| steroid hormone biosynthesis | 0 | drug metabolism cytochrome p450 | 0 |
| drug metabolism cytochrome p450 | 0 | chemical carcinogenesis | 0 |
| chemical carcinogenesis | 0 | steroid hormone biosynthesis | 0 |
| drug metabolism other enzymes | 0 | drug metabolism other enzymes | 0 |
| linoleic acid metabolism | 0 | linoleic acid metabolism | 0 |
| longevity regulating pathway | ppar signaling pathway | 0 | |
| egfr tyrosine kinase inhibitor resistance | phenylalanine metabolism | 0 | |
| endocrine resistance | estrogen signaling pathway | ||
| rap1 signaling pathway | chemokine signaling pathway | ||
| progesterone mediated oocyte maturation | erbb signaling pathway | ||
| hif 1 signaling pathway | phospholipase d signaling pathway | ||
| melanogenesis | neurotrophin signaling pathway | ||
| apoptosis | insulin signaling pathway | ||
| platinum drug resistance | egfr tyrosine kinase inhibitor resistance | ||
| phospholipase d signaling pathway | prolactin signaling pathway | ||
| mtor signaling pathway | oxytocin signaling pathway | ||
| ras signaling pathway | platelet activation | ||
| thyroid hormone signaling pathway | endocrine resistance | ||
| erbb signaling pathway | focal adhesion | ||
Metrics computed for the subpathways disease-specificity assessment of the two datasets in our case study. The table shows the number of substructures nodes, the number of significant nodes (), the number of disease genes, the number of significant disease genes (), the number of reachable pairs of disease genes within subpathways, the average distance between a disease gene and a substructure , and the average distance between disease genes contained within each substructure . The results are compared with a reference computed directly in Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.
| # Nodes | # Disease Genes | |||||||
|---|---|---|---|---|---|---|---|---|
| † | ‡ | |||||||
| 1009 | 7121 | 30 | 104 | 283 | - | 7 | ||
| 466 | 466 | 15 | 15 | 6 | 3 | |||
| 101 | 214 | 9 | 14 | 6 | 1.89 | 3 | ||
| 142 | 722 | 4 | 8 | 1 | 2.09 | 2 | ||
| 34 | 34 | 0 | 0 | 0 | 2.48 | - | ||
| 1009 | 7121 | 11 | 81 | 490 | - | 9 | ||
| 486 | 486 | 9 | 9 | 6 | 4 | |||
| 59 | 173 | 3 | 8 | 4 | 2.04 | 3 | ||
| 158 | 248 | 4 | 7 | 9 | 2.20 | 2 | ||
| 6 | 6 | 0 | 0 | 0 | 2.97 | - | ||
Figure 1Comparison of the results obtained by SubPathway ExtraCtor and enrICher (SPECifIC), Subpathway- GM [24], Subpathway-GMir [27], and DEsubs [26] for the two datasets: breast invasive carcinoma (BRCA) (a) and colon adenocarcinoma (COAD) (b). The Venn diagrams have been obtained considering only pathways for which the reported p-value was significant ().
List of terms sources used for the enrichment phase. All sources are grouped by category. For each source, we report the name, the number of terms, and the number of enriched nodes.
| Category | Source | # Terms | # Nodes |
|---|---|---|---|
| DisGeNET [ | 7607 | 2978 | |
| GAD [ | 403 | 1519 | |
| KEGG [ | 1278 | 1234 | |
| OMIM [ | 89 | 518 | |
| Drugbank [ | 247 | 7 | |
| Drugbank [ | 797 | 180 | |
| Drugbank [ | 4815 | 1494 | |
| Drugbank [ | 560 | 18 | |
| KEGG [ | 3793 | 706 | |
| GO [ | 11,386 | 4850 | |
| GO [ | 1545 | 4852 | |
| GO [ | 4146 | 4832 | |
| KEGG [ | 310 | 4904 | |
List of cancer types extracted from TCGA with their codes, number of case and control samples, and subcategories
| Code | Cancer Type | Control Samples | Case Samples | Case Samples Categories |
|---|---|---|---|---|
| BLCA | Bladder Urothelial Carcinoma | 19 | 193 | Stage I, II, III, IV |
| BRCA | Breast invasive carcinoma | 86 | 642 | Stage I, II, III, IV, X |
| COAD | Colon adenocarcinoma | 8 | 389 | Stage I, II, III, IV |
| KICH | Kidney Chromophobe | 25 | 66 | Stage I, II, III, IV |
| KIRC | Kidney renal clear cell carcinoma | 71 | 224 | Stage I, II, III, IV |
| LUAD | Lung adenocarcinoma | 19 | 388 | Stage I, II, III, IV |
| LUSC | Lung squamous cell carcinoma | 37 | 247 | Stage I, II, III, IV |
| PRAD | Prostate adenocarcinoma | 50 | 191 | Category 6, 7, 8, 9, 10 |
| READ | Rectum adenocarcinoma | 3 | 150 | Stage I, II, III, IV |
| UCEC | Uterine Corpus Endometrial Carcinoma | 14 | 231 | Stage I, II, III, IV |
| All Samples | 332 | 2721 |
Figure 2Subpathway extraction analysis workflow. (a) After selecting a disease and a stage among those available into the drop-down box, (b) the user can optionally choose one or more Nodes of Interest (NoIs); (c) and, after modifying the optional parameters, the job can be submitted to our servers. After processing, (d) a table will be shown with a list of found substructures, giving the ability to perform the functional annotation.
Figure 3Results of the functional annotation analysis of a substructure. The results are shown in a table together with the graph representing the structure itself. In the table, we show the term identifier in the source database, the term description, the number of annotated nodes, the p-value, the adjusted p-value, and the source of the term. By clicking on any term identifier, the annotated nodes will be highlighted in the graph. The user is also able to download annotation results in a tab-separated file and the network in XGMML format.