| Literature DB >> 30854222 |
Cankut Çubuk1, Marta R Hidalgo2, Alicia Amadoz3, Kinza Rian1, Francisco Salavert4, Miguel A Pujana5, Francesca Mateo5, Carmen Herranz5, Jose Carbonell-Caballero6, Joaquín Dopazo7,8,9.
Abstract
In spite of the increasing availability of genomic and transcriptomic data, there is still a gap between the detection of perturbations in gene expression and the understanding of their contribution to the molecular mechanisms that ultimately account for the phenotype studied. Alterations in the metabolism are behind the initiation and progression of many diseases, including cancer. The wealth of available knowledge on metabolic processes can therefore be used to derive mechanistic models that link gene expression perturbations to changes in metabolic activity that provide relevant clues on molecular mechanisms of disease and drug modes of action (MoA). In particular, pathway modules, which recapitulate the main aspects of metabolism, are especially suitable for this type of modeling. We present Metabolizer, a web-based application that offers an intuitive, easy-to-use interactive interface to analyze differences in pathway metabolic module activities that can also be used for class prediction and in silico prediction of knock-out (KO) effects. Moreover, Metabolizer can automatically predict the optimal KO intervention for restoring a diseased phenotype. We provide different types of validations of some of the predictions made by Metabolizer. Metabolizer is a web tool that allows understanding molecular mechanisms of disease or the MoA of drugs within the context of the metabolism by using gene expression measurements. In addition, this tool automatically suggests potential therapeutic targets for individualized therapeutic interventions.Entities:
Mesh:
Year: 2019 PMID: 30854222 PMCID: PMC6397295 DOI: 10.1038/s41540-019-0087-2
Source DB: PubMed Journal: NPJ Syst Biol Appl ISSN: 2056-7189
TCGA samples used in this study
| Cancer type | Abbreviation | Tumor samples | Normal samples | Patients alive | Deceased patients |
|---|---|---|---|---|---|
| Breast invasive carcinoma | BRCA | 1057 | 113 | 900 | 146 |
| Kidney renal clear cell carcinoma | KIRC | 526 | 72 | 345 | 173 |
| Liver hepatocellular carcinoma | LIHC | 294 | 48 | 184 | 112 |
| Prostate adenocarcinoma | PRAD | 379 | 52 | 367 | 7 |
| Total | 2256 | 285 | 1796a | 438a |
aThe sum of these columns does not equal the total number of tumor samples plus normal samples because survival information was missing for some patients
Fig. 1Metabolizer graphic interface with a representation of the modules. On the right side there is a list of KEGG pathways with arrows up or down in case they contain modules with up or down activations, respectively. When the arrow is gray, the change in activity is not significant. Red up arrows indicates a significant increase in activity and blue down arrow a significant decrease of activity in the module. Below the pathway list, there is another list with the modules within the pathway with the same code for arrows
Fig. 2Classification performance obtained using module activities inferred with Metabolizer and CBM-based reaction activities for the prediction of BRCA subtypes. BRCA subtypes are defined on the bases of PAM50 gene activities and therefore, gene expression is taken as the gold standard classification performance
AUC values obtained for tumor types in Table 1, with the corresponding AUC values obtained when artificial classes are obtained by randomizing sample labels
| BRCA | BRCA random | LIHC | LIHC random | KIRC | KIRC random | PRAD | PRAD random | |
|---|---|---|---|---|---|---|---|---|
| Mean | 1.000 | 0.495 | 1.000 | 0.525 | 0.999 | 0.477 | 0.998 | 0.491 |
| Standard deviation | 0.000 | 0.190 | 0.000 | 0.251 | 0.002 | 0.216 | 0.006 | 0.208 |
| Median | 1.000 | 0.5025 | 1.000 | 0.533 | 1.000 | 0.464 | 1.000 | 0.469 |
| Median absolute deviation | 0.000 | 0.205 | 0.000 | 0.312 | 0.000 | 0.243 | 0.000 | 0.228 |
Number of modules found as differentially activated in the cancers listed in Table 1 by the different methods GSEA, SPIA, and Metabolizer
| Method | BRCA | LIHC | KIRC | PRAD | ||||
|---|---|---|---|---|---|---|---|---|
| Found | FP | Found | FP | Found | FP | Found | FP | |
| GSEA | 8 | 3.5/1.8 | 5 | 2.8/6.0 | 14 | 7.4/3.4 | 5 | 2.7/2.3 |
| SPIA | 2 | 0.07/0.05 | 1 | 0.3/0.1 | 2 | 0.1/0.07 | 1 | 1.1/0.1 |
| Metabolizer | 81 | 0.008/0.06 | 77 | 0.04/0.04 | 77 | 0.03/0.03 | 73 | 0.05/0.05 |
The number of false positives (FP) was calculated by comparing 1000 times two artificial sample sets by random sampling of normal tissues maintaining the proportions of the real comparison. That is 102 vs. 11 for BRCA, 41 vs. 7 in LIHC, 63 vs. 9 in KIRC and 46 vs. 6 in PRAD. The same procedure was repeated using cancer samples. In this case the proportions were 995 vs. 102 in BRCA, 253 vs. 41 in LIHC, 463 vs. 63 in KIRC and 333 vs. 46 in PRAD. The second column for each cancer type shows the average number of FPs obtained with normal samples/the same figure obtained from cancer samples
Probabilities of KIRC metabolic profiles being identified as normal cell metabolic profile after the KO of the gene
| Gene symbol | Entrez ID | Change in probability | ||
|---|---|---|---|---|
| HSD17B12 | 51144 | 0.348 | 0.092 | 0.256 |
| TECR | 9524 | 0.348 | 0.092 | 0.256 |
| SC5D | 6309 | 0.328 | 0.092 | 0.236 |
| EBP | 10682 | 0.328 | 0.092 | 0.236 |
| DHCR24 | 1718 | 0.328 | 0.092 | 0.236 |
| LSS | 4047 | 0.328 | 0.092 | 0.236 |
| TM7SF2 | 7108 | 0.328 | 0.092 | 0.236 |
| NSDHL | 50814 | 0.328 | 0.092 | 0.236 |
| CYP51A1 | 1595 | 0.328 | 0.092 | 0.236 |
| HSD17B7 | 51478 | 0.328 | 0.092 | 0.236 |
| DHCR7 | 1717 | 0.328 | 0.092 | 0.236 |
Fig. 3Essentiality (Demeter score) of genes predicted as optimal KOs with respect to the background distribution of essentiality values. Values below 0 indicate lower proliferation. From left to right and top to bottom: HSD17B12 and SC5D in cell line G401 (KIDNEY); TECR in cell line TUHR4TKB (KIDNEY) (this gene shows the same results in KMRC1 cell line of KIDNEY, data not shown); SC5D and EBP in SLR25 cell line (KIDNEY) (SC5D shows the same result in G401 cell line of SOFT_TISSUE, data not shown); DHCR24 in cell line HK2 (KIDNEY); LSS in cell line SLR23 (KIDNEY); NSDHL, DHCR7, and TECR in 769P cell line (KIDNEY); CYP51A1 in cell line SKRC20 (KIDNEY) (also less proliferative in SLR20 KIDNEY cell line, data not shown); HSD17B7 in cell line CAKI2 (KIDNEY); DHCR7 and EBP in cell line SLR26 (KIDNEY)
Probabilities of STAD metabolic profiles being identified as normal cell metabolic profile after the KO of the gene
| Gene symbol | Entrez ID | Change in probability | ||
|---|---|---|---|---|
|
| 1807 | 0.468 | 0.33 | 0.138 |
|
| 51733 | 0.468 | 0.33 | 0.138 |
|
| 2618 | 0.416 | 0.33 | 0.086 |
|
| 471 | 0.416 | 0.33 | 0.086 |
|
| 10606 | 0.416 | 0.33 | 0.086 |
|
| 10998 | 0.392 | 0.33 | 0.062 |
|
| 570 | 0.392 | 0.33 | 0.062 |
|
| 51144 | 0.368 | 0.33 | 0.038 |
|
| 9524 | 0.368 | 0.33 | 0.038 |
|
| 7915 | 0.354 | 0.33 | 0.024 |
|
| 18 | 0.354 | 0.33 | 0.024 |
Fig. 4Relative cell proliferation of line AGS (stomach gastric adenocarcinoma) upon UPB1 expression depletion by three different MISSION shRNAs or transduced with control vector pLKO.1. The asterisk indicates significant differences (Mann–Whitney test p-values < 0.01). The percentage of reduction of cell proliferation is also shown. The prediction of UPB1 essentiality made by Metabolizer was confirmed by a relatively more sensitive behavior
KEGG modules with activity significantly associated to patient survival in both KIRC and LIHC tumors. BRCA and PRAD did not show any significant result
| KIRC | LIHC | BRCA | PRAD | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| KEGG module ID | Final metabolite | Module name | Metabolic categories | FDR adj. | FDR adj. | FDR adj. | FDR adj. | ||||
| M00004 | Pentose phosphate cycle | Pentose phosphate pathway (Pentose phosphate cycle) | Carbohydrate | 0.04415 | 5.51536 | 0.01654 | 8.56769 | 0.95894 | 0.08936 | 1.00000 | 0.34950 |
| M00020 | Serine biosynthesis. glycerate-3P ⇒ serine | Amino acid | 0.04343 | 5.59083 | 0.01902 | 8.00418 | 0.68795 | 2.02105 | 1.00000 | 0.04439 | |
| M00029_1 | Fumarate | Urea cycle | Amino acid | 0.03695 | 5.97270 | 0.02789 | 7.10368 | 0.95894 | 0.12874 | 1.00000 | 1.58333 |
| M00029_2 | Urea | Urea cycle | Amino acid | 0.01324 | 8.08771 | 0.04608 | 5.78630 | 0.97489 | 0.01307 | 1.00000 | 1.50000 |
| M00032 | Acetoacetyl-CoA | Lysine degradation. lysine ⇒ saccharopine ⇒ acetoacetyl-CoA | Amino acid | 0.00002 | 22.4506 | 0.01902 | 7.86930 | 0.97489 | 0.06041 | 1.00000 | 0.09094 |
| M00036 | Acetoacetate | Leucine degradation. Leucine ⇒ acetoacetate acetyl-CoA | Amino acid | <10−6 | 27.1764 | 0.00593 | 11.7331 | 0.97489 | 0.01729 | 1.00000 | 0.01493 |
| M00050 | GTP | Guanine ribonucleotide biosynthesis. IMP ⇒ GDP.GTP | Nucleotide | 0.00002 | 22.0824 | 0.00222 | 14.8778 | 0.97489 | 0.01269 | 1.00000 | 0.03406 |
| M00051_1 | UMP | Uridine monophosphate biosynthesis. glutamine (PRPP) ⇒ UMP | Nucleotide | 0.01324 | 8.08552 | 0.00295 | 13.7996 | 0.77769 | 1.21228 | 1.00000 | 0.31132 |
| M00052 | CDP | Pyrimidine ribonucleotide biosynthesis. UMP ⇒ UDP/UTP.CDP/CTP | Nucleotide | <10−6 | 36.4353 | 0.01796 | 8.31921 | 0.95894 | 0.45008 | 1.00000 | 0.33263 |
| M00071 | Neolactotetraosylceramide | Glycosphingolipid biosynthesis. neolacto-series. LacCer ⇒ nLc4Cer | Carbohydrate | 0.00056 | 14.6064 | 0.01902 | 7.88317 | 0.95894 | 0.26129 | 1.00000 | 0.00638 |
| M00085_1 | Acyl-CoA | Fatty acid biosynthesis. Elongation. mitochondria | Lipid | 0.00056 | 14.6836 | 0.04608 | 5.75749 | 0.98608 | 0.00096 | 1.00000 | 0.62210 |
| M00087 | Tetradecanoyl-CoA | beta-Oxidation | Lipid | <10−6 | 37.0408 | 0.03214 | 6.63945 | 0.68795 | 1.76506 | 1.00000 | 0.02158 |
| M00131 | myo-Inositol | Inositol phosphate metabolism. Ins(1.3.4.5)P4 ⇒ Ins(1.3.4)P3 ⇒ myo-inositol | Lipid | <10−6 | 27.9721 | 0.00222 | 16.1345 | 0.97489 | 0.04363 | 1.00000 | 0.13613 |
| M00741 | Succinyl-CoA | Propanoyl-CoA metabolism. propanoyl-CoA ⇒ succinyl-CoA | Carbohydrate | <10−6 | 38.7000 | 0.00826 | 10.3022 | 0.52040 | 3.42237 | 1.00000 | 1.08333 |
Fig. 5Kaplan–Meier survival plots for the Guanine ribonucleotide biosynthesis module (left), the Beta-oxidation module (center) and the Leucine degradation module (right) in KIRC (upper row) and LIHC (lower row) tumors