| Literature DB >> 32637048 |
Jocelyn Gal1, Caroline Bailleux2, David Chardin3,4, Thierry Pourcher4, Julia Gilhodes5, Lun Jing4, Jean-Marie Guigonis4, Jean-Marc Ferrero2, Gerard Milano6, Baharia Mograbi7, Patrick Brest7, Yann Chateau1, Olivier Humbert3,4, Emmanuel Chamorey1.
Abstract
Genomics and transcriptomics have led to the widely-used molecular classification of breast cancer (BC). However, heterogeneous biological behaviors persist within breast cancer subtypes. Metabolomics is a rapidly-expanding field of study dedicated to cellular metabolisms affected by the environment. The aim of this study was to compare metabolomic signatures of BC obtained by 5 different unsupervised machine learning (ML) methods. Fifty-two consecutive patients with BC with an indication for adjuvant chemotherapy between 2013 and 2016 were retrospectively included. We performed metabolomic profiling of tumor resection samples using liquid chromatography-mass spectrometry. Here, four hundred and forty-nine identified metabolites were selected for further analysis. Clusters obtained using 5 unsupervised ML methods (PCA k-means, sparse k-means, spectral clustering, SIMLR and k-sparse) were compared in terms of clinical and biological characteristics. With an optimal partitioning parameter k = 3, the five methods identified three prognosis groups of patients (favorable, intermediate, unfavorable) with different clinical and biological profiles. SIMLR and K-sparse methods were the most effective techniques in terms of clustering. In-silico survival analysis revealed a significant difference for 5-year predicted OS between the 3 clusters. Further pathway analysis using the 449 selected metabolites showed significant differences in amino acid and glucose metabolism between BC histologic subtypes. Our results provide proof-of-concept for the use of unsupervised ML metabolomics enabling stratification and personalized management of BC patients. The design of novel computational methods incorporating ML and bioinformatics techniques should make available tools particularly suited to improving the outcome of cancer treatment and reducing cancer-related mortalities.Entities:
Keywords: Breast neoplasms; Computer simulation; Metabolomics; Unsupervised machine learning
Year: 2020 PMID: 32637048 PMCID: PMC7327012 DOI: 10.1016/j.csbj.2020.05.021
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Patients’ demographics and treatment characteristics.
| Clinical characteristic | No. of patients | % |
|---|---|---|
| Age (median min – max) | 63.2 (37–88) | |
| Histology type | ||
| Invasive ductal carcinoma | 48 | 92 |
| Invasive lobular carcinoma | 3 | 6 |
| Microinvasive carcinoma | 1 | 2 |
| Tumor stage | ||
| T1 | 21 | 40.5 |
| T2 | 24 | 46 |
| T3 | 7 | 13.5 |
| Axillary lymph node status | ||
| N0 | 28 | 54 |
| N+ | 24 | 46 |
| Metastasis | ||
| M0 | 50 | 96 |
| M1 | 2 | 4 |
| Histological grade | ||
| I | 5 | 10 |
| II | 22 | 43 |
| III | 24 | 47 |
| Hormonal receptors status | ||
| Negative | 25 | 48 |
| Positive | 27 | 52 |
| Her-2 status | ||
| Non-over-expressed | 40 | 74 |
| Over-expressed | 12 | 24 |
| Triple-negative status | ||
| No | 37 | 71 |
| Yes | 15 | 29 |
| Tumor phenotype | ||
| Her2 | 12 | 23 |
| Luminal | 25 | 48 |
| Triple-Negative | 15 | 29 |
| Adjuvant Chemotherapy | ||
| No | 13 | 25 |
| Yes | 39 | 75 |
| Adjuvant Radiotherapy | ||
| No | 9 | 17 |
| Yes | 43 | 83 |
| Adjuvant Hormonotherapy | ||
| No | 24 | 46 |
| Yes | 28 | 54 |
Oestrogen and/or progesterone.
Fig. 1Visualization of each cluster by clustering method using T-sne.
Fig. 2Silhouette value (SI) representation for each patient by clustering method.
Clinical comparison of 52 patients between clusters.
| Clinical characteristic | PCA-K-means | Spectral Clustering | Sparse K-means | SIMLR | K-Sparse | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C1 (N = 21) | C2 (N = 10) | C3 (N = 21) | P-value | C2 (N = 19) | C1 (N = 12) | C3 (N = 21) | P-value | C1 (N = 24) | C2 (N = 8) | C3 (N = 20) | P-value | C1 (N = 17) | C2 (N = 12) | C3 (N = 23) | P-value | C1 (N = 19) | C2 (N = 12) | C3 (N = 21) | P-value | |
| Age a | 62.7 (15.2) | 64.8(16) | 62.9(15) | 0.93 | 64.8 (14.3) | 62.5 (16.5) | 62 (15.3) | 0.8 | 64.1(15) | 60.5 (17.2) | 63 (14.9) | 0.85 | 64.3 (14.1) | 64.9 (16.1) | 61.4 (15.6) | 0.755 | 64.8(14.3) | 62.5(16.5) | 62(15.3) | 0.827 |
| Histology type | 1 | 0.392 | 0.106 | 0.752 | 0.392 | |||||||||||||||
| Ductal carcinoma | 19(90.5) | 10(1 0 0) | 19(90.5) | 17(89.5) | 11(91.7) | 20(95.2) | 21(87.5) | 7(87.5) | 20(1 0 0) | 15(88.2) | 12(1 0 0) | 21(91.3) | 17(89.5) | 11(91.7) | 20(95.2) | |||||
| Lobular carcinoma | 2(9.5) | 0(0) | 1(4.8) | 2(10.5) | 1(8.3) | 0(0) | 3(12.5) | 0(0) | 0(0) | 2(11.8) | 0(0) | 1(4.3) | 2(10.5) | 1(8.3) | 0(0) | |||||
| Microinvasive carcinoma | 0(0) | 0(0) | 1(4.8) | 0(0) | 0(0) | 1(4.8) | 0(0) | 1(12.5) | 0(0) | 0(0) | 0(0) | 1(4.3) | 0(0) | 0(0) | 1(4.8) | |||||
| Tumor stage | 0.005 | 0.063 | ||||||||||||||||||
| T1 | 14(66.7) | 3(30) | 4(19) | 12(63.2) | 5(41.7) | 4(19) | 14(58.3) | 2(25) | 5(25) | 10(58.8) | 6(50) | 5(21.7) | 12(63.2) | 5(41.7) | 4(19) | |||||
| T2/T3 | 7(33.3) | 7(70) | 17(81) | 7(36.8) | 7(58.3) | 17(81) | 10(41.7) | 6(75) | 15(75) | 7(41.2) | 6(50) | 18(78.3) | 7(36.8) | 7(58.3) | 17(81) | |||||
| Axillary lymph node | 0.162 | 0.075 | 0.526 | 0.387 | 0.075 | |||||||||||||||
| N0 | 14(66.7) | 6(60) | 8(38.1) | 14(73.7) | 6(50) | 8(38.1) | 15(62.5) | 4(50) | 9(45) | 11(64.7) | 7(58.3) | 10(43.5) | 14(73.7) | 6(50) | 8(38.1) | |||||
| N+ | 7(33.3) | 4(40) | 13(61.9) | 5(26.3) | 6(50) | 13(61.9) | 9(37.5) | 4(50) | 11(55) | 6(35.3) | 5(41.7) | 13(56.5) | 5(26.3) | 6(50) | 13(61.9) | |||||
| Metastasis | 0.667 | 1 | 1 | 0.497 | 1 | |||||||||||||||
| M0 | 21(1 0 0) | 10(1 0 0) | 19(90.5) | 18(94.7) | 12(1 0 0) | 20(95.2) | 23(96) | 8(1 0 0) | 19(95) | 17(1 0 0) | 12(1 0 0) | 21(86.9) | 18(94.7) | 12(1 0 0) | 20(95.2) | |||||
| M1 | 0(40) | 0(0) | 2(9.5) | 1(5.3) | 0(0%) | 1(4.8) | 1(4) | 0(0%) | 1(5) | 0(0%) | 0(0%) | 2(13.1) | 1(5.3) | 0(0) | 1(50) | |||||
| Histological grade | 0.109 | |||||||||||||||||||
| I/II | 13(61.9) | 7(70) | 7(35) | 12(63.2) | 9(75) | 6(30) | 15(62.5) | 5(71.4) | 7(35) | 11(64.7) | 9(75) | 7(31.8) | 12(63.2) | 9(75) | 6(30) | |||||
| III | 8(38.1) | 3(30) | 13(75) | 7(36.8) | 3(25) | 14(70) | 9(37.5) | 2(28.6) | 13(65) | 6(35.3) | 3(25) | 15(68.2) | 7(36.8) | 3(25) | 14(70) | |||||
| Mitosis | 0.133 | |||||||||||||||||||
| 1 | 11(52.4) | 4(40) | 2(10) | 10 (52.6) | 5 (41.7) | 2 (10) | 11 (45.8) | 2 (28.6) | 4 (20) | 10 (58.8) | 5 (41.7) | 2 (9.1) | 10 (52.6) | 5 (41.7) | 2 (10) | |||||
| 2 | 3(14.3) | 4(40) | 7(35) | 3 (15.8) | 5 (41.7) | 6 (30) | 4 (16.7) | 4 (57.1) | 6 (30) | 2 (11.8) | 5 (41.7) | 7 (31.8) | 3 (15.8) | 5 (41.7) | 6 (30) | |||||
| 3 | 7(33.3) | 2(20) | 11(55) | 6 (31.6) | 2 (16.7) | 10 (60) | 9 (37.5) | 1 (14.3) | 10 (50) | 5 (29.4) | 2 (16.7) | 13 (59.1) | 6 (31.6) | 2 (16.7) | 12 (60) | |||||
| Ki67 a | 25(5,100) | 27.5(10,90) | 60(10,90) | 0.066 | 41.1 (30.6) | 33(22.6) | 58.8 (27.2) | 30 (19.2, 80) | 35 (23.8, 45) | 60 (28.8, 90) | 0.196 | 38 (31) | 32.8 (22.7) | 59.7 (25.9) | 41.1 (30.6) | 33 (22.6) | 58.8(27.2) | |||
| Tumour phenotype | ||||||||||||||||||||
| Her-2 over-expressed | 1(4.8) | 4(40) | 7(33.3) | 1(5.3) | 4(33.3) | 7(33.3) | 2(8.3) | 4(50) | 6(30) | 1(5.9) | 4(33.3) | 7(30.4) | 1(5.3) | 4(33.3) | 7(33.3) | |||||
| Luminal | 14(66.7) | 5(50) | 6(28.6) | 13(68.4) | 7(58.3) | 5(23.8) | 16(66.7) | 4(50) | 5(25) | 12(70.6) | 7(58.3) | 6(26.1) | 13(68.4) | 7(58.3) | 5(23.8) | |||||
| Triple-Negative | 6(28.6) | 1(10) | 8(38.1) | 5(26.3) | 1(8.3) | 9(42.9) | 6(25) | 0(0) | 9(45) | 4(23.5) | 1(8.3) | 10(43.5) | 5(26.3) | 1(8.3) | 9(42.9) | |||||
| Hormonal receptors status | 0.178 | 0.075 | 0.112 | 0.071 | 0.075 | |||||||||||||||
| Negative | 7(33.3) | 5(50) | 13(61.9) | 6(31.6) | 5(41.7) | 14(66.7) | 8(33.3) | 4(50) | 13(65) | 5(29.4) | 5(41.7) | 15(65.2) | 6(31.6) | 5(41.7) | 14(66.7) | |||||
| Positive | 14(66.7) | 5(50) | 7(38.1) | 13(68.4) | 7(58.3) | 7(33.3) | 16(66.7) | 4(50) | 7(35) | 12(70.6) | 7(58.3) | 8(34.8) | 13(68.4) | 7(58.3) | 7(33.3) | |||||
| Her-2 status | 0.061 | 0.115 | 0.061 | |||||||||||||||||
| Non-over-expressed | 20(95.2) | 6(60) | 13(66.7) | 18(94.7) | 8(66.7) | 14(66.7) | 22(91.7) | 4(50) | 14(70) | 16(94.1) | 8(66.7) | 16(69.6) | 18(94.7) | 6(66.7) | 14(66.7) | |||||
| Over-expressed | 1(4.8) | 5(40) | 6(33.3) | 1(5.3) | 4(33.3) | 7(33.3) | 2(8.3) | 4(50) | 6(30) | 1(5.9) | 4(33.3) | 7(30.4) | 1(5.3) | 4(33.3) | 7(33.3) | |||||
| Triple-Negative status | 0.272 | 0.104 | 0.051 | 0.087 | 0.104 | |||||||||||||||
| No | 15(71.4) | 9(90) | 13(61.9) | 14(73.7) | 11(91.7) | 12(57.1) | 18(75) | 8(1 0 0) | 11(55) | 13(76.5) | 11(91.7) | 13(56.5) | 14(73.7) | 11(91.7) | 12(57.1) | |||||
| Yes | 6(28.6) | 1(10) | 8(38.1) | 5(26.3) | 1(8.3) | 9(42.9) | 6(25) | 0(0) | 9(45) | 4(23.5) | 1(8.3) | 10(43.5) | 5(26.3) | 1(8.3) | 9(42.9) | |||||
| Luminal | ||||||||||||||||||||
| No | 7(33.3) | 5(50) | 15(71.4) | 6(31.6) | 5(41.7) | 16(76.2) | 8(33.3) | 4(50) | 15(75) | 5(29.4) | 5(41.7) | 17(73.9) | 6(31.6) | 5(41.7) | 16(76.2) | |||||
| Yes | 14(66.7) | 5(50) | 6(28.6) | 13(68.4) | 7(58.3) | 5(23.8) | 16(66.7) | 4(50) | 5(25) | 12(70.6) | 7(58.3) | 6(26.1) | 13(68.4) | 7(58.3) | 5(28.8) | |||||
| Adjuvant Chemotherapy | 0.52 | 0.423 | 0.459 | 0.459 | 0.423 | |||||||||||||||
| No | 7(33.3) | 3(30) | 4(19) | 7(36.8) | 2(16.7) | 4(19) | 6(25) | 2(25) | 5(25) | 6(35.3) | 3(25) | 4(17.4) | 7(36.8) | 2(16.7) | 4(19) | |||||
| Yes | 14(85.7) | 7(70) | 17(81) | 12(63.2) | 10(83.3) | 17(81) | 18(75) | 6(75) | 1575) | 11(64.7) | 9(75) | 19(82.6) | 12(63.2) | 10(83.3) | 17(81) | |||||
| Adjuvant Radiotherapy | 0.561 | 0.803 | 0.69 | 1 | 0.803 | |||||||||||||||
| No | 3(14.3) | 3(30) | 3(14.3) | 3(15.8) | 3(25) | 3(14.3) | 3(12.5) | 2(25) | 4(20) | 3(17.6) | 2(16.7) | 4(17.4) | 3(15.8) | 3(25) | 3(14.3) | |||||
| Yes | 18(85.7) | 7(70) | 18(85.7) | 16(84.2) | 9(75) | 18(85.7) | 21(87.5) | 6(75) | 16(80) | 14(82.4) | 10(83.3) | 19(82.6) | 16(84.2) | 9(75) | 18(85.7) | |||||
C1: cluster 1; C2: cluster 2; C3: cluster 3; a: mean (sd) or median (min, max).
Comparison of prediction for overall and specific survival between clusters at 5 and 10-year.
| Predict 5-year | Predict 10-year | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Overall Survival | Specific Survival | Overall Survival | Specific Survival | ||||||
| Methods | No. of patients | % [95% CI] | P-value | % [95% CI] | P-value | % [95% CI] | P-value | % [95% CI] | P-value |
| K-sparse | |||||||||
| Cluster 1 (n = 19) | 77% | 87% | 58% | 80% | |||||
| Cluster 2 (n = 12) | 71% | 81% | 53% | 75% | |||||
| Cluster 3 (n = 20) | 59% | 68% | 41% | 62% | |||||
| 0.1 | 0.241 | ||||||||
| Cluster 1 (n = 17) | 75% | 85% | 55% | 77% | |||||
| Cluster 2 (n = 12) | 72% | 83% | 55% | 79% | |||||
| Cluster 3 (n = 22) | 61% | 71% | 43% | 64% | |||||
| 0.203 | |||||||||
| Cluster 1 (n = 24) | 74% | 84% | 54% | 80% | |||||
| Cluster 2 (n = 7) | 72% | 83% | 56% | 75% | |||||
| Cluster 3 (n = 20) | 61% | 70% | 42% | 62% | |||||
| 0.077 | |||||||||
| Cluster 1 (n = 19) | 77% | 77% | 58% | 82% | |||||
| Cluster 2 (n = 12) | 71% | 71% | 52% | 75% | |||||
| Cluster 3 (n = 20) | 59% | 69% | 41% | 62% | |||||
| 0.055 | 0.085 | ||||||||
| Cluster 1 (n = 21) | 77% | 86% | 58% | 79% | |||||
| Cluster 2 (n = 10) | 69% | 80% | 52% | 77% | |||||
| Cluster 3 (n = 20) | 60% | 69% | 41% | 63% | |||||
Fig. 3Venn diagram of metabolic that were in common or unique to the five clustering methods.
Table indicating which metabolites are in each intersection or are unique to a certain list.
| Clustering Methods | Nbr | Metabolites | |
|---|---|---|---|
| 5 | K-Sparse | 2 | Creatine; |
| 4 | K-Sparse | 1 | Triethanolamine; |
| K-Sparse | 2 | ||
| K-Sparse | 2 | ||
| PCA K-means | 4 | Glutathione; Isoleucyl-Methionine; Humulinic acid A; Alnustone; | |
| 3 | K-Sparse | 1 | Hydroxyprolyl-Valine; |
| K-Sparse | 20 | Aminoadipic acid; Methylmalonic acid; 1b-Furanoeudesm-4(15)-en-1-ol acetate; Glycerophosphocholine; Lidocaine; Adenosine monophosphate; 2-Methyl-3-ketovaleric acid; Liqcoumarin; p-Cresol sulfate; 2-Methylbutyroylcarnitine; Methoxsalen; Citramalic acid; Hypoxanthine; | |
| SIMLR | 4 | 2,5-Dichloro-4-oxohex-2-enedioate; Histidinyl-Isoleucine; 3-(4-Methyl-3-pentenyl)thiophene; (−)-Epigallocatechin | |
| PCA K-means | 3 | ||
| 2 | K-Sparse | 3 | 5-Hydroxyisourate; Hexanoylcarnitine; |
| K-Sparse | 9 | Creatinine; Proline; betaine; Erythronic acid; Garcinia acid; Thiolutin; 4-Chloro-1H-indole-3-acetic acid; Niacinamide 3-Dehydroxycarnitine; Dihydrothymine; | |
| SIMLR | 21 | 5b-Cyprinol sulfate; 2′,4-Dihydroxy-4′,6′-dimethoxychalcone; Propenoylcarnitine; 5-Hydroxyindoleacetic acid; Phaseolic acid Lisuride; 2-Bromophenol; (alpha-D-mannosyl)7-beta-D-mannosyl-diacetylchitobiosyl- | |
| SIMLR | 1 | Phosphoric acid; | |
| PCA K-means | 4 | I(−); | |
| 1 | K-Sparse | 10 | Prolylhydroxyproline; Guanidoacetic acid; Histamine; PC-M6; |
| SIMLR | 14 | 3-Hydroxy-6,8-dimethoxy-7(11)-eremophilen-12,8-olide; Glycerol tripropanoate; Alanyl-Isoleucine; 1-(2,4,6-Trimethoxyphenyl)-1,3-butanedione; 1-Oxo-1H-2-benzopyran-3-carboxaldehyde; 1,3,11-Tridecatriene-5,7,9-triyne; N-Acetyl- | |
| Spectral clustering | 13 | 2,3-diketogulonate; 2,5-Furandicarboxylic acid; Pyrrolidine; Piperidine; Beta-Alanine; Aspartyl- | |
| Sparse K-means | 3 | Erinapyrone C; Ergothioneine; N-Methylethanolaminium phosphate | |
| PCA K-means | 4 | Dimethylglycine; Pipecolic acid; Methyl (9Z)-10′-oxo-6,10′-diapo-6-carotenoate; N-Desmethylvenlafaxine | |
List of significant relevant pathways identified by 5 methods.
| K-Sparse method | |||||||
|---|---|---|---|---|---|---|---|
| Clusters Comparaison | Interaction metabolite | Pathway Name | Total Cmpd | Match Status | Raw | -log(p) | Impact |
| C1 vs C3 | UDP – glucose | Starch and sucrose metabolism | 50 | 1 | 0,0107 | 4,5388 | 0,1390 |
| UDP – glucose | Amino sugar and nucleotide sugar metabolism | 88 | 1 | 0,0107 | 4,5388 | 0,0928 | |
| UDP - glucose; Glyceric acid | Glycerolipid metabolism | 32 | 2 | 0,0153 | 4,1831 | 0,0206 | |
| SIMLR method | |||||||
| Clusters Comparaison | Interaction metabolite | Pathway Name | Total Cmpd | Match Status | P Value | -log(p) | Impact |
| C1 VS C2 | Glutathione; Oxidized glutathione; Glycine; | Glutathione metabolism | 38 | 12 | 0 | 12,826 | 0,3628 |
| Ascorbic acid; Uridine diphosphate glucose; Pyruvic acid; D-Glucuronic acid 1-phosphate; Oxoglutaric acid; | Ascorbate and aldarate metabolism | 45 | 5 | 0 | 12,469 | 0,1383 | |
| Tryptophan metabolism | 79 | 8 | 0,0001 | 9,1233 | 0,2741 | ||
| 5′-Methylthioadenosine; N-Formyl- | Cysteine and methionine metabolism | 56 | 9 | 0,0008 | 7,1674 | 0,2509 | |
| Purine metabolism | 92 | 17 | 0,0011 | 6,8091 | 0,2048 | ||
| Glyoxylic acid; Oxoglutaric acid; N-Formyl- | Glyoxylate and dicarboxylate metabolism | 50 | 6 | 0,0027 | 5,9281 | 0,268 | |
| Arginine and proline metabolism | 77 | 19 | 0,0053 | 5,238 | 0,6514 | ||
| Oxoglutaric acid; Oxalosuccinic acid; Pyruvic acid; | Citrate cycle (TCA cycle) | 20 | 3 | 0,0075 | 4,8991 | 0,176 | |
| D-Xylose; Uridine diphosphate glucose; D-Glucuronic acid 1-phosphate; Pyruvic acid; | Pentose and glucuronate interconversions | 53 | 4 | 0,0076 | 4,8821 | 0,0394 | |
| 2-Hydroxyethanesulfonate; Pyruvic acid; 3-Sulfinoalanine; | Taurine and hypotaurine metabolism | 20 | 3 | 0,0154 | 4,1754 | 0,0324 | |
| Glyceric acid; Betaine; Guanidoacetic acid; Dimethylglycine; Glycine; Phosphoserine; | Glycine, serine and threonine metabolism | 48 | 13 | 0,018 | 4,0154 | 0,46986 | |
| Uridine diphosphate glucose; D-Glucuronic acid 1-phosphate; N-Acetyl-D-Glucosamine 6-Phosphate; Uridine diphosphate-N-acetylglucosamine; Cytidine monophosphate N-acetylneuraminic acid; D-Glucose; D-Xylose | Amino sugar and nucleotide sugar metabolism | 88 | 7 | 0,0187 | 3,9783 | 0,1417 | |
| Formiminoglutamic acid; | Histidine metabolism | 44 | 10 | 0,0412 | 3,1903 | 0,3705 | |
| Pyridoxamine; Oxoglutaric acid; 3-Hydroxy-2-methylpyridine-4,5-dicarboxylate; Pyruvic acid; | Vitamin B6 metabolism | 32 | 4 | 0,0412 | 3,1898 | 0,0773 | |
| C1 VS C3 | Formiminoglutamic acid; | Histidine metabolism | 44 | 10 | 0,0139 | 4,2752 | 0,3705 |
| Phenylpyruvic acid; | Phenylalanine, tyrosine and tryptophan biosynthesis | 27 | 5 | 0,0189 | 3,9687 | 0,099 | |
| Tryptophan metabolism | 79 | 8 | 0 | 16,409 | 0,2741 | ||
| C2 VS C3 | Glutathione; Oxidized glutathione; Glycine; | Glutathione metabolism | 38 | 12 | 0 | 16,133 | 0,3628 |
| Ascorbic acid; Uridine diphosphate glucose; Pyruvic acid; D-Glucuronic acid 1-phosphate; Oxoglutaric acid | Ascorbate and aldarate metabolism | 45 | 5 | 0 | 13,096 | 0,1383 | |
| 5′-Methylthioadenosine; N-Formyl- | Cysteine and methionine metabolism | 56 | 9 | 0,0001 | 9,8548 | 0,2509 | |
| Phenylpyruvic acid; | Phenylalanine, tyrosine and tryptophan biosynthesis | 27 | 5 | 0,0001 | 8,9814 | 0,099 | |
| Aminoacyl-tRNA biosynthesis | 75 | 14 | 0,0002 | 8,758 | 0,1127 | ||
| Glyoxylic acid; Oxoglutaric acid; N-Formyl- | Glyoxylate and dicarboxylate metabolism | 50 | 6 | 0,0004 | 7,7271 | 0,268 | |
| Purine metabolism | 92 | 17 | 0,0007 | 7,306 | 0,2048 | ||
| Malonic acid; Beta-Alanine; Spermine; Spermidine; Dihydrouracil; Pantothenic acid; Uracil; | beta-Alanine metabolism | 28 | 8 | 0,0012 | 6,7568 | 0,3577 | |
| Uridine 5′-monophosphate; | Pyrimidine metabolism | 60 | 13 | 0,0014 | 6,5817 | 0,2756 | |
| Pantothenic acid; Dihydrouracil; Beta-Alanine; Pyruvic acid; Adenosine 3′,5′-diphosphate; Uracil; | Pantothenate and CoA biosynthesis | 27 | 6 | 0,0023 | 6,0879 | 0,2736 | |
| Phenylalanine metabolism | 45 | 6 | 0,0072 | 4,9364 | 0,2468 | ||
| D-Glutamine and D-glutamate metabolism | 11 | 3 | 0,0124 | 4,39 | 0,139 | ||
| Arginine and proline metabolism | 77 | 19 | 0,0169 | 4,082 | 0,6514 | ||
| 2-Hydroxyethanesulfonate; Pyruvic acid; 3-Sulfinoalanine; | Taurine and hypotaurine metabolism | 20 | 3 | 0,0215 | 3,8411 | 0,0324 | |
| N-Acetyl- | Alanine, aspartate and glutamate metabolism | 24 | 7 | 0,0221 | 3,8108 | 0,4122 | |
| Pyridoxamine; Oxoglutaric acid; 3-Hydroxy-2-methylpyridine-4,5-dicarboxylate; Pyruvic acid; | Vitamin B6 metabolism | 32 | 4 | 0,0267 | 3,6235 | 0,0773 | |
| Oxoglutaric acid; Oxalosuccinic acid; Pyruvic acid | Citrate cycle (TCA cycle) | 20 | 3 | 0,0302 | 3,5015 | 0,176 | |
| Glyceric acid; Betaine; Guanidoacetic acid; Dimethylglycine; Glycine; Phosphoserine; | Glycine, serine and threonine metabolism | 48 | 13 | 0,0372 | 3,2914 | 0,4699 | |
| Uridine diphosphate glucose; Glycerol 3-phosphate; Glycerol; Glyceric acid; Galactosylglycerol; | Glycerolipid metabolism | 32 | 5 | 0,0427 | 3,1546 | 0,2162 | |
| D-Xylose; Uridine diphosphate glucose; D-Glucuronic acid 1-phosphate; Pyruvic acid; | Pentose and glucuronate interconversions | 53 | 4 | 0,0427 | 3,1536 | 0,0394 | |
| Sparse K-means method | |||||||
| Clusters Comparaison | Interaction metabolite | Total Cmpd | Match Status | Raw p | -log(p) | Impact | |
| C1 VS C2 | Cysteine and methionine metabolism | 56 | 2 | 0.007 | 4.9 | 0.0454 | |
| C1 VS C3 | Cysteine and methionine metabolism | 56 | 2 | 0.0020 | 6.2 | 0.00454 | |
| Spectral clustering method | |||||||
| Clusters Comparaison | Interaction metabolite | Pathway Name | Total Cmpd | Match Status | Raw p | -log(p) | Impact |
| C1 VS C3 | Iminoaspartic acid; Quinolinic acid; Niacinamide; Pyruvic acid; Propionic acid; | Nicotinate and nicotinamide metabolism | 44 | 5 | 0,0024 | 6,0206 | 0,0712 |
| Glyceric acid; Betaine; Guanidoacetic acid; Dimethylglycine; Glycine; Phosphoserine; | Glycine, serine and threonine metabolism | 48 | 13 | 0,0040 | 5,5100 | 0,4699 | |
| 5′-Methylthioadenosine; N-Formyl- | Cysteine and methionine metabolism | 56 | 9 | 0,0098 | 4,6232 | 0,2509 | |
| Formiminoglutamic acid; | Histidine metabolism | 44 | 10 | 0,0101 | 4,5961 | 0,3705 | |
| xoglutaric acid; Oxalosuccinic acid; Pyruvic acid; | Citrate cycle (TCA cycle) | 20 | 3 | 0,0171 | 4,0710 | 0,1760 | |
| Pyruvic acid; | Valine, leucine and isoleucine biosynthesis | 27 | 3 | 0,0178 | 4,0277 | 0,0350 | |
| D-Xylose; Uridine diphosphate glucose; D-Glucuronic acid 1-phosphate; Pyruvic acid; | Pentose and glucuronate interconversions | 53 | 4 | 0,0210 | 3,8609 | 0,0394 | |
| D-Glucose; Glyceric acid; Pyruvic acid; | Pentose phosphate pathway | 32 | 3 | 0,0232 | 3,7622 | 0,0218 | |
| Pyruvic acid; | Glycolysis or Gluconeogenesis | 31 | 3 | 0,0249 | 3,6928 | 0,0953 | |
| Pyruvic acid; | Pyruvate metabolism | 32 | 2 | 0,0274 | 3,5955 | 0,3201 | |
| Butanoate metabolism | 40 | 4 | 0,0283 | 3,5644 | 0,0852 | ||
| 2-Hydroxyethanesulfonate; Pyruvic acid; 3-Sulfinoalanine; | Taurine and hypotaurine metabolism | 20 | 3 | 0,0287 | 3,5525 | 0,0324 | |
| Glyoxylic acid; Oxoglutaric acid; N-Formyl- | Glyoxylate and dicarboxylate metabolism | 50 | 6 | 0,0303 | 3,4966 | 0,2680 | |
| Ascorbic acid; Uridine diphosphate glucose; Pyruvic acid; D-Glucuronic acid 1-phosphate; Oxoglutaric acid; | Ascorbate and aldarate metabolism | 45 | 5 | 0,0330 | 3,4104 | 0,1383 | |
| Epinephrine; Dopamine; | Tyrosine metabolism | 76 | 5 | 0,0385 | 3,2580 | 0,1750 | |
| N-Acetyl- | Alanine, aspartate and glutamate metabolism | 24 | 7 | 0,0390 | 3,2431 | 0,4122 | |
| Pyridoxamine; Oxoglutaric acid; 3-Hydroxy-2-methylpyridine-4,5-dicarboxylate; Pyruvic acid; | Vitamin B6 metabolism | 32 | 4 | 0,0447 | 3,1074 | 0,0773 | |
| PCA K-means method | |||||||
| Clusters Comparaison | Interaction metabolite | Pathway Name | Total Cmpd | Match Status | Raw p | -log(p) | Impact |
| C1 vs C3 | Iminoaspartic acid; Quinolinic acid; Niacinamide; Pyruvic acid; Propionic acid; | Nicotinate and nicotinamide metabolism | 44 | 5 | 0,003 | 5,9412 | 0,0712 |
| Oxoglutaric acid; Oxalosuccinic acid; Pyruvic acid; | Citrate cycle (TCA cycle) | 20 | 3 | 0,011 | 4,4865 | 0,1760 | |
| Epinephrine; Dopamine; | Tyrosine metabolism | 76 | 5 | 0,024 | 3,7311 | 0,1750 | |
| Pyruvic acid; | Pyruvate metabolism | 32 | 2 | 0,043 | 3,1507 | 0,3201 | |
| D-Xylose; Uridine diphosphate glucose; D-Glucuronic acid 1-phosphate; Pyruvic acid; | Pentose and glucuronate interconversions | 53 | 4 | 0,044 | 3,1214 | 0,0394 | |
| Pyruvic acid; | Valine, leucine and isoleucine biosynthesis | 27 | 3 | 0,045 | 3,1107 | 0,0350 | |
| Ascorbic acid; Uridine diphosphate glucose; Pyruvic acid; D-Glucuronic acid 1-phosphate; Oxoglutaric acid; | Ascorbate and aldarate metabolism | 45 | 5 | 0,045 | 3,0926 | 0,1383 | |
| Butanoate metabolism | 40 | 4 | 0,046 | 3,0843 | 0,0852 | ||
| D-Glucose; Glyceric acid; Pyruvic acid; | Pentose phosphate pathway | 32 | 3 | 0,046 | 3,0769 | 0,0218 | |
| N-Acetyl- | Alanine, aspartate and glutamate metabolism | 24 | 7 | 0,048 | 3,0446 | 0,4122 | |
Total cmpd is the total number of compounds in the pathway.
Hits is the actual matched number from the uploaded data.
Raw p is the original p-value calculated from the pathway analysis.
Impact is the pathway impact value calculated from pathway topology analysis.
Fig. 4Venn diagram of pathways that were in common or unique to the five clustering methods.
Fig. 5Boxplot of the 8 metabolites extracted from 5 ML methods.