| Literature DB >> 35546254 |
Brechtje Hoegen1, Juliet E Hampstead1, Udo F H Engelke2, Purva Kulkarni2, Ron A Wevers2, Han G Brunner1,3, Karlien L M Coene2, Christian Gilissen1.
Abstract
Untargeted metabolomics (UM) allows for the simultaneous measurement of hundreds of metabolites in a single analytical run. The sheer amount of data generated in UM hampers its use in patient diagnostics because manual interpretation of all features is not feasible. Here, we describe the application of a pathway-based metabolite set enrichment analysis method to prioritise relevant biological pathways in UM data. We validate our method on a set of 55 patients with a diagnosed inherited metabolic disorder (IMD) and show that it complements feature-based prioritisation of biomarkers by placing the features in a biological context. In addition, we find that by taking enriched pathways shared across different IMDs, we can identify common drugs and compounds that could otherwise obscure genuine disease biomarkers in an enrichment method. Finally, we demonstrate the potential of this method to identify novel candidate biomarkers for known IMDs. Our results show the added value of pathway-based interpretation of UM data in IMD diagnostics context.Entities:
Keywords: biochemical pathways; biomarkers; cystathionine ß-synthase; inborn errors of metabolism; inherited metabolic disorders; mass spectrometry; metabolite set enrichment analysis; next-generation metabolic screening; untargeted metabolomics
Mesh:
Substances:
Year: 2022 PMID: 35546254 PMCID: PMC9544878 DOI: 10.1002/jimd.12522
Source DB: PubMed Journal: J Inherit Metab Dis ISSN: 0141-8955 Impact factor: 4.750
Samples and biomarker counts by inherited metabolic disorder (IMD). Samples in our cohort (n = 62) are summarised by the diagnosed IMD (left), IMD OMIM ID. We also list the number of theoretical biomarkers per IMD used in our analysis, separated into all 102 known biomarkers (left) and 54 biomarkers associated with a Kyoto Encyclopedia of Genes and Genomes (KEGG) or Small Molecule Pathway Database (SMPDB) pathway (right). Grey lines indicate that biomarker pathways were enriched for all cohort samples of the IMD indicated with the exception of phenylketonuria, for which biomarker pathways were enriched in only 8 out of 9 samples
| Biomarkers | Samples | |||
|---|---|---|---|---|
| IMD | OMIM | Total | Pathway | |
| 3‐Ureidopropionase deficiency | 613161 | 4 | 4 | 1 |
| 3β‐Hydroxy‐∆5‐C27‐steroid dehydrogenase deficiency | 607765 | 5 | 1 | 1 |
| ACSF3 deficiency (CMAMMA) | 614265 | 2 | 2 | 3 |
| Adenylosuccinate lyase deficiency | 103050 | 2 | 1 | 1 |
| Alkaptonuria | 203500 | 1 | 1 | 1 |
| Aminoacylase I deficiency | 609924 | 10 | 1 | 2 |
| Cystathionine ß‐synthase deficiency | 236200 | 2 | 2 | 1 |
| Dimethylglycine dehydrogenase deficiency | 605850 | 1 | 1 | 1 |
| Glutamate formimino transferase deficiency | 229100 | 2 | 2 | 1 |
| Glutaric aciduria Type I | 231670 | 2 | 1 | 1 |
| Guanidinoacetate methyltransferase deficiency | 601240 | 3 | 3 | 1 |
| Histidinemia | 235800 | 2 | 2 | 1 |
| Hyperlysinemia, Type I | 238700 | 3 | 2 | 2 |
| Hyperprolinemia, Type II | 239510 | 4 | 3 | 2 |
| Maple syrup urine disease | 248600 | 7 | 5 | 2 |
| Methionine Adenosyltransferase I/III deficiency | 250850 | 3 | 1 | 2 |
| Molybdenum cofactor deficiency | 252150 | 5 | 5 | 2 |
| NANS deficiency | 610442 | 1 | 1 | 1 |
| Ornithine aminotransferase deficiency | 258870 | 3 | 2 | 1 |
| Phenylketonuria | 261600 | 5 | 3 | 9 |
| Pyridoxine‐dependent epilepsy | 266100 | 3 | 3 | 1 |
| UMP synthase deficiency | 258900 | 2 | 2 | 1 |
| Xanthinuria, Type II | 603592 | 6 | 6 | 2 |
| (Very‐)Long‐chain acyl‐CoA dehydrogenase deficiency | 201475 | 3 | 0 | 1 |
| 3‐Hydroxy‐3‐methylglutaryl‐CoA lyase deficiency | 246450 | 6 | 0 | 5 |
| 3‐Ketothiolase deficiency | 203750 | 4 | 0 | 1 |
| 3‐Methylcrotonyl‐CoA carboxylase deficiency | 210200 | 3 | 0 | 5 |
| Cerebrotendinous xanthomatosis | 213700 | 1 | 0 | 5 |
| Medium‐chain acyl‐CoA dehydrogenase deficiency | 201450 | 7 | 0 | 5 |
| Total | 102 | 54 | 62 | |
Some of these samples belong to the same patient, see Table S2 for more details.
FIGURE 1Metabolite set enrichment analysis (MSEA) method overview. Visualisation of our untargeted metabolomics workflow, including MSEA. Analysis up to aberrant feature detection was performed using an in‐house pipeline described previously by Coene et al. Adjustments made to the in‐house pipeline and the implementation of MSEA and clustering steps are described in more detail in the materials and methods
The biomarker pathways that were prioritised for each inherited metabolic disorder (IMD) in the data by our metabolite set enrichment analysis (MSEA) implementation. Bold biomarkers were not detected in all samples; Table S4A shows the exact samples in which the biomarker was or was not measured. Table S4B gives a more extensive list of all theoretically biomarker pathways available, including medication and disease pathways
| IEM | OMIM | Pathway name | Aberrant biomarkers | Enriched |
|---|---|---|---|---|
| 3‐Ureidopropionase deficiency | 613161 | Beta‐alanine metabolism | Ureidopropionic acid | 1/1 |
| Pyrimidine metabolism |
| 1/1 | ||
| Pyrimidine metabolism | Ureidopropionic acid | 1/1 | ||
| Beta‐alanine metabolism | Ureidopropionic acid | 1/1 | ||
| 3β‐Hydroxy‐∆5‐C27‐steroid dehydrogenase deficiency | 607765 | Bile acid biosynthesis | 3b,7a‐Dihydroxy‐5‐cholestenoic acid | 1/1 |
| Primary bile acid biosynthesis | 3b,7a‐Dihydroxy‐5‐cholestenoic acid | 1/1 | ||
| Cystathionine ß‐synthase deficiency | 236200 | Betaine metabolism |
| 1/1 |
| Methionine Metabolism |
| 1/1 | ||
| Cysteine and methionine metabolism |
| 1/1 | ||
| 2‐Oxocarboxylic acid metabolism |
| 1/1 | ||
| Histidinemia | 235800 | Histidine metabolism | Histidine; imidazole lactic acid | 1/1 |
| Hyperlysinemia, Type I | 238700 | Biotin metabolism | Lysine | 1/2 |
| Lysine degradation | Lysine; | 1/2 | ||
| tRNA charging: lysine | Lysine | 2/2 | ||
| Lysine biosynthesis | Lysine | 1/2 | ||
| Lysine degradation | Lysine; | 2/2 | ||
| Biotin metabolism | Lysine | 1/2 | ||
| Hyperprolinemia, Type II | 239510 | Arginine and proline metabolism | Proline; 1‐pyrroline‐2‐carboxylic acid | 1/2 |
| tRNA charging: proline | Proline | 2/2 | ||
| Arginine and proline metabolism | Proline; 1‐pyrroline‐2‐carboxylic acid; pyrrole‐2‐carboxylic acid | 2/2 | ||
| Biosynthesis of amino acids | Proline | 1/2 | ||
| ABC transporters | Proline | 1/2 | ||
| Protein digestion and absorption | Proline | 1/2 | ||
| Mineral absorption | Proline | 1/2 | ||
| Maple syrup urine disease | 248600 | Valine, leucine and isoleucine degradation | Leucine; isoleucine; ketoleucine; | 2/2 |
| Valine, leucine and isoleucine degradation | Leucine; isoleucine; ketoleucine; | 2/2 | ||
| Valine, leucine and isoleucine biosynthesis | Leucine; isoleucine; ketoleucine; | 2/2 | ||
| 2‐Oxocarboxylic acid metabolism | Leucine; isoleucine; ketoleucine; | 2/2 | ||
| Biosynthesis of amino acids | Leucine; isoleucine; ketoleucine; | 1/2 | ||
| Central carbon metabolism in cancer | Leucine; isoleucine | 1/2 | ||
| Methionine adenosyltransferase I/III deficiency | 250850 | Spermidine and spermine biosynthesis |
| 1/2 |
| tRNA charging: methionine |
| 2/2 | ||
| Phenylketonuria | 261600 | Phenylalanine and tyrosine metabolism | Phenylalanine | 2/9 |
| tRNA charging: phenylalanine | Phenylalanine | 8/9 | ||
| Phenylalanine metabolism | Phenylalanine; | 4/9 | ||
| Phenylalanine, tyrosine and tryptophan biosynthesis | Phenylalanine | 1/9 | ||
| Aminoacyl‐tRNA biosynthesis | Phenylalanine | 1/9 | ||
| Protein digestion and absorption | Phenylalanine | 1/9 | ||
| Mineral absorption | Phenylalanine | 1/9 | ||
| UMP synthase deficiency | 258900 | Pyrimidine metabolism | Orotic acid; dihydroorotic acid | 1/1 |
| Pyrimidine metabolism | Orotic acid; dihydroorotic acid | 1/1 | ||
| Xanthinuria, Type II | 603592 | Purine metabolism | Xanthosine; xanthine; | 1/2 |
| Purine metabolism | Xanthosine; xanthine; | 1/2 | ||
| Caffeine metabolism | Xanthosine; xanthine | 2/2 | ||
| Glutaric aciduria Type I | 231670 | Fatty acid degradation |
| 1/1 |
FIGURE 2Metabolite set enrichment analysis (MSEA) prioritises known inherited metabolic disorder (IMD) biomarkers. (A) The row index distribution of all biomarker‐associated features are shown ranked by feature intensity (yellow), MSEA pathway p‐value (purple) and MSEA cluster p‐value (blue; see Section 2). (B) An example patient (00057) with cystathionine ß‐synthase (CBS) deficiency is shown. The number of biomarker‐associated features in each category (significant features, yellow; significant features within an MSEA‐enriched pathway, purple; significant features within an MSEA cluster, blue) is indicated by a line. Along this line, biomarker‐associated features are shown; their position indicates where they rank in the feature distribution sorted as described in (A). (C) The same example patient (00057) with CBS deficiency is shown. Here the length of the line represents the number of pathways (purple) or clustered pathways (blue) and points represent the distribution of known biomarkers in these pathways ranked by pathway p‐value (purple) or cluster p‐value (blue, see Section 2). (D) A pathway depicting what metabolites we found to be aberrant in a CBS deficiency patient, who were in close proximity to CBS. Bold metabolites are known biomarkers on IEMbase, underlined metabolites are part of our own metabolite panel, red metabolites have aberrant features associated to them in our data and metabolites in italics are false positive hits in the pathway as the associated aberrant feature was incorrectly annotated, see Table S4C–F for more details.
Metabolite set enrichment analysis (MSEA) pathway enrichment can aid in the discovery of novel IMD biomarker metabolites. For each MSEA‐enriched pathway containing a known biomarker, we identified all other metabolites in the pathway associated with an enriched feature following correction for multiple testing (Bonferroni–Holm p < 0.05, right column). We removed known biomarkers from this list (Table 1). We report the remainder as putatively novel biomarkers for the indicated IMD pending analytical and functional evaluation and literature review
| Sample | Diagnosis | Enriched biomarker pathway | Putative novel metabolite biomarker |
|---|---|---|---|
| RadboudUMC_1 | 3‐Ureidopropionase deficiency | SMP0000007; SMP0000046; hsa00240; hsa00410 | Carnosine (HMDB0000033/C00386); 1,3‐diaminopropane (HMDB0000002/C00986); carbon dioxide (HMDB0001967); flavin mononucleotide (HMDB0001520); |
| RadboudUMC_2 | Cystathionine ß‐synthase deficiency | SMP0000033; hsa00270; hsa01210 |
|
| RadboudUMC_3 | Histidinemia | hsa00340 | Imidazole‐4‐acetaldehyde (C05130); imidazoleacetic acid riboside (C05131); 4‐imidazolone‐5‐propionic acid (C03680); gamma‐ |
| RadboudUMC_4 | Hyperlysinemia, Type I | hsa00300; | (2 |
| RadboudUMC_31 | Hyperlysinemia, Type I | hsa00310 |
|
| RadboudUMC_5 | Hyperprolinemia, Type II | SMP0000020; |
|
| RadboudUMC_32 | Hyperprolinemia, Type II | hsa00330 | 2‐Oxoarginine (C03771); ( |
| RadboudUMC_14 | 3β‐Hydroxy‐∆5‐C27‐steroid dehydrogenase deficiency | SMP0000035; hsa00120 | Cholic acid (HMDB0000619/C00695); 7alpha‐hydroxy‐3‐oxo‐4‐cholestenoate (HMDB0012458/C17337) |
| RadboudUMC_33 | Maple syrup urine disease | SMP0000032; hsa00280; hsa01210; hsa05230 |
|
| RadboudUMC_34 | Maple syrup urine disease | SMP0000032; hsa00280; hsa00290; hsa01230 | ( |
| RadboudUMC_46 | Phenylketonuria | hsa00360 | 3‐Hydroxyphenylacetic acid (C05593); vanillin (C00755); 3‐(3‐hydroxyphenyl)propanoic acid (C11457); 1‐phenyl‐1,2‐propanedione (C17268); ortho‐hydroxyphenylacetic acid (C05852); phenylacetylglycine (C05598); phenylacetic acid (C07086); |
| RadboudUMC_47 | Phenylketonuria | hsa00360 | 1‐Phenyl‐1,2‐propanedione (C17268); phenylacetic acid (C07086); |
| RadboudUMC_49 | Phenylketonuria |
| Phenylacetic acid (C07086); |
| RadboudUMC_50a | Phenylketonuria | hsa00360 | Phenylacetic acid (C07086); |
| RadboudUMC_53 | UMP synthase deficiency | SMP0000046; hsa00240 | Uracil (HMDB0000300/C00106); hydroxypropionic acid (C01013) |
| RadboudUMC_55 | Xanthinuria, Type II | SMP0000050; hsa00230; hsa00232 | SAICAR (HMDB0000797/C04823); ( |
FIGURE 3Biomarker‐containing pathways are relatively inherited metabolic disorder (IMD) specific. (A) The counts of biomarker‐containing and non‐biomarker‐containing pathways across 64 samples were plotted by the number of IMDs that share enrichment of each pathway. Biomarker‐containing pathways are significantly more IMD‐specific overall than non‐biomarker‐containing pathways (Wilcoxon p = 0.016). We indicate a set of pathways enriched by common non‐steroidal anti‐inflammatory drugs (NSAIDs) taken by a subset of our cohort with asterisk symbol (Table S5). If we remove these confounding pathways, biomarker‐containing pathways are not significantly more IMD specific (Wilcoxon p = 0.13)