| Literature DB >> 32316423 |
Ellen Kuang1, Matthew Marney2, Daniel Cuevas3, Robert A Edwards2,3,4, Erica M Forsberg1,2,3.
Abstract
Genomics-based metabolic models of microorganisms currently have no easy way of corroborating predicted biomass with the actual metabolites being produced. This study uses untargeted mass spectrometry-based metabolomics data to generate a list of accurate metabolite masses produced from the human commensal bacteria Citrobacter sedlakii grown in the presence of a simple glucose carbon source. A genomics-based flux balance metabolic model of this bacterium was previously generated using the bioinformatics tool PyFBA and phenotypic growth curve data. The high-resolution mass spectrometry data obtained through timed metabolic extractions were integrated with the predicted metabolic model through a program called MS_FBA. This program correlated untargeted metabolomics features from C. sedlakii with 218 of the 699 metabolites in the model using an exact mass match, with 51 metabolites further confirmed using predicted isotope ratios. Over 1400 metabolites were matched with additional metabolites in the ModelSEED database, indicating the need to incorporate more specific gene annotations into the predictive model through metabolomics-guided gap filling.Entities:
Keywords: bioinformatics; flux balance analysis; mass spectrometry; metabolomics; microbiome; multiomics
Year: 2020 PMID: 32316423 PMCID: PMC7240944 DOI: 10.3390/metabo10040156
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1MS_FBA workflow illustrating the necessary input files and algorithmic processes involved in integrating metabolomics data with genome-based metabolic models. The algorithm requires two sets of data: one from mid (M), late (L) and stationary (S) phase cell culture extracts that have been analyzed using untargeted liquid chromatography mass spectrometry (LCMS) that have been converted to mzxml file format; the other is a metabolite list generated from flux balance analysis software, in this case we use PyFBA. The m/z features are searched within the metabolite list based on accurate mass and isotope ratios, then output as a ranked list and graphical visualization.
Figure 2A 3D graphical output from MS_FBA showing features with potential metabolite matches between significant features from reverse phase data and the PyFBA predicted metabolites list. There are two data points for every feature: the intensity difference between late-log phase and mid-log phase sample classes (red), and the intensity difference between the stationary phase and late-log phase sample classes (blue).
Figure 3Mass spectrum taken from late stationary phase sample L5, depicting the metabolite feature eluting at 1.9 min correlating to glutamic acid and [M+H]+ at 148.0596 m/z. Identified in the spectrum are the loss of carboxylic acid [M-COOH+H]+, the loss of water [M-H2O+H]+, the sodium adduct [M+Na]+, the dimer [2M+H]+, and the trimer [3M+H]+. The inset graph shows the production and decline of the feature identified as glutamic acid for mid, late, and stationary phases, with box and whisker plots from the five biological replicates.
Summary of metabolite features matched to reverse phase (RP) and hydrophilic interaction liquid chromatogprahy (HILIC) data using MS_FBA.
| Features Matched | RP | HILIC |
|---|---|---|
| XCMS features (pre-MS_FBA) | 1210 | 1560 |
| Matched to PyFBA | 135 | 118 |
| PyFBA isotope matches | 23 | 28 |
| Matched to Model SEED | 846 | 621 |
| Model SEED isotope matches | 107 | 90 |
| Unique annotated metabolites to PyFBA | 218 | |
| PyFBA compounds in search list | 699 | |
| Model SEED compounds in search list | 27,693 | |
Figure 4A summary of metabolite features MS_FBA-matched to the predicted metabolites generated from the PyFBA metabolic model with either accurate mass only (blue) or isotope ratio (red). The total column is the combination of both RP positive mode and HILIC negative runs, without excluding duplicate features between polarities.