| Literature DB >> 24155035 |
Roman Fischer1, Paul Bowness, Benedikt M Kessler.
Abstract
Proteomic research facilities and laboratories are facing increasing demands for the integration of biological data from multiple '-OMICS' approaches. The aim to fully understand biological processes requires the integrated study of genomes, proteomes and metabolomes. While genomic and proteomic workflows are different, the study of the metabolome overlaps significantly with the latter, both in instrumentation and methodology. However, chemical diversity complicates an easy and direct access to the metabolome by mass spectrometry (MS). The present review provides an introduction into metabolomics workflows from the viewpoint of proteomic researchers. We compare the physicochemical properties of proteins and peptides with metabolites/small molecules to establish principle differences between these analyte classes based on human data. We highlight the implications this may have on sample preparation, separation, ionisation, detection and data analysis. We argue that a typical proteomic workflow (nLC-MS) can be exploited for the detection of a number of aliphatic and aromatic metabolites, including fatty acids, lipids, prostaglandins, di/tripeptides, steroids and vitamins, thereby providing a straightforward entry point for metabolomics-based studies. Limitations and requirements are discussed as well as extensions to the LC-MS workflow to expand the range of detectable molecular classes without investing in dedicated instrumentation such as GC-MS, CE-MS or NMR.Entities:
Keywords: Integration; Liquid chromatography; Mass spectrometry; Metabolomics; Technology
Mesh:
Year: 2013 PMID: 24155035 PMCID: PMC4265265 DOI: 10.1002/pmic.201300192
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 2Mass redundancy of biomolecules – a challenge for identification by MS. (A) All 64 092 molecular entries of the METLIN database were sorted based on their molecular masses and then categorised in mass bins of 200 Da (X-axis). The total number of compounds per mass bin (red line) and the number of different masses (blue line) are displayed, indicating an uneven distribution of compounds and those sharing identical masses across the mass range (insert: higher resolved plot for mass range 0–1200 Da). (B) All protein entries from the SwissProt (UniKProt, 21 March 2012 release, containing 35 956 unique proteins incl. isoforms) database were digested in silico with trypsin (Protein Digestion Simulator by Matthew Monroe, PNNL (USA)) yielding 1 501 402 protein fragments between 400 and 4000 Da, sorted based on their molecular masses and then categorised in mass bins of 200 Da (X-axis). The total number of peptides per mass bin (red line), the number of unique peptides (green line) and the number of different masses (blue line) are displayed, indicating an uneven distribution of total and peptides with different molecular weights across the mass range.
Accessibility of selected metabolite classes using reversed and HILIC stationary phases
| Compound class | C18 | HILIC |
|---|---|---|
| Acyl glycines | ||
| Amino acids | ||
| Amino alcohols | ||
| Bile acids | ||
| Biotin and derivatives | ||
| Carbohydrates | ||
| Carnitines | ||
| Catecholamines and derivatives | ||
| Cobalamin derivatives | ||
| Coenzyme A derivatives | ||
| Dicarboxylic acids | ||
| Fatty acids | ||
| Glucuronides | ||
| Glycerolipids | ||
| Hydroxy acids | ||
| Indoles and indole derivatives | ||
| Keto acids | ||
| Leukotrienes | ||
| Lipoamides and derivatives | ||
| Nucleosides | ||
| Nucleotides | ||
| Peptides | ||
| Phospholipids | ||
| Polyamines | ||
| Polyphenols | ||
| Porphyrins | ||
| Prostanoids | ||
| Pterins | ||
| Purines and purine derivatives | ||
| Pyridoxals and derivatives | ||
| Pyrimidines and pyrimidine derivatives | ||
| Retinoids | ||
| Sphingolipids | ||
| Steroids and steroid derivatives | ||
| Sugar phosphates | ||
| Tricarboxylic acids |
Common requirements for various aspects of proteomic and metabolomic sample analysis
| Proteomics | LC-MS based metabolomics | |
|---|---|---|
| MS instrumentation | Ion trap, Q-TOF, QqQ, hybrid, Orbitrap | TOF, Q-TOF, QqQ, single-quad, Orbitrap |
| Ionisation | ESI, MALDI | ESI, APCI |
| Detector | MCP, electron multiplier, Orbitrap | MCP, electron multiplier, Orbitrap |
| Polarity | Positive | Positive/negative |
| High resolution | Required | Optional |
| High mass accuracy (MS1) | As high as possible | 70 ppm or better |
| High scan speed | Required | Optional |
| High sensitivity | Required | Required/less critical |
| High dynamic range | Required | Required |
| MSn capability | Required | Optional (comparison to standards) |
| High mass accuracy (MSn) | Optional | Required |
| High resolution | Optional | Required |
| Chromatographic separation | Required | Required (screening) |
| Column chemistry | RP (HILIC) | RP, HILIC, others |
| Nano-flow | Required | Normal flow preferred |
| Injection volume | 0.5–10 μL | 1–100 μL |
| Long columns/gradients | Required | Optional |
| Low inter-day variability | Optional | Required |
| Software for analysis | Vendor-specific, commercial or free software | Vendor-specific and limited free software |
| MSMS analysis | Automated | Manual |
| Databases | Available for sequenced organisms | Incomplete |
| Identification of analyte | Required | Optional |
| Use of standards | Optional (absolute quantitation, SWATH) | Required |
Figure 1Conceptual differences between a typical proteomics workflow and possible metabolomics workflows. (A) A ‘shotgun’ proteomic discovery experiment will typically employ a pre-fractionation of the analyte pre- or post-proteolysis, followed by LC-MS/MS analysis. Identification of peptides/proteins is essential for both quantitation and interpretation. A metabolomic experiment requires a sample extraction compatible with the analytical workflow further downstream. A separation into hydrophilic and hydrophobic compounds (Supporting Information Fig. 2) can yield samples for HILIC and RP front-end separation. The quantitation of detected molecules builds the basis for further processing. Even without identification, a metabolomic footprint can be used for diagnostic purposes and differential analyses. (B) The NMR-based metabolite sample preparation and analysis is not limited towards compounds with physicochemical properties compatible with LC-MS. Minimal to no sample preparation is needed. However, NMR (as other powerful platforms for metabolomics such as GC-MS or TLC-GC-FID 115) is not a standard technique used in most proteomics laboratories and is considered less sensitive than MS.
Figure 3Effect of mass accuracy on measuring and identifying biomolecules. (A) Display of the number of different protein precursor masses present in the SwissProt (UniKProt) database that can be separated based on mass accuracy (ppm calculations are based on protein LST1, 1.419 kDa). The proteins are indicated in groups of 30 kDa mass bins (X-axis). (B) Display of the number of different small molecular compound masses present in the METLIN database that can be separated based on mass accuracy (ppm calculations are based on methane, 16.0313 Da). The compounds are indicated in groups of 100 Da mass bins (X-axis). (C) Display of the number of different peptide precursor masses derived from an in silico trypsin digestion of proteins present in the SwissProt (UniKProt) database that can be separated based on mass accuracy (ppm calculations are based on the unique peptide HNM (Q9C037–3), 400.1528814 Da). The peptides are indicated in groups of 200 Da mass bins (X-axis). A similar analysis was performed after in silico digestion of proteins with other proteolytic enzymes (Supporting Information Fig. 3 and Supporting Information Tables 1 and 2).