| Literature DB >> 20015393 |
Alvaro Cuadros-Inostroza1, Camila Caldana, Henning Redestig, Miyako Kusano, Jan Lisec, Hugo Peña-Cortés, Lothar Willmitzer, Matthew A Hannah.
Abstract
BACKGROUND: Metabolite profiling, the simultaneous quantification of multiple metabolites in an experiment, is becoming increasingly popular, particularly with the rise of systems-level biology. The workhorse in this field is gas-chromatography hyphenated with mass spectrometry (GC-MS). The high-throughput of this technology coupled with a demand for large experiments has led to data pre-processing, i.e. the quantification of metabolites across samples, becoming a major bottleneck. Existing software has several limitations, including restricted maximum sample size, systematic errors and low flexibility. However, the biggest limitation is that the resulting data usually require extensive hand-curation, which is subjective and can typically take several days to weeks.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20015393 PMCID: PMC3087348 DOI: 10.1186/1471-2105-10-428
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1TargetSearch flow chart. TargetSearch pre-processing flow chart for the analysis of GC-MS data.
Figure 2TargetSearch grahical user interface. A simple GUI for TargetSearch. Here, the user can set all the parameters, import or manually edit the sample files, retention markers definition and library for and run the analysis in one go.
Figure 3Retention index markers report. Retention time of the first four retention index markers in the standard mixture experiment. Samples and retention times are represented in the x and y-axis, respectively. Different days of measurement are indicated by numbers 2, 3 and 4. A so-called day effect is observed between days 2, and 3 and 4, manifested as a retention time shift.
Summary of the standard mixture metabolite profiling.
| Name | Mass count | RI | Score | RI dev | Sample Count | Corr Coef |
|---|---|---|---|---|---|---|
| Glycolic acid | 20 | 1056.8 | 977 | 6.2 | 27 | 0.874 |
| Alanine | 19 | 1090.0 | 950 | 0.7 | 27 | 0.968 |
| Valine | 20 | 1213.5 | 960 | -4.7 | 27 | 0.982 |
| Leucine | 7 | 1267.9 | 995 | -1.8 | 27 | 0.988 |
| Isoleucine | 7 | 1289.7 | 993 | -1.1 | 27 | 0.990 |
| Proline | 20 | 1299.2 | 869 | -1.2 | 27 | 0.987 |
| Nicotinic acid | 13 | 1301.5 | 959 | -1.7 | 27 | 0.986 |
| Glycine | 15 | 1305.0 | 790 | -0.5 | 27 | 0.976 |
| Fumaric acid | 20 | 1346.1 | 880 | 0.1 | 27 | 0.982 |
| Serine | 21 | 1353.2 | 929 | 0.1 | 27 | 0.991 |
| Threonine | 21 | 1380.1 | 917 | -0.6 | 27 | 0.997 |
| Glutaric acid | 20 | 1402.7 | 961 | -1.7 | 27 | 0.947 |
| Alanine, beta- | 21 | 1429.8 | 929 | -5.0 | 27 | 0.994 |
| Homoserine | 20 | 1443.7 | 949 | -1.5 | 27 | 0.820 |
| Aspartic acid | 20 | 1510.0 | 915 | -0.2 | 27 | 0.983 |
| Methionine | 20 | 1519.2 | 922 | -4.7 | 27 | 0.985 |
| Butyric acid, 4-amino- | 19 | 1530.0 | 967 | -3.5 | 27 | 0.950 |
| Glutaric acid, 2-oxo- | 19 | 1567.0 | 955 | 1.2 | 27 | 0.951 |
| Phenylalanine | 21 | 1635.3 | 936 | -6.0 | 27 | 0.982 |
| Ribose | Arabinose | 20 | 20 | 1662.7 | 1662.7 | 915 | 942 | 4.2 | -4.7 | 27 | 0.829 |
| Suberic acid | 20 | 1695.0 | 867 | -0.3 | 27 | 0.985 |
| Aconitic acid | 19 | 1736.9 | 977 | 3.6 | 27 | 0.919 |
| Shikimic acid | 15 | 1791.4 | 975 | 1.1 | 27 | 0.986 |
| Citric acid | 17 | 1806.1 | 920 | -1.6 | 27 | 0.948 |
| Isocitric acid | 9 | 1804.6 | 909 | 0.0 | 27 | 0.828 |
| Arginine | 16 | 1817.8 | 826 | -3.4 | 27 | 0.993 |
| Quinic acid | 14 | 1845.0 | 882 | -1.8 | 27 | 0.896 |
| Fructose | 22 | 1854.3 | 922 | 1.9 | 27 | 0.867 |
| Mannose | 21 | 1869.1 | 908 | -0.1 | 27 | 0.689 |
| Lysine | 20 | 1912.0 | 856 | 0.3 | 27 | 0.993 |
| Histidine | 17 | 1914.1 | 443 | -2.5 | 27 | 0.983 |
| Galacturonic acid | 22 | 1926.8 | 948 | 0.7 | 27 | 0.783 |
| Tyrosine | 21 | 1934.0 | 955 | -1.4 | 27 | 0.988 |
| Sinapic acid | 20 | 2056.1 | 962 | -2.4 | 27 | 0.868 |
| Inositol, myo- | 21 | 2085.0 | 911 | -1.2 | 27 | 0.959 |
| Caffeic acid | 21 | 2133.0 | 957 | 0.2 | 27 | 0.913 |
| Phytol | 21 | 2169.7 | 954 | -0.6 | 27 | 0.745 |
| Tryptamine | 19 | 2244.2 | 864 | -14.8 | 27 | 0.838 |
| Fructose-6-phosphate | 12 | 2286.3 | 661 | 6.5 | 27 | 0.738 |
| Cystine | 13 | 2290.0 | 860 | 0.6 | 27 | 0.971 |
| Glucose-6-phosphate | 21 | 2301.9 | 958 | 5.2 | 27 | 0.866 |
| Maltose | 4 | 2723.7 | 716 | 3.4 | 27 | 0.741 |
| Trehalose | 16 | 2729.8 | 853 | 0.3 | 27 | 0.949 |
The summarised metabolite profile of the chemical standards (see Additional file 6 for full information). Name: metabolite name; Mass count: the number of correlating masses; RI: averaged RI; Score: spectrum similarity according to (1); RI dev; RI deviation from the expected RI; Sample Count: the number of samples in which the metabolite was found; Corr Coef: the correlation coefficient between metabolite input concentration and abundance estimation.
Figure 4Comparison between identified metabolite spectra and reference spectra. Comparison example between the averaged (blue) and reference spectra (red) of glycolic acid (right) and histidine (left). This type of comparisons can be used to assess the quality of individual identified peaks. In this example, histidine may need a closer inspection to elucidate what causes the difference.
Figure 5Comparison between identified metabolites by TargetSearch and manual curation. The venn diagrams show the number of metabolites considered to be either present or not present by TargetSearch in comparison to the manual curation. Left hand side: 96 metabolites identified by TargetSearch that were confirmed by manual curation. Right hand side: 31 metabolites not identified by TargetSearch and confirmed by manual curation. Both diagrams: 5 metabolites found by our software but not confirmed by manual curation, whilst 7 metabolites were not reported by TargetSearch but were present in the chromatograms.
Performance comparison of pre-processing analysis using Tagfinder, XCMS and TargetSearch.
| Tagfinder | XCMS | TargetSearch | |
|---|---|---|---|
| Peak extraction | 3442 | 3634 | 2779 |
| Peak import | 271 | - | - |
| RI correction | 255 | 29 | - |
| Profile building | 3050 | 1740 | 195 |
All parameters were set to default in order to obtain the maximum number of samples, but only 1025 samples were available. Times were compared using 200 samples. Before running this analysis, Tagfinder intensity threshold was set to 200 to allow the 200 samples to be imported. Time is expressed in seconds. Peak import is not necessary in XCMS and TargetSearch. RI correction is performed together with Peak extraction in TargetSearch.