| Literature DB >> 19229523 |
Sonja Peters1, Ewoud van Velzen, Hans-Gerd Janssen.
Abstract
In chromatographic profiling applications, peak alignment is often essential as most chromatographic systems exhibit small peak shifts over time. When using currently available alignment algorithms, there are several parameters that determine the outcome of the alignment process. Selecting the optimum set of parameters, however, is not straightforward, and the quality of an alignment result is at least partly determined by subjective decisions. Here, we demonstrate a new strategy to objectively determine the quality of an alignment result. This strategy makes use of a set of control samples that are analysed both spiked and non-spiked. With this set, not only the system and the method can be checked but also the quality of the peak alignment can be evaluated. The developed strategy was tested on a representative metabolomics data set using three software packages, namely Markerlynx, MZmine and MetAlign. The results indicate that the method was able to assess and define the quality of an alignment process without any subjective interference of the analyst, making the method a valuable contribution to the data handling process of chromatography-based metabolomics data.Entities:
Mesh:
Year: 2009 PMID: 19229523 PMCID: PMC2694917 DOI: 10.1007/s00216-009-2662-7
Source DB: PubMed Journal: Anal Bioanal Chem ISSN: 1618-2642 Impact factor: 4.142
Fig. 1Schematic overview of the strategy developed to determine the optimum parameter settings for peak alignment
Retention times, molecular weights and the five most dominant mass peaks of the 19 spiked compounds used in this study
| Retention time [min] | Compound | Molecular weight after TMS derivatisation | Main mass fragments | ||||
|---|---|---|---|---|---|---|---|
| 10.18 | 208.1 | 65 | 91 | 119 | 193 | 208 | |
| 11.23 | 3-Phenylpropionic acid | 222.1 | 75 | 91 | 104 | 207 | 222 |
| 12.12 | Mandelic acid | 296.1 | 73 | 147 | 163 | 179 | 253 |
| 12.61 | Salicylic acid | 282.1 | 73 | 91 | 135 | 147 | 267 |
| 13.25 | 220.1 | 75 | 103 | 131 | 161 | 205 | |
| 13.52 | 3-Hydroxybenzoic acid | 282.1 | 73 | 193 | 223 | 267 | 282 |
| 14.14 | 3-Hydroxyphenylacetic acid | 296.1 | 73 | 147 | 164 | 281 | 296 |
| 14.42 | 4-Hydroxybenzoic acid | 282.1 | 73 | 193 | 223 | 267 | 282 |
| 15.97 | 2,3-Dihydroxybenzoic acid | 370.1 | 73 | 75 | 147 | 193 | 355 |
| 16.27 | 2,6-Dihydroxybenzoic acid | 370.1 | 73 | 75 | 147 | 267 | 355 |
| 16.97 | 308.1 | 73 | 147 | 161 | 293 | 308 | |
| 17.05 | 2,4-Dihydroxybenzoic acid | 370.1 | 73 | 147 | 281 | 355 | 370 |
| 17.13 | 3,4-Dihydroxybenzoic acid | 370.1 | 73 | 193 | 281 | 355 | 370 |
| 17.2 | 3,5-Dihydroxybenzoic acid | 370.1 | 73 | 75 | 147 | 355 | 370 |
| 17.90 | 308.1 | 73 | 203 | 219 | 293 | 308 | |
| 18.72 | 308.1 | 73 | 219 | 249 | 293 | 308 | |
| 18.96 | 3,4,5-Trihydroxybenzoic acid | 458.1 | 73 | 75 | 281 | 443 | 458 |
| 19.25 | 2,4,6-Trihydroxybenzoic acid | 458.1 | 73 | 75 | 147 | 355 | 443 |
| 21.17 | Caffeic acid | 397.1 | 73 | 75 | 219 | 381 | 396 |
Fig. 2Reconstructed chromatograms obtained with two sets of parameter using Markerlynx: one set resulting in good peak alignment (a) and one resulting in poor alignment (c). The spiked (purple) and non-spiked (black) chromatograms are overlaid in a and c, whereas their respective residual chromatograms are shown in b and d. These representative settings are taken from sets 5 and 11 using Markerlynx for peak alignment (see Table 2 for details)
Sets of parameters applied in Markerlynx, MZmine and MetAlign and the number of detected peaks as well as number of markers and false positives found
| Settings | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Markerlynx | |||||||||||||
| Mass tolerance (abs) | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.1 | 1 | 0.1 | 0.1 | 0.1 | ||
| Noise elimination level | 1 | 6 | 20 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | 6 | ||
| Masses per retention time | 50 | 50 | 50 | 5 | 100 | 50 | 50 | 50 | 50 | 50 | 50 | ||
| Minimum intensity (%) | 1 | 1 | 1 | 1 | 1 | 10 | 1 | 1 | 1 | 1 | 1 | ||
| Mass window (Da) | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.5 | 1 | 0.5 | ||
| Retention time window | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 1 | ||
| Number of peaks | 13,296 | 4,920 | 2,781 | 2,434 | 2,882 | 1,925 | 4,415 | 4,350 | 2,596 | 2,256 | 1,398 | ||
| Number of spikes found | 18.4 | 18 | 18.3 | 3.7 | 18.4 | 6.9 | 18.8 | 18.4 | 17.7 | 17.8 | 8.1 | ||
| Number of false positives | 58.6 | 70.9 | 55.7 | 91 | 58 | 93.1 | 47.1 | 58.3 | 82.3 | 82.2 | 91.9 | ||
| MZmine | |||||||||||||
| 0.25 | 0.25 | 0.25 | 1 | 0.25 | 0.25 | 0.25 | 0.25 | 0.25 | 0.25 | 0.25 | 0.25 | 0.25 | |
| Noise level | 4 | 16 | 0.5 | 4 | 4 | 4 | 4 | 50 | 4 | 4 | 4 | 4 | 4 |
| Tolerance for | 0.05 | 0.05 | 0.05 | 0.05 | 0.5 | 1 | 0.05 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 |
| Tolerance for intensity variation (%) | 20 | 20 | 20 | 20 | 20 | 20 | 50 | 20 | 10 | 20 | 20 | 20 | 20 |
| Balance between | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 10 | 40 | 5 | 10 | 10 |
| 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 0.2 | 1 | 0.2 | |
| RT tolerance size (%) | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 10 |
| Number of variables | 7,450 | 4,171 | 8,251 | 7,450 | 7,830 | 8,535 | 7,450 | 1,551 | 6,089 | 6,083 | 6,091 | 5,868 | 5,127 |
| Number of spikes found | 19 | 19 | 19 | 19 | 19 | 19 | 19 | 18.3 | 19 | 19 | 19 | 19 | 18.7 |
| Number of false positives | 6 | 13.7 | 7.1 | 6 | 5 | 5.3 | 6 | 55.8 | 3.8 | 3.5 | 4 | 3.9 | 51.1 |
| MetAlign | |||||||||||||
| Peak slope factor | 1 | 5 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||||
| Peak threshold factor | 2 | 2 | 10 | 2 | 2 | 2 | 2 | 2 | 2 | ||||
| Regions | 100/200 | 100/200 | 100/200 | 10/20 | 10/20 | 10/20 | 10/20 | 10/20 | 10/20 | ||||
| Alignment | Rough | Rough | Rough | Rough | It. | It. | It. | It. | It. | ||||
| Maximum shift per 100 scans | – | – | – | – | 40 | 60 | 40 | 40 | 40 | ||||
| Minimum factor | – | – | – | – | 10/5 | 10/5 | 50/5 | 50/5 | 10/5 | ||||
| Minimum number of masses | – | – | – | – | 4/2 | 4/2 | 4/2 | 8/2 | 8/2 | ||||
| Number of variables | 19,268 | 5,879 | 6,172 | 27,913 | 33,336 | 33,343 | 33,315 | 33,308 | 33,291 | ||||
| Number of spikes found | 19 | 19 | 19 | 19 | 19 | 19 | 19 | 19 | 19 | ||||
| Number of false positives | 7.8 | 9 | 7.8 | 3.2 | 3.7 | 3.7 | 10.4 | 10.4 | 3.7 | ||||