| Literature DB >> 24957991 |
Trung Nghia Vu1, Kris Laukens2.
Abstract
One of the most significant challenges in the comparative analysis of Nuclear Magnetic Resonance (NMR) metabolome profiles is the occurrence of shifts between peaks across different spectra, for example caused by fluctuations in pH, temperature, instrument factors and ion content. Proper alignment of spectral peaks is therefore often a crucial preprocessing step prior to downstream quantitative analysis. Various alignment methods have been developed specifically for this purpose. Other methods were originally developed to align other data types (GC, LC, SELDI-MS, etc.), but can also be applied to NMR data. This review discusses the available methods, as well as related problems such as reference determination or the evaluation of alignment quality. We present a generic alignment framework that allows for comparison and classification of different alignment approaches according to their algorithmic principles, and we discuss their performance.Entities:
Year: 2013 PMID: 24957991 PMCID: PMC3901265 DOI: 10.3390/metabo3020259
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Hypothetical examples of how binning addresses peak shifts. (a) A good binning—each peak corresponding between the red and blue spectrum end up the same bin. (b) Multiple peaks end up in a single bin. (c) Incorrect matching of overlapping peaks—the first peak of the red spectrum falls in the same bin as the second peak of the blue spectrum. (d) Peak shifts across the boundaries of bins.
Figure 2Example of a spectral region before and after alignment using CluPA [8]. The example comes from the Wine dataset [9,10].
List of methods and their features.
| Short Name | Full Name | Reference | Technique | Target Function | Peak Picking? | Number of Parameters | Original Applied Data | Segment-Wise? | Pair-Wise? | Correction Method | Software |
|---|---|---|---|---|---|---|---|---|---|---|---|
| PLF | Partial Linear Fit | [ | Segmentation model by consecutive peaks distances less than window size D | Sum of squared differences in intensity | No | 2 (window | 1D NMR | Yes | Yes | Shift | NA |
| COW | Correlation Optimized Warping | [ | Dynamic programming | Pearson correlation coefficient | No | 2 ( | Chromatograpic data | yes | Yes | Insert and deletion | (1) |
| PAGA | Peak alignment by genetic algorithn | [ | Genetic Algorithm | Pearson correlation coefficient | No | 6 - Based on GA (normalize geometric ranking | 1D NMR | Yes | Yes | Shift & Insert and deletion | NA |
| PARS | Peak alignment using Reduced Set | [ | Breadth first search (BFS), Dynamic Programming (DP), complexity reduced dynamic programming (crDP) | Euclidean distances | Yes | 2 (search window size, mismatch weight) | 1D NMR, Gas Chromatography | No | Yes | Shift | (+) |
| DTW | Dynamic Time Warping | [ | Dynamic programming | Squared Euclidean distance | No | 2 ( | Chromatograpic data | No | Yes | Insert and deletion | (1) |
| PABS | Peak alignment by Beam search | [ | Beam search algorithm | Pearson correlation coefficient | No | 3 (ranges of segment number, sideway movement and interpolation) | 1D NMR | Yes | Yes | Shift & Insert and deletion | (+) |
| PAPCA (*) | Peak alignment by PCA | [ | Principle Component Analysis | CORREL | No | 1 (correlation threshold 0.8) | 1D NMR | No | No | Shift | (+) |
| PTW | Parametric Time Warping | [ | Global polynominal model | Root mean squared (RMS) | No | 1 (degree of polynomial warping function) | Chromatograpic data | No | Yes | Polynominal model | (2) |
| PAFFT | Peak alignment by FFT | [ | FFT + segmentation model by equal size segments | FFT cross-correlation | No | 2 (segment size: | Chromatograpic data | Yes | Yes | Shift | (3) |
| RAFFT | Recursive alignment by FFT | [ | FFT + Recursive segmentation model from global to local | FFT cross-correlation | No | 1 (max. allowable shift) | Chromatograpic data | Yes | Yes | Shift | (3) |
| SpecAlign | NA | [ | Sliding windows | Minimal matched peak distances | No | 1 (window size | Mass Spectrometry | No | Yes | Insert and deletion | (3) |
| FW | Fuzzy Warping | [ | Fuzzy logic for matching most intense peaks | Maximize fuzzy membership Gaussian function | Yes | 1 (the number of most intense peaks) | 1D NMR | No | Yes | Insert and deletion | (4) |
| GFHT | Generlized Fuzzy Hought Transform | [ | Hough transform | Hough score | Yes | 3 (expansion factor alpha, step size, lower vote threshold) | 1D NMR | No | No | NA | NA |
| RSPA | Recursive segment-wise peak alignment | [ | Recursive segmentation model | FFT cross-correlation | Yes | 6 (peak height threshold, splitting threshold, min. segment size, validation of segment alignment, max. allowable shift, alignment acceptance) | 1D NMR | Yes | Yes | Shift & Insert and deletion | (+) |
| PCANS | Progressive Consensus Alignment of NMR Spectra | [ | Segmentation model+Dynamic programming + progressive consensus alignment | Scoring by similarity between peaks calculated by height, half height and position of peaks | Yes | 5 | 1D NMR | Yes | No | Shift | (5) |
| BAA (*) | Bayesian approach for alignment | [ | Bayesian modeling | Bayesian estimation | No | 3 (noise variance, two parameter values in diagonal entries of diagonal covariance matrix) | 1D NMR | No | Yes | Polynomial model | NA |
| icoshift | interval correlation shifting | [ | Segmentation model by equal size segments or manually selecting segments | FFT cross-correlation | No | 2 (the number of intervals or the length of interval | 1D NMR | Yes | Yes | Shift & Insert and deletion | (6) |
| CluPA | hierarchial Cluster-based Peak Alignment | [ | Segmentation model by hierarchical clustering | FFT cross-correlation | Yes | 1 (max. allowable shift) | 1D NMR | Yes | Yes | Shift | (7) |
(*): This name is not from the authors, but assigned by us for convenience; NA: The software implementation information is not available; (+): The implementation of the algorithm can be requested from the authors; (1): http://www.models.life.ku.dk/DTW_COW/; (2): http://cran.r-project.org/web/packages/ptw/index.html/; (3): http://powcs.med.unsw.edu.au/research/adult-cancer-program/services-resources/specalign/; (4): http://code.google.com/p/automics/ refer to [26]; (5): http://gomezlab.bme.unc.edu/tools/; (6): http://www.models.life.ku.dk/icoshift/; (7): http://code.google.com/p/speaq/
Figure 3A general framework of Nuclear Magnetic Resonance (NMR) spectrum alignment methods. The stacked blocks with white background represent possible methodological variations.
Figure 4Examples of evaluation by using visualizations for a region in the Wine data [9,10]. (a) Spectral plot. (b) Spectra image. (c) Grey scale plot. (d) The heatmap of spectra correlation.