| Literature DB >> 18284666 |
Max Bylesjö1, Mattias Rantalainen, Jeremy K Nicholson, Elaine Holmes, Johan Trygg.
Abstract
BACKGROUND: Kernel-based classification and regression methods have been successfully applied to modelling a wide variety of biological data. The Kernel-based Orthogonal Projections to Latent Structures (K-OPLS) method offers unique properties facilitating separate modelling of predictive variation and structured noise in the feature space. While providing prediction results similar to other kernel-based methods, K-OPLS features enhanced interpretational capabilities; allowing detection of unanticipated systematic variation in the data such as instrumental drift, batch variability or unexpected biological variation.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18284666 PMCID: PMC2323673 DOI: 10.1186/1471-2105-9-106
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Pseudo-code for the K-OPLS model training algorithm. K denotes the original kernel matrix, Ki the kernel matrix deflated by i Y-orthogonal components and Qi the Ki matrix deflated by A predictive components.
| 1. | Estimate the predictive | |
| 2. | Project | |
| 3. | Calculate the predictive score matrix of | |
| 4. | Repeat for | |
| 4.1 | Estimate the | |
| 4.2. | Calculate the | |
| 4.3. | Deflate | |
| 4.4. | Update the predictive score matrix: | |
| 5. | Predictions of | |
Figure 1K-OPLS model properties of the NMR-based metabolic profiling data set. Each point represents a measured observation (biological sample).The size of each glyph in the figure is proportional to the internode number 1–8, denoting a growth gradient. In (A), the K-OPLS predictive score vector t1p is plotted against the first Y-orthogonal score vector t1o. In (B), the first K-OPLS Y-orthogonal score vector t1o is plotted against the second Y-orthogonal score vector t2o. An approximate joint internode gradient, formed by a linear combination of both vectors, is shown using the dashed arrow. In (C), the first K-OPLS Y-orthogonal score vector t1o is plotted against the second Y-orthogonal score vector t2o only for the mutant samples, colour-coded by biological replicate. Biological replicate A displays a deviating behaviour compared to biological replicates B and C; trajectory shown by the dashed line. In (D), the first KPLS latent variable t1 is plotted against the second latent variable t2. The discriminatory direction is now a linear combination of both of the latent variables.