| Literature DB >> 23216942 |
Benoit Liquet1, Kim-Anh Lê Cao, Hakim Hocini, Rodolphe Thiébaut.
Abstract
BACKGROUND: High throughput 'omics' experiments are usually designed to compare changes observed between different conditions (or interventions) and to identify biomarkers capable of characterizing each condition. We consider the complex structure of repeated measurements from different assays where different conditions are applied on the same subjects.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23216942 PMCID: PMC3627901 DOI: 10.1186/1471-2105-13-325
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Simulation study. Sample representation from multilevel sPLS-DA. Samples were projected onto a subspace spanned by the first 3 sPLS-DA components, based on the 200 genes selected on each of the 3 components.
Simulation study
| classical sPLS-DA | 58.0 | 75.0 | 87.2 | 78.2 |
| multilevel sPLS-DA | 82.8 | 95.6 | 93.1 | 92.0 |
Percentage of the number of true selected genes selected by classical sPLS-DA or multilevel sPLS-DA on each component or dimension (averaged over 100 simulation runs); 200 genes were selected on each component.
Figure 2Stimulation study. Hierarchical clustering (Euclidian distance and Ward method aggregation) of the genes selected with multilevel sPLS-DA. Samples are represented in columns and genes in rows.
Simulation study
| 25 | 0.535 | 0.369 | 0.312 | 0.500 | 0.271 | 0.024 |
| 50 | 0.530 | 0.364 | 0.311 | 0.500 | 0.265 | 0.016 |
| 75 | 0.527 | 0.360 | 0.306 | 0.500 | 0.261 | 0.013 |
| 100 | 0.524 | 0.354 | 0.300 | 0.500 | 0.258 | 0.011 |
| 125 | 0.522 | 0.351 | 0.296 | 0.500 | 0.257 | 0.009 |
| 150 | 0.520 | 0.343 | 0.285 | 0.500 | 0.250 | 0.008 |
| 175 | 0.518 | 0.335 | 0.281 | 0.500 | 0.243 | 0.009 |
| 200 | 0.516 | 0.327 | 0.500 | 0.234 | ||
| 225 | 0.514 | 0.323 | 0.269 | 0.500 | 0.227 | 0.009 |
| 250 | 0.512 | 0.316 | 0.267 | 0.500 | 0.220 | 0.008 |
| 275 | 0.510 | 0.314 | 0.266 | 0.500 | 0.207 | 0.007 |
| 300 | 0.510 | 0.306 | 0.262 | 0.500 | 0.196 | 0.007 |
| 325 | 0.509 | 0.299 | 0.260 | 0.500 | 0.182 | 0.007 |
Classification error rate estimation using leave-one-out cross-validation for classical sPLS-DA and multilevel sPLS-DA, with respect to the number of genes selected on each component (averaged over 100 simulation runs).
Figure 3Multilevel sPLS-DA analysis on the transcriptomics data with one factor (W14).(a) Unsupervised clustering analysis with Euclidian distance and Ward method of the 290 genes selected by sPLS-DA. Samples are represented in columns and genes in rows. (b) and (c) sPLS-DA sample representation for dimensions 1-2 (b) or 1-3 (c).
Figure 4Multilevel sPLS-DA analysis on the transcriptomics data with two factors stimulation and time.(a) Unsupervised clustering analysis with Euclidian distance and Ward method of the 220 genes selected by sPLS-DA. sPLS-DA sample representations for dimensions 1-2 (b) or 1-3 (c).
Figure 5Integrative analysis of gene expression and cytokine secretion for W14. Clustered Image Maps (CIM) obtained from multilevel sPLS. Selected genes are represented in columns and cytokines in rows.