| Literature DB >> 28890672 |
Izabella Surowiec1, Erik Johansson2, Frida Torell1, Helena Idborg3, Iva Gunnarsson3, Elisabet Svenungsson3, Per-Johan Jakobsson3, Johan Trygg1,2.
Abstract
INTRODUCTION: Availability of large cohorts of samples with related metadata provides scientists with extensive material for studies. At the same time, recent development of modern high-throughput 'omics' technologies, including metabolomics, has resulted in the potential for analysis of large sample sizes. Representative subset selection becomes critical for selection of samples from bigger cohorts and their division into analytical batches. This especially holds true when relative quantification of compound levels is used.Entities:
Keywords: Metabolomics; Multi-batch analysis; OPLS; Representative sample selection
Year: 2017 PMID: 28890672 PMCID: PMC5570768 DOI: 10.1007/s11306-017-1248-1
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Fig. 1Overview of the experimental strategy applied in this study comprising of the following steps: (1) representative selection of samples from each of the studied sample classes (SLE subgroups) based on available clinical and personal sample descriptors with the application of multivariate characterization and DOE approach; (2) application of the same strategy for subdivision of samples in the representative analytical batches; (3) chemical analysis of samples; (4) OPLS modeling of samples in each batch respectively to the question of the study; (5) averaging of the OPLS p(corr) vectors from all batches to obtain combined metabolic profile
Fig. 2Sample selection from controls. From the PCA model (R2X[1] = 0.107, R2X[2] = 0.044), 20 samples from the full two-level factorial design corners (5 from each) + 3 center points were selected so that they spanned the entire multivariate space defined by samples and their associated clinical data. Selected samples are marked in green (design corners) and red (center points); samples that were not selected are marked in gray
Parameters of the OPLS-DA models used to discriminate between the SLE and population-based control groups
| Batch | A | p(1) (%) | N | R2X | R2 (cum) | Q2 | CV-ANOVA |
|---|---|---|---|---|---|---|---|
| Batch 1 | 1 + 1 + 0 | 9.1 | 35 | 0.24 | 0.75 | 0.42 | p = 0.002 |
| Batch 2 | 1 + 0 + 0 | 7.3 | 36 | 0.07 | 0.52 | −0.10 | p = 1 |
| Batch 3 | 1 + 1 + 0 | 6.0 | 34 | 0.18 | 0.81 | 0.42 | p = 0.003 |
P(corr) loadings from the OPLS-DA models between SLE and control groups in each batch and in averaged profile
| Compound name | HMDB | Compound class | Batch 1 | Batch 2 | Batch 3 | Average p(corr) value | Average confidence interval |
|---|---|---|---|---|---|---|---|
| 1-5-Anhydro- | HMDB03911 | Carbohydrate | 0.324 | −0.106 | −0.045 | ||
| 2-Oxoisocaproic acid | HMDB00695 | Organic acid |
|
|
|
| 0.387 |
| 3-Hydroxybutanoic acid | HMDB00357 | Organic acid | −0.308 | −0.122 | −0.104 | −0.178 | 0.614 |
| 4-Hydroxyphenylacetic acid | HMDB00020 | Organic acid | 0.212 | 0.196 | 0.078 | 0.162 | 0.340 |
| Adenosine-5-monophosphate | HMDB00045 | Nucleotide | −0.318 | −0.395 | −0.039 | −0.251 | 0.497 |
| Alanine | HMDB00161 | Amino acid | 0.150 | 0.267 | −0.335 | ||
| Allothreonine | HMDB60878 | Amino acid | 0.126 |
| −0.492 | ||
| α-Aminobutyric acid | HMDB00452 | Organic acid | −0.027 | −0.080 |
| −0.152 | 0.412 |
| α-Ketoglutaric acid | HMDB00208 | Organic acid | 0.297 | −0.104 | 0.169 | ||
| α-Linolenic acid (ALA) | HMDB02181 | Fatty acid | −0.647 | −0.268 | −0.159 | −0.358 | 0.684 |
| α-Tocopherol | HMDB01893 | Sterol | 0.048 | −0.088 | 0.046 | ||
| Arachidonic acid | HMDB01043 | Fatty acid | −0.411 |
| −0.176 | −0.310 | 0.526 |
| Arginine | HMDB00517 | Amino acid |
| 0.320 | 0.147 | 0.283 | 0.352 |
| Asparagine | HMDB00168 | Amino acid | 0.090 | 0.098 | −0.293 | ||
| β-Sitosterol | HMDB00852 | Sterol | 0.100 | 0.050 | 0.207 | 0.119 | 0.383 |
| Caffeine | HMDB01847 | Nucleotide | 0.105 | −0.182 | 0.138 | ||
| Campesterol | HMDB02869 | Sterol | 0.247 | 0.050 | 0.091 | 0.129 | 0.418 |
| Cholesterol | HMDB00067 | Sterol | 0.004 | 0.090 | 0.004 | 0.033 | 0.465 |
| Citric acid | HMDB00094 | Organic acid | −0.189 | −0.183 | −0.315 | −0.229 | 0.498 |
| Creatinine | HMDB00562 | Amino ketone | 0.384 | 0.241 | 0.201 | 0.275 | 0.388 |
| Cystathionine | HMDB00099 | Amino acid | 0.216 | 0.161 | 0.087 | 0.155 | 0.370 |
| Cysteine | HMDB00574 | Amino acid | −0.275 | −0.050 | −0.124 | −0.150 | 0.548 |
| Cystine | HMDB00192 | Amino acid | 0.313 | 0.409 | 0.156 | 0.293 | 0.379 |
| Docosahexaenoic acid (DHA) | HMDB03581 | Fatty acid | −0.292 |
| −0.201 | −0.344 | 0.565 |
| Elaidic acid | HMDB00573 | Fatty acid | −0.682 | −0.301 | −0.178 | −0.387 | 0.626 |
| Erythronic acid | HMDB00613 | Carbohydrate | −0.043 | −0.267 | 0.258 | ||
| Galactitol | HMDB00107 | Carbohydrate |
| 0.317 | 0.103 | 0.269 | 0.430 |
| γ-Tocopherol | HMDB01492 | Sterol | −0.152 | 0.332 | 0.180 | ||
| Gluconic acid | HMDB00625 | Organic acid | −0.029 | −0.034 | −0.514 | −0.193 | 0.363 |
| Glucosamine | HMDB01514 | Carbohydrate | −0.010 | 0.051 | −0.280 | ||
| Glucose | HMDB00122 | Carbohydrate |
| −0.108 | −0.307 | −0.272 | 0.335 |
| Glutamic acid | HMDB00148 | Amino acid | 0.096 | 0.056 |
| 0.144 | 0.259 |
| Glutamine | HMDB00641 | Amino acid | 0.176 | 0.338 | 0.108 | 0.208 | 0.282 |
| Glyceric acid | HMDB00139 | Organic acid | 0.388 | 0.023 | 0.210 | 0.207 | 0.445 |
| Glycerol | HMDB00131 | Polyol | −0.122 | −0.218 | 0.246 | ||
| Glycerol-3-phosphate | HMDB35909 | Organic acid | 0.154 | −0.041 | 0.248 | ||
| Glycine | HMDB00123 | Amino acid |
| 0.256 | 0.097 | 0.206 | 0.340 |
| Hexadecanoic acid | HMDB00220 | Fatty acid | −0.640 | −0.385 | −0.278 | −0.434 | 0.620 |
| Hippuric acid | HMDB00714 | Amino acid |
| −0.194 | 0.332 | ||
| Histidine | HMDB00177 | Amino acid | −0.045 | 0.185 | −0.633 | ||
| Inosine | HMDB00195 | Nucleoside | −0.104 |
| −0.203 | −0.225 | 0.535 |
| Lactic acid | HMDB00190 | Organic acid | 0.159 | 0.037 | 0.034 | 0.077 | 0.361 |
| Lactose | HMDB00186 | Carbohydrate | −0.418 | 0.240 | 0.235 | ||
| Lauric acid | HMDB00638 | Fatty acid |
| −0.236 | −0.133 | −0.307 | 0.391 |
| Linoleic acid | HMDB00673 | Fatty acid | −0.509 | −0.233 | −0.330 | −0.357 | 0.698 |
| Lysine | HMDB00182 | Amino acid | 0.162 | 0.347 | −0.371 | ||
| Malic acid | HMDB00156 | Organic acid | 0.114 | −0.216 | −0.037 | ||
| Maltose | HMDB00163 | Carbohydrate | −0.632 | −0.307 | −0.013 | −0.317 | 0.637 |
| Methionine | HMDB00696 | Amino acid | 0.170 | 0.174 |
| ||
| Methyl linoleate | HMDB34381 | Fatty acid methyl ester | 0.391 | 0.162 |
| 0.281 | 0.313 |
| Nonanoic acid | HMDB00847 | Fatty acid | −0.039 | −0.066 | 0.057 | ||
|
| HMDB00224 | Organic phosphoric acid | −0.391 | −0.341 | −0.060 | −0.264 | 0.475 |
| Ornithine | HMDB00214 | Amino acid |
|
| 0.093 |
| 0.323 |
| Oxalic acid | HMDB02329 | Organic acid | −0.218 | 0.180 | 0.033 | ||
| Palmitoleic acid | HMDB03229 | Fatty acid | 0.289 | 0.205 | −0.037 | ||
| Phenylalanine | HMDB00159 | Amino acid | 0.062 | −0.021 | −0.212 | ||
| Phosphoric acid | HMDB02142 | Inorganic acid | −0.105 | −0.032 | 0.209 | ||
| Proline | HMDB00162 | Amino acid | 0.102 |
| −0.065 | ||
| Pyroglutamic acid | HMDB00267 | Amino acid | −0.053 | 0.215 | 0.181 | ||
| Scyllo-inositol | HMDB06088 | Polyol | 0.275 | 0.312 | 0.085 | 0.224 | 0.410 |
| Serine | HMDB00187 | Amino acid | 0.192 | 0.069 | 0.254 | 0.172 | 0.240 |
| Squalene | HMDB00256 | Carbohydrate | −0.114 | 0.336 | 0.015 | ||
| Stearic acid | HMDB00827 | Fatty acid | −0.592 | −0.495 | −0.073 | −0.387 | 0.497 |
| Sucrose | HMDB00258 | Carbohydrate | 0.165 | 0.355 | 0.040 | 0.187 | 0.407 |
| Taurine | HMDB00251 | Amino acid | −0.340 | −0.577 | 0.103 | ||
| Threonic acid | HMDB00943 | Organic acid | −0.003 | −0.050 | 0.262 | ||
| Threonine | HMDB00167 | Amino acid | 0.251 | −0.130 |
| ||
| Tryptophan | HMDB00929 | Amino acid | −0.197 |
|
|
| 0.258 |
| Tyrosine | HMDB00158 | Amino acid | 0.097 | 0.242 | −0.118 | ||
| Uric acid | HMDB00289 | Purine | 0.309 | 0.250 | 0.186 | 0.248 | 0.508 |
| Valine | HMDB00883 | Amino acid | −0.168 | 0.022 | −0.436 | ||
| Xylitol | HMDB02917 | Polyol |
| 0.416 | 0.054 | 0.300 | 0.385 |
| Xylose | HMDB00098 | Carbohydrate | 0.180 |
| 0.238 | 0.298 | 0.304 |
Underlined metabolites significant according to jackknifing. Positive values represent metabolite increased in SLE patients compared to the population-based controls
Fig. 3SUS plot analysis of p(corr) vectors from the first and second batch (a), the second and third batch (b), and the third and first batch (c). Metabolites with the same change direction from all batches studied are indicated in black; the dashed line is the regression line; R 2 regression coefficient
Fig. 4Combined metabolic profile of SLE versus controls. The p(corr) value presented is the average p(corr) value of the three batches for the metabolites that showed the same change direction relative to SLE in all batches studied