| Literature DB >> 29036400 |
Carl Brunius1,2, Anders Pedersen3, Daniel Malmodin3, B Göran Karlsson3, Lars I Andersson4, Gunnel Tybring5, Rikard Landberg1,2,6.
Abstract
MOTIVATION: Biobanks are important infrastructures for life science research. Optimal sample handling regarding e.g. collection and processing of biological samples is highly complex, with many variables that could alter sample integrity and even more complex when considering multiple study centers or using legacy samples with limited documentation on sample management. Novel means to understand and take into account such variability would enable high-quality research on archived samples.Entities:
Mesh:
Year: 2017 PMID: 29036400 PMCID: PMC5870544 DOI: 10.1093/bioinformatics/btx442
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Drift modeling of four clusters with different drift patterns from 22 °C data (Table 1). The two upper graphs represent clusters with small to minimal drift during pre-centrifugation delay time with either significant (left) or non-significant (right) improvement of feature CV after correction. The two lower graphs represent clusters with either decreased (left) or increased (right) feature intensity with increased pre-centrifugation time and significant CV improvement after correction. For each cluster, the upper graph shows the cluster-averaged scaled feature intensities in grey and the cluster drift function in black. The lower half shows the same features in the same y-scale after application of cluster-based drift correction
Cluster-based modeling and data correction for kinetic drift of NMR plasma metabolomics data with pre-centrifugation delay time at 22 °C
| Feature CV | |||||
|---|---|---|---|---|---|
| Original | Metadata | Prediction | Dominant identified features | ||
| Cluster1 | 34 | 0.06 | 0.05*** | 0.05*** | aas |
| Cluster2 | 47 | 0.058 | 0.057*** | 0.058*** | 3OH-butyrate, citrate, aas |
| Cluster3 | 28 | 0.26 | 0.26 | 0.26 | Mostly unknown, weak signals |
| Cluster4 | 20 | 0.18 | 0.16*** | 0.16*** | Alcohols, lipids/ffas |
| Cluster5 | 43 | 0.10 | 0.086*** | 0.087*** | aas |
| Cluster6 | 56 | 0.056 | 0.055*** | 0.055*** | aas |
| Cluster7 | 20 | 0.11 | 0.077*** | 0.08*** | aas, 3OH-butyrate |
| Cluster8 | 60 | 0.11 | 0.11 | 0.11 | aas, lipids/ffas |
| Cluster9 | 21 | 0.33 | 0.15*** | 0.14*** | Glucose |
| Cluster10 | 34 | 0.13 | 0.12*** | 0.12*** | Lipids/ffas, aas |
| Cluster11 | 3 | 0.37 | 0.33 | 0.33 | Acetate |
| Cluster12 | 25 | 0.11 | 0.11*** | 0.11*** | Lipids/ffas, 3OH-butyrate, aas |
| Cluster13 | 2 | 0.35 | 0.35 | 0.35 | Unknown weak signals |
| Cluster14 | 14 | 0.38 | 0.19*** | 0.17*** | Glucose |
| Cluster15 | 12 | 0.25 | 0.12*** | 0.12*** | Glucose |
| Cluster16 | 2 | 0.52 | 0.49 | 0.46 | Glucose (suppressed) |
| Cluster17 | 7 | 0.28 | 0.1*** | 0.13*** | Pyruvate |
| Cluster18 | 11 | 0.15 | 0.079*** | 0.086*** | aas |
| Cluster19 | 6 | 0.62 | 0.11*** | 0.21*** | Lactate |
| Cluster20 | 7 | 0.20 | 0.14*** | 0.14*** | Lipids/ffas |
| Modeled | 452 | 0.14 | 0.10*** | 0.11*** | |
| Irreproducible | 26 | Mostly unknown, weak signals | |||
| Total | 478 | ||||
Note: Correction for kinetic drift was performed using recorded bench-time metadata or prediction estimates (to simulate legacy samples) as time input. aas, amino acids; ffas, free fatty acids. *p<0.05, **p<0.01, ***p<0.001.
CV of feature intensities within cluster over 1–36 h.
After drift correction based on actual pre-centrifugation time from metadata.
After drift correction based on predicted pre-centrifugation time.
Fig. 2.Cross-validated prediction of pre-centrifugation temperature (left) and pre-centrifugation time at 22 °C (center) or 4 °C (right). The pre-centrifugation temperature modeling correctly predicted 96% of samples as either stored at 4 °C (black) or 22 °C (grey). In pre-centrifugation time modeling, predicted times (y-axes) were strongly associated with actual times (x-axes). Predictions were better for pre-centrifugation time modeling at 22 °C, which reflected the larger effects on the metabolome at higher temperatures. All models were highly significant (P from permutation analysis; Supplementary Fig. S2)
Fig. 3.Predicted pre-centrifugation time at 22 °C for external validation samples (n = 111). Samples were prepared on either the same day as sampling, the next day, or the day after that. Predicted pre-centrifugation times were significantly different between levels (P < 2.2e-16). All pair-wise comparisons were significant after Tukey adjustment (P < 0.001)
Cluster-based modeling and data correction for kinetic drift of NMR plasma metabolomics data with pre-centrifugation delay time at 4 °C
| Feature CV | |||||
|---|---|---|---|---|---|
| Original | Metadata | Prediction | Dominant identified features | ||
| Cluster1 | 42 | 0.12 | 0.12*** | 0.12** | 3OH-butyrate, aas |
| Cluster2 | 32 | 0.17 | 0.17*** | 0.17*** | aas, alcohols |
| Cluster3 | 22 | 0.30 | 0.30 | 0.30 | Pyruvate, acetate, histidine |
| Cluster4 | 38 | 0.19 | 0.19*** | 0.19*** | Sugars |
| Cluster5 | 14 | 0.099 | 0.097*** | 0.098* | aas |
| Cluster6 | 49 | 0.094 | 0.093*** | 0.094 | aas, 3OH-butyrate |
| Cluster7 | 36 | 0.18 | 0.18*** | 0.18 | Lipids/ffas, 3OH-butyrate, aas, citrate |
| Cluster8 | 18 | 0.35 | 0.33*** | 0.34** | Lipids/ffas |
| Cluster9 | 13 | 0.28 | 0.27*** | 0.28* | Lipids/ffas, aas |
| Cluster10 | 35 | 0.12 | 0.097*** | 0.10*** | Glucose |
| Cluster11 | 21 | 0.10 | 0.10*** | 0.10*** | aas |
| Cluster12 | 8 | 0.10 | 0.09*** | 0.091*** | Glucose |
| Cluster13 | 24 | 0.085 | 0.085** | 0.085 | 3OH-butyrate, aas |
| Cluster14 | 41 | 0.075 | 0.075*** | 0.075*** | aas |
| Cluster15 | 7 | 0.24 | 0.14*** | 0.16*** | Lactate |
| Cluster16 | 44 | 0.087 | 0.085*** | 0.086*** | aas |
| Modeled | 444 | 0.14 | 0.14*** | 0.14*** | |
| Irreproducible | 34 | Mostly lipids/ffas and unknown, weak signals | |||
| Total | 478 | ||||
Note: Correction for kinetic drift was performed using recorded bench-time metadata or prediction estimates (to simulate legacy samples) as time input. aas, amino acids; ffas, free fatty acids. *p<0.05, **p<0.01, ***p<0.001.
CV of feature intensities within cluster over 1–36 h.
After drift correction based on actual pre-centrifugation time from metadata.
After drift correction based on predicted pre-centrifugation time.
Fig. 4.Effects of pre-centrifugation time on metabolite feature stability. The upper graphs show the effects of pre-centrifugation delay time at 4 °C as relative deviation of metabolite feature intensities compared with the reference level (at 1 h) for original data (left) and after data correction based on metadata information (right). Isobaric lines correspond to percentiles in the distribution of relative differences of 7648 measurements (16 samples × 478 features). The lower graphs show corresponding effects of pre-centrifugation delay time at 22 °C. The two horizontal lines correspond to 20% (lower line) and 30% (higher line) absolute deviation in feature intensity from the 1 h reference state