| Literature DB >> 28868138 |
Ting-Li Han1, Yang Yang1, Hua Zhang1,2, Kai P Law1.
Abstract
Background: A challenge of metabolomics is data processing the enormous amount of information generated by sophisticated analytical techniques. The raw data of an untargeted metabolomic experiment are composited with unwanted biological and technical variations that confound the biological variations of interest. The art of data normalisation to offset these variations and/or eliminate experimental or biological biases has made significant progress recently. However, published comparative studies are often biased or have omissions.Entities:
Keywords: Biomarker discovery; Gas chromatography-mass spectrometry; Gestational diabetes; Metabolomics; Normalisation method
Year: 2017 PMID: 28868138 PMCID: PMC5553085 DOI: 10.12688/f1000research.11823.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Agilent MassHunter ( a) Qualitative Workflows and ( b) Profinder interface. 385 components were extracted from a typical QC sample from 14.5 to 56 min, of which 62 were confidently annotated with match factor ≥ 80. Data was then exported to a CEF file. The file was then used by Profinder for batch data extraction. The Profinder tool was designed with the use of reference spectra and retention time windows to assist data extraction.
Summary statistics for metabolite variability according to relative standard deviation (RSD) for QC and analytical samples before and after normalisation.
| RSD (%) of individual metabolites across
| ||
|---|---|---|
| QC | Analytical | |
|
| 19.34 (45.00, 12.15) | 30.11 (64.01, 17.22) |
|
| 11.75 (41.18, 1.14) | 30.89 (97.42, 1.95) |
|
| 9.771 (36.33, 2.19) | 22.11 (62.06, 6.70) |
|
| 8.916 (31.06, 1.22) | 20.80 (58.85, 9.96) |
|
| 8.196 (30.27, 1.18) | 21.16 (62.66, 2.45) |
|
| 5.733 (22.05, 1.88) | 18.18 (62.60, 2.49) |
Figure 2. Multilevel principal component analysis score plots produced by the data processed with the ( a) Eigen, ( b) PQN, and ( c) LOWESS normalisation.
Figure 3. Multilevel principal component analysis PC1 loading plots (top 10 variables) corresponding to Figure 2.
( a) Eigen, ( b) PQN, and ( c) LOWESS normalisation.
Figure 4. Heat map of are under the curve (ROC) values of 62 putative metabolites.