| Literature DB >> 25598764 |
Warwick B Dunn1, Wanchang Lin2, David Broadhurst3, Paul Begley2, Marie Brown2, Eva Zelena4, Andrew A Vaughan4, Antony Halsall4, Nadine Harding4, Joshua D Knowles5, Sue Francis-McIntyre4, Andy Tseng4, David I Ellis4, Steve O'Hagan4, Gill Aarons6, Boben Benjamin7, Stephen Chew-Graham7, Carly Moseley8, Paula Potter8, Catherine L Winder9, Catherine Potts10, Paula Thornton8, Catriona McWhirter10, Mohammed Zubair7, Martin Pan11, Alistair Burns7, J Kennedy Cruickshank12, Gordon C Jayson8, Nitin Purandare13, Frederick C W Wu10, Joe D Finn10, John N Haselden14, Andrew W Nicholls14, Ian D Wilson15, Royston Goodacre9, Douglas B Kell9.
Abstract
Phenotyping of 1,200 'healthy' adults from the UK has been performed through the investigation of diverse classes of hydrophilic and lipophilic metabolites present in serum by applying a series of chromatography-mass spectrometry platforms. These data were made robust to instrumental drift by numerical correction; this was prerequisite to allow detection of subtle metabolic differences. The variation in observed metabolite relative concentrations between the 1,200 subjects ranged from less than 5 % to more than 200 %. Variations in metabolites could be related to differences in gender, age, BMI, blood pressure, and smoking. Investigations suggest that a sample size of 600 subjects is both necessary and sufficient for robust analysis of these data. Overall, this is a large scale and non-targeted chromatographic MS-based metabolomics study, using samples from over 1,000 individuals, to provide a comprehensive measurement of their serum metabolomes. This work provides an important baseline or reference dataset for understanding the 'normal' relative concentrations and variation in the human serum metabolome. These may be related to our increasing knowledge of the human metabolic network map. Information on the Husermet study is available at http://www.husermet.org/. Importantly, all of the data are made freely available at MetaboLights (http://www.ebi.ac.uk/metabolights/).Entities:
Keywords: Clinical biochemistry; Human serum; Mass spectrometry; Metabolic phenotyping; UK population
Year: 2014 PMID: 25598764 PMCID: PMC4289517 DOI: 10.1007/s11306-014-0707-1
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
Clinical characteristics of the cohort studied defining median and inter-quartile range
| Characteristic | |
|---|---|
| Gender (male:female) | 701:490a |
| Age (median, IQR) | 48.0 (40.0,60.0)b |
| BMI (median, IQR) | 25.63 (23.20,28.71)b |
| Smokers (non:ex:current) | 502:163:176c |
| SBP (median, IQR), mmHg | 125 (115,137)d |
| DBP (median, IQR), mmHg | 76 (70,83)d |
| GLUC (median, IQR), mmol L−1 | 4.71 (4.20,5.30)e |
| CHOL (median, IQR), mmol L−1 | 5.10 (4.30,5.80)f |
| TRIG (median, IQR), mmol L−1 | 1.18 (0.80,1.80)g |
| HDLC (median, IQR), mmol L−1 | 1.26 (1.00,1.50)h |
| LDLC (median, IQR), mmol L−1 | 3.2 (2.54,3.77)i |
a Data not available for 8 subjects
b Data not available for 4 subjects
c Data not available for 355 subjects
d Data not available for 179 subjects
e Data not available for 175 subjects
f Data not available for 262 subjects
g Data not available for 326 subjects
h Data not available for 347 subjects
i Data not available for 360 subjects
Fig. 1The distribution of relative standard deviations defining the inter-subject variability in metabolite relative concentrations for each analytical platform applied, following signal correction. The data are shown as distribution plots. Top plot GC–MS, middle plot UPLC–MS(−), bottom plot UPLC–MS(+)
Fig. 2Heatmap with dendrogram of correlation network for metabolites detected by GC–MS. The twenty unique metabolites with one or more of the highest correlations are depicted. The lower bar represents the colour code of coefficients from pairwise Pearson’s correlations (Color figure online)
Fig. 3Classification analysis to assess sample size effects. The accuracy rate of discrimination with 95 % confidence intervals for data acquired applying UPLC–MS(+) for the three parameters of age (age <50 vs. age >65), BMI (BMI <25 vs. BMI >30) and gender (male vs. female). A Random Forest (RF) classifier was employed and 100 bootstrap sample sets were used for the assessment of classification accuracy
Fig. 4A boxplot showing the distribution of methionine sulfoxide for males and females across different age categories. For each box, the central line is the median, the edges of the box are the upper and lower quartiles, the whiskers extend the box by a further ±1.5 × interquartile range (IQR), and outliers (>1.5 × IQR) are plotted as individual points. Data were analysed using 2-way ANOVA showing a significant difference between males and females, [F(1,901) = 20.3, p = 7.7 × 10−6]. There was no significant difference between age categories and no significant interaction between gender and age categories
Fig. 5A boxplot showing the distribution of citric acid for males and females across different age categories. For each box, the central line is the median, the edges of the box are the upper and lower quartiles, the whiskers extend the box by a further ±1.5 × interquartile range (IQR), and outliers are plotted as individual points (>1.5 × IQR). Data were analysed using 2-way ANOVA. There was a significant difference between males and females (F(1,779) = 79.8, p = 3.1 × 10−18). There was no significant difference between age categories and no significant interaction between gender and age categories
Fig. 6A boxplot showing the distribution of tyrosine and tryptophan for males and females across different age categories. For each box, the central line is the median, the edges of the box are the upper and lower quartiles, the whiskers extend the box by a further ±1.5 × interquartile range (IQR), and outliers are plotted as individual points (>1.5 × IQR). Data were analysed using 2-way ANOVA. There was a significant difference across age categories (<50 years vs. >64 years) for tryptophan [F(1,778) = 11.7, p = 0.0007] and tyrosine [F(1,788) = 39.1, p = 6.8 × 10−10]. There was a significant difference across gender categories for tryptophan [F(1,788) = 55.4, p = 2.6 × 10−13]. There was no significant interaction between gender and age categories for tryptophan or tyrosine
Fig. 7Heatmap with dendrogram of Pearson’s correlation analysis between metabolites detected by GC–MS and clinical chemistry data. The arrangement of the clusters are produced by hierarchical clustering on both metabolites and clinical chemistry data. The lower bar represents the colour code of coefficients from pairwise Pearson’s correlations between GC–MS data and the clinical chemistry data (Color figure online)