| Literature DB >> 31797944 |
Anna Tkachev1,2, Vita Stepanova1,2, Lei Zhang3, Ekaterina Khrameeva1, Dmitry Zubkov1, Patrick Giavalisco4, Philipp Khaitovich5,6,7.
Abstract
Human populations, despite their overwhelming similarity, contain some distinct phenotypic, genetic, epigenetic, and gene expression features. In this study, we explore population differences at yet another level of molecular phenotype: the abundance of non-polar and polar low molecular weight compounds, lipids and metabolites in the prefrontal cortical region of the brain. We assessed the abundance of 1,670 lipids and 258 metabolites in 146 Han Chinese, 97 Western European, and 60 African American individuals of varying ages, covering most of the lifespan. The statistical analysis and logistic regression models both demonstrated extensive lipid and metabolic divergence of the Han Chinese individuals from the other two populations. This divergence was age-dependent, peaking in young adults, and involved metabolites and lipids clustering in specific metabolic pathways.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31797944 PMCID: PMC6893025 DOI: 10.1038/s41598-019-53762-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Assessment of the lipid and metabolite abundance variation. (A) Age distribution of samples. Each circle represents an individual. The circle colors correspond to populations: red – Han Chinese (HC), green – Western Europeans (WE), blue – African Americans (AA). The background color delineates two datasets: the darker shade – samples with ages less than 5 years (DS:0-4) and the lighter shade – samples with ages greater than 5 years (DS:5-71). (B,C) Lipid (B) and metabolite (C) abundance variation among individuals visualized using t-distributed stochastic neighbor embedding (t-SNE). Each circle represents an individual. The sizes of the circles are related to the individuals’ age – larger circles represent older individuals. Colors represent populations, as described above.
Figure 2Lipidome and metabolome population-specific differences in the DS:0-4 and DS:5-71 datasets. (A,B) Number of lipids (A) and metabolites (B) with significant abundance differences between one population and the other two combined, estimated by subsampling n individuals from each of the three populations 100 times. The numbers of subsampled individuals n used in analysis are marked on top of the panels. The results are shown for DS:5-71 (left, n = 25), DS:5-71 with same number n as DS:0-4 (middle, n = 13), and DS:0-4 (right, n = 13). The colors represent populations: red – Han Chinese (HC), green – Western Europeans (WE), blue – African Americans (AA). (C,D) The mean Area Under the ROC Curve (AUC) estimates for the lasso logistic regression models separating samples from one population and samples from the other two combined calculated using different values of hyperparameter C (inverse of regularization strength). The models were based on the lipid (C) and metabolite (D) abundance in DS:5-71 samples. Lines correspond to the means of AUC values estimated on different test sets. The shaded areas indicate the standard deviations of AUC values estimated on different test sets. The colors correspond to populations, as described above.
Figure 3The age-dependent dynamics of lipidome and metabolome differences between populations. (A) Sample distribution across six age groups. Each circle represents an individual. The colors represent populations: red – Han Chinese (HC), green – Western Europeans (WE), blue – African Americans (AA). The x-axis labels indicate the age groups’ boundaries in years. (B,C) Pairwise population differences estimated based on the abundance of age-unbiased population-distinguishing lipids (B) and metabolites. (C) Differences were calculated in each age group A1-A6 using correlations of population-mean abundances based on four samples subsampled from each population. Y-axis represents distance values calculated as one minus these correlation values. Circles represent the median distance values estimated by subsampling within each age group 10,000 times. Vertical lines extend to the upper and lower quartile values in each age group. The dotted blue line represents a smooth spline curve fitted to the average of the HC-WE and HC-AA distances. The dashed orange line represents a smooth spline curve fitted to the WE-AA distance. (D) Pathway enrichment analysis. Shown are the top 19 pathways (Benjamini-Hochberg corrected hypergeometric test p < 0.001) that show enrichment of genes linked to HC-specific lipids and metabolites. HC-specific lipids and metabolites defined from stability selection were used in this analysis. Circle sizes represent the number of genes linked to HC-specific lipid and metabolite compounds. Circle colors correspond to Benjamini-Hochberg corrected hypergeometric test p-values. Pathways associated with lipid metabolism are shaded in light blue.