| Literature DB >> 31551469 |
Rubina Tabassum1, Joel T Rämö1, Pietari Ripatti1, Jukka T Koskela1, Mitja Kurki1,2,3, Juha Karjalainen1,4,5, Priit Palta1,6, Shabbeer Hassan1, Javier Nunez-Fontarnau1, Tuomo T J Kiiskinen1,7, Sanni Söderlund8, Niina Matikainen8,9, Mathias J Gerl10, Michal A Surma10,11, Christian Klose10, Nathan O Stitziel12,13,14, Hannele Laivuori1,15,16, Aki S Havulinna1,7, Susan K Service17, Veikko Salomaa7, Matti Pirinen1,18,19, Matti Jauhiainen7,20, Mark J Daly1,4, Nelson B Freimer17, Aarno Palotie1,4,21, Marja-Riitta Taskinen8, Kai Simons10,22, Samuli Ripatti23,24,25.
Abstract
Understanding genetic architecture of plasma lipidome could provide better insights into lipid metabolism and its link to cardiovascular diseases (CVDs). Here, we perform genome-wide association analyses of 141 lipid species (n = 2,181 individuals), followed by phenome-wide scans with 25 CVD related phenotypes (n = 511,700 individuals). We identify 35 lipid-species-associated loci (P <5 ×10-8), 10 of which associate with CVD risk including five new loci-COL5A1, GLTPD2, SPTLC3, MBOAT7 and GALNT16 (false discovery rate<0.05). We identify loci for lipid species that are shown to predict CVD e.g., SPTLC3 for CER(d18:1/24:1). We show that lipoprotein lipase (LPL) may more efficiently hydrolyze medium length triacylglycerides (TAGs) than others. Polyunsaturated lipids have highest heritability and genetic correlations, suggesting considerable genetic regulation at fatty acids levels. We find low genetic correlations between traditional lipids and lipid species. Our results show that lipidomic profiles capture information beyond traditional lipids and identify genetic variants modifying lipid levels and risk of CVD.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31551469 PMCID: PMC6760179 DOI: 10.1038/s41467-019-11954-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Study design and work flow. The figure illustrates the study design and key findings of the study
Fig. 2Heritability of lipidomic profiles and genetic correlations among the lipid species. a Histogram and kernel density curve showing the distribution of heritability estimates across all the lipid species. b Boxplot showing the heritability estimates in each lipid class. c Boxplot showing comparison of the median heritability estimates of lipid species containing C20:4, C20:5 and C22:6 acyl chains and all others. The P-values were calculated using the Wilcoxon rank-sum test. d Hierarchical clustering of lipid species based on genetic correlations among lipid species. Lipids containing polyunsaturated fatty acids C20:5, C20:4 and C22:6 are highlighted with black bars. The data presented in the boxplots represent the interquartile range (IQR) defined by the bounds of the box with the median (middle line of the box) and whiskers extending to the largest/smallest values no further than 1.5 times the IQR. CER ceramide, DAG diacylglyceride, LPC lysophosphatidylcholine, LPE lysophosphatidylethanolamine, PC phosphatidylcholine, PCO phosphatidylcholine-ether, PE phosphatidylethanolamine, PEO phosphatidylethanolamine-ether, PI phosphatidylinositol, CE cholesteryl ester, SM sphingomyelin, ST sterol, TAG triacyglycerol, Trad traditional lipids
Fig. 3Lipidomic profiles capture information beyond traditional lipids. The genetic and phenotypic correlations between traditional lipids and molecular lipid species are shown in lower panel. The bar plot in the upper panel shows the heritability estimates of each lipid species (red bars) and the variance explained by all the known loci together (green bars). The lipid species are ordered based on the hierarchical clustering showing the correlations between the lipid species and traditional lipids. TC total cholesterol, TG triglycerides
Genomic loci associated with molecular lipid species at genome-wide significance
| SNP | Position | Gene | Change | Ref | Alt | AF | Lipid species | Effect | SE | |
|---|---|---|---|---|---|---|---|---|---|---|
| rs201385366 | 1:897866 | Intronic | C | T | 0.019 | LPE(22:6;0) | −0.87 | 0.16 | 3.6 × 10−8 | |
| rs187163948 | 1:14399146 | Intronic | G | A | 0.011 | TAG(53:3;0) | 0.95 | 0.17 | 3.5 × 10−8 | |
| rs76866386♯ | 2:44075483 | Intronic | T | C | 0.077 | CE(20:2;0) | −0.39 | 0.06 | 3.9 × 10−10 | |
| rs58029241 | 2:98701245 | Intergenic | T | A | 0.062 | TAG(50:1;0) | 0.37 | 0.07 | 1.9 × 10−8 | |
| rs13070110 | 3:21393248 | Intergenic | T | C | 0.085 | Total CER | 0.33 | 0.06 | 3.9 × 10−9 | |
| rs10212439 | 3:142655053 | Intergenic | T | C | 0.602 | PI(18:0;0–18:1;0) | 0.18 | 0.03 | 3.1 × 10−8 | |
| rs13151374 | 4:8122221 | Intronic | G | A | 0.153 | TAG(50:1;0) | 0.25 | 0.04 | 3.7 × 10−8 | |
| rs186689484 | 4:97033701 | Intergenic | G | A | 0.051 | TAG(52:4;0) | −0.40 | 0.07 | 4.2 × 10−8 | |
| rs543895501 | 6:74120350 | Intronic | C | T | 0.013 | Total LPC | 0.87 | 0.16 | 2.9 × 10−8 | |
| rs4896307 | 6:138297840 | Intergenic | C | T | 0.216 | PCO(16:1;0–16:0;0) | −0.23 | 0.04 | 3.3 × 10−8 | |
| rs534693155 | 7:101081274 | Intronic | A | G | 0.010 | LPC(16:1;0) | 1.24 | 0.23 | 3.9 × 10−8 | |
| rs10281741 | 7:157793122 | Intronic | G | C | 0.225 | TAG(54:6;0) | 0.21 | 0.04 | 2.2 × 10−8 | |
| rs1478898 | 8:11395079 | Intronic | G | A | 0.440 | PC(16:0;0–16:0;0) | 0.17 | 0.03 | 2.5 × 10−8 | |
| rs11570891 | 8:19822810 | Intronic | C | T | 0.075 | TAG(52:3;0) | −0.33 | 0.06 | 2.9 × 10−8 | |
| rs146717710 | 9:137549865 | Intronic | C | T | 0.011 | PC(16:0;0–16:1;0) | −1.03 | 0.19 | 2.8 × 10−8 | |
| rs140645847 | 10:118863255 | Intronic | G | T | 0.101 | LPE(20:4;0) | −0.32 | 0.06 | 3.3 × 10−8 | |
| rs28456♯ | 11:61589481 | Intronic | A | G | 0.405 | CE(20:4;0) | −0.59 | 0.03 | 1.1 × 10−77 | |
| rs964184 | 11:116648917 | Intergenic | G | C | 0.855 | TAG(52:3;0) | −0.258 | 0.045 | 9.5 × 10−9 | |
| rs10790495 | 11:122198706 | Intronic | A | G | 0.590 | TAG(56:4;0) | −0.20 | 0.04 | 2.1 × 10−8 | |
| rs117388573♯ | 12:78980665 | Intergenic | A | G | 0.020 | LPC(14:0;0) | −0.77 | 0.13 | 9.8 × 10−10 | |
| rs512948 | 13:52374489 | Intronic | T | C | 0.225 | LPE(18:2;0) | −0.22 | 0.04 | 1.4 × 10−8 | |
| rs8008070♯ | 14:64233720 | Intronic | A | T | 0.133 | SM(32:1;2) | 0.48 | 0.05 | 2.9 × 10−26 | |
| rs3902951 | 14:69789755 | Intronic | T | G | 0.361 | PEO(18:1;0–18:2;0) | 0.19 | 0.03 | 1.9 × 10−8 | |
| rs35861938 | 15:45637343 | Intergenic | T | C | 0.398 | PCO(18:2;0–18:1;0) | 0.18 | 0.03 | 2.7 × 10−8 | |
| rs261290♯ | 15:58678720 | Intronic | T | C | 0.617 | PE(18:0;0–20:4;0) | −0.37 | 0.03 | 4.0 × 10−31 | |
| rs35221977♯ | 16:79563576 | Intronic | G | C | 0.054 | LPC(16:0;0) | −0.46 | 0.08 | 1.3 × 10−9 | |
| rs79202680♯ | 17:4692640 | Intronic | G | T | 0.032 | SM(34:0;2) | −0.85 | 0.09 | 3.4 × 10−22 | |
| rs143203352 | 17:77293933 | Intronic | T | C | 0.024 | PC(16:0;0–18:1;0) | 0.60 | 0.11 | 3.2 × 10−8 | |
| rs151223356♯ | 18:18627427 | Intronic | A | C | 0.013 | LPC(14:0;0) | 0.97 | 0.15 | 1.9 × 10−10 | |
| rs7246617♯ | 19:8272163 | Intergenic | G | A | 0.402 | SM(38:2;2) | 0.25 | 0.03 | 2.5 × 10−15 | |
| rs2455069 | 19:51728641 | Missense | A | G | 0.383 | TAG(52:5;0) | −0.19 | 0.03 | 9.3 × 10−9 | |
| rs8736♯ | 19:54677189 | UTR | C | T | 0.388 | PI(18:0;0–20:4;0) | −0.38 | 0.03 | 9.8 × 10−28 | |
| rs4374298 | 19:55738746 | Synonymous | G | A | 0.166 | PEO(16:1;0–20:4;0) | −0.25 | 0.04 | 2.3 × 10−8 | |
| rs364585♯ | 20:12962718 | Intergenic | A | G | 0.670 | Total CER | −0.20 | 0.03 | 9.1 × 10−10 | |
| rs186680008 | 22:39754367 | Intronic | A | C | 0.015 | CE(20:3;0) | −0.81 | 0.15 | 2.6 × 10−8 |
Ref reference allele, Alt alternate allele, AF alternate allele frequency, SE standard error, UTR untranslated region
The strongest association between SNP and lipid species in the genome-wide significant loci (P < 5.0 × 10−8) are presented. The P-values were calculated from the meta-analyses using the inverse variance weighted method for fixed effects. The study-wide significant associations are marked by hash symbol. The SNPs are annotated to the nearest gene if identified in this study (marked by asterisk symbol) or to previously known gene if in linkage disequilibrium with the known loci for any lipid measure. The effect sizes presented are change in standard deviation of the lipid species per alternate allele. Chromosomal positions are based on hg19 reference sequence
Fig. 4Genetic architecture of the lipidome. a Manhattan plot showing associations for all 141 lipid species. Only the associations with P < 1.0 × 10−4 in the meta-analysis and consistent in directions in all three batches are plotted. The y-axis is capped at −log10 P-value = 30 for better representation of the data. The dotted line represents the threshold for genome-wide significant associations at P < 5.0 × 10−8. b Genome-wide significant associations between the identified lipid species-associated loci and lipid species showing effect of the loci on the lipidome. The plotted P-values were calculated from the meta-analyses using the inverse variance weighted method for fixed effects. New hits with P < 5.0 × 10−8 are shown as red dots, new independent hits in previously reported loci are presented as blue dots and hits in previously known loci are presented as black dots
Fig. 5Relationship between lipid species-associated variants and risk of CVDs. The upper panel shows the association of the identified variants with the strongest associated lipid species. Boxplots show the interquartile range (IQR) defined by the bounds of the box with the median (middle line of the box) of plasma levels of the respective lipid species for each genotype of the variants; whiskers extend to the largest/smallest values no further than 1.5 times the IQR. The lower panel depicts the relationship between the identified variants with CVD phenotypes. The effect sizes (odds ratio) with 95% confidence interval are plotted with respect to the alternate alleles. The associations with CVD phenotypes highlighted in red colour are significant at FDR <0.05
Fig. 6Patterns in associations and proposed mechanisms for the effect of identified variants on lipid metabolism and clinical outcomes. a Associations of LPL rs11570891-T and LPL activity with TAGs. Change (beta and standard errors) in plasma levels of TAGs per increase in standard deviation of LPL activity with their corresponding P-values, as calculated using linear regression model, are plotted in lower panel. The upper panel shows change (beta and standard errors) in plasma levels of TAGs per T allele with their corresponding P-values, as obtained from meta-analyses of genome-wide association analysis. b Association of LPL variant rs11570891 with LPL activity. The effect size (beta in standardised units and standard error in parenthesis) and P-value were calculated using linear mixed model. Boxplot depicts the interquartile range (IQR) defined by the bounds of the box, median (middle line) and whiskers extending to the largest/smallest values no further than 1.5 times the IQR. c Based on the patterns of the association of lipid species-associated loci with different lipid species, we propose that: (1) LPL rs11570891-T and APOA5 rs964184-C might result in more efficient hydrolysis of medium length TAGs which might results in reduced CVD risk, (2) FADS2 rs28456-G may have observed effect on PUFA metabolism through its inverse effect on FADS2 and FADS1 expressions, (3) SYNGR1 rs18680008-C might have a role in the negative regulation of either desaturation of linoleic acid (C18:2,n-6) or elongation of gamma linoleic acid (C18:3,n-6). (4) PTPRN2 rs10281741-G and MIR100HG rs10790495-G, which have very similar patterns of association with reduced level of long polyunsaturated TAGs, might have a role in negative regulation of either elongation and desaturation of fatty acids or incorporation of long chain unsaturated fatty acids in glycerol backbone during TAG biosynthesis. The positive (+) and negative (−) signs indicate increase or decrease, respectively, in level of lipid species or risk of disease as observed in our study, with different colours for different genetic variant
Fig. 7Association of known variants for traditional lipids with lipid species and traditional lipids. The P-values for the associations of the lead SNPs (557 SNPs available in our data set) identified through different genome-wide or exome-wide studies of traditional lipids (HDL-C, LDL-C, TG and TC) with lipid species (upper panel) and traditional lipids (lower panel) are plotted. The y-axis in the upper panel is capped at −log10 P-value = 30 for better representation of the data. The SNPs on the x-axis are serially arranged based on their chromosomal positions and as listed in the Supplementary Data 8. The points on the plots are colour coded by the lipid classes in the upper panel and traditional lipid in the lower panel. CER ceramide, DAG diacylglyceride, LPC lysophosphatidylcholine, LPE lysophosphatidylethanolamine, PC phosphatidylcholine, PCO phosphatidylcholine-ether, PE phosphatidylethanolamine, PEO phosphatidylethanolamine-ether, PI phosphatidylinositol, CE cholesteryl ester, SM sphingomyelin, ST sterol, TAG triacyglycerol, TC total cholesterol, TG triglycerides