| Literature DB >> 35668104 |
Gemma Cadby1, Corey Giles2,3, Phillip E Melton1,4, Kevin Huynh2,3, Natalie A Mellett2, Thy Duong2, Anh Nguyen2, Michelle Cinel2, Alex Smith2, Gavriel Olshansky2,3, Tingting Wang2,3, Marta Brozynska2, Mike Inouye2, Nina S McCarthy5, Amir Ariff6, Joseph Hung7,8,9, Jennie Hui9,10, John Beilby9,10, Marie-Pierre Dubé11, Gerald F Watts7,12, Sonia Shah13, Naomi R Wray13,14, Wei Ling Florence Lim15,16, Pratishtha Chatterjee15,17,18, Ian Martins15, Simon M Laws19,20,21, Tenielle Porter19,20,21, Michael Vacher19,20,22, Ashley I Bush23, Christopher C Rowe23,24, Victor L Villemagne24,25, David Ames26,27, Colin L Masters23, Kevin Taddei15, Matthias Arnold28,29, Gabi Kastenmüller29, Kwangsik Nho30,31,32, Andrew J Saykin30,32,33, Xianlin Han34, Rima Kaddurah-Daouk28,35,36, Ralph N Martins15,16,17,18, John Blangero37, Peter J Meikle38,39,40, Eric K Moses41,42.
Abstract
We integrated lipidomics and genomics to unravel the genetic architecture of lipid metabolism and identify genetic variants associated with lipid species putatively in the mechanistic pathway for coronary artery disease (CAD). We quantified 596 lipid species in serum from 4,492 individuals from the Busselton Health Study. The discovery GWAS identified 3,361 independent lipid-loci associations, involving 667 genomic regions (479 previously unreported), with validation in two independent cohorts. A meta-analysis revealed an additional 70 independent genomic regions associated with lipid species. We identified 134 lipid endophenotypes for CAD associated with 186 genomic loci. Associations between independent lipid-loci with coronary atherosclerosis were assessed in ∼456,000 individuals from the UK Biobank. Of the 53 lipid-loci that showed evidence of association (P < 1 × 10-3), 43 loci were associated with at least one lipid endophenotype. These findings illustrate the value of integrative biology to investigate the aetiology of atherosclerosis and CAD, with implications for other complex diseases.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35668104 PMCID: PMC9170690 DOI: 10.1038/s41467-022-30875-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Study design for the genetic analysis of the human lipidome.
Representation of genome-wide association studies (GWAS) of the lipidome in the BHS discovery cohort (blue boxes), ADNI and AIBL validation cohorts (green boxes), discovery meta-analysis (orange box), and downstream analyses (grey boxes). ADNI Alzheimer’s Disease Neuroimaging Initiative, AIBL Australian Imaging, Biomarker & Lifestyle Flagship Study of Ageing, BHS Busselton Health Study, CAD coronary artery disease, Chol cholesterol, eQTL expression quantitative trait loci, GRM genetic relatedness matrix, GWAS genome-wide association study, IVW inverse-variance weighted, LD linkage disequilibrium, MAC minor allele count, meQTL methylation quantitative trait loci, mQTL metabolite quantitative trait loci, PC principal component, PRS polygenic risk score, pQTL protein quantitative trait loci, SD standard deviation, SNP single nucleotide polymorphism, Trig triglycerides.
Fig. 2Circular presentation of loci associated with circulating lipid species identified in our Discovery GWAS.
The −log10(P) for genetic association with lipid species are arranged by chromosomal position, indicated by alternating blue and green points. Association P-values are truncated at P < 1 × 10−60. Genome-wide significance (P < 5 × 10−8) is indicated by the red line. For details about significant associations, see Supplementary Data 2, 3. Genes identified in our candidate gene analysis are highlighted in blue, otherwise the closest gene is indicated in black. The purple band indicates lipid-loci that co-localise with coronary artery disease (CAD) or show association with CAD after adjusting for clinical lipids. The inner circle shows a Fuji plot of SNP-lipid associations, coloured by broad lipid category. Colour keys representing broad lipid categories are indicated in the plot centre. Chromosomes are indicated by numbered panels 1–22.
Fig. 3Comparison of estimated lipidomic effect sizes between clinical lipid adjusted and unadjusted models.
a Beta coefficients for independent unadjusted SNP-lipid associations (x-axis) are plotted against clinical lipid-adjusted SNP-lipid associations (y-axis). b Z-scores for unadjusted SNP-lipid associations (x-axis) are plotted against clinical lipid-adjusted SNP-lipid associations (y-axis). Z-scores for SNP associations reaching genome-wide significance (P < 5 × 10−8) in either the clinical lipid adjusted or unadjusted models. Variant effect signs are fixed so adjusted associations are positive. Variants showing greater (positive) associations in clinical lipid-adjusted analysis are shown in red, and variants showing reduced associations are shown in blue. Circle diameter is proportional of −log10(P) t-test of effect differences.
Fig. 4Identification of putative causal genes using genetic prioritisation and knowledge-based approaches.
Assignment of putative causal genes was performed using the ProGeM framework, incorporating genetic-based prioritisation (bottom-up), and biological knowledge-based approaches (top-down). a Venn diagram showing the number of loci with annotations for candidate genes using the distinct approaches and the overlap. Top-down annotations were divided into lipid-specific databases and generic databases. b Venn diagram of distinct genes identified in genetic-based prioritisation analysis. c Summary of putative causal genes with overlapping annotations for closest gene, protein consequences, eQTL and meQTL (left). Summary of putative causal SNP-gene pairs for which pQTL evidence was identified (right). eQTL expression quantitative trait loci, meQTL methylation quantitative trait loci, pQTL protein quantitative trait loci.
Fig. 5Genetic and phenotypic associations of the lipidome with coronary artery disease.
Forest plots of lipid-coronary artery disease; circles represent effect sizes and horizontal bars represent ±standard errors. a Phenotypic associations (logistic regression; two-sided) between lipid species and incident coronary artery disease in the BHS cohort (551 cases and 3703 controls), adjusted for age, sex, and the first 10 genomic principal components. b Association of lipid species with polygenic risk for coronary artery disease. Individuals in the discovery cohort (n = 4492) were assessed for risk using the metaGRS polygenic score, consisting of ∼1.7 million genetic variants. Linear regressions (two-sided) were performed to test the association between an individual’s polygenic score and lipid species concentrations, adjusting for age, sex, and the 10 first principal components. c Genetic correlations of lipid species (n = 4492) against coronary artery disease (meta-analysis of CARDIoGRAMplusC4D and UK Biobank; 122,733 cases and 424,528 controls), performed with Linkage Disequilibrium Score Regression (LDSC; v1.0.1). Nominally significant and Benjamini–Hochberg corrected significance is indicated by light- and dark-grey circles, respectively. The 10 most significant lipid species are highlighted in blue, red, or green.
Genomic regions showing co-localisation with lipid species and coronary artery disease.
| # | rsID | Positiona | EA/ OA | Co-localised lipid classes | Number of lipids co-localised | Strongest co-localisation | Minimum CAD | Nearby genesb |
|---|---|---|---|---|---|---|---|---|
| 1 | rs11591147 | 1:55505647 | G/T | CE, DE, Hex2Cer, Hex3Cer, PC(P), SHexCer, SM, TG(O) | 32 | CE(18:1) | 1.86 × 10−22 | PCSK9, USP24, BSND |
| 2 | rs602633 | 1:109821511 | G/T | HexCer | 2 | HexCer(d18:1/24:1) | 3.63 × 10−58 | PSRC1, CELSR2, MYBPHL |
| 3 | rs2281719 | 1:230297659 | C/T | DG, PI, TG [NL] | 5 | DG(18:0_18:1) | 6.41 × 10−07 | GALNT2, PGBD5, COG2 |
| 4 | rs10779835 | 1:230299949 | C/T | DG, TG [NL] | 4 | TG(54:2) [NL-18:0] | 6.41 × 10−07 | GALNT2, PGBD5, COG2 |
| 5 | rs515135 | 2:21286057 | C/T | CE, PC | 4 | PC(16:0_18:0) | 5.74 × 10−17 | APOB, TDRD15, LDAH |
| 6 | rs6713865 | 2:23899807 | A/G | AC | 2 | AC(16:0) | 2.86 × 10−05 | KLHL29, ATAD2B, UBXN2A |
| 7 | rs6544713 | 2:44073881 | C/T | CE | 6 | CE(20:1) | 1.84 × 10−18 | ABCG8, ABCG5, DYNC2LI1 |
| 8 | rs2736177 | 6:31586094 | C/T | TG [NL] | 2 | TG(50:2) [NL-18:2] | 4.86 × 10−09 | AIF1, PRRC2A, BAG6 |
| 9 | rs41279633 | 7:44580876 | G/T | CE | 1 | CE(18:0) | 1.72 × 10−06 | NPC1L1, DDX56, TMED4 |
| 10 | rs6982502 | 8:126479362 | C/T | SM | 1 | SM(d18:0/22:0) | 7.67 × 10−23 | TRIB1, NSMCE2, WASHC5 |
| 11 | rs2980869 | 8:126488250 | C/T | PC | 1 | PC(36:0) | 7.67 × 10−23 | TRIB1, NSMCE2, WASHC5 |
| 12 | rs35093463 | 9:107586238 | A/C | Hex3Cer | 2 | Hex3Cer(d18:1/22:0) | 4.00 × 10−07 | ABCA1, NIPSNAP3B, NIPSNAP3A |
| 13 | rs1800978 | 9:107665978 | C/G | Hex3Cer | 1 | Hex3Cer(d18:1/24:1) | 4.00 × 10−07 | ABCA1, NIPSNAP3B, NIPSNAP3A |
| 14 | 9:136141870 | 9:136141870 | C/T | CE | 1 | CE(18:0) | 2.03 × 10−14 | ABO, SURF6, OBP2B |
| 15 | rs603424 | 10:102075479 | A/G | AC, CE, DG, Hex2Cer, LPC, PC, PC(P), TG [NL] | 24 | LPC(16:1) [sn2] | 7.41 × 10−07 | PKD2L1, BLOC1S2, SCD |
| 16 | rs7350481 | 11:116586283 | C/T | CE, DG | 2 | DG(18:1_18:2) | 5.64 × 10−07 | BUD13, ZPR1, APOA5 |
| 17 | rs6589563 | 11:116590787 | A/G | CE, DG, TG [NL] | 4 | DG(18:0_18:1) | 5.64 × 10−07 | BUD13, ZPR1, APOA5 |
| 18 | rs1558861 | 11:116607437 | C/T | CE, DG, PI, TG [NL] | 25 | TG(54:4) [NL-18:2] | 5.64 × 10−07 | BUD13, ZPR1, APOA5 |
| 19 | rs964184 | 11:116648917 | C/G | CE, DE, DG, LPI, PC, PE, PG, PI, TG [NL] | 64 | TG(54:2) [NL-18:0] | 7.03 × 10−13 | ZPR1, BUD13, APOA5 |
| 20 | rs651821 | 11:116662579 | C/T | CE, PE | 3 | CE(22:0) | 7.03 × 10−13 | APOA5, ZPR1, BUD13 |
| 21 | rs1169288 | 12:121416650 | A/C | Cer(d), PC, SM | 6 | PC(36:0) | 1.26 × 10−18 | HNF1A, C12orf43, OASL |
| 22 | rs2244608 | 12:121416988 | A/G | SM | 1 | SM(d18:0/22:0) | 1.26 × 10−18 | HNF1A, C12orf43, OASL |
| 23 | rs2043085 | 15:58680954 | C/T | PE | 1 | PE(18:0_18:1) | 7.24 × 10−06 | ALDH1A2, LIPC, AQP9 |
| 24 | rs1532085 | 15:58683366 | A/G | PE, PG | 16 | PE(18:1_18:2) | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 25 | rs1077835 | 15:58723426 | A/G | PE | 7 | PE(15-MHDA_22:6) | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 26 | rs1800588 | 15:58723675 | C/T | DG, LPE, PE, PE(O), PG, TG(O) | 19 | LPE(20:4) [sn1] | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 27 | rs2070895 | 15:58723939 | A/G | CE, PE, PG, PS | 16 | PG(34:2) | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 28 | rs588136 | 15:58730498 | C/T | DG, PC, PC(P), PS, TG(O) | 10 | Total PC | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 29 | rs261342 | 15:58731153 | C/G | LPE, TG [NL] | 3 | LPE(20:4) [sn1] | 7.24 × 10−06 | ALDH1A2, LIPC, ADAM10 |
| 30 | rs12446515 | 16:56987015 | C/T | PC, PC(O) | 3 | PC(16:0_16:0) | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 31 | rs56156922 | 16:56987369 | C/T | Hex3Cer, PC, PC(O), PC(P), PE(P) | 22 | PC(P-16:0/16:1) | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 32 | rs56228609 | 16:56987765 | C/T | CE, PC(O), PE(O), PI, TG(O) | 6 | CE(18:0) | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 33 | rs247616 | 16:56989590 | C/T | PC | 1 | PC(16:0_18:3) (a) | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 34 | rs12149545 | 16:56993161 | A/G | PC(O), PC(P), PE(O), PI, TG(O) | 11 | TG(O-50:1) [NL-16:0] | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 35 | rs3764261 | 16:56993324 | A/C | PC | 1 | PC(18:2_18:2) | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 36 | rs17231506 | 16:56994528 | C/T | Hex2Cer, Hex3Cer, PC, PC(O), PC(P), PE(P), TG(O) | 40 | TG(O-50:1) [NL-16:0] | 1.19 × 10−09 | CETP, HERPUD1, NLRC5 |
| 37 | rs56289821 | 19:11188247 | A/G | CE, Cer(d), COH, GM3, Hex2Cer, Hex3Cer, HexCer, PC, PC(O), PC(P), SHexCer, SM | 60 | SM(35:2) (b) | 1.93 × 10−36 | LDLR, SMARCA4, SPC24 |
| 38 | rs72999033 | 19:19366632 | C/T | Cer(d) | 1 | Cer(d16:1/24:1) | 3.18 × 10−07 | HAPLN4, NCAN, TM6SF2 |
| 39 | rs58542926 | 19:19379549 | C/T | LPC, PC | 2 | LPC(20:3) [sn1] | 3.18 × 10−07 | TM6SF2, HAPLN4, SUGP1 |
| 40 | rs10401969 | 19:19407718 | C/T | Cer(d), DG, LPC, PC, PE, TG [NL] | 38 | DG(18:1_20:4) | 3.18 × 10−07 | SUGP1, TM6SF2, MAU2 |
| 41 | rs73001065 | 19:19460541 | C/G | Cer(d), TG [NL] | 3 | Cer(d18:1/24:0) | 3.18 × 10−07 | MAU2, SUGP1, GATAD2A |
| 42 | rs150268548 | 19:19494483 | A/G | Cer(d) | 3 | Total Cer | 3.18 × 10−07 | GATAD2A, MAU2, SUGP1 |
| 43 | rs7412 | 19:45412079 | C/T | CE, Cer(d), COH, DE, DG, GM1, GM3, Hex2Cer, Hex3Cer, HexCer, LPC, LPC(O), LPC(P), LPE(P), PC, PC(O), PC(P), PE(P), SHexCer, SM, TG [NL], TG(O) | 184 | CE(16:0) | 2.14 × 10−35 | APOE, TOMM40, APOC1 |
Co-localisation analyses performed using coronary artery disease in UK Biobank and CARDIoGRAMplusC4D. Minimum CAD P-values were obtained from the meta-analysis performed in van der Harst & Verweij 2018.
CAD coronary artery disease, EA effect allele, OA other allele.
aGenomic position based on Genome Reference Consortium Human Build 37 (GRCh37).
bClosest three protein coding genes to causal variant.
Fig. 6Co-localisation of lipid-loci with coronary artery disease.
Summary of lipid classes which contain at least one lipid specie that co-localises with coronary artery disease. Colours indicate broad lipid categories—green, sphingolipids; orange, phospholipids; blue, neutral lipids/others. Indicated variants were identified as the most likely causal variant for each of the identified co-localisation analysis. Genetic variants are ordered according to the number of co-localisations across lipid classes. Evidence of co-localisation included H3 + H4 > 0.8 and H4/H3 > 10.
Fig. 7Genetic analysis of the LIPC gene region and circulating levels of phosphatidylethanolamine.
a Lipid-wide association with the genetic variant, rs2043085, in the BHS cohort (n = 4492). Symbol colour is used to distinguish lipid classes. The symbol orientation indicates the effect sign, inverted triangles indicate negative associations, while regular triangles indicate positive associations. The dashed line indicates genome-wide significance (P < 5 × 10−8). b Regional association plots for Total PE and coronary artery disease (van der Harst & Verweij 2018), focusing on the LIPC region. Variants are coloured based on LD with the lead variant, rs2043085. Linkage disequilibrium plot showing correlation between variants following clumping (r > 0.8; P < 5 × 10−8). Variant correlations were obtained from 10,000 unrelated individuals from the UK Biobank. c Plot of genetic instrument effect sizes against Total PE (n = 4492) and coronary artery disease (n = 547,261). Variants were selected based on association with Total PE from within the LIPC region. Eight approximately independent variants were left following clumping (r > 0.05; P < 5 × 10−8). Generalised Summary-data based Mendelian Randomisation (GSMR) was used to estimate effect of Total PE on coronary artery disease, accounting for the variant correlations and uncertainty in both b and b. Data are presented as mean ± SE. d Forest plot of single variant tests and GSMR estimates from panel c. Data presented as mean ± 95% confidence interval. e Diagram of mediated pleiotropy, showing effect sizes estimated across multiple datasets. Exposure modifying variant effect sizes were estimated in the BHS cohort, as well as odds ratio of phosphatidylethanolamine lipid species against incident cardiovascular disease. Total effect represents the sum of genetics effects on coronary artery disease, whether mediated through phosphatidylethanolamine or not. Coronary artery disease effect size was obtained from van der Harst & Verweij 2018. Source data are provided as a Source Data file. MAF minor allele frequency, MR Mendelian randomisation, OR odds ratio, PE phosphatidylethanolamine, SNP single nucleotide polymorphism.