| Literature DB >> 29212778 |
Pim van der Harst1, Niek Verweij2.
Abstract
RATIONALE: Coronary artery disease (CAD) is a complex phenotype driven by genetic and environmental factors. Ninety-seven genetic risk loci have been identified to date, but the identification of additional susceptibility loci might be important to enhance our understanding of the genetic architecture of CAD.Entities:
Keywords: computational biology; coronary artery disease; genetics; genome-wide association study; sample size
Mesh:
Substances:
Year: 2017 PMID: 29212778 PMCID: PMC5805277 DOI: 10.1161/CIRCRESAHA.117.312086
Source DB: PubMed Journal: Circ Res ISSN: 0009-7330 Impact factor: 17.367
Sixty-Four Novel Genome-Wide Significant CAD Loci
Figure 1.Network analyses of reconstituted gene sets. The total number of significant gene sets involved in coronary artery disease (CAD) increased to 13.90% since the 1000 Genome genome-wide association studies of CARDIoGRAMplusC4D, considering all possible gene sets. Clustering by modularity using Gephi software indicated that pathways specific for cardiovascular/heart development, inflammation, lipids, kidney and coagulation clustered together. PPI networks & Other indicates a remaining bin predominantly populated by protein–protein interaction networks.
Figure 2.Heatmap of associations in UK Biobank with novel loci. Heatmap of z scores for different diseases and phenotypes in UK Biobank, aligned to increased risk of coronary artery disease. Only significant associations (false discovery rate<0.01) are shown. The genetic risk score constructed with the known and novel loci, weighted using coefficients of CARDIoGRAMplusC4D, is highlighted by the red rectangle. BMI indicates body mass index; COPD, chronic obstructive pulmonary disease; RBC, red blood cell; and TIA, transient ischemic attack.
Figure 3.The role of regulatory DNA underlying coronary artery disease (CAD)-associated single-nucleotide polymorphisms (SNPs). Enrichment of genome-wide association analysis P values in Dnase I hypersensitive sites (DHS). CAD SNPs at different genome-wide association study (GWAS) threshold were significantly enriched in DHS footprints (A) and hot spots (B) across many different tissues and cell types. The fold enrichment was highly significant for most tissues and cell types (P<1×10−8) as indicated by the 4 colored circles next to the labels, 3 colored circles indicate P<1×10−7. Label sizes of tissue types were downsized because of space limitations; tissue types may be represented by multiple samples, indicated by hash marks of the same color. C, Subsequent prioritization of potential causal annotations underlying the 161 CAD loci also suggested that regions of DHS may be underlying the associations, but coding variants, conservation, 5′ untranslated region (UTR), and H3K4me1 annotations were more likely to be causal. D, Posterior probabilities for causality for each variant in the 164 CAD loci were calculated by an empirical Bayes approach implemented in the Probabilistic Annotation Integrator Framework, taking into account linkage disequilibrium (LD), association statistics, and the potentially causal annotations and summarized in Table 2 and Online Table XX. CTCF indicates transcriptional repressor CTCF; DGF, digital genomic footprint by Dnase1 hypersensitivity; FANTOM5, functional annotation of the mammalian genome V5; TFBS, transcription factor binding site; and TSS, transcription start site.
For 28 Loci, the 95% Credible Set of Causal Variants Consisted of a Single Coronary Artery Disease Variant