| Literature DB >> 32591598 |
Alexandra S Shadrina1,2, Tatiana I Shashkova3,4,5, Anna A Torgasheva3,6, Sodbo Z Sharapov3,6, Lucija Klarić7,8, Eugene D Pakhomov3, Dmitry G Alexeev3, James F Wilson8,9, Yakov A Tsepilov3,6, Peter K Joshi9, Yurii S Aulchenko10,11.
Abstract
Genome-wide association studies have led to a significant progress in identification of genomic loci affecting coronary artery disease (CAD) risk. However, revealing the causal genes responsible for the observed associations is challenging. In the present study, we aimed to prioritize CAD-relevant genes based on cumulative evidence from the published studies and our own study of colocalization between eQTLs and loci associated with CAD using SMR/HEIDI approach. Prior knowledge of candidate genes was extracted from both experimental and in silico studies, employing different prioritization algorithms. Our review systematized information for a total of 51 CAD-associated loci. We pinpointed 37 genes in 36 loci. For 27 genes we infer they are causal for CAD, and for 10 further genes we judge them most likely causal. Colocalization analysis showed that for 18 out of these loci, association with CAD can be explained by changes in gene expression in one or more CAD-relevant tissues. Furthermore, for 8 out of 36 loci, existing evidence suggested additional CAD-associated genes. For the remaining 15 loci, we concluded that evidence for gene prioritization remains inconsistent, insufficient, or absent. Our results provide deeper insights into the genetic etiology of CAD and demonstrate knowledge gaps where further research is warranted.Entities:
Mesh:
Year: 2020 PMID: 32591598 PMCID: PMC7320185 DOI: 10.1038/s41598-020-67001-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Genes in 51 CAD-associated loci (±250 kb around the lead SNP) proposed to be causal according to different lines of evidence.
| Lead SNP¥ | Chr: position* | Nearest† known gene | Genes prioritized by SMR/HEIDI‡ | Candidate genes from literature** | Genes prioritized previously based on bioinformatics approaches[ | Conclusion | |
|---|---|---|---|---|---|---|---|
| 1 | rs17114036 | 1: 56 962 821 | — | ▪ | ▪ ▪ | ||
| 2 | rs602633 | 1: 109 821 511 | ▪ ▪ ▪ | ▪ ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 3 | rs4129267 | 1: 154 426 264 | ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ | ||
| 4 | rs10919065 | 1: 169 093 557 | ▪ ▪ | ▪ | ▪ ▪ ▪ | ||
| 5 | rs6700559 | 1: 200 646 073 | ▪ ▪ ▪ | — | ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 6 | rs2820315 | 1: 201 872 264 | ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 7 | rs16986953 | 2: 19 942 473 | — | — | — | No evidence. | |
| 8 | rs515135 | 2: 21 286 057 | — | ▪ | ▪ ▪ | ||
| 9 | rs6544713 | 2: 44 073 881 | — | ▪ ▪ | ▪ ▪ | ||
| 10 | rs1561198 | 2: 85 809 989 | ▪ ▪ ▪ ▪ ▪ | ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 11 | rs2252641 | 2: 145 801 461 | — | ▪ | ▪ | Evidence is inconsistent. | |
| 12 | rs2351524 | 2: 203 880 992 | ▪ ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 13 | rs2306374 | 3: 138 119 952 | ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ | ||
| 14 | rs1429141 | 4: 148 288 067 | — | ▪ | ▪ ▪ | ||
| 15 | rs7692387 | 4: 156 635 309 | ▪ | ▪ | ▪ | ||
| 16 | rs273909 | 5: 131 667 353 | — | — | ▪ | Insufficient evidence. | |
| 17 | rs246600 | 5: 142 516 897 | — | — | ▪ | Insufficient evidence. | |
| 18 | rs7751826 | 6: 12 900 977 | ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 19 | rs10947789 | 6: 39 174 922 | — | — | ▪ | Insufficient evidence. | |
| 20 | rs2327429 | 6: 134 209 837 | ▪ ▪ ▪ | ▪ | ▪ | ||
| 21 | rs3103349# | 6: 160 740 721 | ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 22 | rs10455872# | 6: 161 010 118 | ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 23 | rs11556924 | 7: 129 663 496 | ▪ | ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 24 | rs10237377 | 7: 139 757 136 | — | ▪ | ▪ ▪ ▪ | ||
| 25 | rs11204085 | 8: 19 940 796 | ▪ | ▪ | ▪ ▪ ▪ | ||
| 26 | rs2954032 | 8: 126 493 392 | — | ▪ | — | ||
| 27 | rs3218020 | 9: 21 997 872 | — | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 28 | rs579459 | 9: 136 154 168 | ▪ ▪ | ▪ ▪ | ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 29 | rs2505083 | 10: 30 335 122 | ▪ | ▪ | ▪ ▪ ▪ | ||
| 30 | rs10793513 | 10: 44 494 546 | — | — | — | No evidence. | |
| 31 | rs523297 | 10: 44 756 557 | — | ▪ | — | ||
| 32 | rs2246833 | 10: 91 005 854 | ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ | ||
| 33 | rs11191447 | 10: 104 652 323 | ▪ ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 34 | rs12801636 | 11: 65 391 317 | ▪ ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 35 | rs974819 | 11: 103 660 567 | ▪ ▪ | ▪ | ▪ | ||
| 36 | rs3184504§ | 12: 111 884 608 | ▪ ▪ ▪ ▪ ▪ | ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 37 | rs441§ | 12: 112 228 849 | ▪ ▪ ▪ ▪ ▪ | ▪ ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | Evidence is inconsistent. | |
| 38 | rs2258287 | 12: 121 454 313 | ▪ ▪ ▪ | ▪ | ▪ ▪ | ||
| 39 | rs11057830 | 12: 125 307 053 | — | ▪ | ▪ ▪ ▪ ▪ | ||
| 40 | rs9319428 | 13: 28 973 621 | — | ▪ | ▪ | ||
| 41 | rs9515203 | 13: 111 049 623 | — | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 42 | rs2895811 | 14: 100 133 942 | ▪ | ▪ ▪ ▪ | |||
| 43 | rs7178051 | 15: 79 118 296 | ▪ ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 44 | rs17514846 | 15: 91 416 550 | ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 45 | rs1050362 | 16: 72 130 815 | ▪ ▪ ▪ ▪ | ▪ | ▪ | ||
| 46 | rs170041 | 17: 2 170 216 | — | ▪ ▪ | ▪ ▪ | Evidence is inconsistent. | |
| 47 | rs12936587 | 17: 17 543 722 | ▪ ▪ | ▪ ▪ ▪ | ▪ ▪ | Evidence is inconsistent. | |
| 48 | rs2070783 | 17: 62 406 971 | ▪ | ▪ | ▪ ▪ | ||
| 49 | rs12052058 | 19: 11 159 525 | ▪ ▪ ▪ ▪ | ▪ ▪ ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 50 | rs867186 | 20: 33 764 554 | ▪ ▪ ▪ ▪ ▪ | ▪ | ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ ▪ | ||
| 51 | rs9982601 | 21: 35 599 128 | ▪ ▪ | ▪ | ▪ ▪ ▪ |
Alternative gene names or non-coding RNA names are given in parenthesis after official gene symbols. Literature overview for each candidate gene found in literature sources is provided in Supplementary Table S4. In the studies[46–49], possible candidate genes were linked to the prioritized CAD-associated SNPs (data on those SNPs located in the51 studied loci can be found in Supplementary Table S3b). Arrows near gene names indicate that these genes have been linked to the same prioritized SNP in the locus or to SNPs in high LD with each other (r2 ≥ 0.8; Supplementary Table S3c). If there are two or more groups of such genes in the locus, single arrow indicates the genes linked to one SNP; double, triple, and quadruple arrows – genes linked to other SNPs (e.g., in locus 43). We also marked with arrows the genes found in SMR/HEIDI analysis if the top SNP (instrumental variable used for investigating relationships between gene expression and CAD) was the same or in high LD (r2 ≥ 0.8; Supplementary Table S3d) with SNPs prioritized in other studies.
¥Loci for the analysis in our study were defined as regions within ±250 kb around these lead SNPs (see Supplementary Table S1c).
*Chromosome: position of the lead SNP on the chromosome according to GRCh37.p13
†Nearest gene according to the NCBI dbSNP database (https://www.ncbi.nlm.nih.gov/snp/)
‡Information on whether increased gene expression in CAD-relevant tissue is associated with the increased or decreased CAD risk is given in Supplementary Table S2a.
**Candidate genes with the most compelling evidence for their role in CAD according to literature data are shown in bold.
¶Converging evidence of a potential functional SNP-gene mechanism (demonstrated in the study by van der Harst et al.[48]).
#, §These pairs of loci are overlapping and contain partially the same genes. Since the distance between the lead SNPs rs3103349–rs10455872 and rs3184504–rs441 was > 250 kb (269,4 kb and 344,2 kb, respectively), SMR/HEIDI analysis was performed for each locus (±250 kb around the lead SNP) separately. The genes prioritized based on literature data and revealed in the gene-based association analysis[49], if located in two loci in the pair, were attributed to both. Similarly, if the CAD-associated SNPs prioritized in the studies by Brænne et al.[46], Lempiäinen et al.[47], and van der Harst et al.[48] were located in two loci in the pair, we attributed the genes linked with these SNPs to both loci.
Figure 1Summary of findings for 51 CAD-associated loci. Matching loci numbers with chromosomal positions and lead SNPs can be found in Table 1 and Supplementary Table S1c. Prioritized genes are listed in Table 1 and Supplementary Table S4.