| Literature DB >> 30214008 |
Jihoon Choi1,2, Kelan G Tantisira3,4, Qing Ling Duan5,6.
Abstract
More than 1100 genetic loci have been correlated with drug response outcomes but disproportionately few have been translated into clinical practice. One explanation for the low rate of clinical implementation is that the majority of associated variants may be in linkage disequilibrium (LD) with the causal variants, which are often elusive. This study aims to identify and characterize likely causal variants within well-established pharmacogenomic genes using next-generation sequencing data from the 1000 Genomes Project. We identified 69,319 genetic variations within 160 pharmacogenomic genes, of which 8207 variants are in strong LD (r2>0.8) with known pharmacogenomic variants. Of the latter, eight are coding or structural variants predicted to have high impact, with 19 additional missense variants that are predicted to have moderate impact. In conclusion, we identified putatively functional variants within known pharmacogenomics loci that could account for the association signals and represent the missing causative variants underlying drug response phenotypes.Entities:
Mesh:
Year: 2018 PMID: 30214008 PMCID: PMC6417988 DOI: 10.1038/s41397-018-0048-y
Source DB: PubMed Journal: Pharmacogenomics J ISSN: 1470-269X Impact factor: 3.550
Variants with high impact predictions, which are in LD with known pharmacogenomics variants.
| Chr | New | Gene | Functional annotation | PharmGKB | Gene | Functional annotation | EUR | EAS | AMR | AFR |
|---|---|---|---|---|---|---|---|---|---|---|
| 3 | rs13146 | UMPS | structural interaction variant | rs1801019 | UMPS | missense variant | 0.98 | 1.00 | 1.00 | 0.98 |
| 4 | rs28364311 | ADH1A | splice donor variant & intron variant | rs6811453 | ADH1A | downstream gene variant | 0.99 | 1.00 | 1.00 | 1.00 |
| 6 | rs677830 | OPRM1 | stop gained | rs558025 | OPRM1 | downstream gene variant | 1.00 | 1.00 | 0.98 | <0.8 |
| 7 | rs6977165 | CYP3A5 | stop lost | rs41303343 | CYP3A5 | frameshift variant | <0.8 | <0.8 | <0.8 | 0.80 |
| 17 | rs4330 | ACE | stop gained | rs4341 | ACE | 3 prime UTR variant | 0.99 | 0.95 | 1.00 | 0.95 |
| rs4343 | ACE | synonymous variant | 0.95 | 0.95 | 0.87 | <0.8 | ||||
| rs4344 | ACE | upstream gene variant | 0.99 | 0.96 | 1.00 | 0.90 | ||||
| rs4331 | ACE | synonymous variant | 0.86 | <0.8 | 0.88 | 0.84 | ||||
| rs4359 | ACE | intron variant | 0.96 | <0.8 | 0.91 | <0.8 | ||||
| rs4363 | ACE | splice region variant & intron variant | 0.93 | <0.8 | 0.86 | <0.8 | ||||
| 19 | rs11322783 | IFNL4 | frameshift variant | rs12980275 | IFNL3P1 | upstream gene variant | <0.8 | 0.87 | 0.87 | <0.8 |
| rs8105790 | IFNL3P1 | upstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs4803217 | IFNL3 | downstream gene variant | 0.83 | 0.97 | 0.87 | <0.8 | ||||
| rs11881222 | IFNL4 | downstream gene variant | 0.87 | 0.94 | 0.84 | <0.8 | ||||
| rs28416813 | IFNL3 | 5 prime UTR variant | 0.88 | 0.86 | 0.94 | <0.8 | ||||
| rs12979860 | IFNL3 | upstream gene variant | 0.94 | 0.87 | 0.93 | <0.8 | ||||
| rs8109886 | IFNL4 | upstream gene variant | <0.8 | 0.89 | <0.8 | <0.8 | ||||
| rs8113007 | IFNL4 | upstream gene variant | 0.88 | 0.97 | 0.84 | <0.8 | ||||
| rs8099917 | IFNL4 | upstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs7248668 | IFNL4 | upstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| 21 | rs881712 | CBR3 | structural interaction variant | rs8133052 | CBR3 | missense variant | 0.94 | 1.00 | 0.83 | <0.8 |
| 22 | rs3761423 | ADORA2A-AS1 | splice donor variant & intron variant | rs5996696 | ADORA2A | upstream gene variant | <0.8 | 0.90 | <0.8 | <0.8 |
Abbreviations: Chr = Chromosome, EUR r2 = linkage disequilibrium in the European Population of 1000 Genomes project measured in r-squared; EAS r2 = linkage disequilibrium in the Eastern Asian Population of 1000 Genomes project measured in r-squared; AMR r2= linkage disequilibrium in the American Population of 1000 Genomes project measured in r-squared; AFR r2 = linkage disequilibrium in the African Population of 1000 Genomes project measured in r-squared. Annotation definitions: structural interaction variant = These are “within protein” interaction loci, which are likely to be supporting the protein structure. They are calculated from single protein PDB entries, by selecting amino acids that are: a) atom within 3 Angstrom of each other; and b) are far away in the AA sequence (over 20 AA distance). The assumption is that, since they are very close in distance, they must be “interacting” and thus important for protein structure. For more information, see http://snpeff.sourceforge.net/SnpEff_manual.html.
Variants predicted with moderate impact identified in this study, which are in LD with known pharmacogenomics variants
| Chr | New Variant | Gene | Functional annotation | PharmGKBVariant | Gene | Annotation | EUR | EAS | AMR | AFR |
|---|---|---|---|---|---|---|---|---|---|---|
| r2 | r2 | r2 | r2 | |||||||
| 18 | rs2853533★ | C18orf56 | missense variant & TFBS variant | rs2853741 | RP11–806L2.5 | upstream gene variant | <0.8 | 0.85 | <0.8 | <0.8 |
| 1 | rs55867221 | C1orf167 | missense variant & TFBS variant | rs17367504 | CLCN6 | upstream gene variant | <0.8 | 0.9 | <0.8 | <0.8 |
| rs3737967 | C1orf167 | missense variant | <0.8 | 0.98 | 0.87 | <0.8 | ||||
| rs2274976 | MTHFR | missense variant | <0.8 | 0.96 | 0.87 | <0.8 | ||||
| 1 | rs1537514 | C1orf167 | missense variant | rs3737967 | C1orf167 | missense variant | <0.8 | 0.98 | 0.87 | <0.8 |
| rs2274976 | MTHFR | missense variant | <0.8 | 0.96 | 0.87 | <0.8 | ||||
| rs17367504 | CLCN6 | upstream gene variant | <0.8 | 0.9 | <0.8 | <0.8 | ||||
| 1 | rs1800595 | F5 | missense variant | rs6018 | F5 | missense variant | 1 | 1 | 1 | 1 |
| 1 | rs6027 | F5 | missense variant | rs6018 | F5 | missense variant | 0.94 | 0.89 | 0.97 | <0.8 |
| 1 | rs6033 | F5 | missense variant | rs6018 | F5 | missense variant | <0.8 | 0.83 | <0.8 | <0.8 |
| 3 | rs3732765 | MED12L | missense variant | rs9859538 | MED12L | intron variant | <0.8 | 0.97 | <0.8 | <0.8 |
| rs10935842 | P2RY12 | upstream gene variant | 1 | 0.99 | 0.97 | <0.8 | ||||
| rs6798637 | P2RY12 | upstream gene variant | 0.89 | <0.8 | <0.8 | <0.8 | ||||
| 4 | rs1693482 | ADH1C | missense variant | rs1662060 | ADH1C | downstream gene variant | 1 | 1 | 0.96 | 1 |
| rs698 | ADH1C | missense variant | 1 | 1 | 0.96 | 1 | ||||
| 4 | rs4963 | ADD1 | missense variant | rs4961 | ADD1 | missense variant | 0.88 | 0.99 | 0.96 | <0.8 |
| 7 | rs2307040 | CALU | missense variant | rs1043550 | CALU | 3 prime UTR variant | 0.82 | <0.8 | 0.96 | 0.89 |
| rs11653 | CALU | 3 prime UTR variant | 0.82 | <0.8 | 0.96 | 0.89 | ||||
| 9 | rs56350726 | SLC28A3 | missense variant | rs10868138 | SLC28A3 | missense variant | 0.81 | <0.8 | 0.83 | <0.8 |
| 11 | rs11604671 | ANKK1 | missense variant | rs2734849 | ANKK1 | missense variant | 0.97 | 1 | 0.98 | <0.8 |
| rs6277 | DRD2 | synonymous variant | <0.8 | 1 | 0.88 | <0.8 | ||||
| rs2587548 | DRD2 | upstream gene variant | <0.8 | 1 | <0.8 | <0.8 | ||||
| rs2734833 | DRD2 | upstream gene variant | <0.8 | 1 | <0.8 | <0.8 | ||||
| rs1076563 | DRD2 | upstream gene variant | <0.8 | 0.97 | <0.8 | <0.8 | ||||
| 16 | rs115629050 | CES1 | missense variant | rs2307240 | CES1 | missense variant | <0.8 | <0.8 | 0.9 | <0.8 |
| 16 | rs2307227 | CES1 | missense variant | rs2307240 | CES1 | missense variant | <0.8 | <0.8 | 0.9 | <0.8 |
| 16 | rs79711700 | CES1 | missense variant | rs2307240 | CES1 | missense variant | 0.88 | <0.8 | 1 | <0.8 |
| 19 | rs2336219 | CD3EAP | missense variant | rs967591 | CD3EAP | 5 prime UTR variant | 0.83 | 1 | 0.96 | <0.8 |
| rs735482 | CD3EAP | missense variant | 1 | 1 | 0.96 | 0.93 | ||||
| 19 | rs12971396 | IFNL4 | missense variant | rs12980275 | IFNL3P1 | upstream gene variant | <0.8 | 0.84 | <0.8 | <0.8 |
| rs8105790 | IFNL3P1 | upstream gene variant | 0.92 | 0.97 | 0.97 | <0.8 | ||||
| rs4803217 | IFNL3 | downstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs11881222 | IFNL4 | downstream gene variant | <0.8 | 0.91 | <0.8 | <0.8 | ||||
| rs28416813 | IFNL3 | 5 prime UTR variant | <0.8 | 0.83 | <0.8 | <0.8 | ||||
| rs12979860 | IFNL3 | upstream gene variant | <0.8 | 0.84 | <0.8 | <0.8 | ||||
| rs8109886 | IFNL4 | upstream gene variant | <0.8 | 0.86 | <0.8 | <0.8 | ||||
| rs8113007 | IFNL4 | upstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs8099917 | IFNL4 | upstream gene variant | 0.93 | 0.97 | 0.86 | <0.8 | ||||
| rs7248668 | IFNL4 | upstream gene variant | 0.93 | 0.97 | 0.86 | <0.8 | ||||
| 19 | rs4803221 | IFNL4 | missense variant | rs12980275 | IFNL3P1 | upstream gene variant | <0.8 | 0.84 | <0.8 | <0.8 |
| rs8105790 | IFNL3P1 | upstream gene variant | 0.93 | 0.97 | 0.95 | 0.81 | ||||
| rs4803217 | IFNL3 | downstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs11881222 | IFNL4 | downstream gene variant | <0.8 | 0.91 | <0.8 | <0.8 | ||||
| rs28416813 | IFNL3 | 5 prime UTR variant | <0.8 | 0.83 | <0.8 | <0.8 | ||||
| rs12979860 | IFNL3 | upstream gene variant | <0.8 | 0.84 | <0.8 | <0.8 | ||||
| rs8109886 | IFNL4 | upstream gene variant | <0.8 | 0.86 | <0.8 | <0.8 | ||||
| rs8113007 | IFNL4 | upstream gene variant | <0.8 | 0.94 | <0.8 | <0.8 | ||||
| rs8099917 | IFNL4 | upstream gene variant | 0.95 | 0.97 | 0.89 | <0.8 | ||||
| rs7248668 | IFNL4 | upstream gene variant | 0.95 | 0.97 | 0.89 | <0.8 | ||||
| 19 | rs762562 | CD3EAP | missense variant | rs967591 | CD3EAP | 5 prime UTR variant | 0.83 | 1 | 0.92 | <0.8 |
| rs735482 | CD3EAP | missense variant | 1 | 1 | 1 | 1 |
rs2853533★ - phenotype association (SNPedia): Neural Tube Defects & Spina Bifida Cystica (The G variant of rs2853533 was associated with Spina Bifida in a transmission disequilibrium test. Study size: 610 families (329 trios, 281 duos) Study population/ethnicity: Patients affected with Spina Bifida and their parents; Houston, TX; Los Angeles, CA; Toronto, ON, Canada Significance metric(s): p=0.0213). Abbreviations: Chr = Chromosome, EUR r2 = linkage disequilibrium in the European Population of 1000 Genomes project measured in r-squared; EAS r2 = linkage disequilibrium in the Eastern Asian Population of 1000 Genomes project measured in r-squared; AMR r2= linkage disequilibrium in the American Population of 1000 Genomes project measured in r-squared; AFR r2 = linkage disequilibrium in the African Population of 1000 Genomes project measured in r-squared.