| Literature DB >> 30420678 |
Tõnis Tasa1,2, Kristi Krebs2, Mart Kals2, Reedik Mägi2, Volker M Lauschke3, Toomas Haller2, Tarmo Puurand4, Maido Remm4, Tõnu Esko2, Andres Metspalu2, Jaak Vilo1, Lili Milani5,6.
Abstract
Pharmacogenomics aims to tailor pharmacological treatment to each individual by considering associations between genetic polymorphisms and adverse drug effects (ADEs). With technological advances, pharmacogenomic research has evolved from candidate gene analyses to genome-wide association studies. Here, we integrate deep whole-genome sequencing (WGS) information with drug prescription and ADE data from Estonian electronic health record (EHR) databases to evaluate genome- and pharmacome-wide associations on an unprecedented scale. We leveraged WGS data of 2240 Estonian Biobank participants and imputed all single-nucleotide variants (SNVs) with allele counts over 2 for 13,986 genotyped participants. Overall, we identified 41 (10 novel) loss-of-function and 567 (134 novel) missense variants in 64 very important pharmacogenes. The majority of the detected variants were very rare with frequencies below 0.05%, and 6 of the novel loss-of-function and 99 of the missense variants were only detected as single alleles (allele count = 1). We also validated documented pharmacogenetic associations and detected new independent variants in known gene-drug pairs. Specifically, we found that CTNNA3 was associated with myositis and myopathies among individuals taking nonsteroidal anti-inflammatory oxicams and replicated this finding in an extended cohort of 706 individuals. These findings illustrate that population-based WGS-coupled EHRs are a useful tool for biomarker discovery.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30420678 PMCID: PMC6460570 DOI: 10.1038/s41431-018-0300-6
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Single-nucleotide variation (SNV) characteristics in whole-genome sequences from Estonian Biobank participants
| (a) Variants in the Estonian Biobank discovery set | Whole genome | ADMET genes ( | ||
|---|---|---|---|---|
|
| % |
| % | |
| Genes with variants | 18,468 | 56 | ||
| Unique variants | 29,108,287 | 1314 | ||
| Variant carriers | 2240 | 2240 | ||
| Novel variants | 11,508,281 | 39.5 | 267 | 20.3 |
| Known variants | 17,600,006 | 60.5 | 1047 | 79.7 |
| MAF > 5% | 5,403,215 | 18.6 | 164 | 12.5 |
| 1% ≤ MAF < 5% | 2,444,670 | 8.4 | 95 | 7.2 |
| 0.5% ≤ MAF < 1 % | 1,211,084 | 4.2 | 45 | 3.4 |
| 0.05% ≤ MAF < 0.5% | 6,460,248 | 22.2 | 285 | 21.7 |
| MAF < 0.05 % | 13,589,070 | 46.7 | 725 | 55.2 |
| AC = 1 | 10,617,607 | 36.5 | 560 | 42.6 |
| AC = 2 | 2,971,463 | 10.2 | 165 | 12.6 |
AC allele count, MAF Minor allele frequency, ADMET absorption, distribution, metabolism, excretion, and toxicity, n number of variants
(a) Numbers and frequencies of detected variants in whole-genome sequences and targeted pharmacogenes
(b) Characterization of targeted pharmacogenetic variations in loss of function (LoF), missense, and regulatory (transcription factor binding sites in liver 5-kb upstream of gene start site) regions
Fig. 1Overview of genetic variation, drug consumption, and adverse drug effect (ADE) data in electronic health records (EHRs). a Outline of pharmacogenomic variation, high-risk drug prescriptions, and ADEs. Drug prescriptions and medical histories in EHRs were combined with whole-genome sequencing data and imputed genotypes to investigate effects of genetic variation in 64 pharmacogenetically important genes on prevalence of ADEs among people with specific drug prescriptions. b Numbers of Estonian Biobank participants with variations in pharmacogenes (light gray bars), filled prescriptions of high-risk drugs with known genetic associations (dark gray bars), and diagnosed ADEs (black bars). c Flowchart visualizing co-occurrences of genetic variants, drug prescriptions, and ADEs among Estonian Biobank participants. Line thickness reflects the number of individuals with a given feature (minimum n = 10)
Validation of previously reported PharmGKB associations in 64 pharmacogenes
| ADE/drug | No ADE/drug | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Rs number/star allele (variants in LD) | Variant HGVS term (hg19) | Drug (ATC code) | AA | Level of evidence | Variant carriers | Non-carriers | Variant carriers | Non-carriers | OR (95% CI) | Post-conditional | |
| (a) Significant variant/drug associations | ||||||||||||
|
| CYP2D6 *6 | chr22:g.42525086_42525086delA | Amitriptyline (N06AA09) | *6 | 1 A | 5 | 77 | 5 | 532 | 6.0 (1.33–29.28) | 0.020 | NA |
|
| CYP2D6 *6 | chr22:g.42525086_42525086delA | Tramadol (N02AX02) | *6 | 1B | 9 | 182 | 16 | 1181 | 2.7 (1.04–6.53) | 0.035 | NA |
|
| rs2032582 | chr7:g.87160618 A > T | Atorvastatin (C10AA05) | T | 3 | 3 | 27 | 2 | 126 | 6.9 (1.09–43.1) | 0.037 | NA |
|
| rs1801252 | chr10: g.115804036 A > G | Verapamil (C08DA01) | G | 3 | 11 | 8 | 49 | 129 | 3.6 (1.36–9.46) | 0.018 | NA |
|
| rs12208357 | chr6::g.160543148 C > T | Tramadol (N02AX02) | T | 3 | 52 | 139 | 236 | 961 | 1.5 (1.07–2.15) | 0.009 | NA |
|
| rs3892097 | chr22::g.42524947 C > T | Sertraline (N06AB06) | T | 3 | 12 | 55 | 102 | 194 | 0.4(0.102–0.74) | 0.016 | NA |
| (b) Significant gene/drug associations | ||||||||||||
|
| rs7910642 | chr10:g.101541579 G > A | Simvastatin (C10AA01) | A | 3 | 9 | 55 | 138 | 366 | 0.4 (0.2–0.9) | 0.0181 | 0.0121 |
|
| rs45441199 | chr10: g.101591737 T > C | Carbamazepine (N03AF01) | C | 3 | 3 | 33 | 3 | 256 | 7.7 (1.5–39.5) | 0.0112 | 0.0177 |
|
| rs4980 (rs4358) | chr17:g.61574642 G > A (chr17:g.61571907 G > A) | Torasemide (C03CA04) | A | 3 | 7 | 100 | 13 | 635 | 3.4 (1.3–8.8) | 0.0243 | 0.0243 |
|
| rs56104268 | chr22:g.19946897 A > C | Venlafaxine (N06AX16) | C | 3 | 10 | 42 | 75 | 137 | 0.4 (0.2–0.9) | 0.0264 | 0.0430 |
|
| rs563052490 | chr10:g.96602634 C > A | Esomeprazole (A02BC05) | A | 3 | 2 | 160 | 1 | 1214 | 15.1(1.4–167.1) | 0.0322 | 0.0310 |
|
| rs34030679 | chr22:g.42610099 T > C | Sertraline (N06AB06) | C | 3 | 11 | 56 | 23 | 273 | 2.3 (1.1–5.0) | 0.0033 | 0.0076 |
|
| rs199535154 | chr22:g.42524814 A > G | Tolterodine (G04BD07) | G | 2 A | 2 | 9 | 2 | 72 | 7.7 (1.0–61.3) | 0.0133 | 0.0133 |
|
| rs1058172 | chr22:g.42523528 C > T | Mirtazapine (N06AX11) | T | 2 A | 1 | 13 | 19 | 27 | 0.1 (0.0–0.9) | 0.0387 | 0.0259 |
|
| rs145259190 | chr6:g.160683564 C > T | Metformin (A10BA02) | T | 3 | 6 | 84 | 19 | 666 | 2.5 (1.0–6.4) | 0.0473 | 0.0473 |
| (c) Insignificant after conditional analysis with previously reported association variants | ||||||||||||
|
| rs10249788 | chr7:g.17338147 C > T | Olanzapine (N05AH03) | T | 3 | 7 | 9 | 15 | 70 | NS | 0.030 | 0.051 |
|
| rs28371703 (rs28588594, rs1080989, rs1065852) | chr22:g.42525821 G > T (chr22:g.42528224 G > A chr22:g.42527793 C > T, chr22:g.42526694 G > A) | Sertraline (N06AB06) | T | 3 | 11 | 56 | 100 | 196 | NS | 0.006 | 0.079 |
|
| rs28371703 | chr22:g.42525821 G > T | Fluoxetine (N06AB03) | T | 3 | 13 | 46 | 134 | 228 | NS | 0.028 | 0.135 |
|
| rs1065852 (rs1080989, rs28588594) | chr22:g.42526694 G > A (chr22:g.42527793 C > T, chr22:g.42528224 G > A) | Mirtazapine (N06AX11) | A | 2 A | 14 | 53 | 129 | 230 | NS | 0.033 | 0.139 |
Star allele—nomenclature for pharmacogenes, discrete star allele represents either a single genetic variant or a haplotype
ADE adverse drug effect, OR odds ratio, 95% CI 95% confidence intervals, LD linkage disequilibrium, AA alternative allele, NA not available, NS not significant
(a) Significantly replicated results from previously reported variant-drug associations
(b) Significant new pharmacogenomic gene variants (n = 1314) in previously reported gene-drug associations
(c) Variants that became insignificant after correction with previously reported variants in a gene-drug association
Fig. 2Top five significant findings from genome-wide association analysis (GWAS). a Variants selected for replication with odds ratios (squares) and 95% confidence intervals (CI, horizontal lines). Discovery associations with the most significant ADE group are shown in blue and in the replication cohort in purple. The plot is annotated with p-values from the discovery (pd), replication (pr), and combined meta-analyses (pm). b–f Regional association plots for five replicated loci: NM_001127384.2:c.1047 + 29179 T > C (rs75495219); chr11:g.139896164 A > G (rs7390154); NM_020132.4:c.*7617 G > A (rs8133463); NM_018557.2:c.1014-42068 T > C (rs1882642); NM_001136534.1:c.186 + 7589 A > G (rs4767831). Color-coded dots display linkage disequilibrium values for surrounding single-nucleotide variations calculated from the 1000 Genomes Project release of 2012 (EUR population) and human hg19 assembly
Fig. 3Regional association plots around c.1047 + 29179 T > C (rs75495219) in CTNNA3 (NM_001127384.2) for adverse drug effects (ADEs) among individuals with oxicam prescriptions. Color-coded dots display linkage disequilibrium values for surrounding single-nucleotide variations calculated from the 1000 Genomes Project release of 2012 (EUR population) and human hg19 assembly. a ADEs defined as a set of 79 ICD10 codes. b ADEs restricted to a subset of myopathy-related ICD10 codes from a