| Literature DB >> 30327483 |
Dorothée Diogo1, Chao Tian2, Christopher S Franklin3, Mervi Alanne-Kinnunen4, Michael March5, Chris C A Spencer3, Ciara Vangjeli3, Michael E Weale3, Hannele Mattsson4,6, Elina Kilpeläinen4, Patrick M A Sleiman5, Dermot F Reilly7, Joshua McElwee7,8, Joseph C Maranville7,9, Arnaub K Chatterjee7,10, Aman Bhandari7,11, Khanh-Dung H Nguyen12, Karol Estrada12, Mary-Pat Reeve13, Janna Hutz13, Nan Bing14, Sally John12, Daniel G MacArthur15,16, Veikko Salomaa6, Samuli Ripatti4,15,17, Hakon Hakonarson5, Mark J Daly15,16, Aarno Palotie4,15,16,18,19, David A Hinds2, Peter Donnelly3, Caroline S Fox7, Aaron G Day-Williams7,12, Robert M Plenge7,9, Heiko Runz20,21.
Abstract
Phenome-wide association studies (PheWAS) have been proposed as a possible aid in drug development through elucidating mechanisms of action, identifying alternative indications, or predicting adverse drug events (ADEs). Here, we select 25 single nucleotide polymorphisms (SNPs) linked through genome-wide association studies (GWAS) to 19 candidate drug targets for common disease indications. We interrogate these SNPs by PheWAS in four large cohorts with extensive health information (23andMe, UK Biobank, FINRISK, CHOP) for association with 1683 binary endpoints in up to 697,815 individuals and conduct meta-analyses for 145 mapped disease endpoints. Our analyses replicate 75% of known GWAS associations (P < 0.05) and identify nine study-wide significant novel associations (of 71 with FDR < 0.1). We describe associations that may predict ADEs, e.g., acne, high cholesterol, gout, and gallstones with rs738409 (p.I148M) in PNPLA3 and asthma with rs1990760 (p.T946A) in IFIH1. Our results demonstrate PheWAS as a powerful addition to the toolkit for drug discovery.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30327483 PMCID: PMC6191429 DOI: 10.1038/s41467-018-06540-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Candidate drug targets investigated in the study
| Human genetics | Drug developmentb | ||
|---|---|---|---|
| Gene | Prior GWAS associationsa | Mendelian disorders (direction of effect) | Indications/status/proposed mechanism of action |
|
| CD; IBD | – | – / – / – |
|
| CD; IBD; UC | Familial candidiasis (LOF) | – / – / – |
|
| IBD; MPV; T1D | – | – / – / – |
|
| Asthma | – | – / – / – |
|
| aPTT; VTE; FXI levels | FXI deficiency (LOF) | Hemophilia C/launched/factor XI stimulant |
| Thrombosis/phase II/factor XI inhibitor | |||
|
| aPTT; FXII levels | Hereditary angioedema (GOF); FXII deficiency (LOF) | Hereditary angioedema; thrombosis/phase I/factor XII inhibitor |
| Antiphospholipid syndrome/preclinical/factor XII inhibitor | |||
|
| BMI | – | Cachexia/preclinical/GDF-15 antagonist |
|
| CD; IBD; UC | – | Cough; mastocytosis; pruritus/phase II/GPR35 agonist |
|
| CD; IBD; UC | – | – / – / – |
|
| BP; CAD; MI | Moyamoya 6 with achalasia (LOF) | – / – / – |
|
| IgAD; IBD; psoriasis; UC; SLE; T1D; vitiligo | Aicardi–Goutieres syndrome (GOF); Singleton–Merten syndrome (GOF) | Solid cancer/phase I/IFIH1 stimulant (additional targets: RIG-I; TLR3) |
|
| PBC; RA; SJO; SLE; SSc; UC | – | – / – / – |
|
| aPTT; FXI levels | – | – / – / – |
|
| Galectin-3 levels | – | Liver fibrosis; non-alcoholic steatohepatitis; psoriasis/phase II/galectin-1 and 3 antagonist |
| Pulmonary idiopathic fibrosis/phase II/galectin-3 antagonist | |||
| Atopic eczema; head and neck cancer; melanoma/phase I/galectin-1 and 3 antagonist | |||
| Arrhythmia; fibrosis: myocardial, renal; pulmonary hypertension/preclinical/galectin-1 and 3 antagonist | |||
| Cardiac and renal conditions/preclinical/galectin-3 antagonist | |||
|
| CD; IBD; PD; UC | Familial Parkinson’s disease (GOF) | Parkinson’s disease/phase I/LRRK2 inhibitor |
| Alzheimer’s disease; glaucoma/preclinical/LRRK2 inhibitor | |||
|
| Alcohol-related cirrhosis; ALT; CT; hepatic steatosis; NAFLD | – | – |
|
| Fasting glucose; T2D | – | – / – / – |
|
| PD | – | – / – / – |
|
| CD; IBD; MS; PBC; psoriasis; RA; SLE; T1D; UC | Immunodeficiency (LOF) | Atopic eczema/phase II/JAK1 and TYK2 inhibitor |
| psoriasis/phase II/TYK2 inhibitor; JAK1 and TYK2 inhibitor | |||
| SLE/phase II/TYK2 inhibitor | |||
| Alopecia areata; UC/phase II/JAK1 and TYK2 inhibitor | |||
| IBD/phase I/TYK2 inhibitor; JAK1 and TYK2 inhibitor | |||
| psoriatic arthritis/phase I/TYK2 inhibitor | |||
| CD/preclinical/JAK1-3 and TYK2 inhibitor | |||
| cancer: acute leukemia, colorectal, anaplastic large cell lymphoma; MS; RA/preclinical/JAK1 and TYK2 inhibitor | |||
| Uveitis/preclinical/TYK2 inhibitor | |||
ALT: alanine aminotransferase, aPTT: activated partial thromboplastin time, BMI: body mass index, CAD: coronary artery disease, IgAD: immunoglobulin A deficiency, MI: myocardial infarction, MPV: mean platelet volume, NAFLD: non-alcoholic fatty liver disease, RA: rheumatoid arthritis, SLE: systemic lupus erythematosus, T1D: type 1 diabetes, T2D: type 2 diabetes, VTE: venous thromboembolism, CD: Crohn’s disease, IBD: inflammatory bowel disease, MS: multiple sclerosis, PBC: primary biliary cirrhosis, PD: Parkinson’s disease, SJO: Sjogren’s syndrome, SSc: systemic sclerosis, UC: ulcerative colitis, GOF: gain-of-function, LOF: loss-of-function
aPublished associations at the genetic locus as defined in Methods. Causal gene not always unambiguously established. For details, see Supplementary Information
bAs listed in Citeline’s Pharmaprojects database. Active development with most advanced status (preclinical or clinical) as of Dec 16, 2017 is indicated
Cohorts included in this study
| Cohort | Participants geographic distribution | Phenotypes source | Max sample size | |
|---|---|---|---|---|
| 23andMe | 89% USA (adult) | Questionnaire-based self-reports | 654 | 671,151 |
| Genomics plc UK Biobank | 100% UK (adult) | Questionnaire-based self-reports, medical interviews and follow-up | 90 | 112,337 |
| FINRISK | 100% Finns (adult) | National health registries (ICD8,9,10) | 278 | 21,371 |
| CHOP | 100% USA (pediatric) | Electronic health records (ICD9-CM) | 870 | 12,044 |
| Genomics plc GWAS | Mixed | Mixed—multiple independent disease-specific cohorts | 34 | - |
aNumber of binary endpoints with N cases ≥ 20
Fig. 1Phenotypes tested and study design. a Categories of phenotypes assessed in the 23andMe, Genomics plc UK Biobank, FINRISK, and CHOP cohorts. b Manual phenotype mapping was performed to identify phenotypes shared between cohorts. One hundred and forty-five phenotypes were captured with at least 20 cases in at least 2 cohorts. After PheWAS in each cohort separately, the 145 phenotypes were meta-analyzed to increase statistical power and enable systematic comparisons of results between cohorts. c The 145 mapped phenotypes (see Supplementary Table 2) represent a broad spectrum of phenotypic categories and are captured with variable case:control ratios in the cohorts tested
Significant novel associations in the PheWAS meta-analysis
| Novel association in meta-PheWASa | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Gene | SNP | EA (EAF)b | Known associated phenotypec | Phenotype | OR (CI95) | Directiond | |||
|
| rs763361 | T (0.47) | IBD | Hypothyroidism | 1.05 (1.04–1.07) | 8.11e−11 | ++?+? | 35,428 | 412,577 |
|
| rs17724992 | A (0.73) | BMI | Heart metabolic diseasee | 1.03 (1.02–1.04) | 3.08e−09 | +???? | 275,944 | 209,302 |
| High blood pressuree | 1.03 (1.02–1.04) | 7.64e−09 | ++??? | 151,511 | 465,686 | ||||
| Blood pressure medicatione | 1.03 (1.02–1.04) | 1.76e−07 | +???? | 125,406 | 394,753 | ||||
| GERD | 1.03 (1.02–1.04) | 6.11e−07 | +???? | 130,654 | 384,572 | ||||
| Any CVDe | 1.03 (1.01–1.04) | 1.40e−06 | +???? | 148,577 | 388,405 | ||||
|
| rs1990760 | T (0.61) | T1D |
|
| 1.11e−07 | − − − − − | 57,101 | 269,659 |
|
| rs10488631 | C (0.11) | SLE | Hypothyroidism | 1.08 (1.05–1.12) | 5.78e−07 | ++?+? | 23,182 | 236,240 |
|
| rs738409 | G (0.33) | ALT |
|
| 1.47e−11 | −???? | 14,812 | 187,018 |
|
|
| 1.59e−07 | − −??? | 101,646 | 180,947 | ||||
|
| rs34536443 | G (0.89) | Psoriasis | Any immune disorder | 1.10 (1.07–1.13) | 4.27e−12 | +???? | 112,148 | 173,986 |
| Hypothyroidism | 1.14 (1.08–1.20) | 1.19e−06 | ++?−? | 23,145 | 233,757 | ||||
ALT: alanine aminotransferase, BMI: body mass index, CVD: cardiovascular disease, EA: effect allele, EAF: effect allele frequency, GERD: gastroesophageal reflux disease, IBD: inflammatory bowel disease, SLE: systemic lupus erythematosus, T1D: type diabetes, T2D: type 2 diabetes
aAssociations reaching P < 1.8e−6 (Bonferroni-corrected significance threshold) in the meta-analysis of PheWAS results with GWAS results. The full list of potential novel SNP-phenotype pairs reaching FDR < 0.1 is provided in Supplementary Table 5. Novel associations with direction of effect opposite to the known associated disease(s) effect, predicting potential adverse drug events, are highlighted in bold
bThe effect allele is the risk allele for known associated disease(s) related to the therapeutic hypothesis
cKnown associated disease related to the therapeutic hypothesis (surrogate for efficacy). The strongest association reported in the literature is indicated. The full list of known associations is provided in Supplementary Table 1
dDirection of effect in 23andMe, Genomics plc UK Biobank, FINRISK, CHOP, and GWAS
eCorrelated phenotypes
fMeta-analysis results including the 23andMe, Gplc/UK Biobank, FINRISK, CHOP, and GWAS Gabriel cohorts. When further including the independent GWAS EVE study, the association reaches P = 6.7 × 10−8
Fig. 2Meta-PheWAS results for 25 SNPs in candidate drug targets. Phenotypes associated at FDR < 0.1 (P < 7e−4) with at least one SNP in the meta-PheWAS are represented. Direction of effect of the known disease-risk increasing allele related to the therapeutic hypothesis is indicated. A positive Z score (in red) indicates increased risk, a negative Z score (in blue) indicates reduced risk. Known and novel associations reaching FDR < 0.1 are outlined in white and black respectively. Detailed association results are provided in the Supplementary Data 1
Fig. 3Pleiotropic effects of IFIH1 LOF variants. a A significant association of IFIH1 rs1990760-C (p.T946A) with increased risk of asthma was observed in the meta-analysis of PheWAS and GWAS results, with consistent effect estimate across the six cohorts tested. Odds ratios (OR) and 95% confidence intervals are represented. b Power estimation demonstrates the lack of power to detect an association at rs1990760-C in currently available asthma GWAS studies. Power to surpass various significance cutoffs (P < 0.05; FDR < 0.1, P < 7e−4; study-wide significance after Bonferroni correction, P < 1.8e−6; and genome-wide significance, P < 5e−8) in the six cohorts was estimated using the frequency of the asthma risk allele (RAF = 0.39), the OR in the PheWAS/GWAS meta-analysis (OR = 1.037), a disease prevalence of 8%, and the number of cases and controls in each of the cohorts. c Co-localization analysis demonstrates that the asthma, systemic lupus erythematosus (SLE), and ulcerative colitis (UC) associations at the IFIH1 locus are driven by a shared causal signal. Regional association results with asthma (red), SLE (blue) and UC (orange) are shown. PP, posterior probability of co-localization. d Results from this study (indicated by an asterix) combined with previously published findings suggest an allelic series of LOF IFIH1 alleles decreasing the risk of various autoimmune diseases while increasing the risk of asthma and UC. OR and 95% confidence intervals of association for the IFIH1 loss-of-function alleles rs1990760-C (p.T946A) and rs35667974-C (p. I923V) are shown
Fig. 4PheWAS for contact activation coagulation pathway targets. Three SNPs known to affect plasma protein levels of FXI (rs4253399), FXII (rs2731672), and KNG1 (rs5030062), and previously reported associated with partial thromboplastin time (aPTT) were interrogated in meta-PheWAS. Five phenotypes were observed as significantly associated (FDR < 0.1) with at least one of the three SNPs: blood clots (23andMe, FINRISK, and CHOP: 7487 cases, 273,305 controls), known association with the F11 SNP (*), blood thinners medication (23andMe: 22,985 cases, 236,431 controls), warfarin medication (23andMe: 7142 cases, 94,701 controls), pulmonary embolism (Gplc/UK Biobank: 949 cases, 111,077 controls), and bleeding tendency (23andMe: 1574 cases, 85,223 controls). Odds ratios (OR) and 95% confidence intervals of association of the aPTT-increasing alleles are shown. Detailed association results are provided in the Supplementary Data 1