| Literature DB >> 25057111 |
Abstract
BACKGROUND: Genome wide association studies (GWAS) have revealed a large number of links between genome variation and complex disease. Among other benefits, it is expected that these insights will lead to new therapeutic strategies, particularly the identification of new drug targets. In this paper, we evaluate the power of GWAS studies to find drug targets by examining how many existing drug targets have been directly 'rediscovered' by this technique, and the extent to which GWAS results may be leveraged by network information to discover known and new drug targets.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25057111 PMCID: PMC4083410 DOI: 10.1186/1471-2164-15-S4-S5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overlap between GWAS reported genes and drug targets
| Disease | Number of Drugs | GWAS reported genes | Number of drug targets | GWAS overlap, same disease* | GWAS overlap, all diseases** |
|---|---|---|---|---|---|
| Acute lymphoblastic leukemia | 6 | 19 | 10 | 0 | 3 |
| Age-related macular degeneration | 9 | 23 | 2 | 1 | 2 |
| Allergic rhinitis | 69 | 11 | 20 | 0 | 5 |
| Alzheimer's disease | 5 | 54 | 179 | 0 | 40 |
| Amyotrophic lateral sclerosis | 3 | 26 | 2 | 0 | 1 |
| Ankylosing spondylitis | 39 | 17 | 29 | 0 | 9 |
| Arthritis | 168 | 7 | 112 | 0 | 35 |
| Asthma | 102 | 43 | 52 | 1 | 19 |
| Atopic dermatitis | 12 | 8 | 3 | 0 | 1 |
| Atrial fibrillation | 45 | 7 | 25 | 0 | 14 |
| Attention deficit hyperactivity disorder | 3 | 81 | 1 | 0 | 1 |
| Autism | 3 | 6 | 10 | 0 | 5 |
| Basal cell carcinoma | 6 | 8 | 9 | 0 | 2 |
| Bipolar disorder/Schizophrenia | 93 | 215 | 110 | 1 | 32 |
| Blood pressure/Hypertension | 351 | 100 | 114 | 3 | 35 |
| Breast cancer | 84 | 42 | 43 | 1 | 13 |
| Celiac disease | 3 | 74 | 1 | 0 | 0 |
| Chronic kidney disease | 8 | 69 | 6 | 0 | 2 |
| Chronic lymphocytic leukemia | 14 | 17 | 29 | 0 | 5 |
| Chronic myeloid leukemia | 6 | 9 | 15 | 0 | 6 |
| Chronic obstructive pulmonary disease | 14 | 18 | 7 | 0 | 2 |
| Colorectal cancer | 8 | 14 | 16 | 0 | 6 |
| Coronary heart disease | 6 | 84 | 5 | 0 | 3 |
| Crohn's disease | 7 | 136 | 23 | 0 | 9 |
| Cystic fibrosis | 8 | 7 | 11 | 0 | 5 |
| Depression/Depressive disorder | 45 | 68 | 73 | 0 | 17 |
| Diabetes | 46 | 205 | 59 | 4 | 21 |
| Duodenal ulcer | 8 | 2 | 18 | 0 | 5 |
| Emphysema | 10 | 5 | 17 | 0 | 5 |
| Endometrial cancer | 1 | 2 | 2 | 0 | 0 |
| Endometriosis | 5 | 4 | 7 | 0 | 3 |
| End-stage renal disease | 2 | 2 | 8 | 0 | 3 |
| Epilepsy | 18 | 1 | 53 | 0 | 10 |
| Esophageal cancer | 1 | 18 | 2 | 0 | 1 |
| Gallstones | 1 | 1 | 1 | 0 | 0 |
| Gastric cancer | 2 | 3 | 1 | 0 | 0 |
| Glaucoma | 24 | 13 | 31 | 0 | 6 |
| Glioblastoma | 2 | 1 | 1 | 0 | 0 |
| Heart failure | 51 | 16 | 65 | 0 | 27 |
| HIV/AIDS | 54 | 62 | 53 | 1 | 9 |
| Hodgkin's lymphoma | 8 | 7 | 31 | 0 | 7 |
| Hypertriglyceridemia | 2 | 5 | 4 | 0 | 3 |
| Hypothyroidism | 5 | 43 | 8 | 1 | 5 |
| Inflammatory bowel disease | 2 | 18 | 8 | 0 | 4 |
| Kawasaki disease | 1 | 20 | 11 | 1 | 5 |
| Malaria | 17 | 3 | 17 | 0 | 4 |
| Male infertility | 6 | 5 | 3 | 0 | 3 |
| Melanoma | 9 | 20 | 6 | 0 | 0 |
| Menopause age | 9 | 23 | 15 | 0 | 4 |
| Migraine | 20 | 7 | 46 | 0 | 10 |
| Multiple myeloma | 7 | 3 | 10 | 0 | 3 |
| Multiple sclerosis | 10 | 123 | 30 | 1 | 12 |
| Myocardial infarction | 29 | 14 | 44 | 0 | 17 |
| Narcolepsy | 2 | 4 | 6 | 0 | 1 |
| Nephropathy/Nephrotic syndrome | 20 | 26 | 38 | 0 | 9 |
| Neuroblastoma | 2 | 2 | 6 | 0 | 2 |
| Non-small cell lung cancer | 5 | 7 | 10 | 0 | 1 |
| Obesity | 4 | 40 | 11 | 0 | 4 |
| Osteoarthritis | 26 | 3 | 46 | 0 | 10 |
| Osteoporosis | 13 | 10 | 10 | 0 | 2 |
| Ovarian cancer | 5 | 10 | 4 | 0 | 1 |
| Paget's disease | 4 | 9 | 6 | 0 | 1 |
| Pancreatic cancer | 2 | 29 | 11 | 0 | 4 |
| Panic disorder | 6 | 10 | 18 | 0 | 4 |
| Parkinson's disease | 20 | 62 | 184 | 1 | 34 |
| Polycystic ovary syndrome | 2 | 7 | 2 | 0 | 1 |
| Prostate cancer | 14 | 94 | 21 | 0 | 8 |
| Psoriasis/Psoriatic arthritis | 19 | 30 | 39 | 0 | 13 |
| Refractive error | 1 | 4 | 4 | 0 | 1 |
| Restless legs syndrome | 2 | 6 | 18 | 0 | 6 |
| Rheumatoid arthritis | 46 | 67 | 80 | 2 | 29 |
| Sleepiness | 1 | 2 | 2 | 0 | 0 |
| Stevens-Johnson syndrome/toxic epidermal necrolysis | 1 | 12 | 1 | 0 | 0 |
| Stroke | 8 | 4 | 7 | 0 | 6 |
| Tardive dyskinesia | 3 | 1 | 22 | 0 | 7 |
| Testicular cancer | 4 | 7 | 6 | 0 | 2 |
| Thyroid cancer | 2 | 5 | 3 | 0 | 2 |
| Tuberculosis | 12 | 5 | 18 | 0 | 4 |
| Type 1 diabetes | 8 | 74 | 18 | 0 | 8 |
| Type 2 diabetes | 28 | 91 | 34 | 3 | 13 |
| Ulcerative colitis | 5 | 95 | 9 | 1 | 6 |
| Uterine fibroids | 1 | 7 | 1 | 0 | 0 |
| Venous thromboembolism | 1 | 7 | 3 | 0 | 2 |
| Vitiligo | 4 | 25 | 8 | 1 | 2 |
| Mean | 19.90 | 29.18 | 24.00 | 0.26 | 7.09 |
*For each disease, the number of GWAS reported genes that are also drug targets for the disease.
**For each disease, the number of GWAS reported genes that are drug targets for any disease.
Comparison of common non-synonymous SNP densities between GWAS reported genes and drug targets
| Drug Targets | GWAS reported genes | HGMD genes | All genes | |
|---|---|---|---|---|
| Density of all non-synonymous SNPs | 0.0155 | 0.0171 | 0.0166 | 0.0171 |
| Density of Common non-synonymous SNPs | 0.00169 | 0.00221 | 0.00179 | 0.00214 |
1P-value for Mann-Whitney test against the density of common non-synonymous SNPs for GWAS reported genes.2P-value for Mann-Whitney test against the density of common non-synonymous SNPs for all genes.
dN/dS analysis for GWAS reported genes and drug targets
| Number of genes | Mean dN/dS | P Value for Mann-Whitney test against all genes | P Value for Mann-Whitney test against GWAS reported genes | ||
|---|---|---|---|---|---|
| Human-Mouse orthologs | All genes | 13691 | 0.22 | ||
| GWAS reported genes | 2932 | 0.19 | 2.44e-09* | ||
| Drug targets | 1035 | 0.18 | 1.21e-04* | 0.43 | |
| Drug targets with known mechanism | 432 | 0.17 | 6.04e-06* | 0.038* | |
| HGMD genes | 720 | 0.20 | 1.0 | ||
| Human-Chimpanzee orthologs | All genes | 14173 | 0.44 | ||
| GWAS reported genes | 2911 | 0.36 | 1.26e-13* | ||
| Drug targets | 1020 | 0.33 | 2.78e-13* | 0.0098* | |
| Drug targets with known mechanism | 423 | 0.32 | 4.20e-08* | 0.013* | |
| HGMD genes | 699 | 0.36 | 0.002* |
"*" denotes significant, i.e, P < 0.05
Figure 1Distribution of the log longest transcript length for different types of genes. GWAS genes are on average substantially longer than drug target genes, and longer than the set of all genes.
Figure 2Continuous network substructure formed by 43 of the 74 GWAS (green) and 16 of the 18 drug targets (red) for Type 1 Diabetes, allowing not more than one intermediate gene (grey). GWAS and drug target genes are intermingled in the network, and short paths are sufficient to form a connected network for almost all genes. FI network, figure from Cytoscape.
Figure 3A. Distribution of shortest distances to the nearest drug target for GWAS reported genes and all genes. B. Distribution of the shortest distance to the nearest GWAS genes for drug targets and all genes. C. Distribution of degree for drug targets and all genes in the FI network. Drug targets have a slightly higher degree (Mann-Whitley test P = 0.014).
Machine learning results for different diseases, using a Random Forest.
| Disease | GWAS genes | Drug targets | True Positive | False Positive | Precision | Recall | ROC area | F-Measure |
|---|---|---|---|---|---|---|---|---|
| Ankylosing spondylitis | 17 | 29(24) | 0.36 | 0.123 | 0.074 | 0.36 | 0.73 | 0.123 |
| Menopause | 24 | 15(14) | 0.571 | 0.098 | 0.082 | 0.571 | 0.819 | 0.143 |
| Multiple sclerosis | 126 | 30(28) | 0.393 | 0.052 | 0.19 | 0.393 | 0.75 | 0.256 |
| Myocardial infarction | 14 | 44(40) | 0.175 | 0.135 | 0.055 | 0.175 | 0.571 | 0.084 |
| Nephropathy/Nephrotic syndrome | 26 | 38(35) | 0.371 | 0.245 | 0.056 | 0.371 | 0.576 | 0.097 |
| Obesity | 40 | 11(11) | 0.273 | 0.098 | 0.032 | 0.273 | 0.724 | 0.058 |
| Osteoporosis | 10 | 10(10) | 0.2 | 0.189 | 0.011 | 0.2 | 0.546 | 0.022 |
| Pancreatic cancer | 29 | 11(6) | 0.167 | 0.1 | 0.011 | 0.167 | 0.611 | 0.02 |
| Panic disorder | 10 | 18(16) | 0.438 | 0.118 | 0.061 | 0.438 | 0.754 | 0.107 |
| Parkinson's disease | 62 | 184(132) | 0.606 | 0.226 | 0.307 | 0.606 | 0.712 | 0.407 |
| Asthma | 43 | 52(47) | 0.213 | 0.102 | 0.1 | 0.213 | 0.713 | 0.136 |
| Prostate cancer | 95 | 21(18) | 0.5 | 0.073 | 0.118 | 0.5 | 0.686 | 0.191 |
| Psoriasis/Psoriatic arthritis | 31 | 39(36) | 0.5 | 0.076 | 0.209 | 0.5 | 0.852 | 0.295 |
| Rheumatoid arthritis | 67 | 80(68) | 0.324 | 0.131 | 0.163 | 0.324 | 0.677 | 0.217 |
| Type 1 diabetes | 76 | 18(16) | 0.25 | 0.104 | 0.04 | 0.25 | 0.631 | 0.07 |
| Type 2 diabetes | 92 | 34(28) | 0.214 | 0.116 | 0.054 | 0.214 | 0.595 | 0.086 |
| Bipolar disorder/Schizophrenia | 217 | 110(81) | 0.593 | 0.15 | 0.273 | 0.593 | 0.744 | 0.374 |
| Blood pressure/Hypertension | 101 | 114(102) | 0.412 | 0.143 | 0.261 | 0.412 | 0.717 | 0.319 |
| Breast cancer | 43 | 43(38) | 0.289 | 0.072 | 0.147 | 0.289 | 0.745 | 0.195 |
| Chronic lymphocytic leukemia | 17 | 29(26) | 0.423 | 0.098 | 0.11 | 0.423 | 0.653 | 0.175 |
| Colorectal cancer | 14 | 16(16) | 0.25 | 0.154 | 0.028 | 0.25 | 0.53 | 0.05 |
| Acute lymphoblastic leukemia | 19 | 10(10) | 0.7 | 0.07 | 0.097 | 0.7 | 0.889 | 0.171 |
| Crohn's disease | 139 | 23(22) | 0.455 | 0.093 | 0.105 | 0.455 | 0.764 | 0.171 |
| Depression/Depressive disorder | 68 | 73(62) | 0.597 | 0.172 | 0.198 | 0.597 | 0.722 | 0.297 |
| Diabetes | 209 | 59(51) | 0.216 | 0.081 | 0.134 | 0.216 | 0.712 | 0.165 |
| Allergic rhinitis | 11 | 20(19) | 0.263 | 0.128 | 0.041 | 0.263 | 0.589 | 0.071 |
| Glaucoma | 14 | 31(25) | 0.16 | 0.189 | 0.023 | 0.16 | 0.443 | 0.04 |
| Alzheimer's disease | 54 | 179(125) | 0.544 | 0.178 | 0.321 | 0.544 | 0.69 | 0.404 |
| Heart failure | 16 | 65(54) | 0.481 | 0.222 | 0.118 | 0.481 | 0.655 | 0.189 |
| HIV/AIDS | 63 | 53(34) | 0.353 | 0.121 | 0.099 | 0.353 | 0.715 | 0.155 |
| Kawasaki disease | 20 | 11(10) | 0.7 | 0.027 | 0.219 | 0.7 | 0.919 | 0.333 |
Top 'false positive' drug targets for acute lymphoblastic leukemia.
| Target | Description from Refseq | Random Forest Probability |
|---|---|---|
| MAPK3 | The protein encoded by this gene is a member of the MAP kinase family. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act in a signaling cascade that regulates various cellular processes such as proliferation, differentiation, and cell cycle progression in response to a variety of extracellular signals. | 1 |
| PIK3R1 | Phosphatidylinositol 3-kinase plays an important role in the metabolic actions of insulin, and a mutation in this gene has been associated with insulin resistance. | 0.96 |
| RAF1 | v-raf-1 murine leukemia viral oncogene homolog 1 | 0.96 |
| EGFR | Mutations in this gene are associated with lung cancer. Multiple alternatively spliced transcript variants that encode different protein isoforms have been found for this gene | 0.96 |
| FGFR2 | Mutations in this gene are associated with Crouzon syndrome, Pfeiffer syndrome, Craniosynostosis, Apert syndrome, Jackson-Weiss syndrome, Beare-Stevenson cutis gyrata syndrome, Saethre-Chotzen syndrome, and syndromic craniosynostosis. | 0.96 |
| KDR | This receptor, known as kinase insert domain receptor, is a type III receptor tyrosine kinase. Mutations of this gene are implicated in infantile capillary hemangiomas. | 0.94 |
| FLT1 | This protein binds to VEGFR-A, VEGFR-B and placental growth factor and plays an important role in angiogenesis and vasculogenesis. | 0.94 |
| FGFR1 | Chromosomal aberrations involving this gene are associated with stem cell myeloproliferative disorder and stem cell leukemia lymphoma syndrome. | 0.94 |
| IL2RG | The protein encoded by this gene is an important signaling component of many interleukin receptors | 0.92 |
| ERBB2 | v-erb-b2 erythroblastic leukemia viral oncogene homolog 2, neuro/glioblastoma derived oncogene homolog | 0.92 |
| FGFR3 | This particular family member binds acidic and basic fibroblast growth hormone and plays a role in bone development and maintenance. Mutations in this gene lead to craniosynostosis and multiple types of skeletal dysplasia. | 0.9 |
| AKT1 | v-akt murine thymoma viral oncogene homolog 1 | 0.9 |
| INSR | insulin receptor | 0.9 |
| IL2RA | Mutations in this gene are associated with interleukin 2 receptor alpha deficiency. | 0.9 |
| SDC2 | The syndecan-2 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Altered syndecan-2 expression has been detected in several different tumor types. | 0.88 |
| MAPK1 | The protein encoded by this gene is a member of the MAP kinase family. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. | 0.86 |
| CD247 | The protein encoded by this gene is T-cell receptor zeta, which together with T-cell receptor alpha/beta and gamma/delta heterodimers, and with CD3-gamma, -delta and -epsilon, forms the T-cell receptor-CD3 complex. | 0.86 |
| RET | ret proto-oncogene | 0.86 |
| VEGFA | vascular endothelial growth factor A | 0.86 |
| PTPN1 | protein tyrosine phosphatase, non-receptor type 1 | 0.86 |
| IL3RA | The protein encoded by this gene is an interleukin 3 specific subunit of a heterodimeric cytokine receptor. | 0.84 |
| HDAC1 | histone deacetylase 1, Together with metastasis-associated protein-2, it deacetylates p53 and modulates its effect on cell growth and apoptosis. | 0.82 |
| CCND1 | The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance throughout the cell cycle. This protein has been shown to interact with tumor suppressor protein Rb and the expression of this gene is regulated positively by Rb. Mutations, amplification and overexpression of this gene, which alters cell cycle progression, are observed frequently in a variety of tumors and may contribute to tumorigenesis | 0.82 |
| FASN | fatty acid synthase | 0.82 |
| CD4 | The protein functions to initiate or augment the early phase of T-cell activation, and may function as an important mediator of indirect neuronal damage in infectious and immune-mediated diseases of the central nervous system. | 0.8 |