| Literature DB >> 28848388 |
Jörn Lötsch1,2, Catharina Lippmann2, Dario Kringel1, Alfred Ultsch3.
Abstract
Genes causally involved in human insensitivity to pain provide a unique molecular source of studying the pathophysiology of pain and the development of novel analgesic drugs. The increasing availability of "big data" enables novel research approaches to chronic pain while also requiring novel techniques for data mining and knowledge discovery. We used machine learning to combine the knowledge about n = 20 genes causally involved in human hereditary insensitivity to pain with the knowledge about the functions of thousands of genes. An integrated computational analysis proposed that among the functions of this set of genes, the processes related to nervous system development and to ceramide and sphingosine signaling pathways are particularly important. This is in line with earlier suggestions to use these pathways as therapeutic target in pain. Following identification of the biological processes characterizing hereditary insensitivity to pain, the biological processes were used for a similarity analysis with the functions of n = 4,834 database-queried drugs. Using emergent self-organizing maps, a cluster of n = 22 drugs was identified sharing important functional features with hereditary insensitivity to pain. Several members of this cluster had been implicated in pain in preclinical experiments. Thus, the present concept of machine-learned knowledge discovery for pain research provides biologically plausible results and seems to be suitable for drug discovery by identifying a narrow choice of repurposing candidates, demonstrating that contemporary machine-learned methods offer innovative approaches to knowledge discovery from available evidence.Entities:
Keywords: big data; computational biology; data science; genetic variation; humans; machine learning; pain; perception
Year: 2017 PMID: 28848388 PMCID: PMC5550731 DOI: 10.3389/fnmol.2017.00252
Source DB: PubMed Journal: Front Mol Neurosci ISSN: 1662-5099 Impact factor: 5.639
A set of n = 20 genes, in alphabetical order, that were reported to be causally associated with the hereditary human phenotype of insensitivity to pain.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
|
| Atlastin GTPase 1 | Neuropathy, hereditary sensory, type ID | HSN1D |
| Guelly et al., |
|
|
| Atlastin GTPase 3 | Neuropathy, hereditary sensory, type IF | HSN1F |
| Kornak et al., |
|
|
| Clathrin heavy chain like 1 | Insensitivity to Pain with Preserved Temperature sensation |
| Nahorski et al., | |
|
|
| Dynamin 1 like | Encephalopathy, lethal, due to defective mitochondrial peroxisomal fission 1 | EMPF1 |
| Sheffer et al., |
|
|
| Early B-cell factor 3 | Hypotonia, ataxia, and delayed development syndrome | HADDS |
| Chao et al., |
|
|
| Family with sequence similarity 134, member B | Neuropathy, hereditary sensory and autonomic, type IIB | HSAN2B |
| Kurth et al., |
|
|
| Inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated protein | Dysautonomia, familial (Riley-Day syndrome) | HSAN III |
| Anderson et al., |
|
|
| Kinesin family member 1A | Hereditary sensory neuropathy type IIC | HSN2C |
| Riviere et al., |
|
|
| Leukemia inhibitory factor receptor alpha | Congenital pain insensitivity phenotype with progressive vertebral destruction | Elsaid et al., | ||
|
|
| MpV17 mitochondrial inner membrane protein | Navajo neurohepatopathy | NNH |
| Karadimas et al., |
|
|
| Nerve growth factor (beta polypeptide) | Neuropathy, hereditary sensory and autonomic, type V | HSAN V |
| Einarsdottir et al., |
|
|
| Neurotrophic tyrosine kinase, receptor, type 1 | Insensitivity to pain, congenital, with anhidrosis | HSAN IV |
| Indo et al., |
|
|
| PR/SET domain 12 | Insensitivity to Pain with hypohidrosis | HSAN VIII |
| Chen et al., |
|
|
| RAB7A, member RAS oncogene family | Charcot-Marie-Tooth type 2B neuropathy | CMT2B |
| Verhoeven et al., |
|
|
| Nav1.9 (sodium voltage-gated channel alpha subunit 11) | Neuropathy, hereditary sensory and autonomic, type VII | HSAN VII |
| Leipold et al., |
|
|
| Nav1.7 (sodium channel, voltage-gated, type IX, alpha subunit 9) | Insensitivity to pain, channelopathy-associated |
| Cox et al., | |
|
|
| Serine palmitoyltransferase, long chain base subunit 1 | Neuropathy, hereditary sensory and autonomic, type IA | HSAN1A |
| Dawkins et al., |
|
|
| Serine palmitoyltransferase, long chain base subunit 2 | Neuropathy, hereditary sensory and autonomic, type IC | HSAN1C |
| Rotthier et al., |
|
|
| Transthyretin | Carpal tunnel syndrome, familial | CTS1 |
| Swoboda et al., |
|
|
| WNK lysine deficient protein kinase 1 | Neuropathy, hereditary sensory and autonomic, type II (Morvan disease) | HSAN2A |
| Lafreniere et al., |
The genes were queried on March 13, 2017 from the “Online Mendelian Inheritance in Man” (OMIM) database at .
HSAN, Hereditary sensory and autonomic neuropathy; HSN, Hereditary sensory neuropathy; OMIM, Online Mendelian Inheritance in Man database.
Novel analgesic drugs developed with the purpose to antagonistically target genes associated with human hereditary insensitivity to pain, i.e., to mimic the pain-insensitivity phenotype observed in carriers of loss-of-function mutations in these genes, being currently in a clinical phase of development according to publicly available sources of information (Table 3).
|
|
|
|
|
|
|---|---|---|---|---|
| NGF | Nerve growth factor (beta polypeptide) | Tanezumab | Antibody | Pfizer |
| MEDI-7352 | Antibody | AstraZeneca | ||
| Fasinumab | Antibody | Regeneron | ||
| CRB-0089 | Antagonist | Rottapharm Biotech | ||
| NTRK1 | Neurotrophic tyrosine kinase, receptor, type 1 | ASP-7962 | Blocker | Astellas Pharma |
| VM-902A | Blocker | Purdue Pharma | ||
| ARRY-954 | Blocker | Array BioPharma | ||
| CRB-0089 | Blocker | Rottapharm Biotech | ||
| FX-007 | Blocker | Flexion Therapeutics | ||
| SCN9A | Nav1.7 (sodium channel, voltage-gated, type IX, alpha subunit 9) | Funapide | Blocker | Xenon Pharmaceuticals |
| CC-8464 | Blocker | Chromocell | ||
| DSP-2230 | Blocker | Sumitomo Dainippon Pharma | ||
| GDC-0310 | Blocker | Genentech/Xenon Pharmaceuticals |
The information was queried on March 31, 2017 from the Thomson Reuters Integrity database at .
Overview on data sources and computational tools used for the present data science approach to drug repurposing from knowledge about the functions of genes related to insensitivity to pain in humans.
|
|
|
| |
|---|---|---|---|
| Gene names and functions | AmiGO (search utility for GO) |
| Carbon et al., |
| Gene Ontology (GO) |
| Ashburner et al., | |
| HUGO Gene Nomenclature Committee |
| Seal et al., | |
| NCBI gene index database |
| ||
| GeneCards |
| Rebhan et al., | |
| Human diseases | Online Mendelian Inheritance in Man (OMIM®) database |
| |
| Drugs | DrugBank database |
| Wishart et al., |
| Thomson Reuters Integrity database |
| Nonfree | |
| Software | R software |
| R Development Core Team, |
All recourses are publically available, most of them free of charge.
Figure 1Scheme of the data analysis workflow. The analyses had two major aims, i.e., (i) assessing the biological functions of genes reportedly associated with insensitivity to pain (upper line) and (ii) using the biological processes in which these genes are involved to find repurposing candidates among DrugBank listed drugs (lower line). To this end, genes were identified in databases and their biological functions were associated based on the annotations in the Gene Ontology database; for the drug target coding genes (>4,000) with a filter for too many irrelevant terms implemented as an overrepresentation analysis. Such filter was not necessary for the only 20 genes associated with insensitivity to pain. However, for the latter, overrepresentation analysis (ORA) followed by functional abstraction was performed to obtain a comprehensible set of >10 biological functions which summarize the biological roles of these genes in an interpretable manner. The information obtained in the ORA of the 20 pain insensitivity genes was used to generate a virtual “pain drug” that was introduced into the “drug vs. biological processes” matrix of all drugs. Subsequently, unsupervised machine learning was used to find data structures among all drugs. Those drugs that, in the high dimensional vector space of associations with GO terms (biological processes), laid near the virtual “PainInsensitivity drug” were the repurposing candidates.
Functional areas representing the genetic background of hereditary insensitivity to pain presented in a polyhierarchy of GO terms with a maximum of certainty, information value, coverage and conciseness (Ultsch and Lötsch, 2014).
|
|
|
|
|---|---|---|
| GO:0033555 | Multicellular organismal response to stress | 2 |
| GO:0070997 | Neuron death | 3 |
| GO:0050877 | Neurological system process | 5 |
| GO:0009991 | Response to extracellular stimulus | 5 |
| GO:0007399 | Nervous system development | 6 |
| GO:0071704 | Organic substance metabolic process | 11 |
| GO:0016043 | Cellular component organization | 13 |
| GO:0065007 | Biological regulation | 17 |
Specifically, significant gene ontology (GO) terms were obtained by means of over-representation analysis (ORA) of the 20 genes against all human genes. The precise definition of the GO terms can be obtained using the AmiGO search tool for GO at .
Figure 23D-view of the U-matrix visualization of distance based structures of the 414 × 38 sized “drug vs. biological process” matrix, comprising the 413 drugs annotated with one or more of the n = 38 biological processes assigned to both, the set of 20 genes causally implicated in insensitivity to pain and the drug targets queried form the DrugBank database, which following ABC analysis based item selection were found in at closer Euclidian distances form the virtual “PainInsensitivity” drug (red dot) that carried all of the n = 38 processes. The U-matrix has been obtained using a projection of the data points onto a toroid grid of 4,000 neurons where opposite edges are connected. The U-Matrix was colored as a geographical map with brown (up to snow-covered) heights and green valleys with blue lakes. Valleys indicate clusters and watersheds indicate borderlines between different clusters. The dots indicate the so-called “best matching units” (BMUs) of the self-organizing map (SOM), which are those neurons whose weight vector is most similar to the input. A single neuron can be the BMU for more than one data point; hence, the number of BMUs may not be equal to the number of drugs. In the vicinity of the red dot, i.e., the virtual “PainInsensitivity” drug, a mount ridge surrounded valley was observed that represented a cluster of drugs (yellow dots) most similar to the virtual “PainInsensitivity” drug. However, the latter was located eccentrically in the cluster indicating that the similarity to any of the DrugBank queried repurposing candidates for pain therapy is incomplete. The other drugs (gray dots) lay outside the cluster of the “PainInsensitivity” drug and could therefore be rejected as repurposing candidates based on the present approach to search for drugs that with respect to the addressed biological processes are most similar to the pattern of biological processes in which the genes associated with insensitivity to pain are involved. The figure has been created using the R software package (version 3.3.3 for Linux; http://CRAN.R-project.org/; R Development Core Team, 2008) using our R library “Umatrix” (M. Thrun, F. Lerch, Marburg, Germany, http://www.uni-marburg.de/fb12/datenbionik/software; file http://www.uni-marburg.de/fb12/datenbionik/umatrix.tar.gz).
Candidate DrugBank listed substances that qualify for repurposing as treatments of pain, according to the similarity between the biological processes associated with the n = 20 genes causally involved in human insensitivity to pain and captured in the virtual “PainInsensitivity” drug, and the biological processes in which the 413 drugs queried from the DrugBank database (Wishart et al., 2006, 2008) are involved.
|
|
|
|
|---|---|---|
| “PainInsensitivity” drug | none | Not applicable (virtual drug) |
| Acetylsalicylic acid | 945 | Approved analgesic |
| Myristic acid | 8231 | Antinociceptive effects (Intahphuak et al., |
| Phosphonoserine | 4522 | – |
| Tamoxifen | 675 | – |
| Adenosine monophosphate | 131 | Its inhibition attenuated pain (Liou et al., |
| Phosphonothreonine | 2482 | – |
| Imatinib | 619 | Restored morphine analgesic potency (Donica et al., |
| Sorafenib | 398 | Might induce pain (Di Cesare Mannelli et al., |
| Flavopiridol | 3496 | Recovery from tactile allodynia (Tsuda et al., |
| Nintedanib | 9079 | – |
| Ellagic acid | 8846 | Antinociceptive effects (Mansouri et al., |
| Dasatinib | 1254 | – |
| Sunitinib | 1268 | Hyperalgesic effects (Bullon et al., |
| Staurosporine | 2010 | Counteracting capsaicin sensitization (Anand et al., |
| Lenvatinib | 9078 | – |
| Pazopanib | 6589 | – |
| MP470 | 5216 | – |
| ABT-869 | 6080 | – |
| XL999 | 5014 | – |
| Yohimbine | 1392 | Antinociceptive effects (Shannon and Lutz, |
| Ponatinib | 8901 | – |
| Caffeine | 201 | Analgesic effects shown and discussed (Baratloo et al., |
The 22 drugs are the members of the cluster in the high dimensional space to which the virtual “PainInsensitivity” drug belonged (Figure .