| Literature DB >> 29350398 |
Jörn Lötsch1,2, Dario Kringel1.
Abstract
The novel research area of functional genomics investigates biochemical, cellular, or physiological properties of gene products with the goal of understanding the relationship between the genome and the phenotype. These developments have made analgesic drug research a data-rich discipline mastered only by making use of parallel developments in computer science, including the establishment of knowledge bases, mining methods for big data, machine-learning, and artificial intelligence, (Table ) which will be exemplarily introduced in the following.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29350398 PMCID: PMC6001421 DOI: 10.1002/cpt.960
Source DB: PubMed Journal: Clin Pharmacol Ther ISSN: 0009-9236 Impact factor: 6.875
Overview on data sources and computational tools used for the present data science approach to analgesic drug repurposing from knowledge about the functions of genes related to insensitivity to pain in humans
| Site name | URL | |
|---|---|---|
| Gene names and functions | AmiGO (search utility for GO) |
|
| Human Pain Genes Database |
| |
| Gene Ontology (GO) |
| |
| HUGO Gene Nomenclature Committee |
| |
| NCBI gene index database |
| |
| GeneCards |
| |
| Diseases genes | Pain Genes Database |
|
| Online Mendelian Inheritance in Man (OMIM) database |
| |
| Drugs | DrugBank database |
|
| Thomson Reuters Integrity database |
| |
| Registry and results database of federally and privately supported clinical trials |
| |
| Reported biomedical evidence | PubMed database |
|
| Software | Gene Trail |
|
| R software |
|
All recourses except one (Thomson Reuters Integrity database) are publicly available, most of them free of charge. They were accessed on October 8, 2017.
Figure 1Structure found using unsupervised machine‐learning in the high‐dimensional data space of the analgesic drug (n = 79) vs. computational functional genomics based biological processes (d = 928) matrix. Left: The so‐called U‐matrix displays the result of a projection of the drug vs. biological process interaction matrix onto a toroid neuronal grid where opposite edges are connected. The projection was obtained using a parameter‐free polar swarm, Pswarm, consisting of so‐called DataBots, which are self‐organizing artificial “life forms” that carry vectors of the biological processes associated with the drugs via their genetic targets. During the learning phase, the DataBots were allowed to adaptively adjust their location on the grid close to DataBots, according to the Jaccard distance, carrying data with similar features, with a successively decreasing search radius. When the algorithm ends, the DataBots become projected points. To enhance the emergence of data structures on this projection, a generalized U‐matrix displaying the distance in the high‐dimensional space was added as a third dimension to this visualization. The U‐matrix was colored in hypsometric colors making the visualization appear as a geographical map with brown heights and green valleys with blue lakes. Watersheds indicate borderlines between different groups of analgesic drugs. In the present visualization, a curved “mountain range” in the “north–south” direction (marked with a light blue dotted line) separates two main clusters of drugs. These clusters completely coincided with the prior classification of analgesics into opioids and nonopioids subjects according to the pattern of repeated cold pain measurements. The data points are colored according to the emerging two‐cluster structure. Right: Ward clustering of the projected data clearly also indicated two clusters, supporting the machine results. The figure was created using the R software package (v. 3.4.2 for Linux; http://CRAN.R-project.org/), in particular the libraries “DatabionicSwarm” (M. Thrun, https://cran.r-project.org/package=DatabionicSwarm). The figure reproduces results of a previous analysis of the same data matrix; however, using a different machine‐learning method for nonredundancy.