| Literature DB >> 27295093 |
Nikolas Papanikolaou1, Georgios A Pavlopoulos1, Theodosios Theodosiou1, Ioannis S Vizirianakis2, Ioannis Iliopoulos3.
Abstract
BACKGROUND: Text mining and data integration methods are gaining ground in the field of health sciences due to the exponential growth of bio-medical literature and information stored in biological databases. While such methods mostly try to extract bioentity associations from PubMed, very few of them are dedicated in mining other types of repositories such as chemical databases.Entities:
Keywords: Chemicals; Data integration; Document clustering; Drug associations; Knowledge discovery; Name entity recognition; Text mining
Mesh:
Substances:
Year: 2016 PMID: 27295093 PMCID: PMC4905607 DOI: 10.1186/s12859-016-1041-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1DrugQuest’s workflow. a Queries to DrugBank and retrieval of records related to the query. b DrugBank record mining based on textual information such as: description, toxicology and pharmacology. c Name Entity Recognition techniques to identify genes/proteins, chemicals, diseases, pathways. d TextQuest algorithm to identify non tagged Significant Terms. e Partitional clustering of DrugBank records using various clustering algorithms and similarity measures. f Visual representation of results: Left: Tag cloud example of highly representative terms per cluster. Right: DrugBank records assigned to clusters
Fig. 2Aspirin Example. Tag Cloud view for term “aspirin” related query. Cluster 1: tags focusing on the anticoagulant blood effects of aspirin in related diseases including other anticoagulant drug classes along with analgesic, antipyretic and anti-inflammatory activities of aspirin. Cluster 2: tags refer to combination therapy of aspirin with other pharmacological classes of drugs. Cluster 3: tags propose combination therapy of aspirin with other analgesic drugs for the relief of pain in severe conditions. Cluster 4: tags point to a specific disease where aspirin is included in the therapeutic protocol, e.g. heart diseases
Fig. 3SSRIs Example. Tag Cloud view for drugs “citalopram”, “fluoxetine”, “paroxetine” and “sertraline”. Orange: common tags characterizing the class of SSRIs (in terms of pharmacological effects, ADRs, and/or clinical uses). Blue: tags related to specific pharmacological, chemical or clinical properties of each individual drug appearing in each respective cluster