Literature DB >> 17646305

Identification of new drug classification terms in textual resources.

Corinna Kolárik1, Martin Hofmann-Apitius, Marc Zimmermann, Juliane Fluck.   

Abstract

UNLABELLED: Knowledge about biological effects of small molecules helps in the understanding of biological processes and supports the development of new therapeutic agents. DrugBank is a high quality database providing such information about drugs that contains annotation of drug effects and classification of therapeutic effects. However, to broaden the scope of such a database in classifying and annotating drugs, systems for automatic extraction of classification terms and the corresponding annotation of drugs are needed. We have developed an approach for the identification of new terms used in unstructured text that provide information about drug properties. It is based on the identification and extraction of phrases corresponding to lexico-syntactic patterns--so-called Hearst patterns that contain drug names and directly related drug annotation terms. Such phrases could be identified with a high performance in DrugBank text (0.89 F-score) and in Medline abstracts (0.83 F-score). In comparison to DrugBank annotation terminology, a huge amount of new drug annotation terms could be found. The evaluation of terms extracted from Medline showed that 29-53% of them are new valid drug property terms. They could be assigned to existing and new drug property classes not provided by the DrugBank drug annotation. We come to the conclusion that our system can support database content update by providing additionally drug descriptions of pharmacological effects not yet found in databases like DrugBank. Moreover, we propose that automatic normalization of terms improves the annotation and the retrieval of relevant database entries. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2007        PMID: 17646305     DOI: 10.1093/bioinformatics/btm196

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Silver threads.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2011-12-09       Impact factor: 3.686

2.  Linguistic approach for identification of medication names and related information in clinical narratives.

Authors:  Thierry Hamon; Natalia Grabar
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

3.  Biomedical ontologies in action: role in knowledge management, data integration and decision support.

Authors:  O Bodenreider
Journal:  Yearb Med Inform       Date:  2008

Review 4.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

5.  Mining the pharmacogenomics literature--a survey of the state of the art.

Authors:  Udo Hahn; K Bretonnel Cohen; Yael Garten; Nigam H Shah
Journal:  Brief Bioinform       Date:  2012-07       Impact factor: 11.622

6.  Automated annotation of chemical names in the literature with tunable accuracy.

Authors:  Jun D Zhang; Lewis Y Geer; Evan E Bolton; Stephen H Bryant
Journal:  J Cheminform       Date:  2011-11-22       Impact factor: 5.514

7.  Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining.

Authors:  Kristina M Hettne; Antony J Williams; Erik M van Mulligen; Jos Kleinjans; Valery Tkachenko; Jan A Kors
Journal:  J Cheminform       Date:  2010-03-23       Impact factor: 5.514

8.  Detection of IUPAC and IUPAC-like chemical names.

Authors:  Roman Klinger; Corinna Kolárik; Juliane Fluck; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

9.  Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.

Authors:  Ghada Alfattni; Maksim Belousov; Niels Peek; Goran Nenadic
Journal:  JMIR Med Inform       Date:  2021-05-05

10.  Using workflows to explore and optimise named entity recognition for chemistry.

Authors:  Balakrishna Kolluru; Lezan Hawizy; Peter Murray-Rust; Junichi Tsujii; Sophia Ananiadou
Journal:  PLoS One       Date:  2011-05-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.