Literature DB >> 27467152

Text Mining for Drugs and Chemical Compounds: Methods, Tools and Applications.

Miguel Vazquez1, Martin Krallinger1, Florian Leitner1, Alfonso Valencia2.   

Abstract

Providing prior knowledge about biological properties of chemicals, such as kinetic values, protein targets, or toxic effects, can facilitate many aspects of drug development. Chemical information is rapidly accumulating in all sorts of free text documents like patents, industry reports, or scientific articles, which has motivated the development of specifically tailored text mining applications. Despite the potential gains, chemical text mining still faces significant challenges. One of the most salient is the recognition of chemical entities mentioned in text. To help practitioners contribute to this area, a good portion of this review is devoted to this issue, and presents the basic concepts and principles underlying the main strategies. The technical details are introduced and accompanied by relevant bibliographic references. Other tasks discussed are retrieving relevant articles, identifying relationships between chemicals and other entities, or determining the chemical structures of chemicals mentioned in text. This review also introduces a number of published applications that can be used to build pipelines in topics like drug side effects, toxicity, and protein-disease-compound network analysis. We conclude the review with an outlook on how we expect the field to evolve, discussing its possibilities and its current limitations.
Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Chemical compounds; Drugs; Information extraction; Named entity recognition; Text mining

Year:  2011        PMID: 27467152     DOI: 10.1002/minf.201100005

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  21 in total

1.  Wide-coverage relation extraction from MEDLINE using deep syntax.

Authors:  Nhung T H Nguyen; Makoto Miwa; Yoshimasa Tsuruoka; Takashi Chikayama; Satoshi Tojo
Journal:  BMC Bioinformatics       Date:  2015-04-01       Impact factor: 3.169

2.  LeadMine: a grammar and dictionary driven approach to entity recognition.

Authors:  Daniel M Lowe; Roger A Sayle
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

3.  TaggerOne: joint named entity recognition and normalization with semi-Markov Models.

Authors:  Robert Leaman; Zhiyong Lu
Journal:  Bioinformatics       Date:  2016-06-09       Impact factor: 6.937

4.  Exploring sets of molecules from patents and relationships to other active compounds in chemical space networks.

Authors:  Ryo Kunimoto; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2017-09-04       Impact factor: 3.686

5.  A Relation-Oriented Model With Global Context Information for Joint Extraction of Overlapping Relations and Entities.

Authors:  Huihui Han; Jian Wang; Xiaowen Wang
Journal:  Front Neurorobot       Date:  2022-07-04       Impact factor: 3.493

6.  GeneDive: A gene interaction search and visualization tool to facilitate precision medicine.

Authors:  Paul Previde; Brook Thomas; Mike Wong; Emily K Mallory; Dragutin Petkovic; Russ B Altman; Anagha Kulkarni
Journal:  Pac Symp Biocomput       Date:  2018

7.  Enhancement of chemical entity identification in text using semantic similarity validation.

Authors:  Tiago Grego; Francisco M Couto
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

8.  Consistency of systematic chemical identifiers within and between small-molecule databases.

Authors:  Saber A Akhondi; Jan A Kors; Sorel Muresan
Journal:  J Cheminform       Date:  2012-12-13       Impact factor: 5.514

9.  Search and visualization of gene-drug-disease interactions for pharmacogenomics and precision medicine research using GeneDive.

Authors:  Mike Wong; Paul Previde; Jack Cole; Brook Thomas; Nayana Laxmeshwar; Emily Mallory; Jake Lever; Dragutin Petkovic; Russ B Altman; Anagha Kulkarni
Journal:  J Biomed Inform       Date:  2021-03-16       Impact factor: 8.000

10.  Ambiguity of non-systematic chemical identifiers within and between small-molecule databases.

Authors:  Saber A Akhondi; Sorel Muresan; Antony J Williams; Jan A Kors
Journal:  J Cheminform       Date:  2015-11-16       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.