Literature DB >> 22500000

ChemSpot: a hybrid system for chemical named entity recognition.

Tim Rocktäschel1, Michael Weidlich, Ulf Leser.   

Abstract

MOTIVATION: The accurate identification of chemicals in text is important for many applications, including computer-assisted reconstruction of metabolic networks or retrieval of information about substances in drug development. But due to the diversity of naming conventions and traditions for such molecules, this task is highly complex and should be supported by computational tools.
RESULTS: We present ChemSpot, a named entity recognition (NER) tool for identifying mentions of chemicals in natural language texts, including trivial names, drugs, abbreviations, molecular formulas and International Union of Pure and Applied Chemistry entities. Since the different classes of relevant entities have rather different naming characteristics, ChemSpot uses a hybrid approach combining a Conditional Random Field with a dictionary. It achieves an F(1) measure of 68.1% on the SCAI corpus, outperforming the only other freely available chemical NER tool, OSCAR4, by 10.8 percentage points. AVAILABILITY: ChemSpot is freely available at: http://www.informatik.hu-berlin.de/wbi/resources.

Mesh:

Substances:

Year:  2012        PMID: 22500000     DOI: 10.1093/bioinformatics/bts183

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  59 in total

1.  Beyond accuracy: creating interoperable and scalable text-mining web services.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  Bioinformatics       Date:  2016-02-16       Impact factor: 6.937

2.  Putting hands to rest: efficient deep CNN-RNN architecture for chemical named entity recognition with no hand-crafted rules.

Authors:  Ilia Korvigo; Maxim Holmatov; Anatolii Zaikovskii; Mikhail Skoblov
Journal:  J Cheminform       Date:  2018-05-23       Impact factor: 5.514

3.  SimConcept: A Hybrid Approach for Simplifying Composite Named Entities in Biomedicine.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  ACM BCB       Date:  2014

4.  SimConcept: a hybrid approach for simplifying composite named entities in biomedical text.

Authors:  Chih-Hsuan Wei; Robert Leaman; Zhiyong Lu
Journal:  IEEE J Biomed Health Inform       Date:  2015-04-13       Impact factor: 5.772

5.  GLASS: a comprehensive database for experimentally validated GPCR-ligand associations.

Authors:  Wallace K B Chan; Hongjiu Zhang; Jianyi Yang; Jeffrey R Brender; Junguk Hur; Arzucan Özgür; Yang Zhang
Journal:  Bioinformatics       Date:  2015-05-13       Impact factor: 6.937

6.  Enhancing of chemical compound and drug name recognition using representative tag scheme and fine-grained tokenization.

Authors:  Hong-Jie Dai; Po-Ting Lai; Yung-Chun Chang; Richard Tzong-Han Tsai
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

7.  tmChem: a high performance approach for chemical named entity recognition and normalization.

Authors:  Robert Leaman; Chih-Hsuan Wei; Zhiyong Lu
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

8.  A comparison of conditional random fields and structured support vector machines for chemical entity recognition in biomedical literature.

Authors:  Buzhou Tang; Yudong Feng; Xiaolong Wang; Yonghui Wu; Yaoyun Zhang; Min Jiang; Jingqi Wang; Hua Xu
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

9.  LeadMine: a grammar and dictionary driven approach to entity recognition.

Authors:  Daniel M Lowe; Roger A Sayle
Journal:  J Cheminform       Date:  2015-01-19       Impact factor: 5.514

10.  TaggerOne: joint named entity recognition and normalization with semi-Markov Models.

Authors:  Robert Leaman; Zhiyong Lu
Journal:  Bioinformatics       Date:  2016-06-09       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.