Literature DB >> 12603047

A biological named entity recognizer.

Meenakshi Narayanaswamy1, K E Ravikumar, K Vijay-Shanker.   

Abstract

In this paper we describe a new named entity extraction system. Our system is based on a manually developed set of rules that rely heavily upon some crucial lexical information, linguistic constraints of English, and contextual information. This system achieves state of art results in the protein name detection task, which is what many of the current name extraction systems do. We discuss the need for detection of chemical names and show that we not only obtain a high degree of success in recognizing chemicals but that this task can help improve the precision of protein name detection as well. We use context and surrounding words for categorization of named entities and find the results obtained are encouraging.

Mesh:

Substances:

Year:  2003        PMID: 12603047     DOI: 10.1142/9789812776303_0040

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  21 in total

1.  A simple and practical dictionary-based approach for identification of proteins in Medline abstracts.

Authors:  Sergei Egorov; Anton Yuryev; Nikolai Daraselia
Journal:  J Am Med Inform Assoc       Date:  2004-02-05       Impact factor: 4.497

2.  NLProt: extracting protein names and sequences from papers.

Authors:  Sven Mika; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  BioTagger-GM: a gene/protein name recognition system.

Authors:  Manabu Torii; Zhangzhi Hu; Cathy H Wu; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2008-12-11       Impact factor: 4.497

4.  RLIMS-P 2.0: A Generalizable Rule-Based Information Extraction System for Literature Mining of Protein Phosphorylation Information.

Authors:  Manabu Torii; Cecilia N Arighi; Gang Li; Qinghua Wang; Cathy H Wu; K Vijay-Shanker
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2015 Jan-Feb       Impact factor: 3.710

5.  How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?

Authors:  Hyunjae Kim; Jaewoo Kang
Journal:  IEEE Access       Date:  2022-03-08       Impact factor: 3.476

6.  Towards pathway curation through literature mining--a case study using PharmGKB.

Authors:  K E Ravikumar; Kavishwar B Wagholikar; Hongfang Liu
Journal:  Pac Symp Biocomput       Date:  2014

7.  Detection of IUPAC and IUPAC-like chemical names.

Authors:  Roman Klinger; Corinna Kolárik; Juliane Fluck; Martin Hofmann-Apitius; Christoph M Friedrich
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

8.  Identifying gene and protein mentions in text using conditional random fields.

Authors:  Ryan McDonald; Fernando Pereira
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

9.  A scalable machine-learning approach to recognize chemical names within large text databases.

Authors:  Jonathan D Wren
Journal:  BMC Bioinformatics       Date:  2006-09-06       Impact factor: 3.169

10.  Automatically annotating documents with normalized gene lists.

Authors:  Jeremiah Crim; Ryan McDonald; Fernando Pereira
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.