Literature DB >> 18229721

Comparing usability of matching techniques for normalising biomedical named entities.

Xinglong Wang1, Michael Matthews.   

Abstract

String matching plays an important role in biomedical Term Normalisation, the task of linking mentions of biomedical entities to identifiers in reference databases. This paper evaluates exact, rule-based and various string-similarity-based matching techniques. The matchers are compared in two ways: first, we measure precision and recall against a gold-standard dataset and second, we integrate the matchers into a curation tool and measure gains in curation speed when they were used to assist a curator in normalising protein and tissue entities. The evaluation shows that a rule-based matcher works better on the gold-standard data, while a string-similarity based system and exact string matcher win out on improving curation efficiency.

Mesh:

Year:  2008        PMID: 18229721

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  3 in total

1.  TRANSLATING BIOLOGY: TEXT MINING TOOLS THAT WORK.

Authors:  K Bretonnel Cohen; Hong Yu; Philip E Bourne; Lynette Hirschman
Journal:  Pac Symp Biocomput       Date:  2008-01-01

Review 2.  What the papers say: text mining for genomics and systems biology.

Authors:  Nathan Harmston; Wendy Filsell; Michael P H Stumpf
Journal:  Hum Genomics       Date:  2010-10       Impact factor: 4.639

3.  Distinguishing the species of biomedical named entities for term identification.

Authors:  Xinglong Wang; Michael Matthews
Journal:  BMC Bioinformatics       Date:  2008-11-19       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.