Literature DB >> 15694638

Assisting medical annotation in Swiss-Prot using statistical classifiers.

Pavel B Dobrokhotov1, Cyril Goutte, Anne-Lise Veuthey, Eric Gaussier.   

Abstract

Bio-medical knowledge bases are valuable resources for the research community. Original scientific publications are the main source used to annotate them. Medical annotation in Swiss-Prot is specifically targeted at finding and extracting data about human genetic diseases and polymorphisms. Curators have to scan through hundreds of publications to select the relevant ones. This workload can be greatly reduced by using bio-text mining techniques. Using a combination of natural language processing (NLP) techniques and statistical classifiers, we achieve recall points of up to 84% on the potentially interesting documents and a precision of more than 96% in detecting irrelevant documents. Careful analysis of the document pre-processing chain allows us to measure the impact of some steps on the overall result, as well as test different classifier configurations. The best combination was used to create a prototype of a search and classification tool that is currently tested by the database curators.

Entities:  

Mesh:

Year:  2005        PMID: 15694638     DOI: 10.1016/j.ijmedinf.2004.04.017

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  2 in total

1.  Reducing workload in systematic review preparation using automated citation classification.

Authors:  A M Cohen; W R Hersh; K Peterson; Po-Yin Yen
Journal:  J Am Med Inform Assoc       Date:  2005-12-15       Impact factor: 4.497

2.  Measuring the impact of screening automation on meta-analyses of diagnostic test accuracy.

Authors:  Christopher R Norman; Mariska M G Leeflang; Raphaël Porcher; Aurélie Névéol
Journal:  Syst Rev       Date:  2019-10-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.