Literature DB >> 16160337

Refinement of an automatic method for indexing medical literature--a preliminary study.

Michel Joubert1, Anne-Laure Peretti, Joanny Gouvernet, Marius Fieschi.   

Abstract

OBJECTIVES: to rank according to their significance MeSH terms automatically extracted from Internet sites in the framework of a French project, VUMeF, a contribution to the NLM' UMLS project.
MATERIAL AND METHODS: scores are affected to key-words of a given document on the basis of the Semantic Network of the UMLS and frequencies of co-occurring major terms in the Medline literature. If N is the number of major terms of a document, and n is the number of major terms retrieved in the N first terms ranked in descending order according to their scores, the measure of the achievement of the method is n/N.
RESULTS: a set of 1444 randomized documents have been extracted from Medline. For each document we computed the retrieved major terms among the first N terms with two methods: a statistical method using only frequencies given by co-occurrences, and our method that uses furthermore the UMLS semantic network. In 34% of cases corresponding to documents indexed by about 16 key-words, about 3 major terms among them, our method produces a better precision (7%) than the statistical method. DISCUSSION: the rough calculation of the proportion of retrieved major terms should be enhanced by the use of a probability law allowing to enlarge the list of terms to select taking into account both the number of major terms and the total number of key-words used to index each document.

Entities:  

Mesh:

Year:  2005        PMID: 16160337

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  1 in total

1.  Contribution to an automated indexing of French-language health web sites.

Authors:  Michel Joubert; Anne-Laure Peretti; Stefan Darmoni; Badisse Dahamna; Marius Fieschi
Journal:  AMIA Annu Symp Proc       Date:  2006
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.