BACKGROUND: To facilitate information retrieval in the biomedical domain, a system for the automatic assignment of Medical Subject Headings to documents curated by an online quality-controlled health gateway was implemented. The French Multi-Terminology Indexer (F-MTI) implements a multiterminology approach using nine main medical terminologies in French and the mappings between them. OBJECTIVE: This paper presents recent efforts to assess the added value of (a) integrating four new terminologies (Orphanet, ATC, drug names, MeSH supplementary concepts) into F-MTI's knowledge sources and (b) performing the automatic indexing on the titles and abstracts (vs. title only) of the online health resources. METHODS: F-MTI was evaluated on a CISMeF corpus comprising 18,161 manually indexed resources. RESULTS: The performance of F-MTI including nine health terminologies on CISMeF resources with Title only was 27.9% precision and 19.7% recall, while the performance on CISMeF resources with Title and Abstract is 14.9 % precision (-13.0%) and 25.9% recall (+6.2%). CONCLUSION: In a few weeks, CISMeF will launch the indexing of resources based on title and abstract, using nine terminologies.
BACKGROUND: To facilitate information retrieval in the biomedical domain, a system for the automatic assignment of Medical Subject Headings to documents curated by an online quality-controlled health gateway was implemented. The French Multi-Terminology Indexer (F-MTI) implements a multiterminology approach using nine main medical terminologies in French and the mappings between them. OBJECTIVE: This paper presents recent efforts to assess the added value of (a) integrating four new terminologies (Orphanet, ATC, drug names, MeSH supplementary concepts) into F-MTI's knowledge sources and (b) performing the automatic indexing on the titles and abstracts (vs. title only) of the online health resources. METHODS: F-MTI was evaluated on a CISMeF corpus comprising 18,161 manually indexed resources. RESULTS: The performance of F-MTI including nine health terminologies on CISMeF resources with Title only was 27.9% precision and 19.7% recall, while the performance on CISMeF resources with Title and Abstract is 14.9 % precision (-13.0%) and 25.9% recall (+6.2%). CONCLUSION: In a few weeks, CISMeF will launch the indexing of resources based on title and abstract, using nine terminologies.
Authors: Louise Deléger; Tayeb Merabti; Thierry Lecrocq; Michel Joubert; Pierre Zweigenbaum; Stéfan Darmoni Journal: AMIA Annu Symp Proc Date: 2010-11-13