Literature DB >> 16287934

Automatic assignment of biomedical categories: toward a generic approach.

Patrick Ruch1.   

Abstract

MOTIVATION: We report on the development of a generic text categorization system designed to automatically assign biomedical categories to any input text. Unlike usual automatic text categorization systems, which rely on data-intensive models extracted from large sets of training data, our categorizer is largely data-independent.
METHODS: In order to evaluate the robustness of our approach we test the system on two different biomedical terminologies: the Medical Subject Headings (MeSH) and the Gene Ontology (GO). Our lightweight categorizer, based on two ranking modules, combines a pattern matcher and a vector space retrieval engine, and uses both stems and linguistically-motivated indexing units. RESULTS AND
CONCLUSION: Results show the effectiveness of phrase indexing for both GO and MeSH categorization, but we observe the categorization power of the tool depends on the controlled vocabulary: precision at high ranks ranges from above 90% for MeSH to <20% for GO, establishing a new baseline for categorizers based on retrieval methods.

Mesh:

Substances:

Year:  2005        PMID: 16287934     DOI: 10.1093/bioinformatics/bti783

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  35 in total

1.  A bottom-up approach to MEDLINE indexing recommendations.

Authors:  Antonio Jimeno-Yepes; Bartłomiej Wilkowski; James G Mork; Elizabeth Van Lenten; Dina Demner Fushman; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Biomedical ontologies in action: role in knowledge management, data integration and decision support.

Authors:  O Bodenreider
Journal:  Yearb Med Inform       Date:  2008

3.  From episodes of care to diagnosis codes: automatic text categorization for medico-economic encoding.

Authors:  Patrick Ruch; Julien Gobeilla; Imad Tbahritia; Antoine Geissbühlera
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

4.  Combining evidence, specificity, and proximity towards the normalization of Gene Ontology terms in text.

Authors:  S Gaudan; A Jimeno Yepes; V Lee; D Rebholz-Schuhmann
Journal:  EURASIP J Bioinform Syst Biol       Date:  2008

5.  Optimal training sets for Bayesian prediction of MeSH assignment.

Authors:  Sunghwan Sohn; Won Kim; Donald C Comeau; W John Wilbur
Journal:  J Am Med Inform Assoc       Date:  2008-04-24       Impact factor: 4.497

6.  Predicting MeSH Beyond MEDLINE.

Authors:  Adam K Kehoe; Vetle I Torvik; Matthew B Ross; Neil R Smalheiser
Journal:  Proc 1st Workshop Sch Web Min (2017)       Date:  2017-02

Review 7.  Analyzing Medical Image Search Behavior: Semantics and Prediction of Query Results.

Authors:  Maria De-Arteaga; Ivan Eggel; Charles E Kahn; Henning Müller
Journal:  J Digit Imaging       Date:  2015-10       Impact factor: 4.056

8.  Comparison and combination of several MeSH indexing approaches.

Authors:  Antonio Jose Jimeno Yepes; James G Mork; Dina Demner-Fushman; Alan R Aronson
Journal:  AMIA Annu Symp Proc       Date:  2013-11-16

9.  Comparing a Rule Based vs. Statistical System for Automatic Categorization of MEDLINE Documents According to Biomedical Specialty.

Authors:  Susanne M Humphrey; Aurélie Névéol; Julien Gobeil; Patrick Ruch; Stéfan J Darmoni; Allen Browne
Journal:  J Am Soc Inf Sci Technol       Date:  2009-12-01

10.  Social tagging in the life sciences: characterizing a new metadata resource for bioinformatics.

Authors:  Benjamin M Good; Joseph T Tennis; Mark D Wilkinson
Journal:  BMC Bioinformatics       Date:  2009-09-25       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.