Literature DB >> 10303917

Ranking documents with a thesaurus.

R Rada, E Bicknell.   

Abstract

This article reports on exploratory experiments in evaluating and improving a thesaurus through studying its effect on retrieval. A formula called DISTANCE was developed to measure the conceptual distance between queries and documents encoded as sets of thesaurus terms. DISTANCE references MeSH (Medical Subject Headings) and assesses the degree of match between a MeSH-encoded query and document. The performance of DISTANCE on MeSH is compared to the performance of people in the assessment of conceptual distance between queries and documents, and is found to simulate with surprising accuracy the human performance. The power of the computer simulation stems both from the tendency of people to rely heavily on broader-than (BT) relations in making decisions about conceptual distance and from the thousands of accurate BT relations in MeSH. One source for discrepancy between the algorithms' measurement of closeness between query and document and people's measurement of closeness between query and document is occasional inconsistency in the BT relations. Our experiments with adding non-BT relations to MeSH showed how these non-BT non-BT relations to MeSH showed how these non-BT relations could improve document ranking, if DISTANCE were also appropriately revised to treat these relations differently from BT relations.

Entities:  

Mesh:

Year:  1989        PMID: 10303917     DOI: 10.1002/(SICI)1097-4571(198909)40:5<304::AID-ASI2>3.0.CO;2-6

Source DB:  PubMed          Journal:  J Am Soc Inf Sci        ISSN: 0002-8231


  5 in total

1.  A unified architecture for biomedical search engines based on semantic web technologies.

Authors:  Vahid Jalali; Mohammad Reza Matash Borujerdi
Journal:  J Med Syst       Date:  2009-08-25       Impact factor: 4.460

2.  Journal notes.

Authors:  W K Beatty
Journal:  Bull Med Libr Assoc       Date:  1990-04

3.  A performance and failure analysis of SAPHIRE with a MEDLINE test collection.

Authors:  W R Hersh; D H Hickam; R B Haynes; K A McKibbon
Journal:  J Am Med Inform Assoc       Date:  1994 Jan-Feb       Impact factor: 4.497

4.  Information theory applied to the sparse gene ontology annotation network to predict novel gene function.

Authors:  Ying Tao; Lee Sam; Jianrong Li; Carol Friedman; Yves A Lussier
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

5.  A transversal approach to predict gene product networks from ontology-based similarity.

Authors:  Julie Chabalier; Jean Mosser; Anita Burgun
Journal:  BMC Bioinformatics       Date:  2007-07-02       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.