Literature DB >> 12463872

Free-text medical document retrieval via phrase-based vector space model.

Wenlei Mao1, Wesley W Chu.   

Abstract

Many information retrieval systems are based on vector space model (VSM) that represents a document as a vector of index terms. Concepts have been proposed to replace word stems as the index terms to improve retrieval accuracy. However, past research revealed that such systems did not outperform the traditional stem-based systems. Incorporating conceptual similarity derived from knowledge sources should have the potential to improve retrieval accuracy. Yet the incompleteness of the knowledge source precludes significant improvement. To remedy this problem, we propose to represent documents using phrases. A phrase consists of multiple concepts and word stems. The similarity between two phrases is jointly determined by their conceptual similarity and their common word stems. The document similarity can in turn be derived from phrase similarities. Using OHSUMED as a test collection and UMLS as the knowledge source, our experiment results reveal that phrase-based VSM yields a 16% increase of retrieval accuracy compared to the stem-based model.

Mesh:

Year:  2002        PMID: 12463872      PMCID: PMC2244442     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  7 in total

1.  Determining word sequence variation patterns in clinical documents using multiple sequence alignment.

Authors:  Frank Meng; Craig A Morioka; Suzie El-Saden
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  A unified architecture for biomedical search engines based on semantic web technologies.

Authors:  Vahid Jalali; Mohammad Reza Matash Borujerdi
Journal:  J Med Syst       Date:  2009-08-25       Impact factor: 4.460

3.  Comparison of vector space model methodologies to reconcile cross-species neuroanatomical concepts.

Authors:  P R Srinivas; Shang-Heng Wei; Nello Cristianini; E G Jones; F A Gorin
Journal:  Neuroinformatics       Date:  2005

4.  The effect of feature representation on MEDLINE document classification.

Authors:  Meliha Yetisgen-Yildiz; Wanda Pratt
Journal:  AMIA Annu Symp Proc       Date:  2005

5.  A comparison of Intelligent Mapper and document similarity scores for mapping local radiology terms to LOINC.

Authors:  Daniel J Vreeman; Clement J McDonald
Journal:  AMIA Annu Symp Proc       Date:  2006

6.  Discovering semantic features in the literature: a foundation for building functional associations.

Authors:  Monica Chagoyen; Pedro Carmona-Saez; Hagit Shatkay; Jose M Carazo; Alberto Pascual-Montano
Journal:  BMC Bioinformatics       Date:  2006-01-26       Impact factor: 3.169

7.  A Systematic Framework for Analyzing Patient-Generated Narrative Data: Protocol for a Content Analysis.

Authors:  Maryam Zolnoori; Joyce E Balls-Berry; Tabetha A Brockman; Christi A Patten; Ming Huang; Lixia Yao
Journal:  JMIR Res Protoc       Date:  2019-08-26
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.