Literature DB >> 14764616

A frequency-based technique to improve the spelling suggestion rank in medical queries.

Jonathan Crowell1, Qing Zeng, Long Ngo, Eve-Marie Lacroix.   

Abstract

OBJECTIVE: There is an abundance of health-related information online, and millions of consumers search for such information. Spell checking is of crucial importance in returning pertinent results, so the authors propose a technique for increasing the effectiveness of spell-checking tools used for health-related information retrieval.
DESIGN: A sample of incorrectly spelled medical terms was submitted to two different spell-checking tools, and the resulting suggestions, derived under two different dictionary configurations, were re-sorted according to how frequently each term appeared in log data from a medical search engine. MEASUREMENTS: Univariable analysis was carried out to assess the effect of each factor (spell-checking tool, dictionary type, re-sort, or no re-sort) on the probability of success. The factors that were statistically significant in the univariable analysis were then used in multivariable analysis to evaluate the independent effect of each of the factors.
RESULTS: The re-sorted suggestions proved to be significantly more accurate than the original list returned by the spell-checking tool. The odds of finding the correct suggestion in the number one rank were increased by 63% after re-sorting using the authors' method. This effect was independent of both the dictionary and the spell-checking tools that were used.
CONCLUSION: Using knowledge about the frequency of a given word's occurrence in the medical domain can significantly improve spelling correction for medical queries.

Mesh:

Year:  2004        PMID: 14764616      PMCID: PMC400516          DOI: 10.1197/jamia.M1474

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  3 in total

1.  MEDLINEplus: building and maintaining the National Library of Medicine's consumer health Web service.

Authors:  N Miller; E M Lacroix; J E Backus
Journal:  Bull Med Libr Assoc       Date:  2000-01

2.  Characteristics of consumer terminology for health information retrieval.

Authors:  Q Zeng; S Kogan; N Ash; R A Greenes; A A Boxwala
Journal:  Methods Inf Med       Date:  2002       Impact factor: 2.176

3.  A technique to improve the spelling suggestion rank in medical queries.

Authors:  Jonathan B Crowell; Qing T Zeng; Sandra Kogan
Journal:  AMIA Annu Symp Proc       Date:  2003
  3 in total
  12 in total

1.  Query log analysis of an electronic health record search engine.

Authors:  Lei Yang; Qiaozhu Mei; Kai Zheng; David A Hanauer
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research.

Authors:  Rebecca S Crowley; Melissa Castine; Kevin Mitchell; Girish Chavan; Tara McSherry; Michael Feldman
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

3.  Evaluation of a method to identify and categorize section headers in clinical documents.

Authors:  Joshua C Denny; Anderson Spickard; Kevin B Johnson; Neeraja B Peterson; Josh F Peterson; Randolph A Miller
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

4.  Spell checker for consumer language (CSpell).

Authors:  Chris J Lu; Alan R Aronson; Sonya E Shooshan; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

5.  Interactive use of online health resources: a comparison of consumer and professional questions.

Authors:  Kirk Roberts; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2016-05-04       Impact factor: 4.497

6.  An Ensemble Method for Spelling Correction in Consumer Health Questions.

Authors:  Halil Kilicoglu; Marcelo Fiszman; Kirk Roberts; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

7.  Automatic classification of scanned electronic health record documents.

Authors:  Heath Goodrum; Kirk Roberts; Elmer V Bernstam
Journal:  Int J Med Inform       Date:  2020-10-17       Impact factor: 4.046

8.  Predicting emergency department orders with multilabel machine learning techniques and simulating effects on length of stay.

Authors:  Haley S Hunter-Zinck; Jordan S Peck; Tania D Strout; Stephan A Gaehde
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

9.  Identification of pneumonia and influenza deaths using the Death Certificate Pipeline.

Authors:  Kailah Davis; Catherine Staes; Jeff Duncan; Sean Igo; Julio C Facelli
Journal:  BMC Med Inform Decis Mak       Date:  2012-05-08       Impact factor: 2.796

10.  Matching health information seekers' queries to medical terms.

Authors:  Lina F Soualmia; Elise Prieur-Gaston; Zied Moalla; Thierry Lecroq; Stéfan J Darmoni
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.