Literature DB >> 26958208

An Ensemble Method for Spelling Correction in Consumer Health Questions.

Halil Kilicoglu1, Marcelo Fiszman1, Kirk Roberts1, Dina Demner-Fushman1.   

Abstract

Orthographic and grammatical errors are a common feature of informal texts written by lay people. Health-related questions asked by consumers are a case in point. Automatic interpretation of consumer health questions is hampered by such errors. In this paper, we propose a method that combines techniques based on edit distance and frequency counts with a contextual similarity-based method for detecting and correcting orthographic errors, including misspellings, word breaks, and punctuation errors. We evaluate our method on a set of spell-corrected questions extracted from the NLM collection of consumer health questions. Our method achieves a F1 score of 0.61, compared to an informed baseline of 0.29, achieved using ESpell, a spelling correction system developed for biomedical queries. Our results show that orthographic similarity is most relevant in spelling error correction in consumer health questions and that frequency and contextual information are complementary to orthographic features.

Entities:  

Mesh:

Year:  2015        PMID: 26958208      PMCID: PMC4765565     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  5 in total

1.  A frequency-based technique to improve the spelling suggestion rank in medical queries.

Authors:  Jonathan Crowell; Qing Zeng; Long Ngo; Eve-Marie Lacroix
Journal:  J Am Med Inform Assoc       Date:  2004-02-05       Impact factor: 4.497

2.  Using lexical disambiguation and named-entity recognition to improve spelling correction in the electronic patient record.

Authors:  Patrick Ruch; Robert Baud; Antoine Geissbühler
Journal:  Artif Intell Med       Date:  2003 Sep-Oct       Impact factor: 5.326

3.  SPELLING CORRECTION IN THE PUBMED SEARCH ENGINE.

Authors:  W John Wilbur; Won Kim; Natalie Xie
Journal:  Inf Retr Boston       Date:  2006-11       Impact factor: 2.293

4.  Automatically classifying question types for consumer health questions.

Authors:  Kirk Roberts; Halil Kilicoglu; Marcelo Fiszman; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

5.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

  5 in total
  11 in total

1.  Resource and Response Type Classification for Consumer Health Question Answering.

Authors:  William R Kearns; Jason A Thomas
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Resource Classification for Medical Questions.

Authors:  Kirk Roberts; Laritza Rodriguez; Sonya E Shooshan; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

3.  Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions.

Authors:  Yassine Mrabet; Halil Kilicoglu; Kirk Roberts; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 4.  Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing.

Authors:  D Demner-Fushman; N Elhadad
Journal:  Yearb Med Inform       Date:  2016-11-10

5.  Spell checker for consumer language (CSpell).

Authors:  Chris J Lu; Alan R Aronson; Sonya E Shooshan; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

6.  Interactive use of online health resources: a comparison of consumer and professional questions.

Authors:  Kirk Roberts; Dina Demner-Fushman
Journal:  J Am Med Inform Assoc       Date:  2016-05-04       Impact factor: 4.497

7.  Automatic classification of scanned electronic health record documents.

Authors:  Heath Goodrum; Kirk Roberts; Elmer V Bernstam
Journal:  Int J Med Inform       Date:  2020-10-17       Impact factor: 4.046

8.  Automated Misspelling Detection and Correction in Persian Clinical Text.

Authors:  Azita Yazdani; Marjan Ghazisaeedi; Nasrin Ahmadinejad; Masoumeh Giti; Habibe Amjadi; Azin Nahvijou
Journal:  J Digit Imaging       Date:  2020-06       Impact factor: 4.056

9.  Consumer health information and question answering: helping consumers find answers to their health-related information needs.

Authors:  Dina Demner-Fushman; Yassine Mrabet; Asma Ben Abacha
Journal:  J Am Med Inform Assoc       Date:  2020-02-01       Impact factor: 4.497

10.  Deep Learning from EEG Reports for Inferring Underspecified Information.

Authors:  Travis R Goodwin; Sanda M Harabagiu
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.