Literature DB >> 18693843

Classification of health webpages as expert and non expert with a reduced set of cross-language features.

Natalia Grabar1, Sonia Krivine, Marie-Christine Jaulent.   

Abstract

Making the distinction between expert and non expert health documents can help users to select the information which is more suitable for them, according to whether they are familiar or not with medical terminology. This issue is particularly important for the information retrieval area. In our work we address this purpose through stylistic corpus analysis and the application of machine learning algorithms. Our hypothesis is that this distinction can be performed on the basis of a small number of features and that such features can be language and domain independent. The used features were acquired in source corpus (Russian language, diabetes topic) and then tested on target (French language, pneumology topic) and source corpora. These cross-language features show 90% precision and 93% recall with non expert documents in source language; and 85% precision and 74% recall with expert documents in target language.

Entities:  

Mesh:

Year:  2007        PMID: 18693843      PMCID: PMC2655811     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  7 in total

1.  Building a text corpus for representing the variety of medical language.

Authors:  P Zweigenbaum; P Jacquemart; N Grabar; B Habert
Journal:  Stud Health Technol Inform       Date:  2001

2.  Filtering for medical news items using a machine learning approach.

Authors:  Wanhong Zheng; Evangelos Milios; Carolyn Watters
Journal:  Proc AMIA Symp       Date:  2002

Review 3.  Promoting health literacy.

Authors:  Alexa T McCray
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

4.  Using argumentation to extract key sentences from biomedical abstracts.

Authors:  Patrick Ruch; Celia Boyer; Christine Chichester; Imad Tbahriti; Antoine Geissbühler; Paul Fabry; Julien Gobeill; Violaine Pillet; Dietrich Rebholz-Schuhmann; Christian Lovis; Anne-Lise Veuthey
Journal:  Int J Med Inform       Date:  2006-07-11       Impact factor: 4.046

5.  A language classifier that automatically divides medical documents for experts and health care consumers.

Authors:  Michael Poprat; Kornél Markó; Udo Hahn
Journal:  Stud Health Technol Inform       Date:  2006

6.  A new readability yardstick.

Authors:  R FLESCH
Journal:  J Appl Psychol       Date:  1948-06

7.  Health literacy: report of the Council on Scientific Affairs. Ad Hoc Committee on Health Literacy for the Council on Scientific Affairs, American Medical Association.

Authors: 
Journal:  JAMA       Date:  1999-02-10       Impact factor: 56.272

  7 in total
  2 in total

1.  Paraphrase acquisition from comparable medical corpora of specialized and lay texts.

Authors:  Louise Deléger; Pierre Zweigenbaum
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

2.  Cross-topic learning for work prioritization in systematic review creation and update.

Authors:  Aaron M Cohen; Kyle Ambert; Marian McDonagh
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.