Literature DB >> 25759063

Toward automated classification of consumers' cancer-related questions with a new taxonomy of expected answer types.

Susan McRoy1, Sean Jones2, Adam Kurmally2.   

Abstract

This article examines methods for automated question classification applied to cancer-related questions that people have asked on the web. This work is part of a broader effort to provide automated question answering for health education. We created a new corpus of consumer-health questions related to cancer and a new taxonomy for those questions. We then compared the effectiveness of different statistical methods for developing classifiers, including weighted classification and resampling. Basic methods for building classifiers were limited by the high variability in the natural distribution of questions and typical refinement approaches of feature selection and merging categories achieved only small improvements to classifier accuracy. Best performance was achieved using weighted classification and resampling methods, the latter yielding an accuracy of F1 = 0.963. Thus, it would appear that statistical classifiers can be trained on natural data, but only if natural distributions of classes are smoothed. Such classifiers would be useful for automated question answering, for enriching web-based content, or assisting clinical professionals to answer questions.
© The Author(s) 2015.

Entities:  

Keywords:  automated question answering; question classification

Mesh:

Year:  2015        PMID: 25759063     DOI: 10.1177/1460458215571643

Source DB:  PubMed          Journal:  Health Informatics J        ISSN: 1460-4582            Impact factor:   2.681


  4 in total

1.  Resource Classification for Medical Questions.

Authors:  Kirk Roberts; Laritza Rodriguez; Sonya E Shooshan; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Detecting clinically related content in online patient posts.

Authors:  Courtland VanDam; Shaheen Kanthawala; Wanda Pratt; Joyce Chai; Jina Huh
Journal:  J Biomed Inform       Date:  2017-10-03       Impact factor: 6.317

3.  Qcorp: an annotated classification corpus of Chinese health questions.

Authors:  Haihong Guo; Xu Na; Jiao Li
Journal:  BMC Med Inform Decis Mak       Date:  2018-03-22       Impact factor: 2.796

4.  Assessing Unmet Information Needs of Breast Cancer Survivors: Exploratory Study of Online Health Forums Using Text Classification and Retrieval.

Authors:  Susan McRoy; Majid Rastegar-Mojarad; Yanshan Wang; Kathryn J Ruddy; Tufia C Haddad; Hongfang Liu
Journal:  JMIR Cancer       Date:  2018-05-15
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.