Literature DB >> 24025513

Text classification for assisting moderators in online health communities.

Jina Huh1, Meliha Yetisgen-Yildiz, Wanda Pratt.   

Abstract

OBJECTIVES: Patients increasingly visit online health communities to get help on managing health. The large scale of these online communities makes it impossible for the moderators to engage in all conversations; yet, some conversations need their expertise. Our work explores low-cost text classification methods to this new domain of determining whether a thread in an online health forum needs moderators' help.
METHODS: We employed a binary classifier on WebMD's online diabetes community data. To train the classifier, we considered three feature types: (1) word unigram, (2) sentiment analysis features, and (3) thread length. We applied feature selection methods based on χ² statistics and under sampling to account for unbalanced data. We then performed a qualitative error analysis to investigate the appropriateness of the gold standard.
RESULTS: Using sentiment analysis features, feature selection methods, and balanced training data increased the AUC value up to 0.75 and the F1-score up to 0.54 compared to the baseline of using word unigrams with no feature selection methods on unbalanced data (0.65 AUC and 0.40 F1-score). The error analysis uncovered additional reasons for why moderators respond to patients' posts. DISCUSSION: We showed how feature selection methods and balanced training data can improve the overall classification performance. We present implications of weighing precision versus recall for assisting moderators of online health communities. Our error analysis uncovered social, legal, and ethical issues around addressing community members' needs. We also note challenges in producing a gold standard, and discuss potential solutions for addressing these challenges.
CONCLUSION: Social media environments provide popular venues in which patients gain health-related information. Our work contributes to understanding scalable solutions for providing moderators' expertise in these large-scale, social media environments.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Consumer health; Health information seeking; Human–computer interaction; Online health communities; Text mining

Mesh:

Year:  2013        PMID: 24025513      PMCID: PMC3874858          DOI: 10.1016/j.jbi.2013.08.011

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  18 in total

1.  Towards a medical question-answering system: a feasibility study.

Authors:  Pierre Jacquemart; Pierre Zweigenbaum
Journal:  Stud Health Technol Inform       Date:  2003

2.  The secret life of pronouns: flexibility in writing style and physical health.

Authors:  R Sherlock Campbell; James W Pennebaker
Journal:  Psychol Sci       Date:  2003-01

3.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

4.  Classification ensembles for unbalanced class sizes in predictive toxicology.

Authors:  J J Chen; C A Tsai; J F Young; R L Kodell
Journal:  SAR QSAR Environ Res       Date:  2005-12       Impact factor: 3.000

5.  Tackling Dilemmas in Supporting "The Whole Person" in Online Patient Communities.

Authors:  Jina Huh; Rupa Patel; Wanda Pratt
Journal:  Proc SIGCHI Conf Hum Factor Comput Syst       Date:  2012

6.  Use of a support vector machine for categorizing free-text notes: assessment of accuracy across two institutions.

Authors:  Adam Wright; Allison B McCoy; Stanislav Henkin; Abhivyakti Kale; Dean F Sittig
Journal:  J Am Med Inform Assoc       Date:  2013-03-30       Impact factor: 4.497

7.  Assessing psychosocial distress in diabetes: development of the diabetes distress scale.

Authors:  William H Polonsky; Lawrence Fisher; Jay Earles; R James Dudl; Joel Lees; Joseph Mullan; Richard A Jackson
Journal:  Diabetes Care       Date:  2005-03       Impact factor: 19.112

8.  AskHERMES: An online question answering system for complex clinical questions.

Authors:  YongGang Cao; Feifan Liu; Pippa Simpson; Lamont Antieau; Andrew Bennett; James J Cimino; John Ely; Hong Yu
Journal:  J Biomed Inform       Date:  2011-01-21       Impact factor: 6.317

9.  Managing the personal side of health: how patient expertise differs from the expertise of clinicians.

Authors:  Andrea Hartzler; Wanda Pratt
Journal:  J Med Internet Res       Date:  2011-08-16       Impact factor: 5.428

10.  Text mining and natural language processing approaches for automatic categorization of lay requests to web-based expert forums.

Authors:  Wolfgang Himmel; Ulrich Reincke; Hans Wilhelm Michelmann
Journal:  J Med Internet Res       Date:  2009-07-22       Impact factor: 5.428

View more
  26 in total

1.  Towards Supporting Patient Decision-making In Online Diabetes Communities.

Authors:  Jing Zhang; Rebecca Marmor; Jina Huh
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Online health community experiences of sexual minority women with cancer.

Authors:  Young Ji Lee; Charles Kamen; Liz Margolies; Ulrike Boehmer
Journal:  J Am Med Inform Assoc       Date:  2019-08-01       Impact factor: 4.497

3.  Learning regular expressions for clinical text classification.

Authors:  Duy Duc An Bui; Qing Zeng-Treitler
Journal:  J Am Med Inform Assoc       Date:  2014-02-27       Impact factor: 4.497

4.  A comparison of rule-based and machine learning approaches for classifying patient portal messages.

Authors:  Robert M Cronin; Daniel Fabbri; Joshua C Denny; S Trent Rosenbloom; Gretchen Purcell Jackson
Journal:  Int J Med Inform       Date:  2017-06-23       Impact factor: 4.046

5.  Temporal Causality Analysis of Sentiment Change in a Cancer Survivor Network.

Authors:  Ngot Bui; John Yen; Vasant Honavar
Journal:  IEEE Trans Comput Soc Syst       Date:  2016-08-10

6.  Diabetes on Twitter: A Sentiment Analysis.

Authors:  Elia Gabarron; Enrique Dorronzoro; Octavio Rivera-Romero; Rolf Wynn
Journal:  J Diabetes Sci Technol       Date:  2018-11-19

7.  Weaving Clinical Expertise in Online Health Communities.

Authors:  Jina Huh; Wanda Pratt
Journal:  Proc SIGCHI Conf Hum Factor Comput Syst       Date:  2014

8.  A Collaborative Framework Based for Semantic Patients-Behavior Analysis and Highlight Topics Discovery of Alcoholic Beverages in Online Healthcare Forums.

Authors:  Hamed Jelodar; Yongli Wang; Mahdi Rabbani; Gang Xiao; Ruxin Zhao
Journal:  J Med Syst       Date:  2020-04-07       Impact factor: 4.460

9.  VisOHC: Designing Visual Analytics for Online Health Communities.

Authors:  Bum Chul Kwon; Sung-Hee Kim; Sukwon Lee; Jaegul Choo; Jina Huh; Ji Soo Yi
Journal:  IEEE Trans Vis Comput Graph       Date:  2016-01       Impact factor: 4.579

10.  Detecting clinically related content in online patient posts.

Authors:  Courtland VanDam; Shaheen Kanthawala; Wanda Pratt; Joyce Chai; Jina Huh
Journal:  J Biomed Inform       Date:  2017-10-03       Impact factor: 6.317

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.