Literature DB >> 28359728

Enriching consumer health vocabulary through mining a social Q&A site: A similarity-based approach.

Zhe He1, Zhiwei Chen2, Sanghee Oh3, Jinghui Hou4, Jiang Bian5.   

Abstract

The widely known vocabulary gap between health consumers and healthcare professionals hinders information seeking and health dialogue of consumers on end-user health applications. The Open Access and Collaborative Consumer Health Vocabulary (OAC CHV), which contains health-related terms used by lay consumers, has been created to bridge such a gap. Specifically, the OAC CHV facilitates consumers' health information retrieval by enabling consumer-facing health applications to translate between professional language and consumer friendly language. To keep up with the constantly evolving medical knowledge and language use, new terms need to be identified and added to the OAC CHV. User-generated content on social media, including social question and answer (social Q&A) sites, afford us an enormous opportunity in mining consumer health terms. Existing methods of identifying new consumer terms from text typically use ad-hoc lexical syntactic patterns and human review. Our study extends an existing method by extracting n-grams from a social Q&A textual corpus and representing them with a rich set of contextual and syntactic features. Using K-means clustering, our method, simiTerm, was able to identify terms that are both contextually and syntactically similar to the existing OAC CHV terms. We tested our method on social Q&A corpora on two disease domains: diabetes and cancer. Our method outperformed three baseline ranking methods. A post-hoc qualitative evaluation by human experts further validated that our method can effectively identify meaningful new consumer terms on social Q&A.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Consumer health information; Consumer health vocabulary; Controlled vocabularies; Ontology enrichment; Social Q&A

Mesh:

Year:  2017        PMID: 28359728      PMCID: PMC5488691          DOI: 10.1016/j.jbi.2017.03.016

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  22 in total

1.  ICD-9-CM coding for physician billing.

Authors:  R Finnegan
Journal:  J Am Med Rec Assoc       Date:  1989-02

2.  Reformulation of consumer health queries with professional terminology: a pilot study.

Authors:  Robert M Plovnick; Qing T Zeng
Journal:  J Med Internet Res       Date:  2004-09-03       Impact factor: 5.428

3.  Assessing the readability of ClinicalTrials.gov.

Authors:  Danny T Y Wu; David A Hanauer; Qiaozhu Mei; Patricia M Clark; Lawrence C An; Joshua Proulx; Qing T Zeng; V G Vinod Vydiswaran; Kevyn Collins-Thompson; Kai Zheng
Journal:  J Am Med Inform Assoc       Date:  2015-08-11       Impact factor: 4.497

4.  Using Web 2.0 technologies to enhance evidence-based medical information.

Authors:  Miriam J Metzger; Andrew J Flanagin
Journal:  J Health Commun       Date:  2011

5.  The readiness of SNOMED problem list concepts for meaningful use of electronic health records.

Authors:  Ankur Agrawal; Zhe He; Yehoshua Perl; Duo Wei; Michael Halper; Gai Elhanan; Yan Chen
Journal:  Artif Intell Med       Date:  2013-04-18       Impact factor: 5.326

6.  High-quality, standard, controlled healthcare terminologies come of age.

Authors:  J J Cimino
Journal:  Methods Inf Med       Date:  2011-03-17       Impact factor: 2.176

7.  Computer-assisted update of a consumer health vocabulary through mining of social network data.

Authors:  Kristina M Doing-Harris; Qing Zeng-Treitler
Journal:  J Med Internet Res       Date:  2011-05-17       Impact factor: 5.428

8.  Term identification methods for consumer health vocabulary development.

Authors:  Qing T Zeng; Tony Tse; Guy Divita; Alla Keselman; Jon Crowell; Allen C Browne; Sergey Goryachev; Long Ngo
Journal:  J Med Internet Res       Date:  2007-02-28       Impact factor: 5.428

9.  Consumers' Use of UMLS Concepts on Social Media: Diabetes-Related Textual Data Analysis in Blog and Social Q&A Sites.

Authors:  Min Sook Park; Zhe He; Zhiwei Chen; Sanghee Oh; Jiang Bian
Journal:  JMIR Med Inform       Date:  2016-11-24

10.  Identifying medical terms in patient-authored text: a crowdsourcing-based approach.

Authors:  Diana Lynn MacLean; Jeffrey Heer
Journal:  J Am Med Inform Assoc       Date:  2013-05-05       Impact factor: 4.497

View more
  11 in total

1.  Validating UMLS Semantic Type Assignments Using SNOMED CT Semantic Tags.

Authors:  Huanying Gu; Zhe He; Duo Wei; Gai Elhanan; Yan Chen
Journal:  Methods Inf Med       Date:  2018-04-05       Impact factor: 2.176

2.  Understanding Patient Information Needs About Their Clinical Laboratory Results: A Study of Social Q&A Site.

Authors:  Zhan Zhang; Yu Lu; Yubo Kou; Danny T Y Wu; Jina Huh-Yoo; Zhe He
Journal:  Stud Health Technol Inform       Date:  2019-08-21

3.  Extended Analysis of Topological-Pattern-Based Ontology Enrichment.

Authors:  Zhe He; Vipina Kuttichi Keloth; Yan Chen; James Geller
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2019-01-24

4.  An automated method to enrich consumer health vocabularies using GloVe word embeddings and an auxiliary lexical resource.

Authors:  Mohammed Ibrahim; Susan Gauch; Omar Salman; Mohammed Alqahtani
Journal:  PeerJ Comput Sci       Date:  2021-08-09

5.  Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

Authors:  Denis Newman-Griffis; Guy Divita; Bart Desmet; Ayah Zirikly; Carolyn P Rosé; Eric Fosler-Lussier
Journal:  J Am Med Inform Assoc       Date:  2021-03-01       Impact factor: 4.497

6.  Utilizing a multi-class classification approach to detect therapeutic and recreational misuse of opioids on Twitter.

Authors:  Samah Jamal Fodeh; Mohammed Al-Garadi; Osama Elsankary; Jeanmarie Perrone; William Becker; Abeed Sarker
Journal:  Comput Biol Med       Date:  2020-11-20       Impact factor: 4.589

7.  Evaluating semantic relations in neural word embeddings with biomedical and general domain knowledge bases.

Authors:  Zhiwei Chen; Zhe He; Xiuwen Liu; Jiang Bian
Journal:  BMC Med Inform Decis Mak       Date:  2018-07-23       Impact factor: 2.796

8.  Development of a Consumer Health Vocabulary by Mining Health Forum Texts Based on Word Embedding: Semiautomatic Approach.

Authors:  Gen Gu; Xingting Zhang; Xingeng Zhu; Zhe Jian; Ken Chen; Dong Wen; Li Gao; Shaodian Zhang; Fei Wang; Handong Ma; Jianbo Lei
Journal:  JMIR Med Inform       Date:  2019-05-23

9.  Analyzing Social Media Data to Understand Consumer Information Needs on Dietary Supplements.

Authors:  Rubina F Rizvi; Yefeng Wang; Thao Nguyen; Jake Vasilakes; Jiang Bian; Zhe He; Rui Zhang
Journal:  Stud Health Technol Inform       Date:  2019-08-21

10.  An Informatics Framework to Assess Consumer Health Language Complexity Differences: Proof-of-Concept Study.

Authors:  Biyang Yu; Zhe He; Aiwen Xing; Mia Liza A Lustria
Journal:  J Med Internet Res       Date:  2020-05-21       Impact factor: 5.428

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.