Literature DB >> 28750904

A comparison of rule-based and machine learning approaches for classifying patient portal messages.

Robert M Cronin1, Daniel Fabbri2, Joshua C Denny3, S Trent Rosenbloom4, Gretchen Purcell Jackson5.   

Abstract

OBJECTIVE: Secure messaging through patient portals is an increasingly popular way that consumers interact with healthcare providers. The increasing burden of secure messaging can affect clinic staffing and workflows. Manual management of portal messages is costly and time consuming. Automated classification of portal messages could potentially expedite message triage and delivery of care.
MATERIALS AND METHODS: We developed automated patient portal message classifiers with rule-based and machine learning techniques using bag of words and natural language processing (NLP) approaches. To evaluate classifier performance, we used a gold standard of 3253 portal messages manually categorized using a taxonomy of communication types (i.e., main categories of informational, medical, logistical, social, and other communications, and subcategories including prescriptions, appointments, problems, tests, follow-up, contact information, and acknowledgement). We evaluated our classifiers' accuracies in identifying individual communication types within portal messages with area under the receiver-operator curve (AUC). Portal messages often contain more than one type of communication. To predict all communication types within single messages, we used the Jaccard Index. We extracted the variables of importance for the random forest classifiers.
RESULTS: The best performing approaches to classification for the major communication types were: logistic regression for medical communications (AUC: 0.899); basic (rule-based) for informational communications (AUC: 0.842); and random forests for social communications and logistical communications (AUCs: 0.875 and 0.925, respectively). The best performing classification approach of classifiers for individual communication subtypes was random forests for Logistical-Contact Information (AUC: 0.963). The Jaccard Indices by approach were: basic classifier, Jaccard Index: 0.674; Naïve Bayes, Jaccard Index: 0.799; random forests, Jaccard Index: 0.859; and logistic regression, Jaccard Index: 0.861. For medical communications, the most predictive variables were NLP concepts (e.g., Temporal_Concept, which maps to 'morning', 'evening' and Idea_or_Concept which maps to 'appointment' and 'refill'). For logistical communications, the most predictive variables contained similar numbers of NLP variables and words (e.g., Telephone mapping to 'phone', 'insurance'). For social and informational communications, the most predictive variables were words (e.g., social: 'thanks', 'much', informational: 'question', 'mean').
CONCLUSIONS: This study applies automated classification methods to the content of patient portal messages and evaluates the application of NLP techniques on consumer communications in patient portal messages. We demonstrated that random forest and logistic regression approaches accurately classified the content of portal messages, although the best approach to classification varied by communication type. Words were the most predictive variables for classification of most communication types, although NLP variables were most predictive for medical communication types. As adoption of patient portals increases, automated techniques could assist in understanding and managing growing volumes of messages. Further work is needed to improve classification performance to potentially support message triage and answering.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Machine learning; Natural language processing; Patient portal; Text classification

Mesh:

Year:  2017        PMID: 28750904      PMCID: PMC5546247          DOI: 10.1016/j.ijmedinf.2017.06.004

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  38 in total

1.  Surgical textbooks: past, present, and future.

Authors:  Gretchen P Purcell
Journal:  Ann Surg       Date:  2003-12       Impact factor: 12.969

2.  The KnowledgeMap project: development of a concept-based medical school curriculum database.

Authors:  Joshua C Denny; Plomarz R Irani; Firas H Wehbe; Jeffrey D Smithers; Anderson Spickard
Journal:  AMIA Annu Symp Proc       Date:  2003

Review 3.  A systematic review of interactive computer-assisted technology in diabetes care. Interactive information technology in diabetes care.

Authors:  Chandra L Jackson; Shari Bolen; Frederick L Brancati; Marian L Batts-Turner; Tiffany L Gary
Journal:  J Gen Intern Med       Date:  2005-12-22       Impact factor: 5.128

4.  "Where do we teach what?" Finding broad concepts in the medical school curriculum.

Authors:  Joshua C Denny; Jeffrey D Smithers; Brian Armstrong; Anderson Spickard
Journal:  J Gen Intern Med       Date:  2005-10       Impact factor: 5.128

5.  Identifying UMLS concepts from ECG Impressions using KnowledgeMap.

Authors:  Joshua C Denny; Anderson Spickard; Randolph A Miller; Jonathan Schildcrout; Dawood Darbar; S Trent Rosenbloom; Josh F Peterson
Journal:  AMIA Annu Symp Proc       Date:  2005

Review 6.  A review of computer and Internet-based interventions for smoking behavior.

Authors:  Scott T Walters; Jo Anne Wright; Ross Shegog
Journal:  Addict Behav       Date:  2005-06-13       Impact factor: 3.913

7.  Opportunities to enhance patient and physician e-mail contact.

Authors:  John Hobbs; Jonathan Wald; Yamini S Jagannath; Anne Kittler; Lisa Pizziferri; Lynn A Volk; Blackford Middleton; David W Bates
Journal:  Int J Med Inform       Date:  2003-04       Impact factor: 4.046

8.  The missing link: bridging the patient-provider health information gap.

Authors:  Paul C Tang; David Lansky
Journal:  Health Aff (Millwood)       Date:  2005 Sep-Oct       Impact factor: 6.301

Review 9.  Consumer-driven, patient-centered health care in the age of electronic information.

Authors:  Nancy Calabretta
Journal:  J Med Libr Assoc       Date:  2002-01

10.  A content analysis of e-mail communication between patients and their providers: patients get the message.

Authors:  Casey B White; Cheryl A Moyer; David T Stern; Steven J Katz
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

View more
  16 in total

1.  A systematic literature review of machine learning in online personal health data.

Authors:  Zhijun Yin; Lina M Sulieman; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2019-06-01       Impact factor: 4.497

2.  Why Patient Portal Messages Indicate Risk of Readmission for Patients with Ischemic Heart Disease.

Authors:  Lina Sulieman; Zhijun Yin; Bradley A Malin
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

3.  A systematic review of natural language processing and text mining of symptoms from electronic patient-authored text data.

Authors:  Caitlin Dreisbach; Theresa A Koleck; Philip E Bourne; Suzanne Bakken
Journal:  Int J Med Inform       Date:  2019-02-20       Impact factor: 4.046

4.  Applying Natural Language Processing Neural Network Architectures to Augment Appointment Request Review of Self-Referred Patients to an Academic Medical Center.

Authors:  Christopher A Aakre
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

5.  Extracting Multiple Worries From Breast Cancer Patient Blogs Using Multilabel Classification With the Natural Language Processing Model Bidirectional Encoder Representations From Transformers: Infodemiology Study of Blogs.

Authors:  Tomomi Watanabe; Shuntaro Yada; Eiji Aramaki; Hiroshi Yajima; Hayato Kizaki; Satoko Hori
Journal:  JMIR Cancer       Date:  2022-06-03

6.  Common Consumer Health-Related Needs in the Pediatric Hospital Setting: Lessons from an Engagement Consultation Service.

Authors:  Daniel J Lee; Robert Cronin; Jamie Robinson; Shilo Anders; Kim Unertl; Katherine Kelly; Heather Hankins; Ryan Skeens; Gretchen P Jackson
Journal:  Appl Clin Inform       Date:  2018-08-08       Impact factor: 2.342

Review 7.  Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors:  Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal:  Seizure       Date:  2021-01-13       Impact factor: 3.184

8.  Automating the Classification of Complexity of Medical Decision-Making in Patient-Provider Messaging in a Patient Portal.

Authors:  Lina Sulieman; Jamie R Robinson; Gretchen P Jackson
Journal:  J Surg Res       Date:  2020-06-19       Impact factor: 2.192

9.  Predicting the readability of physicians' secure messages to improve health communication using novel linguistic features: Findings from the ECLIPPSE study.

Authors:  Scott A Crossley; Renu Balyan; Jennifer Liu; Andrew J Karter; Danielle McNamara; Dean Schillinger
Journal:  J Commun Healthc       Date:  2020-09-24

10.  Qcorp: an annotated classification corpus of Chinese health questions.

Authors:  Haihong Guo; Xu Na; Jiao Li
Journal:  BMC Med Inform Decis Mak       Date:  2018-03-22       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.