Literature DB >> 31421211

An interpretable natural language processing system for written medical examination assessment.

Abeed Sarker1, Ari Z Klein2, Janet Mee3, Polina Harik3, Graciela Gonzalez-Hernandez2.   

Abstract

OBJECTIVE: The assessment of written medical examinations is a tedious and expensive process, requiring significant amounts of time from medical experts. Our objective was to develop a natural language processing (NLP) system that can expedite the assessment of unstructured answers in medical examinations by automatically identifying relevant concepts in the examinee responses.
MATERIALS AND METHODS: Our NLP system, Intelligent Clinical Text Evaluator (INCITE), is semi-supervised in nature. Learning from a limited set of fully annotated examples, it sequentially applies a series of customized text comparison and similarity functions to determine if a text span represents an entry in a given reference standard. Combinations of fuzzy matching and set intersection-based methods capture inexact matches and also fragmented concepts. Customizable, dynamic similarity-based matching thresholds allow the system to be tailored for examinee responses of different lengths.
RESULTS: INCITE achieved an average F1-score of 0.89 (precision = 0.87, recall = 0.91) against human annotations over held-out evaluation data. Fuzzy text matching, dynamic thresholding and the incorporation of supervision using annotated data resulted in the biggest jumps in performances. DISCUSSION: Long and non-standard expressions are difficult for INCITE to detect, but the problem is mitigated by the use of dynamic thresholding (i.e., varying the similarity threshold for a text span to be considered a match). Annotation variations within exams and disagreements between annotators were the primary causes for false positives. Small amounts of annotated data can significantly improve system performance.
CONCLUSIONS: The high performance and interpretability of INCITE will likely significantly aid the assessment process and also help mitigate the impact of manual assessment inconsistencies.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Automated assessment; Clinical notes; Natural language processing; Text mining

Mesh:

Year:  2019        PMID: 31421211     DOI: 10.1016/j.jbi.2019.103268

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  Development and Validation of a Machine Learning Model for Automated Assessment of Resident Clinical Reasoning Documentation.

Authors:  Verity Schaye; Benedict Guzman; Jesse Burk-Rafel; Marina Marin; Ilan Reinstein; David Kudlowitz; Louis Miller; Jonathan Chun; Yindalon Aphinyanaphongs
Journal:  J Gen Intern Med       Date:  2022-06-16       Impact factor: 6.473

2.  Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.

Authors:  Quynh T Tran; Md Zahangir Alom; Brent A Orr
Journal:  BMC Bioinformatics       Date:  2022-06-08       Impact factor: 3.307

3.  MKA: A Scalable Medical Knowledge-Assisted Mechanism for Generative Models on Medical Conversation Tasks.

Authors:  Ke Liang; Sifan Wu; Jiayi Gu
Journal:  Comput Math Methods Med       Date:  2021-12-23       Impact factor: 2.238

4.  A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids.

Authors:  Linyi Li; Adela Grando; Abeed Sarker
Journal:  Methods Inf Med       Date:  2021-12-29       Impact factor: 2.176

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.