Literature DB >> 34333635

MT-clinical BERT: scaling clinical information extraction with multitask learning.

Andriy Mulyar1, Ozlem Uzuner2, Bridget McInnes2.   

Abstract

OBJECTIVE: Clinical notes contain an abundance of important, but not-readily accessible, information about patients. Systems that automatically extract this information rely on large amounts of training data of which there exists limited resources to create. Furthermore, they are developed disjointly, meaning that no information can be shared among task-specific systems. This bottleneck unnecessarily complicates practical application, reduces the performance capabilities of each individual solution, and associates the engineering debt of managing multiple information extraction systems.
MATERIALS AND METHODS: We address these challenges by developing Multitask-Clinical BERT: a single deep learning model that simultaneously performs 8 clinical tasks spanning entity extraction, personal health information identification, language entailment, and similarity by sharing representations among tasks.
RESULTS: We compare the performance of our multitasking information extraction system to state-of-the-art BERT sequential fine-tuning baselines. We observe a slight but consistent performance degradation in MT-Clinical BERT relative to sequential fine-tuning. DISCUSSION: These results intuitively suggest that learning a general clinical text representation capable of supporting multiple tasks has the downside of losing the ability to exploit dataset or clinical note-specific properties when compared to a single, task-specific model.
CONCLUSIONS: We find our single system performs competitively with all state-the-art task-specific systems while also benefiting from massive computational benefits at inference.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical natural language processing, named entity recognition, textual entailment, semantic text similarity; multitask learning; natural language processing

Mesh:

Year:  2021        PMID: 34333635      PMCID: PMC8449623          DOI: 10.1093/jamia/ocab126

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   7.942


  10 in total

1.  Aggregating UMLS semantic types for reducing conceptual complexity.

Authors:  A T McCray; A Burgun; O Bodenreider
Journal:  Stud Health Technol Inform       Date:  2001

2.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

3.  Recognizing Question Entailment for Medical Question Answering.

Authors:  Asma Ben Abacha; Dina Demner-Fushman
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

4.  Cross-type biomedical named entity recognition with deep multi-task learning.

Authors:  Xuan Wang; Yu Zhang; Xiang Ren; Yuhao Zhang; Marinka Zitnik; Jingbo Shang; Curtis Langlotz; Jiawei Han
Journal:  Bioinformatics       Date:  2019-05-15       Impact factor: 6.937

5.  2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records.

Authors:  Sam Henry; Kevin Buchan; Michele Filannino; Amber Stubbs; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

Review 6.  Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1.

Authors:  Amber Stubbs; Christopher Kotfila; Özlem Uzuner
Journal:  J Biomed Inform       Date:  2015-07-28       Impact factor: 6.317

Review 7.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

Authors:  Weiyi Sun; Anna Rumshisky; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2013-04-05       Impact factor: 4.497

8.  A neural network multi-task learning approach to biomedical named entity recognition.

Authors:  Gamal Crichton; Sampo Pyysalo; Billy Chiu; Anna Korhonen
Journal:  BMC Bioinformatics       Date:  2017-08-15       Impact factor: 3.169

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  Family history information extraction via deep joint learning.

Authors:  Xue Shi; Dehuan Jiang; Yuanhang Huang; Xiaolong Wang; Qingcai Chen; Jun Yan; Buzhou Tang
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-27       Impact factor: 2.796

  10 in total
  2 in total

1.  Classifying unstructured electronic consult messages to understand primary care physician specialty information needs.

Authors:  Xiyu Ding; Michael Barnett; Ateev Mehrotra; Delphine S Tuot; Danielle S Bitterman; Timothy A Miller
Journal:  J Am Med Inform Assoc       Date:  2022-08-16       Impact factor: 7.942

2.  A Keyword-Enhanced Approach to Handle Class Imbalance in Clinical Text Classification.

Authors:  Andrew E Blanchard; Shang Gao; Hong-Jun Yoon; J Blair Christian; Eric B Durbin; Xiao-Cheng Wu; Antoinette Stroup; Jennifer Doherty; Stephen M Schwartz; Charles Wiggins; Linda Coyle; Lynne Penberthy; Georgia D Tourassi
Journal:  IEEE J Biomed Health Inform       Date:  2022-06-03       Impact factor: 7.021

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.