Literature DB >> 32968800

The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.

Sam Henry1, Yanshan Wang2, Feichen Shen2, Ozlem Uzuner1,3,4.   

Abstract

OBJECTIVE: The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task track 3, focused on medical concept normalization (MCN) in clinical records. This track aimed to assess the state of the art in identifying and matching salient medical concepts to a controlled vocabulary. In this paper, we describe the task, describe the data set used, compare the participating systems, present results, identify the strengths and limitations of the current state of the art, and identify directions for future research.
MATERIALS AND METHODS: Participating teams were provided with narrative discharge summaries in which text spans corresponding to medical concepts were identified. This paper refers to these text spans as mentions. Teams were tasked with normalizing these mentions to concepts, represented by concept unique identifiers, within the Unified Medical Language System. Submitted systems represented 4 broad categories of approaches: cascading dictionary matching, cosine distance, deep learning, and retrieve-and-rank systems. Disambiguation modules were common across all approaches.
RESULTS: A total of 33 teams participated in the MCN task. The best-performing team achieved an accuracy of 0.8526. The median and mean performances among all teams were 0.7733 and 0.7426, respectively.
CONCLUSIONS: Overall performance among the top 10 teams was high. However, several mention types were challenging for all teams. These included mentions requiring disambiguation of misspelled words, acronyms, abbreviations, and mentions with more than 1 possible semantic type. Also challenging were complex mentions of long, multi-word terms that may require new ways of extracting and representing mention meaning, the use of domain knowledge, parse trees, or hand-crafted rules.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Keywords:  clinical narratives; concept normalization; machine learning; natural language processing

Mesh:

Year:  2020        PMID: 32968800      PMCID: PMC7647359          DOI: 10.1093/jamia/ocaa106

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  31 in total

Review 1.  The medical dictionary for regulatory activities (MedDRA).

Authors:  E G Brown; L Wood; S Wood
Journal:  Drug Saf       Date:  1999-02       Impact factor: 5.606

2.  Medical Subject Headings (MeSH).

Authors:  C E Lipscomb
Journal:  Bull Med Libr Assoc       Date:  2000-07

3.  Normalizing clinical terms using learned edit distance patterns.

Authors:  Rohit J Kate
Journal:  J Am Med Inform Assoc       Date:  2015-07-31       Impact factor: 4.497

4.  SNOMED RT: a reference terminology for health care.

Authors:  K A Spackman; K E Campbell; R A Côté
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

5.  MCN: A comprehensive corpus for medical concept normalization.

Authors:  Yen-Fu Luo; Weiyi Sun; Anna Rumshisky
Journal:  J Biomed Inform       Date:  2019-02-22       Impact factor: 6.317

6.  Automated disambiguation of acronyms and abbreviations in clinical texts: window and training size considerations.

Authors:  Sungrim Moon; Serguei Pakhomov; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

7.  Cohort selection for clinical trials: n2c2 2018 shared task track 1.

Authors:  Amber Stubbs; Michele Filannino; Ergin Soysal; Samuel Henry; Özlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

8.  NCBI disease corpus: a resource for disease name recognition and concept normalization.

Authors:  Rezarta Islamaj Doğan; Robert Leaman; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2014-01-03       Impact factor: 6.317

9.  MIMIC II: a massive temporal ICU patient database to support research in intelligent patient monitoring.

Authors:  M Saeed; C Lieu; G Raber; R G Mark
Journal:  Comput Cardiol       Date:  2002

10.  BioCreative V CDR task corpus: a resource for chemical disease relation extraction.

Authors:  Jiao Li; Yueping Sun; Robin J Johnson; Daniela Sciaky; Chih-Hsuan Wei; Robert Leaman; Allan Peter Davis; Carolyn J Mattingly; Thomas C Wiegers; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2016-05-09       Impact factor: 3.451

View more
  6 in total

1.  The UMLS knowledge sources at 30: indispensable to current research and applications in biomedical informatics.

Authors:  Betsy L Humphreys; Guilherme Del Fiol; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

2.  Extracting Radiological Findings With Normalized Anatomical Information Using a Span-Based BERT Relation Extraction Model.

Authors:  Kevin Lybarger; Aashka Damani; Martin Gunn; O Zlem Uzuner; Meliha Yetisgen
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

Review 3.  A scoping review of publicly available language tasks in clinical natural language processing.

Authors:  Yanjun Gao; Dmitriy Dligach; Leslie Christensen; Samuel Tesch; Ryan Laffin; Dongfang Xu; Timothy Miller; Ozlem Uzuner; Matthew M Churpek; Majid Afshar
Journal:  J Am Med Inform Assoc       Date:  2022-09-12       Impact factor: 7.942

4.  Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics.

Authors:  Tiago Almeida; Rui Antunes; João F Silva; João R Almeida; Sérgio Matos
Journal:  Database (Oxford)       Date:  2022-07-01       Impact factor: 4.462

5.  A simple neural vector space model for medical concept normalization using concept embeddings.

Authors:  Dongfang Xu; Timothy Miller
Journal:  J Biomed Inform       Date:  2022-04-23       Impact factor: 8.000

6.  Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.

Authors:  Shikhar Vashishth; Denis Newman-Griffis; Rishabh Joshi; Ritam Dutt; Carolyn P Rosé
Journal:  J Biomed Inform       Date:  2021-08-12       Impact factor: 6.317

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.