Literature DB >> 32949781

Extracting and classifying diagnosis dates from clinical notes: A case study.

Julia T Fu1, Evan Sholle2, Spencer Krichevsky3, Joseph Scandura4, Thomas R Campion5.   

Abstract

Myeloproliferative neoplasms (MPNs) are chronic hematologic malignancies that may progress over long disease courses. The original date of diagnosis is an important piece of information for patient care and research, but is not consistently documented. We describe an attempt to build a pipeline for extracting dates with natural language processing (NLP) tools and techniques and classifying them as relevant diagnoses or not. Inaccurate and incomplete date extraction and interpretation impacted the performance of the overall pipeline. Existing lightweight Python packages tended to have low specificity for identifying and interpreting partial and relative dates in clinical text. A rules-based regular expression (regex) approach achieved recall of 83.0% on dates manually annotated as diagnosis dates, and 77.4% on all annotated dates. With only 3.8% of annotated dates representing initial MPN diagnoses, additional methods of targeting candidate date instances may alleviate noise and class imbalance.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Classification; Clinical text; Natural language processing; Temporality

Mesh:

Year:  2020        PMID: 32949781     DOI: 10.1016/j.jbi.2020.103569

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  1 in total

1.  An architecture for research computing in health to support clinical and translational investigators with electronic patient data.

Authors:  Thomas R Campion; Evan T Sholle; Jyotishman Pathak; Stephen B Johnson; John P Leonard; Curtis L Cole
Journal:  J Am Med Inform Assoc       Date:  2022-03-15       Impact factor: 4.497

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.