Literature DB >> 33380319

Annotation and extraction of age and temporally-related events from clinical histories.

Judy Hong1, Anahita Davoudi2, Shun Yu3, Danielle L Mowery4,5.   

Abstract

BACKGROUND: Age and time information stored within the histories of clinical notes can provide valuable insights for assessing a patient's disease risk, understanding disease progression, and studying therapeutic outcomes. However, details of age and temporally-specified clinical events are not well captured, consistently codified, and readily available to research databases for study.
METHODS: We expanded upon existing annotation schemes to capture additional age and temporal information, conducted an annotation study to validate our expanded schema, and developed a prototypical, rule-based Named Entity Recognizer to extract our novel clinical named entities (NE). The annotation study was conducted on 138 discharge summaries from the pre-annotated 2014 ShARe/CLEF eHealth Challenge corpus. In addition to existing NE classes (TIMEX3, SUBJECT_CLASS, DISEASE_DISORDER), our schema proposes 3 additional NEs (AGE, PROCEDURE, OTHER_EVENTS). We also propose new attributes, e.g., "degree_relation" which captures the degree of biological relation for subjects annotated under SUBJECT_CLASS. As a proof of concept, we applied the schema to 49 H&P notes to encode pertinent history information for a lung cancer cohort study.
RESULTS: An abundance of information was captured under the new OTHER_EVENTS, PROCEDURE and AGE classes, with 23%, 10% and 8% of all annotated NEs belonging to the above classes, respectively. We observed high inter-annotator agreement of >80% for AGE and TIMEX3; the automated NLP system achieved F1 scores of 86% (AGE) and 86% (TIMEX3). Age and temporally-specified mentions within past medical, family, surgical, and social histories were common in our lung cancer data set; annotation is ongoing to support this translational research study.
CONCLUSIONS: Our annotation schema and NLP system can encode historical events from clinical notes to support clinical and translational research studies.

Entities:  

Keywords:  Medical informatics; Natural language processing; Temporality

Year:  2020        PMID: 33380319      PMCID: PMC7772895          DOI: 10.1186/s12911-020-01333-5

Source DB:  PubMed          Journal:  BMC Med Inform Decis Mak        ISSN: 1472-6947            Impact factor:   2.796


  2 in total

1.  Annotating Temporal Relations to Determine the Onset of Psychosis Symptoms.

Authors:  Natalia Viani; Joyce Kam; Lucia Yin; Somain Verma; Robert Stewart; Rashmi Patel; Sumithra Velupillai
Journal:  Stud Health Technol Inform       Date:  2019-08-21

2.  Determining Onset for Familial Breast and Colorectal Cancer from Family History Comments in the Electronic Health Record.

Authors:  Danielle L Mowery; Kensaku Kawamoto; Rick Bradshaw; Wendy Kohlmann; Joshua D Schiffman; Charlene Weir; Damian Borbolla; Wendy W Chapman; Guilherme Del Fiol
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.