Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Standardizing Heterogeneous Annotation Corpora Using HL7 FHIR for Facilitating their Reuse and Integration in Clinical NLP.

Literature DB >> 30815098

Standardizing Heterogeneous Annotation Corpora Using HL7 FHIR for Facilitating their Reuse and Integration in Clinical NLP.

Na Hong¹, Andrew Wen¹, Majid Rastegar Mojarad¹, Sunghwan Sohn¹, Hongfang Liu¹, Guoqian Jiang¹.

Abstract

Manually annotated clinical corpora are commonly used as the gold standards for the training and evaluation of clinical natural language processing (NLP) tools. The creation of these manual annotation corpora, however, is both costly and time-consuming. There is an emerging need in the clinical NLP community for reusing existing annotation corpora across different clinical NLP tasks. The objective of this study is to design, develop and evaluate a framework and accompanying tools to support the standardization and integration of annotation corpora using the HL7 Fast Healthcare Interoperability Resources (FHIR) specification. The framework contains two main modules: 1) an automatic schema transformation module, in which the annotation schema in each corpus is automatically transformed into the FHIR-based schema; 2) an expert-based verification and annotation module, in which existing annotations can be verified and new annotations can be added for new elements defined in FHIR. We evaluated the framework using various annotation corpora created as part of different clinical NLP projects at the Mayo Clinic. We demonstrated that it is feasible to leverage FHIR as a standard data model for standardizing heterogeneous annotation corpora for their reuse and integration in advanced clinical NLP research and practices.

Entities: Disease Species

Mesh：

Year: 2018 PMID： 30815098 PMCID： PMC6371380

Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN： 1559-4076

10 in total

1. The SHARPn project on secondary use of Electronic Medical Record data: progress, plans, and possibilities.

Authors: Christopher G Chute; Jyotishman Pathak; Guergana K Savova; Kent R Bailey; Marshall I Schor; Lacey A Hart; Calvin E Beebe; Stanley M Huff
Journal: AMIA Annu Symp Proc Date: 2011-10-22

2. Anafora: A Web-based General Purpose Annotation Tool.

Authors: Wei-Te Chen; Will Styler
Journal: Proc Conf Date: 2013-06

3. Modeling and validating HL7 FHIR profiles using semantic web Shape Expressions (ShEx).

Authors: Harold R Solbrig; Eric Prud'hommeaux; Grahame Grieve; Lloyd McKenzie; Joshua C Mandel; Deepak K Sharma; Guoqian Jiang
Journal: J Biomed Inform Date: 2017-02-16 Impact factor: 6.317

4. Portable automatic text classification for adverse drug reaction detection via multi-corpus training.

Authors: Abeed Sarker; Graciela Gonzalez
Journal: J Biomed Inform Date: 2014-11-08 Impact factor: 6.317

5. MedXN: an open source medication extraction and normalization tool for clinical text.

Authors: Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu
Journal: J Am Med Inform Assoc Date: 2014-03-17 Impact factor: 4.497

6. Building a robust, scalable and standards-driven infrastructure for secondary use of EHR data: the SHARPn project.

Authors: Susan Rea; Jyotishman Pathak; Guergana Savova; Thomas A Oniki; Les Westberg; Calvin E Beebe; Cui Tao; Craig G Parker; Peter J Haug; Stanley M Huff; Christopher G Chute
Journal: J Biomed Inform Date: 2012-02-04 Impact factor: 6.317

7. The CLEF corpus: semantic annotation of clinical text.

Authors: Angus Roberts; Robert Gaizauskas; Mark Hepple; Neil Davis; George Demetriou; Yikun Guo; Jay Kola; Ian Roberts; Andrea Setzer; Archana Tapuria; Bill Wheeldin
Journal: AMIA Annu Symp Proc Date: 2007-10-11

8. Pooling annotated corpora for clinical concept extraction.

Authors: Kavishwar B Wagholikar; Manabu Torii; Siddhartha R Jonnalagadda; Hongfang Liu
Journal: J Biomed Semantics Date: 2013-01-08

9. Systematic Analysis of Free-Text Family History in Electronic Health Record.

Authors: Yanshan Wang; Liwei Wang; Majid Rastegar-Mojarad; Sijia Liu; Feichen Shen; Hongfang Liu
Journal: AMIA Jt Summits Transl Sci Proc Date: 2017-07-26

10. Integrating Structured and Unstructured EHR Data Using an FHIR-based Type System: A Case Study with Medication Data.

Authors: Na Hong; Andrew Wen; Feichen Shen; Sunghwan Sohn; Sijia Liu; Hongfang Liu; Guoqian Jiang
Journal: AMIA Jt Summits Transl Sci Proc Date: 2018-05-18

10 in total

6 in total

Review 1. HL7 FHIR-based tools and initiatives to support clinical research: a scoping review.

Authors: Stephany N Duda; Nan Kennedy; Douglas Conway; Alex C Cheng; Viet Nguyen; Teresa Zayas-Cabán; Paul A Harris
Journal: J Am Med Inform Assoc Date: 2022-08-16 Impact factor: 7.942

2. Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

Authors: Na Hong; Andrew Wen; Daniel J Stone; Shintaro Tsuji; Paul R Kingsbury; Luke V Rasmussen; Jennifer A Pacheco; Prakash Adekkanattu; Fei Wang; Yuan Luo; Jyotishman Pathak; Hongfang Liu; Guoqian Jiang
Journal: J Biomed Inform Date: 2019-10-14 Impact factor: 6.317

3. Developing an FHIR-Based Computational Pipeline for Automatic Population of Case Report Forms for Colorectal Cancer Clinical Trials Using Electronic Health Records.

Authors: Nansu Zong; Andrew Wen; Daniel J Stone; Deepak K Sharma; Chen Wang; Yue Yu; Hongfang Liu; Qian Shi; Guoqian Jiang
Journal: JCO Clin Cancer Inform Date: 2020-03

4. Developing a scalable FHIR-based clinical data normalization pipeline for standardizing and integrating unstructured and structured electronic health record data.

Authors: Na Hong; Andrew Wen; Feichen Shen; Sunghwan Sohn; Chen Wang; Hongfang Liu; Guoqian Jiang
Journal: JAMIA Open Date: 2019-10-18

5. A Framework (SOCRATex) for Hierarchical Annotation of Unstructured Electronic Health Records and Integration Into a Standardized Medical Database: Development and Usability Study.

Authors: Jimyung Park; Seng Chan You; Eugene Jeong; Chunhua Weng; Dongsu Park; Jin Roh; Dong Yun Lee; Jae Youn Cheong; Jin Wook Choi; Mira Kang; Rae Woong Park
Journal: JMIR Med Inform Date: 2021-03-30

6. Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Authors: Antje Wulff; Marcel Mast; Marcus Hassler; Sara Montag; Michael Marschollek; Thomas Jack
Journal: Methods Inf Med Date: 2020-10-14 Impact factor: 2.176

6 in total