Literature DB >> 31622801

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

Na Hong1, Andrew Wen1, Daniel J Stone1, Shintaro Tsuji1, Paul R Kingsbury1, Luke V Rasmussen2, Jennifer A Pacheco2, Prakash Adekkanattu3, Fei Wang3, Yuan Luo2, Jyotishman Pathak3, Hongfang Liu1, Guoqian Jiang4.   

Abstract

BACKGROUND: Standards-based clinical data normalization has become a key component of effective data integration and accurate phenotyping for secondary use of electronic healthcare records (EHR) data. HL7 Fast Healthcare Interoperability Resources (FHIR) is an emerging clinical data standard for exchanging electronic healthcare data and has been used in modeling and integrating both structured and unstructured EHR data for a variety of clinical research applications. The overall objective of this study is to develop and evaluate a FHIR-based EHR phenotyping framework for identification of patients with obesity and its multiple comorbidities from semi-structured discharge summaries leveraging a FHIR-based clinical data normalization pipeline (known as NLP2FHIR).
METHODS: We implemented a multi-class and multi-label classification system based on the i2b2 Obesity Challenge task to evaluate the FHIR-based EHR phenotyping framework. Two core parts of the framework are: (a) the conversion of discharge summaries into corresponding FHIR resources - Composition, Condition, MedicationStatement, Procedure and FamilyMemberHistory using the NLP2FHIR pipeline, and (b) the implementation of four machine learning algorithms (logistic regression, support vector machine, decision tree, and random forest) to train classifiers to predict disease state of obesity and 15 comorbidities using features extracted from standard FHIR resources and terminology expansions. We used the macro- and micro-averaged precision (P), recall (R), and F1 score (F1) measures to evaluate the classifier performance. We validated the framework using a second obesity dataset extracted from the MIMIC-III database.
RESULTS: Using the NLP2FHIR pipeline, 1237 clinical discharge summaries from the 2008 i2b2 obesity challenge dataset were represented as the instances of the FHIR Composition resource consisting of 5677 records with 16 unique section types. After the NLP processing and FHIR modeling, a set of 244,438 FHIR clinical resource instances were generated. As the results of the four machine learning classifiers, the random forest algorithm performed the best with F1-micro(0.9466)/F1-macro(0.7887) and F1-micro(0.9536)/F1-macro(0.6524) for intuitive classification (reflecting medical professionals' judgments) and textual classification (reflecting the judgments based on explicitly reported information of diseases), respectively. The MIMIC-III obesity dataset was successfully integrated for prediction with minimal configuration of the NLP2FHIR pipeline and machine learning models.
CONCLUSIONS: The study demonstrated that the FHIR-based EHR phenotyping approach could effectively identify the state of obesity and multiple comorbidities using semi-structured discharge summaries. Our FHIR-based phenotyping approach is a first concrete step towards improving the data aspect of phenotyping portability across EHR systems and enhancing interpretability of the machine learning-based phenotyping algorithms.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Algorithm portability; Clinical phenotyping; Electronic Health Records (EHRs); HL7 Fast Healthcare Interoperability Resources (FHIR); Natural language processing

Year:  2019        PMID: 31622801      PMCID: PMC6990976          DOI: 10.1016/j.jbi.2019.103310

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  21 in total

1.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

2.  Recognizing obesity and comorbidities in sparse data.

Authors:  Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

3.  Evaluation of a method to identify and categorize section headers in clinical documents.

Authors:  Joshua C Denny; Anderson Spickard; Kevin B Johnson; Neeraja B Peterson; Josh F Peterson; Randolph A Miller
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

4.  MedXN: an open source medication extraction and normalization tool for clinical text.

Authors:  Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2014-03-17       Impact factor: 4.497

5.  DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records.

Authors:  Guergana K Savova; Eugene Tseytlin; Sean Finan; Melissa Castine; Timothy Miller; Olga Medvedeva; David Harris; Harry Hochheiser; Chen Lin; Girish Chavan; Rebecca S Jacobson
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

6.  Semantic classification of diseases in discharge summaries using a context-aware rule-based classifier.

Authors:  Illés Solt; Domonkos Tikk; Viktor Gál; Zsolt T Kardkovács
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

7.  ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors:  Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2009-05-10       Impact factor: 6.317

8.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors:  George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal:  Stud Health Technol Inform       Date:  2015

Review 9.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future.

Authors:  Omri Gottesman; Helena Kuivaniemi; Gerard Tromp; W Andrew Faucett; Rongling Li; Teri A Manolio; Saskia C Sanderson; Joseph Kannry; Randi Zinberg; Melissa A Basford; Murray Brilliant; David J Carey; Rex L Chisholm; Christopher G Chute; John J Connolly; David Crosslin; Joshua C Denny; Carlos J Gallego; Jonathan L Haines; Hakon Hakonarson; John Harley; Gail P Jarvik; Isaac Kohane; Iftikhar J Kullo; Eric B Larson; Catherine McCarty; Marylyn D Ritchie; Dan M Roden; Maureen E Smith; Erwin P Böttinger; Marc S Williams
Journal:  Genet Med       Date:  2013-06-06       Impact factor: 8.822

10.  Next-generation phenotyping of electronic health records.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2012-09-06       Impact factor: 4.497

View more
  11 in total

Review 1.  HL7 FHIR-based tools and initiatives to support clinical research: a scoping review.

Authors:  Stephany N Duda; Nan Kennedy; Douglas Conway; Alex C Cheng; Viet Nguyen; Teresa Zayas-Cabán; Paul A Harris
Journal:  J Am Med Inform Assoc       Date:  2022-08-16       Impact factor: 7.942

2.  Design and validation of a FHIR-based EHR-driven phenotyping toolbox.

Authors:  Pascal S Brandt; Jennifer A Pacheco; Prakash Adekkanattu; Evan T Sholle; Sajjad Abedian; Daniel J Stone; David M Knaack; Jie Xu; Zhenxing Xu; Yifan Peng; Natalie C Benda; Fei Wang; Yuan Luo; Guoqian Jiang; Jyotishman Pathak; Luke V Rasmussen
Journal:  J Am Med Inform Assoc       Date:  2022-08-16       Impact factor: 7.942

3.  Inference for the Case Probability in High-dimensional Logistic Regression.

Authors:  Zijian Guo; Prabrisha Rakshit; Daniel S Herman; Jinbo Chen
Journal:  J Mach Learn Res       Date:  2021       Impact factor: 5.177

4.  FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model.

Authors:  Guohui Xiao; Emily Pfaff; Eric Prud'hommeaux; David Booth; Deepak K Sharma; Nan Huo; Yue Yu; Nansu Zong; Kathryn J Ruddy; Christopher G Chute; Guoqian Jiang
Journal:  J Biomed Inform       Date:  2022-09-09       Impact factor: 8.000

5.  A corpus-driven standardization framework for encoding clinical problems with HL7 FHIR.

Authors:  Kevin J Peterson; Guoqian Jiang; Hongfang Liu
Journal:  J Biomed Inform       Date:  2020-08-16       Impact factor: 6.317

Review 6.  A Year of Papers Using Biomedical Texts.

Authors:  Cyril Grouin; Natalia Grabar
Journal:  Yearb Med Inform       Date:  2020-08-21

7.  Ontologies, Knowledge Representation, and Machine Learning for Translational Research: Recent Contributions.

Authors:  Peter N Robinson; Melissa A Haendel
Journal:  Yearb Med Inform       Date:  2020-08-21

8.  Development of a repository of computable phenotype definitions using the clinical quality language.

Authors:  Pascal S Brandt; Jennifer A Pacheco; Luke V Rasmussen
Journal:  JAMIA Open       Date:  2021-12-03

9.  Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.

Authors:  Chunhua Weng; Nigam H Shah; George Hripcsak
Journal:  J Biomed Inform       Date:  2020-04-23       Impact factor: 6.317

10.  Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Authors:  Antje Wulff; Marcel Mast; Marcus Hassler; Sara Montag; Michael Marschollek; Thomas Jack
Journal:  Methods Inf Med       Date:  2020-10-14       Impact factor: 2.176

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.