Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

Literature DB >> 31622801

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

Na Hong¹, Andrew Wen¹, Daniel J Stone¹, Shintaro Tsuji¹, Paul R Kingsbury¹, Luke V Rasmussen², Jennifer A Pacheco², Prakash Adekkanattu³, Fei Wang³, Yuan Luo², Jyotishman Pathak³, Hongfang Liu¹, Guoqian Jiang⁴.

Abstract

BACKGROUND: Standards-based clinical data normalization has become a key component of effective data integration and accurate phenotyping for secondary use of electronic healthcare records (EHR) data. HL7 Fast Healthcare Interoperability Resources (FHIR) is an emerging clinical data standard for exchanging electronic healthcare data and has been used in modeling and integrating both structured and unstructured EHR data for a variety of clinical research applications. The overall objective of this study is to develop and evaluate a FHIR-based EHR phenotyping framework for identification of patients with obesity and its multiple comorbidities from semi-structured discharge summaries leveraging a FHIR-based clinical data normalization pipeline (known as NLP2FHIR).
METHODS: We implemented a multi-class and multi-label classification system based on the i2b2 Obesity Challenge task to evaluate the FHIR-based EHR phenotyping framework. Two core parts of the framework are: (a) the conversion of discharge summaries into corresponding FHIR resources - Composition, Condition, MedicationStatement, Procedure and FamilyMemberHistory using the NLP2FHIR pipeline, and (b) the implementation of four machine learning algorithms (logistic regression, support vector machine, decision tree, and random forest) to train classifiers to predict disease state of obesity and 15 comorbidities using features extracted from standard FHIR resources and terminology expansions. We used the macro- and micro-averaged precision (P), recall (R), and F1 score (F1) measures to evaluate the classifier performance. We validated the framework using a second obesity dataset extracted from the MIMIC-III database.
RESULTS: Using the NLP2FHIR pipeline, 1237 clinical discharge summaries from the 2008 i2b2 obesity challenge dataset were represented as the instances of the FHIR Composition resource consisting of 5677 records with 16 unique section types. After the NLP processing and FHIR modeling, a set of 244,438 FHIR clinical resource instances were generated. As the results of the four machine learning classifiers, the random forest algorithm performed the best with F1-micro(0.9466)/F1-macro(0.7887) and F1-micro(0.9536)/F1-macro(0.6524) for intuitive classification (reflecting medical professionals' judgments) and textual classification (reflecting the judgments based on explicitly reported information of diseases), respectively. The MIMIC-III obesity dataset was successfully integrated for prediction with minimal configuration of the NLP2FHIR pipeline and machine learning models.
CONCLUSIONS: The study demonstrated that the FHIR-based EHR phenotyping approach could effectively identify the state of obesity and multiple comorbidities using semi-structured discharge summaries. Our FHIR-based phenotyping approach is a first concrete step towards improving the data aspect of phenotyping portability across EHR systems and enhancing interpretability of the machine learning-based phenotyping algorithms.

Entities: CellLine Chemical Disease Gene Species

Keywords: Algorithm portability; Clinical phenotyping; Electronic Health Records (EHRs); HL7 Fast Healthcare Interoperability Resources (FHIR); Natural language processing

Year: 2019 PMID： 31622801 PMCID： PMC6990976 DOI： 10.1016/j.jbi.2019.103310

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

21 in total

1. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497

2. Recognizing obesity and comorbidities in sparse data.

Authors: Ozlem Uzuner
Journal: J Am Med Inform Assoc Date: 2009-04-23 Impact factor: 4.497

3. Evaluation of a method to identify and categorize section headers in clinical documents.

Authors: Joshua C Denny; Anderson Spickard; Kevin B Johnson; Neeraja B Peterson; Josh F Peterson; Randolph A Miller
Journal: J Am Med Inform Assoc Date: 2009-08-28 Impact factor: 4.497

4. MedXN: an open source medication extraction and normalization tool for clinical text.

Authors: Sunghwan Sohn; Cheryl Clark; Scott R Halgrim; Sean P Murphy; Christopher G Chute; Hongfang Liu
Journal: J Am Med Inform Assoc Date: 2014-03-17 Impact factor: 4.497

5. DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records.

Authors: Guergana K Savova; Eugene Tseytlin; Sean Finan; Melissa Castine; Timothy Miller; Olga Medvedeva; David Harris; Harry Hochheiser; Chen Lin; Girish Chavan; Rebecca S Jacobson
Journal: Cancer Res Date: 2017-11-01 Impact factor: 12.701

6. Semantic classification of diseases in discharge summaries using a context-aware rule-based classifier.

Authors: Illés Solt; Domonkos Tikk; Viktor Gál; Zsolt T Kardkovács
Journal: J Am Med Inform Assoc Date: 2009-04-23 Impact factor: 4.497

7. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors: Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal: J Biomed Inform Date: 2009-05-10 Impact factor: 6.317

8. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors: George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal: Stud Health Technol Inform Date: 2015

Review 9. The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future.

Authors: Omri Gottesman; Helena Kuivaniemi; Gerard Tromp; W Andrew Faucett; Rongling Li; Teri A Manolio; Saskia C Sanderson; Joseph Kannry; Randi Zinberg; Melissa A Basford; Murray Brilliant; David J Carey; Rex L Chisholm; Christopher G Chute; John J Connolly; David Crosslin; Joshua C Denny; Carlos J Gallego; Jonathan L Haines; Hakon Hakonarson; John Harley; Gail P Jarvik; Isaac Kohane; Iftikhar J Kullo; Eric B Larson; Catherine McCarty; Marylyn D Ritchie; Dan M Roden; Maureen E Smith; Erwin P Böttinger; Marc S Williams
Journal: Genet Med Date: 2013-06-06 Impact factor: 8.822

10. Next-generation phenotyping of electronic health records.

Authors: George Hripcsak; David J Albers
Journal: J Am Med Inform Assoc Date: 2012-09-06 Impact factor: 4.497

11 in total

Review 1. HL7 FHIR-based tools and initiatives to support clinical research: a scoping review.

Authors: Stephany N Duda; Nan Kennedy; Douglas Conway; Alex C Cheng; Viet Nguyen; Teresa Zayas-Cabán; Paul A Harris
Journal: J Am Med Inform Assoc Date: 2022-08-16 Impact factor: 7.942

2. Design and validation of a FHIR-based EHR-driven phenotyping toolbox.

Authors: Pascal S Brandt; Jennifer A Pacheco; Prakash Adekkanattu; Evan T Sholle; Sajjad Abedian; Daniel J Stone; David M Knaack; Jie Xu; Zhenxing Xu; Yifan Peng; Natalie C Benda; Fei Wang; Yuan Luo; Guoqian Jiang; Jyotishman Pathak; Luke V Rasmussen
Journal: J Am Med Inform Assoc Date: 2022-08-16 Impact factor: 7.942

10. Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.

Authors: Antje Wulff; Marcel Mast; Marcus Hassler; Sara Montag; Michael Marschollek; Thomas Jack
Journal: Methods Inf Med Date: 2020-10-14 Impact factor: 2.176

Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

1. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

2. Recognizing obesity and comorbidities in sparse data.

3. Evaluation of a method to identify and categorize section headers in clinical documents.

4. MedXN: an open source medication extraction and normalization tool for clinical text.

5. DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records.

6. Semantic classification of diseases in discharge summaries using a context-aware rule-based classifier.

7. ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

8. Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Review 9. The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future.

10. Next-generation phenotyping of electronic health records.

Review 1. HL7 FHIR-based tools and initiatives to support clinical research: a scoping review.

2. Design and validation of a FHIR-based EHR-driven phenotyping toolbox.

3. Inference for the Case Probability in High-dimensional Logistic Regression.

4. FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model.

5. A corpus-driven standardization framework for encoding clinical problems with HL7 FHIR.

Review 6. A Year of Papers Using Biomedical Texts.

7. Ontologies, Knowledge Representation, and Machine Learning for Translational Research: Recent Contributions.

8. Development of a repository of computable phenotype definitions using the clinical quality language.

9. Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability.

10. Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.