Literature DB >> 25841328

Building bridges across electronic health record systems through inferred phenotypic topics.

You Chen1, Joydeep Ghosh2, Cosmin Adrian Bejan3, Carl A Gunter4, Siddharth Gupta4, Abel Kho5, David Liebovitz5, Jimeng Sun6, Joshua Denny7, Bradley Malin8.   

Abstract

OBJECTIVE: Data in electronic health records (EHRs) is being increasingly leveraged for secondary uses, ranging from biomedical association studies to comparative effectiveness. To perform studies at scale and transfer knowledge from one institution to another in a meaningful way, we need to harmonize the phenotypes in such systems. Traditionally, this has been accomplished through expert specification of phenotypes via standardized terminologies, such as billing codes. However, this approach may be biased by the experience and expectations of the experts, as well as the vocabulary used to describe such patients. The goal of this work is to develop a data-driven strategy to (1) infer phenotypic topics within patient populations and (2) assess the degree to which such topics facilitate a mapping across populations in disparate healthcare systems.
METHODS: We adapt a generative topic modeling strategy, based on latent Dirichlet allocation, to infer phenotypic topics. We utilize a variance analysis to assess the projection of a patient population from one healthcare system onto the topics learned from another system. The consistency of learned phenotypic topics was evaluated using (1) the similarity of topics, (2) the stability of a patient population across topics, and (3) the transferability of a topic across sites. We evaluated our approaches using four months of inpatient data from two geographically distinct healthcare systems: (1) Northwestern Memorial Hospital (NMH) and (2) Vanderbilt University Medical Center (VUMC).
RESULTS: The method learned 25 phenotypic topics from each healthcare system. The average cosine similarity between matched topics across the two sites was 0.39, a remarkably high value given the very high dimensionality of the feature space. The average stability of VUMC and NMH patients across the topics of two sites was 0.988 and 0.812, respectively, as measured by the Pearson correlation coefficient. Also the VUMC and NMH topics have smaller variance of characterizing patient population of two sites than standard clinical terminologies (e.g., ICD9), suggesting they may be more reliably transferred across hospital systems.
CONCLUSIONS: Phenotypic topics learned from EHR data can be more stable and transferable than billing codes for characterizing the general status of a patient population. This suggests that EHR-based research may be able to leverage such phenotypic topics as variables when pooling patient populations in predictive models.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clinical phenotype modeling; Computers and information processing; Data mining; Electronic medical records; Medical information systems; Pattern recognition

Mesh:

Year:  2015        PMID: 25841328      PMCID: PMC4464930          DOI: 10.1016/j.jbi.2015.03.011

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  39 in total

1.  Data warehouse and data mining in a surgical clinic.

Authors:  G Tusch; M Müller; K Rohwer-Mensching; K Heiringhoff; J Klempnauer
Journal:  Stud Health Technol Inform       Date:  2000

2.  Data warehousing in disease management programs.

Authors:  D C Ramick
Journal:  J Healthc Inf Manag       Date:  2001

3.  Results from data mining in a radiology department: the relevance of data quality.

Authors:  Martin Lang; Nanda Kirpekar; Thomas Bürkle; Susanne Laumann; Hans-Ulrich Prokosch
Journal:  Stud Health Technol Inform       Date:  2007

4.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors:  Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-03-26       Impact factor: 4.497

5.  A concordance correlation coefficient to evaluate reproducibility.

Authors:  L I Lin
Journal:  Biometrics       Date:  1989-03       Impact factor: 2.571

6.  Use of computerized algorithm to identify individuals in need of testing for celiac disease.

Authors:  Jonas F Ludvigsson; Jyotishman Pathak; Sean Murphy; Matthew Durski; Phillip S Kirsch; Christophe G Chute; Euijung Ryu; Joseph A Murray
Journal:  J Am Med Inform Assoc       Date:  2013-08-16       Impact factor: 4.497

7.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations.

Authors:  Joshua C Denny; Marylyn D Ritchie; Melissa A Basford; Jill M Pulley; Lisa Bastarache; Kristin Brown-Gentry; Deede Wang; Dan R Masys; Dan M Roden; Dana C Crawford
Journal:  Bioinformatics       Date:  2010-03-24       Impact factor: 6.937

8.  Supporting communication in an integrated patient record system.

Authors:  Dario A Giuse
Journal:  AMIA Annu Symp Proc       Date:  2003

9.  Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.

Authors:  Jyotishman Pathak; Kent R Bailey; Calvin E Beebe; Steven Bethard; David C Carrell; Pei J Chen; Dmitriy Dligach; Cory M Endle; Lacey A Hart; Peter J Haug; Stanley M Huff; Vinod C Kaggal; Dingcheng Li; Hongfang Liu; Kyle Marchant; James Masanz; Timothy Miller; Thomas A Oniki; Martha Palmer; Kevin J Peterson; Susan Rea; Guergana K Savova; Craig R Stancl; Sunghwan Sohn; Harold R Solbrig; Dale B Suesse; Cui Tao; David P Taylor; Les Westberg; Stephen Wu; Ning Zhuo; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2013-11-04       Impact factor: 4.497

Review 10.  eMERGEing progress in genomics-the first seven years.

Authors:  Dana C Crawford; David R Crosslin; Gerard Tromp; Iftikhar J Kullo; Helena Kuivaniemi; M Geoffrey Hayes; Joshua C Denny; William S Bush; Jonathan L Haines; Dan M Roden; Catherine A McCarty; Gail P Jarvik; Marylyn D Ritchie
Journal:  Front Genet       Date:  2014-06-17       Impact factor: 4.599

View more
  16 in total

1.  Learning Clinical Workflows to Identify Subgroups of Heart Failure Patients.

Authors:  Chao Yan; You Chen; Bo Li; David Liebovitz; Bradley Malin
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

Review 2.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress.

Authors:  S M Meystre; C Lovis; T Bürkle; G Tognola; A Budrionis; C U Lehmann
Journal:  Yearb Med Inform       Date:  2017-09-11

3.  Detecting time-evolving phenotypic topics via tensor factorization on electronic health records: Cardiovascular disease case study.

Authors:  Juan Zhao; Yun Zhang; David J Schlueter; Patrick Wu; Vern Eric Kerchberger; S Trent Rosenbloom; Quinn S Wells; QiPing Feng; Joshua C Denny; Wei-Qi Wei
Journal:  J Biomed Inform       Date:  2019-08-22       Impact factor: 6.317

4.  Inferring Clinical Workflow Efficiency via Electronic Medical Record Utilization.

Authors:  You Chen; Wei Xie; Carl A Gunter; David Liebovitz; Sanjay Mehrotra; He Zhang; Bradley Malin
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

5.  Learning probabilistic phenotypes from heterogeneous EHR data.

Authors:  Rimma Pivovarov; Adler J Perotte; Edouard Grave; John Angiolillo; Chris H Wiggins; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2015-10-14       Impact factor: 6.317

6.  AUDIT-C and ICD codes as phenotypes for harmful alcohol use: association with ADH1B polymorphisms in two US populations.

Authors:  Amy C Justice; Rachel V Smith; Janet P Tate; Kathleen McGinnis; Ke Xu; William C Becker; Kuang-Yao Lee; Kevin Lynch; Ning Sun; John Concato; David A Fiellin; Hongyu Zhao; Joel Gelernter; Henry R Kranzler
Journal:  Addiction       Date:  2018-08-01       Impact factor: 6.526

7.  Interaction patterns of trauma providers are associated with length of stay.

Authors:  You Chen; Mayur B Patel; Candace D McNaughton; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2018-07-01       Impact factor: 4.497

8.  Identifying collaborative care teams through electronic medical record utilization patterns.

Authors:  You Chen; Nancy M Lorenzi; Warren S Sandberg; Kelly Wolgast; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2017-04-01       Impact factor: 4.497

9.  Predicting Length of Stay for Obstetric Patients via Electronic Medical Records.

Authors:  Cheng Gao; Abel N Kho; Catherine Ivory; Sarah Osmundson; Bradley A Malin; You Chen
Journal:  Stud Health Technol Inform       Date:  2017

10.  Large-Scale Discovery of Disease-Disease and Disease-Gene Associations.

Authors:  Djordje Gligorijevic; Jelena Stojanovic; Nemanja Djuric; Vladan Radosavljevic; Mihajlo Grbovic; Rob J Kulathinal; Zoran Obradovic
Journal:  Sci Rep       Date:  2016-08-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.