Literature DB >> 29375929

Joint Learning of Representations of Medical Concepts and Words from EHR Data.

Tian Bai1, Ashis Kumar Chanda1, Brian L Egleston2, Slobodan Vucetic1.   

Abstract

There has been an increasing interest in learning low-dimensional vector representations of medical concepts from electronic health records (EHRs). While EHRs contain structured data such as diagnostic codes and laboratory tests, they also contain unstructured clinical notes, which provide more nuanced details on a patient's health status. In this work, we propose a method that jointly learns medical concept and word representations. In particular, we focus on capturing the relationship between medical codes and words by using a novel learning scheme for word2vec model. Our method exploits relationships between different parts of EHRs in the same visit and embeds both codes and words in the same continuous vector space. In the end, we are able to derive clusters which reflect distinct disease and treatment patterns. In our experiments, we qualitatively show how our methods of grouping words for given diagnostic codes compares with a topic modeling approach. We also test how well our representations can be used to predict disease patterns of the next visit. The results show that our approach outperforms several common methods.

Entities:  

Year:  2017        PMID: 29375929      PMCID: PMC5783648          DOI: 10.1109/BIBM.2017.8217752

Source DB:  PubMed          Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)        ISSN: 2156-1125


  14 in total

1.  Prediction modeling using EHR data: challenges, strategies, and a comparison of machine learning approaches.

Authors:  Jionglin Wu; Jason Roy; Walter F Stewart
Journal:  Med Care       Date:  2010-06       Impact factor: 2.983

2.  A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

Authors:  Yonghui Wu; Jun Xu; Min Jiang; Yaoyun Zhang; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

3.  Clinical Case-based Retrieval Using Latent Topic Analysis.

Authors:  Corey W Arnold; Suzie M El-Saden; Alex A T Bui; Ricky Taira
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

4.  Learning probabilistic phenotypes from heterogeneous EHR data.

Authors:  Rimma Pivovarov; Adler J Perotte; Edouard Grave; John Angiolillo; Chris H Wiggins; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2015-10-14       Impact factor: 6.317

5.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks.

Authors:  Edward Choi; Mohammad Taha Bahadori; Andy Schuetz; Walter F Stewart; Jimeng Sun
Journal:  JMLR Workshop Conf Proc       Date:  2016-12-10

6.  A Visualization of Evolving Clinical Sentiment Using Vector Representations of Clinical Notes.

Authors:  Mohammad M Ghassemi; Roger G Mark; Shamim Nemati
Journal:  Comput Cardiol (2010)       Date:  2016-02-18

Review 7.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

8.  Modeling Healthcare Quality via Compact Representations of Electronic Health Records.

Authors:  Jelena Stojanovic; Djordje Gligorijevic; Vladan Radosavljevic; Nemanja Djuric; Mihajlo Grbovic; Zoran Obradovic
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2016-07-14       Impact factor: 3.710

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  Using recurrent neural network models for early detection of heart failure onset.

Authors:  Edward Choi; Andy Schuetz; Walter F Stewart; Jimeng Sun
Journal:  J Am Med Inform Assoc       Date:  2017-03-01       Impact factor: 4.497

View more
  6 in total

1.  Distributed learning from multiple EHR databases: Contextual embedding models for medical events.

Authors:  Ziyi Li; Kirk Roberts; Xiaoqian Jiang; Qi Long
Journal:  J Biomed Inform       Date:  2019-02-27       Impact factor: 6.317

2.  Mining Medication Use Patterns from Clinical Notes for Breast Cancer Patients Through a Two-Stage Topic Modeling Approach.

Authors:  Kimberley Es Kondratieff; J Thomas Brown; Marily Barron; Jeremy L Warner; Zhijun Yin
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

3.  Medical Concept Representation Learning from Multi-source Data.

Authors:  Tian Bai; Brian L Egleston; Richard Bleicher; Slobodan Vucetic
Journal:  IJCAI (U S)       Date:  2019-07

4.  EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.

Authors:  Tian Bai; Ashis Kumar Chanda; Brian L Egleston; Slobodan Vucetic
Journal:  BMC Med Inform Decis Mak       Date:  2018-12-12       Impact factor: 2.796

5.  Improving medical term embeddings using UMLS Metathesaurus.

Authors:  Ashis Kumar Chanda; Tian Bai; Ziyu Yang; Slobodan Vucetic
Journal:  BMC Med Inform Decis Mak       Date:  2022-04-29       Impact factor: 3.298

6.  Generating contextual embeddings for emergency department chief complaints.

Authors:  David Chang; Woo Suk Hong; Richard Andrew Taylor
Journal:  JAMIA Open       Date:  2020-07-15
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.