Literature DB >> 29966746

Patient representation learning and interpretable evaluation using clinical notes.

Madhumita Sushil1, Simon Šuster2, Kim Luyckx3, Walter Daelemans2.   

Abstract

We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural category, and gender. We compare their performance with sparse representations obtained from a bag-of-words model. We observe that the learned generalized representations significantly outperform the sparse representations when we have few positive instances to learn from, and there is an absence of strong lexical features. 2. We compare the model performance of the feature set constructed from a bag of words to that obtained from medical concepts. In the latter case, concepts represent problems, treatments, and tests. We find that concept identification does not improve the classification performance. 3. We propose novel techniques to facilitate model interpretability. To understand and interpret the representations, we explore the best encoded features within the patient representations obtained from the autoencoder model. Further, we calculate feature sensitivity across two networks to identify the most significant input features for different classification tasks when we use these pretrained representations as the supervised input. We successfully extract the most influential features for the pipeline using this technique.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Model interpretability; Natural language processing; Patient representations; Representation learning; Unsupervised learning

Mesh:

Year:  2018        PMID: 29966746     DOI: 10.1016/j.jbi.2018.06.016

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  7 in total

1.  Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse.

Authors:  Dmitriy Dligach; Majid Afshar; Timothy Miller
Journal:  J Am Med Inform Assoc       Date:  2019-11-01       Impact factor: 4.497

Review 2.  Deep learning in clinical natural language processing: a methodical review.

Authors:  Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

3.  Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.

Authors:  Jamil Zaghir; Jose F Rodrigues-Jr; Lorraine Goeuriot; Sihem Amer-Yahia
Journal:  J Healthc Inform Res       Date:  2021-06-05

4.  Pre-training phenotyping classifiers.

Authors:  Dmitriy Dligach; Majid Afshar; Timothy Miller
Journal:  J Biomed Inform       Date:  2020-11-28       Impact factor: 6.317

5.  Representation learning for clinical time series prediction tasks in electronic health records.

Authors:  Tong Ruan; Liqi Lei; Yangming Zhou; Jie Zhai; Le Zhang; Ping He; Ju Gao
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-17       Impact factor: 2.796

6.  Combining structured and unstructured data for predictive models: a deep learning approach.

Authors:  Dongdong Zhang; Changchang Yin; Jucheng Zeng; Xiaohui Yuan; Ping Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2020-10-29       Impact factor: 2.796

7.  Patient Representation Learning From Heterogeneous Data Sources and Knowledge Graphs Using Deep Collective Matrix Factorization: Evaluation Study.

Authors:  Sajit Kumar; Alicia Nanelia; Ragunathan Mariappan; Adithya Rajagopal; Vaibhav Rajan
Journal:  JMIR Med Inform       Date:  2022-01-20
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.