Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Patient representation learning and interpretable evaluation using clinical notes.

Literature DB >> 29966746

Patient representation learning and interpretable evaluation using clinical notes.

Madhumita Sushil¹, Simon Šuster², Kim Luyckx³, Walter Daelemans².

Abstract

We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural category, and gender. We compare their performance with sparse representations obtained from a bag-of-words model. We observe that the learned generalized representations significantly outperform the sparse representations when we have few positive instances to learn from, and there is an absence of strong lexical features. 2. We compare the model performance of the feature set constructed from a bag of words to that obtained from medical concepts. In the latter case, concepts represent problems, treatments, and tests. We find that concept identification does not improve the classification performance. 3. We propose novel techniques to facilitate model interpretability. To understand and interpret the representations, we explore the best encoded features within the patient representations obtained from the autoencoder model. Further, we calculate feature sensitivity across two networks to identify the most significant input features for different classification tasks when we use these pretrained representations as the supervised input. We successfully extract the most influential features for the pipeline using this technique.

Entities: Species

Keywords: Model interpretability; Natural language processing; Patient representations; Representation learning; Unsupervised learning

Mesh：

Year: 2018 PMID： 29966746 DOI： 10.1016/j.jbi.2018.06.016

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

Keyword Cloud
Cited

7 in total

1. Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse.

Authors: Dmitriy Dligach; Majid Afshar; Timothy Miller
Journal: J Am Med Inform Assoc Date: 2019-11-01 Impact factor: 4.497

Review 2. Deep learning in clinical natural language processing: a methodical review.

Authors: Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal: J Am Med Inform Assoc Date: 2020-03-01 Impact factor: 4.497

3. Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.

Authors: Jamil Zaghir; Jose F Rodrigues-Jr; Lorraine Goeuriot; Sihem Amer-Yahia
Journal: J Healthc Inform Res Date: 2021-06-05

7. Patient Representation Learning From Heterogeneous Data Sources and Knowledge Graphs Using Deep Collective Matrix Factorization: Evaluation Study.

Authors: Sajit Kumar; Alicia Nanelia; Ragunathan Mariappan; Adithya Rajagopal; Vaibhav Rajan
Journal: JMIR Med Inform Date: 2022-01-20

7 in total

Patient representation learning and interpretable evaluation using clinical notes.

1. Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse.

Review 2. Deep learning in clinical natural language processing: a methodical review.

3. Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.

4. Pre-training phenotyping classifiers.

5. Representation learning for clinical time series prediction tasks in electronic health records.

6. Combining structured and unstructured data for predictive models: a deep learning approach.

7. Patient Representation Learning From Heterogeneous Data Sources and Knowledge Graphs Using Deep Collective Matrix Factorization: Evaluation Study.