Literature DB >> 33875163

A novel method for clinical risk prediction with low-quality data.

Zeyuan Wang1, Josiah Poon2, Shuze Wang3, Shiding Sun4, Simon Poon5.   

Abstract

In real-world data, predictive models for clinical risks (such as adverse drug reactions, hospital readmission, and chronic disease onset) are constantly struggling with low-quality issues, namely redundant and highly correlated features, extreme category imbalances, and most importantly, a large number of missing values. In most existing work, each patient is represented as a value vector with the fixed-length from some feature space, and missing values are forced to be imputed, which introduces much noise for prediction if the data set is highly incomplete. Besides, other challenges are either remaining unresolved or only partially solved when modeling, but without a systematic approach. In this paper, we propose a novel framework to address these low-quality problems, that we first treat patients as bags with the various number of feature-value pairs, called instances, and map them to an embedding space through our proposed feature embedding method to learn from it directly. In this way, predictive models can avoid the negative impact of missing data naturally. A novel multi-instance neural network is then connected, using two computational modules to deal with the problems of correlated and redundant features: multi-head attention and attention-based multi-instance pooling. They are capable of capturing the instance correlations and locating valuable information in each instance or bag. The feature embedding and multi-instance neural network are parameterized and optimized jointly in an end-to-end manner. Moreover, the training process is under both main and auxiliary supervision with focal loss functions to avoid the caveat of a highly imbalanced label set. This proposed framework is named AMI-Net3. We evaluate it on three suitable data sets from real-world settings with different clinical risk prediction tasks: adverse drug reaction of risperidone, schizophrenia relapse, and invasive fungi infection, respectively. The comprehensive experimental results demonstrate the effectiveness and superiority of our proposed method over competitive baselines.
Copyright © 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Clinical risk prediction; Feature embedding; Incomplete data; Multi-instance learning; Transformer

Year:  2021        PMID: 33875163     DOI: 10.1016/j.artmed.2021.102052

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  1 in total

1.  Prediction Model of International Trade Risk Based on Stochastic Time-Series Neural Network.

Authors:  Lei Xu; Guicai Dong
Journal:  Comput Intell Neurosci       Date:  2022-06-16
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.