| Literature DB >> 26306255 |
Kenney Ng1, Jimeng Sun2, Jianying Hu1, Fei Wang3.
Abstract
Personalized predictive models are customized for an individual patient and trained using information from similar patients. Compared to global models trained on all patients, they have the potential to produce more accurate risk scores and capture more relevant risk factors for individual patients. This paper presents an approach for building personalized predictive models and generating personalized risk factor profiles. A locally supervised metric learning (LSML) similarity measure is trained for diabetes onset and used to find clinically similar patients. Personalized risk profiles are created by analyzing the parameters of the trained personalized logistic regression models. A 15,000 patient data set, derived from electronic health records, is used to evaluate the approach. The predictive results show that the personalized models can outperform the global model. Cluster analysis of the risk profiles show groups of patients with similar risk factors, differences in the top risk factors for different groups of patients and differences between the individual and global risk factors.Entities:
Year: 2015 PMID: 26306255 PMCID: PMC4525240
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1:Performance of the personalized logistic regression model in terms of AUC as a function of the number of nearest neighbor training patients for different classifier configurations.
Figure 2:Hierarchical heat map plot showing the top risk factors for diabetes onset identified by the personalized predictive models for 500 randomly selected patients. Patient specific risk factor profiles (the columns) are clustered along the horizontal axis. Risk factors (the rows) are clustered along the vertical axis. Risk factors captured by the global model are highlighted and have a * prefix in the name. The risk score for each patient is plotted as a vertical bar along the bottom.