Bankole Olatosi1, Xiaowen Sun2, Shujie Chen2, Jiajia Zhang2, Chen Liang1, Sharon Weissman3, Xiaoming Li4. 1. Department of Health Services Policy and Management. 2. Department of Epidemiology and Biostatistics. 3. Department of Internal Medicine, School of Medicine, University of South Carolina. 4. Department of Health Promotion, Education, and Behavior, Arnold School of Public Health, Columbia, South Carolina, USA.
Abstract
OBJECTIVES: Ending the HIV epidemic requires innovative use of data for intelligent decision-making from surveillance through treatment. This study sought to examine the usefulness of using linked integrated PLWH health data to predict PLWH's future HIV care status and compare the performance of machine-learning methods for predicting future HIV care status for SC PLWH. DESIGN: We employed supervised machine learning for its ability to predict PLWH's future care status by synthesizing and learning from PLWH's existing health data. This method is appropriate for the nature of integrated PLWH data because of its high volume and dimensionality. METHODS: A data set of 8888 distinct PLWH's health records were retrieved from an integrated PLWH data repository. We experimented and scored seven representative machine-learning models including Bayesian Network, Automated Neural Network, Support Vector Machine, Logistic Regression, LASSO, Decision Trees and Random Forest to best predict PLWH's care status. We further identified principal factors that can predict the retention-in-care based on the champion model. RESULTS: Bayesian Network (F = 0.87, AUC = 0.94, precision = 0.87, recall = 0.86) was the best predictive model, followed by Random Forest (F = 0.78, AUC = 0.81, precision = 0.72, recall = 0.85), Decision Tree (F = 0.76, AUC = 0.75, precision = 0.70, recall = 0.82) and Neural Network (cluster) (F = 0.75, AUC = 0.71, precision = 0.69, recall = 0.81). CONCLUSION: These algorithmic applications of Bayesian Networks and other machine-learning algorithms hold promise for predicting future HIV care status at the individual level. Prediction of future care patterns for SC PLWH can help optimize health service resources for effective interventions. Predictions can also help improve retention across the HIV continuum.
OBJECTIVES: Ending the HIV epidemic requires innovative use of data for intelligent decision-making from surveillance through treatment. This study sought to examine the usefulness of using linked integrated PLWH health data to predict PLWH's future HIV care status and compare the performance of machine-learning methods for predicting future HIV care status for SC PLWH. DESIGN: We employed supervised machine learning for its ability to predict PLWH's future care status by synthesizing and learning from PLWH's existing health data. This method is appropriate for the nature of integrated PLWH data because of its high volume and dimensionality. METHODS: A data set of 8888 distinct PLWH's health records were retrieved from an integrated PLWH data repository. We experimented and scored seven representative machine-learning models including Bayesian Network, Automated Neural Network, Support Vector Machine, Logistic Regression, LASSO, Decision Trees and Random Forest to best predict PLWH's care status. We further identified principal factors that can predict the retention-in-care based on the champion model. RESULTS: Bayesian Network (F = 0.87, AUC = 0.94, precision = 0.87, recall = 0.86) was the best predictive model, followed by Random Forest (F = 0.78, AUC = 0.81, precision = 0.72, recall = 0.85), Decision Tree (F = 0.76, AUC = 0.75, precision = 0.70, recall = 0.82) and Neural Network (cluster) (F = 0.75, AUC = 0.71, precision = 0.69, recall = 0.81). CONCLUSION: These algorithmic applications of Bayesian Networks and other machine-learning algorithms hold promise for predicting future HIV care status at the individual level. Prediction of future care patterns for SC PLWH can help optimize health service resources for effective interventions. Predictions can also help improve retention across the HIV continuum.
Authors: Bryan Hartzler; Julia C Dombrowski; Jason R Williams; Heidi M Crane; Joseph J Eron; Elvin H Geng; Christopher Mathews; Kenneth H Mayer; Richard D Moore; Michael J Mugavero; Sonia Napravnik; Benigno Rodriguez; Dennis M Donovan Journal: AIDS Behav Date: 2018-03
Authors: Melanie A Thompson; Michael J Mugavero; K Rivet Amico; Victoria A Cargill; Larry W Chang; Robert Gross; Catherine Orrell; Frederick L Altice; David R Bangsberg; John G Bartlett; Curt G Beckwith; Nadia Dowshen; Christopher M Gordon; Tim Horn; Princy Kumar; James D Scott; Michael J Stirratt; Robert H Remien; Jane M Simoni; Jean B Nachega Journal: Ann Intern Med Date: 2012-03-05 Impact factor: 25.391
Authors: Michael J Mugavero; Hui-Yi Lin; James H Willig; Andrew O Westfall; Kimberly B Ulett; Justin S Routman; Sarah Abroms; James L Raper; Michael S Saag; Jeroan J Allison Journal: Clin Infect Dis Date: 2009-01-15 Impact factor: 9.079
Authors: Lindsey R Haas; Paul Y Takahashi; Nilay D Shah; Robert J Stroebel; Matthew E Bernard; Dawn M Finnie; James M Naessens Journal: Am J Manag Care Date: 2013-09 Impact factor: 2.229