| Literature DB >> 32386061 |
Jan A Roth1,2, Gorjan Radevski3, Catia Marzolini1, Andri Rauch4, Huldrych F Günthard5,6, Roger D Kouyos5,6, Christoph A Fux7, Alexandra U Scherrer5,6, Alexandra Calmy8, Matthias Cavassini9, Christian R Kahlert10,11, Enos Bernasconi12, Jasmina Bogojeska3, Manuel Battegay1.
Abstract
BACKGROUND: It is unclear whether data-driven machine learning models, which are trained on large epidemiological cohorts, may improve prediction of comorbidities in people living with human immunodeficiency virus (HIV).Entities:
Keywords: HIV; chronic kidney disease; digital epidemiology; machine learning; prediction
Mesh:
Year: 2021 PMID: 32386061 PMCID: PMC8514185 DOI: 10.1093/infdis/jiaa236
Source DB: PubMed Journal: J Infect Dis ISSN: 0022-1899 Impact factor: 7.759
Figure 1.Study population. a Calculated using the Chronic Kidney Disease Epidemiology Collaboration equation. b Baseline is defined as the first creatinine measurement after 1 January 2002. Abbreviation: SHCS, Swiss HIV Cohort Study.
Main Characteristics of the Study Population
| Characteristic | All (N = 12 761) | Individuals Without CKDa (n = 11 569) | Individuals With CKDa (n = 1192) | |||
|---|---|---|---|---|---|---|
| Age, y, median (IQR) | ||||||
| Baseline | 39 | (33–46) | 48 | (33–45) | 38 | (40–57) |
| End of follow-up | 49 | (41–56) | 56 | (41–55) | 49 | (50–65) |
| Sex | ||||||
| Male | 9156 | (72) | 8319 | (72) | 837 | (70) |
| Female | 3605 | (28) | 3250 | (28) | 355 | (30) |
| Race/ethnicity | ||||||
| White | 9964 | (78) | 8851 | (77) | 1113 | (93) |
| Black | 1825 | (14) | 1783 | (15) | 42 | (4) |
| Hispanic | 444 | (3) | 433 | (4) | 11 | (1) |
| Asian | 482 | (4) | 458 | (4) | 24 | (2) |
| Other/unknown | 46 | (0.4) | 44 | (0.4) | 2 | (0.2) |
| IDU prior to HIV diagnosis | ||||||
| Yes | 2287 | (18) | 2047 | (18) | 240 | (20) |
| No | 10 408 | (82) | 9465 | (82) | 943 | (79) |
| Unknown | 66 | (0.005) | 57 | (0.005) | 9 | (0.008) |
| Ever smoked | ||||||
| Yes | 7906 | (62) | 7158 | (62) | 748 | (63) |
| No | 4815 | (38) | 4372 | (38) | 443 | (37) |
| Unknown | 40 | (0.3) | 39 | (0.3) | 1 | (0.1) |
| Hypertension | ||||||
| Yes | 729 | (5.7) | 575 | (5.7) | 154 | (12.9) |
| No | 11 963 | (94) | 10 928 | (94) | 1035 | (86.8) |
| Unknown | 69 | (0.5) | 66 | (0.5) | 3 | (0.3) |
| eGFRb, mL/min/1.73 m2, median (IQR) | ||||||
| Baseline | 103 | (90–114) | 105 | (92–115) | 84 | (73–96) |
| End of study | 90 | (75–104) | 93 | (80–106) | 55 | (50–58) |
| CD4 count, cells/µL, median (IQR) | ||||||
| Baseline | 407 | (252–597) | 410 | (255–600) | 366 | (228–561) |
| End of study | 615 | (426–830) | 621 | (437–839) | 536 | (362–759) |
| Viral load, copies/mL, median (IQR) | ||||||
| Baseline | 883 | (0–35 173) | 1040 | (0–36 000) | 174 | (0–23 459) |
| End of study | 0 | (0–0) | 0 | (0–0) | 0 | (0–0) |
| Hepatitis B | ||||||
| Positive | 510 | (4) | 464 | (4) | 46 | (4) |
| Negative | 8208 | (64) | 7563 | (65) | 645 | (54) |
| Unknown | 4043 | (32) | 3542 | (30) | 501 | (42) |
| Hepatitis C | ||||||
| Positive | 1407 | (11) | 1272 | (11) | 135 | (11) |
| Negative | 10 022 | (79) | 9142 | (79) | 880 | (74) |
| Unknown | 1332 | (10) | 1155 | (10) | 177 | (15) |
| Ever exposed to TDF | ||||||
| Baseline | 2259 | (18) | 2100 | (18) | 159 | (13) |
| End of study | 9800 | (77) | 8814 | (76) | 986 | (83) |
| Ever exposed to ATV/r | ||||||
| Baseline | 481 | (4) | 441 | (4) | 40 | (3) |
| End of study | 3629 | (28) | 3135 | (27) | 494 | (41) |
| Ever exposed to LPV/r | ||||||
| Baseline | 1783 | (14) | 1577 | (14) | 206 | (17) |
| End of study | 4043 | (32) | 3604 | (31) | 439 | (37) |
Data are presented as No. (%) unless otherwise indicated. All values are presented at baseline if not stated otherwise. Baseline is defined as the first creatinine measurement after 1 January 2002. Some potential risk factors are not presented, as these variables were not recorded during the entire study period.
Abbreviations: ATV/r, ritonavir-boosted atazanavir; CKD, chronic kidney disease; eGFR, estimated glomerular filtration rate; HIV, human immunodeficiency virus, IDU, intravenous drug use; IQR, interquartile range; LPV/r, ritonavir-boosted lopinavir; TDF, tenofovir disoproxil fumarate.
a Within the observation period.
b Calculated using the Chronic Kidney Disease Epidemiology Collaboration equation.
Figure 2.Overall glomerular filtration rates (GFRs; mL/minute/1.73 m2) in people living with human immunodeficiency virus (N = 12 761). This figure refers to the GFR at the last visit of the visit sequences in the considered observation period that is used to make predictions for 90 days, 180 days, 270 days, and 365 days ahead. The middle line and box indicate the median and interquartile range (IQR), respectively. Whiskers cover the 1.5 IQR. Abbreviations: CKD, chronic kidney disease; GFR, glomerular filtration rate.
Performance of Models to Predict Chronic Kidney Disease Across Different Prediction Horizons (n = 1276 Individuals; Test Set)
| Algorithm | Visits Used | Imputation Method | F-score | Precision | Recall | ROC-AUC | PR-AUC |
|---|---|---|---|---|---|---|---|
| Prediction 90 d in advance | |||||||
| Data-driven machine learning models (full models) | |||||||
| Multilayer perceptron | Last 2 visitsa | Zero imputation | 0.782 | 0.703 | 0.879 | 0.979 | 0.829 |
| Median forward | 0.847 | 0.858 | 0.836 | 0.990 | 0.890 | ||
| Gradient boosting | Last 2 visitsa | Zero imputation | 0.874 | 0.852 | 0.897 | 0.994 | 0.933 |
| Median forward | 0.890 | 0.875 | 0.905 | 0.996 | 0.956 | ||
| Random forest | Last 2 visitsa | Zero imputation | 0.583 | 0.942 | 0.422 | 0.995 | 0.943 |
| Median forward | 0.836 | 0.918 | 0.767 | 0.994 | 0.931 | ||
| Elastic net | Last 2 visitsa | Zero imputation | 0.774 | 0.649 | 0.957 | 0.984 | 0.861 |
| Median forward | 0.846 | 0.800 | 0.897 | 0.992 | 0.904 | ||
| Bidirectional recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.818 | 0.786 | 0.853 | 0.984 | 0.874 |
| Median forward | 0.856 | 0.819 | 0.897 | 0.989 | 0.916 | ||
| Bidirectional attention recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.803 | 0.797 | 0.810 | 0.981 | 0.867 |
| Median forward | 0.852 | 0.812 | 0.897 | 0.986 | 0.901 | ||
| Manually built logistic regression model (short model) | Last 2 visitsa | None | 0.807 | 0.689 | 0.974 | 0.990 | 0.881 |
| Prediction 180 d in advance | |||||||
| Data-driven machine learning models (full models) | |||||||
| Multilayer perceptron | Last 2 visitsa | Zero imputation | 0.719 | 0.716 | 0.722 | 0.960 | 0.777 |
| Median forward | 0.718 | 0.798 | 0.652 | 0.963 | 0.803 | ||
| Gradient boosting | Last 2 visitsa | Zero imputation | 0.656 | 0.859 | 0.530 | 0.969 | 0.833 |
| Median forward | 0.789 | 0.815 | 0.765 | 0.970 | 0.860 | ||
| Random forest | Last 2 visitsa | Zero imputation | 0.115 | > 0.999 | 0.061 | 0.955 | 0.803 |
| Median forward | 0.677 | 0.844 | 0.565 | 0.968 | 0.814 | ||
| Elastic net | Last 2 visitsa | Zero imputation | 0.698 | 0.629 | 0.783 | 0.952 | 0.768 |
| Median forward | 0.767 | 0.777 | 0.757 | 0.959 | 0.787 | ||
| Bidirectional recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.722 | 0.732 | 0.713 | 0.965 | 0.759 |
| Median forward | 0.718 | 0.706 | 0.730 | 0.956 | 0.730 | ||
| Bidirectional attention recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.694 | 0.720 | 0.670 | 0.963 | 0.755 |
| Median forward | 0.721 | 0.712 | 0.730 | 0.945 | 0.792 | ||
| Manually built logistic regression model (short model) | Last 2 visitsa | None | 0.559 | 0.405 | 0.904 | 0.934 | 0.646 |
| Prediction 270 d in advance | |||||||
| Data-driven machine learning models (full models) | |||||||
| Multilayer perceptron | Last 2 visitsa | Zero imputation | 0.678 | 0.634 | 0.728 | 0.948 | 0.666 |
| Median forward | 0.660 | 0.753 | 0.588 | 0.952 | 0.735 | ||
| Gradient boosting | Last 2 visitsa | Zero imputation | 0.290 | 0.833 | 0.175 | 0.944 | 0.702 |
| Median forward | 0.689 | 0.745 | 0.640 | 0.957 | 0.728 | ||
| Random forest | Last 2 visitsa | Zero imputation | 0.068 | > 0.999 | 0.035 | 0.928 | 0.661 |
| Median forward | 0.578 | 0.788 | 0.456 | 0.955 | 0.739 | ||
| Elastic net | Last 2 visitsa | Zero imputation | 0.647 | 0.566 | 0.754 | 0.942 | 0.702 |
| Median forward | 0.650 | 0.756 | 0.570 | 0.943 | 0.716 | ||
| Bidirectional recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.605 | 0.581 | 0.632 | 0.938 | 0.649 |
| Median forward | 0.661 | 0.632 | 0.693 | 0.940 | 0.737 | ||
| Bidirectional attention recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.664 | 0.630 | 0.702 | 0.931 | 0.678 |
| Median forward | 0.664 | 0.699 | 0.632 | 0.934 | 0.693 | ||
| Manually built logistic regression model (short model) | Last 2 visitsa | None | 0.453 | 0.310 | 0.842 | 0.893 | 0.504 |
| Prediction 365 d in advance | |||||||
| Data-driven machine learning models (full models) | |||||||
| Multilayer perceptron | Last 2 visitsa | Zero imputation | 0.641 | 0.691 | 0.598 | 0.950 | 0.699 |
| Median forward | 0.628 | 0.776 | 0.527 | 0.950 | 0.722 | ||
| Gradient boosting | Last 2 visitsa | Zero imputation | 0.220 | 0.933 | 0.125 | 0.945 | 0.700 |
| Median forward | 0.619 | 0.663 | 0.580 | 0.941 | 0.710 | ||
| Random forest | Last 2 visitsa | Zero imputation | 0.018 | > 0.999 | 0.009 | 0.941 | 0.705 |
| Median forward | 0.527 | 0.800 | 0.393 | 0.952 | 0.725 | ||
| Elastic net | Last 2 visitsa | Zero imputation | 0.588 | 0.626 | 0.554 | 0.938 | 0.673 |
| Median forward | 0.512 | 0.808 | 0.375 | 0.935 | 0.681 | ||
| Bidirectional recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.606 | 0.656 | 0.562 | 0.945 | 0.631 |
| Median forward | 0.678 | 0.661 | 0.696 | 0.935 | 0.694 | ||
| Bidirectional attention recurrent neural network | Full sequence; all previous visits | Zero imputation | 0.600 | 0.643 | 0.562 | 0.928 | 0.632 |
| Median forward | 0.633 | 0.554 | 0.738 | 0.926 | 0.692 | ||
| Manually built logistic regression model (short model) | Last 2 visitsa | None | 0.423 | 0.286 | 0.812 | 0.883 | 0.468 |
Abbreviations: PR-AUC; area under the precision-recall curve; ROC-AUC, area under the receiver operating characteristic curve.
a And summary statistics from earlier visits during the target observation period, as detailed in the Methods.
Figure 3.Variable importance plot of the gradient-boosting model; 180 days prediction horizon. This hypothesis-generating plot is for illustrative purposes only. Suffix “2” signifies that information from the latest visit was used, whereas suffix “1” signifies that information from the preceding (penultimate) visit was used, both specified with respect to the visit sequence in the considered observation period. The different statistics (median, standard deviation for numerical and maximum for the nominal variables) were computed for all the remaining visits in the target observed hospital visit sequence. The Shapley additive explanation values describe for each variable and individual the change in the expected model prediction when conditioning on that variable. Abbreviations: GFR, glomerular filtration rate; max, maximum; SD, standard deviation; SHAP, Shapley additive explanation.
How Would You Decide? Predicted and Observed Chronic Kidney Disease Outcomes Among 3 Complex Cases Across Prediction Horizons (Gradient-Boosting Model Estimates for Illustrative Purposes)
| Individual | Predicted Outcome (CKD Probability) | Observed Outcome | Brief Interpretation and Key Predictor for Single Individuals | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Prediction Horizon, d | Prediction Horizon, d | ||||||||
| 90 | 180 | 270 | 365 | 90 | 180 | 270 | 365 | ||
| 1 | No CKD (0.34) | CKD (0.99) | CKD (0.51) | No CKD (0.01) | CKD | CKD | CKD | CKD | Platelet counts and various hematological parameters were strong predictors for CKD in this individual; however, this did not prevent false-negative predictions at 90 d and 365 d. There were dozens of moderate predictors of unclear clinical relevance: These factors have cancelled out at 365 d, as some were preventive and others suggested an incremental CKD risk. This example highlights that a clinician should review every machine learning prediction. |
| 2 | No CKD (0.18) | No CKD (0.00) | No CKD (0.00) | No CKD (0.00) | No CKD | No CKD | No CKD | No CKD | Absent cardiovascular risk factors (eg, smoking) were strong predictors against CKD development. However, there were dozens of moderate predictors (potential preventive factors and risk factors) of unclear clinical relevance. The low CKD probability score across prediction horizons, together with a careful review of medical records, may be an indication for clinicians that CKD development is unlikely. |
| 3 | No CKD (0.28) | CKD (0.71) | No CKD (0.00) | No CKD (0.02) | No CKD | No CKD | No CKD | No CKD | Cardiovascular risk factors (eg, high systolic blood pressure) and alcohol binge drinking increased the predicted CKD probability substantially—resulting in a false-positive prediction at 180 d; however, high preceding eGFR values were strong predictors against CKD across prediction horizons. |
Abbreviations: CKD, chronic kidney disease; eGFR, estimated glomerular filtration rate.