| Literature DB >> 32618575 |
Zakhriya Alhassan1,2, David Budgen1, Riyad Alshammari3,4, Noura Al Moubayed1.
Abstract
BACKGROUND: Electronic health record (EHR) systems generate large datasets that can significantly enrich the development of medical predictive models. Several attempts have been made to investigate the effect of glycated hemoglobin (HbA1c) elevation on the prediction of diabetes onset. However, there is still a need for validation of these models using EHR data collected from different populations.Entities:
Keywords: EHR; HbA1c; diabetes; differentiated replication; electronic health records; glycated hemoglobin; hemoglobin; logistic regression; medical informatics; prediction
Year: 2020 PMID: 32618575 PMCID: PMC7367516 DOI: 10.2196/18963
Source DB: PubMed Journal: JMIR Med Inform
Predictors available in the original study versus King Abdullah International Medical Research Center datasets.
| Predictors | Original study dataset | KAIMRCa dataset |
| Age | √ | √ |
| Body mass index | √ | √ (calculated) |
| Estimated glomerular filtration rate | √ | √ |
| Random blood sugar (glucose) level | √ | √ |
| Non–high density lipoprotein | √ | √ (calculated) |
| Total cholesterol | √ | √ |
| Race | √ | x |
| Smoking status | √ | x |
aKAIMRC: King Abdullah International Medical Research Center, Saudi Arabia.
Figure 1Dataset preprocessing details. HbA1c: glycated hemoglobin.
Descriptive statistics for King Abdullah International Medical Research Center and original study datasets.
| Variablesa | KAIMRCb dataset | Original studyc dataset | ||||
|
| HbA1cd <5.7% (n=14,332) | HbA1c ≥5.7% (n=22,046) | HbA1c <5.7% (n=16,743) | HbA1c ≥5.7% (n=5892) | ||
| Age (years), mean (SD) | 45.5 (17.01) | 60.5 (14.13) | <.001 | 48.1 (15.4) | 54.8 (14.0) | |
| BMI (kg/m2), mean (SD) | 29.61 (10.74) | 31.50 (12.13) | <.001 | 30.1 (7.44) | 33.0 (8.41) | |
| eGFRe (mL/min/1.73 m2), mean (SD) | 93.40 (35.19) | 82.02 (28.86) | <.001 | 92.0 (33.0) | 87.9 (30.8) | |
|
|
|
| <.001 |
|
| |
|
| RBS (mmol/L), mean (SD) | 5.47 (1.28) | 8.30 (4.30) |
| 4.9 (0.7) | 5.3 (0.9) |
|
| RBS (mg/dL), mean (SD) | 98.5 (23.00) | 149.4 (77.47) |
| 88.4 (12.7) | 96.1 (16.0) |
|
|
|
| <.001 |
|
| |
|
| Cholesterol (mmol/L), mean (SD) | 4.59 (1.19) | 4.17 (1.16) |
| 4.80 (1.01) | 4.96 (1.11) |
|
| Cholesterol (mg/dL), mean (SD) | 177.49 (46.01) | 161.25 (44.85) |
| 186 (39.4) | 192 (43.1) |
|
|
|
| <.001 |
|
| |
|
| Non-HDL (mmol/L), mean (SD) | 2.85 (1.06) | 2.49 (0.99) |
| 3.49 (0.96) | 3.72 (1.07) |
|
| Non-HDL (mg/dL), mean (SD) | 110.2 (40.99) | 96.28 (38.28) |
| 135 (37.4) | 144 (41.7) |
aRefer to Multimedia Appendix 3 for unit conversion formulae.
bKAIMRC: King Abdullah International Medical Research Center, Saudi Arabia.
cWake Forest Baptist Medical Center, North Carolina, United States.
dHbA1c: glycated hemoglobin.
eeGFR: estimated glomerular filtration rate.
fRBS: random blood sugar.
gHDL: high-density lipoproteins.
Performance of models for glycated hemoglobin elevation prediction.
| Model | Variables used | Number of RCSa knots | AUR ROCb | 95% CI | Recall | Precision | F1 |
| PMc1 | Completed | 3 | 73.67 | 74.71-77.51 | 85.24 | 77.58 | 81.23 |
| PM2 | Complete | N/Ae | 74.04 | 74.35-77.16 | 82.18 | 78.76 | 80.43 |
| PM3 | Reducedf | 5 | 74.73 | 75.38-78.15 | 84.40 | 78.80 | 81.50 |
aRCS: restricted cubic splines.
bAUR ROC: area under the receiver operating characteristic.
cPM: predictive model.
dAll variables (age, random blood sugar, cholesterol, non–high-density lipoproteins, estimated glomerular filtration rate, and BMI).
eN/A: not applicable.
fReduced variables (age, random blood sugar, cholesterol, non–high-density lipoproteins, and estimated glomerular filtration rate).
Figure 2The calibration curve for PM1. HbA1c: glycated hemoglobin. PM: predictive model.
Figure 3The calibration curve for PM2. HbA1c: glycated hemoglobin. PM: predictive model.
Figure 4Order of importance of predictors for PM1. Chol: cholesterol. eGFR: estimated glomerular filtration rate. HDL: high-density lipoproteins. PM: predictive model. RBS: random blood sugar.
Figure 5Order of importance of predictors for PM3. Chol: cholesterol. eGFR: estimated glomerular filtration rate. HDL: high-density lipoproteins. PM: predictive model. RBS: random blood sugar.
Predictors importance rankings.
| Study | 1st | 2nd | 3rd | 4th | 5th | 6th | 7th | 8th | |
| Original study | Age | BMI | RBSa | Race | Non-HDLb | Cholesterol | eGFRc | Smoking status | |
|
|
|
|
|
|
|
|
|
| |
|
| PMd1 | RBS | Age | Cholesterol | Non-HDL | eGFR | BMI | N/Ae | N/A |
|
| PM2 | Age | RBS | Cholesterol | Non-HDL | BMI | eGFR | N/A | N/A |
|
| PM3 | RBS | Age | eGFR | Cholesterol | Non-HDL | BMI (excluded) | N/A | N/A |
aRBS: random blood sugar.
bHDL: high-density lipoproteins.
ceGFR: estimated glomerular filtration rate.
dPM: predictive model.
eN/A: not applicable.
Figure 6Box plots of the reported measures for the models. AUC ROC: area under the receiver operating characteristic. PM: predictive model.
Figure 7HbA1c elevation for BMI ranges of King Abdullah International Medical Research Center patients. HbA1c: glycated hemoglobin.