| Literature DB >> 30658456 |
Antonio Martinez-Millana1, María Argente-Pla2,3, Bernardo Valdivieso Martinez4,5, Vicente Traver Salcedo6,7, Juan Francisco Merino-Torres8,9.
Abstract
Electronic health records and computational modelling have paved the way for the development of Type 2 Diabetes risk scores to identify subjects at high risk. Unfortunately, few risk scores have been externally validated, and their performance can be compromised when routine clinical data is used. The aim of this study was to assess the performance of well-established risk scores for Type 2 Diabetes using routinely collected clinical data and to quantify their impact on the decision making process of endocrinologists. We tested six risk models that have been validated in external cohorts, as opposed to model development, on electronic health records collected from 2008-2015 from a population of 10,730 subjects. Unavailable or missing data in electronic health records was imputed using an existing validated Bayesian Network. Risk scores were assessed on the basis of statistical performance to differentiate between subjects who developed diabetes and those who did not. Eight endocrinologists provided clinical recommendations based on the risk score output. Due to inaccuracies and discrepancies regarding the exact date of Type 2 Diabetes onset, 76 subjects from the initial population were eligible for the study. Risk scores were useful for identifying subjects who developed diabetes (Framingham risk score yielded a c-statistic of 85%), however, our findings suggest that electronic health records are not prepared to massively use this type of risk scores. Use of a Bayesian Network was key for completion of the risk estimation and did not affect the risk score calculation (p > 0.05). Risk score estimation did not have a significant effect on the clinical recommendation except for starting pharmacological treatment (p = 0.004) and dietary counselling (p = 0.039). Despite their potential use, electronic health records should be carefully analyzed before the massive use of Type 2 Diabetes risk scores for the identification of high-risk subjects, and subsequent targeting of preventive actions.Entities:
Keywords: Risk scores; T2DM; clinical data; prediction; screening
Year: 2019 PMID: 30658456 PMCID: PMC6352264 DOI: 10.3390/jcm8010107
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1Schedule of the study for the Risk Stratification and the Support to doctor’s assessment.
Externally validated list of Risk Scores and description of the derivation sample population characteristics, mathematical approach, and criteria for type 2 diabetes diagnoses.
| FINDRISC | ARIC | San Antonio | PREDIMED | Framingham | CAMBRIDGE | |
|---|---|---|---|---|---|---|
|
| −5.51 | −9.981 | −13.415 | −18.607 | −6.322 | |
|
| 45–54: 0.63 | 0.0173 | 0.028 | 50–64: −0.010 | 0.063 | |
|
| Female: 0.661 | Male: 0.4308 | Female: −0.879 | |||
|
| African-American: 0.443 | Hispanic: 0.44 | ||||
|
| 0.71 | 0.838 | 0.336 | 1.222 | ||
|
| >110: 2.14 * | 0.088 | 0.079 | ≥100: 1.929 | 0.1398 | |
|
| 25–30: 0.17 | 0.070 | ≥27: 0.315 | 0.03922 | 25–27.49: 0.6 | |
|
| 0.0012 | 0.039 | −0.0488 | |||
|
| 0.00271 | ≥150: 0.405 | ||||
|
| Systolic: 0.0111 | Systolic: 0.018 |
| Systolic: | ||
|
| 0.498 | 0.481 | 0.506 | 0.4383 | 0.728 ** | |
|
| 0.547 | 0.855 ** | ||||
|
| 0.427 | |||||
|
| Men 94–102 Women 80–88 0.86 | 0.0273 | 0.0488 | |||
|
| 0.0326 |
* Refers to any hipoglycemia. ** The original model foresees two more categories not available in the study dataset. *** The model calculates high blood pressure as an analysis of Systolic and Diastolic blood pressure or anti-hypertensive medication prescription, only one of the predictors is used.
Externally validated list of Risk Scores and description of the derivation sample population characteristics, mathematical approach, and criteria for type 2 diabetes diagnoses.
| Risk Score Name and Validation Study | Population Characteristics for Internal Validation | Population Characteristics for External Validation | Mathematical Model | T2DM Diagnosis Criteria |
|---|---|---|---|---|
| FINDRISC [ | NS | North European, Dutch, Australian, African | Logistic regression | WHO (FPG or 2h-PG) |
| Ages: 35–64 | Ages: 35.2–71 | |||
| Follow-up: 5 years | Follow-up: 5 Years | |||
| ARIC [ | United States | United States | Logistic regression | WHO criteria |
| Ages: 45–64 | Ages: 45–84 | |||
| Follow-up: 9 years | Follow-up: 4.75 years | |||
| San Antonio Internal [ | Mexican-Americans | Finland and | Linear | ADA criteria |
| Ages: NS | Ages: 44–55 | |||
| Follow-up: 7.5 years | Follow-up: 7.5 years | |||
| QDScore Internal [ | Caucasian | Caucasian (93%) and other ethnic groups | Proportional | Diagnosis read code |
| Ages: 25–79 | Ages: 25–79 | |||
| Retrospective (15 years) | Retrospective (15 years) THIN DataBase | |||
| Cambridge Internal [ | UK population | UK population | Logistic | Diagnostic Code |
| Ages: 40–79 | Ages: 35–55 | |||
| Follow-up: 5 years | Retrospective | |||
| PREDIMED Internal [ | Spanish Caucasian | Spanish Caucasian (High Risk) | Multivariate | ADA criteria |
| Ages: 55–80 | Ages: 45–75 | |||
| Follow-up: 3.8 years | Follow-up: 4.2 years | |||
| Framingham Internal [ | Caucasian | Caucasian, | Logistic | ADA criteria |
| Ages: 44.2–63.9 | Ages: 45–84 | |||
| Follow-up: 7 years | Follow-up: 4.75 years |
World Health Organization (WHO); 2h- Plasgma Glucose (2h-PG); American Diabetes Association (ADA); Electronic Health Record (EHR).
Sample size, threshold, and discrimination performance of the externally validated risk models selected for the assessment.
| Risk Score | Sample Size | Incident | Cut-Off | S | Sp | PPV | NPV | AUC |
|---|---|---|---|---|---|---|---|---|
| FINDRISC Internal [ | 4586 | 182 | ≥9 | 0.78 | 0.77 | 0.13 | 0.99 | 0.85 |
| FINDRISC External [ | 18,301 | 844 | ≥7 | 0.76 | 0.63 | 0.11 | NA | 0.76 |
| ARIC Internal [ | 7915 | 1292 | ≥0.18 | 0.67 | 0.77 | 0.36 | 0.92 | 0.80 |
| ARIC External [ | 5329 | 446 | NS | NS | NS | NS | NS | 0.84 * |
| San Antonio Internal [ | 2903 | 275 | NA | NS | NS | NS | NS | 0.84 |
| San Antonio External [ | 2395 | 124 | >0.0065 | 0.75 | 0.72 | 0.119 | NS | 0.83 * |
| QDScore Internal [ | 3,773,585 | 115,616 | NS | NS | NS | NS | NS | 0.83 men |
| QDScore External [ | 2,396,392 | 72,986 | NS | NS | NS | NS | NS | 0.80 men |
| Cambridge Internal [ | 24,495 | 323 | >0.37 | 0.55 | 0.80 | NS | NS | 0.75 |
| Cambridge External [ | 5135 | 302 | >0.37 | NS | NS | NS | NS | 0.72 |
| PREDIMED Internal [ | 1381 | 155 | ≥6 | 0.72 | 0.72 | 0.25 | 0.95 | 0.78 |
| PREDIMED External [ | 552 | 124 | ≥6 | 0.85 | 0.26 | 0.25 | 0.86 | 0.66 |
| Framingham Internal ** [ | 3140 | 160 | NS | NS | NS | NS | NS | 0.84 |
| Framingham External ** [ | 5329 | 446 | NS | NS | NS | NS | NS | 0.83 * |
* Indicates recalibration. ** Specific model for the prediction of Type 2 Diabetes Mellitus (T2DM) derived from the Framingham Offspring Study. S = Sensitivity; Sp = Specificity; PPV = Positive Predictive Value; NPV = Negative Predictive Value; AUC = Area Under the Curve; and NS = metric not specified.
Figure 2Difference between coding onset year and real onset year for Type 2 Diabetes Mellitus diagnoses. T1DM = Type 1 Diabetes Mellitus.
Figure 3Risk Score outcome comparison between cases and controls.
Discrimination and calibration of the risk models for recalculated cut-off points
| S | Sp | PPV | NPV | AUC | Cut-off | HL Score | ||
|---|---|---|---|---|---|---|---|---|
|
| 0.38 | 1 | 1 | 0.6 | 0.69 | 0.180 | 0.003 | 0.043 |
|
| 0.53 | 1 | 1 | 0.67 | 0.73 | 0.821 | 0.271 | 0.397 |
|
| 0.61 | 1 | 1 | 0.71 | 0.76 | 0.065 | 0.018 | 0.107 |
|
| 0.54 | 0.91 | 0.83 | 0.57 | 0.66 | 16.297 | 0.049 | 0.175 |
|
| 0.76 | 0.33 | 0.55 | 0.57 | 0.53 | 0.345 | 0.288 | 0.408 |
|
| 0.85 | 0.83 | 0.84 | 0.83 | 0.875 | 0.034 | <0.001 | 0.020 |
S = Sensitivity; Sp = Specificity; PPV = Positive Predictive Value; NPV = Negative Predictive Value; AUC = Area Under the Curve; HL: Hosmer–Lemershow.
Figure 4Calibration performance of risk scores with suggested and calculated cut-off points. (A) Calibration plot for suggested cut-off. (B) Calibration plot for re-calculated cut-off. Cambridge and Framingham scores do not suggest cut-off points, so the performance descriptors are not applicable in chart (A).
Figure 5Comparison of the c-statistic (AUC Receiver Operating Characteristics (ROC) curve) for the 2h-OGTT high-risk probability and the two gold standard procedures.Fasting Glucose (FG) and HbA1c).
Descriptive distribution, dependency analysis, and missing data rate for Cases and Controls of the prediction analysis.
| VARIABLE | CONTROLS ( | CASES | MISSING DATA (%) | |||
|---|---|---|---|---|---|---|
| Gender | 4 M/9 F | 5 M/7 F | ||||
|
|
|
|
| |||
| Age | 65.76 | 8.20 | 59.41 | 9.28 | 0.082 | 0 |
| Body Mass Index | 28.78 | 5.20 | 32.16 | 8.46 | 0.433 | 56 |
| Waist | 98.66 | 5.13 | 92.00 | 0.00 | 0.377 | 84 |
| Systolic Blood Pressure | 130.00 | 12.94 | 136.67 | 21.82 | 0.451 | 36 |
| Diastolic Blood Pressure | 75.30 | 9.86 | 89.83 | 12.30 | 0.020 | 36 |
| Pulse | 70.85 | 8.78 | 74.00 | 12.20 | 0.613 | 52 |
| Cholesterol | 198.31 | 48.62 | 208.50 | 31.53 | 0.544 | 0 |
| Triglyceride | 149.23 | 60.63 | 175.75 | 61.96 | 0.290 | 0 |
| High-Density Lipoprotein (HDL) | 45.58 | 17.16 | 49.11 | 13.67 | 0.618 | 16 |
| Fasting Glucose | 101.55 | 12.34 | 98.27 | 10.51 | 0.510 | 12 |
| HbA1C | 5.89 | 0.37 | 5.58 | 0.40 | 0.132 | 32 |
Descriptive distribution, dependency analysis, and missing data rate for Cases and Controls of the detection.
| VARIABLE | CONTROLS ( | CASES | MISSING DATA (%) | |||
|---|---|---|---|---|---|---|
| Gender | 12 M/13 F | 13 M/10 F | ||||
|
|
|
|
| |||
| Age | 61.6 | 8.98 | 62.35 | 11.18 | 0.800 | 0.00 |
| Body Mass Index | 29.22 | 6.14 | 32.13 | 7.87 | 0.319 | 45.80 |
| Waist | 96 | 6.10 | 115 | 24.95 | 0.262 | 85.40 |
| Systolic Blood Pressure | 135.41 | 18.514 | 128 | 16.749 | 0.237 | 31.25 |
| Diastolic Blood Pressure | 82.41 | 12.76 | 79.5 | 9.07 | 0.020 | 36.00 |
| Pulse | 71.25 | 10.83 | 81.92 | 12.62 | 0.030 | 45.83 |
| Cholesterol | 204.76 | 41.43 | 203.23 | 41.75 | 0.900 | 2.08 |
| Triglyceride | 177.52 | 94.29 | 195.9 | 68.36 | 0.290 | 0.00 |
| HDL | 45.58 | 17.16 | 49.11 | 13.67 | 0.643 | 4.16 |
| Fasting Glucose | 100.82 | 11.083 | 108.13 | 8.95 | <0.05 | 6.00 |
| HbA1C | 5.75 | 0.41 | 6.17 | 0.19 | <0.05 | 54.00 |
Endocrinologists evaluating the two clinical scenarios. Information Technology (IT).
|
| Male(2)/Female (6) | |
|
| 42 ± 13 | |
|
| 14 ± 10 | |
|
| High = 3; Medium = 3; Low = 2 | |
|
| Overall | 319.33 ± 247.66 |
| TD2M Patients | 127.44 ± 75.22 | |
| High Risk of developing T2DM | 48.00 ± 33.79 | |
Number of recommendations for each subject according to the risk outcome. Low and high risk discrimination is done at the recommended cut-off point.
| Recommendation | Risk Outcome | Statistical Analysis | ||
|---|---|---|---|---|
| LOW RISK | HIGH RISK |
| Chi2 | |
| Order an 2h-OGTT for this patient | 4 | 6 | 0.654 | 0.20 |
| Order an HbA1c test for this patient | 15 | 19 | 0.466 | 0.52 |
| Refer to General endocrinologist | 1 | 2 | - | - |
| Refer to General Practitioner | 11 | 12 | - | - |
| Start Pharmacological | 1 | 8 | 0.004 | 8.00 |
| Start Dietary Habits | 5 | 12 | 0.039 | 4.23 |
| Start Moderate Physical Activity Habits | 6 | 11 | 0.170 | 1.88 |
| Counsel about healthy lifestyle | 15 | 11 | 0.405 | 0.69 |
| Counsel about diet, physical activity, and weight control | 6 | 11 | 0.170 | 1.88 |