| Literature DB >> 31093547 |
S van Doorn1, T B Brakenhoff1, K G M Moons1, F H Rutten1, A W Hoes1, R H H Groenwold1, G J Geersing1.
Abstract
BACKGROUND: Research on prognostic prediction models frequently uses data from routine healthcare. However, potential misclassification of predictors when using such data may strongly affect the studied associations. There is no doubt that such misclassification could lead to the derivation of suboptimal prediction models. The extent to which misclassification affects the validation of existing prediction models is currently unclear.We aimed to quantify the amount of misclassification in routine care data and its effect on the validation of the existing risk prediction model. As an illustrative example, we validated the CHA2DS2-VASc prediction rule for predicting mortality in patients with atrial fibrillation (AF).Entities:
Keywords: Atrial fibrillation; CHA2DS2-VASc; Misclassification; Prediction model; Routine care data; Validation
Year: 2017 PMID: 31093547 PMCID: PMC6460749 DOI: 10.1186/s41512-017-0018-x
Source DB: PubMed Journal: Diagn Progn Res ISSN: 2397-7523
The original CHA2DS2-VASc score [12]
| Predictor | Score |
|---|---|
| Congestive heart failure/LV dysfunction | 1 |
| Hypertension | 1 |
| Age ≥ 75 years | 2 |
| Diabetes mellitus | 1 |
| Stroke/TIA/TE | 2 |
| Vascular disease (prior myocardial infarction, peripheral artery disease, or aortic plaque) | 1 |
| Age 65–74 years | 1 |
| Sex category (i.e., female sex) | 1 |
TE thromboembolism
The annual risks of thromboembolism (ischemic stroke, peripheral embolism, or pulmonary embolism) for CHA2DS2-VASc, adjusted for aspirin use [12]
| CHA2DS2-VASc score | Risk (events/persons) |
|---|---|
| 0 | 0 (0/103) |
| 1 | 0.7 (1/162) |
| 2 | 1.9 (3/184) |
| 3 | 4.7 (8/203) |
| 4 | 2.3 (4/208) |
| 5 | 3.9 (3/95) |
| 6 | 4.5 (2/57) |
| 7 | 10.1 (2/25) |
| 8 | 14.2 (1/9) |
| 9 | 100 (1/1) |
The original study deriving the CHA2DS2-VASc consisting of 1084 AF patients with a follow-up of 1 year, considering ischemic stroke, peripheral embolism, or pulmonary embolism as outcomes for thromboembolism
Automatically extracted ICPC codes for the index predictors in the CHA2DS2-VASc model and the definition of the reference predictors used for manually scrutinizing the electronic patient file
| Predictor | ICPC code(s) for index predictors | Definition for reference predictors |
|---|---|---|
| Congestive heart failure | K77 heart failure | Signs and symptoms suggestive of heart failure, with structural or functional abnormalities on echocardiography, either with preserved or reduced ejection fraction |
| Hypertension | K86 hypertension without organ damage | Repeated systolic blood pressure measurement of 140 mmHg or higher |
| Age | Age in years | Age in years |
| Diabetes | T90 type 1 and type 2 diabetes | Repeated fasting blood glucose measurement of ≥ 7.0 mmol/L (126 mg/dL) or a non-fasting glucose measurement of ≥ 11.1 mmol/L (200 mg/dL) |
| Stroke/TIA | K89 TIA | Focal neurological deficit of sudden onset lasting > 24 or < 24 h, respectively |
| Vascular disease | K74 angina pectoris | • Coronary heart disease: prior myocardial infarction (both ST-elevated myocardial infarction or non-ST-elevated myocardial infarction), angina pectoris or prior percutaneous coronary intervention (PCI) or coronary artery bypass graft surgery (CABG) |
| Sex category | Female sex | Female sex |
Prevalence of individual ICPC codes (index predictors) and manually verified diagnoses (reference predictors) and measures of misclassification
| ICPC codes (index predictors) | Manually verified diagnoses (reference predictors) | Kappa | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|---|
| Congestive heart failure/LV dysfunction | 28.1 | 18.3 | 56.1 | 54.5 | 95.7 | 83.3 | 84.3 |
| Hypertension | 60.8 | 59.9 | 70.9 | 87.8 | 83.3 | 89.1 | 81.6 |
| Diabetes | 24.3 | 22.5 | 89.7 | 88.6 | 98.8 | 95.8 | 96.4 |
| Stroke/TIA/TE | 18.7 | 16.4 | 75.5 | 74.8 | 97.1 | 85.5 | 94.4 |
| Vascular disease | 34.6 | 26 | 60.4 | 63.1 | 93.7 | 84.2 | 82.7 |
ICPC International Classification of Primary Care, PPV positive predictive value, NPV negative predictive value
Fig. 1The concordance of CHA2DS2-VASc scores as calculated using the index predictors (x-axis) and as calculated using the reference predictors (y-axis). Numbers are counts (percentages)
Incidence rate of all-cause mortality for each CHA2DS2-VASc score as calculated with ICPC codes (index predictors) or manually verified diagnoses (reference predictors)
| ICPC codes (index predictors) | Manually verified diagnoses (reference predictors) | |||||||
|---|---|---|---|---|---|---|---|---|
| Score | No. of patients (%) | No. of events | py | IR | No. of patients (%) | No. of events | py | IR |
| 0 | 124 (5.3) | 2 | 338 | 0.6 | 125 (5.3) | 2 | 346 | 0.6 |
| 1 | 194 (8.2) | 2 | 541 | 0.4 | 203 (8.6) | 4 | 567 | 0.7 |
| 2 | 307 (13.0) | 29 | 892 | 3.3 | 344 (14.6) | 37 | 994 | 3.7 |
| 3 | 356 (15.1) | 48 | 1041 | 4.6 | 417 (17.7) | 54 | 1208 | 4.5 |
| 4 | 404 (17.2) | 67 | 1186 | 5.6 | 431 (18.3) | 83 | 1274 | 6.5 |
| 5 | 292 (12.4) | 86 | 887 | 9.7 | 254 (10.8) | 89 | 795 | 11.2 |
| 6 | 187 (7.9) | 70 | 590 | 11.9 | 139 (5.9) | 60 | 441 | 13.6 |
| 7 | 82 (3.5) | 37 | 262 | 14.1 | 54 (2.3) | 27 | 187 | 14.4 |
| 8 | 33 (1.4) | 26 | 127 | 20.5 | 16 (0.7) | 10 | 60 | 16.7 |
| 9 | 8 (0.3) | 1 | 23 | 4.3 | 4 (0.2) | 2 | 14 | 14.3 |
py person-years/100, IR incidence rate no. as of events/100 person-years
Fig. 2Calibration plot showing deciles of observed and predicted probabilities of survival from the CHA2DS2-VASc model developed using the reference predictors and validated using the baseline hazard and coefficients for validation with the index predictors as input values