| Literature DB >> 32375693 |
Jenna M Reps1, Ross D Williams2, Seng Chan You3, Thomas Falconer4, Evan Minty5, Alison Callahan6, Patrick B Ryan7, Rae Woong Park3,8, Hong-Seok Lim9, Peter Rijnbeek2.
Abstract
BACKGROUND: To demonstrate how the Observational Healthcare Data Science and Informatics (OHDSI) collaborative network and standardization can be utilized to scale-up external validation of patient-level prediction models by enabling validation across a large number of heterogeneous observational healthcare datasets.Entities:
Keywords: Collaborative network; External validation; Patient-level prediction; Prognostic model; Transportability
Mesh:
Year: 2020 PMID: 32375693 PMCID: PMC7201646 DOI: 10.1186/s12874-020-00991-3
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
The covariates included in ATRIA, Framingham, CHADS2, CHA2DS2VASc and Q-Stroke
| Age >= 85 | x | ||||
| Age 75–84 | x | ||||
| Age 65–74 | x | x | |||
| Age 60–62 | x | ||||
| Age 63–66 | x | ||||
| Age 67–71 | x | ||||
| Age 72–74 | x | ||||
| Age 75–77 | x | ||||
| Age 78–81 | x | ||||
| Age 82–85 | x | ||||
| Age 86–90 | x | ||||
| Age 91–93 | x | ||||
| Age > 93 | x | ||||
| Age >= 75 | x | x | |||
| Female | x | x | x | ||
| Diabetes | x | x | x | x | x |
| Congestive heart failure | x | x | x | ||
| Prior Stroke or transient ischemic attack | x | x | x | ||
| Hypertension | x | x | x | x | |
| Systolic blood pressure+ | x | x | |||
| Total cholesterol: HDLa cholesterol ratio+ | x | ||||
| Townsend deprivation score+ | x | ||||
| Proteinuria | x | ||||
| eGFRa < 45 or End stage renal disease | x | ||||
| Vascular disease | x | ||||
| Congestive heart failure or Liver disease | x | ||||
| Smoking status+ | x | ||||
| Ethnicity+ | x | ||||
| Coronary heart disease | x | ||||
| Family history of congestive heart failure+ | x | ||||
| Atrial fibrillation | x | ||||
| Rheumatoid arthritis | x | ||||
| Chronic renal disease | x | ||||
| Valvular heart disease | x |
Existing models for predicting stroke risk. + indicates predictors are often poorly recorded or missing in claims data
aHDL High-density lipoproteins, eGFR Estimated glomerular filtration rate
The internal and external validation performances of the existing stroke prediction models
| 0.72 | 0.66 | 0.82 | 0.61 | 0.65 | |
| UK Electronic Medical Records (EMR) 2015 [ | 0.7 (0.69–0.71) | – | 0.68 (0.67–0.69) | 0.68 (0.67–0.69) | – |
| Swedish EMR 2016 [ | 0.71 (0.70–0.71) | – | 0.69 (0.69–0.70) | 0.69 (0.69–0.70) | – |
| Taiwan 2016 [ | – | – | 0.66 | 0.70 | – |
| New Zealand, Russia and the Netherlands 2014 [ | – | 0.70 (0.68–0.73) | – | – | 0.71 (0.69–0.73) |
| UK EMR 2010 [ | – | 0.65 (0.63–0.68) | 0.66 (0.64–0.68) | 0.67 (0.65–0.69) | – |
Internal and previously published external model fit statistics for each of the five models that predict stroke in atrial fibrillation patients
The stroke rate (% of target population) across the datasets
| Outcome rate % (Target population size) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Target Population | CCAE | MDCD | MDCR | Optum claims | Optum EHR | CUMC | AUSOM | STRIDE |
| T1: Females aged 65+ with atrial fibrillation no prior stroke or anticoagulants | – | 4.95 (25,880) | 4.40 (89,156) | 4.07 (110,905) | 1.30 (149,906) | 5.75 (4312) | 2.61 (268) | 1.37 (3366) |
| T2: Females with atrial fibrillation no prior stroke or anticoagulants | 1.33 (61,224) | 4.61 (33,262) | – | 3.49 (139,376) | 1.13 (189,815) | 5.00 (5758) | 1.76 (455) | 1.28 (4456) |
| Sensitivity T1: Females aged 65+ with atrial fibrillation no prior stroke or anticoagulants (no anticoagulants during tar) | – | 5.04 (23,586) | 5.26 (56,511) | 4.48 (78,353) | 1.44 (99,212) | 6.23 (3403) | 4.17 (144) | 1.29 (2094) |
| Sensitivity T2: Females with atrial fibrillation no prior stroke or anticoagulants (no anticoagulants during tar) | 1.28 (46,054) | 4.69 (29,546) | – | 3.73 (100,757) | 1.22 (128,409) | 5.35 (4546) | 2.73 (256) | 1.22 (2786) |
Target population size in each dataset and the percentage of patients with stroke within 1 year of initial atrial fibrillation diagnosis
Discrimination performance of the existing models externally validated across the OHDSI datasets
| Database AUROC (95% CIs) | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Target Populationa | Model | CCAE | MDCD | MDCR | Optum claims | Optum EHR | CUMC | AUSOM | STRIDE |
| T1: Females aged 65+ with atrial fibrillation no prior stroke or anticoagulants | ATRIA | – | 0.57 (0.55–0.58) | 0.63 (0.62–0.64) | 0.61 | 0.62 | 0.64 (0.61–0.68) | 0.60 (0.33–0.87) | 0.49 (0.40–0.58) |
| CHADS2 | – | 0.54 (0.53–0.56) | 0.60 (0.59–0.61) | 0.59 | 0.60 | 0.60 (0.57–0.64) | 0.51 (0.27–0.75) | 0.48 (0.39–0.57) | |
| CHA2DS2VASc | – | 0.55 (0.53–0.57) | 0.60 (0.59–0.61) | 0.59 | 0.62 | 0.61 (0.58–0.65) | 0.53 (0.32–0.74) | 0.52 (0.42–0.62) | |
| Framingham | – | 0.56 (0.54–0.57) | 0.62 (0.61–0.63) | 0.59 | 0.61 | 0.63 (0.60–0.66) | 0.58 (0.33–0.83) | 0.61 (0.52–0.70) | |
| Q-Stroke | – | 0.53 (0.52–0.55) | 0.56 (0.55–0.57) | 0.55 | 0.56 | 0.55 (0.51–0.59) | 0.56 (0.29–0.84) | 0.50 (0.41–0.59) | |
| T2: Females with atrial fibrillation no prior stroke or anticoagulants | ATRIA | 0.62 (0.60–0.64) | 0.58 (0.56–0.59) | – | 0.65 | 0.65 | 0.66 (0.62–0.69) | 0.73 (0.58–0.89) | 0.52 (0.44–0.60) |
| CHADS2 | 0.61 (0.59–0.62) | 0.56 (0.55–0.57) | – | 0.62 | 0.63 | 0.63 (0.60–0.66) | 0.63 (0.43–0.83) | 0.50 (0.42–0.57) | |
| CHA2DS2VASc | 0.63 (0.61–0.65) | 0.58 (0.56–0.59) | – | 0.64 | 0.65 | 0.64 (0.61–0.67) | 0.73 (0.60–0.85) | 0.55 (0.47–0.62) | |
| Framingham | 0.62 (0.60–0.64) | 0.57 (0.56–0.59) | – | 0.64 | 0.65 | 0.65 (0.62–0.68) | 0.70 (0.53–0.86) | 0.61 (0.53–0.69) | |
| Q-Stroke | 0.61 (0.59–0.63) | 0.54 (0.53–0.56) | – | 0.57 | 0.58 | 0.56 (0.53–0.60) | 0.63 (0.39–0.88) | 0.51 (0.43–0.59) | |
| Sensitivity T1: Females aged 65+ with atrial fibrillation no prior stroke or anticoagulants (no anticoagulants during 1 year time-at-risk) | ATRIA | – | 0.56 (0.55–0.58) | 0.63 (0.62–0.64) | 0.61 (0.61–0.62) | 0.63 (0.61–0.64) | 0.65 (0.62–0.69) | 0.69 (0.43–0.95) | 0.55 (0.47–0.62) |
| CHADS2 | – | 0.54 (0.53–0.56) | 0.61 (0.60–0.62) | 0.59 (0.58–0.60) | 0.61 (0.59–0.62) | 0.62 (0.58–0.65) | 0.61 (0.36–0.85) | 0.51 (0.38–0.63) | |
| CHA2DS2VASc | – | 0.55 (0.54–0.57) | 0.61 (0.60–0.62) | 0.59 (0.58–0.60) | 0.63 (0.61–0.64) | 0.63 (0.59–0.66) | 0.64 (0.45–0.83) | 0.55 (0.42–0.67) | |
| Framingham | – | 0.55 (0.54–0.57) | 0.62 (0.61–0.63) | 0.59 (0.59–0.60) | 0.62 (0.61–0.63) | 0.64 (0.61–0.68) | 0.68 (0.44–0.93) | 0.64 (0.53–0.74) | |
| Q-Stroke | – | 0.53 (0.52–0.55) | 0.57 (0.55–0.58) | 0.55 (0.54–0.56) | 0.57 (0.55–0.58) | 0.56 (0.52–0.60) | 0.61 (0.30–0.92) | 0.47 (0.35–0.58) | |
| Sensitivity T2: Females with atrial fibrillation no prior stroke or anticoagulants (no anticoagulants during 1-year time-at-risk) | ATRIA | 0.63 (0.61–0.66) | 0.58 (0.56–0.59) | – | 0.67 | 0.67 | 0.67 (0.64–0.70) | 0.79 (0.63–0.94) | 0.53 (0.43–0.63) |
| CHADS2 | 0.62 (0.60–0.65) | 0.56 (0.55–0.58) | – | 0.64 | 0.65 | 0.64 (0.61–0.68) | 0.72 (0.53–0.91) | 0.51 (0.41–0.62) | |
| CHA2DS2VASc | 0.65 (0.62–0.67) | 0.58 (0.56–0.59) | – | 0.65 | 0.67 | 0.66 (0.63–0.69) | 0.81 (0.71–0.90) | 0.55 (0.44–0.65) | |
| Framingham | 0.64 (0.61–0.66) | 0.57 (0.56–0.59) | – | 0.65 | 0.66 | 0.66 (0.63–0.69) | 0.76 (0.59–0.93) | 0.62 (0.51–0.72) | |
| Q-Stroke | 0.62 (0.60–0.64) | 0.55 (0.53–0.56) | – | 0.58 | 0.6 | 0.57 (0.53–0.61) | 0.68 (0.42–0.94) | 0.47 (0.36–0.57) | |
Discrimination performance of the existing models across the datasets. The AUROC 95% confidence intervals were only calculated when the outcome count was less than 1000. a See section ‘Participants’ for full inclusion/exclusion criteria