| Literature DB >> 30904868 |
Cynthia A Jackevicius1,2,3,4,5, JaeJin An1, Dennis T Ko2,3,6, Joseph S Ross7,8,9, Suveen Angraal9, Joshua D Wallach10,11, Maria Koh2, Jeeeun Song1, Harlan M Krumholz8,9,12.
Abstract
OBJECTIVES: To collate and systematically characterise the methods, results and clinical performance of the clinical risk prediction submissions to the Systolic Blood Pressure Intervention Trial (SPRINT) Data Analysis Challenge.Entities:
Keywords: clinical prediction; open science; risk prediction; sprint
Year: 2019 PMID: 30904868 PMCID: PMC6475140 DOI: 10.1136/bmjopen-2018-025936
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1This figure illustrates the selection process of the submissions included in the systematic evaluation and the reasons for exclusion. SPRINT, Systolic Blood Pressure Intervention Trial.
Characteristics of prediction models
| Characteristic | N | % |
| Study population (n=29) | 29 | |
| Overall cohort | 26 | 90 |
| Others (patients without CKD, patients without primary end point, unclear) | 3 | 10 |
| Outcomes of prediction models (n=29) | ||
| Both efficacy and safety outcomes | 16 | 55 |
| Efficacy models (a) | 12 | 41 |
| Safety models (b) | 12 | 41 |
| Efficacy and safety combined models | 4 | 14 |
| Efficacy outcome only (c) | 11 | 37 |
| Safety outcome only (d) | 2 | 7 |
| Efficacy outcome model (a), (c) (n=23) | ||
| SPRINT primary composite outcome* | 21 | 91 |
| Safety outcome model (b), (d) (n=14) | ||
| Composite outcome | 8 | 57 |
| Single outcome for each prediction model | 6 | 43 |
| Safety outcome frequencies used in the model | ||
| Hypotension | 9 | 64 |
| Syncope | 9 | 64 |
| Electrolyte abnormality | 9 | 64 |
| Acute kidney injury or acute renal failure | 9 | 64 |
| Bradycardia | 6 | 43 |
| Injurious fall | 6 | 43 |
| Model approach (n=29) | ||
| Multivariable Cox proportional hazards | 11 | 38 |
| Multivariable Cox proportional hazards and machine learning† | 9 | 31 |
| Machine learning only† | 5 | 17 |
| Others | 4 | 14 |
| Absolute net benefit calculated (n=29) | 10 | 34 |
| Risk prediction tools (n=29) | ||
| Risk prediction tools developed | 7 | 24 |
| Risk prediction tools provided | 2 | 7 |
| Clinical scores developed (n=29) | ||
| Efficacy clinical scores | 4 | 14 |
| Safety clinical scores | 2 | 7 |
| Efficacy/safety combined clinical scores | 2 | 7 |
| Risk prediction tools/clinical scores provided in a usable format (n=29) | 9 | 31 |
| Web-based risk calculators | 2 | 7 |
| Risk equation | 1 | 3 |
| Clinical scores | 3 | 10 |
| Risk stratification algorithms | 3 | 10 |
*Myocardial infarction, acute coronary syndrome, stroke, heart failure or death from cardiovascular causes.
†Machine learning techniques include least absolute shrinkage and selection operator (LASSO), generalised, unbiased, interaction detection and estimation (GUIDE) regression tree, weighted k-nearest neighbour model, support vector machines, supervised learning, elastic net regularisation, elastic net binary linear classifier, recursive partition model, random forest, random survival forest, causal forest, boosted classification trees, supervised learning classification and regression trees (CART).
CKD, chronic kidney disease; SPRINT, Systolic Blood Pressure Intervention Trial.
Variables used in the prediction models
| Efficacy model (abstract, n=23) | Safety model (abstract, n=14) | Efficacy/safety combined models (abstract, n=4) | |
| Candidate variables | |||
| Numbers (%) specified in the abstract | 11 (48%) | 6 (43%) | 2 (50%) |
| Median number of candidate variables (IQI, range) | 21 (IQI: 18–27, | 20 (IQI: 17–26, | 24 (IQI: 22–26, |
| All baseline variables/candidate variables | 5 (22%) | 5 (36%) | 1 (25%) |
| All baseline+blood pressure trajectory | 2 (9%) | – | – |
| Unclear/not available/other | 5 (22%) | 3 (21%) | 1 (25%) |
| Final variables | |||
| Clearly presented | 15 (65%) | 10 (71%) | 2 (50%) |
| Median number of final variables (IQI, range) | 7 (IQI: 5–9, | 7 (IQI: 5–11, | 12.5 (IQI: 9–16, |
| Unclear/not specified | 7 (30%) | 4 (29%) | 2 (50%) |
| All baseline variables | 1 (4%) | – | – |
One abstract may report both efficacy and safety models separately, and this abstract is counted twice, as an efficacy model abstract and a safety model abstract.
One abstract may build and report multiple efficacy models, but they are counted as one abstract here.
Note, this table shows the number of abstracts reporting an efficacy, a safety or a combined prediction model.
IQI, interquartile interval.
Figure 2This figure is a bar chart that shows the frequency of variables included in the efficacy, safety and combined efficacy/safety models for the submissions included in the systematic evaluation. The x-axis lists the variables (with abbreviations defined in the footnote) and the y-axis shows the number of models that included each variable in their final prediction models. AGECAT, age category; ASA, daily aspirin use; ASCVD, atherosclerotic cardiovascular disease risk; BG, serum glucose; BMI, body mass index; CKD, indicator of eGFR <60 mL/min/1.73m2; CLINCVDHX, history of clinical cardiovascular disease; DBP, diastolic blood pressure; EGFR, estimated glomerular filtration rate; FRS, indicator whether 10-year Framingham Risk Score is >15%; HDL, high-density lipoprotein cholesterol; HTNRX, number of distinct antihypertensive agents prescribed; INT/NITX, treatment assignment (either intensive or standard treatment); SBP, systolic blood pressure; SCR, serum creatinine; STATIN, on any statin medication; SUBCLINCVDHX, history of subclinical cardiovascular disease; TC, total cholesterol; TG, triglycerides; UACR, urine albumin/creatinine ratio.
Prediction model performance measures
| Performance measures | Efficacy model | Safety model | Efficacy/safety combined model | |||
| Abstract, N | % | Abstract, N | % | Abstract, N | % | |
| Total number of abstracts | 23 | 100% | 14 | 100% | 4 | 100% |
| Number of abstracts that reported any model performance measures | 14 | 61% | 9 | 64% | 1 | 25% |
| Discrimination measures | ||||||
| C-statistics from development | 6 | 26% | 5 | 36% | – | – |
| Median (IQI, range)* | 0.70 | (IQI: 0.69–0.71, | 0.68 | (IQI: 0.68–0.70, | – | – |
| Median (IQI, range) for the best-case scenario† | 0.71 | (IQI: 0.70–0.77, | 0.69 | (IQI: 0.68–0.78, | ||
| Median (IQI, range) for the worst-case scenario‡ | 0.69 | (IQI: 0.63–0.70, | 0.62 | (IQI: 0.61–0.68, | ||
| C-statistics from internal validation | 7 | 30% | 4 | 29% | - | - |
| Median | 0.69 | (IQI: 0.69–0.71, | 0.68 | (IQI: 0.66–0.72, | - | - |
| C-statistics from external validation | – | – | – | – | – | – |
| Calibration measures | 6 | 26% | 5 | 36% | – | – |
| Internal validation | 13 | 57% | 9 | 64% | 3 | 75% |
| Bootstrapping | 7 | 30% | 6 | 43% | – | – |
| Cross-validation | 5 | 22% | 2 | 14% | 1 | 25% |
| Split-sample | 1 | 4% | 1 | 7% | 2 | 50% |
| External validation | 2 | 9% | 1 | 7% | – | – |
| Correlation between efficacy and safety models | 1 | 4% | – | – | – | – |
This table shows number of abstracts that reported efficacy, safety or combined prediction model. One abstract may report both efficacy and safety models separately, and this abstract was included both in the efficacy model abstract and in the safety model abstract.
*In case of multiple C-statistics from one abstract, the median of the ranges was used to summarise the data (two abstracts reported multiple C-statistics).
†Best-case scenario is using the highest C-statistics in case the abstract provided ranges of C-statistics from multiple different models.
‡Worst-case scenario is using the highest C-statistics in case the abstract provided ranges of C-statistics from multiple different models.