| Literature DB >> 34906262 |
Orouba Almilaji1,2, Gwilym Webb3, Alec Maynard4, Thomas P Chapman5, Brian S F Shine6, Antony J Ellis7, John Hebden8, Sharon Docherty9, Elizabeth J Williams10, Jonathon Snook10.
Abstract
BACKGROUND: Using two large datasets from Dorset, we previously reported an internally validated multivariable risk model for predicting the risk of GI malignancy in IDA-the IDIOM score. The aim of this retrospective observational study was to validate the IDIOM model using two independent external datasets.Entities:
Keywords: External validation; Gastrointestinal cancer; IDIOM app; Iron deficiency anaemia; TRIPOD; Temporal validation
Year: 2021 PMID: 34906262 PMCID: PMC8672477 DOI: 10.1186/s41512-021-00112-8
Source DB: PubMed Journal: Diagn Progn Res ISSN: 2397-7523
Risk groups cut-offs after regulating IDIOM model based on the quartiles of PPV
| PPV quarters | PPV values range % | Corresponding predicted risk cut-offs % | Risk group |
|---|---|---|---|
| Lower half of the 1st quarter of PPVa | [8.4-9.4] | ≤ 1.18 | Very low risk |
| Upper half of the 1st quarter of PPV | ]9.4-10.8] | ]1.18-2.16] | Low risk |
| 2nd quarter of PPV | ]10.8-14.7] | ]2.16-4.24] | Moderate risk |
| 3rd quarter of PPV | ]14.7-19.6] | ]4.24-7.97] | High risk |
| 4th quarter of PPV | > 19.6 | > 7.97 | Very high risk |
aThe risk group at which PPV values are in the lower quarter, and NPV = 100
PPV is the number of positive cases that were correctly classified divided by the total number of positive cases predicted. NPV is the number of negative cases that were correctly classified divided by the total number of negative cases predicted
Descriptive statistics for the three datasets
| Dataset | Dorset | Oxford | Sheffield | |
|---|---|---|---|---|
| Dataset size | 2390 | 1117 | 474 | |
| 200 (8.4%) | 86 (7.7%) | 36 (7.6%) | ||
| 862 (36%) | 446 (40%) | 227 (48%) | ||
| 71 (16, 96) | 74 (22, 97) | 69 (18, 93) | ||
| 104 (32, 159) | 91 (29, 129) | 104 (54, 152) | ||
| 80 (53, 112) | 81 (55, 125) | 80 (32, 104) | ||
Fig. 1Receiver operating characteristic curve shows the sensitivity on y-axis, and specificity on x-axis for the Dorset (black), Oxford (dark grey), Sheffield (grey) and combined validation (dotted black) datasets with the highest GMean value in each dataset. AUC, area under curve; GMean, geometric mean of sensitivity and specificity
Fig. 2Flexible calibration curve for the combined external datasets, showing the relationship between the estimated risks (on the x-axis) and the observed proportion of events (on the y-axis)
Fig. 3Decision curve analysis for GI investigation using Dorset data. Grey line: penalised IDIOM model. Black line: investigate no-one strategy. Dashed line: investigate all strategy. The vertical axis displays standardised net benefit. The horizontal axis shows the risk thresholds
Fig. 4Decision curve analysis for GI investigation using the combined external datasets. Grey line: penalised IDIOM model. Black line: ‘investigate no-one’ strategy. Dashed line: ‘investigate all’ strategy. The vertical axis displays standardised net benefit. The horizontal axis shows the risk thresholds