| Literature DB >> 32213167 |
Jennifer Anne Cooper1, Ronan Ryan2, Nick Parsons3, Chris Stinton3, Tom Marshall2, Sian Taylor-Phillips3.
Abstract
BACKGROUND: The database used for the NHS Bowel Cancer Screening Programme (BCSP) derives participant information from primary care records. Combining predictors with FOBTs has shown to improve referral decisions and accuracy. The richer data available from GP databases could be used to complement screening referral decisions by identifying those at greatest risk of colorectal cancer. We determined the availability of data for key predictors and whether this information could be used to inform more accurate screening referral decisions.Entities:
Keywords: Colorectal neoplasms; Early detection of cancer; Electronic health records; Mass screening; Occult blood; Prediction model
Mesh:
Year: 2020 PMID: 32213167 PMCID: PMC7093989 DOI: 10.1186/s12876-020-01206-1
Source DB: PubMed Journal: BMC Gastroenterol ISSN: 1471-230X Impact factor: 3.067
Variable completeness and univariable associations with colorectal cancer and polyps
| Variable | Percentage with this variable recorded ( | Prevalence of variable (%) | Hazard Ratio (95% Confidence Interval) | Standard Error | ||||
|---|---|---|---|---|---|---|---|---|
| Sex | 100 | |||||||
| Male (baseline) | – | 46.74% | (136,518/292,059) | – | (---) | – | – | |
| Female | – | 53.26% | (155,541/292,059) | 0.655 | (0.609–0.706) | 0.025 | 0.000* | |
| Age at Latest FOBT (continuous) | 100 | Mean 66.43 (SD 4.47) | 1.025 | (1.017–1.033) | 0.004 | 0.000* | ||
| 100 | (292,059/292,059) | |||||||
| BCSP FOB test normal (baseline) | 97.82 | (285,697/292,059) | 97.82% | (285,697/292,059) | – | (---) | – | – |
| BCSP FOB test abnormal | 2.18 | (6362/292,059) | 2.18% | (6362/292,059) | 55.936 | (51.988–60.183) | 2.089 | 0.000* |
| Previous Positive BCSP FOBTs (continuous) | (Recorded if observed) | 0 (99.47%) 1 (0.51%) 2 (0.02%) 3 (0.00068%) | (290,515/292,059) (1488/292,059) (54/292,059) (2/292,059) | 5.028 | (4.180–6.047) | 0.473 | 0.000* | |
| Previous Negative BCSP FOBTs (continuous) | (Recorded if observed) | 0 (60.44%) 1 (31.18%) 2 (8.03%) 3 (0.34%) 4 (0.0027%) | (176,523/292,059) (91,076/292,059) (23,465/292,059) (987/292,059) (8/292,059) | 0.769 | (0.720–0.821) | 0.026 | 0.000* | |
| Previously screened with a BCSP FOBT | (Recorded if observed) | 39.94% with 60.06% without | (116,641/292,059) (175,418/292,059) | 0.783 | (0.723–0.847) | 0.032 | 0.000* | |
| Alcohol (units per week) (continuous) | 78.00 | (227,792/292,059) | Mean 9.49 (SD 12.27) | 1.010 | (1.008–1.011) | 0.001 | 0.000* | |
| 99.44 | (290,429/292,059) | |||||||
| Never-Smoked | 57.48 | (167,880/292,059) | 57.80% | (167,880/290,429) | ||||
| Ex-Smoker | 33.32 | (97,310/292,059) | 33.51% | (97,310/290,429) | 1.532 | (1.417–1.656) | 0.061 | 0.000* |
| Current Smoker | 8.64 | (25,239/292,059) | 8.69% | (25,239/290,429) | 1.619 | (1.437–1.824) | 0.099 | 0.000* |
| BMI (continuous) | 95.85 | (279,927/292,059) | Mean 27.48 (SD 5.01) | 1.029 | (1.022–1.036) | 0.004 | 0.000* | |
| Primary care FOBT | (Recorded if observed) | 0.01% with 99.99% without | (32/292,059) (292,027/292,059) | 2.868 | (0.404–20.369) | 2.869 | 0.292 | |
| Hb g/dL (continuous) within 365 days prior to the latest FOBT | 44.51 | (129,996/292,059) | Mean 13.92 (SD 1.30) | 0.990 | (0.953–1.029) | 0.019 | 0.606 | |
| Hb < 11 g/dL (reference category ≥11 g/dL) within 365 days prior to the latest FOBT | 44.51 | (129,996/292,059) | 1.50% < 11 g/dL 98.50% ≥11 g/dL | (1947/129,996) (128,049/129,996) | 2.231 | (1.679–2.966) | 0.324 | 0.000* |
| Mean Cell Volume fL (continuous) within 365 days prior to the latest FOBT | 44.33 | (129,481/292,059) | Mean 91.11 (SD 5.08) | 0.996 | (0.986–1.005) | 0.005 | 0.382 | |
| Mean Cell Volume < 80 fL (reference category ≥80 fL) within 365 days prior to the latest FOBT | 44.33 | (129,481/292,059) | 1.60% < 80 fL 98.40% ≥80 fL | (2073/129,481) (127,408/129,481) | 2.419 | (1.856–3.151) | 0.326 | 0.000* |
| Ferritin 15 μg/L (continuous) within 365 days prior to the latest FOBT | 8.59 | (25,082/292,059) | Mean 127.07 (SD 201.66) | 0.999 | (0.998–1.000) | 0.000 | 0.069 | |
| Ferritin < 15 μg/L (reference category ≥15 μg/L) within 365 days prior to the latest FOBT | 8.59 | (25,082/292,059) | 4.99% < 15 μg/L 95.01% ≥15 μg/L | (1252/25,082) (23,830/25,082) | 2.054 | (1.434–2.943) | 0.377 | 0.000* |
| Platelet Count × 109/L (continuous) within 365 days prior to the latest FOBT | 44.40 | (129,685/292,059) | Mean 245.61 (SD 66.00) | 1.000 | (0.999–1.001) | 0.000 | 0.691 | |
| Platelet Count > 400 × 109/L (reference category ≤400 109/L) within 365 days prior to the latest FOBT | 44.40 | (129,685/292,059) | 2.13% > 400 × 109/L) 97.87% ≤400 109/L | (2764/129,685) (126,921/129,685) | 1.155 | (0.837–1.594) | 0.190 | 0.379 |
| GP has ordered a blood test 365 days prior to their latest BCSP FOBT | (Recorded if observed) | 44.72% with 55.28% without | (130,611/292,059) (161,448/292,059) | 1.441 | (1.339–1.550) | 0.054 | 0.000* | |
| Previous polyps diagnosed | (Recorded if observed) | 2.49% with 97.51% without | (7269/292,059) (284,790/292,059) | 3.181 | (2.767–3.658) | 0.226 | 0.000* | |
| Diabetes | (Recorded if observed) | 11.05% with 88.95% without | (32,272/292,059) (259,787/292,059) | 1.470 | (1.329–1.627) | 0.076 | 0.000* | |
| Crohn’s disease | (Recorded if observed) | 0.30% with 99.70% without | (884/292,059) (291,175/292,059) | 1.038 | (0.539–1.997) | 0.346 | 0.911 | |
| Ulcerative Colitis | (Recorded if observed) | 0.61% with 99.39% without | (1796/292,059) (290,263/292,059) | 1.686 | (1.177–2.416) | 0.309 | 0.004* | |
| Irritable Bowel Syndrome | (Recorded if observed) | 9.28% with 90.72% without | (27,103/292,059) (264,956/292,059) | 1.141 | (1.013–1.286) | 0.069 | 0.030* | |
| Diverticulitis | (Recorded if observed) | 6.37% with 93.63% without | (18,606/292,059) (273,453/292,059) | 1.226 | (1.069–1.406) | 0.086 | 0.004* | |
| Venous Thromboembolism | (Recorded if observed) | 0.31% with 99.69% without | (916/292,059) (291,143/292,059) | 1.421 | (0.824–2.451) | 0.395 | 0.206 | |
| Family History of Gastro-Intestinal Cancer | (Recorded if observed) | 1.51% with 98.49% without | (4423/292,059) (287,636/292,059) | 1.591 | (1.251, 2.023) | 0.195 | 0.000* | |
| Constipation | (Recorded if observed) | 1.46% with, 98.54% without | (4260/292,059) (287,799/292,059) | 1.654 | (1.305–2.097) | 0.200 | 0.000* | |
| Diarrhoea | (Recorded if observed) | 2.01% with, 97.99% without | (5867/292,059) (286,192/292,059) | 1.779 | (1.464–2.161) | 0.177 | 0.000* | |
| Loss of Appetite | (Recorded if observed) | 0.04% with, 99.96% without | (117/292,059) (291,942/292,059) | 2.614 | (0.843–8.109) | 1.510 | 0.096 | |
| Flatulence | (Recorded if observed) | 0.17% with 99.83% without | (498/292,059) (291,561/292,059) | 2.481 | (1.439–4.278) | 0.670 | 0.001* | |
| Tiredness | (Recorded if observed) | 2.46% with 97.54% without | (7173/292,059) (284,886/292,059) | 1.358 | (1.108–1.665) | 0.141 | 0.003* | |
| Weight Loss | (Recorded if observed) | 0.36% with 99.64% without | (1057/292,059) (291,002/292,059) | 1.705 | (1.073–2.710) | 0.403 | 0.024* | |
| Change in Bowel Habit | (Recorded if observed) | 0.57% with, 99.43% without | (1655/292,059) (290,404/292,059) | 2.610 | (1.924–3.539) | 0.406 | 0.000* | |
| Abdominal Pain† | (Recorded if observed) | 7.12% with, 92.88% without | (20,790/292,059) (271,269/292,059) | 1.425 | (1.261–1.610) | 0.089 | 0.000* | |
| Abdominal Pain | (Recorded if observed) | 4.86% with 95.14% without | (14,206/292,059) (277,853/292,059) | 1.424 | (1.232–1.646) | 0.105 | 0.000* | |
| Abdominal Mass | (Recorded if observed) | 0.06% with 99.94% without | (165/292,059) (291,894/292,059) | 1.258 | (0.314–5.032) | 0.890 | 0.746 | |
| Rectal Bleeding/melaena | (Recorded if observed) | 0.92% with 99.08% without | (2694/292,059) (289,365/292,059) | 3.118 | (2.504–3.884) | 0.349 | 0.000* | |
| Antispasmodic drug prescription | (Recorded if observed) | 3.31% with 96.69% without | (9661/292,0590 (282,398/292,059) | 1.450 | (1.221–1.721) | 0.127 | 0.000* | |
| Anti-motility drug prescription | (Recorded if observed) | 1.24% with 98.76% without | (3613/292,0590 (288,446/292,059) | 1.535 | (1.176–2.005) | 0.209 | 0.002* | |
| Laxative Drug | (Recorded if observed) | 7.96% with 92.04% without | (23,234/292,059) (268,825/292,059) | 1.390 | (1.235–1.564) | 0.084 | 0.000* | |
* Significant at the p value of 0.05
† Includes prescriptions of anti-spasmodic drug.
FOBT Faecal Occult Blood Test, BCSP bowel cancer screening programme, BMI body mass index, MCV mean cell volume, IBS irritable bowel syndrome, Hb haemoglobin concentration
Cox regression multivariable prediction model for participants with a FOBT result (either positive or negative) N = 191,081, 1676 events
| Variable | Hazard Ratio | Observed Coefficient | Bootstrapped Standard Error | z | [95% Confidence Intervals] | ||
|---|---|---|---|---|---|---|---|
| 70.173 | 4.251 | 0.057 | 74.19 | <0.001 | 4.139 | 4.363 | |
| Ex-smoker (reference category non-smoker) | 1.141 | 0.132 | 0.050 | 2.61 | 0.009 | 0.033 | 0.230 |
| Current smoker (reference category non-smoker) | 1.265 | 0.235 | 0.090 | 2.61 | 0.009 | 0.058 | 0.411 |
| − | 3.147 | 1.180 | 2.67 | 0.008 | 0.835 | 5.460 | |
| − | −4.177 | 1.557 | −2.68 | 0.007 | −7.229 | −1.125 | |
| 0.850 | −0.162 | 0.054 | −2.99 | 0.003 | −0.269 | −0.056 | |
| − | 5.859 | 2.064 | 2.84 | 0.005 | 1.814 | 9.904 | |
| − | −0.419 | 0.154 | −2.71 | 0.007 | −0.722 | −0.116 | |
| 0.862 | −0.149 | 0.049 | −3.05 | 0.002 | −0.245 | −0.053 | |
| 1.560 | 0.444 | 0.168 | 2.64 | 0.008 | 0.115 | 0.774 | |
Abbreviations: CI confidence intervals, FOBT faecal occult blood test (specifically guaiac). The continuous variables (Age/10) has been centred at 6.639, (Age/10)at 44.077, ((Alcohol + 1)/100)at 0.011, ((Alcohol + 1)/100)at 0.001, Previous negative BCSP FOBTs at 0.507. A ‘*’ indicates that the variable is treated as continuous.
0.9932 baseline CRC free survival at 2 years S0(2) (the re-estimated shrunken baseline CRC free survival at 2 years was also 0.9932 when rounded) the heuristic shrinkage factor was 0.998.
Where S(2) is the survival probability at 2 years (probability of not being diagnosed with colorectal cancer/polyps)
P = 1 – S(2).
Where P is the probability of colorectal cancer/polyp being diagnosed within 2 years of the latest FOBT date; x1Latest FOBT result; x2ex-smoker; x3current smoker; x4alcohol consumption; x5sex; x6age at FOBT; x7Number of previous negative BCSP FOBTs < 80 fL; x8Family History of GI Cancer.
The dataset derived for the multivariable modelling analysis had 1676 colorectal cancers and polyp diagnoses (sample population = 191,081) and considered 17 degrees of freedom in the model building process giving 98.59 events per variable. The final model had 10 degrees of freedom with an AIC of 34,050.33 and BIC 34,104.77 (N = 1676 when calculating BIC). Overall model fit was assessed using adjusted Rwhich was 0.600 (bootstrapped CI 100 reps: 0.580, 0.622) and adjusted D was 2.509. Regular Rwas 0.602 with a D statistic of 2.519. The linear predictor from the final model had a mean of − 0.021 and a standard deviation of 1.630 (range: -446.458 to 5.048, IQR: -0.235 to 0.781).
Cox regression multivariable prediction model for patients with negative FOBTs only n = 187,470, 735 events
| Variable | Hazard Ratio | Observed Coefficient | Bootstrapped Standard Error | z | [95% Confidence Intervals] | ||
|---|---|---|---|---|---|---|---|
| Ex-smoker (reference category non-smoker) | 1.238 | 0.214 | 0.078 | 2.75 | 0.006 | 0.061 | 0.366 |
| Current smoker (reference category non-smoker) | 1.499 | 0.405 | 0.148 | 2.74 | 0.006 | 0.116 | 0.694 |
| 0.777 | −0.252 | 0.074 | −3.42 | 0.001 | −0.397 | −0.108 | |
| − | − 1581.596 | 639.251 | −2.47 | 0.013 | − 2834.505 | − 328.687 | |
| − | 1094.918 | 460.929 | 2.38 | 0.018 | 191.514 | 1998.322 | |
| 0.761 | −0.272 | 0.066 | −4.11 | <0.001 | −0.403 | − 0.142 | |
| 1.286 | 0.251 | 0.067 | 3.76 | <0.001 | 0.121 | 0.382 | |
| 1.415 | 0.347 | 0.123 | 2.83 | 0.005 | 0.106 | 0.588 | |
Abbreviations: CI = confidence intervals, FOBT = faecal occult blood test (specifically guaiac). The continuous variables (Age/10)has been centred at 0.023, (Age/10)x ln(Age/10) at 0.043, Previous negative BCSP FOBTs at 0.510. A ‘*’ indicates that the variable is treated as continuous
0.9909 baseline CRC free survival at 2 years S0(2) (the re-estimated shrunken baseline CRC free survival at 2 years was also 0.9909 when rounded) the heuristic shrinkage factor was 0.914 where S(2) is the survival probability at 2 years (probability of not being diagnosed with colorectal cancer/polyps)
P = 1 – S(2).
Where P is the probability of colorectal cancer/polyp being diagnosed within 2 years of the latest FOBT date; x1ex-smoker; x2current smoker; x3sex; x4age at FOBT; x5Previous negative BCSP FOBT; x6GP ordered blood test; x7presence of IBS.
There were 735 events (sample population = 187,470) and considered 16 degrees of freedom giving 45.94 events. The final model had 8 degrees of freedom with an AIC of 16,686.66 and BIC of 16,723.46 (N = 735 when calculating BIC)). Overall model fit was assessed using adjusted Rwhich was 0.066 (bootstrapped CI 100 reps: 0.046, 0.100). Regular Rwas 0.072 (95% CI: 0.047, 0.102) with D statistic of 0.572. The linear predictor from this model had a mean of − 0.021 and a standard deviation of 0.363 (range: -1.418 to 1.206, IQR: -0.287 to 0.211).
Optimism calculated performance for the C statistic, c-slope, D statistic and R2 for the multivariable models
| Statistic | Apparent Performance | Optimism (100 bootstrap replications) | Optimism adjusted performance (apparent minus optimism) |
|---|---|---|---|
| C statistic | 0.863 | 0.002 | 0.860 |
| c-slope | 1.000 | 0.003 | 0.997 |
| D statistic | 2.519 | 0.028 | 2.491 |
| R2 | 0.602 | 0.005 | 0.597 |
| C statistic | 0.604 | 0.007 | 0.597 |
| c-slope | 1.000 | 0.060 | 0.940 |
| D statistic | 0.572 | 0.039 | 0.533 |
| R2 | 0.072 | 0.010 | 0.062 |
Fig. 1Calibration curves for the multivariable models adjusted for optimism and for deciles of risk. Left: Model for participants with both positive and negative FOBTs. Right: Model for participants with negative FOBTs only
Fig. 2PPV and NPV for different thresholds of risk for the prediction model applied to those with negative results and with two year follow up. A PPV of 3% corresponds to a risk probability threshold of 0.0168. The plot displays the locally weighted regression lines of PPV and NPV against the risk probability determined from the model
Fig. 3ROC curve sensitivity and specificity pairs for different thresholds of risk determined from the risk prediction model applied to those with negative results and with two year follow up (n = 24,297). A risk probability cutoff of 0.0168 is indicated on the curve and refers to a PPV level of 3%
2 by 2 table for the FOBT only, model only and a combined approach of FOBT positive plus risk positive at PPV 3% level (probability: 0.0168) for those with negative results
| Diagnostic Positive: Record/diagnosis of CRC (over 2 year follow up) | Diagnostic Negative: No record/diagnosis of CRC (over 2 year follow up) | ||||
|---|---|---|---|---|---|
| CRC | Polyp | Total | Total | ||
| FOBT only | 158 | 291 | |||
| Model Only (negative population only, | 13 | 28 | |||
| Combined | 171 | 319 | |||
| FOBT only | 165 | 219 | |||
| Model Only (negative population only, | 152 | 191 | |||
| Combined | 152 | 191 | |||
FOBT Only: Sensitivity 53.90%, Specificity 96.58%, PPV 34.67%, NPV 98.42%.
Model Only: Sensitivity 10.68%, Specificity 94.61%, PPV 3.00%, NPV 98.51%.
Combined (FOBT Positive or Risk Positive): Sensitivity 58.82%, Specificity 91.38%, PPV 18.67%, NPV 98.51%.