| Literature DB >> 36230702 |
Pradeep S Virdee1, Julietta Patnick2, Peter Watkinson3, Tim Holt1, Jacqueline Birks4.
Abstract
Colorectal cancer has low survival rates when late-stage, so earlier detection is important. The full blood count (FBC) is a common blood test performed in primary care. Relevant trends in repeated FBCs are related to colorectal cancer presence. We developed and internally validated dynamic prediction models utilising trends for early detection. We performed a cohort study. Sex-stratified multivariate joint models included age at baseline (most recent FBC) and simultaneous trends over historical haemoglobin, mean corpuscular volume (MCV), and platelet measurements up to baseline FBC for two-year risk of diagnosis. Performance measures included the c-statistic and calibration slope. We analysed 250,716 males and 246,695 females in the development cohort and 312,444 males and 462,900 females in the validation cohort, with 0.4% of males and 0.3% of females diagnosed two years after baseline FBC. Compared to average population trends, patient-level declines in haemoglobin and MCV and rise in platelets up to baseline FBC increased risk of diagnosis in two years. C-statistic: 0.751 (males) and 0.763 (females). Calibration slope: 1.06 (males) and 1.05 (females). Our models perform well, with low miscalibration. Utilising trends could bring forward diagnoses to earlier stages and improve survival rates. External validation is now required.Entities:
Keywords: blood test; colorectal cancer; full blood count; joint modelling of longitudinal and time-to-event data; prediction model; primary care
Year: 2022 PMID: 36230702 PMCID: PMC9563332 DOI: 10.3390/cancers14194779
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Flow of (dummy) longitudinal data for two-year risk of colorectal cancer diagnosis. Red X indicates tests that were excluded.
Figure 2Patient flow diagram. 1 Number of patients available in the CPRD data extract. Abbreviations: FBC = full blood count; NCRAS = National Cancer Registration and Analysis Service; Hb = haemoglobin; MCV = mean corpuscular volume.
Summary of patient characteristics.
| Summary Statistic | Males | Females | ||
|---|---|---|---|---|
| Diagnosed | Not Diagnosed | Diagnosed | Not Diagnosed | |
|
| ||||
|
| 865 (0.4%) | 249,851 (99.6%) | 677 (0.3%) | 246,018 (99.7%) |
|
| 70.9 (10.0) | 60.7 (13.0) | 73.2 (11.0) | 61.9 (14.6) |
|
| 40–95 | 40–104 | 40–96 | 40–108 |
|
| ||||
|
|
| |||
|
|
|
|
| |
|
| 1,040 (0.3%) | 311,404 (99.7%) | 1,200 (0.3%) | 461,700 (99.7%) |
|
| 71.6 (10.2) | 60.6 (13.0) | 73.4 (11.2) | 61.7 (14.6) |
|
| 40–95 | 40–109 | 40–98 | 40–107 |
1 Age (years) at baseline FBC.
Cox sub-model from the joint models.
| Variable | Males | Females |
|---|---|---|
| HR (95% CI) | HR (95% CI) | |
| Age 2 (years) 1 | 1.015 (1.013, 1.017) | 1.014 (1.012, 1.016) |
| Age 2 × log(Age) (years) 1 | 0.997 (0.997, 0.997) | 0.997 (0.997, 0.998) |
| Trend: haemoglobin (g/dL) 2 | 0.868 (0.824, 0.916) | 0.863 (0.805, 0.926) |
| Trend: mean cell volume (fL) 2 | 0.996 (0.983, 1.009) | 0.986 (0.972, 1.000) |
| Trend: platelets (1012/L) 2 | 1.001 (0.999, 1.002) | 1.002 (1.001, 1.003) |
|
|
|
|
1 Age (years) at baseline FBC (most recent FBC available prior to two years before diagnosis/censor). 2 These HRs indicate how an increase in the patient’s blood parameter from the average population trend effects risk of diagnosis. 3 Breslow estimate. Abbreviations: HR = hazard ratio; CI = confidence interval.
Performance measures of the joint models.
| Performance Measure | Males | Females | ||
|---|---|---|---|---|
| Development | Validation | Development | Validation | |
| Brier score | 0.0034 | 0.0033 | 0.0027 | 0.0028 |
| RD2 | 0.28 | 0.30 | 0.31 | 0.34 |
| C-statistic | 0.739 | 0.751 | 0.753 | 0.763 |
| D-statistic | 1.27 | 1.33 | 1.38 | 1.46 |
| Calibration slope | 1.00 | 1.06 | 1.00 | 1.05 |
Figure 3Calibration plots for the joint models. Abbreviations: KM = Kaplan–Meier.
Diagnostic accuracy measures of the joint models (validation cohort).
| Risk Centile | Risk Cut-Off | True Positives | False Positives | True Negatives | False Negatives | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| 75% | 0.3670% | 600 | 77511 | 233893 | 440 | 57.69 | 75.11 | 0.77 | 99.81 |
| 80% | 0.4036% | 505 | 61984 | 249420 | 535 | 48.56 | 80.10 | 0.81 | 99.79 |
| 85% | 0.4406% | 401 | 46466 | 264938 | 639 | 38.56 | 85.08 | 0.86 | 99.76 |
| 90% | 0.4839% | 291 | 30954 | 280450 | 749 | 27.98 | 90.06 | 0.93 | 99.73 |
| 95% | 0.5525% | 180 | 15443 | 295961 | 860 | 17.31 | 95.04 | 1.15 | 99.71 |
| 99% | 0.7232% | 49 | 3076 | 308328 | 991 | 4.71 | 99.01 | 1.57 | 99.68 |
|
| |||||||||
| 75% | 0.2767% | 710 | 115018 | 346682 | 490 | 59.17 | 75.09 | 0.61 | 99.86 |
| 80% | 0.3043% | 614 | 91967 | 369733 | 586 | 51.17 | 80.08 | 0.66 | 99.84 |
| 85% | 0.3348% | 513 | 68922 | 392778 | 687 | 42.75 | 85.07 | 0.74 | 99.83 |
| 90% | 0.3747% | 397 | 45893 | 415807 | 803 | 33.08 | 90.06 | 0.86 | 99.81 |
| 95% | 0.4426% | 237 | 22909 | 438791 | 963 | 19.75 | 95.04 | 1.02 | 99.78 |
| 99% | 0.6446% | 75 | 4554 | 457146 | 1125 | 6.25 | 99.01 | 1.62 | 99.75 |
Figure 4ROC curves for the joint models and ColonFlag for males (left) and females (right) (validation cohort).