| Literature DB >> 30616086 |
Darren S Thomas1, Aleksandra Gentry-Maharaj2, Andy Ryan3, Evangelia-Ourania Fourkala1, Sophia Apostolidou2, Matthew Burnell2, Wendy Alderton4, Julie Barnes4, John F Timms1, Usha Menon5.
Abstract
BACKGROUND: Electronic health records are frequently used for cancer epidemiology. We report on their quality for ascertaining colorectal cancer (CRC) in UK women.Entities:
Keywords: Colorectal neoplasms; Death certificates; Electronic health records; Hospital episode statistics; Registries; Self report
Mesh:
Year: 2019 PMID: 30616086 PMCID: PMC6363963 DOI: 10.1016/j.canep.2018.11.011
Source DB: PubMed Journal: Cancer Epidemiol ISSN: 1877-7821 Impact factor: 2.984
Fig. 1Study overview. All women enrolled in the UKCTOCS were monitored for CRCs diagnosed since being randomised via linkage to electronic health records, self-reporting, and by information discerned via the trial itself. The treating clinician or general practitioner of each woman notified to us were contacted for clinician’s confirmation of all events. Apparent performances of electronic health records and self-reporting were assessed against the outcome of clinician’s confirmation. Completeness of cancer registrations at 19 May 2010 (England & Wales)/ 23 February 2009 (Northern Ireland) was compared to those received by 04 September 2016 (England & Wales)/ 15 April 2015 (Northern Ireland). 1 England only. Abbreviations: UKCTOCS, UK Collaborative Trial of Ovarian Cancer Screening; CR, cancer registration; DC, death certification; HES, Hospital Episode Statistics; SR, self-reporting; CRCQ, colorectal cancer questionnaire; HC, clinician’s confirmation; GP, general practitioner; IMD, Index of Multiple Deprivation; EHR, electronic health records (CR, DC, HES).
Non-responders show notable similarities to True Positives and differences to False Positives.
| Cohort | ||||
|---|---|---|---|---|
| Responders (TP) | Responders (FP) | Non-responders (NR) | ||
| Median (Range) | ||||
| Age | 72 (57–83) | 68 (57–82) | 72 (57–83) | NR vs. FP (0.018) |
| BMI (Kg m-2) | 25.8 (10.3–110.8) | 26.4 (18.7–150.0) | 25.6 (0.1–47.2) | 0.753 |
| IMD score | 13.6 (1.6–73.9) | 11.4 (2.5–74.4) | 16.4 (1.6–70.8) | NR vs. TP (0.011) |
| Count (%) | ||||
| Cohort size | 514 (100) | 127 (100) | 425 (100) | |
| Alcohol | NR vs. FP (< 0.001) | |||
| Non-drinker | 83 (16) | 16 (13) | 71 (17) | |
| < 1 unit a day | 178 (35) | 67 (53) | 124 (29) | |
| ≥ 1 unit a day | 71 (14) | 37 (29) | 55 (13) | |
| Missing | 182 (35) | 7 (6) | 175 (41) | |
| Deaths | NR Vs. TP (< 0.001) | |||
| Alive | 390 (76) | 121 (95) | 263 (62) | |
| Deceased | 124 (24) | 6 (5) | 162 (38) | |
| Education | NR vs. FP (< 0.001) | |||
| Low | 131 (25) | 50 (39) | 104 (24) | |
| High | 89 (17) | 43 (34) | 54 (13) | |
| Other | 106 (21) | 25 (20) | 95 (22) | |
| Missing | 188 (37) | 9 (7) | 172 (40) | |
| Ethnicity | 0.821 | |||
| White | 493 (96) | 124 (98) | 411 (97) | |
| Black | 9 (2) | 2 (2) | 7 (2) | |
| Other | 9 (2) | 1 (1) | 3 (1) | |
| Missing | 3 (1) | 0 (0) | 4 (1) | |
| Smoking | NR vs. FP (< 0.001) | |||
| Ever | 109 (21) | 43 (34) | 120 (28) | |
| Never | 187 (36) | 64 (50) | 135 (32) | |
| Missing | 218 (42) | 20 (16) | 170 (40) | |
| Notifications | 1,173 (100) | 152 (100) | 841 (100) | NR vs. TP (0.001) |
| CR | 453 (39) | 25 (16) | 312 (37) | |
| DC | 98 (8) | 2 (1) | 125 (15) | |
| HES | 327 (28) | 22 (14) | 220 (26) | |
| SR | 211 (18) | 95 (62) | 137 (16) | |
| Trial | 84 (7) | 8 (5) | 47 (6) | |
p values refer to the multivariable test (Kruskal-Wallis/Fisher’s Exact) if no significance detected or the Bonferroni-adjusted, pairwise post-hoc test(s) (pairwise Wilcox rank sum/χ2) if significance detected.
at clinician’s confirmation. Abbreviations: TP; True Positives (Responders); FP, False Positives (Responders); NR, Non-responders; CRCQ, Colorectal Cancer Questionnaire.
Distribution of true and false notifications by data source.
Performance estimates for electronic health records and self-reporting.
| Dataset | Sensitivity | PPV | Specificity | NPV |
|---|---|---|---|---|
| CR | 0.92 (0.90–0.94) | 0.95 (0.92–0.97) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) |
| CR | 0.99 (0.97–1.00) | 0.95 (0.93–0.97) | — | — |
| DC | 0.97 (0.92–0.99) | 0.98 (0.93–1.00) | 0.97 (0.97–0.97) | 1.00 (1.00–1.00) |
| HES | 0.82 (0.78–0.86) | 0.94 (0.91–0.96) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) |
| SR | 0.91 (0.86–0.94) | 0.69 (0.63–0.74) | 1.00 (1.00–1.00) | 1.00 (1.00–1.00) |
relative to clinician’s confirmation.
relative to expected cases.
1–9 years curation (median 4.1, IQR 3.2).
6–14 years curation (median 9.4, IQR 3.2). Abbreviations: PPV, positive predictive value; NPV, negative predictive value; CIs, confidence intervals; CR, cancer registration; DC, death certificate; HES, Hospital Episode Statistics; SR, self-reporting.
Fig. 2Sensitivity of cancer registrations curated after a minimum of 1 & 6.3 years. Sensitivities by year of diagnosis (2002–10 inclusively) were 100, 92, 100, 99, 99, 99, 73, 88, and 91% with allowance of 1.0–9.1 years curation (CR update in 2010/11) and 100, 92, 100, 99, 99, 100, 100, 98, and 100% when supplemented with registrations received until 2015/16 (6.3–14.4 years curation).
Variables associated with self-reporting concordant with clinician’s confirmation.
| Variable | OR | |
|---|---|---|
| (Intercept) | 0.22 (0.01–7.53) | 0.395 |
| Age (year) | 1.05 (1.01–1.10) | 0.026* |
| BMI (Kg/m2) | 0.98 (0.91–1.05) | 0.504 |
| IMD score | 1.01 (0.98–1.04) | 0.441 |
| Alcohol | ||
| 0 | 1.00 | |
| < 1 U/day | 0.48 (0.20–1.13) | 0.091 |
| > 1 U/day | 0.38 (0.15–1.00) | 0.051 |
| Smoking | ||
| Never | 1.00 | |
| Ever | 1.18 (0.63–2.19) | 0.604 |
| Education | ||
| Low | 1.00 | |
| High | 1.19 (0.60–2.36) | 0.613 |
| Other | 0.86 (0.41–1.83) | 0.698 |
Adjusted for all variables listed. n 353. Age at self-report. Abbreviations: OR, odds ratio; BMI, Body Mass Index; IMD, Index of Multiple Deprivation.