| Literature DB >> 28748189 |
Aamir Khakwani1, Ruth H Jack2, Sally Vernon3, Rosie Dickinson4, Natasha Wood3, Susan Harden5, Paul Beckett6, Ian Woolhouse7, Richard B Hubbard1.
Abstract
In 2014, the method of data collection from NHS trusts in England for the National Lung Cancer Audit (NLCA) was changed from a bespoke dataset called LUCADA (Lung Cancer Data). Under the new contract, data are submitted via the Cancer Outcome and Service Dataset (COSD) system and linked additional cancer registry datasets. In 2014, trusts were given opportunity to submit LUCADA data as well as registry data. 132 NHS trusts submitted LUCADA data, and all 151 trusts submitted COSD data. This transitional year therefore provided the opportunity to compare both datasets for data completeness and reliability. We linked the two datasets at the patient level to assess the completeness of key patient and treatment variables. We also assessed the interdata agreement of these variables using Cohen's kappa statistic, κ. We identified 26 001 patients in both datasets. Overall, the recording of sex, age, performance status and stage had more than 90% agreement between datasets, but there were more patients with missing performance status in the registry dataset. Although levels of agreement for surgery, chemotherapy and external-beam radiotherapy were high between datasets, the new COSD system identified more instances of active treatment. There seems to be a high agreement of data between the datasets, and the findings suggest that the registry dataset coupled with COSD provides a richer dataset than LUCADA. However, it lagged behind LUCADA in performance status recording, which needs to improve over time.Entities:
Year: 2017 PMID: 28748189 PMCID: PMC5521232 DOI: 10.1183/23120541.00003-2017
Source DB: PubMed Journal: ERJ Open Res ISSN: 2312-0541
FIGURE 1Data merging of registry dataset and LUCADA (Lung Cancer Data).
Key patient features across the registry dataset and LUCADA (Lung Cancer Data) for the 132 National Health Service trusts and level of agreement (n=26 001)
| Female | 11 990 (46%) | 11 987 (46%) | ||
| Male | 14 011 (54%) | 14 014 (54%) | 99% | 0.99 |
| <65 | 6032 (23%) | 6025 (23%) | ||
| 65–80 | 14 249 (55%) | 14 242 (55%) | ||
| >80 | 5720 (22%) | 5734 (22%) | 99% | 0.99 |
| No | 6341 (24%) | 7664 (29) | ||
| Yes | 19 660 (76%) | 18 337 (71%) | 89% | 0.73 |
| Small cell | 2975 (11%) | 2958 (11%) | ||
| Carcinoid | 189 (1%) | 234 (1%) | ||
| Nonsmall cell | 22 837 (88%) | 22 809 (88%) | 97% | 0.87 |
| 0 | 3816 (15%) | 4278 (16%) | ||
| 1 | 6550 (25%) | 7869 (30%) | ||
| 2 | 4025 (15%) | 5109 (20%) | ||
| 3 | 3553 (14%) | 4527 (17%) | ||
| 4 | 1152 (4%) | 1424 (5%) | 97% | 0.91#,¶ |
| Missing | 6905 (27%) | 2794 (11%) | 83% | 0.57# |
| IA | 2976 (8%) | 2226 (9%) | ||
| IB | 2296 (6%) | 1714 (7%) | ||
| IIA | 1405 (4%) | 1087 (4%) | ||
| IIB | 1236 (4%) | 1025 (4%) | ||
| IIIA | 3981 (11%) | 3330 (13%) | ||
| IIIB | 2844 (8%) | 2470 (9%) | ||
| IV | 16 758 (47%) | 12 258 (47%) | 96% | 0.90#,¶ |
| Missing | 4022 (11%) | 1891 (7%) | 94% | 0.81# |
#: weighted κ; ¶: excluding missing data.
Treatment data agreement across registry dataset and LUCADA (Lung Cancer Data) for matched patients (n=26 001)
| No | 21 631 | 243 | ||||
| Yes | 713 | 3414 | 2961 (87%) | 3301 (97%) | 96% | 0.86 |
| No | 16 958 | 268 | ||||
| Yes | 1125 | 7650 | 6865 (90%) | 6928 (91%) | 95% | 0.88 |
| No | 17 214 | 1048 | ||||
| Yes | 1370 | 6369 | 6002 (94%) | 6073 (95%) | 91% | 0.77 |
| No | 25 951 | 16 | ||||
| Yes | 23 | 11 | 10 (91%) | 10 (91%) | 99% | 0.36 |
Patient feature comparison for matched and unmatched patients in the registry dataset (n=32 036)
| 26 001 | 6035 | |
| Female | 11 990 (46%) | 2867 (48%) |
| Male | 14 011 (54%) | 3168 (52%) |
| <65 | 6032 (23%) | 1287 (21) |
| 65–80 | 14 249 (55%) | 3023 (50) |
| >80 | 5720 (22%) | 1725 (29) |
| 0 | 3816 (15%) | 394 (7%) |
| 1 | 6550 (25%) | 639 (11%) |
| 2 | 4025 (15%) | 342 (6%) |
| 3 | 3553 (14%) | 384 (6%) |
| 4 | 1152 (4%) | 131 (2%) |
| Missing | 6905 (27%) | 4145 (69%) |
| IA | 2190 (8%) | 502 (8%) |
| IB | 1836 (7%) | 271 (4%) |
| IIA | 1096 (4%) | 184 (3%) |
| IIB | 1018 (4%) | 131 (2%) |
| IIIA | 3267 (13%) | 385 (6%) |
| IIIB | 2353 (9%) | 230 (4%) |
| IV | 12 699 (49%) | 2443 (41%) |
| Missing | 1542 (6%) | 1889 (31%) |
| Small cell | 2975 (11%) | 382 (6%) |
| Carcinoid | 189 (1%) | 98 (2%) |
| Nonsmall cell | 22 837 (88%) | 5555 (92%) |
| No | 21 825 (84%) | 5252 (87%) |
| Yes | 4176 (16%) | 783 (13%) |
| No | 17 226 (66%) | 5031 (83%) |
| Yes | 8775 (34%) | 1004 (17%) |
| No | 18 262 (70%) | 5198 (86%) |
| Yes | 7739 (30%) | 837 (14%) |
| No | 25 967 (99.9%) | 6026 (99.9%) |
| Yes | 34 (0.1%) | 9 (0.1%) |
LUCADA: Lung Cancer Data.
FIGURE 2Kaplan–Meier curve of survival (in days) by data source. LUCADA: Lung Cancer Data; COSD: Cancer Outcome and Service Dataset.
FIGURE 3Kaplan–Meier curve of survival (in days) by data source adjusted for age, sex, performance status, cancer stage and lung cancer type. LUCADA: Lung Cancer Data; COSD: Cancer Outcome and Service Dataset.