| Literature DB >> 32398093 |
Sarah Lensen1, Archie Macnair1, Sharon B Love2, Victoria Yorke-Edwards1, Nurulamin M Noor1, Meredith Martyn1, Alexandra Blenkinsop1, Carlos Diaz-Montana1, Graham Powell3, Elizabeth Williamson4, James Carpenter1,5, Matthew R Sydes1.
Abstract
BACKGROUND: Clinical trials generally each collect their own data despite routinely collected health data (RCHD) increasing in quality and breadth. Our aim is to quantify UK-based randomised controlled trials (RCTs) accessing RCHD for participant data, characterise how these data are used and thereby recommend how more trials could use RCHD.Entities:
Keywords: RCT; Registry; Routinely collected health data; Systematic review
Mesh:
Year: 2020 PMID: 32398093 PMCID: PMC7218527 DOI: 10.1186/s13063-020-04329-8
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Fig. 1Identification of trials from registries. Each trial is counted only once. For instance, trials identified through both release register searches and notification by registries are captured only once. Of 13 registries with release registers available, 10 published comprehensive release registers and three provided a brief lists of projects receiving routinely collected health data (RCHD) on the website
Trial characteristics
| Trial characteristic | Summary extraction: 2013–16, n (%) | Detailed extraction: 2017–18, n (%) | Total, |
|---|---|---|---|
| Screening | 6 (9%) | 10 (11%) | 16 (10%) |
| Treatment | 51 (74%) | 65 (71%) | 116 (73%) |
| Primary prevention | 12 (17%) | 16 (18%) | 28 (18%) |
| Yes | 6 (9%) | 11 (12%) | 17 (11%) |
| No | 63 (91%) | 80 (88%) | 143 (89%) |
| Primary care | 23 (33%) | 18 (20%) | 41 (26%) |
| Secondary care | 46 (67%) | 73 (80%) | 119 (74%) |
| Cancer | 17 (25%) | 30 (33%) | 47 (29%) |
| Cardiovascular and stroke | 21 (30%) | 25 (27%) | 46 (29%) |
| Pregnancy and childbirth | 4 (6%) | 5 (5%) | 9 (6%) |
| Mental/neurological health | 6 (9%) | 6 (7%) | 12 (8%) |
| Infection | 5 (7%) | 3 (3%) | 8 (5%) |
| Endocrine and diabetes | 1 (1%) | 3 (3%) | 4 (3%) |
| Inflammatory disorder | 2 (3%) | 3 (3%) | 5 (3%) |
| Other | 13 (19%) | 16 (18%) | 29 (18%) |
| Drug | 38 (55%) | 38 (42%) | 76 (48%) |
| Surgical | 5 (7%) | 8 (9%) | 13 (8%) |
| Other | 26 (38%) | 45 (49%) | 71 (44%) |
| Survival related | 33 (48%) | 39 (43%) | 72 (45%) |
| Other | 36 (52%) | 52 (57%) | 88 (55%) |
| Individual | 61 (88%) | 75 (82%) | 136 (85%) |
| Cluster | 8 (12%) | 16 (18%) | 24 (15%) |
| 1103 | 2611 | 1590 | |
| 41–170,432 | 53–6,000,000 | 41–6,000,000 | |
| 1–500 | 21 (30%) | 20 (22%) | 41 (26%) |
| 500–5000 | 31 (45%) | 43 (47%) | 74 (46%) |
| >5000 | 17 (25%) | 25 (27%) | 42 (26%) |
| Unclear | 0 (0%) | 3 (3%) | 3 (2%) |
| Yes | 13 (19%) | 19 (21%) | 32 (20%) |
| No (UK only) | 56 (81%) | 69 (76%) | 125 (78%) |
| Unclear | 0 (0%) | 3 (3%) | 3 (2%) |
| Yes | 40 (58%) | 63 (69%) | 103 (64%) |
| No | 10 (14%) | 12 (13%) | 22 (14%) |
| Unclear | 19 (28%) | 16 (18%) | 35 (22%) |
| NA | 2 (2%) | 2 (1%) | |
| NA | 6 (7%) | 6 (4%) | |
| NA | 16 (18%) | 16 (10%) | |
| NA | 2 (2%) | 2 (1%) | |
| NA | 3 (3%) | 3 (2%) | |
| Other | NA | 6 (7%) | 6 (4%) |
| Not yet published | NA | 56 (62%) | 56 (35%) |
| Not captured | 69 (100%) | NA | 69 (43%) |
Comparison of RCTs accessing RCHD with trials evaluated by the HRA in 2015
| RCTs accessing RCHD | HRA in 2015 | |
|---|---|---|
| Primary care | 41 (26%) | 48 (5%) |
| Secondary care | 119 (74%) | 846 (95%) |
| Unclear/missing | 0 | 69 |
| Cancer | 47 (29%) | 168 (17%) |
| Cardiovascular and stroke | 46 (29%) | 121 (13%) |
| Pregnancy and childbirth | 9 (6%) | 30 (3%) |
| Infection | 8 (5%) | 55 (6%) |
| Inflammatory disorder | 5 (3%) | 72 (7%) |
| 76 (48%) | 515 (53%) | |
| Individual | 136 (85%) | 934 (97%) |
| Cluster trial | 24 (15%) | 29 (3%) |
| 17 (11%) | 177 (18%) | |
| 1590 (41–6,000,000) | 275 (6–30,000) | |
| Unclear/missing | 0 | 440 |
| UK only | 125 (78%) | 450 (50%) |
| International trials | 32 (20%) | 443 (50%) |
| Unclear/missing | 0 | 70 |
This table only includes data fields that were comparable between the two sources. Sample size targets in the UK Health Research Authority (HRA) cohort are limited to those not described as phase I/II trials. Data obtained from Clark et al. [11] (2018), including unpublished supplementary appendices [1]. Abbreviations: RCHD routinely collected health data, RCT randomised controlled trial
aClark et al. [11].
bFor recruitment setting and location, the unclear/missing values were omitted from the percentage calculation.
Registries and datasets accessed
| Registry | Total trials | Total trials (2017–2018) | Datasets accessed | |||
|---|---|---|---|---|---|---|
| Death | Hospital visits | Cancer registration | Other | |||
| 108 (68%) | 59 (65%) | 58 (84%) | 34 (68%) | 22 (76%) | 4 (15%) | |
| 35 (22%) | 25 (27%) | 16 (23%) | 13 (26%) | 7 (24%) | 2 (8%) | |
| 15 (9%) | 11 (12%) | 3 (4%) | 6 (12%) | 10 (34%) | 1 (4%) | |
| 9 (6%) | 6 (7%) | 2 (3%) | 5 (10%) | 2 (7%) | 1 (4%) | |
| 7 (4%) | 4 (4%) | 1 (1%) | 0 (0%) | 0 (0%) | 4 (15%) | |
| 7 (4%) | 6 (7%) | 1 (1%) | 6 (12%) | 1 (3%) | 0 (0%) | |
| 6 (4%) | 2 (2%) | 0 (0%) | 0 (0%) | 0 (0%) | 2 (8%) | |
| 4 (3%) | 1 (1%) | 0 (0%) | 1 (2%) | 0 (0%) | 1 (4%) | |
| 3 (2%) | 3 (3%) | 0 (0%) | 0 (0%) | 0 (0%) | 2 (8%) | |
| 3 (2%) | 2 (2%) | 1 (1%) | 1 (2%) | 0 (0%) | 2 (8%) | |
| 2 (1%) | 2 (2%) | 0 (0%) | 0 (0%) | 0 (0%) | 2 (8%) | |
| 2 (1%) | 2 (2%) | 1 (1%) | 1 (2%) | 0 (0%) | 2 (8%) | |
| 2 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (4%) | |
| 2 (1%) | 2 (2%) | 0 (0%) | 1 (2%) | 1 (3%) | 0 (0%) | |
| 2 (1%) | 2 (2%) | 0 (0%) | 0 (0%) | 0 (0%) | 2 (8%) | |
| 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | |
| 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (4%) | |
| 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (4%) | |
| 1 (1%) | 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | |
| 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 1 (3%) | 0 (0%) | |
| 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (4%) | |
| 1 (1%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (4%) | |
Registries accessed was captured for all 160 trials. Information about datasets accessed from these registries was captured only for those 91 accessing routinely collected health data (RCHD) between 2017 and 2018. The fields are not mutually exclusive as one trial may have accessed data from multiple registries, and multiple datasets can be accessed via a single registry. Percentages are calculated by using the column header denominators. Hospital visits includes all Hospital Episode Statistics (Outpatient, Inpatient, Accident and Emergency, and Critical Care), Patient Episode Database for Wales (PEDW), and Scottish Morbidity Records (SMR). Acronyms: Information Services Division (ISD), Public Health England (PHE), Secure Anonymised Information Linkage (SAIL), Intensive Care National Audit & Research Centre (ICNARC), NHS Wales Informatics Service (NWIS), Paediatric Intensive Care Audit Network (PICANet), Clinical Practice Research Datalink (CPRD), NHS Blood and Transplant (NHSBT), Trauma Audit and Research Network - Major Trauma Audit (TARN), National Emergency Laparotomy Audit (NELA), Neonatal Research Database (NNRD), Public Health Wales (PHW), UK Renal Registry (UKRR), Department of Health (DOH), Falls and Fragility Fractures Audit programme (FFFAP), Honest Broker Service, Northern Ireland Statistics and Research Agency (HBS), National Institute for Cardiovascular Outcomes Research (NICOR), Northern Ireland Cancer Registry (NICR), Out-of-Hospital Cardiac Arrest Outcomes (OHCAO) Registry, UK Cystic Fibrosis Registry (UKCFR)
Categories describing how RCHD was used to inform or supplement participant data
| Data use category | Description | Example | All trials | Primary | Long-term follow-up | Both |
|---|---|---|---|---|---|---|
| The RCHD alone is used as trial data, and there is no cross-checking or comparison against any other data. | One of the included RCTs was a cluster-randomised trial of GP practices. Outcome data specific to the trial were extracted at the level of the cluster (GP practice) from CPRD. | 52 (57%) | 33 (45%) | 34 (65%) | 15 (42%) | |
| RCHD is used to cross-check against or verify known trial data, namely data already being collected on CRFs as part of the trial (arising from clinical observations and measurements). | In one case, death and cause of death of participants are captured on CRFs at each site. NHS Digital also sends the trial team quarterly reports of all new deaths. The trial team then compares the events and cause of death from both sources. | 27 (30%) | 24 (32%) | 6 (12%) | 3 (8%) | |
| RCHD is used to cross-check against or verify self-reported trial data, namely data already being collected from participants, such as by questionnaire. | In one trial, participants are asked to complete a questionnaire every 3 months which asks whether they had any unexpected stays in hospital. This information is cross-checked against Hospital Episode Statistics data obtained from NHS Digital. | 28 (31%) | 22 (30%) | 11 (21%) | 5 (14%) | |
| RCHD is used to alert or flag trial teams to a potential outcome/event, prompting medical note review to confirm the outcome/event. The specific outcome/events being flagged are not otherwise being captured as trial data. This may be accompanied by clinical end-point review or adjudication of events and outcomes. | In one RCT, participants were flagged in the UK Transplant Registry for notification of transplant rejection and failure. When participants were identified as having a transplant rejection or failure, study staff sought extra information from hospital records. The collated information was redacted and used for central adjudication by trained clinicians. | 22 (24%) | 19 (26%) | 9 (17%) | 6 (17%) | |
| RCHD from one source is used to cross-check against or compared with RCHD from another source. | One of the included RCTs accessed RCHD from NHS Digital, ICNARC and OHCAO, and all had provided the same fields, such as length of stay in intensive care. | 9 (10%) | 7 (9%) | 6 (12%) | 4 (11%) | |
| RCHD is being used for health economic analysis or cost-effectiveness purposes, rather than a clinical outcome. | In one study, Hospital Episode Statistics data were used to calculate the cost of secondary resource use within 90 days of randomisation. | 25 (27%) | 21 (28%) | 12 (23%) | 8 (22%) | |
| RCHD is not used directly for trial purposes but to evaluate the quality of these data compared with trial data or other RCHD, or RCHD is used to generate an algorithm or equation that hopes to predict or replicate the frequency of events/outcomes. | In a breast cancer trial, cancer data from NCRAS were accessed and compared against the trial data to assess the completeness, validity and consistency of the two data sources. | 11 (12%) | 11 (15%) | 1 (2%) | 1 (3%) | |
| – | – | 13 (14%) | 9 (12%) | 9 (17%) | 5 (14%) |
These categories were developed for the purpose of this review and are not mutually exclusive. For example, randomised controlled trials (RCTs) may use routinely collected health data (RCHD) for both cross-checking against existing trial data and against a second source of RCHD. Additionally, RCTs may use RCHD from multiple sources in different ways. Percentages are calculated by using the column header denominators. Abbreviations: CPRD Clinical Practice Research Datalink, CRF case report form, GP general practitioner, ICNARC Intensive Care National Audit & Research Centre, NCRAS National Cancer Registration and Analysis Service, NHS National Health Service, OHCAO Out-of-Hospital Cardiac Arrest Outcomes
Barriers to use of routinely collected health data and potential solutions
| Barrier | Example/explanation | Potential solution |
|---|---|---|
| Lack of comprehensive list of RCHD sources and the data they hold | There is no one point where you can find out about all sources of RCHD. | A searchable database – HDR UK or NHS Digital is responsible for update. |
| Clear terminology to describe data from registries | The source of the trial data is not always mentioned in trial publications. | Publication of consensus terminology and a description of the way in which RCHD can be used. |
| Publications of trials using RCHD did not make mention of it. | In methods section, details of the sources of RCHD should be stated. | Soon-to-be-released EHR CONSORT extension for routinely collected health data should improve this. |
| Lack of awareness by trialists of the availability and utility of RCHD | Shown by a small percentage of trials using RCHD | More publicity on available RCHD and the use of RCHD |
| Poor accessibility of data | Registry name change invalidating consent Long delays of several years in application process RCHD not provided despite agreement | Streamline the RCHD application process and render it efficient. |
| Poor data retention and no possibility of onward sharing | Time limit to keeping the data is shorter than the data storage time limit for the trial. Data sharing is often necessary to achieve funding for a trial. | RCHD needs to align with trial data retention rules and data-sharing requirements. |
| Data quality and timeliness | Trial data are monitored and checked and a lot of registry data are not. | Registry data need a validation process to ensure that their RCHD can be used as a verifiable GCP-compliant data source. A comparison of trial and registry data in several trials, facilitated by a SWAT, is required to educate all about the accuracy and completeness of registry data. |
| Regulator ready RCHD | Trial data require underlying source data whilst registry data are not source data and do not often have checked underlying source data. | Regulators and registries need to agree a solution to underlying source data. |
Abbreviations: EHR CONSORT electronic health record Consolidated Standards of Reporting Trials, GCP Good Clinical Practice, HDR UK Health Data Research UK, NHS National Health Service, RCHD routinely collected health data, RCT randomised controlled trial, SWAT study-within-a-trial