| Literature DB >> 29325575 |
Kimberly A Mc Cord1, Rustam Al-Shahi Salman2, Shaun Treweek3, Heidi Gardner3, Daniel Strech4, William Whiteley2, John P A Ioannidis5,6,7,8,9, Lars G Hemkens10.
Abstract
BACKGROUND: Routinely collected health data (RCD) are increasingly used for randomized controlled trials (RCTs). This can provide three major benefits: increasing value through better feasibility (reducing costs, time, and resources), expanding the research agenda (performing trials for research questions otherwise not amenable to trials), and offering novel design and data collection options (e.g., point-of-care trials and other designs directly embedded in routine care). However, numerous hurdles and barriers must be considered pertaining to regulatory, ethical, and data aspects, as well as the costs of setting up the RCD infrastructure. Methodological considerations may be different from those in traditional RCTs: RCD are often collected by individuals not involved in the study and who are therefore blinded to the allocation of trial participants. Another consideration is that RCD trials may lead to greater misclassification biases or dilution effects, although these may be offset by randomization and larger sample sizes. Finally, valuable insights into external validity may be provided when using RCD because it allows pragmatic trials to be performed.Entities:
Keywords: Clinical epidemiology; Electronic health records; Evidence-based medicine; Registries; Routinely collected health data; Trials
Mesh:
Year: 2018 PMID: 29325575 PMCID: PMC5765645 DOI: 10.1186/s13063-017-2394-5
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Common limitations of randomized controlled trials and whether they can be amended by routinely collected health data
| Limitations of RCTs [ | What using RCD for RCTs can offer | Challenges | Potential of RCD to improve RCTs |
|---|---|---|---|
| Generalizability and real-world relevance | No specific data collection processes (follow-up visits, measurements) outside routine care, avoiding artificial situations | Random allocation of interventions may still require some deviation from routine care processes (e.g., obtaining informed consent). | Very high |
| Costs and resources | No costs to the trial for data collection processes and related activities (study site setup, study staff salary, monitoring and auditing activities, training costs) | Potential costs for obtaining the RCD (if the collecting entity does not provide it for free; e.g., data brokers); additional costs for data management, processing, merging, cleaning, and so forth | Very high |
| Specific conditions/subgroup effects | Larger sample sizes that are less influenced by resource constraints and feasibility issues may provide sufficient power for evaluating subgroups. | More opportunities for exploratory analyses with spurious findings | High |
| Late outcomes | RCD can provide long-term outcome data without actively following patients and often reducing the number of patients lost to follow-up | Patients moving away from RCD infrastructure will be lost and may still require active contact, highly dependent on RCD infrastructure | High |
| Speed | No cumbersome outcome ascertainment (follow-up contacts, data recording and collection) and no need for setting up the data collection infrastructure, thus results can be obtained faster | Management, processing, merging, and “cleaning” of large datasets may be time-consuming. Reporting of specific adverse events may be delayed. | High to moderate |
| Conflicts of interest/sponsorship bias | Collection of RCD is more objective and less easily manipulated to obtain a desired result. | Data may still be analyzed and reported nonobjectively to convey preferred conclusions. | Moderate |
| Understudied healthcare questions | Providing information on routine care allows researchers to address understudied healthcare questions because more resources are spared or different outcomes are collected. | Not all desired endpoints might be available; funding may not be the sole barrier | Moderate |
| Regulations | Obtaining approval for intervention imposes several bureaucratic loopholes; RCD are already available and might require different ethical clearance. | RCD still require approval in terms of data protection and confidentiality. | Moderate |
| Rare or uncommon conditions | Recruiting an appropriate sample size may be hard with rare diseases; larger samples with RCD and easier EHR or registry recruitment can reduce these difficulties. | Only possible if RCD resources are extensive, highly dependent on RCD infrastructure | Moderate |
Abbreviations: EHR Electronic health record, RCD Routinely collected health data, RCT Randomized controlled trial
Fig. 1The role of routinely collected health data (RCD) in randomized controlled trials in various phases of a clinical trial (based on the Consolidated Standards of Reporting Trials [CONSORT] flow diagram [11]). (1) During enrollment, RCD sources can be screened retrospectively for eligible patients, but they can also be used prospectively as targeted screening and recruitment tools. (2) Informed consent could be given both for data use and for trial participation, so that when patients decline to participate in the randomized component of the trial, their usual care can still be followed. (3) Allocation can be facilitated by RCD through point-of-care randomization. Patients who are not allocated to an intervention but select care on the basis of personal and clinical preferences can be observed with RCD. (4) During the follow-up phase, RCD allows patients who would otherwise be lost to follow-up to be tracked, and thus less missing data may be encountered. (5) Long-term follow-up, such as in registries, may be possible with RCD even after formal completion of the primary study phase. (6) RCD allows analysis of both nonrandomized and randomized patients and direct supplementation of information to the randomized part of the trial
Barriers in the use of routinely collected health data for randomized controlled trials and options for improvement
| General barriers or issues | Pressing questions | Possible solutions, actions and additional comments |
|---|---|---|
| Data | ||
| ▪ Availability ▪ Management ▪ Linkage ▪ Accuracy ▪ Validity | ▪ Is the desired outcome variable or RCD source available? ▪ Will it be possible to achieve the same data quality and accuracy with RCD as in traditional trials? ▪ Is the data linkage and management feasible in institutions with limited IT infrastructure? | ▪ A central register of databases available for clinical trial research would be helpful, ideally with details about data quality. ▪ Establish core outcomes and structured outcome assessments in routine care ▪ Create RCD trial guidelines and RCD source validation guidelines to help standardize their use and reduce sources of bias or uncertainty ▪ Increase IT presence (particularly data analysts) to health research teams ▪ The more RCD is sought out and used in research, the greater is its availability and differentiation. |
| Regulatory and ethics | ||
| ▪ Collecting and obtaining the data ▪ Using and sharing the data | ▪ What type of release must be given by the patients before their data can be collected or shared? ▪ Is it ethical to use RCD without asking for their permission, even if their data are anonymized? ▪ Can this data be considered of value and morally be sold? ▪ How are concerns about privacy and informed consent approached (particularly in the context of population-wide trials or Zelen designs)? ▪ Are data safety standards applied to RCD just as strictly as they are to traditional actively collected data? ▪ Who is responsible for the safety of the data? | ▪ Ethical guidelines specifically regarding the collection and dissemination of RCD should be developed. ▪ Ethics and approval committees should deepen their knowledge of these novel ethical challenges. ▪ Whereas personal data are collected daily from many sources (e.g., phone use), collection, storage, and dissemination of data related to health require more definite ethical oversight and greater transparency to the general public. ▪ After safety issues are defined, researchers and stakeholders must ensure that data are safely handled, with full transparency of access. |
| Costs | ||
| ▪ Obtaining the data ▪ Managing the data | ▪ Will data collectors (e.g. health insurers) share their data? Freely or at a cost? ▪ Is a constant increase in the generation of routine data really reducing the overall trial costs if the same institution collected the data in the first place? ▪ When is the use of RCD cost-effective? | ▪ The financial worth of health data is not defined or explored; empirical data are necessary to determine the cost of both producing and maintaining health data ▪ Health data are already legally sold to many industries, and regulations/legislation must catch up with this aspect. |
| Novelty | ||
| ▪ Bureaucratic obstacles ▪ Unawareness ▪ Training to generate, collect, prepare, manage and analyze RCD for trials | ▪ Will approval committees understand the implications of using RCD sources for clinical trials? ▪ What are the challenges that can be expected bureaucratically because most submission templates do not assume the use of RCD and absence of patient contact? ▪ Are data anonymization techniques clear? ▪ What training is required to qualify individuals who generate, collect, prepare, and manage RCD for clinical trial research? | ▪ Develop, in collaboration with approval committees, RCD-specific templates and submission forms, especially in such studies where no patient contact is foreseen and therefore speedy approval is desired. ▪ Educate regarding data anonymization and confidentiality risks ▪ Include the concept of using RCD for RCT in clinical research education and teaching ▪ Create and use reporting guidance specifically for RCD-RCTs |
IT Information technology, RCD Routinely collected health data, RCT Randomized controlled trial