Literature DB >> 33421321

Contextualizing single-arm trials with real-world data: An emulated target trial comparing therapies for neovascular age-related macular degeneration.

Darren S Thomas¹, Aaron Y Lee², Philipp L Müller^3,4,5, Roy Schwartz^4,5,6, Abraham Olvera-Barrios^4,5, Alasdair N Warwick^4,7, Praven J Patel⁶, Tjebo F C Heeren^4,5, Catherine Egan^4,5,6, Paul Taylor¹, Adnan Tufail^4,5,6.

Abstract

One-in-four ophthalmology trials are single-armed, which poses challenges to their interpretation. We demonstrate how real-world cohorts used as external/synthetic control arms can contextualize such trials. We herein emulated a target trial on the intention-to-treat efficacy of off-label bevacizumab (q6w) pro re nata relative to fixed-interval aflibercept (q8w) for improving week 54 visual acuity of eyes affected by neovascular age-related macular degeneration. The bevacizumab arm (n = 65) was taken from the ABC randomized controlled trial. A total of 4,471 aflibercept-treated eyes aligning with the ABC trial eligibility were identified from electronic health records and synthetic control arms were created by emulating randomization conditional on age, sex, and baseline visual read via exact matching and propensity score methods. We undertook an inferiority analysis on mean difference at 54 weeks; outcomes regression on achieving a change in visual acuity of greater than or equal to 15, greater than or equal to 10, and less than or equal to -15 Early Treatment Diabetic Retinopathy (ETDRS) letters at week 54; and a time-to-event analysis on achieving a change in visual acuity of greater than or equal to 15, greater than or equal to 10, and less than or equal to -15 ETDRS letters by week 54. The findings suggest off-label bevacizumab to be neither inferior nor superior to licensed aflibercept. Our study highlights how real-world cohorts representing the counterfactual intervention could aid the interpretation of single-armed trials when analyzed in accord to the target trial framework. Study Highlights WHAT IS THE CURRENT KNOWLEDGE ON THE TOPIC? One-in-four randomized controlled trials in ophthalmology are single-armed, which poses challenges for interpreting their efficacy relative to standard of care. Recent conceptual advances in the methods of causal inference and in the emulation of target trials suggests that the standard-of-care arms representing the counterfactual intervention can be approximated with observational data. WHAT QUESTION DID THIS STUDY ADDRESS? How real-world cohorts representing the counterfactual intervention can aid the interpretation of single-armed ophthalmological trials. WHAT DOES THIS STUDY ADD TO OUR KNOWLEDGE? Our study highlights how real-world cohorts representing the counterfactual intervention could aid the interpretation of single-armed ophthalmological trials when undertaken in accord with the target trial framework. HOW MIGHT THIS CHANGE CLINICAL PHARMACOLOGY OR TRANSLATIONAL SCIENCE? External counterfactual arms could reduce the time and cost to reach potential regulatory approval.

Entities: Chemical

Mesh：

Substances：

Year: 2021 PMID： 33421321 PMCID： PMC8212729 DOI： 10.1111/cts.12974

Source DB: PubMed Journal: Clin Transl Sci ISSN： 1752-8054 Impact factor: 4.689

INTRODUCTION

One‐in‐four ophthalmology trials are single‐armed, which poses challenges to their interpretation in the absence of a standard‐of‐care control arm. Nevertheless, recent conceptual advances in the methods of causal inference and in the emulation of target trials , , , suggests that the standard‐of‐care arms representing the counterfactual intervention can be approximated (the external/synthetic control arm henceforth), and that real‐world analogues of trial findings are concordant with the trials they emulate. Central to these successes is the emulated target trial design and the target trial framework. A target trial is the hypothetical trial that would in ideal circumstances be undertaken. The target trial framework outlines a structured approach to the design and analysis of observational research as if they were randomized trials. By conceptualizing observational research as an attempt to emulate the target trial, and making explicit the experimental design and statistical analysis as one would in a trial protocol, the target trial framework can help researchers to identify and avoid biases resulting from an inappropriate selection of patients or misalignment of time zero. An additional consideration for leveraging real‐world cohorts as external controls is that one must also attempt conditional randomization on measured confounding, because the nonrandom treatment assignment in the real‐world introduces confounding biases. This is often achieved either within subsets defined by levels of confounding through matching or within levels of confounding through inverse probability weighting. , , , We herein emulated a target trial on the relative efficacy of intravitreal administration of two inhibitors of VEGF (off‐label bevacizumab vs. licensed aflibercept) for improving the visual acuity of eyes affected by neovascular age‐related macular degeneration (nAMD) after 54 weeks of treatment. Our emulated target trial compared the bevacizumab arm (n = 65) of the ABC trial —a prospective, double‐masked, randomized controlled trial—with a synthetic control arm from 31,151 eyes receiving aflibercept during routine care as recorded by electronic health records (EHRs). The choice of intervention and comparator is based on the lack of randomized controlled trials comparing the two therapies, despite off‐label bevacizumab being the most commonly used drug to treat nAMD in the United States. The size of the ABC trial bevacizumab arm is similar to that used in early phase single‐armed clinical trials. Our report follows the emulated target trial framework.

METHODS

Study design

An emulated target trial was undertaken comparing week 54 outcomes of eyes that, during the ABC trial, received bevacizumab (1.25 mg) pro re nata at 6‐week intervals (q6w) after 3 loading injections (i.e., the bevacizumab trial arm) with an external synthetic control arm of eyes that received aflibercept (2 mg) during routine care on the intention to treat at fixed 8‐week intervals (q8w) reflective of the posology at launch of the therapy (i.e., the aflibercept synthetic arm). The target trial protocol is outlined in Table S1.

Informed consent and ethics

The ABC trial was approved by the ethics committee at each clinical site with all patients giving written informed consent to participate. The Caldicott data protection guardian at each EHR site gave signed permission for the de‐identifiable EHRs to be extracted for analysis.

Subjects

Bevacizumab trial arm

The ABC trial was a prospective, double‐masked, multicenter randomized controlled trial undertaken in the United Kingdom during 2006–2008. The method has previously been reported. Briefly, 131 patients were randomized 1:1 to receive in the study eye: (I) 1.25 mg bevacizumab intravitreally at q6w pro ne rata until week 48 after 3 mandatory loading injections at weeks 0, 6, and 12 (n = 65) or (II) the standard of care at the time (n = 66); the latter arm being irrelevant to this study. Patients could receive 3–9 bevacizumab treatments (0–6 maintenance injections). Administration of injection at each visit was determined pro ne rata based on the presence of subretinal fluid, development of hemorrhage, or classic choroidal neovascularization, loss of greater than five Early Treatment Diabetic Retinopathy Study (ETDRS) letters in association with new intraretinal fluid or persistent intraretinal fluid. Patients underwent assessments for visual acuity at each visit before treatment was administered, with a final assessment at week 54. Best‐corrected visual acuity measurements were assessed using ETDRS reading charts.

Aflibercept synthetic arm

Synthetic control arms were sampled from EHRs recording ophthalmologic care across 27 sites in England during 2012–2018 (Medisoft Ophthalmology platform (http://www.medisoft.co.uk), Leeds, UK). Medisoft EHRs hold anonymized longitudinal data on all episodes of ophthalmologic care from treatments and surgeries through to visual acuity measurements, diagnostics, and complications. All visual acuity measurements were the best‐recorded unaided and were measured via, or converted to, ETDRS letters. Qualitative assessments of counting fingers, hand motion, light perception, and no light perception were converted to ETDRS letter scores of 2, 0, 0, and 0, respectively. Study eyes were the first diagnosed (or randomly sampled for bilateral diagnosis on the same day) that initiated treatment with aflibercept (2 mg q8w fixed‐dose after 3 loading injections at ~ 4 weekly intervals within 70 days) for nAMD (for EHR phenotype, see Table S2). The pool of potential synthetic controls were eyes that, as far as we could judge, would have been eligible for the ABC trial using the eligibility criteria in Table S1 (for criteria not recorded by EHRs, see Table S3). In addition to the ABC eligibility criteria, we also excluded all eyes having an incomplete loading phase; all those that had unrecorded sex, age, or baseline visual acuity required for matching or propensity scoring; and all eyes that initiated treatment after 1 December 2017 (allowing for 54 weeks follow‐up).

Emulating randomization

Three methods (exact matching [EM], inverse probability treatment weighting [IPTW], and propensity score matching [PSM]) for achieving conditional exchangeability were compared with an unconditional (UC) design wherein all eyes eligible were analyzed post protocol alignment but without addressing exchangeability. Hypothesized confounders were age, sex, and visual acuity at baseline read. , The study workflow is presented in Figure 1.

Figure 1

An emulated target trial estimating the causal effect of bevacizumab relative to aflibercept on the week 54 visual acuity of eyes affected by neovascular age‐related macular degeneration. Age, sex, and ETDRS at baseline read (L) are common causes of both treatment assignment (A) and outcome (Y), whereas the number of maintenance injections received during the study period (M) mediates between A and Y. Thus, blocking the backdoor criterion L–Y via emulated randomization (denoted by □), which is assumed to naturally arise in the bevacizumab trial arm through randomization, permits unbiased estimates of the causal pathway A–Y. The backdoor criterion remains open under UC. EHRs, electronic health records; EM, exact matching; ETDRS, Early Treatment Diabetic Retinopathy; IPTW, inverse probability of treatment weighting; PSM, propensity score matching; SOC, standard of care; UC, unconditional

Unconditional analysis

The UC analysis included all potential synthetic controls that aligned with the ABC and emulated target trial‐specific criteria analyzed without emulating randomization.

Exact matching

The aflibercept arm was exactly matched 1:1 with the bevacizumab trial arm on baseline confounders (age, sex, and visual acuity). Those with greater than one exact matches were paired with one randomly selected. Those with no exact matches were excluded from the analyses.

Inverse propensity score weighting and propensity score matching

The propensity score (Pr[A|L]) describes the probability of being in the ABC bevacizumab trial arm conditional on confounding variables (age, sex, and visual acuity). The score was derived from a binomial logistic model trained on all 131 ABC trial subjects (65 and 66 randomized to receive bevacizumab and standard of care, respectively (Equation 1)). The synthetic distribution tails not overlapping with the propensity scores of the bevacizumab arm were trimmed prior to weighting and matching. The IPTW analysis included all 65 bevacizumab‐treated eyes and all aflibercept‐treated eyes that fulfilled the ABC trial and target trial eligibility criteria. Outcomes for each eye were thereafter weighted on the conditional inverse probability of treatment (1/Pr[A|L] for the bevacizumab arm and 1/(1 – Pr[A|L]) for the aflibercept arm). For PSM, synthetic controls were matched 1:1 to each bevacizumab eye (n = 65) within a caliper of 0.1 standard deviation. Equation 1 = Propensity score trained on ABC trial data.

Statistical analyses

We estimated the intention‐to‐treat effects under each method for emulating randomization, including all eyes that started treatment with bevacizumab or aflibercept regardless of the drug load received, treatment crossover, or number of post‐baseline measurements during the study period. Time zero (baseline henceforth) for all analyses was synchronized to the date of first treatment (also the day of randomization for the bevacizumab arm). The date of time zero was also the date that we established patient eligibility. Baseline reads were taken on the date of first treatment, or the nearest up to 30 days prior for external controls (but rarely is visual acuity not measured immediately prior to treatment). Study exits were dependent on the analysis, as detailed later. Exchangeability in confounding variables was assessed pre‐emulating and post‐emulating randomization via standardized differences calculated as reported. A standardized difference less than 0.1 was considered evidence for conditional exchangeability. Statistical inferences were made at an α threshold of less than 0.05.

Noninferiority

An ordinary least squares regression was modeled on the change in visual acuity from baseline to week 54 measurement, stratified by treatment (aflibercept arm as reference). Noninferiority was declared if the lower confidence interval (CI) bound was greater than or equal to a noninferiority margin of minus four ETDRS letters. This margin of noninferiority mirrors that of the HAWK and HARRIER randomized trials recently undertaken, which we took as the current consensus. If no measurement was taken during week 54 owing to the inexactness of routine healthcare attendances, we used the measurement taken nearest to week 54 during weeks 50–58, taking the earliest date if 2 dates were equally tied. Missing data were imputed using the last observation carried forward.

Binomial regression

A binomial logistic regression, stratified by treatment, was modeled on the outcomes of achieving a change in visual acuity from baseline to week 54 measurement of greater than or equal to 15 letters, greater than or equal to 10 letters, and less than or equal to −15 letters. We present the odds ratios (ORs) and 95% CIs relative to the aflibercept synthetic arm as reference. If no measurement was taken during week 54, we used the measurement taken nearest to week 54 during weeks 50–58, taking the earliest date if 2 dates were equally tied. Missing data were imputed using the last observation carried forward.

Time‐to‐event

Kaplan‐Meier estimators were modeled on the time from baseline to achieve a change in visual acuity of greater than or equal to 15 letters, greater than or equal to 10 letters, and less than or equal to −15 letters, stratified by treatment (aflibercept arm as reference). Right censorship was on the last visual acuity measurement taken during the study period, up to 54 weeks maximum. Inferences were made on the outputs of the log rank test comparing event distributions.

Software

EHRs were extracted by Medisoft (Leeds, UK) and stored as relational tables on a MySQL server (version 5.5.62). All analyses were undertaken with R version 4.0.2 with particular use of tidyverse version 1.3.0, survival version 3.2–3, and fuzzyjoin version 0.1.6. All code and an exhaustive listing of all R packages used are available at https://github.com/dsthom/synthetic_trial_amd.

RESULTS

A pool of 31,151 eyes that started aflibercept treatment for nAMD was identified. Of these, 26,680 (86% of 31,151) were ineligible, for either not fulfilling the ABC (n = 20,823, 67%) or the emulated trial criteria (n = 5,857, 19%). Thus, a pool of 4,471 (14%) eyes were available for study. All baseline reads for these 4,471 eligible were taken on the same day as treatment. Of these, 186 eyes were trimmed from the IPTW and PSM analyses for having nonoverlapping propensity scores for a pool of 4,141 (13%) eyes. A stepwise breakdown of ineligibility is presented in Table S4. Eyes included in each bevacizumab trial and aflibercept synthetic arms were respectively, 65 and 4,471 for UC; 131 and 8,506 for IPTW (pseudo‐population); 43 and 43 for EM; and 65 and 65 for PSM. The aim of emulating randomization was to achieve conditional exchangeability in measured confounding (summarized in Table 1). Prior to aligning protocols, arms were imbalanced on age and baseline ETDRS, and after aligning protocols (UC) on age only. IPTW and EM were balanced for all measured confounding, whereas imbalances remained post‐PSM for age and the process in itself seemed to imbalance baseline ETDRS. The causal assumption of positivity was satisfied for all levels of confounding.

Table 1

Baseline traits of emulated target trial arms

Variable	Aflibercept synthetic arm	Bevacizumab trial arm	Standardized difference
Pre‐aligning of trial protocols (n = 25,199)
Arm size	25,134 ^a	65	—
Sex (female)	15,860 (63%)	39 (60%)	0.06
Age in years	x̄ 80 (σx 8)	x̄ 79 (σx 8)	0.14
Baseline ETDRS	x̄ 56 (σx 17)	x̄ 51 (σx12)	0.34
Propensity score	x̄ 0.48 (σx 0.12)	x̄ 0.52 (σx 0.10)	0.34
Unconditional (n = 4536)
Sample n	4,471	65	—
Sex (female)	2,817 (63%)	39 (60%)	0.06
Age in years	x̄ 80 (σx 8)	x̄ 79 (σx 8)	0.14
Baseline ETDRS	x̄ 52 (σx13)	x̄ 51 (σx 12)	0.09
Propensity score	x̄ 0.50 (σx 0.11)	x̄ 0.52 (σx 0.10)	0.17
Inverse probability of treatment weighting (n = 8637 ^b )
Sample n	8,506 ^b	131 ^b	—
Sex (female)	5,372 (63%)	81 (62%)	0.03
Age in years	x̄ 80 (σx 7)	x̄ 80 (σx 7)	0.06
Baseline ETDRS	x̄ 52(σx 13)	x̄ 52(σx 12)	0.03
Propensity score	x̄ 0.50 (σx 0.10)	x̄ 0.49 (σx 0.10)	0.09
Exact matching (n = 86)
Sample n	43	43	—
Sex (female)	29 (67%)	29 (67%)	0
Age in years	x̄ 80 (σx 6)	x̄ 80 (σx 6)	0
Baseline ETDRS	x̄ 51 (σx 11)	x̄ 51 (σx 11)	0
Propensity score	x̄ 0.50 (σx 0.09)	x̄ 0.50 (σx 0.09)	0
Propensity Score Matching (n = 130)
Sample n	65	65	—
Sex (female)	40 (62%)	39 (60%)	0.03
Age in years	x̄ 78 (σx 8)	x̄ 79 (σx 8)	0.12
Baseline ETDRS	x̄ 53 (σx 12)	x̄ 51(σx 12)	0.20
Propensity score	x̄ 0.52 (σx 0.10)	x̄ 0.52 (σx 0.10)	0

Abbreviation: ETDRS, Early Treatment Diabetic Retinopathy.

There were 31,151 minus eyes with unrecorded sex, age, or baseline ETDRS.

Pseudo‐population created by weighting.

Baseline traits of emulated target trial arms Abbreviation: ETDRS, Early Treatment Diabetic Retinopathy. There were 31,151 minus eyes with unrecorded sex, age, or baseline ETDRS. Pseudo‐population created by weighting.

Noninferiority

Approval of novel nAMD therapies is often based on proof of their noninferiority to currently approved drugs. Our results generally suggest, with exception of PSM, that bevacizumab is not inferior to aflibercept assessed on a noninferiority margin of four ETDRS letters (Figure 2). There was, however, insufficient evidence to conclude noninferiority under PSM (−0.1 mean ETDRS difference relative to aflibercept). Confidence bounds for matching methods were notably wider than those for UC and IPTW.

Figure 2

Mean ETDRS difference in change in visual acuity from baseline to week 54 measurement for bevacizumab‐treated eyes relative to those treated with aflibercept. Dashed vertical line denotes a noninferiority margin of minus four ETDRS letters. EM, exact matching; ETDRS, Early Treatment Diabetic Retinopathy; IPTW, inverse probability of treatment weighting; PSM, propensity score matching; UC, unconditional

Superiority

We mirrored the primary analysis of the ABC trial in which the proportion of eyes in each treatment arm that gained greater than or equal to 15 ETDRS letters at week 54 were compared (Figure 3). Secondary analysis on the proportion of eyes having a change in visual acuity from baseline to week 54 measurement of greater than or equal to 10 letters and less than or equal to –15 letters are also presented. Under no trials tested was bevacizumab shown to be superior to aflibercept.

Figure 3

Odds of bevacizumab‐treated eyes achieving a priori outcomes at week 54 relative to those treated with aflibercept. EM, exact matching; IPTW, inverse probability of treatment weighting; PSM, propensity score matching; UC, unconditional

Time‐to‐event

It may be that the drugs themselves, or the protocols under which they were administered, are beneficial at varying rates during the study period. We explored this possibility with the Kaplan‐Meier estimator modeled on the time to a change in visual acuity from baseline of greater than or equal to 15, greater than or equal to 10 and less than or equal to −15 ETDRS letters (Figure 4). The log rank statistic suggests there being no difference in the event distribution of eyes receiving bevacizumab relative to aflibercept. The Kaplan‐Meier curves, too, suggest disproportional hazards over time, although this may be an artifact of the fixed‐interval measurements of the trial arm.

Figure 4

Hazards of bevacizumab‐treated eyes achieving a priori outcomes during the first 54 weeks of treatment relative to those treated with aflibercept. Hued ribbons represent 95% confidence intervals. EM, exact matching; IPTW, inverse probability of treatment weighting; PSM, propensity score matching; UC, unconditional

DISCUSSION

Principal findings

One‐in‐four ophthalmology trials are single‐armed, which poses challenges for their interpretation. Our study highlights how real‐world cohorts representing the counterfactual intervention could aid the interpretation of these single‐armed trials when analyzed in accord to the target trial framework. It also suggests that week 54 visual outcomes of eyes treated with off‐label bevacizumab pro re nata (q6w) are neither inferior nor superior to licensed aflibercept administered at fixed intervals (q8w).

In context with the evidence base

The results suggest off‐label bevacizumab administered pro re nata (q6w) to be neither inferior nor superior to licensed aflibercept administered at fixed intervals (q8w) for improving the week 54 vision of eyes affected by nAMD. Although we have no trial reference to validate the present findings with, we can attempt to contextualize ours in relation to the literature. At writing, two randomized trials have reported on the efficacy of inhibitors of VEGF for treating nAMD; each having a common comparator arm, ranibizumab, from which we can triangulate. The CATT trial demonstrated bevacizumab to be noninferior to ranibizumab. The VIEW trial reported aflibercept to be noninferior to ranibizumab. These findings taken together suggests there being no biological reason to suspect bevacizumab to be inferior to aflibercept, which is consistent with the results of this emulated trial. Nevertheless, it should be acknowledged that this finding remains a hypothesis that should be consolidated with randomized interventions. Second, the methods of causal inference used herein have been validated by a number of studies that assessed the extent to which a target trial can be emulated using real‐world data. In a study on statins and cancer, the authors revisited the past failures of observational analogues to arrive at the findings of clinical trials. It was argued that this failure is not primarily due to confounding biases introduced by nonrandom treatment assignment, as is often thought, but also to biases introduced by subjects and analyses deviating from an explicit target trial protocol; advice that later evolved into the target trial framework that we followed herein. Under the target trial approach, observational estimates were concordant with those of the trial that was emulated. In another study most applicable to ours, the estimates obtained by emulated target trials composed of one trial arm and an external control arm derived from real‐world data were compared with that of a reference two‐armed trial. The paper reports, through alignment of protocols and propensity‐based adjustment, concordance in 10 out of the 11 trials emulated. It is on these validated methods that our findings are dependent on. A criticism of PSM is that its use may inadvertently exacerbate exchangeability, and we question whether there is a link between this possibility and the discordance in the PSM noninferiority estimate relative to other methods. Given the evidence base is that visual acuity on starting treatment is the most important determinant of visual improvement, , , it may therefore be plausible that the imbalance of the PSM aflibercept arm toward greater baseline acuity—despite exchangeability in the dimension of propensity score—nullified the relative efficacy of bevacizumab. Notably, of all methods for emulating randomization, PSM was the only to lead to inexchangeability in baseline read, and more curiously was that this was being exchangeable prior to emulating randomization. Another notable limitation of matching in general is the reduced sample size leading to widened confidence bounds and potential biases from discarding unmatched observations. IPTW, in contrast, amplifies the weighting of these rare events without discarding data.

Strengths and limitations

The target trial framework provides a structured approach to improve the rigor of observational research, by applying the best practices for the design and analysis of randomized trials. Adhering to the framework can help researchers identify and avoid biases in an inappropriate selection or the accrual of immortal time. A major strength of our research is, because we adhered to the framework, the risk of these biases is low to nonexistent. For the vast majority of eligibility criteria, we were also able to align the synthetic arms with the ABC trial protocol so that the trial and real‐world arms were comparable. For those criteria not explicitly recorded by Medisoft EHRs (Table S3), some of these would have been excluded through other criteria (exclusion of eyes ever treated for cataracts or myopia, for example); are implied through treatment not to be present (inadequate contraception in premenopausal women, no contraindications or active infection, or recent stroke or cardiac event); or do not lie on a backdoor path between treatment and outcome (concurrent warfarin use and fluorescein allergy). Any potential risk for bias would be through an inability to exclude aphakic eyes that have poorer prognoses, but given their rarity we believe that the risk of bias to be low. Naturally, we also acknowledge several limitations. The first being the potential for residual confounding. Where trials can assume exchangeability across all confounders—measured and unmeasured—due to random treatment assignment, emulated target trials in contrast are limited to exchangeability only in measured variables. Second, the nonoverlap in treatment periods between the bevacizumab (2006–2008) and aflibercept (2012–2018) arms and differences between trial and real‐world cohorts could bias our estimates. As shown in Table 1, the alignment of the real‐world cohorts to the ABC trial population (mean baseline read 51) lowers the mean from 56 ETDRS in those initiating treatment in the real‐world to 51–53 for the external cohorts. Because eyes with starting vision approaching the ceiling stand to gain less improvement, one would expect the pragmatic treatment effects for a contemporaneous sample to shrink toward the null relative to those reported here.

Implications for stakeholders

It is evident through the founding of the FDA Real‐World Evidence Program that regulatory decisions may increasingly rely on real‐world evidence. Indeed, the use of external counterfactual arms has already expedited the regulatory approval of blinatumomab (Amgen) and alectinib (Roche) for oncologic indications. Nevertheless, barring a small evidence base, the extent to which randomized trials can be emulated by real‐world data is largely unknown. In consequence, the US Food and Drug Administration (FDA) has funded the RCT DUPLICATE initiative, which aims to determine the validity of using real‐world evidence for regulatory decisions. More immediately, our study has implications for the economics of treating nAMD, for bevacizumab is somewhat more cost‐effective than licensed therapies. For context, the treatment of retinal disorders with inhibitors of VEGF accounts for 12% of all Medicare expenditure, and a 2018 survey by the American Society of Retina Specialists reports aflibercept being the first‐line therapy for 13%, 16%, 37%, 41%, and 47% of ophthalmologists surveyed across Africa, the United States, Europe, Asia, and Central and South America.

Unanswered questions and future research

With no existing trials comparing the efficacy of bevacizumab to aflibercept and with bevacizumab not being prescribed per ABC protocol during routine health care, it was not possible for us to triangulate our findings against a reference standard. We nevertheless acknowledge the importance of validating our findings, and aim to use the Medisoft EHRs to triangulate our methods relative to existing clinical trials. Furthermore, the estimation of per protocol effects, either by naive definition or by weighting on the conditional probability of treatment adherence, would be an important contribution to the evidence base. This would require the delineation of nonadherence from pro re nata.

CONCLUSIONS

Our study highlights how real‐world cohorts representing the counterfactual intervention could aid the interpretation of single‐armed ophthalmological trials when undertaken in accord with the target trial framework.

CONFLICT OF INTEREST

Aaron Y. Lee is an employee of the FDA, has received research funding from Novartis, Santen, Carl Zeiss, and Meditec, and has acted as a consultant for Genentech, Topcon, and Verana Health. Praven J. Patel has received research funding from Bayer UK, and has acted as a consultant for Bayer UK, Genentech, Novartis UK, and Roche. Roy Schwartz has received travel expenses from Allergan (AbbVie) and is a board member of Eye‐Wise diagnostics. Paul Taylor has received a grant from Novartis Pharmaceuticals. Adnan Tufail, Praven J. Patel, and Catherine Egan received a proportion of their funding from the Department of Health’s NIHR Biomedical Research Centre for Ophthalmology at Moorfields Eye Hospital and UCL Institute of Ophthalmology. The views expressed in this publication are those of the authors and not necessarily those of the Department of Health. All other authors declared that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

DISCLAIMER

The opinions in this manuscript are those of the authors and should not be interpreted as the position of the US Food and Drug Administration.

AUTHOR CONTRIBUTIONS

All authors wrote the manuscript. A.Y.L., A.T., D.S.T., and P.T. designed the research. D.S.T. performed the research. All authors analyzed the data. Table S1 Click here for additional data file. Table S2 Click here for additional data file. Table S3 Click here for additional data file. Table S4 Click here for additional data file.

23 in total

1. Bevacizumab for neovascular age related macular degeneration (ABC Trial): multicentre randomised double masked study.

Authors: Adnan Tufail; Praveen J Patel; Catherine Egan; Philip Hykin; Lyndon da Cruz; Zdenek Gregor; Jonathan Dowler; Mohammed A Majid; Clare Bailey; Quresh Mohamed; Robert Johnston; Catey Bunce; Wen Xing
Journal: BMJ Date: 2010-06-09

2. Using Big Data to Emulate a Target Trial When a Randomized Trial Is Not Available.

Authors: Miguel A Hernán; James M Robins
Journal: Am J Epidemiol Date: 2016-03-18 Impact factor: 4.897

3. Characteristics of Ophthalmology Trials Registered in ClinicalTrials.gov, 2007-2018.

Authors: Brandon Turner; Nitya Rajeshuni; Elaine M Tran; Cassie A Ludwig; Zujaja Tauqeer; Brannon Weeks; Benyam Kinde; Suzann Pershing
Journal: Am J Ophthalmol Date: 2019-11-13 Impact factor: 5.258

4. Avoidable flaws in observational analyses: an application to statins and cancer.

Authors: Barbra A Dickerman; Xabier García-Albéniz; Roger W Logan; Spiros Denaxas; Miguel A Hernán
Journal: Nat Med Date: 2019-10-07 Impact factor: 53.440

5. Ranibizumab and bevacizumab for neovascular age-related macular degeneration.

Authors: Daniel F Martin; Maureen G Maguire; Gui-shuang Ying; Juan E Grunwald; Stuart L Fine; Glenn J Jaffe
Journal: N Engl J Med Date: 2011-04-28 Impact factor: 91.245

6. Adherence adjustment in the Coronary Drug Project: A call for better per-protocol effect estimates in randomized trials.

Authors: Eleanor J Murray; Miguel A Hernán
Journal: Clin Trials Date: 2016-03-07 Impact factor: 2.486

7. The neovascular age-related macular degeneration database: multicenter study of 92 976 ranibizumab injections: report 1: visual acuity.

Authors:
Journal: Ophthalmology Date: 2014-01-23 Impact factor: 12.079

8. Comparative Efficacy of Ceritinib and Crizotinib as Initial ALK-Targeted Therapies in Previously Treated Advanced NSCLC: An Adjusted Comparison with External Controls.

Authors: Daniel Shao-Weng Tan; António Araújo; Jie Zhang; James Signorovitch; Zheng-Yi Zhou; Xiaopeng Cai; Geoffrey Liu
Journal: J Thorac Oncol Date: 2016-06-08 Impact factor: 15.609

9. Intravitreal aflibercept (VEGF trap-eye) in wet age-related macular degeneration.

Authors: Jeffrey S Heier; David M Brown; Victor Chong; Jean-Francois Korobelnik; Peter K Kaiser; Quan Dong Nguyen; Bernd Kirchhof; Allen Ho; Yuichiro Ogura; George D Yancopoulos; Neil Stahl; Robert Vitti; Alyson J Berliner; Yuhwen Soo; Majid Anderesi; Georg Groetzbach; Bernd Sommerauer; Rupert Sandbrink; Christian Simader; Ursula Schmidt-Erfurth
Journal: Ophthalmology Date: 2012-10-17 Impact factor: 12.079

10. Using Electronic Health Records to Derive Control Arms for Early Phase Single-Arm Lung Cancer Trials: Proof-of-Concept in Randomized Controlled Trials.

Authors: Gillis Carrigan; Samuel Whipple; William B Capra; Michael D Taylor; Jeffrey S Brown; Michael Lu; Brandon Arnieri; Ryan Copping; Kenneth J Rothman
Journal: Clin Pharmacol Ther Date: 2019-10-11 Impact factor: 6.875