| Literature DB >> 30419965 |
Marian Mitroiu1, Katrien Oude Rengerink1, Caridad Pontes2,3, Aranzazu Sancho4,5, Roser Vives4,6, Stella Pesiou4, Juan Manuel Fontanet7, Ferran Torres8,9, Stavros Nikolakopoulos1, Konstantinos Pateras1, Gerd Rosenkranz10, Martin Posch10, Susanne Urach10, Robin Ristl10, Armin Koch11, Spineli Loukia11, Johanna H van der Lee12, Kit C B Roes1.
Abstract
BACKGROUND: The ASTERIX project developed a number of novel methods suited to study small populations. The objective of this exercise was to evaluate the applicability and added value of novel methods to improve drug development in small populations, using real world drug development programmes as reported in European Public Assessment Reports.Entities:
Keywords: Clinical trials; Orphan; Rare condition; Small population; Statistical methods
Mesh:
Year: 2018 PMID: 30419965 PMCID: PMC6233569 DOI: 10.1186/s13023-018-0925-0
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
Overview of the methods that were evaluated
| Description of the method | Requirements for use of the method | Potential advantages | Potential disadvantages |
|---|---|---|---|
| compared to developmental plans that supported approval | |||
| LEVEL OF EVIDENCE | |||
| Extrapolation [ | |||
| In small populations, a full independent drug development program to demonstrate efficacy may not be ethical, feasible or necessary. Extrapolations of evidence from a larger population to the smaller target population is widely used to support decisions in this situation.For the justification of requirements specified in EMA Paediatric Investigation Plans, this paper discusses how to specify the clinical trial design in the target population, when the data from the source population at the time of planning is not available but development in the target population will only start, after a treatment effect in the source population has been demonstrated. A framework based on prior beliefs is formulated to investigate whether the significance level for the test of the primary endpoint in confirmatory trials can be relaxed, and the sample size reduced, while controlling a certain level of certainty about the effects. The procedure is based on a so called skepticism factor, that quantifies the belief that a treatment effect observed in the larger population can be extrapolated to the target population. | Factors that influence the possibility for extrapolation: | ▪ Optimised use of available evidence for the entire development programme | ▪ Difficulty lies in its novelty and application |
| META-ANALYSIS | |||
| Prior distributions for variance parameters in sparse-event meta-analysis (Pateras K, personnal communication) | |||
| The small sample sizes in rare diseases make it particularly valuable to pool the data of small studies in a meta-analysis. When the primary outcome is binary, small sample sizes increase the chance of observing zero events. | ▪ > = 2 RCTs | ▪ Optimised use and variance estimation in a sparse-event meta-analysis | ▪ Use of informative priors (even for heterogeneity) may be controversial. |
| Heterogeneity estimators in zero cells meta-analysis [ | |||
| When a meta-analysis consists of a few small trials that report zero events, accounting for heterogeneity in the estimation of the overall effect is challenging. In practice, the data poses restrictions on the meta-analysis method employed that lead to deviations from the pre-planned analysis, such as the presence of zero events in at least one study arm. | ▪ Quicker and optimal selection of heterogeneity estimator in a sparse-event meta-analysis | ▪ Niche method and does not cover all heterogeneity estimators | |
| INNOVATIVE TRIAL DESIGNS | |||
| Critical appraisal of delayed-start design proposed as alternative to randomized controlled design in the field of rare diseases [ | |||
| In a delayed start randomization design, patients are randomised at baseline to receive either the intervention (early-start group) or placebo (delayed-start group) and after a certain period of time, the latter switch to the intervention until trial completion, therefore, reducing the time in placebo. Data collected at the end of placebo-phase allow for causal inferences, whereas the data collected at trial completion allow for investigation of disease-modifying effects. | ▪ The comparator needs to be placebo | ▪ All patients eventually receive treatment | ▪ Delay in some patients receiving treatment, compared to a single arm trial (but not different from parallel control arm) |
| Sample size reassessment and hypothesis testing in adaptive survival trials [ | |||
| This design allows a sample size reassessment during a trial where the primary outcome is the time to the occurrence of an event. The sample size reassessment is performed in an interim analysis and may be based on unblinded interim data, including secondary endpoints. | ▪ In case the sample size re-assessment is unmasked | ▪ Increased precision for sample size reassessment | ▪ Logistically resource-wise more demanding |
| Multi-arm group sequential designs with a simultaneous stopping rule [ | |||
| A design with 3 arms or more, with planned interim analyses with a simultaneous stopping rule using predefined boundaries. This rule aims to detect at least one efficacious treatment out of all tested arms. The trial may stop for one or more arms because of futility, or for all arms when efficacy is proven for at least one of them. | ▪ At least 3 arms including control (placebo) | ▪ More patients are randomized to a treatment arm due to the common control arm | ▪ Not applicable to historically/externally controlled studies |
| Sequential design for small samples starting from a maximum sample size [ | |||
| Using a group sequential design, an analysis will be performed before the trial is finished, based on the available data collected at that (pre-defined) moment. The aim of this design is to pick up large benefits or lack of benefit signals earlier. | ▪ Needs to start from maximum sample size that can be recruited | ▪ Increased precision when using prior knowledge (from historical data or previous trials) to estimate treatment effect size, and thereby increased precision for the adjustment of boundaries | ▪ More interim analyses will provide extra work |
| Bayesian sample size re-estimation using power priors [ | |||
| Bayesian statistics, use probability distributions, often including a probability of the belief in the intervention before the start of the trial (the prior). For normally distributed outcomes, an assumption for the variance needs to be made to inform the sample size needed, which is usually based on limited prior information, especially in small populations. When using a Bayesian approach, the aggregation of prior information on the variance with newly collected data is more formalized. The uncertainty surrounding prior estimates can be modelled with prior distributions. The authors adapt the previously suggested methodology to facilitate sample size re-estimation. In, addition, they suggest the employment of power priors in order for operational characteristics to be controlled. | ▪ At least 1 interim analysis | ▪ More efficient use of available patients for the development programme (i.e. smaller sample size) | ▪ Extra patients needed in case of effect size overestimation |
| Dynamic borrowing using power priors that control type I error [ | |||
| In rare diseases, where available data is scarce and heterogeneity between trials is less well understood, the current methods of meta-analysis fall short. The concept of power priors can be useful, particularly for borrowing evidence from a single historical study. Such power priors are expressed as a parameter, which in most situations has a direct translation as a fraction of the sample size of the historical study that is included in the analysis of the new study. However, the possibility of borrowing data from a historical trial will usually be associated with an inflation of the type I error. Therefore in this paper a new, simple method of estimating the power parameter in the power prior formulation is suggested, suitable when only one historical dataset is available. | ▪ Essential to have robust data from ideally previous similar studies | ▪ More efficient use of available patients for the development programme (i.e. smaller sample size) | ▪ Extra patients needed in case of effect size overestimation |
| STUDY ENDPOINTS AND STATISTICAL ANALYSIS | |||
| Fallback tests for co-primary endpoints [ | |||
| Usually, when the efficacy of an intervention is measured by co-primary endpoints, efficacy may be claimed only if for each endpoint an individual statistical test is significant. While this strategy controls the type I error, it is often very conservative, and does not allow for inference if only one of the co-primary endpoints shows significance.This paper describes the use of fall-back tests. They reject the null hypothesis in exactly the same way as the classical tests, with the advantage that they allow for inference in settings where only some of the co-primary endpoints show a significant effect. Similarly to the fall-back tests defined for hierarchical testing procedures, these fall-back tests for co-primary endpoints allow to continue testing, even the primary objective of the trial was not met. | ▪ At least 2 co-primary endpoints | ▪ No need for hierarchical pre-specification and testing of multiple co-primary endpoints | ▪ Potentially more patients needed |
| Optimal exact tests for multiple binary endpoints [ | |||
| In confirmatory trials with small sample sizes, hypothesis tests developed for large samples - based on asymptotic distributions - are often not valid. Exact non-parametric procedures are applied instead. However, exact non-parametric procedures are based on discrete test statistics and can become very conservative. With standard adjustments for multiple testing, they become even more conservative. | ▪ Multiple dichotomous/binary outcomes | ▪ Optimised multiple testing procedure for dichotomous endpoints | ▪ Potentially more patients needed |
| Simultaneous inference for multiple marginal GEE models [ | |||
| A framework is proposed for using generalized estimating equation models for each endpoint marginally considering dependencies within the same subject. The asymptotic joint normality of the stacked vector of marginal estimating equations is used to derive Wald-type simultaneous confidence intervals and hypothesis tests for linear contrasts of regression coefficients of the multiple marginal models. | ▪ Repeated measurements | ▪ Robust evidence from longitudinal data | ▪ Technically more complex |
| Goal Attainment Scaling [ | |||
| Goal Attainment Scaling is a measurement instrument that measures the attainment of different goals of patients in a standardized way. The goals are measured in the same way for every patient, but the content of the goals can be different between patients. To apply goal attainment scaling, the caregiver and the patient sit together to decide what the goals of the patient are, and how they can be defined in five levels. Next, the patient receives the intervention (preferably blinded). Then after the intervention the patient and doctor assess how well the goals have been attained. | ▪ Essential that there is no primary endpoint that is relevant for all patients | ▪ The goals are individually defined in consultation with patients and chosen per patient, hence customised measurement of therapeutic effect | ▪ Time-consuming to set (multiple) goals individually per patient |
EPARs included in the evaluation
| Cluster | Drug | Date opinion | Rare or ultra-rare* | Repurposed/ new drug? |
|---|---|---|---|---|
| Acute: single episodes | ||||
| Antracycline extravasation | Savene | 2006 | Ultra-rare | New drug |
| Patent ductus arteriosus | Pedea | 2009 | Rare | Repurposed |
| Hepatic venooclusive disease | Defitelio | 2013 | Rare | New drug |
| Tuberculosis | Sirturo | 2014 | Rare | New drug |
| Acute: recurrent episodes | ||||
| Cryopirine periodic syndromes | Ilaris | 2009 | Ultra-rare | New drug |
| Gram negative lung infection in cystic fibrosis | Cayston | 2009 | Rare | Repurposed |
| Narcolepsy | Xyrem | 2007 | Rare | New drug |
| Dravet syndrome | Diacomit | 2009 | Rare | New drug |
| Sickle cell disease | Sicklos | 2007 | Rare | New drug |
| Systemic sclerosis | Tracleer | 2009 | Rare | New drug |
| Chronic: stable/slow progression | ||||
| Short bowel syndrome | Revestive | 2012 | Rare | New drug |
| Adrenal insufficiency | Plenadren | 2011 | Rare | Repurposed |
| Thrombocytemia | Xagrid | 2009 | Rare | New drug |
| Deficit of lipoprotein lipase | Glybera | 2012 | Ultra-rare | New drug |
| Chronic: progressive, one system/organ | ||||
| Nocturnal Paroxysmal haemoglobinuria | Soliris | 2009 | Rare | New drug |
| Wilson’s disease | Wilzin | 2006 | Rare | New drug |
| Congenital errors of bile synthesis | Orphacol | 2013 | Ultra-rare | Repurposed |
| Gastrointestinal stromal tumours | Glivec | 2009 | Ultra-rare | Repurposed |
| Chronic: progressive, multiple systems/organs | ||||
| Fabry disease | Fabrazyme | 2008 | Ultra-rare | New drug |
| Cystic fibrosis | Kalydeco | 2013 | Rare | New drug |
| Familial amyloid polyneuropathy | Vyndaqel | 2011 | Rare | New drug |
| Gaucher disease | Zavesca | 2009 | Rare | New drug |
| Chronic: staged condition | ||||
| Renal carcinoma | Afinitor | 2009 | Rare | Repurposed |
| Pulmonary hypertension | Opsumit | 2014 | Rare | New drug |
| Indolent non-Hodgkin lymphoma | Litak | 2006 | Rare | New drug |
| Myelodysplastic syndrome | Revlimid | 2008 | Rare | New drug |
EPAR European Public Assessment Report
* Rare if prevalence = or > 5/10.000 and > 0.1/10.000 inhabitants; Ultrarare if prevalence = or < 0.1/10.000 inhabitants
Percentage of EPARs where the methods are applicable
| METHOD | Applicability in percentage of EPARs | |||
|---|---|---|---|---|
| Step | Static step 1 (no adjustments) | Dynamic step 2 (adjustments) | ||
| Statistic | Percentage of EPARs | Percentage of clusters | Percentage of EPARs | Percentage of clusters |
| Extrapolation | 35% [9/26] | 83% | 46% [12/26] | 100% |
| Heterogeneity estimators | 4% [1/26] | 17% | 4% [1/26] | 17% |
| Prior distributions for variance parameters in sparse-event meta-analysis | 4% [1/26] | 17% | 4% [1/26] | 17% |
| Delayed-start randomisation | 13% [3/26] | 50% | 12% [3/26] | 50% |
| Sample size reassessment and hypothesis testing in adaptive survival trials | 35% [9/26] | 83% | 58% [15/26] | 100% |
| Multi-arm group sequential designs with a simultaneous stopping rule | 23% [6/26] | 67% | 58% [15/26] | 100% |
| Sequential designs for small samples | 31% [8/26] | 67% | 66% [17/26] | 100% |
| Bayesian sample size re-estimation using power priors | 12% [3/26] | 33% | 50% [13/26] | 100% |
| Dynamic borrowing through empirical power priors that control type I error | 15% [4/26] | 33% | 50% [13/26] | 100% |
| Fallback tests for co-primary endpoints | 15% [4/26] | 50% | 50% [13/26] | 100% |
| Optimal exact tests for multiple binary endpoints | 4% [1/26] | 17% | 31% [8/26] | 83% |
| Simultaneous inference for multiple marginal GEE models | 19% [5/26] | 50% | 23% [6/26] | 67% |
| Goal Attainment Scaling | 31% [8/26] | 67% | 31% [8/26] | 67% |
EPAR European Public Assessment Report
Fig. 1Header: Percentage of EPARs where the methods are applicable