Literature DB >> 29403314

Guidance for using pilot studies to inform the design of intervention trials with continuous outcomes.

Melanie L Bell¹, Amy L Whitehead², Steven A Julious².

Abstract

BACKGROUND: A pilot study can be an important step in the assessment of an intervention by providing information to design the future definitive trial. Pilot studies can be used to estimate the recruitment and retention rates and population variance and to provide preliminary evidence of efficacy potential. However, estimation is poor because pilot studies are small, so sensitivity analyses for the main trial's sample size calculations should be undertaken.
METHODS: We demonstrate how to carry out easy-to-perform sensitivity analysis for designing trials based on pilot data using an example. Furthermore, we introduce rules of thumb for the size of the pilot study so that the overall sample size, for both pilot and main trials, is minimized.
RESULTS: The example illustrates how sample size estimates for the main trial can alter dramatically by plausibly varying assumptions. Required sample size for 90% power varied from 392 to 692 depending on assumptions. Some scenarios were not feasible based on the pilot study recruitment and retention rates.
CONCLUSION: Pilot studies can be used to help design the main trial, but caution should be exercised. We recommend the use of sensitivity analyses to assess the robustness of the design assumptions for a main trial.

Entities: Chemical

Keywords: feasibility; pilot; power; randomized controlled trial; sample size; sensitivity analysis

Year: 2018 PMID： 29403314 PMCID： PMC5779280 DOI： 10.2147/CLEP.S146397

Source DB: PubMed Journal: Clin Epidemiol ISSN： 1179-1349 Impact factor: 4.790

Introduction

Prior to a definitive intervention trial, a pilot study may be undertaken. Pilot trials are often small versions of the main trial, undertaken to test trial methods and procedures.1,2 The overall aim of pilot studies is to demonstrate that a future trial can be undertaken. To address this aim, there are a number of objectives for a pilot study including assessing recruitment and retention rates, obtaining estimates of parameters required for sample size calculation, and providing preliminary evidence of efficacy potential.3–6 We illustrate how to use pilot studies to inform the design of future randomized controlled trials (RCTs) so that the likelihood of answering the research question is high. We show how pilot studies can address each of the objectives listed earlier, how to optimally design a pilot trial, and how to perform sample size sensitivity analysis. Our example uses a continuous outcome, but most of the content can be applied to pilot studies in general.

Considerations for trial design

When designing a definitive trial, one must consider The target effect size, such as the difference in means for continuous outcomes; The variance about the estimates for continuous outcomes, which is used to give a range of responses for individuals in the trial; Feasibility, including referral, recruitment, and retention rates. Pilot trial results can inform each of these elements. Factors such as type I error and power are set independent of the pilot and are discussed in detail elsewhere.7 We focus on external pilot studies, where the trial is run before the main trial, and results are not combined.8

Feasibility

The first consideration is feasibility: will the researchers be able to recruit the required number of participants within the study timeframe and retain them in the main trial? While review of clinical records can be used to give some indication of potential participant pool, pilot studies provide estimates of the number of participants that actually enroll and consent to randomization, and these estimates should be included in the manuscripts that report the pilot study results.9 Many trials struggle to reach their sample size goal, which can result in trial extensions or failure to recruit to their prespecified sample size.10 Failure to recruit is a major issue in UK publicly funded trials, where 45% fail to reach target sample size.10 Along with review of prior trials at the same centers in similar populations, pilot studies can also give estimates of retention rates and adherence rates.11 Missing data and dropouts are issues in most RCTs12 and need to be considered at each step of the research process,13 including design, reporting,9 and progression to a larger definitive trial.11,14

Target effect size and potential efficacy

Hislop et al15 undertook a systematic review to identify seven approaches for determining the target effect size for an RCT and classified them as clinically important and/or realistic. A specific type of clinically important difference is the smallest value that would make a difference to patients or that could change care, a quantity referred to as the minimum important difference (MID), or sometimes minimally clinical important difference. The MID can be difficult to determine, particularly as it can change with patient population. However, researchers in various fields have investigated MID estimation and provide guidance on estimation.16,17 In the absence of a known MID for continuous outcomes, particularly patient-reported outcomes, a standardized effect size15 between 0.3 and 0.5 has been recommended.17,18 Expert opinion is also used to specify important differences.15 Although some researchers use the pilot effect size to power the definitive trial, this is a practice that should be avoided in general, as estimation is poor due to the small sample size, and is likely to mislead.19 The target effect size must also be realistic, and the estimated effect size and confidence interval (CI) from the pilot can give some evidence here, ie, whether there is any indication that the intervention is effective and important differences might be obtained in the main trial.5 The small sample size of a pilot makes estimation uncertain, so caution must be exercised.19,20 One approach for handling this uncertainty is to use significance levels other than the “traditional” 5% to provide preliminary evidence for efficacy, with corresponding CIs such as 85 and 75% in addition to 95% CIs.21 A figure showing these CIs, the MID, and the null value can be a helpful way of displaying pilot results, by facilitating an assessment of both statistical significance and the potential for clinical significance.31 While some authors argue against carrying out hypothesis tests and assessing efficacy from pilots, even potential efficacy, most pilot studies do undertake hypothesis tests.6 We strongly stress that preliminary efficacy evidence from a pilot study should not be overstated, and researchers should avoid temptation to forgo the main trial.20,22

Estimating the standard deviation (SD)

The population SD is another key element of sample size estimation for continuous outcomes, and its estimation is one of the objectives for conducting a pilot study. However, similar to the effect size, the SD can be imprecisely estimated due to the pilot’s small sample size. Using a pilot study’s SD to design a future sample size has been shown to often result in an underpowered study.23,24 Thus, sensitivity analyses should be undertaken.

Sensitivity analysis for sample size

Sensitivity analyses are important to assess the robustness of study results to the assumptions made in the primary analysis.25 Sensitivity analyses should also be performed in the design stage26 and can take the form of accounting for the uncertainty in estimation by calculating sample sizes based on a range of plausible SDs and retention/dropout rates. Browne23 suggested using the pilot study’s upper limit of the 80% CI for the SD to calculate sample size in the subsequent trial. One may also consider SDs from the literature.

Pilot study sample size

In order to have the best chance of answering the research question, researchers should carefully consider the size of not only the definitive trial but also the pilot as well. Although traditional power calculations are inappropriate for pilot studies (since the primary aim of a pilot study is not to test superiority of one treatment over the other), a sample size justification is important. While there are several rules of thumb for the size of a pilot study, ranging from 12 to 35 individuals per arm,5,27 none of these guidelines account for the likely size of the future trial. Whitehead et al27 showed how, if you know the main trial’s target effect size, you can estimate the pilot study’s optimum sample size, minimizing the number of patients recruited across the two studies. From this work, they proposed stepped rules of thumb for pilot studies based on the target effect size and the size of the future trial. These rules are summarized in Table 1. For example, if the future trial will be designed around a small effect, then the number of patients per arm for the pilot study should be 25 for 90% power. Using these rules increases the likelihood of appropriate power for the future trial. Cocks and Torgerson5 also recommend basing the pilot study size on the future trial’s size, if the SD is known.

Table 1

Stepped rules of thumb for pilot study sample size per arm, as a function of the target effect size (standardized difference) and power of the main trial

Standardized difference, da	80% powered main trial		90% powered main trial
Standardized difference, da	Pilot N per arm	Main trial N per arm	Pilot N per arm	Main trial N per arm
Extra small (d<0.1)	50	>1571	75	>2103
Small (0.1≤d<0.3)	20	176–1571	25	235–2103
Medium (0.3≤d<0.7)	10	34–176	15	44–235
Large (d≥0.7)	10	≤34	10	≤44

Notes:

, where . The corresponding likely size of the main trial is also shown. int is the intervention arm, and ctl is the control arm.

Example

Suppose a research team is planning a pilot in the anticipation of designing a definitive trial. The main trial will be a two-arm RCT comparing a new supportive care regimen for cancer patients to usual care, with assessments at baseline, 6 weeks, and 3 months. Their primary outcome is the quality of life at 3 months as measured by the Functional Assessment of Cancer Therapy-General (FACT-G), a 27-item questionnaire covering aspects of physical, social, family, emotional, and functional well-being.28

Pilot study sample size

To use the stepped rules of thumb for pilot sample size, the researchers must consider the target effect size and SD for the main trial in order to calculate the standardized difference (effect size). They find that the estimated FACT-G MID is between three and six points29 and an SD estimate from the literature30 is 14 in similar populations. Using an MID estimate of four points, and an SD of 14, the standardized effect size is 4/14=0.29. For a 90% powered main trial, they should use a sample size of 25 per arm for the pilot (Table 1).

Pilot study results

Suppose now the researchers undertake the pilot study of 50 participants with recruitment over 2 months. Of the 100 potential participants, 70 participants were referred by their oncologist, 60 participants met eligibility criteria, and 50 participants agreed to participate. This indicates a recruitment rate of 50% of eligible patients, at 25 recruitments per month. Of the 50 participants, 40 participants completed all three assessments; retention is 80%. These rates will aid in estimating the main trial duration. The difference in the quality of life between the arms at 3 months is estimated at 3.1 points, with 95% CI −1.8 to 8.0, and SD =11.2. Figure 1 shows several CIs demonstrating that the intervention is promising, as each CI contains the MID of 4. Thus, the objective of the pilot study to provide preliminary evidence of efficacy has been met.

Figure 1

Mean difference in FACT-G scores between pilot study intervention and control arms with confidence intervals.

Abbreviations: FACT-G, Functional Assessment of Cancer Therapy-Genera; MID, minimum important difference.

Sample size calculations and sensitivity analyses

Table 2 shows sample sizes based on the pilot study’s SD, its upper 80% CI limit (taken as the square root of the CI for the variance), and the original estimate from the literature. Sample sizes are also given for the observed dropout rate (20%) and for >5 and <5%. For 90% power, sample size ranges from 392 to 692. For 80% power, sample sizes range from 296 to 518. Note that the sensitivity analysis is quantified in terms of the effect of assumptions on the sample size. An alternative approach is to fix the sample size (at 392 say) and observe how power varies based on assumptions.

Table 2

A range of sample sizes varying dropout, recruitment rate, and estimated SD assuming an effect size of four points

SD justification	SD	Dropout (%)	90% powered main trial			80% powered main trial
SD justification	SD	Dropout (%)	N (total)	Required recruitment rate per montha	Feasibleb	N (total)	Required recruitment rate per montha	Feasibleb
Pilot study SD	11.2	15	392	22	Yes	296	16	Yes
		20	416	23	Yes	314	17	Yes
		25	444	25	Yes	334	19	Yes
Upper 80% confidence limit from pilot	13.2	15	542	30	No	406	23	Yes
		20	576	32	No	430	24	Yes
		25	614	34	No	460	26	No
Literature	14.0	15	610	34	No	458	26	No
		20	648	36	No	486	27	No
		25	692	38	No	518	29	No

Notes:

Based on 1.5 years of recruitment.

Based on the pilot study recruitment rate of 25 participants per month.

Abbreviation: SD, standard deviation.

Feasibility of the main trial

We now consider feasibility. Specifically, are the researchers likely to be able to recruit the required number of participants within the study timeframe? Based on the funding and the follow-up time of 3 months, recruitment can take 1.5 years. If the pilot recruitment rate of 25 participants per month is a good estimate, then the study will be able to recruit and enroll 450 participants. This falls below several of the estimates in Table 2. Further consideration may be needed how to expand the pool of participants.

Conclusion

We have illustrated how pilot studies can aid in the design of future trials with continuous outcomes by providing estimates of population SD, evidence of potential for intervention effectiveness, and quantification of feasibility in the form of recruitment and retention rates. We have introduced guidelines on pilot study sample size and demonstrated sample size sensitivity analysis. The example demonstrated how main trial sample size estimates can vary dramatically by plausibly altering assumptions. The decision to progress from a pilot trial to a main trial is generally made using feasibility estimates, as well as issues such as protocol nonadherence. For more information on progression, refer to Avery et al,11 and for information on the context of internal pilots, refer to Hampson et al.14 Whether researchers decide to progress to a definitive trial or not, results of pilot studies should be published. A CONSORT extension for reporting results of pilot and feasibility studies gives detailed guidelines.9

31 in total

1. Trials in primary care: statistical issues in the design, conduct and evaluation of complex interventions.

Authors: G A Lancaster; M J Campbell; S Eldridge; A Farrin; M Marchant; S Muller; R Perera; T J Peters; A T Prevost; G Rait
Journal: Stat Methods Med Res Date: 2010-05-04 Impact factor: 3.021

2. The truly remarkable universality of half a standard deviation: confirmation through another look.

Authors: Geoffrey R Norman; Jeff A Sloan; Kathleen W Wyrwich
Journal: Expert Rev Pharmacoecon Outcomes Res Date: 2004-10 Impact factor: 2.217

Review 3. A point of minimal important difference (MID): a critique of terminology and methods.

Authors: Madeleine T King
Journal: Expert Rev Pharmacoecon Outcomes Res Date: 2011-04 Impact factor: 2.217

4. A framework for prospectively defining progression rules for internal pilot studies monitoring recruitment.

Authors: Lisa V Hampson; Paula R Williamson; Martin J Wilby; Thomas Jaki
Journal: Stat Methods Med Res Date: 2017-06-07 Impact factor: 3.021

5. Designing psycho-oncology randomised trials and cluster randomised trials: variance components and intra-cluster correlation of commonly used psychosocial measures.

Authors: Melanie L Bell; Joanne E McKenzie
Journal: Psychooncology Date: 2012-10-18 Impact factor: 3.894

6. Practical and statistical issues in missing data for longitudinal patient-reported outcomes.

Authors: Melanie L Bell; Diane L Fairclough
Journal: Stat Methods Med Res Date: 2013-02-19 Impact factor: 3.021

7. A tutorial on sensitivity analyses in clinical trials: the what, why, when and how.

Authors: Lehana Thabane; Lawrence Mbuagbaw; Shiyuan Zhang; Zainab Samaan; Maura Marcucci; Chenglin Ye; Marroon Thabane; Lora Giangregorio; Brittany Dennis; Daisy Kosa; Victoria Borg Debono; Rejane Dillenburg; Vincent Fruci; Monica Bawor; Juneyoung Lee; George Wells; Charles H Goldsmith
Journal: BMC Med Res Methodol Date: 2013-07-16 Impact factor: 4.615

Review 8. Handling missing data in RCTs; a review of the top medical journals.

Authors: Melanie L Bell; Mallorie Fiero; Nicholas J Horton; Chiu-Hsieh Hsu
Journal: BMC Med Res Methodol Date: 2014-11-19 Impact factor: 4.615

Review 9. Informing efficient randomised controlled trials: exploration of challenges in developing progression criteria for internal pilot studies.

Authors: Kerry N L Avery; Paula R Williamson; Carrol Gamble; Elaine O'Connell Francischetto; Chris Metcalfe; Peter Davidson; Hywel Williams; Jane M Blazeby
Journal: BMJ Open Date: 2017-02-17 Impact factor: 2.692

10. A tutorial on pilot studies: the what, why and how.

Authors: Lehana Thabane; Jinhui Ma; Rong Chu; Ji Cheng; Afisi Ismaila; Lorena P Rios; Reid Robson; Marroon Thabane; Lora Giangregorio; Charles H Goldsmith
Journal: BMC Med Res Methodol Date: 2010-01-06 Impact factor: 4.615

44 in total

1. Effect of the Triceps Brachii Facilitation Technique on Scapulohumeral Muscle Activation during Reach and Point in a Healthy Population.

Authors: Olive Lennon; Kaushika Logeswaran; Srushti Mistry; Tara Moore; Giacomo Severini; Catherine Cornall; Cliona O'Sullivan; Ulrik McCarthy Persson
Journal: Physiother Can Date: 2019 Impact factor: 1.037

2. The Surveillance of Physical Activity, Sedentary Behavior, and Sleep: Protocol for the Development and Feasibility Evaluation of a Novel Measurement System.

Authors: Patrick Crowley; Erika Ikeda; Sheikh Mohammed Shariful Islam; Rasmus Kildedal; Sandra Schade Jacobsen; Jon Roslyng Larsen; Peter J Johansson; Pasan Hettiarachchi; Mette Aadahl; Paul Jarle Mork; Leon Straker; Emmanuel Stamatakis; Andreas Holtermann; Nidhi Gupta
Journal: JMIR Res Protoc Date: 2022-06-06

3. Including migrant oncology patients in research: A multisite pilot randomised controlled trial testing consultation audio-recordings and question prompt lists.

Authors: Amelia Hyatt; Ruby Lipson-Smith; Karla Gough; Phyllis Butow; Michael Jefford; Thomas F Hack; Sandra Hale; Emiliano Zucchi; Shane White; Uldis Ozolins; Penelope Schofield
Journal: Contemp Clin Trials Commun Date: 2022-05-28

Review 4. The practice of pilot/feasibility studies in informing the conduct of HIV related clinical trials in sub-Saharan Africa: A scoping review.

Authors: Sylivia Nalubega; Lawrence Obado Osuwat; Poku Brenda Agyeiwaa; Catrin Evans; John Bosco Matovu
Journal: Contemp Clin Trials Commun Date: 2022-07-08

5. Randomized Trial on the Effects of a Group EMDR Intervention on Narrative Complexity and Specificity of Autobiographical Memories: A Path Analytic and Supervised Machine-Learning Study.

Authors: Andrea Poli; Angelo Gemignani; Mario Miccoli
Journal: Int J Environ Res Public Health Date: 2022-06-23 Impact factor: 4.614

6. Pilot Trial of a Sleep-Promoting Intervention for Children With Type 1 Diabetes.

Authors: Sarah S Jaser; Erin M Bergner; Emily R Hamburger; Shivani Bhatia; Morgan Lyttle; Grace E Bell; J Christopher Slaughter; Beth A Malow; Jill H Simmons
Journal: J Pediatr Psychol Date: 2021-03-18

7. Keep on running - a randomized controlled trial to test a digital evidence-based intervention for sustained adoption of recreational running: rationale, design and pilot feasibility study.

Authors: Hugo V Pereira; Pedro J Teixeira; Marta M Marques; Eliana V Carraça; Marlene N Silva; Jorge Encantado; Inês Santos; António L Palmeira
Journal: Health Psychol Behav Med Date: 2021-03-01

8. Nasal High-flow Oxygen Versus Conventional Oxygen Therapy for Acute Severe Asthma Patients: A Pilot Randomized Controlled Trial.

Authors: Onlak Ruangsomboon; Chok Limsuwat; Nattakarn Praphruetkit; Apichaya Monsomboon; Tipa Chakorn
Journal: Acad Emerg Med Date: 2020-12-22 Impact factor: 3.451

9. Introspective Meditation before Seeking Pleasurable Activities as a Stress Reduction Tool among College Students: A Multi-Theory Model-Based Pilot Study.

Authors: Manoj Sharma; Amar Kanekar; Kavita Batra; Traci Hayes; Ram Lakhan
Journal: Healthcare (Basel) Date: 2022-03-25

10. Pilot randomized controlled trial of eHealth cognitive-behavioral therapy for insomnia among Spanish-speaking breast cancer survivors.

Authors: Laura B Oswald; Jennifer Morales-Cruz; Sarah L Eisel; Jocelyn Del Rio; Aasha I Hoogland; Vanessa Ortiz-Rosado; Grace Soto-Lopez; Edgar Rodriguez-Rivera; Josée Savard; Eida Castro; Heather S L Jim; Brian D Gonzalez
Journal: J Behav Med Date: 2022-04-19