Should treatment effects be estimated in pilot and feasibility studies?

Julius Sim, Julius Sim

Abstract

Background: Feasibility studies and external pilot studies are used increasingly to inform planning decisions related to a definitive randomized controlled trial. These studies can provide information on process measures, such as consent rates, treatment fidelity and compliance, and methods of outcome measurement. Additionally, they can provide initial parameter estimates for a sample size calculation, such as a standard deviation or the 'success' rate for a binary outcome in the control group. However, the issue of estimating treatment effects in pilot or feasibility studies is controversial.

Methodological discussion: Between-group estimates of treatment effect from pilot studies are sometimes used to calculate the sample size for a main trial, alongside estimated standard deviations. However, whilst estimating a standard deviation is an empirical matter, a targeted treatment effect should be established in terms of clinical judgement, as a minimum important difference (MID), not through analysis of pilot data. Secondly, between-group effects measured in pilot studies are sometimes used to indicate the magnitude of an effect that might be obtained in a main trial, and a decision on progression made with reference to the associated confidence interval. Such estimates will be imprecise in typically small pilot studies and therefore do not allow a robust decision on a main trial; both a decision to proceed and a decision not to proceed may be made too readily. Thirdly, a within-group change might be estimated from a pilot or a feasibility study in a desire to assess the potential efficacy of a novel intervention prior to testing it in a main trial, but again such estimates are liable to be imprecise and do not allow sound causal inferences.

Conclusion: Treatment effects calculated from pilot or feasibility studies should not be the basis of a sample size calculation for a main trial, as the MID to be detected should be based primarily on clinical judgement rather than statistics. Deciding on progression to a main trial based on these treatment effects is also misguided, as they will normally be imprecise, and may be biased if the pilot or feasibility study is unrepresentative of the main trial.

Keywords: Feasibility studies; Pilot studies; Randomized controlled trials; Treatment effects.

Conflict of interest statement

Competing interestsThe author declares that he has no competing interests

Figures

Fig. 1
Fig. 1
Estimated treatment effects, with 95% confidence intervals, reported for four outcomes by Sheehan et al. [25]. A difference in percentage improvement of 40% (indicated by the dashed horizontal reference line) was taken to be the minimum important difference
Fig. 2
Fig. 2
Simulated data for 20 pilot studies (each n = 34) estimating an unknown ‘true’ treatment effect of 10, with a standard deviation of 20, and with an assumed minimum important difference of 4 (dashed horizontal reference line). Error bars are 95% confidence intervals, and the observed underlying standard deviations range from 12.6 to 24.8
Fig. 3
Fig. 3
Confidence intervals for the estimated treatment effect in study 18 (n = 34) in the simulated example, at varying confidence levels from 95 to 70%. The dashed horizontal reference line indicates the minimum important difference

References

    1. Thabane L, Ma J, Chu R, Cheng J, Ismaila A, Rios LP, Robson R, Thabane M, Giangregorio L, Goldsmith CH. A tutorial on pilot studies: the what, why and how. BMC Med Res Methodol. 2010;10:1. doi: 10.1186/1471-2288-10-1.
    1. Lancaster GA, Dodd S, Williamson PR. Design and analysis of pilot studies: recommendations for good practice. J Eval Clin Pract. 2004;10(2):307–312. doi: 10.1111/j.2002.384.doc.x.
    1. Leon AC, Davis LL, Kraemer HC. The role and interpretation of pilot studies in clinical research. J Psychiatr Res. 2011;45(5):626–629. doi: 10.1016/j.jpsychires.2010.10.008.
    1. Abbott JH. The distinction between randomized clinical trials (RCTs) and preliminary feasibility and pilot studies: what they are and are not. J Orthop Sports Phys Ther. 2014;44(8):555–558. doi: 10.2519/jospt.2014.0110.
    1. Sim J, Lewis M. The size of a pilot study for a clinical trial should be calculated in relation to considerations of precision and efficiency. J Clin Epidemiol. 2012;65(3):301–308. doi: 10.1016/j.jclinepi.2011.07.011.
    1. Whitehead AL, Julious SA, Cooper CL, Campbell MJ. Estimating the sample size for a pilot randomised trial to minimise the overall trial sample size for the external pilot and main trial for a continuous outcome variable. Stat Methods Med Res. 2016;25(3):1057–1073. doi: 10.1177/0962280215588241.
    1. Eldridge SM, Costelloe CE, Kahan BC, Lancaster GA, Kerry SM. How big should the pilot study for my cluster randomised trial be? Stat Methods Med Res. 2016;25(3):1039–1056. doi: 10.1177/0962280215588242.
    1. Arain M, Campbell MJ, Cooper CL, Lancaster GA. What is a pilot or feasibility study? A review of current practice and editorial policy. BMC Med Res Methodol. 2010;10:67. doi: 10.1186/1471-2288-10-67.
    1. Eldridge SM, Lancaster GA, Campbell MJ, Thabana L, Hopewell S, Coleman CL, Bond CM. Defining feasibility and pilot studies in preparation for randomised controlled trials: development of a conceptual framework. PLoS One. 2016;11(3):e0150205. doi: 10.1371/journal.pone.0150205.
    1. Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, Lancaster GA. and on behalf of the PAFS consensus group. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. Pilot Feasibility Stud. 2016;2:64. doi: 10.1186/s40814-016-0105-8.
    1. Cook JA, Julious SA, Sones W, Hampson LV, Hewitt C, Berlin JA, Ashby D, Emsley R, Fergusson DA, Walters SJ, Wilson ECF, Maclennan G, Stallard N, Rothwell JC, Bland M, Brown L, Ramsay CR, Cook A, Armstrong D, Altman D, Vale LD. DELTA2 guidance on choosing the target difference and undertaking and reporting the sample size calculation for a randomised controlled trial. Trials. 2018;19(1):606. doi: 10.1186/s13063-018-2884-0.
    1. Shanyinde M, Pickering RM, Weatherall M. Questions asked and answered in pilot and feasibility randomized controlled trials. BMC Med Res Methodol. 2010;11:117. doi: 10.1186/1471-2288-11-117.
    1. Hislop J, Adewuyi TE, Vale LD, Harrild K, Fraser C, Gurung T, Altman DG, Briggs AH, Fayers P, Ramsay CR, Norrie JD, Harvey IM, Buckley B, Cook JA, DELTA group Methods for specifying the target difference in a randomised controlled trial: the Difference ELicitation in TriAls (DELTA) systematic review. PloS Med. 2014;11(5):e1001645. doi: 10.1371/journal.pmed.1001645.
    1. Abdulatif M, Mukhtar A, Obayah G. Pitfalls in reporting sample size calculation in randomized controlled trials published in leading anaesthesia journals: a systematic review. Br J Anaesth. 2015;115(5):699–707. doi: 10.1093/bja/aev166.
    1. Kaur N, Figueiredo S, Bouchard V, Moriello C, Mayo N. Where have all the pilot studies gone? A follow-up on 30 years of pilot studies in Clinical Rehabilitation. Clin Rehabil. 2017;31(9):1238–1248. doi: 10.1177/0269215517692129.
    1. Uszynski MK, Purtill H, Donnelly A, Coote S. Comparing the effects of whole-body vibration to standard exercise in ambulatory people with multiple sclerosis: a randomised controlled feasibility study. Clin Rehabil. 2016;30(7):657–668. doi: 10.1177/0269215515595522.
    1. Kemp JL, Coburn SL, Jones DM, Crossley KM. The Physiotherapy for Femoroacetabular Impingement Rehabilitation STudy (physioFIRST): a pilot randomized controlled trial. J Orthop Sports Phys Ther. 2018;48(4):307–315. doi: 10.2519/jospt.2018.7941.
    1. Mollart L, Skinner V, Foureur M. A feasibility randomised controlled trial of acupressure to assist spontaneous labour for primigravid women experiencing a post-date pregnancy. Midwifery. 2016;36:21–27. doi: 10.1016/j.midw.2016.02.020.
    1. Kraemer HC, Mintz J, Noda A, Tinklenberg J, Yesavage JA. Caution regarding the use of pilot studies to guide power calculations for study proposals. Arch Gen Psychiatry. 2006;63(5):484–489. doi: 10.1001/archpsyc.63.5.484.
    1. Friedman LM, Furberg CD, DeMets DL, Reboussin DM, Granger CB. Fundamentals of clinical trials. 5. Heidelberg: Springer; 2015.
    1. Ruzicka M, McCormick B, Magner P, Ramsay T, Edwards C, Bugela A, Hiremath S. Thiazide diuretic-caused hyponatremia in the elderly hypertensive: will a bottle of Nepro a day keep hyponatremia and the doctor away? Study protocol for a proof-of-concept feasibility trial. Pilot Feasibility Stud. 2018;4:71. doi: 10.1186/s40814-018-0263-y.
    1. Westlund E, Stuart EA. The nonuse, misuse, and proper use of pilot studies in experimental evaluation research. Am J Eval. 2016;38(2):246–261. doi: 10.1177/1098214016651489.
    1. Lee EC, Whitehead AL, Jacques RM, Julious SA. The statistical interpretation of pilot trials: should significance thresholds be reconsidered? BMC Med Res Methodol. 2014;14:41. doi: 10.1186/1471-2288-14-41.
    1. Arnold DM, Burns KEA, Adhikari NKJ, Kho ME, Meade MO. Cook DJ; for the McMaster Critical Care Interest Group. The design and interpretation of pilot trials in clinical research in critical care. Crit Care Med. 2009;37(suppl):S69–S74. doi: 10.1097/CCM.0b013e3181920e33.
    1. Sheehan JL, Winzeler-Merçay U, Mudie MH. A randomized controlled pilot study to obtain the best estimate of the size of the effect of a thermoplastic resting splint on spasticity in the stroke-affected wrist and fingers. Clin Rehabil. 2006;20(12):1032–1037. doi: 10.1177/0269215506071267.
    1. Roberts C, Torgerson DJ. Baseline imbalance in randomised controlled trials. BMJ. 1999;319(7203):185. doi: 10.1136/bmj.319.7203.185.
    1. Chester R, Costa ML, Shepstone L, Cooper A, Donell ST. Eccentric calf muscle training compared with therapeutic ultrasound for chronic Achilles tendon pain – a pilot study. Man Ther. 2008;13(6):484–491. doi: 10.1016/j.math.2007.05.014.
    1. Garcia MK, Driver L, Haddad R, Lee R, Palmer JL, Wei Q, Frenkel M, Cohen L. Acupuncture for treatment of uncontrolled pain in cancer patients: a pragmatic pilot study. Integr Cancer Ther. 2014;13(2):133–140. doi: 10.1177/1534735413510558.
    1. Galantino ML, Callens ML, Cardena GJ, Piela NL, Mao JJ. Tai chi for well-being of breast cancer survivors with aromatase inhibitor-associated arthralgias: a feasibility study. Altern Ther Health Med. 2013;19(6):38–44. PMID: 24254037.
    1. Julious SA, Swank DJ. Moving statistics beyond the individual clinical trial: applying decision science to optimize a clinical development plan. Pharm Stat. 2005;4(1):37–46. doi: 10.1002/pst.149.
    1. Proschan MA. Sample size re-estimation in clinical trials. Biom J. 2009;51(2):348–357. doi: 10.1002/bimj.200800266.

Source: PubMed

3
Subskrybuj