The batched stepped wedge design: A design robust to delays in cluster recruitment

Jessica Kasza, Rhys Bowden, Richard Hooper, Andrew B Forbes, Jessica Kasza, Rhys Bowden, Richard Hooper, Andrew B Forbes

Abstract

Stepped wedge designs are an increasingly popular variant of longitudinal cluster randomized trial designs, and roll out interventions across clusters in a randomized, but step-wise fashion. In the standard stepped wedge design, assumptions regarding the effect of time on outcomes may require that all clusters start and end trial participation at the same time. This would require ethics approvals and data collection procedures to be in place in all clusters before a stepped wedge trial can start in any cluster. Hence, although stepped wedge designs are useful for testing the impacts of many cluster-based interventions on outcomes, there can be lengthy delays before a trial can commence. In this article, we introduce "batched" stepped wedge designs. Batched stepped wedge designs allow clusters to commence the study in batches, instead of all at once, allowing for staggered cluster recruitment. Like the stepped wedge, the batched stepped wedge rolls out the intervention to all clusters in a randomized and step-wise fashion: a series of self-contained stepped wedge designs. Provided that separate period effects are included for each batch, software for standard stepped wedge sample size calculations can be used. With this time parameterization, in many situations including when linear models are assumed, sample size calculations reduce to the setting of a single stepped wedge design with multiple clusters per sequence. In these situations, sample size calculations will not depend on the delays between the commencement of batches. Hence, the power of batched stepped wedge designs is robust to unexpected delays between batches.

Keywords: cluster randomized trial; intracluster correlation; sample size calculation; within-cluster correlation structure.

Conflict of interest statement

The authors declare no potential conflict of interests.

© 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.

Figures

FIGURE 1
FIGURE 1
An example of a standard stepped wedge design, with 4 periods and 3 sequences (0 indicates periods in which the control condition is implemented; 1 indicates periods in which the intervention is implemented)
FIGURE 2
FIGURE 2
Three different ways in which the effect of time can be parameterized in a design where clusters commence study participation in three batches: the βs parameterize the time effects. For example, in the top panel, β1 parameterizes the effect of month 1 on outcomes. The top panel indicates how effects of time are shared across batches when the effects of calendar time are assumed to be constant across batches; the middle panel indicates how the effects of time are shared across batches when time‐on‐trial effects are assumed to be constant across batches; the bottom panel indicates that no time effects are shared across batches when separate time effects are estimated in each batch
FIGURE 3
FIGURE 3
Four examples of batched stepped wedge designs with identical component designs (0 indicates control periods; 1 indicates intervention periods). Each of these designs has three batches of three‐period stepped wedge designs, with differing degrees of overlap between successive batches. Design 1 (top row): no overlap between successive batches; Design 2 (second from top): overlap of one period between successive batches; Design 3 (second from bottom): overlap of two periods between successive batches; Design 4 (bottom row): variable overlap between successive batches
FIGURE 4
FIGURE 4
Two examples of batched stepped wedge designs without identical component designs (0 indicates control periods; 1 indicates intervention periods)
FIGURE 5
FIGURE 5
The design schematic for the PACT‐HF trial: Two batches of a 5‐sequence stepped wedge design
FIGURE 6
FIGURE 6
Empirical and theoretical type I error rates (left panel) and power (right panel) for the simulated continuous outcomes. ICC, intracluster correlation; CAC, cluster autocorrelation. Within each panel, subpanels correspond to a different value of the CAC. The theoretical and empirical type I error rate or power is displayed for each combination of number of periods of overlap, ICC, and CAC, with the empirical result plus and minus 2 standard errors also displayed
FIGURE 7
FIGURE 7
Empirical and theoretical type I error rates (left panel) and power (right panel) for the simulated binary outcomes analyzed via GEE. ICC, intracluster correlation. The theoretical and empirical type I error rate or power is displayed for each combination of number of periods of overlap and ICC, with the empirical result plus and minus 2 standard errors also displayed

References

    1. Mdege ND, Man MS, Taylor CA, Torgerson DJ. Systematic review of stepped wedge cluster randomized trials shows that design is particularly used to evaluate interventions during routine implementation. J Clin Epidemiol. 2011;64(9):936‐948.
    1. Matthews JNS, Forbes AB. Stepped wedge designs: insights from a design of experiments perspective. Stat Med. 2017;36(24):3772‐3790.
    1. Hemming K, Taljaard M, McKenzie JE, et al. Reporting of stepped wedge cluster randomised trials: extension of the CONSORT 2010 statement with explanation and elaboration. BMJ. 2020;363:k1614.
    1. Hussey MA, Hughes JP. Design and analysis of stepped wedge cluster randomized trials. Contemp Clin Trials. 2007;28:182‐191.
    1. Hemming K, Kasza J, Hooper R, Forbes A, Taljaard M. A tutorial on sample size calculation for multiple‐period cluster randomized parallel, cross‐over and stepped‐wedge trials using the shiny CRT calculator. Int J Epidemiol. 2020;49:979‐995.
    1. Li F, Turner EL, Preisser JS. Sample size determination for GEE analyses of stepped wedge cluster randomized trials. Biometrics. 2018;74:1450‐1458.
    1. Hastings SN, Stechuchak KM, Choate A, et al. Implementation of a stepped wedge cluster randomized trial to evaluate a hospital mobility program. Trials. 2020;21:863.
    1. ESCP EAGLE Safe Anastomosis Collective . ESCP Safe Anastomosis Programme in Colorectal Surgery (EAGLE): study protocol for an international cluster randomised trial of a quality improvement intervention to reduce anastomotic leak following right colectomy. Color Dis. 2021;23:2761‐2771. doi:10.1111/codi.15806
    1. Hooper R, Bourke L. Cluster randomised trials with repeated cross sections: alternatives to parallel group designs. BMJ. 2015;350:h2925.
    1. Taljaard M, Teerenstra S, Ivers NM, Fergusson DA. Substantial risks associated with few clusters in cluster randomized and stepped wedge designs. Clin Trials. 2016;13:459‐463.
    1. Hemming K, Haines TP, Chilton PJ, Girling AJ, Lilford R. The stepped‐wedge cluster randomised trial: rationale, design, analysis and reporting. BMJ. 2015;350:h391.
    1. Kasza J, Hemming K, Hooper R, Matthews JNS, Forbes AB. Impact of non‐uniform correlation structure on sample size and power in multiple‐period cluster randomised trials. Stat Methods Med Res. 2019;28:703‐716.
    1. Grantham KL, Kasza J, Heritier S, Hemming K, Forbes AB. Accounting for a decaying correlation structure in cluster randomized trials with continuous recruitment. Stat Med. 2019;38:1918‐1934.
    1. Kasza J, Hooper R, Copas A, Forbes AB. Sample size and power calculations for open cohort longitudinal cluster randomized trials. Stat Med. 2020;39:1871‐1883.
    1. Kasza J, Taljaard M, Forbes AB. Information content of stepped‐wedge designs when treatment effect heterogeneity and/or implementation periods are present. Stat Med. 2019;38:4686‐4701.
    1. Kasza J, Bowden R, Forbes AB. Information content of stepped wedge designs with unequal cluster‐period sizes in linear mixed models: informing incomplete designs. Stat Med. 2021;40:1736‐1751. doi:10.1002/sim.8867
    1. Harrison LJ, Chen T, Wang R. Power calculation for cross‐sectional stepped wedge cluster randomized trials with variable cluster sizes. Biometrics. 2020;76:951‐962.
    1. Zhou X, Liao X, Kunz LM, Normand SLT, Wang M, Spiegelman D. A maximum likelihood approach to power calculations for stepped wedge designs of binary outcomes. Biostatistics. 2020;21:102‐121.
    1. Li F, Forbes A, Turner EL, Preisser JS. Power and sample size requirements for GEE analyses of cluster randomized crossover trials. Stat Med. 2019;38:636‐649.
    1. Unni RR, Lee SF, Thabane L, Connolly S, Van Spall HG. Variations in stepped‐wedge cluster randomized trial design: insights from the patient‐centered care transitions in heart failure trial. Am Heart J. 2020;220:116‐126.
    1. Hemming K, Girling A. A menu‐driven facility for power and detectable‐difference calculations in stepped‐wedge cluster‐randomized trials. Stata J. 2014;14:363‐380.
    1. Chen J, Zhou X, Li F, Spiegelman D. swdpwr: a SAS macro and an R package for power calculation in stepped wedge cluster randomized trials; 2020. arxiv:2011.06031v1.
    1. Morris TP, White IR, Crowther MJ. Using simulation studies to evaluate statistical methods. Stat Med. 2019;38:2074‐2102.
    1. Kammer M. looplot: create nested loop plots; 2022. R package version 0.5.0.9002.
    1. Qaqish BF. A family of multivariate binary distributions for simulating correlated binary variables. Biometrika. 2003;90:455‐463.

Source: PubMed

Подписаться