Choosing an imbalance metric for covariate-constrained randomization in multiple-arm cluster-randomized trials

Jody D Ciolino, Alicia Diebold, Jessica K Jensen, Gerald W Rouleau, Kimberly K Koloms, Darius Tandon, Jody D Ciolino, Alicia Diebold, Jessica K Jensen, Gerald W Rouleau, Kimberly K Koloms, Darius Tandon

Abstract

Background: In cluster-randomized controlled trials (C-RCTs), covariate-constrained randomization (CCR) methods efficiently control imbalance in multiple baseline cluster-level variables, but the choice of imbalance metric to define the subset of "adequately balanced" possible allocation schemes for C-RCTs involving more than two arms and continuous variables is unclear. In an ongoing three-armed C-RCT, we chose the min(three Kruskal-Wallis [KW] test P values) > 0.30 as our metric. We use simulation studies to explore the performance of this and other metrics of baseline variable imbalance in CCR.

Methods: We simulated three continuous variables across three arms under varying allocation ratios and assumptions. We compared the performance of min(analysis of variance [ANOVA] P value) > 0.30, min(KW P value) > 0.30, multivariate analysis of variance (MANOVA) P value > 0.30, min(nine possible t test P values) > 0.30, and min(Wilcoxon rank-sum [WRS] P values) > 0.30.

Results: Pairwise comparison metrics (t test and WRS) tended to be the most conservative, providing the smallest subset of allocation schemes (10%-13%) meeting criteria for acceptable balance. Sensitivity of the min(t test P values) > 0.30 for detecting non-trivial imbalance was 100% for both hypothetical and resampled simulation scenarios. The KW criterion maintained higher sensitivity than both the MANOVA and ANOVA criteria (89% to over 99%) but was not as sensitive as pairwise criteria.

Conclusions: Our criterion, the KW P value > 0.30, to signify "acceptable" balance was not the most conservative, but it appropriately identified imbalance in the majority of simulations. Since all are related, CCR algorithms involving any of these imbalance metrics for continuous baseline variables will ensure robust simultaneous control over multiple continuous baseline variables, but we recommend care in determining the threshold of "acceptable" levels of (im)balance.

Trial registration: This trial is registered on ClinicalTrials.gov (initial post: December 1, 2016; identifier: NCT02979444 ).

Keywords: Cluster randomization; Cluster-randomized controlled trial; Continuous covariate; Covariate-constrained randomization; Imbalance.

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Distribution of site-level randomization variables by arm for trial NCT02979444. We planned for a total of 42 randomized sites (6:18:18), but owing to dropout we have 38 active sites. Sample size and power considerations accounted for dropout that we observed. Medians (interquartile ranges) are displayed in each arm for each variable above
Fig. 2
Fig. 2
Maximum pairwise imbalance observed for scenarios meeting adequacy threshold (P > 0.30) in simulated trials by criterion. Each panel represents the distribution of the max(mean difference) for simulated scenarios meeting the criterion for “adequate” overall variable balance across arms. All simulated schemes meet criteria for adequate under simple random allocation (panel a), but the remaining panels illustrate only those allocation schemes meeting the P > 0.30 criterion for each metric. The mean difference depicted is on the standard deviation unit scale. Those meeting this criterion would ideally have a small max(mean difference), and we deem a max(mean difference) > 1.0 (vertical line) unacceptable since previously a value of 0.8 would be deemed “large” [16]
Fig. 3
Fig. 3
Pairwise scatterplots exploring associations between Kruskal–Wallis (KW) P value with other measures. The panels here present a selection of pairwise plots to illustrate the relationships between the imbalance metric we used in our randomization algorithm, the KW test P value, and additional candidate imbalance metrics explored. Each plot includes 5000 observations from the resampled scenarios using 1:3:3 allocation as in our present study. In each plot, there is often a non-linear relationship. For example, the first plot illustrating min(KW P value) in comparison with the multivariate analysis of variance (MANOVA) demonstrates a somewhat noisy relationship between the two whereby the min(KW P value) tends to be lower than the overall MANOVA P value, but the two are related. The comparison of the min(KW P value) versus the min(Wilcoxon rank-sum [WRS] P value) shows a more pronounced relationship and a clearer, non-linear pattern. All metrics of imbalance as determined are highly related; broadly, the more global tests (e.g., MANOVA) tended to be less conservative (i.e., have larger P value) than the corresponding more specific tests based on more than one comparison (e.g., WRS). The line y = x has been added for reference

References

    1. Campbell M, Donner A, Klar N. Developments in cluster randomized trials and Statistics in Medicine. Stat Med. 2007;26:2–19. doi: 10.1002/sim.2731.
    1. Turner Elizabeth L., Li Fan, Gallis John A., Prague Melanie, Murray David M. Review of Recent Methodological Developments in Group-Randomized Trials: Part 1—Design. American Journal of Public Health. 2017;107(6):907–915. doi: 10.2105/AJPH.2017.303706.
    1. Donner A, Klar N. Pitfalls of and controversies in cluster randomization trials. Am J Public Health. 2004;94:416–422. doi: 10.2105/AJPH.94.3.416.
    1. Ivers NM, Halperin IJ, Barnsley J, Grimshaw JM, Shah BR, Tu K, et al. Allocation techniques for balance at baseline in cluster randomized trials: a methodological review. Trials. 2012;13:120. doi: 10.1186/1745-6215-13-120.
    1. Moerbeek M, van Schie S. How large are the consequences of covariate imbalance in cluster randomized trials: a simulation study with a continuous outcome and a binary covariate at the cluster level. BMC Med Res Methodol. 2016;16:79. doi: 10.1186/s12874-016-0182-7.
    1. Raab GM, Butcher I. Balance in cluster randomized trials. Stat Med. 2001;20:351–365. doi: 10.1002/1097-0258(20010215)20:3<351::AID-SIM797>;2-C.
    1. Ciolino JD, Martin RH, Zhao W, Hill MD, Jauch EC, Palesch YY. Measuring continuous baseline covariate imbalances in clinical trial data. Stat Methods Med Res. 2015;24:255–272. doi: 10.1177/0962280211416038.
    1. Ciolino JD, Martin RH, Zhao W, Jauch EC, Hill MD, Palesch YY. Covariate imbalance and adjustment for logistic regression analysis of clinical trial data. J Biopharm Stat. 2013;23:1383–1402. doi: 10.1080/10543406.2013.834912.
    1. Zhao W, Hill MD, Palesch Y. Minimal sufficient balance—a new strategy to balance baseline covariates and preserve randomness of treatment allocation. Stat Methods Med Res. 2015;24:989–1002. doi: 10.1177/0962280212436447.
    1. de Hoop E, Teerenstra S, van Gaal BG, Moerbeek M, Borm GF. The “best balance” allocation led to optimal balance in cluster-controlled trials. J Clin Epidemiol. 2012;65:132–137. doi: 10.1016/j.jclinepi.2011.05.006.
    1. Li F, Lokhnygina Y, Murray DM, Heagerty PJ, DeLong ER. An evaluation of constrained randomization for the design and analysis of group-randomized trials. Stat Med. 2016;35:1565–1579. doi: 10.1002/sim.6813.
    1. Li F, Turner EL, Heagerty PJ, Murray DM, Vollmer WM, DeLong ER. An evaluation of constrained randomization for the design and analysis of group-randomized trials with binary outcomes. Stat Med. 2017;36:3791–3806. doi: 10.1002/sim.7410.
    1. Nietert PJ, Jenkins RG, Nemeth LS, Ornstein SM. An application of a modified constrained randomization process to a practice-based cluster randomized trial to improve colorectal cancer screening. Contemp Clin Trials. 2009;30:129–132. doi: 10.1016/j.cct.2008.10.002.
    1. Tandon SD, Leis JA, Mendelson T, Perry DF, Kemp K. Six-month outcomes from a randomized controlled trial to prevent perinatal depression in low-income home visiting clients. Matern Child Health J. 2014;18:873–881. doi: 10.1007/s10995-013-1313-y.
    1. Tandon SD, Perry DF, Mendelson T, Kemp K, Leis JA. Preventing perinatal depression in low-income home visiting clients: A randomized controlled trial. J Consult Clin Psychol. 2011;79:707. doi: 10.1037/a0024895.
    1. Cohen J. A power primer. Psychol Bull. 1992;112:155. doi: 10.1037/0033-2909.112.1.155.
    1. Turner EL, Prague M, Gallis JA, Li F, Murray DM. Review of recent methodological developments in group-randomized trials: part 2—analysis. Am J Public Health. 2017;107:1078–1086. doi: 10.2105/AJPH.2017.303707.
    1. Murray DM, Varnell SP, Blitstein JL. Design and analysis of group-randomized trials: a review of recent methodological developments. Am J Public Health. 2004;94:423–432. doi: 10.2105/AJPH.94.3.423.
    1. Raudenbush SW, Martinez A, Spybrook J. Strategies for improving precision in group-randomized experiments. Educ Eval Policy Anal. 2007;29:5–29. doi: 10.3102/0162373707299460.
    1. Altman DG, Royston P. The cost of dichotomising continuous variables. BMJ. 2006;332:1080. doi: 10.1136/bmj.332.7549.1080.

Source: PubMed

3
Subscribe