Bounding the per-protocol effect in randomized trials: an application to colorectal cancer screening

Sonja A Swanson, Øyvind Holme, Magnus Løberg, Mette Kalager, Michael Bretthauer, Geir Hoff, Eline Aas, Miguel A Hernán, Sonja A Swanson, Øyvind Holme, Magnus Løberg, Mette Kalager, Michael Bretthauer, Geir Hoff, Eline Aas, Miguel A Hernán

Abstract

Background: The per-protocol effect is the effect that would have been observed in a randomized trial had everybody followed the protocol. Though obtaining a valid point estimate for the per-protocol effect requires assumptions that are unverifiable and often implausible, lower and upper bounds for the per-protocol effect may be estimated under more plausible assumptions. Strategies for obtaining bounds, known as "partial identification" methods, are especially promising in randomized trials.

Results: We estimated bounds for the per-protocol effect of colorectal cancer screening in the Norwegian Colorectal Cancer Prevention trial, a randomized trial of one-time sigmoidoscopy screening in 98,792 men and women aged 50-64 years. The screening was not available to the control arm, while approximately two thirds of individuals in the treatment arm attended the screening. Study outcomes included colorectal cancer incidence and mortality over 10 years of follow-up. Without any assumptions, the data alone provide little information about the size of the effect. Under the assumption that randomization had no effect on the outcome except through screening, a point estimate for the risk under no screening and bounds for the risk under screening are achievable. Thus, the 10-year risk difference for colorectal cancer was estimated to be at least -0.6 % but less than 37.0 %. Bounds for the risk difference for colorectal cancer mortality (-0.2 to 37.4 %) and all-cause mortality (-5.1 to 32.6 %) had similar widths. These bounds appear helpful in quantifying the maximum possible effectiveness, but cannot rule out harm. By making further assumptions about the effect in the subpopulation who would not attend screening regardless of their randomization arm, narrower bounds can be achieved.

Conclusions: Bounding the per-protocol effect under several sets of assumptions illuminates our reliance on unverifiable assumptions, highlights the range of effect sizes we are most confident in, and can sometimes demonstrate whether to expect certain subpopulations to receive more benefit or harm than others.

Trial registration: Clinicaltrials.gov identifier NCT00119912 (registered 6 July 2005).

Figures

Fig. 1
Fig. 1
Bounds for the per-protocol 10-year risk difference when restricting the maximum value of the risk under screening in the “never-takers”, aged 55–64 years. Gray area in nested plots indicates the area of detail presented in the outer plots
Fig. 2
Fig. 2
Age-standardized bounds (for ages 50–64) for the per-protocol 10-year risk difference under various sets of assumptions. Sets of assumptions include: a No assumptions. b The instrumental conditions (relevance, exclusion restriction, and exchangeability). c The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 2 %, 1 %, and 40 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. d The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 1.5 %, 0.75 %, and 30 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. e The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 1 %, 0.5 %, and 20 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. f The instrumental conditions plus additive effect homogeneity. g The instrumental conditions plus multiplicative effect homogeneity. The dotted line indicates the intention-to-treat effect
Fig. 3
Fig. 3
Age-standardized bounds (for ages 50–64) for the per-protocol 10-year risk ratio under various sets of assumptions. Sets of assumptions include: a No assumptions. b The instrumental conditions (relevance, exclusion restriction, and exchangeability). c The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 2 %, 1 %, and 40 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. d The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 1.5 %, 0.75 %, and 30 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. e The instrumental conditions plus an assumed maximum risk under screening in the “never-takers” of 1 %, 0.5 %, and 20 % for the CRC incidence, CRC mortality, and all-cause mortality, respectively. f The instrumental conditions plus additive effect homogeneity. g The instrumental conditions plus multiplicative effect homogeneity. The dotted line indicates the intention-to-treat effect

References

    1. Hernán MA, Hernandez-Diaz S, Robins JM. Randomized trials analyzed as observational studies. Ann Intern Med. 2013;159(8):560–2.
    1. Hernán MA, Hernandez-Diaz S. Beyond the intention-to-treat in comparative effectiveness research. Clin Trials (London, England) 2012;9(1):48–55. doi: 10.1177/1740774511420743.
    1. Balke A, Pearl J. Bounds on treatment effects for studies with imperfect compliance. J Am Stat Assoc. 1997;92(439):1171–6. doi: 10.1080/01621459.1997.10474074.
    1. Hernan MA, Hernandez-Diaz S, Werler MM, Mitchell AA. Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. Am J Epidemiol. 2002;155(2):176–84. doi: 10.1093/aje/155.2.176.
    1. Richardson T, Robins JM. Analysis of the binary instrumental variable model. In: Dechter R, Geffner H, Halpern JY, editors. Heuristics, probability, and causality: a tribute to Judea Pearl. London: College Publications; 2010: 415–44.
    1. Robins JM. The analysis of randomized and nonrandomized AIDS treatment trials using a new approach to causal inference in longitudinal studies. In: Sechrest L, Freeman H, Mulley A, editors. Health service research methodology: a focus on AIDS. Washington, DC: US Public Health Service; 1989. pp. 113–59.
    1. Robins JM. Correcting for non-compliance in randomized trials using structural nested mean models. Commun Stat. 1994;23:2379–412. doi: 10.1080/03610929408831393.
    1. Manski CF. Nonparametric bounds on treatment effects. Am Econ Rev. 1990;80(2):319-23
    1. Bretthauer M, Gondal G, Larsen K, Carlsen E, Eide TJ, Grotmol T, et al. Design, organization and management of a controlled population screening study for detection of colorectal neoplasia: attendance rates in the NORCCAP study (Norwegian Colorectal Cancer Prevention) Scand J Gastroenterol. 2002;37(5):568–73. doi: 10.1080/00365520252903125.
    1. Holme O, Loberg M, Kalager M, Bretthauer M, Hernán MA, Aas E, et al. Colorectal cancer incidence and mortality after flexible sigmoidoscopy screening – First population-based randomized trial. JAMA. 2014;312(6):606–615. doi: 10.1001/jama.2014.8266.
    1. Hoff G, Grotmol T, Skovlund E, Bretthauer M. Norwegian Colorectal Cancer Prevention Study G. Risk of colorectal cancer seven years after flexible sigmoidoscopy screening: randomised controlled trial. BMJ. 2009;338:b1846. doi: 10.1136/bmj.b1846.
    1. Richardson T, Robins JM. ACE bounds; SEMs with equilibrium conditions. Stat Sci. 2014;29(3):363-366.
    1. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58(1):21–9. doi: 10.1111/j.0006-341X.2002.00021.x.
    1. VanderWeele TJ. Principal stratification – uses and limitations. Int J Biostat. 2011;7(1):1–14. doi: 10.2202/1557-4679.1329.
    1. Angrist JD, Imbens GW, Rubin DB. Identification of causal effects using instrumental variables. J Am Stat Assoc. 1996;91(434):444–55. doi: 10.1080/01621459.1996.10476902.
    1. Hernán MA, Robins JM. Instruments for causal inference: an epidemiologist’s dream? Epidemiology (Cambridge, Mass) 2006;17(4):360–72. doi: 10.1097/01.ede.0000222409.00878.37.
    1. Swanson SA, Hernán MA. Think globally, act globally: an epidemiologist’s perspective on instrumental variable estimation. Stat Sci. 2014;29(3):371–4. doi: 10.1214/14-STS491.
    1. Pearl J. Imperfect experiments: bounding effects and counterfactuals. Causality. New York City: Cambridge University Press; 2009. pp. 259–81.
    1. Little RJ, D’Agostino R, Cohen ML, Dickersin K, Emerson SS, Farrar JT, et al. The prevention and treatment of missing data in clinical trials. New Engl J Med. 2012;367(14):1355–60. doi: 10.1056/NEJMsr1203730.
    1. Robins JM. Structural nested failure time models. In: Anderson PK, Keiding N, editors. The encyclopedia of biostatistics. Chichester, UK: Wiley; 1998. pp. 4372–89.
    1. Tamer E. Partial identification in econometrics. Annu Rev Econ. 2010;2(1):167–95. doi: 10.1146/annurev.economics.050708.143401.
    1. Chernozhukov V, Hong H, Tamer E. Estimation and confidence regions for parameter sets in econometric models. Econometrica. 2007;75(5):1243–84. doi: 10.1111/j.1468-0262.2007.00794.x.
    1. Horowitz JL, Manski CF. Nonparametric analysis of randomized experiments with missing covariate and outcome data. J Am Stat Assoc. 2000;95(449):77–84. doi: 10.1080/01621459.2000.10473902.
    1. Manski CF, Sandefur GD, McLanahan S, Powers D. Alternative estimates of the effect of family structure during adolescence on high school graduation. J Am Stat Assoc. 1992;87(417):25–37. doi: 10.1080/01621459.1992.10475171.
    1. Ramsahai R, Lauritzen S. Likelihood analysis of the binary instrumental variable model. Biometrika. 2011;98(4):987-994.
    1. Romano JP, Shaikh AM. Inference for identifiable parameters in partially identified econometric models. J Stat Plann Infer. 2008;138(9):2786–807. doi: 10.1016/j.jspi.2008.03.015.
    1. Imbens GW, Manski CF. Confidence intervals for partially identified parameters. Econometrica. 2004;72(6):1845-57.
    1. Vansteelandt S, Goetghebeur E, Kenward MG, Molenberghs G. Ignorance and uncertainty regions as inferential tools in a sensitivity analysis. 2006;16(3):953-979.

Source: PubMed

3
Abonnere