A Bayesian non-inferiority approach using experts' margin elicitation - application to the monitoring of safety events

Camille Aupiais, Corinne Alberti, Thomas Schmitz, Olivier Baud, Moreno Ursino, Sarah Zohar, Camille Aupiais, Corinne Alberti, Thomas Schmitz, Olivier Baud, Moreno Ursino, Sarah Zohar

Abstract

Background: When conducing Phase-III trial, regulatory agencies and investigators might want to get reliable information about rare but serious safety outcomes during the trial. Bayesian non-inferiority approaches have been developed, but commonly utilize historical placebo-controlled data to define the margin, depend on a single final analysis, and no recommendation is provided to define the prespecified decision threshold. In this study, we propose a non-inferiority Bayesian approach for sequential monitoring of rare dichotomous safety events incorporating experts' opinions on margins.

Methods: A Bayesian decision criterion was constructed to monitor four safety events during a non-inferiority trial conducted on pregnant women at risk for premature delivery. Based on experts' elicitation, margins were built using mixtures of beta distributions that preserve experts' variability. Non-informative and informative prior distributions and several decision thresholds were evaluated through an extensive sensitivity analysis. The parameters were selected in order to maintain two rates of misclassifications under prespecified rates, that is, trials that wrongly concluded an unacceptable excess in the experimental arm, or otherwise.

Results: The opinions of 44 experts were elicited about each event non-inferiority margins and its relative severity. In the illustrative trial, the maximal misclassification rates were adapted to events' severity. Using those maximal rates, several priors gave good results and one of them was retained for all events. Each event was associated with a specific decision threshold choice, allowing for the consideration of some differences in their prevalence, margins and severity. Our decision rule has been applied to a simulated dataset.

Conclusions: In settings where evidence is lacking and where some rare but serious safety events have to be monitored during non-inferiority trials, we propose a methodology that avoids an arbitrary margin choice and helps in the decision making at each interim analysis. This decision rule is parametrized to consider the rarity and the relative severity of the events and requires a strong collaboration between physicians and the trial statisticians for the benefit of all. This Bayesian approach could be applied as a complement to the frequentist analysis, so both Data Safety Monitoring Boards and investigators can benefit from such an approach.

Keywords: Bayesian inference; Children; Clinical trial; Elicitation; Mixture model; Non-inferiority.

Conflict of interest statement

The authors declared declare that they have no competing interests with respect to the research, authorship, and/or publication of this article.

Figures

Fig. 1
Fig. 1
General framework describing the two steps of the decision rule building. This figure summarizes the general framework, divided in two steps: 1 Fit margins from experts’ elicitation (Dj); 2 Sensitivity analysis to choose the prior and the decision thresholds
Fig. 2
Fig. 2
Histogram of the acceptable difference of severe intraventricular haemorrhage between arms, and mixtures of Beta distributions fitted from experts’ elicitation, through 3 different methods, with their criteria for goodness of fit. The histogram represents the acceptable difference of IVH among the E experts (dj,e). The 3 lines represent the fits of this difference (Dj), obtained through the 3 different methods. The legend gives the parameters of the fits and their criteria for goodness of fit
Fig. 3
Fig. 3
Plots of posterior class a and class b misclassifications according to the decision thresholds for each of the 13 pairs of priors for severe intraventricular haemorrhage. This figure represents the posterior rates of misclassifications for each pair of priors. Prior 1 is the non-informative prior, with α1,j=α0,j=β1,j=β0,j=1; Prior 2 to 13 are distinguished by (i) the means for the difference between the two arms: E(π1,jπ0,j)=0 for prior 2, 3, 4 and 5; E(π1,jπ0,j)=median(dj,e) for prior 6, 7, 8 and 9; and E(π1,jπ0,j)=π0,j for prior 10, 11, 12 and 13; (ii) their precision: 1 for prior 2, 6 and 10; 1/3 for prior 3, 7 and 11; 1/10 for prior 4, 8 and 12; and 1/20 for prior 5, 9 and 13. For each prior, the red solid line represents the number of posterior class a misclassifications (trials that conclude that the difference between arms is Unacceptable, while it is not true) at the final analysis, according to each final threshold. The blue solid line represents the number of posterior class b misclassifications (trials that conclude that the difference between arms is Acceptable, while it is not true)
Fig. 4
Fig. 4
Distribution of the successive conclusions and errors, obtained by applying the decision rule to the 5000 simulated trials, at each interim analysis and in overall, for severe intraventricular haemorrhage. The left part of the plot represents the conclusions at each interim analysis. The right part represents the overall count of conclusions among the 11 analyses. The upper part of the plot represents the trials with an Acceptable difference between arms: orange area correspond to trials that conclude that the difference between arms is Acceptable, while it is true; red area correspond to trials that conclude that the difference between arms is Unacceptable, while it is not true (class a misclassifications). The bottom part of the plot represents the trials with an Unacceptable difference between arms: green area correspond to trials that conclude that the difference between arms is Unacceptable, while it is true; blue area correspond to trials that conclude that the difference between arms is Acceptable, while it is not true (class b misclassifications)
Fig. 5
Fig. 5
Distribution of the overall conclusions and errors, obtained by applying the decision rule to the 5000 simulated trials, according to the event and the scenario. This plot presents the overall numbers or misclassifications obtained by applying this decision rule, according to the 5 scenario and to the 4 events. The left part of the plot represents the trials with an Acceptable difference between arms: orange area correspond to trials that conclude that the difference between arms is Acceptable, while it is true; red area correspond to trials that conclude that the difference between arms is Unacceptable, while it is not true (class a misclassifications). The right part of the plot represents the trials with an Unacceptable difference between arms: green area correspond to trials that conclude that the difference between arms is Unacceptable, while it is true; blue area correspond to trials that conclude that the difference between arms is Acceptable, while it is not true (class b misclassifications). IVH: Intraventricular haemorrhage; NEC: Necrotizing enterocolitis

References

    1. U S Department of Health and Human Services, Food and Drug Administration, Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). Non-Inferiority Clinical Trials to Establish Effectiveness - Guidance for Industry; 2016.
    1. Sorbello A, Komo S, Valappil T. Noninferiority Margin for Clinical Trials of Antibacterial Drugs for Nosocomial Pneumonia. Drug Inf J. 2010;44(2):165–76. doi: 10.1177/009286151004400208.
    1. Aupiais C, Zohar S, Taverny G, Le Roux E, Boulkedid R, Alberti C. Exploring how non-inferiority and equivalence are assessed in paediatrics: a systematic review. Arch Dis Child. 2018;103(11):1067–75.
    1. Bradburn MJ, Deeks JJ, Berlin JA, Russell Localio A. Much ado about nothing: a comparison of the performance of meta-analytical methods with rare events. Stat Med. 2007;26(1):53–77. doi: 10.1002/sim.2528.
    1. Gamalo-Siebers Margaret, Gao Aijun, Lakshminarayanan Mani, Liu Guanghan, Natanegara Fanni, Railkar Radha, Schmidli Heinz, Song Guochen. Bayesian methods for the design and analysis of noninferiority trials. Journal of Biopharmaceutical Statistics. 2015;26(5):823–841. doi: 10.1080/10543406.2015.1074920.
    1. Spiegelhalter DJ, Freedman LS, Parmar MKB. Bayesian Approaches to Randomized Trials. J R Stat Soc Ser A (Stat Soc) 1994;157(3):357–416. doi: 10.2307/2983527.
    1. Gamalo MA, Wu R, Tiwari RC. Bayesian approach to noninferiority trials for proportions. J Biopharm Stat. 2011;21(5):902–19. doi: 10.1080/10543406.2011.589646.
    1. Schmitz T, Alberti C, Ursino M, Baud O, Aupiais C, BETADOSE study group and the GROG(GroupedeRechercheenGynécologie Obstétrique) Full versus half dose of antenatal betamethasone to prevent severe neonatal respiratory distress syndrome associated with preterm birth: study protocol for a randomised, multicenter, double blind, placebo-controlled, non-inferiority trial (BETADOSE) BMC Pregnancy Childbirth. 2019;19(1):67. doi: 10.1186/s12884-019-2206-x.
    1. Liggins GC, Howie RN. A controlled trial of antepartum glucocorticoid treatment for prevention of the respiratory distress syndrome in premature infants. Pediatrics. 1972;50(4):515–25.
    1. Effect of corticosteroids for fetal maturation on perinatal outcomes. NIH Consens Statement. 1994; 12(2):1–24.
    1. ACOG committee opinion Antenatal corticosteroid therapy for fetal maturation. Number 210, October 1998 (Replaces Number 147, December 1994). Committee on Obstetric Practice. American College of Obstetricians and Gynecologists. Int J Gynaecol Obstet Off Organ Int Fed Gynaecol Obstet. 1999;64(3):334–5.
    1. Senat MV. Corticosteroid for fetal lung maturation: indication and treatment protocols. J Gynecol Obstet Biol Reprod. 2002;31(7 Suppl):5S105–113.
    1. Royal College of Obstetricians & Gynaecologists. Green-top Guideline No 7: Antenatal corticosteroids to reduce neonatal morbidity and mortality. London; 2010.
    1. Roberts D, Dalziel S. Antenatal corticosteroids for accelerating fetal lung maturation for women at risk of preterm birth. Cochrane Database Syst Rev. 2006;3:CD004454.
    1. Brownfoot FC, Gagliardi DI, Bain E, Middleton P, Crowther CA. Different corticosteroids and regimens for accelerating fetal lung maturation for women at risk of preterm birth. Cochrane Database Syst Rev. 2013;8:CD006764.
    1. Crowther A, McKinlay CJD, Middleton P, Harding JE. Repeat doses of prenatal corticosteroids for women at risk of preterm birth for improving neonatal health outcomes. Cochrane Database Syst Rev. 2015;7:CD003935.
    1. Ancel PY, Goffinet F, EPIPAGE-2 Writing Group, Kuhn P, Langer B, Matis J, et al.Survival and morbidity of preterm children born at 22 through 34 weeks’ gestation in France in 2011: results of the EPIPAGE-2 cohort study. JAMA Pediatr. 2015; 169(3):230–8.
    1. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing; 2017. .
    1. Grün B, Kosmidis I, Zeileis A. Extended Beta Regression in R: Shaken, Stirred, Mixed, and Partitioned. J Stat Softw. 2012;48(11):1–25. doi: 10.18637/jss.v048.i11.
    1. Stan Development Team. RStan: the R interface to Stan. 2018. R package version 2.11.1. .
    1. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al.Stan: A Probabilistic Programming Language. J Stat Softw. 2017; 76(1):1–32.
    1. Thall PF, Ursino. M. Baudouin V, Alberti C, Zohar S. Bayesian treatment comparison using parametric mixture priors computed from elicited histograms. Stat Methods Med Res. 2019;28(2):404–18. doi: 10.1177/0962280217726803.

Source: PubMed

3
Předplatit