Restricted mean survival time: an alternative to the hazard ratio for the design and analysis of randomized trials with a time-to-event outcome

Patrick Royston, Mahesh K B Parmar, Patrick Royston, Mahesh K B Parmar

Abstract

Background: Designs and analyses of clinical trials with a time-to-event outcome almost invariably rely on the hazard ratio to estimate the treatment effect and implicitly, therefore, on the proportional hazards assumption. However, the results of some recent trials indicate that there is no guarantee that the assumption will hold. Here, we describe the use of the restricted mean survival time as a possible alternative tool in the design and analysis of these trials.

Methods: The restricted mean is a measure of average survival from time 0 to a specified time point, and may be estimated as the area under the survival curve up to that point. We consider the design of such trials according to a wide range of possible survival distributions in the control and research arm(s). The distributions are conveniently defined as piecewise exponential distributions and can be specified through piecewise constant hazards and time-fixed or time-dependent hazard ratios. Such designs can embody proportional or non-proportional hazards of the treatment effect.

Results: We demonstrate the use of restricted mean survival time and a test of the difference in restricted means as an alternative measure of treatment effect. We support the approach through the results of simulation studies and in real examples from several cancer trials. We illustrate the required sample size under proportional and non-proportional hazards, also the significance level and power of the proposed test. Values are compared with those from the standard approach which utilizes the logrank test.

Conclusions: We conclude that the hazard ratio cannot be recommended as a general measure of the treatment effect in a randomized controlled trial, nor is it always appropriate when designing a trial. Restricted mean survival time may provide a practical way forward and deserves greater attention.

Figures

Figure 1
Figure 1
Example of sample sizes as a function of the time horizon t for PH (solid lines) and non-PH (dashed lines) trial designs. The designs assume recruitment over K1 = 5 yr and follow-up over K2 = 3 yr.
Figure 2
Figure 2
Percent maturity (pmat) and power curves as a function of t for the RE04 trial. Vertical lines show tfinal∗.
Figure 3
Figure 3
Evolution over time (t) of z -statistics for RMST (truncated, solid lines; non-truncated, short dashed lines) and Cox (truncated, long dashed lines) tests in four randomized controlled trials in cancer.

References

    1. Royston P, Parmar MKB. The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubt. Stat Med. 2011;13:2409–2421. doi: 10.1002/sim.4274.
    1. Schemper M, Wakounig S, Heinze G. The estimation of average hazard ratios by weighted Cox regression. Stat Med. 2009;13:2473–2489. doi: 10.1002/sim.3623.
    1. Mok TS, Wu YL, Thongprasert S, Yang CH, Chu DT, Saijo N, Sunpaweravong P, Han B, Margono B, Ichinose Y, Nishiwaki Y, Ohe Y, Yang JJ, Chewaskulyong B, Jiang H, Duffield EL, Watkins CL, Armour AA, Fukuoka M. Gefitinib or carboplatin - paclitaxel in pulmonary adenocarcinoma. N Engl J Med. 2009;13:947–957. doi: 10.1056/NEJMoa0810699.
    1. Kristensen G, Perren T, Qian W, Pfisterer J, Ledermann JA, Joly F, Carey MS. Result of interim analysis of overall survival in the GCIG ICON7 phase III randomized trial of bevacizumab in women with newly diagnosed ovarian cancer. J Clin Oncol. 2011;13(S):LBA5006.
    1. Andersen PK, Perme MP. Pseudo-observations in survival analysis. Stat Methods Med Res. 2010;13:71–99. doi: 10.1177/0962280209105020.
    1. Barthel FMS, Babiker A, Royston P, Parmar MKB. Evaluation of sample size and power for multi-arm survival trials allowing for non-uniform accrual, non-proportional hazards, loss to follow-up and cross-over. Stat Med. 2006;13:2521–2542. doi: 10.1002/sim.2517.
    1. Irwin JO. The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life in experiments with mice. J Hyg. 1949;13:188–189. doi: 10.1017/S0022172400014443.
    1. Barthel FMS, Royston P, Babiker A. A menu-driven facility for complex sample size calculation in randomized controlled trials with a survival or a binary outcome: update. Stat J. 2005;13:123–129.
    1. Rosner B. Fundamentals of Biostatistics. Belmont, Ca: Duxbury Press; 2006.
    1. Royston P, Parmar MKB. Flexible proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Stat Med. 2002;13:2175–2197. doi: 10.1002/sim.1203.
    1. Royston P, Lambert PC. Flexible parametric survival analysis using Stata beyond the Cox model. College Station, Tx: Stata Press; 2011.
    1. Lambert PC, Royston P. Further development of flexible parametric models for survival analysis. Stat J. 2009;13:265–290.
    1. McGuire WP, Hoskins WJ, Brady MF, Kucera PR, Partridge EE, Look KY, Clarke-Pearson DL, Davidson M. Cyclophosphamide and cisplatin compared with paclitaxel and cisplatin in patients with stage III and stage IV ovarian cancer. N Engl J Med. 1996;13:1–6. doi: 10.1056/NEJM199601043340101.
    1. Leibovich BC, Blute ML, Cheville JC, Lohse CM, Frank I, Kwon ED, Weaver AL, Parker AS, Zincke H. Prediction Of progression after radical Nephrectomy for patients with clear cell renal cell carcinoma. Cancer. 2003;13:1663–1771. doi: 10.1002/cncr.11234.
    1. Gore ME, Griffin CL, Hancock B, Patel PM, Pyle L, Aitchison M, James N, Oliver RTD, Mardiak J, Hussain T, Sylvester R, Parmar MKB, Royston P, Mulders PFA. Interferon alfa-2a versus combination therapy with interferon alfa-2a, interleukin-2, and fluorouracil in patients with untreated metastatic renal cell carcinoma (MRC RE04 / EORTC GU 30012): an open-label randomised trial. Lancet. 2010;13:641–648. doi: 10.1016/S0140-6736(09)61921-8.
    1. Kitchener H, Swart AM, Qian W. Efficacy of systematic pelvic lymphadenectomy in endometrial cancer (MRC ASTEC trial): a randomised study. Lancet. 2009;13:125–136.
    1. International collaboration of trialists on behalf of the Medical Research Council Advanced Bladder Cancer Working Party. Neoadjuvant cisplatin, methotrexate and vinblastine chemotherapy for muscle-invasive bladder cancer: a randomised controlled trial. Lancet. 1999;13:533–540.
    1. Parmar MK, Ledermann JA, Colombo N. Paclitaxel plus platinum-based chemotherapy versus conventional platinum-based chemotherapy in women with relapsed ovarian cancer: the ICON4/AGO-OVAR-2.2 trial. Lancet. 2003;13:2099–2106.
    1. Medical Research Council Oesophageal Cancer Working Party. Surgical resection with or without preoperative chemotherapy in oesophageal cancer: a randomised controlled trial. Lancet. 2002;13:1727–1733.
    1. Gigerenzer G. Reckoning with risk. London, UK: Allen Lane; 2002.
    1. Royston P, Parmar MKB, Altman DG. Visualizing length of survival in time-to-event studies: a complement to Kaplan-Meier plots. J Natl Cancer Inst. 2008;13:92–97. doi: 10.1093/jnci/djm265.

Source: PubMed

3
Subscribe