Head-to-head comparison of 14 prediction models for postoperative delirium in elderly non-ICU patients: an external validation study

Chung Kwan Wong, Barbara C van Munster, Athanasios Hatseras, Else Huis In 't Veld, Barbara L van Leeuwen, Sophia E de Rooij, Rick G Pleijhuis, Chung Kwan Wong, Barbara C van Munster, Athanasios Hatseras, Else Huis In 't Veld, Barbara L van Leeuwen, Sophia E de Rooij, Rick G Pleijhuis

Abstract

Objectives: Delirium is associated with increased morbidity, mortality, prolonged hospitalisation and increased healthcare costs. The number of clinical prediction models (CPM) to predict postoperative delirium has increased exponentially. Our goal is to perform a head-to-head comparison of CPMs predicting postoperative delirium in non-intensive care unit (non-ICU) elderly patients to identify the best performing models.

Setting: Single-site university hospital.

Design: Secondary analysis of prospective cohort study.

Participants and inclusion: CPMs published within the timeframe of 1 January 1990 to 1 May 2020 were checked for eligibility (Preferred Reporting Items for Systematic Reviews and Meta-Analyses). For the time period of 1 January 1990 to 1 January 2017, included CPMs were identified in systematic reviews based on prespecified inclusion and exclusion criteria. An extended literature search for original studies was performed independently by two authors, including CPMs published between 1 January 2017 and 1 May 2020. External validation was performed using a surgical cohort consisting of 292 elderly non-ICU patients.

Primary outcome measures: Discrimination, calibration and clinical usefulness.

Results: 14 CPMs were eligible for analysis out of 366 full texts reviewed. External validation was previously published for 8/14 (57%) CPMs. C-indices ranged from 0.52 to 0.74, intercepts from -0.02 to 0.34, slopes from -0.74 to 1.96 and scaled Brier from -1.29 to 0.088. Based on predefined criteria, the two best performing models were those of Dai et al (c-index: 0.739; (95% CI: 0.664 to 0.813); intercept: -0.018; slope: 1.96; scaled Brier: 0.049) and Litaker et al (c-index: 0.706 (95% CI: 0.590 to 0.823); intercept: -0.015; slope: 0.995; scaled Brier: 0.088). For the remaining CPMs, model discrimination was considered poor with corresponding c-indices <0.70.

Conclusion: Our head-to-head analysis identified 2 out of 14 CPMs as best-performing models with a fair discrimination and acceptable calibration. Based on our findings, these models might assist physicians in postoperative delirium risk estimation and patient selection for preventive measures.

Keywords: delirium & cognitive disorders; geriatric medicine; internal medicine; risk management; surgery.

Conflict of interest statement

Competing interests: None declared.

© Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.

Figures

Figure 1
Figure 1
Flow chart indicating the selection process of included delirium prediction models.
Figure 2
Figure 2
Head-to-head comparison of discriminative power of delirium prediction models. Discriminative power of externally validated delirium prediction models is reported as c-indices with associated 95% CIs, ranked from low to high. A c-index of 0.5 resembles a situation in which the model has no discriminative power, that is, the model predicts no better than flipping a coin. Only 2 out of 14 validated models showed fair discrimination with c-indices >0.70 (0.71 and 0.74 for the models developed by Litaker et al and Dai et al, respectively) and 95% CIs with lower bounds >0.50. Discriminative power of the remaining 12 models was considered poor.
Figure 3
Figure 3
Discrimination, calibration and clinical utility of best performing models. Panels A and B show the receiver operating characteristic (ROC) curve of the delirium prediction models by Litaker et al and Dai et al, respectively, with the area under the ROC curve (c-index) indicating the discriminative power of the model. A graphical representation of the calibration of both models is shown in panels C and D, plotting the predicted probability (x-axis) with corresponding 95% CI against the actually observed occurrence of delirium in the validation cohort (y-axis). The model by Litaker et al showed adequate calibration (panel C), correctly differentiating patients at low risk of delirium (20%). The model by Dai et al correctly identified patients at low risk (20%). Panels E and F show decision curve analyses as a measure of clinical utility of both models. For the models by Litaker et al and Dai et al, a positive net benefit was observed in the 10%–35% threshold probability range (panel E) and the 5%–20% threshold probability range (panel F), respectively.

References

    1. Haley MN, Casey P, Kane RY, et al. . Delirium management: let's get physical? A systematic review and meta-analysis. Australas J Ageing 2019;38:231–41. 10.1111/ajag.12636
    1. Maldonado JR. Acute brain failure: pathophysiology, diagnosis, management, and sequelae of delirium. Crit Care Clin 2017;33:461–519. 10.1016/j.ccc.2017.03.013
    1. Schenning KJ, Deiner SG. Postoperative delirium in the geriatric patient. Anesthesiol Clin 2015;33:505–16. 10.1016/j.anclin.2015.05.007
    1. Inouye SK, Bogardus ST, Charpentier PA, et al. . A multicomponent intervention to prevent delirium in hospitalized older patients. N Engl J Med 1999;340:669–76. 10.1056/NEJM199903043400901
    1. Marcantonio ER, Flacker JM, Wright RJ, et al. . Reducing delirium after hip fracture: a randomized trial. J Am Geriatr Soc 2001;49:516–22. 10.1046/j.1532-5415.2001.49108.x
    1. Salvi F, Young J, Lucarelli M, et al. . Non-pharmacological approaches in the prevention of delirium. Eur Geriatr Med 2020;11:71–81. 10.1007/s41999-019-00260-7
    1. Adibi A, Sadatsafavi M, Ioannidis JPA. Validation and utility testing of clinical prediction models: time to change the approach. JAMA 2020;324:235-236. 10.1001/jama.2020.1230
    1. Collins GS, Reitsma JB, Altman DG, et al. . Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ 2014;350:g7594. 10.1136/bmj.g7594
    1. Jansen CJ, Absalom AR, de Bock GH, et al. . Performance and agreement of risk stratification instruments for postoperative delirium in persons aged 50 years or older. PLoS One 2014;9:e113946. 10.1371/journal.pone.0113946
    1. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J 2014;35:1925–31. 10.1093/eurheartj/ehu207
    1. Hosmer DW, Lemeshow S, Sturdivant RX. Applied logistic regression. New York: John Wiley & Sons, 2013.
    1. Steyerberg EW, Vickers AJ, Cook NR, et al. . Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 2010;21:128–38. 10.1097/EDE.0b013e3181c30fb2
    1. Stevens RJ, Poppe KK. Validation of clinical prediction models: what does the "calibration slope" really measure? J Clin Epidemiol 2020;118:93–9. 10.1016/j.jclinepi.2019.09.016
    1. Rufibach K. Use of Brier score to assess binary predictions. J Clin Epidemiol 2010;63:938–9. 10.1016/j.jclinepi.2009.11.009
    1. Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making 2006;26:565–74. 10.1177/0272989X06295361
    1. Vickers AJ, van Calster B, Steyerberg EW. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res 2019;3:18. 10.1186/s41512-019-0064-7
    1. Harrell FE. Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. New York, NY: Springer, 2001.
    1. Peek N, Arts DGT, Bosman RJ, et al. . External validation of prognostic models for critically ill patients required substantial sample sizes. J Clin Epidemiol 2007;60:491.e1–491.e13. 10.1016/j.jclinepi.2006.08.011
    1. van Steenbeek CD, van Maaren MC, Siesling S, et al. . Facilitating validation of prediction models: a comparison of manual and semi-automated validation using registry-based data of breast cancer patients in the Netherlands. BMC Med Res Methodol 2019;19:117. 10.1186/s12874-019-0761-5
    1. Tripod checklist: prediction model development. Available: [Accessed 24 Feb 2021].
    1. van Meenen LCC, van Meenen DMP, de Rooij SE, et al. . Risk prediction models for postoperative delirium: a systematic review and meta-analysis. J Am Geriatr Soc 2014;62:2383–90. 10.1111/jgs.13138
    1. Kalimisetty S, Askar W, Fay B, et al. . Models for predicting incident delirium in hospitalized older adults: a systematic review. J Patient Cent Res Rev 2017;4:69–77. 10.17294/2330-0698.1414
    1. Lindroth H, Bratzke L, Purvis S, et al. . Systematic review of prediction models for delirium in the older adult inpatient. BMJ Open 2018;8:e019223–019223. 10.1136/bmjopen-2017-019223
    1. Pompei P, Foreman M, Rudberg MA, et al. . Delirium in hospitalized older persons: outcomes and predictors. J Am Geriatr Soc 1994;42:809–15. 10.1111/j.1532-5415.1994.tb06551.x
    1. Rudolph JL, Jones RN, Levkoff SE, et al. . Derivation and validation of a preoperative prediction rule for delirium after cardiac surgery. Circulation 2009;119:229–36. 10.1161/CIRCULATIONAHA.108.795260
    1. Rudolph JL, Doherty K, Kelly B, et al. . Validation of a delirium risk assessment using electronic medical record information. J Am Med Dir Assoc 2016;17:244–8. 10.1016/j.jamda.2015.10.020
    1. Carrasco MP, Villarroel L, Andrade M, et al. . Development and validation of a delirium predictive score in older people. Age Ageing 2014;43:346–51. 10.1093/ageing/aft141
    1. Kim MY, Park UJ, Kim HT, et al. . Delirium prediction based on hospital information (Delphi) in general surgery patients. Medicine 2016;95:e3072. 10.1097/MD.0000000000003072
    1. de Wit HAJM, Winkens B, Mestres Gonzalvo C, et al. . The development of an automated ward independent delirium risk prediction model. Int J Clin Pharm 2016;38:915–23. 10.1007/s11096-016-0312-7
    1. Pendlebury ST, Lovett NG, Smith SC, et al. . Delirium risk stratification in consecutive unselected admissions to acute medicine: validation of a susceptibility score based on factors identified externally in pooled data for use at entry to the acute care pathway. Age Ageing 2017;46:226–31. 10.1093/ageing/afw198
    1. Halladay CW, Sillner AY, Rudolph JL. Performance of electronic prediction rules for prevalent delirium at hospital admission. JAMA Netw Open 2018;1:e181405. 10.1001/jamanetworkopen.2018.1405
    1. Ten Broeke M, Koster S, Konings T, et al. . Can we predict a delirium after cardiac surgery? A validation study of a delirium risk checklist. Eur J Cardiovasc Nurs 2018;17:255–61. 10.1177/1474515117733365
    1. Zhang X, Tong D-K, Ji F, et al. . Predictive nomogram for postoperative delirium in elderly patients with a hip fracture. Injury 2019;50:392–7. 10.1016/j.injury.2018.10.034
    1. Dai YT, Lou MF, Yip PK, et al. . Risk factors and incidence of postoperative delirium in elderly Chinese patients. Gerontology 2000;46:28–35. 10.1159/000022130
    1. Litaker D, Locala J, Franco K, et al. . Preoperative risk factors for postoperative delirium. Gen Hosp Psychiatry 2001;23:84–9. 10.1016/S0163-8343(01)00117-7
    1. Wang C-G, Qin Y-F, Wan X, et al. . Incidence and risk factors of postoperative delirium in the elderly patients with hip fracture. J Orthop Surg Res 2018;13:186. 10.1186/s13018-018-0897-8
    1. Raats JW, van Eijsden WA, Crolla RMPH, et al. . Risk factors and outcomes for postoperative delirium after major surgery in elderly patients. PLoS One 2015;10:e0136071. 10.1371/journal.pone.0136071
    1. Setters B, Solberg LM. Delirium. Prim Care 2017;44:541–59. 10.1016/j.pop.2017.04.010
    1. Chen Z, Chen R, Zheng D, et al. . Efficacy and safety of haloperidol for delirium prevention in adult patients: an updated meta-analysis with trial sequential analysis of randomized controlled trials. J Clin Anesth 2020;61:109623. 10.1016/j.jclinane.2019.09.017
    1. Van Calster B, McLernon DJ, van Smeden M, et al. . Calibration: the Achilles heel of predictive analytics. BMC Med 2019;17:230–7. 10.1186/s12916-019-1466-7
    1. Bouwmeester W, Zuithoff NPA, Mallett S, et al. . Reporting and methods in clinical prediction research: a systematic review. PLoS Med 2012;9:1001221. 10.1371/journal.pmed.1001221
    1. Collins GS, de Groot JA, Dutton S, et al. . External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol 2014;14:40. 10.1186/1471-2288-14-40
    1. Collins GS, Omar O, Shanyinde M, et al. . A systematic review finds prediction models for chronic kidney disease were poorly reported and often developed using inappropriate methods. J Clin Epidemiol 2013;66:268–77. 10.1016/j.jclinepi.2012.06.020
    1. Moons KGM, Altman DG, Reitsma JB, et al. . Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1–73. 10.7326/M14-0698
    1. Ettema R, Heim N, Hamaker M, et al. . Validity of a screening method for delirium risk in older patients admitted to a general Hospital in the Netherlands. Gen Hosp Psychiatry 2018;55:44–50. 10.1016/j.genhosppsych.2018.09.004
    1. Freter SH, Dunbar MJ, MacLeod H, et al. . Predicting post-operative delirium in elective orthopaedic patients: the delirium elderly at-risk (dear) instrument. Age Ageing 2005;34:169–71. 10.1093/ageing/afh245

Source: PubMed

3
Sottoscrivi