Predicting intervention onset in the ICU with switching state space models

Marzyeh Ghassemi, Mike Wu, Michael C Hughes, Peter Szolovits, Finale Doshi-Velez, Marzyeh Ghassemi, Mike Wu, Michael C Hughes, Peter Szolovits, Finale Doshi-Velez

Abstract

The impact of many intensive care unit interventions has not been fully quantified, especially in heterogeneous patient populations. We train unsupervised switching state autoregressive models on vital signs from the public MIMIC-III database to capture patient movement between physiological states. We compare our learned states to static demographics and raw vital signs in the prediction of five ICU treatments: ventilation, vasopressor administra tion, and three transfusions. We show that our learned states, when combined with demographics and raw vital signs, improve prediction for most interventions even 4 or 8 hours ahead of onset. Our results are competitive with existing work while using a substantially larger and more diverse cohort of 36,050 patients. While custom classifiers can only target a specific clinical event, our model learns physiological states which can help with many interventions. Our robust patient state representations provide a path towards evidence-driven administration of clinical interventions.

Figures

Figure 1:
Figure 1:
Illustration of data processing pipeline. (1) We extract vital signs and lab results (xn) are extracted from the database for a filtered selection of patients. (2) A switching-state autoregressive model is used the model the time series, generating belief states bn (the probability of each state at each time). (3) Static features are extracted for all patients (sn) - these are based on admission data and do not change over the course of the subject’s stay. (4) Given three possible sets of features for each timestep t and patient n - sn, xnt, and bnt - we train a classifier to predict the per-timestep outcome of interest ynt (e.g. vasopressor administration). Our system predicts the outcome ynt using features from either the immediately previous timestep fn,(t–1), or some further delay fn,(t–d).
Figure 2:
Figure 2:
AUC scores for different features predicting the onset of each intervention at a delay of d ∈ {1, 2, 4, 8} hours ahead of the current timestep. Features: Each bar color denotes one feature or feature concatenation: static observa tions s (10 dimensions using one-hot encoding), dynamic time-series observations x (18 dimensions), and belief state vectors b (K = 10 dimensions) from the switching state model in Eq. (2). Interventions: fresh-frozen-plasma transfu sion (ffp), platelet transfusion, red-blood-cell (rbc) transfusion, vasopressor administration, and ventilator intubation.
Figure 3:
Figure 3:
Learned classifier weights for each belief state under each separate intervention task, using fixed delay of 1 hour. The learned set of K = 10 hidden states is indexed by an integer from 0, 1, … 9. Large weight values indicate a state’s presence will cause the logistic regression classifier to raise the probability of the intervention.
Figure 4:
Figure 4:
Average value of dynamic features xnt assigned to timesteps strongly associated with state index 9. Values are z-score standardized per variable. State 9 had very low observed spo2 and bicarbonate levels as compared to other states. Lactate levels were the highest observed across all states - no other state had significant positive lactate z-scores.

References

    1. Arbus G S, Hebert L A, Levesque P R, Etsten B E, Schwartz W B. Characterization and clinical application of the significance band for acute respiratory alkalosis. New England Journal of Medicine. 1969;280(3):117–123.
    1. Beal M J. Variational algorithms for approximate Bayesian inference. PhD thesis, University of London; 2003.
    1. Caballero Barajas K L, Akella R. Dynamically modeling patient’s health state from electronic medical records: A time series approach. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015
    1. Che Z, Kale D, Li W, Bahadori M T, Liu Y. Deep computational phenotyping. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015
    1. D’Aragon F, Belley Cote E P, Meade M O, et al. Blood pressure targets for vasopressor therapy: A systematic review. Shock. 2015;43(6):530–539.
    1. Fialho A, Celi L, Cismondi F, Vieira S, Reti S, Sousa J, Finkelstein S, et al. Disease-based modeling to predict fluid response in intensive care units. Methods Inf Med. 2013;52(6):494–502.
    1. Fox E B. Bayesian Nonparametric Learning of Complex Dynamical Phenomena. PhD thesis, Massachusetts Institute of Technology; 2009.
    1. Ghahramani Z. An introduction to hidden Markov models and Bayesian networks. International Journal of Pattern Recognition and Artificial Intelligence. 2001;15(1):9–42.
    1. Ghassemi M, Naumann T, Doshi-Velez F, Brimmer N, Joshi R, Rumshisky A, Szolovits P. Unfolding physiological state: Mortality modelling in intensive care units. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2014
    1. Ghassemi M, Pimentel M A, Naumann T, Brennan T, Clifton D A, Szolovits P, Feng M. A multivariate timeseries modeling approach to severity of illness assessment and forecasting in ICU with sparse, heterogeneous clinical data. Proc Twenty-Ninth AAAI Conf on Artificial Intelligence. 2015
    1. Henry K E, Hager D N, Pronovost P J, Saria S. A targeted real-time early warning score (TREWScore) for septic shock. Science Translational Medicine. 2015;7(299):299ra122.
    1. Holcomb J B, Tilley B C, Baraniuk S, Fox E E, Wade C E, Podbielski J M, et al. Transfusion of plasma, platelets, and red blood cells in a 1:1:1 vs a 1:1:2 ratio and mortality in patients with severe trauma: the PROPPR randomized clinical trial. JAMA. 2015;313(5):471–482.
    1. Hug C W, Szolovits P. ICU acuity: real-time models versus daily models. AMIA Annual Symposium Proceedings. 2009
    1. Johnson A E, Pollard T J, Shen L, Lehman L H, Feng M, Ghassemi M, Moody B, Szolovits P, Celi L A, Mark R G. MIMIC-III, a freely accessible critical care database. Scientific Data. 2016;3
    1. Joshi R, Szolovits P. Prognostic physiology: modeling patient severity in intensive care units using radial domain folding. AMIA Annual Symposium Proceedings. 2012
    1. Karkouti K, Cohen M M, McCluskey S A, Sher G D. A multivariable model for predicting the need for blood transfusion in patients undergoing first-time elective coronary bypass graft surgery. Transfusion. 2001;41(10):1193–1203.
    1. Knaus W A, Wagner D, Draper E, Zimmerman J, et al. The APACHE III prognostic system. risk prediction of hospital mortality for critically ill hospitalized adults. Chest. 1991;100(6):1619–1636.
    1. Krishnan R G, Shalit U, Sontag D. Deep Kalman Filters. arXiv preprint arXiv:1511.05121. 2015
    1. Le Gall J, Lemeshow S, Saulnier F. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA. 1993;270(24):2957–2963.
    1. Lee J, Mark R G. An investigation of patterns in hemodynamic data indicative of impending hypotension in intensive care. Biomedical engineering online. 2010;9(1):62.
    1. Lehman L, Adams R, Mayaud L, Moody G, Malhotra A, Mark R, Nemati S. A physiological time series dynamics-based approach to patient monitoring and outcome prediction. IEEE journal of biomedical and health informatics. 2015;19(3):1068.
    1. Lin J, Keogh E, Wei L, Lonardi S. Experiencing SAX: a novel symbolic representation of time series. Data Mining and Knowledge Discovery. 2007;15(2):107–144.
    1. Marik P, Corwin H. Efficacy of red blood cell transfusion in the critically ill: a systematic review of the literature. Critical care medicine. 2008;36(9):2667–2674.
    1. Müllner M, Urbanek B, Havel C, Losert H, Gamper G, Herkner H. Vasopressors for shock. The Cochrane Library. 2004
    1. Murad M H, Stubbs J R, Gandhi M J, Wang A T, Paul A, Erwin P J, Montori V M, Roback J D. The effect of plasma transfusion on morbidity and mortality: a systematic review and meta-analysis. Transfusion. 2010;50(6):1370–1383.
    1. Nichol A, Bailey M, Egi M, Pettila V, French C, Stachowski E, Reade M C, Cooper D J, Bellomo R. Dynamic lactate indices as predictors of outcome in critically ill patients. Critical Care. 2011;15(5):1.
    1. Nichol A D, Egi M, Pettila V, Bellomo R, French C, Hart G, Davies A, Stachowski E, Reade M C, Bailey M, et al. Relative hyperlactatemia and hospital mortality in critically ill patients: a retrospective multi centre study. Critical care. 2010;14(1):1.
    1. Ospina Tascón G A, Bóchele G L, Vincent J.-L. Multicenter, randomized, controlled trials evaluating mortality in intensive care: Doomed to fail? Critical care medicine. 2008;36(4):1311–1322.
    1. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research. 2011;12(Oct)
    1. Quinn J, Williams C K, McIntosh N, et al. Factorial switching linear dynamical systems applied to physiolog ical condition monitoring. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2009;31(9):1537–1551.
    1. Quintana J M, West M. An analysis of international exchange rates using multivariate DLM’s. The Statis tician. 1987;36:275–281.
    1. Rabiner L R, Juang B.-H. An introduction to hidden Markov models. ASSP Magazine, IEEE. 1986;3(1):4–16.
    1. Raper J D, Wang H E. Urine output changes during postcardiac arrest therapeutic hypothermia. Therapeutic hypothermia and temperature management. 2013;3(4):173–177.
    1. Salgado C M, Vieira S M, Mendonça L F, Finkelstein S, Sousa J M. Ensemble fuzzy models in person alized medicine: Application to vasopressors administration. Engineering Applications ofArtificial Intelligence. 2016;49:141–148.
    1. Scott S L. Bayesian methods for hidden Markov models: Recursive computing in the 21st century. Journal of the American Statistical Association. 2002;97(457):337–351.
    1. Slichter S J, Kaufman R M, Assmann S F, McCullough J, Triulzi D J, et al. Dose of prophylactic platelet transfusions and prevention of hemorrhage. New England Journal of Medicine. 2010;362(7):600–613.
    1. Stanworth S, Brunskill S, Hyde C, McClelland D, Murphy M. Is fresh frozen plasma clinically effective? a systematic review of randomized controlled trials. British journal of haematology. 2004;126(1):139–152.
    1. Tobin M J, editor. Principles and practice of mechanical ventilation. McGraw-Hill Medical Pub. Division; 2006.
    1. Vincent J.-L. Critical care-where have we been and where are we going? Critical Care. 2013;17(1):1.
    1. Vincent J.-L, Moreno R, Takala J, Willatts S, De Mendonça A, Bruining H, Reinhart C, Suter P, Thijs L. The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure. Intensive care medicine. 1996;22(7):707–710.
    1. Vozarova B, Weyer C, Lindsay R S, Pratley R E, et al. High white blood cell count is associated with a worsening of insulin sensitivity and predicts the development of type 2 diabetes. Diabetes. 2002;51(2):455–461.
    1. Wainwright M J, Jordan M I. Graphical models, exponential families, and variational inference. Founda tions and Trends R in Machine Learning. 2008;1(1-2):1–305.
    1. Walczak S. Artificial neural network medical decision support tool: predicting transfusion requirements of er patients. IEEE Transactions on Information Technology in Biomedicine. 2005;9(3):468–474.
    1. Wu M, Ghassemi M, Feng M, Celi L, Szolovits P, Doshi-Velez F. Understanding vasopressor intervention and weaning: Risk prediction in a public heterogeneous clinical time series database. Journal of the American Medical Informatics Association. 2017
    1. Yang K L, Tobin M J. A prospective study of indexes predicting the outcome of trials of weaning from mechanical ventilation. New England Journal of Medicine. 1991;324(21):1445–1450.

Source: PubMed

3
Abonnere