Predictive modeling of treatment resistant depression using data from STAR*D and an independent clinical study

Zhi Nie, Srinivasan Vairavan, Vaibhav A Narayan, Jieping Ye, Qingqin S Li, Zhi Nie, Srinivasan Vairavan, Vaibhav A Narayan, Jieping Ye, Qingqin S Li

Abstract

Identification of risk factors of treatment resistance may be useful to guide treatment selection, avoid inefficient trial-and-error, and improve major depressive disorder (MDD) care. We extended the work in predictive modeling of treatment resistant depression (TRD) via partition of the data from the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) cohort into a training and a testing dataset. We also included data from a small yet completely independent cohort RIS-INT-93 as an external test dataset. We used features from enrollment and level 1 treatment (up to week 2 response only) of STAR*D to explore the feature space comprehensively and applied machine learning methods to model TRD outcome at level 2. For TRD defined using QIDS-C16 remission criteria, multiple machine learning models were internally cross-validated in the STAR*D training dataset and externally validated in both the STAR*D testing dataset and RIS-INT-93 independent dataset with an area under the receiver operating characteristic curve (AUC) of 0.70-0.78 and 0.72-0.77, respectively. The upper bound for the AUC achievable with the full set of features could be as high as 0.78 in the STAR*D testing dataset. Model developed using top 30 features identified using feature selection technique (k-means clustering followed by χ2 test) achieved an AUC of 0.77 in the STAR*D testing dataset. In addition, the model developed using overlapping features between STAR*D and RIS-INT-93, achieved an AUC of > 0.70 in both the STAR*D testing and RIS-INT-93 datasets. Among all the features explored in STAR*D and RIS-INT-93 datasets, the most important feature was early or initial treatment response or symptom severity at week 2. These results indicate that prediction of TRD prior to undergoing a second round of antidepressant treatment could be feasible even in the absence of biomarker data.

Trial registration: ClinicalTrials.gov NCT00021528.

Conflict of interest statement

Drs. Vairavan, Narayan and Li are employees of Janssen Research & Development, LLC. Drs. Vairavan, Narayan and Li may be shareholders in Johnson & Johnson, which is the parent company of the Janssen companies. Drs. Vairavan, Narayan and Li declare that, except for income received from their primary employer, no financial support or compensation has been received from any individual or corporate entity over the past three years for research or professional service, and there are no personal financial holdings that could be perceived as constituting a potential conflict of interest. Dr. Ye reports past consultancy relationship with Janssen Research & Development, LLC. Mr. Nie reports no biomedical financial interests or potential conflicts of interest and can now be contacted at Samsung Electronics Co., Ltd. This study was funded by Janssen Research & Development, LLC, Titusville, NJ, USA, and by a research funding from Janssen Research & Development, LLC to University of Michigan for the collaboration with Dr. Ye’s laboratory. This manuscript reflects the views of the authors and may not reflect the opinions or views of the STAR*D Study Investigators or the funders. This does not alter our adherence to PLOS ONE policies on sharing data and materials.

Figures

Fig 1. Schematic diagram showing the predictive…
Fig 1. Schematic diagram showing the predictive modeling process.
The STAR*D training dataset was used to create 30 subsamples with equal ratios of cases and controls and 30 models were constructed using the entire training dataset. The 30 models were used to predict the outcome for the independent STAR*D test dataset and RIS-INT-93 dataset and the predicted outcome were average across 30 models.
Fig 2. AUC for models containing top…
Fig 2. AUC for models containing top 2 to 75 representative predictors from k-means cluster (k = 75) was plotted against the number of predictors for each of the machine learning methods in the STAR*D training and test datasets, respectively.
Remission status was used to define TRD using QIDS-C16 data.
Fig 3. Receiver operating characteristic curves in…
Fig 3. Receiver operating characteristic curves in training and test dataset (STAR*D) using the full set of features, top n features (n ~ 30), and the overlapping features where remission status was used to define TRD (STAR*D remission status was defined using QIDS-C16 data, and RIS-INT-93 remission status was defined using HAM-D17).
Fig 4. Permutation process to access the…
Fig 4. Permutation process to access the model robustness.
The outcome label of the STAR*D training dataset was randomly shuffled 1,000 times and the AUC distribution of the 1,000 null models were plotted for each machine learning machine method (A) XGBoost, (B) Random Forest, (C) l2 penalized logistic regression, and (D) GBDT. In all cases, the observed AUC out-perform the random noise from the 1,000 null models.
Fig 5
Fig 5
variable of importance in statistical learning approaches for outcomes defined by (A) remission status (B) responder status. In both cases, the outcomes were defined using QIDS-C16. SFHS: Short Form Health Survey (SF-12); WSAS: The Work and Social Adjustment Scale; *from PRISE: The Patient Rated Inventory of Side Effects, which collected symptoms one had experienced in the past week. Those symptoms may or may not have been caused by the treatment.

References

    1. Berlim MT, Turecki G. What is the meaning of treatment resistant/refractory major depression (TRD)? A systematic review of current randomized trials. European neuropsychopharmacology: the journal of the European College of Neuropsychopharmacology. 2007;17(11):696–707. doi: .
    1. Ruhe HG, van Rooijen G, Spijker J, Peeters FP, Schene AH. Staging methods for treatment resistant depression. A systematic review. Journal of affective disorders. 2012;137(1–3):35–45. doi: .
    1. Peeters FP, Ruhe HG, Wichers M, Abidi L, Kaub K, van der Lande HJ, et al. The Dutch Measure for quantification of Treatment Resistance in Depression (DM-TRD): an extension of the Maudsley Staging Method. Journal of affective disorders. 2016;205:365–71. doi: .
    1. Rush AJ, Trivedi MH, Wisniewski SR, Nierenberg AA, Stewart JW, Warden D, et al. Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR*D report. The American journal of psychiatry. 2006;163(11):1905–17. doi: .
    1. Bennabi D, Aouizerate B, El-Hage W, Doumy O, Moliere F, Courtet P, et al. Risk factors for treatment resistance in unipolar depression: A systematic review. Journal of affective disorders. 2014;171C:137–41. doi: .
    1. Souery D, Oswald P, Massat I, Bailer U, Bollen J, Demyttenaere K, et al. Clinical factors associated with treatment resistance in major depressive disorder: results from a European multicenter study. The Journal of clinical psychiatry. 2007;68(7):1062–70. .
    1. Dudek D, Rybakowski JK, Siwek M, Pawlowski T, Lojko D, Roczen R, et al. Risk factors of treatment resistance in major depression: association with bipolarity. Journal of affective disorders. 2010;126(1–2):268–71. doi: .
    1. Takahashi M, Shirayama Y, Muneoka K, Suzuki M, Sato K, Hashimoto K. Personality traits as risk factors for treatment-resistant depression. PloS one. 2013;8(5):e63756 doi: ; PubMed Central PMCID: PMC3661718.
    1. Takahashi M, Shirayama Y, Muneoka K, Suzuki M, Sato K, Hashimoto K. Low openness on the revised NEO personality inventory as a risk factor for treatment-resistant depression. PloS one. 2013;8(9):e71964 doi: ; PubMed Central PMCID: PMC3760870.
    1. Sharma V, Khan M, Smith A. A closer look at treatment resistant depression: is it due to a bipolar diathesis? Journal of affective disorders. 2005;84(2–3):251–7. doi: .
    1. Chekroud AM, Zotti RJ, Shehzad Z, Gueorguieva R, Johnson MK, Trivedi MH, et al. Cross-trial prediction of treatment outcome in depression: a machine learning approach. The lancet Psychiatry. 2016;3(3):243–50. doi: .
    1. Dodd S, Berk M, Kelin K, Zhang Q, Eriksson E, Deberdt W, et al. Application of the Gradient Boosted method in randomised clinical trials: Participant variables that contribute to depression treatment efficacy of duloxetine, SSRIs or placebo. Journal of affective disorders. 2014;168:284–93. doi: .
    1. Kuk AY, Li J, Rush AJ. Recursive subsetting to identify patients in the STAR*D: a method to enhance the accuracy of early prediction of treatment outcome and to inform personalized care. The Journal of clinical psychiatry. 2010;71(11):1502–8. doi: .
    1. Chekroud AM, Gueorguieva R, Krumholz HM, Trivedi MH, Krystal JH, McCarthy G. Reevaluating the Efficacy and Predictability of Antidepressant Treatments: A Symptom Clustering Approach. JAMA psychiatry. 2017;74(4):370–8. doi: .
    1. Perlis RH. A clinical risk stratification tool for predicting treatment resistance in major depressive disorder. Biological psychiatry. 2013;74(1):7–14. doi: ; PubMed Central PMCID: PMC3690142.
    1. Fava M, Rush AJ, Trivedi MH, Nierenberg AA, Thase ME, Sackeim HA, et al. Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. The Psychiatric clinics of North America. 2003;26(2):457–94, x. .
    1. Rush AJ, Fava M, Wisniewski SR, Lavori PW, Trivedi MH, Sackeim HA, et al. Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design. Controlled clinical trials. 2004;25(1):119–42. .
    1. Rapaport MH, Gharabawi GM, Canuso CM, Mahmoud RA, Keller MB, Bossie CA, et al. Effects of risperidone augmentation in patients with treatment-resistant depression: Results of open-label treatment followed by double-blind continuation. Neuropsychopharmacology: official publication of the American College of Neuropsychopharmacology. 2006;31(11):2505–13. doi: .
    1. Hamilton M. A rating scale for depression. Journal of neurology, neurosurgery, and psychiatry. 1960;23:56–62. ; PubMed Central PMCID: PMC495331.
    1. Hamilton M. Development of a rating scale for primary depressive illness. The British journal of social and clinical psychology. 1967;6(4):278–96. .
    1. Boessen R, Groenwold RH, Knol MJ, Grobbee DE, Roes KC. Comparing HAMD(17) and HAMD subscales on their ability to differentiate active treatment from placebo in randomized controlled trials. Journal of affective disorders. 2013;145(3):363–9. doi: .
    1. Bech P, Allerup P, Gram LF, Reisby N, Rosenberg R, Jacobsen O, et al. The Hamilton depression scale. Evaluation of objectivity using logistic models. Acta psychiatrica Scandinavica. 1981;63(3):290–9. .
    1. Bech P, Gram LF, Dein E, Jacobsen O, Vitger J, Bolwig TG. Quantitative rating of depressive states. Acta psychiatrica Scandinavica. 1975;51(3):161–70. .
    1. Bech P, Fava M, Trivedi MH, Wisniewski SR, Rush AJ. Factor structure and dimensionality of the two depression scales in STAR*D using level 1 datasets. Journal of affective disorders. 2011;132(3):396–400. doi: .
    1. Maier W, Heuser I, Philipp M, Frommberger U, Demuth W. Improving depression severity assessment—II. Content, concurrent and external validity of three observer depression scales. Journal of psychiatric research. 1988;22(1):13–9. .
    1. Maier W, Philipp M, Heuser I, Schlegel S, Buller R, Wetzel H. Improving depression severity assessment—I. Reliability, internal validity and sensitivity to change of three observer depression scales. Journal of psychiatric research. 1988;22(1):3–12. .
    1. Maier W, Philipp M. Comparative analysis of observer depression scales. Acta psychiatrica Scandinavica. 1985;72(3):239–45. .
    1. Santen G, Gomeni R, Danhof M, Della Pasqua O. Sensitivity of the individual items of the Hamilton depression rating scale to response and its consequences for the assessment of efficacy. Journal of psychiatric research. 2008;42(12):1000–9. doi: .
    1. Gibbons RD, Clark DC, Kupfer DJ. Exactly what does the Hamilton Depression Rating Scale measure? Journal of psychiatric research. 1993;27(3):259–73. .
    1. McIntyre RS, Konarski JZ, Mancini DA, Fulton KA, Parikh SV, Grigoriadis S, et al. Measuring the severity of depression and remission in primary care: validation of the HAMD-7 scale. CMAJ: Canadian Medical Association journal = journal de l'Association medicale canadienne. 2005;173(11):1327–34. doi: ; PubMed Central PMCID: PMC1283499.
    1. Ostergaard SD, Bech P, Trivedi MH, Wisniewski SR, Rush AJ, Fava M. Brief, unidimensional melancholia rating scales are highly sensitive to the effect of citalopram and may have biological validity: implications for the research domain criteria (RDoC). Journal of affective disorders. 2014;163:18–24. doi: .
    1. Bühlmann P, Rütimann P, van de Geer S, Zhang C-H. Correlated variables in regression: clustering and sparse estimation (with discussion). Journal of Statistical Planning and Inference. 2013;143: 1835–71.
    1. Liu H, Setiono R, editors. Chi2: Feature selection and discretization of numeric attributes. tai; 1995: IEEE.
    1. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Statist Soc B. 2005;67:301–20.
    1. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33:1–22.
    1. Schimek MG, editor Penalized logistic regression in gene expression analysis. Proceedings to 2003 Semiparametric Conference; 2003; Berlin, Germany.
    1. Shen L, Than EC. Dimension Reduction-Based Penalized Logistic Regression for Cancer Classification Using Microarray Data. EEE/ACM Trans Comput Biol Bioinformatics. 2005;2(2):166–75.
    1. Breiman L. Random Forests. Machine Learning. 2001;45(1):5–32.
    1. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. The Journal of Machine Learning Research. 2011;12:2825–30.
    1. Nie Z, Yang T, Liu Y, Lin B, Li Q, Narayan VA, et al. Melancholic depression prediction by identifying representative features in metabolic and microarray profiles with missing values. Pacific Symposium on Biocomputing 2015. p. 455–66.
    1. Westfall PH, Young SS. Resampling-based multiple testing: examples and methods for P-value adjustment New York: Wiley; 1993. xvii, 340 p. p.
    1. Chase HW, Fournier JC, Greenberg T, Almeida JR, Stiffler R, Zevallos CR, et al. Accounting for Dynamic Fluctuations across Time when Examining fMRI Test-Retest Reliability: Analysis of a Reward Paradigm in the EMBARC Study. PloS one. 2015;10(5):e0126326 doi: ; PubMed Central PMCID: PMC4427400.
    1. Greenberg T, Chase HW, Almeida JR, Stiffler R, Zevallos CR, Aslam HA, et al. Moderation of the Relationship Between Reward Expectancy and Prediction Error-Related Ventral Striatal Reactivity by Anhedonia in Unmedicated Major Depressive Disorder: Findings From the EMBARC Study. The American journal of psychiatry. 2015;172(9):881–91. doi: .
    1. Phillips ML, Chase HW, Sheline YI, Etkin A, Almeida JR, Deckersbach T, et al. Identifying predictors, moderators, and mediators of antidepressant response in major depressive disorder: neuroimaging approaches. The American journal of psychiatry. 2015;172(2):124–38. doi: ; PubMed Central PMCID: PMC4464814.
    1. Webb CA, Dillon DG, Pechtel P, Goer FK, Murray L, Huys QJ, et al. Neural Correlates of Three Promising Endophenotypes of Depression: Evidence from the EMBARC Study. Neuropsychopharmacology: official publication of the American College of Neuropsychopharmacology. 2016;41(2):454–63. doi: .
    1. Kennedy SH, Downar J, Evans KR, Feilotter H, Lam RW, MacQueen GM, et al. The Canadian Biomarker Integration Network in Depression (CAN-BIND): advances in response prediction. Current pharmaceutical design. 2012;18(36):5976–89. .

Source: PubMed

3
Abonnere