The Patient Health Questionnaire-9 for detection of major depressive disorder in primary care: consequences of current thresholds in a crosssectional study

Nicolaas P A Zuithoff, Yvonne Vergouwe, Michael King, Irwin Nazareth, Manja J van Wezep, Karel G M Moons, Mirjam I Geerlings, Nicolaas P A Zuithoff, Yvonne Vergouwe, Michael King, Irwin Nazareth, Manja J van Wezep, Karel G M Moons, Mirjam I Geerlings

Abstract

Background: There is a need for brief instruments to ascertain the diagnosis of major depressive disorder. In this study, we present the reliability, construct validity and accuracy of the PHQ-9 and PHQ-2 to detect major depressive disorder in primary care.

Methods: Cross-sectional analyses within a large prospective cohort study (PREDICT-NL). Data was collected in seven large general practices in the centre of the Netherlands. 1338 subjects were recruited in the general practice waiting room, irrespective of their presenting complaint. The diagnostic accuracy (the area under the ROC curve and sensitivities and specificities for various thresholds) was calculated against a diagnosis of major depressive disorder determined with the Composite International Diagnostic Interview (CIDI).

Results: The PHQ-9 showed a high degree of internal consistency (ICC = 0.88) and test-retest reliability (correlation = 0.94). With respect to construct validity, it showed a clear association with functional status measurements, sick days and number of consultations. The discriminative ability was good for the PHQ-9 (area under the ROC curve = 0.87, 95% CI: 0.84-0.90) and the PHQ-2 (ROC area = 0.83, 95% CI 0.80-0.87). Sensitivities at the recommended thresholds were 0.49 for the PHQ-9 at a score of 10 and 0.28 for a categorical algorithm. Adjustment of the threshold and the algorithm improved sensitivities to 0.82 and 0.84 respectively but the specificity decreased from 0.95 to 0.82 (threshold) and from 0.98 to 0.81 (algorithm). Similar results were found for the PHQ-2: the recommended threshold of 3 had a sensitivity of 0.42 and lowering the threshold resulted in an improved sensitivity of 0.81.

Conclusion: The PHQ-9 and the PHQ-2 are useful instruments to detect major depressive disorder in primary care, provided a high score is followed by an additional diagnostic work-up. However, often recommended thresholds for the PHQ-9 and the PHQ-2 resulted in many undetected major depressive disorders.

Figures

Figure 1
Figure 1
Flow chart of the inclusion of patients.

References

    1. Spitzer RL, Williams JB, Gibbon M, First MB. The Structured Clinical Interview for DSM-III-R (SCID). I: History, rationale, and description. Arch Gen Psychiatry. 1992;49:624–629.
    1. Williams JB, Gibbon M, First MB, Spitzer RL, Davies M, Borus J. et al.The Structured Clinical Interview for DSM-III-R (SCID). II. Multisite test-retest reliability. Arch Gen Psychiatry. 1992;49:630–636.
    1. Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16:606–613. doi: 10.1046/j.1525-1497.2001.016009606.x.
    1. Spitzer RL, Williams JB, Kroenke K, Linzer M, deGruy FV, Hahn SR. et al.Utility of a new procedure for diagnosing mental disorders in primary care. The PRIME-MD 1000 study. JAMA. 1994;272:1749–1756. doi: 10.1001/jama.272.22.1749.
    1. Adewuya AO, Ola BA, Afolabi OO. Validity of the patient health questionnaire (PHQ-9) as a screening tool for depression amongst Nigerian university students. J Affect Disord. 2006;96:89–93. doi: 10.1016/j.jad.2006.05.021.
    1. Gilbody S, Richards D, Barkham M. Diagnosing depression in primary care using self-completed instruments: UK validation of PHQ-9 and CORE-OM. Br J Gen Pract. 2007;57:650–652.
    1. Lowe B, Spitzer RL, Grafe K, Kroenke K, Quenter A, Zipfel S. et al.Comparative validity of three screening questionnaires for DSM-IV depressive disorders and physicians' diagnoses. J Affect Disord. 2004;78:131–140. doi: 10.1016/S0165-0327(02)00237-9.
    1. Wulsin L, Somoza E, Heck J. The Feasibility of Using the Spanish PHQ-9 to Screen for Depression in Primary Care in Honduras. Prim Care Companion J Clin Psychiatry. 2002;4:191–195. doi: 10.4088/PCC.v04n0504.
    1. Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary Care Evaluation of Mental Disorders. Patient Health Questionnaire. JAMA. 1999;282:1737–1744. doi: 10.1001/jama.282.18.1737.
    1. Kroenke K, Spitzer RL, Williams JB. The Patient Health Questionnaire-2: validity of a two-item depression screener. Med Care. 2003;41:1284–1292. doi: 10.1097/01.MLR.0000093487.78664.3C.
    1. Diez-Quevedo C, Rangil T, Sanchez-Planell L, Kroenke K, Spitzer RL. Validation and utility of the patient health questionnaire in diagnosing mental disorders in 1003 general hospital Spanish inpatients. Psychosom Med. 2001;63:679–686.
    1. Gilbody S, Richards D, Brealey S, Hewitt C. Screening for depression in medical settings with the Patient Health Questionnaire (PHQ): a diagnostic meta-analysis. J Gen Intern Med. 2007;22:1596–1602. doi: 10.1007/s11606-007-0333-y.
    1. Lamers F, Jonkers CC, Bosma H, Penninx BW, Knottnerus JA, van Eijk JT. Summed score of the Patient Health Questionnaire-9 was a reliable and valid method for depression screening in chronically ill elderly patients. J Clin Epidemiol. 2008;61:679–687. doi: 10.1016/j.jclinepi.2007.07.018.
    1. Persoons P, Luyckx K, Desloovere C, Vandenberghe J, Fischler B. Anxiety and mood disorders in otorhinolaryngology outpatients presenting with dizziness: validation of the self-administered PRIME-MD Patient Health Questionnaire and epidemiology. Gen Hosp Psychiatry. 2003;25:316–323. doi: 10.1016/S0163-8343(03)00072-0.
    1. Wittkampf KA, Naeije L, Schene AH, Huyser J, van Weert HC. Diagnostic accuracy of the mood module of the Patient Health Questionnaire: a systematic review. Gen Hosp Psychiatry. 2007;29:388–395. doi: 10.1016/j.genhosppsych.2007.06.004.
    1. Arroll B, Goodyear-Smith F, Crengle S, Gunn J, Kerse N, Fishman T. et al.Validation of PHQ-2 and PHQ-9 to screen for major depression in the primary care population. Ann Fam Med. 2010;8:348–353. doi: 10.1370/afm.1139.
    1. McManus D, Pipkin SS, Whooley MA. Screening for depression in patients with coronary heart disease (data from the Heart and Soul Study) Am J Cardiol. 2005;96:1076–1081. doi: 10.1016/j.amjcard.2005.06.037.
    1. King M, Weich S, Torres F, Svab I, Maaroos H, Neeleman J. et al.Prediction of depression in European general practice attendees: the PREDICT study. BMC Public Health. 2006;6:6. doi: 10.1186/1471-2458-6-6.
    1. King M, Walker C, Levy G, Bottomley C, Royston P, Weich S. et al.Development and validation of an international risk prediction algorithm for episodes of major depression in general practice attendees: the PredictD study. Arch Gen Psychiatry. 2008;65:1368–1376. doi: 10.1001/archpsyc.65.12.1368.
    1. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders. 4th Text Revision edn. Washington, D.C.: American Psychiatric Association; 2000.
    1. Ter Smitten RH, Smeets RMW, Van den Brink W. Composite International Diagnostic Interview - computerized version 2.1: Dutch translation and adaptation. World Health Organisation: Geneva; 1997; 2007.
    1. Kosinsky M. Scoring the SF-12 Physical and Mental Health Summary Measures. Medical Outcomes Trust Bulletin. 1997;5:3–4.
    1. Little RJA, Rubin DB. Statistical analysis with missing data. New York: Wiley; 1987.
    1. Vach W. Logistic regression with missing values in the covariates. New York: Springer; 1994.
    1. Donders AR, van der Heijden GJ, Stijnen T, Moons KG. Review: a gentle introduction to imputation of missing values. J Clin Epidemiol. 2006;59:1087–1091. doi: 10.1016/j.jclinepi.2006.01.014.
    1. Greenland S, Finkle WD. A critical look at methods for handling missing covariates in epidemiologic regression analyses. Am J Epidemiol. 1995;142:1255–1264.
    1. Picardi A, Adler DA, Abeni D, Chang H, Pasquini P, Rogers WH. et al.Screening for depressive disorders in patients with skin diseases: a comparison of three screeners. Acta Derm Venereol. 2005;85:414–419. doi: 10.1080/00015550510034966.
    1. Li C, Friedman B, Conwell Y, Fiscella K. Validity of the Patient Health Questionnaire 2 (PHQ-2) in identifying major depression in older people. J Am Geriatr Soc. 2007;55:596–602. doi: 10.1111/j.1532-5415.2007.01103.x.
    1. Cutler CB, Legano LA, Dreyer BP, Fierman AH, Berkule SB, Lusskin SI. et al.Screening for maternal depression in a low education population using a two item questionnaire. Arch Womens Ment Health. 2007;10:277–283. doi: 10.1007/s00737-007-0202-z.
    1. Bijl RV, De Graaf R, Ravelli A, Smit F, Vollebergh WA. Gender and age-specific first incidence of DSM-III-R psychiatric disorders in the general population. Results from the Netherlands Mental Health Survey and Incidence Study (NEMESIS) Soc Psychiatry Psychiatr Epidemiol. 2002;37:372–379. doi: 10.1007/s00127-002-0566-3.
    1. Kessler RC, Berglund P, Demler O, Jin R, Koretz D, Merikangas KR. et al.The epidemiology of major depressive disorder: results from the National Comorbidity Survey Replication (NCS-R) JAMA. 2003;289:3095–3105. doi: 10.1001/jama.289.23.3095.
    1. Waraich P, Goldner EM, Somers JM, Hsu L. Prevalence and incidence studies of mood disorders: a systematic review of the literature. Can J Psychiatry. 2004;49:124–138.
    1. Brenner H, Gefeller O. Variation of sensitivity, specificity, likelihood ratios and predictive values with disease prevalence. Stat Med. 1997;16:981–991. doi: 10.1002/(SICI)1097-0258(19970515)16:9<981::AID-SIM510>;2-N.
    1. Sobin C, Weissman MM, Goldstein RB, Adams P, Wickramaratne P, Warner V. et al.Diagnostic interviewing for family studies: comparing telephone and face-to-face methods for the diagnosis of lifetime psychiatric disorders. Psychiatric Genetics. 1993;3:227–234. doi: 10.1097/00041444-199324000-00005.
    1. Kurdyak PA, Gnam WH. Small signal, big noise: performance of the CIDI depression module. Can J Psychiatry. 2005;50:851–856.

Source: PubMed

3
Abonner