Comparing SF-36 Scores Collected Through Web-Based Questionnaire Self-completions and Telephone Interviews: An Ancillary Study of the SENTIPAT Multicenter Randomized Controlled Trial

Ayşe Açma, Fabrice Carrat, Gilles Hejblum, Ayşe Açma, Fabrice Carrat, Gilles Hejblum

Abstract

Background: The 36-Item Short Form Health Survey (SF-36) is a popular questionnaire for measuring the self-perception of quality of life in a given population of interest. Processing the answers of a participant comprises the calculation of 10 scores corresponding to 8 scales measuring several aspects of perceived health and 2 summary components (physical and mental). Surprisingly, no study has compared score values issued from a telephone interview versus those from an internet-based questionnaire self-completion.

Objective: This study aims to compare the SF-36 score values issued from a telephone interview versus those from an internet-based questionnaire self-completion.

Methods: Patients with an internet connection and returning home after hospital discharge were enrolled in the SENTIPAT multicenter randomized trial on the day of discharge. They were randomized to either self-completing a set of questionnaires using a dedicated website (internet group) or providing answers to the same questionnaires administered during a telephone interview (telephone group). This ancillary study of the trial compared SF-36 data related to the posthospitalization period in these 2 groups. To anticipate the potential unbalanced characteristics of the responders in the 2 groups, the impact of the mode of administration of the questionnaire on score differences was investigated using a matched sample of individuals originating from the internet and telephone groups (1:1 ratio), in which the matching procedure was based on a propensity score approach. SF-36 scores observed in the internet and telephone groups were compared using the Wilcoxon-Mann-Whitney test, and the score differences between the 2 groups were also examined according to Cohen effect size.

Results: Overall, 29.2% (245/840) and 75% (630/840) of SF-36 questionnaires were completed in the internet and telephone groups, respectively (P<.001). Globally, the score differences between groups before matching were similar to those observed in the matched sample. Mean scores observed in the telephone group were all above the corresponding values observed in the internet group. After matching, score differences in 6 out of the 8 SF-36 scales were statistically significant, with a mean difference greater than 5 for 4 scales and an associated mild effect size ranging from 0.22 to 0.29, and with a mean difference near this threshold for 2 other scales (4.57 and 4.56) and a low corresponding effect size (0.18 and 0.16, respectively).

Conclusions: The telephone mode of administration of SF-36 involved an interviewer effect, increasing SF-36 scores. Questionnaire self-completion via the internet should be preferred, and surveys combining various administration methods should be avoided.

Trial registration: ClinicalTrials.gov NCT01769261; https://www.clinicaltrials.gov/ct2/show/record/NCT01769261.

Keywords: Bias, Epidemiologic; Effect Modifier, Epidemiologic; Forms as Topic; Internet; Interviews, Telephone; Patient Reported Outcome Measures; Patient Satisfaction; Quality of Life; Surveys and Questionnaires.

Conflict of interest statement

Conflicts of Interest: None declared.

©Ayşe Açma, Fabrice Carrat, Gilles Hejblum. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 10.03.2022.

Figures

Figure 1
Figure 1
Flow of participants through the study. SF-36: 36-Item Short Form Health Survey.
Figure 2
Figure 2
Differences in baseline variables between the internet and telephone responders before and after the matching procedure.
Figure 3
Figure 3
Observed mean score differences (telephone–internet) of SF-36 scales and summary components before and after matching. SF-36: 36-Item Short Form Health Survey.

References

    1. Busija L, Pausenberger E, Haines TP, Haymes S, Buchbinder R, Osborne RH. Adult measures of general health and health-related quality of life: medical outcomes study short form 36-item (SF-36) and short form 12-item (SF-12) health surveys, Nottingham Health Profile (NHP), Sickness Impact Profile (SIP), Medical Outcomes Study Short Form 6D (SF-6D), Health Utilities Index Mark 3 (HUI3), Quality of Well-Being Scale (QWB), and Assessment of Quality of Life (AQoL) Arthritis Care Res (Hoboken) 2011 Nov;63 Suppl 11:S383–412. doi: 10.1002/acr.20541. doi: 10.1002/acr.20541.
    1. Contopoulos-Ioannidis DG, Karvouni A, Kouri I, Ioannidis JPA. Reporting and interpretation of SF-36 outcomes in randomised trials: systematic review. BMJ. 2009;338:a3006.
    1. Ware JE. SF-36 health survey update. Spine (Phila Pa 1976) 2000 Dec 15;25(24):3130–9.
    1. Leplège A, Ecosse E, Verdier A, Perneger TV. The French SF-36 Health Survey: translation, cultural adaptation and preliminary psychometric evaluation. J Clin Epidemiol. 1998 Nov;51(11):1013–23. doi: 10.1016/s0895-4356(98)00093-6.S0895-4356(98)00093-6
    1. Counsell SR, Callahan CM, Clark DO, Tu W, Buttar AB, Stump TE, Ricketts GD. Geriatric care management for low-income seniors: a randomized controlled trial. JAMA. 2007 Dec 12;298(22):2623–33. doi: 10.1001/jama.298.22.2623.298/22/2623
    1. Husky MM, Ferdous Farin F, Compagnone P, Fermanian C, Kovess-Masfety V. Chronic back pain and its association with quality of life in a large French population survey. Health Qual Life Outcomes. 2018 Sep 26;16(1):195. doi: 10.1186/s12955-018-1018-4. 10.1186/s12955-018-1018-4
    1. Middleton S, McElduff P, Ward J, Grimshaw JM, Dale S, D'Este C, Drury P, Griffiths R, Cheung NW, Quinn C, Evans M, Cadilhac D, Levi C, QASC Trialists Group Implementation of evidence-based treatment protocols to manage fever, hyperglycaemia, and swallowing dysfunction in acute stroke (QASC): a cluster randomised controlled trial. Lancet. 2011 Nov 12;378(9804):1699–706. doi: 10.1016/S0140-6736(11)61485-2.S0140-6736(11)61485-2
    1. Needham DM, Dinglas VD, Bienvenu OJ, Colantuoni E, Wozniak AW, Rice TW, Hopkins RO, NIH NHLBI ARDS Network One year outcomes in patients with acute lung injury randomised to initial trophic or full enteral feeding: prospective follow-up of EDEN randomised trial. BMJ. 2013 Mar 19;346:f1532. doi: 10.1136/bmj.f1532.
    1. Bell DS, Kahn CE. Health status assessment via the World Wide Web. Proc AMIA Annu Fall Symp. 1996:338–42.
    1. Cunha-Miranda L, Santos H, Miguel C, Silva C, Barcelos F, Borges J, Trinca R, Vicente V, Silva T. Validation of Portuguese-translated computer touch-screen questionnaires in patients with rheumatoid arthritis and spondyloarthritis, compared with paper formats. Rheumatol Int. 2015 Dec;35(12):2029–35. doi: 10.1007/s00296-015-3347-5.10.1007/s00296-015-3347-5
    1. Hidalgo-Mazzei D, Reinares M, Mateu A, Nikolova VL, Bonnín CD, Samalin L, García-Estela A, Pérez-Solá V, Young AH, Strejilevich S, Vieta E, Colom F. OpenSIMPLe: a real-world implementation feasibility study of a smartphone-based psychoeducation programme for bipolar disorder. J Affect Disord. 2018 Dec 01;241:436–45. doi: 10.1016/j.jad.2018.08.048.S0165-0327(18)30848-6
    1. Kao H, Wu W, Liang T, Lee K, Hou M, Shi H. Cloud-based service information system for evaluating quality of life after breast cancer surgery. PLoS One. 2015;10(9):e0139252. doi: 10.1371/journal.pone.0139252. PONE-D-15-15874
    1. Basnov M, Kongsved SM, Bech P, Hjollund NH. Reliability of short form-36 in an internet- and a pen-and-paper version. Inform Health Soc Care. 2009 Jan;34(1):53–8. doi: 10.1080/17538150902779527.909744214
    1. Broering JM, Paciorek A, Carroll PR, Wilson LS, Litwin MS, Miaskowski C. Measurement equivalence using a mixed-mode approach to administer health-related quality of life instruments. Qual Life Res. 2014 Mar;23(2):495–508. doi: 10.1007/s11136-013-0493-7.
    1. Kongsved SM, Basnov M, Holm-Christensen K, Hjollund NH. Response rate and completeness of questionnaires: a randomized study of internet versus paper-and-pencil versions. J Med Internet Res. 2007;9(3):e25. doi: 10.2196/jmir.9.3.e25. v9i3e25
    1. MacKenzie H, Thavaneswaran A, Chandran V, Gladman DD. Patient-reported outcome in psoriatic arthritis: a comparison of web-based versus paper-completed questionnaires. J Rheumatol. 2011 Dec;38(12):2619–24. doi: 10.3899/jrheum.110165.jrheum.110165
    1. Naus MJ, Philipp LM, Samsi M. From paper to pixels: a comparison of paper and computer formats in psychological assessment. Comput Hum Behav. 2009 Jan;25(1):1–7. doi: 10.1016/j.chb.2008.05.012.
    1. Buskirk TD, Stein KD. Telephone vs. mail survey gives different SF-36 quality-of-life scores among cancer survivors. J Clin Epidemiol. 2008 Oct;61(10):1049–55. doi: 10.1016/j.jclinepi.2007.11.012.S0895-4356(07)00447-7
    1. Cerrada CJ, Weinberg J, Sherman KJ, Saper RB. Inter-method reliability of paper surveys and computer assisted telephone interviews in a randomized controlled trial of yoga for low back pain. BMC Res Notes. 2014 Apr 09;7:227. doi: 10.1186/1756-0500-7-227. 1756-0500-7-227
    1. García M, Rohlfs I, Vila J, Sala J, Pena A, Masiá R, Marrugat J, REGICOR Investigators Comparison between telephone and self-administration of Short Form Health Survey Questionnaire (SF-36) Gac Sanit. 2005;19(6):433–9. doi: 10.1016/s0213-9111(05)71393-5. 13082788
    1. Hays RD, Kim S, Spritzer KL, Kaplan RM, Tally S, Feeny D, Liu H, Fryback DG. Effects of mode and order of administration on generic health-related quality of life scores. Value Health. 2009 Sep;12(6):1035–9. doi: 10.1111/j.1524-4733.2009.00566.x. S1098-3015(10)60305-1
    1. Jörngården A, Wettergen L, von Essen L. Measuring health-related quality of life in adolescents and young adults: Swedish normative data for the SF-36 and the HADS, and the influence of age, gender, and method of administration. Health Qual Life Outcomes. 2006 Dec 01;4:91. doi: 10.1186/1477-7525-4-91. 1477-7525-4-91
    1. McHorney CA, Kosinski M, Ware JE. Comparisons of the costs and quality of norms for the SF-36 health survey collected by mail versus telephone interview: results from a national survey. Med Care. 1994 Jun;32(6):551–67.
    1. Perkins JJ, Sanson-Fisher RW. An examination of self- and telephone-administered modes of administration for the Australian SF-36. J Clin Epidemiol. 1998 Nov;51(11):969–73.S0895-4356(98)00088-2
    1. Powers JR, Mishra G, Young AF. Differences in mail and telephone responses to self-rated health: use of multiple imputation in correcting for response bias. Aust N Z J Public Health. 2005 Apr;29(2):149–54. doi: 10.1111/j.1467-842x.2005.tb00065.x.
    1. Wettergren L, Mattsson E, von Essen L. Mode of administration only has a small effect on data quality and self-reported health status and emotional distress among Swedish adolescents and young adults. J Clin Nurs. 2011 Jun;20(11-12):1568–77. doi: 10.1111/j.1365-2702.2010.03481.x.
    1. Sentinel patients: value of an information system collecting patient's own report on his healthcare management (SENTIPAT) NIH U.S. National Library of Medicines. [2020-12-17]. .
    1. Couturier B, Carrat F, Hejblum G. Comparing patients' opinions on the hospital discharge process collected with a self-reported questionnaire completed via the internet or through a telephone survey: an ancillary study of the SENTIPAT randomized controlled trial. J Med Internet Res. 2015 Jun 24;17(6):e158. doi: 10.2196/jmir.4379. v17i6e158
    1. Feldman SF, Lapidus N, Cosnes J, Tiret E, Fonquernie L, Cabane J, Chazouilleres O, Surgers L, Beaussier M, Valleron A, Carrat F, Hejblum G. Comparing inpatient satisfaction collected via a web-based questionnaire self-completion and through a telephone interview: an ancillary study of the SENTIPAT randomized controlled trial. J Med Internet Res. 2017 Aug 23;19(8):e293. doi: 10.2196/jmir.7061. v19i8e293
    1. Leplège A, Ecosse E, Pouchot J, Coste J, Perneger T. Le questionnaire MOS SF-36: manuel de l'utilisateur et guide d'interprétation des scores. Paris, France: Éditions ESTEM; 2001.
    1. Angst F, Aeschlimann A, Angst J. The minimal clinically important difference raised the significance of outcome effects above the statistical level, with methodological implications for future studies. J Clin Epidemiol. 2017 Feb;82:128–36. doi: 10.1016/j.jclinepi.2016.11.016. S0895-4356(16)30776-4
    1. Rouquette A, Blanchin M, Sébille V, Guillemin F, Côté SM, Falissard B, Hardouin J. The minimal clinically important difference determined using item response theory models: an attempt to solve the issue of the association with baseline score. J Clin Epidemiol. 2014 Apr;67(4):433–40. doi: 10.1016/j.jclinepi.2013.10.009.S0895-4356(13)00440-X
    1. Ware J, Snow K, Kosinski M, Gandek B. SF-36 Health Survey: Manual and Interpretation Guide. Lincoln, RI: Quality Metric Incorporated; 1993.
    1. Cohen J. A power primer. Psychol Bull. 1992 Jul;112(1):155–9.
    1. Ho DE, Imai K, King G, Stuart EA. MatchIt: nonparametric preprocessing for parametric causal inference. J Stat Soft. 2011;42(8):1–28. doi: 10.18637/jss.v042.i08.
    1. Greenland S, Senn SJ, Rothman KJ, Carlin JB, Poole C, Goodman SN, Altman DG. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016 May 21;31(4):337–50. doi: 10.1007/s10654-016-0149-3.
    1. Nuzzo R. Scientific method: statistical errors. Nature. 2014 Feb 13;506(7487):150–2. doi: 10.1038/506150a.506150a
    1. Hays J, Ockene JK, Brunner RL, Kotchen JM, Manson JE, Patterson RE, Aragaki AK, Shumaker SA, Brzyski RG, LaCroix AZ, Granek IA, Valanis BG, Women's Health Initiative Investigators Effects of estrogen plus progestin on health-related quality of life. N Engl J Med. 2003 May 08;348(19):1839–54. doi: 10.1056/NEJMoa030311.NEJMoa030311
    1. Yost KJ, Haan MN, Levine RA, Gold EB. Comparing SF-36 scores across three groups of women with different health profiles. Qual Life Res. 2005 Jun;14(5):1251–61. doi: 10.1007/s11136-004-6673-8.
    1. Younossi ZM, Stepanova M, Afdhal N, Kowdley KV, Zeuzem S, Henry L, Hunt SL, Marcellin P. Improvement of health-related quality of life and work productivity in chronic hepatitis C patients with early and advanced fibrosis treated with ledipasvir and sofosbuvir. J Hepatol. 2015 Aug;63(2):337–45. doi: 10.1016/j.jhep.2015.03.014.S0168-8278(15)00192-0
    1. Lumley T. Complex Surveys: A Guide to Analysis Using R. Oxford, UK: Wiley-Blackwell; 2010. pp. 1–296.
    1. Lyons RA, Fielder H, Littlepage BN. Measuring health status with the SF-36: the need for regional norms. J Public Health Med. 1995 Mar;17(1):46–50.
    1. Lyons RA, Wareham K, Lucas M, Price D, Williams J, Hutchings HA. SF-36 scores vary by method of administration: implications for study design. J Public Health Med. 1999 Mar;21(1):41–5. doi: 10.1093/pubmed/21.1.41.
    1. Dijkstra W. How interviewer variance can bias the results of research on interviewer effects. Qual Quant. 1983 Jun;17(3):179–87. doi: 10.1007/BF00167582.
    1. Kreuter F, Presser S, Tourangeau R. Social desirability bias in CATI, IVR, and web surveys: the effects of mode and question sensitivity. Public Opin Quart. 2009 Jan 26;72(5):847–65. doi: 10.1093/poq/nfn063.
    1. Blumenberg C, Barros AJ. Response rate differences between web and alternative data collection methods for public health research: a systematic review of the literature. Int J Public Health. 2018 Jul;63(6):765–73. doi: 10.1007/s00038-018-1108-4.10.1007/s00038-018-1108-4
    1. Daikeler J, Bosnjak M, Manfreda K. Web versus other survey modes: an updated and extended meta-analysis comparing response rates. J Surv Stat Methodol. 2020;8(3):513–39. doi: 10.1093/jssam/smz008.
    1. Colley RC, Bushnik T, Langlois K. Exercise and screen time during the COVID-19 pandemic. Health Rep. 2020 Jul 15;31(6):3–11. doi: 10.25318/82-003-x202000600001-eng. doi: 10.25318/82-003-x202000600001-eng.82-003-X202000600001
    1. Sun S, Folarin AA, Ranjan Y, Rashid Z, Conde P, Stewart C, Cummins N, Matcham F, Dalla Costa G, Simblett S, Leocani L, Lamers F, Sørensen PS, Buron M, Zabalza A, Guerrero Pérez AI, Penninx BW, Siddi S, Haro JM, Myin-Germeys I, Rintala A, Wykes T, Narayan VA, Comi G, Hotopf M, Dobson RJ, RADAR-CNS Consortium Using smartphones and wearable devices to monitor behavioral changes during COVID-19. J Med Internet Res. 2020 Sep 25;22(9):e19992. doi: 10.2196/19992. v22i9e19992

Source: PubMed

3
Abonner