Performance of seven consumer sleep-tracking devices compared with polysomnography

Evan D Chinoy, Joseph A Cuellar, Kirbie E Huwa, Jason T Jameson, Catherine H Watson, Sara C Bessman, Dale A Hirsch, Adam D Cooper, Sean P A Drummond, Rachel R Markwald, Evan D Chinoy, Joseph A Cuellar, Kirbie E Huwa, Jason T Jameson, Catherine H Watson, Sara C Bessman, Dale A Hirsch, Adam D Cooper, Sean P A Drummond, Rachel R Markwald

Abstract

Study objectives: Consumer sleep-tracking devices are widely used and becoming more technologically advanced, creating strong interest from researchers and clinicians for their possible use as alternatives to standard actigraphy. We, therefore, tested the performance of many of the latest consumer sleep-tracking devices, alongside actigraphy, versus the gold-standard sleep assessment technique, polysomnography (PSG).

Methods: In total, 34 healthy young adults (22 women; 28.1 ± 3.9 years, mean ± SD) were tested on three consecutive nights (including a disrupted sleep condition) in a sleep laboratory with PSG, along with actigraphy (Philips Respironics Actiwatch 2) and a subset of consumer sleep-tracking devices. Altogether, four wearable (Fatigue Science Readiband, Fitbit Alta HR, Garmin Fenix 5S, Garmin Vivosmart 3) and three nonwearable (EarlySense Live, ResMed S+, SleepScore Max) devices were tested. Sleep/wake summary and epoch-by-epoch agreement measures were compared with PSG.

Results: Most devices (Fatigue Science Readiband, Fitbit Alta HR, EarlySense Live, ResMed S+, SleepScore Max) performed as well as or better than actigraphy on sleep/wake performance measures, while the Garmin devices performed worse. Overall, epoch-by-epoch sensitivity was high (all ≥0.93), specificity was low-to-medium (0.18-0.54), sleep stage comparisons were mixed, and devices tended to perform worse on nights with poorer/disrupted sleep.

Conclusions: Consumer sleep-tracking devices exhibited high performance in detecting sleep, and most performed equivalent to (or better than) actigraphy in detecting wake. Device sleep stage assessments were inconsistent. Findings indicate that many newer sleep-tracking devices demonstrate promising performance for tracking sleep and wake. Devices should be tested in different populations and settings to further examine their wider validity and utility.

Keywords: actigraphy; nearables; noncontact; polysomnography; sensors; sleep technology; validation; wearables.

© Sleep Research Society 2020. Published by Oxford University Press on behalf of the Sleep Research Society.

Figures

Figure 1.
Figure 1.
Bland–Altman plots: total sleep time (TST). Bland–Altman plots depicting the mean bias (blue dashed line) and upper and lower limits of agreement (two standard deviations from bias; black dashed lines) for minutes of TST for the devices compared with polysomnography (PSG). Black circles are individual nights. Solid blue curves represent the best-fit of data, with surrounding gray shaded regions representing 95% confidence bands. The solid black line at zero represents no difference, with positive and negative y-axis values indicating an overestimation or underestimation, respectively, compared with PSG.
Figure 2.
Figure 2.
Bland–Altman plots: sleep efficiency (SE). Bland–Altman plots depicting the percentage of SE for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Figure 3.
Figure 3.
Bland–Altman plots: sleep onset latency (SOL). Bland–Altman plots depicting the minutes of SOL for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Figure 4.
Figure 4.
Bland–Altman plots: wake after sleep onset (WASO). Bland–Altman plots depicting the minutes of WASO from sleep onset latency (SOL) for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Figure 5.
Figure 5.
Bland–Altman plots: light sleep. Bland–Altman plots depicting the minutes of light sleep for the devices compared with polysomnography (PSG). For PSG, light sleep was calculated as the combination of N1 and N2 sleep stages. Only devices that output data on sleep stages are depicted. See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Figure 6.
Figure 6.
Bland–Altman plots: deep sleep. Bland–Altman plots depicting the minutes of deep sleep for the devices compared with polysomnography (PSG). For PSG, deep sleep was calculated as the N3 sleep stage. Only devices that output data on sleep stages are depicted. See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Figure 7.
Figure 7.
Bland–Altman plots: rapid eye movement (REM) sleep. Bland–Altman plots depicting the minutes of REM sleep for the devices compared with polysomnography (PSG). Only devices that output data on sleep stages are depicted. See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.

References

    1. Breslau N, et al. . Sleep disturbance and psychiatric disorders: a longitudinal epidemiological study of young adults. Biol Psychiatry. 1996;39(6):411–418.
    1. Knutson KL, et al. . The metabolic consequences of sleep deprivation. Sleep Med Rev. 2007;11(3):163–178.
    1. Luyster FS, et al. ; Boards of Directors of the American Academy of Sleep Medicine and the Sleep Research Society. Sleep: a health imperative. Sleep. 2012;35(6):727–734.
    1. Baglioni C, et al. . Sleep and mental disorders: a meta-analysis of polysomnographic research. Psychol Bull. 2016;142(9):969–990.
    1. Belenky G, et al. . Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: a sleep dose–response study. J Sleep Res. 2003;12(1):1–12.
    1. Van Dongen HPA, et al. . The cumulative cost of additional wakefulness: dose-response effects on neurobehavioral functions and sleep physiology from chronic sleep restriction and total sleep deprivation. Sleep. 2003;26(2):117–126. doi:10.1093/sleep/26.2.117
    1. Goel N, et al. . Neurocognitive consequences of sleep deprivation. Semin Neurol. 2009;29(4):320–339.
    1. Iber C, et al. . The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology and Technical Specifications. Westchester, IL: American Academy of Sleep Medicine; 2007.
    1. Cole RJ, et al. . Automatic sleep/wake identification from wrist activity. Sleep. 1992;15(5):461–469.
    1. Sadeh A, et al. . The role of actigraphy in the evaluation of sleep disorders. Sleep. 1995;18(4):288–302.
    1. Ancoli-Israel S, et al. . The role of actigraphy in the study of sleep and circadian rhythms. Sleep. 2003;26(3):342–392.
    1. Morgenthaler T, et al. . Practice parameters for the use of actigraphy in the assessment of sleep and sleep disorders: an update for 2007. Sleep. 2007;30(4):519–529. doi:10.1093/sleep/30.4.519
    1. Paquet J, et al. . Wake detection capacity of actigraphy during sleep. Sleep. 2007;30(10):1362–1369.
    1. Sadeh A. The role and validity of actigraphy in sleep medicine: an update. Sleep Med Rev. 2011;15(4):259–267. doi:10.1016/j.smrv.2010.10.001
    1. Van de Water ATM, et al. . Objective measurements of sleep for non-laboratory settings as alternatives to polysomnography--a systematic review. J Sleep Res. 2011;20(1 Pt 2):183–200. doi:10.1111/j.1365-2869.2009.00814.x
    1. Marino M, et al. . Measuring sleep: accuracy, sensitivity, and specificity of wrist actigraphy compared to polysomnography. Sleep. 2013;36(11):1747–1755.
    1. Grandner MA, et al. . Chapter 12 – Actigraphic sleep tracking and wearables: historical context, scientific applications and guidelines, limitations, and considerations for commercial sleep devices. In: Grandner MA, ed. Sleep and Health. Cambridge, MA: Academic Press; 2019: 147–157. doi:10.1016/B978-0-12-815373-4.00012-5
    1. Kolla BP, et al. . Consumer sleep tracking devices: a review of mechanisms, validity and utility. Expert Rev Med Devices. 2016;13(5):497–506.
    1. Khosla S, et al. ; American Academy of Sleep Medicine Board of Directors. Consumer sleep technology: an American Academy of Sleep Medicine position statement. J Clin Sleep Med. 2018;14(5):877–880.
    1. Baron KG, et al. . Feeling validated yet? A scoping review of the use of consumer-targeted wearable and mobile technology to measure and improve sleep. Sleep Med Rev. 2018;40:151–159.
    1. Bianchi MT. Sleep devices: wearables and nearables, informational and interventional, consumer and clinical. Metabolism. 2018;84:99–108.
    1. de Zambotti M, et al. . Wearable sleep technology in clinical and research settings. Med Sci Sports Exerc. 2019;51(7):1538–1557.
    1. Depner CM, et al. .. Wearable technologies for developing sleep and circadian biomarkers: a summary of workshop discussions. Sleep. 2020;43(2). doi:10.1093/sleep/zsz254
    1. Tal A, et al. . Validation of contact-free sleep monitoring device with comparison to polysomnography. J Clin Sleep Med. 2017;13(3):517–522.
    1. Jumabhoy R, et al. . Wrist-worn activity monitoring devices overestimate sleep duration and efficiency in health adults. Sleep. 2017;40(Suppl 1):A288–A289. doi:10.1093/sleepj/zsx050.778
    1. Schade MM, et al. . Sleep validity of a non-contact bedside movement and respiration-sensing device. J Clin Sleep Med. 2019;15(7):1051–1061. doi:10.5664/jcsm.7892
    1. Lee XK, et al. . Validation of a consumer sleep wearable device with actigraphy and polysomnography in adolescents across sleep opportunity manipulations. J Clin Sleep Med. 2019;15(9):1337–1346.
    1. Moreno-Pino F, et al. . Validation of Fitbit Charge 2 and Fitbit Alta HR against polysomnography for assessing sleep in adults with obstructive sleep apnea. J Clin Sleep Med. 2019;15(11):1645–1653.
    1. Kahawage P, et al. . Validity, potential clinical utility, and comparison of consumer and research-grade activity trackers in Insomnia Disorder I: in-lab validation against polysomnography. J Sleep Res. 2020;29(1):e12931.
    1. Liu S. Fitness & Activity Tracker – Statistics & Facts. Published May 22, 2019. . Accessed January 28, 2020.
    1. Moar J. Where Now for Wearables? Published March 21, 2018. . Accessed January 28, 2020.
    1. Kunst A. Percentage of U.S. Adults that Use Apps to Track their Sleep as of 2017, by Gender. Published December 20, 2019. /. Accessed January 28, 2020.
    1. Syngene Research. Global Smart Sleep Tracking Device Market Analysis 2019. Published November 2019. . Accessed January 28, 2020.
    1. Menghini L, et al. . A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. 2020. doi:10.1093/sleep/zsaa170
    1. de Zambotti M, et al. . A validation study of Fitbit Charge 2TM compared with polysomnography in adults. Chronobiol Int. 2018;35(4):465–476. doi:10.1080/07420528.2017.1413578
    1. Cook JD, et al. . Ability of the Fitbit Alta HR to quantify and classify sleep in patients with suspected central disorders of hypersomnolence: a comparison against polysomnography. J Sleep Res. 2019;28(4):e12789.
    1. de Zambotti M, et al. . The sleep of the ring: comparison of the ŌURA Sleep Tracker against polysomnography. Behav Sleep Med. 2017; 17(2):124–136. doi:10.1080/15402002.2017.1300587
    1. Bland JM, et al. . Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–310.
    1. Bland JM, et al. . Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8(2):135–160.
    1. Scott H, et al. . A systematic review of the accuracy of sleep wearable devices for estimating sleep onset. Sleep Med Rev. 2020;49:101227.
    1. Montgomery-Downs HE, et al. . Movement toward a novel activity monitoring device. Sleep Breath. 2012;16(3):913–917.
    1. de Zambotti M, et al. . Evaluation of a consumer fitness-tracking device to assess sleep in adults. Chronobiol Int. 2015;32(7):1024–1028.
    1. Meltzer LJ, et al. . Comparison of a commercial accelerometer with polysomnography and actigraphy in children and adolescents. Sleep. 2015;38(8):1323–1330.
    1. de Zambotti M, et al. . Validation of sleep-tracking technology compared with polysomnography in adolescents. Sleep. 2015;38(9):1461–1468.
    1. Mantua J, et al. . Reliability of sleep measures from four personal health monitoring devices compared to research-based actigraphy and polysomnography. Sensors (Basel). 2016;16(5):646. doi:10.3390/s16050646
    1. Haghayegh S, et al. . Accuracy of wristband Fitbit models in assessing sleep: systematic review and meta-analysis. J Med Internet Res. 2019;21(11):e16273.
    1. Colvonen PJ, et al. . Limiting racial disparities and bias for wearable devices in health science research. Sleep. 2020;43(10). doi:10.1093/sleep/zsaa159
    1. Hamill K, et al. . Validity, potential clinical utility and comparison of a consumer activity tracker and a research-grade activity tracker in insomnia disorder II: outside the laboratory. J Sleep Res. 2020;29(1):e12944.

Source: PubMed

3
订阅