A Validation of Six Wearable Devices for Estimating Sleep, Heart Rate and Heart Rate Variability in Healthy Adults

Dean J Miller, Charli Sargent, Gregory D Roach, Dean J Miller, Charli Sargent, Gregory D Roach

Abstract

The primary aim of this study was to examine the validity of six commonly used wearable devices, i.e., Apple Watch S6, Garmin Forerunner 245 Music, Polar Vantage V, Oura Ring Generation 2, WHOOP 3.0 and Somfit, for assessing sleep. The secondary aim was to examine the validity of the six devices for assessing heart rate and heart rate variability during, or just prior to, night-time sleep. Fifty-three adults (26 F, 27 M, aged 25.4 ± 5.9 years) spent a single night in a sleep laboratory with 9 h in bed (23:00-08:00 h). Participants were fitted with all six wearable devices-and with polysomnography and electrocardiography for gold-standard assessment of sleep and heart rate, respectively. Compared with polysomnography, agreement (and Cohen's kappa) for two-state categorisation of sleep periods (as sleep or wake) was 88% (κ = 0.30) for Apple Watch; 89% (κ = 0.35) for Garmin; 87% (κ = 0.44) for Polar; 89% (κ = 0.51) for Oura; 86% (κ = 0.44) for WHOOP and 87% (κ = 0.48) for Somfit. Compared with polysomnography, agreement (and Cohen's kappa) for multi-state categorisation of sleep periods (as a specific sleep stage or wake) was 53% (κ = 0.20) for Apple Watch; 50% (κ = 0.25) for Garmin; 51% (κ = 0.28) for Polar; 61% (κ = 0.43) for Oura; 60% (κ = 0.44) for WHOOP and 65% (κ = 0.52) for Somfit. Analyses regarding the two-state categorisation of sleep indicate that all six devices are valid for the field-based assessment of the timing and duration of sleep. However, analyses regarding the multi-state categorisation of sleep indicate that all six devices require improvement for the assessment of specific sleep stages. As the use of wearable devices that are valid for the assessment of sleep increases in the general community, so too does the potential to answer research questions that were previously impractical or impossible to address-in some way, we could consider that the whole world is becoming a sleep laboratory.

Keywords: autonomic modulation; autonomic nervous system; cardiovascular health; consumer sleep technology; photoplethysmography; sleep quality; sleep quantity; sleep staging; wearable sleep monitor.

Conflict of interest statement

G.D.R., C.S., and D.J.M. are members of a research group at Central Queensland University (i.e., The Sleep Lab) that receives support for research (i.e., funding, equipment) from WHOOP Inc. However, WHOOP was not involved in the design, conduct or reporting of this study.

Figures

Figure 1
Figure 1
Bland–Altman plots for device–derived and PSG–derived measures of total sleep time. The y-axes represent the difference between the measures—positive values indicate devices overestimate relative to PSG, and negative values indicate devices underestimate relative to PSG. Solid red lines indicate the device bias relative to PSG. Solid grey lines indicate the 95% limits of agreement (±1.96 standard deviation).
Figure 2
Figure 2
Bland–Altman plots for device–derived and PSG–derived measures of heart rate. The y-axes represent the difference between the measures—positive values indicate devices overestimate relative to PSG, and negative values indicate devices underestimate relative to PSG. Solid red lines indicate the device bias relative to PSG. Solid grey lines indicate the 95% limits of agreement (±1.96 standard deviation). * manufacturers provided raw data; ** y-axis was adjusted in plot B to accommodate for one data point outside of the common y–axis range.
Figure 3
Figure 3
Bland–Altman plots for device–derived and PSG–derived measures of heart rate variability. The y–axes represent the difference between the measures—positive values indicate devices overestimate relative to PSG, and negative values indicate devices underestimate relative to PSG. Solid red lines indicate the device bias relative to PSG. Solid grey lines indicate the 95% limits of agreement (±1.96 standard deviation). * Manufacturers provided raw data.

References

    1. Wearable Technology Market Size, Share & Trends Analysis Report by Product (Wrist-Wear, Eye-Wear & Head-Wear, Foot-Wear, Neck-Wear, Body-Wear), by Application, by Region, and Segment Forecasts, 2020–2027. [(accessed on 17 June 2022)]. Available online: .
    1. Pevnick J.M., Birkeland K., Zimmer R., Elad Y., Kedan I. Wearable Technology for Cardiology: An Update and Framework for the Future. Trends Cardiovasc. Med. 2018;28:144–150. doi: 10.1016/j.tcm.2017.08.003.
    1. Piwek L., Ellis D.A., Andrews S., Joinson A. The Rise of Consumer Health Wearables: Promises and Barriers. PLoS Med. 2016;13:e1001953. doi: 10.1371/journal.pmed.1001953.
    1. Han P., Li L., Zhang H., Guan L., Marques C., Savović S., Ortega B., Min R., Li X. Low-Cost Plastic Optical Fiber Sensor Embedded in Mattress for Sleep Performance Monitoring. Opt. Fiber Technol. 2021;64:102541. doi: 10.1016/j.yofte.2021.102541.
    1. Kushida C.A., Littner M.R., Morgenthaler T., Alessi C.A., Bailey D., Coleman J., Jr., Friedman L., Hirshkowitz M., Kapen S., Kramer M., et al. Practice Parameters for the Indications for Polysomnography and Related Procedures: An Update for 2005. Sleep. 2005;28:499–521. doi: 10.1093/sleep/28.4.499.
    1. Berry R.B., Brooks R., Gamaldo C.E., Harding S.M. The AASM Manual for the Scoring of Sleep and Associated Events. American Academy of Sleep Medicine; Darien, IL, USA: 2015.
    1. de Zambotti M., Cellini N., Goldstone A., Colrain I.M., Baker F.C. Wearable Sleep Technology in Clinical and Research Settings. Med. Sci. Sports Exerc. 2019;51:1538–1557. doi: 10.1249/MSS.0000000000001947.
    1. Pollak C.P., Tryon W.W., Nagaraja H., Dzwonczyk R. How Accurately Does Wrist Actigraphy Identify the States of Sleep and Wakefulness? Sleep. 2001;24:957–965. doi: 10.1093/sleep/24.8.957.
    1. Sargent C., Lastella M., Romyn G., Versey N., Miller D.J., Roach G.D. How Well Does a Commercially Available Wearable Device Measure Sleep in Young Athletes? Chronobiol. Int. 2018;35:754–758. doi: 10.1080/07420528.2018.1466800.
    1. Miller D.J., Roach G.D., Lastella M., Scanlan A.T., Bellenger C.R., Halson S.L., Sargent C. A Validation Study of a Commercial Wearable Device to Automatically Detect and Estimate Sleep. Biosensors. 2021;11:185. doi: 10.3390/bios11060185.
    1. Plews D.J., Laursen P.B., Stanley J., Kilding A.E., Buchheit M. Training Adaptation and Heart Rate Variability in Elite Endurance Athletes: Opening the Door to Effective Monitoring. Sports Med. 2013;43:773–781. doi: 10.1007/s40279-013-0071-8.
    1. Plews D.J., Scott B., Altini M., Wood M., Kilding A.E., Laursen P.B. Comparison of Heart-Rate-Variability Recording With Smartphone Photoplethysmography, Polar H7 Chest Strap, and Electrocardiography. Int. J. Sports Physiol. Perform. 2017;12:1324–1328. doi: 10.1123/ijspp.2016-0668.
    1. de Zambotti M., Godino J.G., Baker F.C., Cheung J., Patrick K., Colrain I.M. The Boom in Wearable Technology: Cause for Alarm or Just What Is Needed to Better Understand Sleep? Sleep. 2016;39:1761–1762. doi: 10.5665/sleep.6108.
    1. Halson S.L. Sleep Monitoring in Athletes: Motivation, Methods, Miscalculations and Why It Matters. Sports Med. 2019;49:1487–1497. doi: 10.1007/s40279-019-01119-4.
    1. Miller D.J., Lastella M., Scanlan A.T., Bellenger C., Halson S.L., Roach G.D., Sargent C. A Validation Study of the WHOOP Strap against Polysomnography to Assess Sleep. J. Sports Sci. 2020;38:2631–2636. doi: 10.1080/02640414.2020.1797448.
    1. Iber C. The AASM Manual for the Scoring of Sleep and Associated Events: Rules. American Academy of Sleep Medicine; Darien, IL, USA: 2007. Terminology and Technical Specification.
    1. Nielsen T. Cognition in REM and NREM Sleep: A Review and Possible Reconciliation of Two Models of Sleep Mentation. Behav. Brain Sci. 2000;23:851–866. doi: 10.1017/S0140525X0000399X.
    1. Pichot V., Roche F., Celle S., Barthélémy J.-C., Chouchou F. HRVanalysis: A Free Software for Analyzing Cardiac Autonomic Activity. Front. Physiol. 2016;7:557. doi: 10.3389/fphys.2016.00557.
    1. Menghini L., Cellini N., Goldstone A., Baker F.C., de Zambotti M. A Standardized Framework for Testing the Performance of Sleep-Tracking Technology: Step-by-Step Guidelines and Open-Source Code. Sleep. 2021;44:zsaa170. doi: 10.1093/sleep/zsaa170.
    1. Magalang U.J., Chen N.-H., Cistulli P.A., Fedson A.C., Gíslason T., Hillman D., Penzel T., Tamisier R., Tufik S., Phillips G., et al. Agreement in the Scoring of Respiratory Events and Sleep among International Sleep Centers. Sleep. 2013;36:591–596. doi: 10.5665/sleep.2552.
    1. Kosmadopoulos A., Sargent C., Darwent D., Zhou X., Roach G.D. Alternatives to Polysomnography (PSG): A Validation of Wrist Actigraphy and a Partial-PSG System. Behav. Res. Methods. 2014;46:1032–1041. doi: 10.3758/s13428-013-0438-7.
    1. Bland J.M., Altman D.G. Agreement between Methods of Measurement with Multiple Observations per Individual. J. Biopharm. Stat. 2007;17:571–582. doi: 10.1080/10543400701329422.
    1. Koo T.K., Li M.Y. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J. Chiropr. Med. 2016;15:155–163. doi: 10.1016/j.jcm.2016.02.012.
    1. Sargent C., Lastella M., Halson S.L., Roach G.D. The Validity of Activity Monitors for Measuring Sleep in Elite Athletes. J. Sci. Med. Sport. 2016;19:848–853. doi: 10.1016/j.jsams.2015.12.007.

Source: PubMed

3
구독하다