The Dreem Headband compared to polysomnography for electroencephalographic signal acquisition and sleep staging

Pierrick J Arnal, Valentin Thorey, Eden Debellemaniere, Michael E Ballard, Albert Bou Hernandez, Antoine Guillot, Hugo Jourde, Mason Harris, Mathias Guillard, Pascal Van Beers, Mounir Chennaoui, Fabien Sauvet, Pierrick J Arnal, Valentin Thorey, Eden Debellemaniere, Michael E Ballard, Albert Bou Hernandez, Antoine Guillot, Hugo Jourde, Mason Harris, Mathias Guillard, Pascal Van Beers, Mounir Chennaoui, Fabien Sauvet

Abstract

Study objectives: The development of ambulatory technologies capable of monitoring brain activity during sleep longitudinally is critical for advancing sleep science. The aim of this study was to assess the signal acquisition and the performance of the automatic sleep staging algorithms of a reduced-montage dry-electroencephalographic (EEG) device (Dreem headband, DH) compared to the gold-standard polysomnography (PSG) scored by five sleep experts.

Methods: A total of 25 subjects who completed an overnight sleep study at a sleep center while wearing both a PSG and the DH simultaneously have been included in the analysis. We assessed (1) similarity of measured EEG brain waves between the DH and the PSG; (2) the heart rate, breathing frequency, and respiration rate variability (RRV) agreement between the DH and the PSG; and (3) the performance of the DH's automatic sleep staging according to American Academy of Sleep Medicine guidelines versus PSG sleep experts manual scoring.

Results: The mean percentage error between the EEG signals acquired by the DH and those from the PSG for the monitoring of α was 15 ± 3.5%, 16 ± 4.3% for β, 16 ± 6.1% for λ, and 10 ± 1.4% for θ frequencies during sleep. The mean absolute error for heart rate, breathing frequency, and RRV was 1.2 ± 0.5 bpm, 0.3 ± 0.2 cpm, and 3.2 ± 0.6%, respectively. Automatic sleep staging reached an overall accuracy of 83.5 ± 6.4% (F1 score: 83.8 ± 6.3) for the DH to be compared with an average of 86.4 ± 8.0% (F1 score: 86.3 ± 7.4) for the 5 sleep experts.

Conclusions: These results demonstrate the capacity of the DH to both monitor sleep-related physiological signals and process them accurately into sleep stages. This device paves the way for, large-scale, longitudinal sleep studies.

Clinical trial registration: NCT03725943.

Keywords: EEG; device; machine learning; sleep; sleep stages.

© Sleep Research Society 2020. Published by Oxford University Press on behalf of the Sleep Research Society.

Figures

Figure 1.
Figure 1.
20-s samples of raw signals recorded by DH (pink) and PSG (black) on the same record during each sleep stages (N1, N2, N3, REM, and Wake). The derivations are F7–O1 for the DH and F3–O1 for the PSG. The signals are presented between −150 and 150 μV.
Figure 2.
Figure 2.
Relative spectral power (α, β, λ, and θ frequencies, AU), heart rate (beats per minute, BPM), breathing frequency (cycles per minute, CPM), and respiratory rate variability (RRV, %) for a representative record (i.e. with a MPE similar to the mean of the group). These signals are presented for a whole record for both the DH (pink) and PSG (black).
Figure 3.
Figure 3.
Confusion matrix for the DH versus PSG scoring consensuses (top) and the overall confusion matrix for scorers versus the other scorers’ consensuses (bottom). Values are normalized by row with the number of epochs in parentheses.
Figure 4.
Figure 4.
Hypnograms for the five first participants showing both the consensuses of the four top-ranked scorers (gray) and the DH automated sleep stage classifications. Accuracies are presented as average obtained by the five scorers on the consensus hypnogram, and scores obtained for the DH versus the consensuses.
Figure 5.
Figure 5.
Bland Altman plots for each sleep variable measures by the DH versus the consensus sleep metrics computed for each record.

References

    1. National Center on Sleep Disorders Research and others. National Institutes of Health Sleep Disorders Research Plan. Bethesda, MD: National Institutes of Health;2011. Report No. DOT HS 808 707.
    1. Sateia MJ. International classification of sleep disorders—third edition: highlights and modifications. Chest. 2014;146:1387–1394.
    1. Ohayon MM. Epidemiological overview of sleep disorders in the general population. Sleep Med Res. 2011;2(1):1–9.
    1. Iber C, et al. The AASM Manual for the Scoring of Sleep and Associated Events : Rules, Terminology, and Technical Specifications. Westchester, IL: American Academy of Sleep Medicine; 2007.
    1. White LH, et al. Night-to-night variability in obstructive sleep apnea severity: relationship to overnight rostral fluid shift. J Clin Sleep Med. 2015;11(2):149–156.
    1. Bittencourt LR, et al. The variability of the apnoea-hypopnoea index. J Sleep Res. 2001;10:245–251.
    1. Rosenberg RS, et al. The American Academy of Sleep Medicine Inter-scorer Reliability program: respiratory events. J Clin Sleep Med. 2014;10:447–454.
    1. Danker-Hopfe H, et al. Interrater reliability for sleep scoring according to the Rechtschaffen & Kales and the new AASM standard. J Sleep Res. 2009;18:74–84.
    1. Sun H, et al. Large-scale automated sleep staging. Sleep. 2017;40(10). doi:10.1093/sleep/zsx139
    1. Patanaik A, et al. An end-to-end framework for real-time automatic sleep stage classification. Sleep. 2018;41:1–11.
    1. Chambon S, et al. A deep learning architecture for temporal sleep stage classification using multivariate and multimodal time series. IEEE Trans Neural Syst Rehabil Eng. 2018;26:758–769.
    1. Stephansen JB, et al. Neural network analysis of sleep stages enables efficient diagnosis of narcolepsy. Nat Commun. 2018;9:5229.
    1. Biswal S, et al. Automated sleep staging system via deep learning. SLEEPNET. [Submitted on 26 Jul 2017] .
    1. Kuna ST, et al. Agreement in computer-assisted manual scoring of polysomnograms across sleep centers. Sleep. 2013;36(4):583–589.
    1. Chambon S, et al. DOSED: a deep learning approach to detect multiple sleep micro-events in EEG signal. J Neurosci Methods. 2019;321:64–78.
    1. Gandhi M, et al. The Future of Biosensing Wearables. Rock Health. Published 2014 Accessed May 28, 2020.
    1. Marino M, et al. Measuring sleep: accuracy, sensitivity, and specificity of wrist actigraphy compared to polysomnography. Sleep. 2013;36:1747–1755.
    1. Montgomery-Downs HE, et al. Movement toward a novel activity monitoring device. Sleep Breath. 2012;16:913–917.
    1. Debellemaniere E, et al. Performance of an ambulatory dry-EEG device for auditory closed-loop stimulation of sleep slow oscillations in the home environment. Front Hum Neurosci. 2018;12:88.
    1. Shambroom JR, et al. Validation of an automated wireless system to monitor sleep in healthy adults. J Sleep Res. 2012;21:221–230.
    1. Garcia-Molina G, et al. Closed-loop system to enhance slow-wave activity. J Neural Eng. 2018;15:066018.
    1. Finan PH, et al. Validation of a wireless, self-application, ambulatory electroencephalographic sleep monitoring device in healthy volunteers. J Clin Sleep Med. 2016; 12:1443–1451.
    1. Sterr A, et al. Sleep EEG derived from behind-the-ear electrodes (cEEGrid) compared to standard polysomnography: a proof of concept study. Front Hum Neurosci. 2018;12:452.
    1. Mikkelsen KB, et al. EEG Recorded from the Ear: characterizing the Ear-EEG Method. Front Neurosci. 2015;9:438.
    1. Depner CM, et al. Wearable technologies for developing sleep and circadian biomarkers: a summary of workshop discussions. Sleep. 2020;43(2). doi:10.1093/sleep/zsz254
    1. Bastien CH, et al. Validation of the insomnia severity index as an outcome measure for insomnia research. Sleep Med. 2001;2(4):297–307.
    1. Johns MW. A new method for measuring daytime sleepiness: The Epworth sleepiness scale. Sleep. 1991;14(6):540–545.
    1. Spitzer RL, et al. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. 2006;166:1092–1097.
    1. Kroenke K, et al. The PHQ-9 : a new depression measure. Psychiatr Ann. 2002;32(9):509–515.
    1. Gutierrez G, et al. Respiratory rate variability in sleeping adults without obstructive sleep apnea. Physiol Rep. 2016;4(17):e12949.
    1. Van Hout S. The American Academy of Sleep Medicine inter-scorer reliability program: sleep stage scoring Richard S. Rosenberg1. J Clin Sleep Med. 2013;9(1):81–87.
    1. Powers DMW. Evaluation: from precision, recall and F-factor to ROC, informedness, markedness & correlation. J Mach Learn Technol. 2011.
    1. Hochreiter S, et al. Long short-term memory. Neural Comput. 1997;9:1735–1780.
    1. Paszke A, et al. Automatic differentiation in PyTorch. NIPS 2017 Workshop Autodiff Decision Program Chairs. 2017;22:2– 8.
    1. Lopez-Gordo MA, et al. Dry EEG electrodes. Sensors (Switzerland). 2014;14(7):12847–12870.
    1. Srinivasa MG, et al. Dry electrodes for bio-potential measurement in wearable systems. In: 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), Bangalore, 2017; pp. 270–276.
    1. Lu G, et al. Limitations of oximetry to measure heart rate variability measures. Cardiovasc Eng. 2009;9:119–125.

Source: PubMed

3
Tilaa