Machine-learning-derived sleep-wake staging from around-the-ear electroencephalogram outperforms manual scoring and actigraphy

Kaare B Mikkelsen, James K Ebajemito, Maria A Bonmati-Carrion, Nayantara Santhi, Victoria L Revell, Giuseppe Atzori, Ciro Della Monica, Stefan Debener, Derk-Jan Dijk, Annette Sterr, Maarten de Vos, Kaare B Mikkelsen, James K Ebajemito, Maria A Bonmati-Carrion, Nayantara Santhi, Victoria L Revell, Giuseppe Atzori, Ciro Della Monica, Stefan Debener, Derk-Jan Dijk, Annette Sterr, Maarten de Vos

Abstract

Quantification of sleep is important for the diagnosis of sleep disorders and sleep research. However, the only widely accepted method to obtain sleep staging is by visual analysis of polysomnography (PSG), which is expensive and time consuming. Here, we investigate automated sleep scoring based on a low-cost, mobile electroencephalogram (EEG) platform consisting of a lightweight EEG amplifier combined with flex-printed cEEGrid electrodes placed around the ear, which can be implemented as a fully self-applicable sleep system. However, cEEGrid signals have different amplitude characteristics to normal scalp PSG signals, which might be challenging for visual scoring. Therefore, this study evaluates the potential of automatic scoring of cEEGrid signals using a machine learning classifier ("random forests") and compares its performance with manual scoring of standard PSG. In addition, the automatic scoring of cEEGrid signals is compared with manual annotation of the cEEGrid recording and with simultaneous actigraphy. Acceptable recordings were obtained in 15 healthy volunteers (aged 35 ± 14.3 years) during an extended nocturnal sleep opportunity, which induced disrupted sleep with a large inter-individual variation in sleep parameters. The results demonstrate that machine-learning-based scoring of around-the-ear EEG outperforms actigraphy with respect to sleep onset and total sleep time assessments. The automated scoring outperforms human scoring of cEEGrid by standard criteria. The accuracy of machine-learning-based automated scoring of cEEGrid sleep recordings compared with manual scoring of standard PSG was satisfactory. The findings show that cEEGrid recordings combined with machine-learning-based scoring holds promise for large-scale sleep studies.

Keywords: EEG; automated sleep scoring; ear EEG; home EEG; mobile EEG.

Conflict of interest statement

JKE, MABC, NS, VLR, SD, DJD and AS declare no conflicts of interest, including any involvement in organizations with a financial interest in the subject matter of the paper. KBM and MDV received a grant from Circadian Therapeutics to perform this study. MDV is a founding member of Circadian Therapeutics.

© 2018 The Authors. Journal of Sleep Research published by John Wiley & Sons Ltd on behalf of European Sleep Research Society.

Figures

Figure 1
Figure 1
Participant wearing both cEEGrid electrode array and polysomnography electrodes. Permission was obtained from the individual for the publication of this image
Figure 2
Figure 2
Comparison of electrode derivations. The metric is “CI” as defined in Equation (1). “TB” in this context means “top versus bottom” and FB(L) and FB(R) mean “front versus back”, for left and right ear, respectively. Values for TB(L) and TB(R) have been plotted as “detached” from the rest of the derivations because they have been positioned on the x‐axis outside of their place in the ordering (which would have hidden them inside the group of 56 “single‐electrode” derivations)
Figure 3
Figure 3
(a) Visualization of the three chosen derivations. FB(L) and FB(R) are within‐C, and L–R is the difference between the average within each C. (b) Analysis of reliability of the three chosen derivations. “Accepted data” is equivalent to gi in (1). FB(L) and FB(R) mean “front versus back”, for left and right ear, respectively; L–R, average of left electrodes versus average of right electrodes.
Figure 4
Figure 4
Representative hypnograms for a sleep recording. Top shows the manual scoring, using polysomnography data. Middle shows manual scoring using cEEGrid data and bottom shows automatic scoring after post‐processing, using leave‐one‐subject‐out cross‐validation
Figure 5
Figure 5
Comparison of classification performance for various automatic sleep staging algorithms and one manual classification. (a) Using only sleep–wake scores, (b) using five‐stage scoring. Labels on the x‐axis are described in detail in Table 2
Figure 6
Figure 6
Sleep statistics. See Table 2 for a description of the methods compared. For each plot four r2‐values can be calculated, according to how well the straight line fits to the scatter plot. The r2‐values for each plot are (in the order of the legend): total sleep (0.72; 0.84; 0.89; 0.61; 0.78), sleep efficiency (0.34; 0.49; 0.65; 0.25; 0.27), WASO (0.55; 0.70; 0.52; 0.42; 0.32), SOL (0.94; 0.96; 0.61; 0.67; 0.63), REM latency (0.04; 0.00; 0.03; 0.21; NaN). WASO, wake‐after‐sleep‐onset; REM, rapid eye movement; SOL, sleep onset latency
Figure 7
Figure 7
Analysis of classifier performance as a function of number of features. Box plots show results of random sampling from the feature space; solid line shows performance of N highest ranked features. We notice that the optimal number of features appears to be around 20–30, but also that the overfitting resulting from using the full set is relatively minor (given that the decrease in kappa is only from 0.58 to 0.53)

References

    1. Bassetti, C. L. (2005). Sleep and stroke. Seminars in Neurology, 25(1), 19–32. 10.1055/s-2005-867073
    1. Berry, R. B. , Brooks, R. , Gamaldo, C. , Harding, S. M. , Lloyd, R. M. , Quan, S. F. , … Vaughn, B. V. (2017). AASM scoring manual updates for 2017 (version 2.4). Journal of Clinical Sleep Medicine, 13(5), 665–666. 10.5664/jcsm.6576
    1. Bleichner, M. G. , & Debener S. (2017). Concealed, Unobtrusive Ear–Centered EEG Acquisition: cEEGrids for Transparent EEG. Front Hum Neurosci, 11, 163.
    1. Boostani, R. , Karimzadeh, F. , & Nami, M. (2017). A comparative review on sleep stage classification methods in patients and healthy individuals. Computer Methods and Programs in Biomedicine, 140, 77–91. 10.1016/j.cmpb.2016.12.004
    1. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. 10.1023/a:1010933404324
    1. Buysse, D. J. , Reynolds, C. F. , Monk, T. H. , Berman, S. R. , & Kupfer, D. J. (1989). The Pittsburgh Sleep Quality Index: A new instrument for psychiatric practice and research. Psychiatry Research, 28(2), 193–213. 10.1016/0165-1781(89)90047-4
    1. Carr, O. , Saunders, K. E. A. , Tsanas, A. , Bilderbeck, A. C. , Palmius, N. , Geddes, J. R. , … De Vos, M. (2018). Variability in phase and amplitude of diurnal rhythms is related to variation of mood in bipolar and borderline personality disorder. Scientific Reports, 8, Article number: 1649 (2018).
    1. Ceriani, L. , & Verme, P. (2011). The origins of the Gini index: Extracts from Variabilit`a e Mutabilit`a (1912) by Corrado Gini. Journal of Economic Inequality, 10, 421–443.
    1. Cho, K. , Ennaceur, A. , Cole, J. C. , & Suh, C. K. (2000). Chronic jet lag produces cognitive deficits. Journal of Neuroscience, 20(6), RC66 10.1523/jneurosci.20-06-j0005.2000
    1. De Vos, M. , Gandras, K. , & Debener, S. (2014). Towards a truly mobile auditory brain–computer interface: Exploring the P300 to take away. International Journal of Psychophysiology, 91(1), 46–53. 10.1016/j.ijpsycho.2013.08.010
    1. Debener, S. , Emkes, R. , de Vos, M. , & Bleichner, M. (2015). Unobtrusive ambulatory EEG using a smartphone and flexible printed electrodes around the ear. Scientific Reports, 5, 16743 10.1038/srep16743
    1. Debener, S. , Minow, F. , Emkes, R. , Gandras, K. , & de Vos, M. (2012). How about taking a low‐cost, small, and wireless EEG for a walk? Psychophysiology, 49(11), 1617–1621. 10.1111/j.1469-8986.2012.01471.x
    1. Delorme, A. , & Makeig, S. (2004). EEGLAB: An open source toolbox for analysis of single‐trial EEG dynamics including independent component analysis. Journal of Neuroscience Methods, 134(1), 9–21. 10.1016/j.jneumeth.2003.10.009
    1. Garaulet, M. , Ordovás, J. M. , & Madrid, J. A. (2010). The chronobiology, etiology and pathophysiology of obesity. International Journal of Obesity, 34(12), 1667–1683. 10.1038/ijo.2010.118
    1. Griessenberger, H. , Heib, D. P. J. , Kunz, A. B. , Hoedlmoser, K. , & Schabus, M. (2013). Assessment of a wireless headband for automatic sleep scoring. Sleep and Breathing, 17(2), 747–752. 10.1007/s11325-012-0757-4
    1. Horne, J. A., & Ostberg O. (1976). A self–assessment questionnaire to determine morningness‐eveningness in human circadian rhythms. Int J Chronobiol. 4(2), 97–110. PMID: 1027738
    1. Knutsson, A. , & Bøggild, H. (2011). Shiftwork and cardiovascular disease: Review of disease mechanisms. Reviews on Environmental Health, 15(4), 359–372.
    1. Koley, B. , & Dey, D. (2012). An ensemble system for automatic sleep stage classification using single channel EEG signal. Computers in Biology and Medicine, 42(12), 1186–1195. 10.1016/j.compbiomed.2012.09.012
    1. Luay, F. , Khaldon, L. , Khasawneh, N. , Wenz, H. , & Hartmut, D. (2012). Automated sleep stage identification system based on time–frequency analysis of a single EEG channel and random forest classifier. Computer Methods and Programs in Biomedicine, 108(1), 10–19. 10.1016/j.cmpb.2011.11.005. ()
    1. Levendowski, D. J. , Ferini‐Strambi, L. , Gamaldo, C. , Cetel, M. , Rosenberg, R. , & Westbrook, P. R. (2017). The accuracy, night‐to‐night variability, and stability of frontopolar sleep electroencephalography biomarkers. Journal of Clinical Sleep Medicine, 13(6), 791–803. 10.5664/jcsm.6618
    1. Looney, D. , Goverdovsky, V. , Rosenzwei, I. , Morrell, M. J. , & Mandic, D. P. (2016). A wearable in‐ear encephalography sensor for monitoring sleep: Preliminary observations from nap studies. Annals of the American Thoracic Society, 13, 2229–2233. 10.1513/annalsats.201605-342bc
    1. Lucey, B. P. , Mcleland, J. S. , Toedebusch, C. D. , Boyd, J. , Morris, J. C. , Landsness, E. C. , … Holtzman, D. M. (2016). Comparison of a single‐channel EEG sleep study to polysomnography. Journal of sleep research, 25(6), 625–635. 10.1111/jsr.12417
    1. McGraw, K. O. , & Wong, S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46.
    1. Mikkelsen, K. , & de Vos, M. (2018). Personalizing deep learning models for automatic sleep staging [Online]. Retrieved from .
    1. Mikkelsen, K. B. , Kappel, S. L. , Mandic, D. P. , & Kidmose, P. (2015). EEG recorded from the ear: Characterizing the ear‐EEG method. Frontiers in Neuroscience, 9, 438.
    1. Mikkelsen, K. , Kidmose, P. , & Hansen, L. K. (2017). On the keyhole hypothesis: High mutual information between Ear and Scalp EEG. Frontiers in Human Neuroscience, 11, 341 10.3389/fnhum.2017.00341
    1. Mikkelsen, K. B. , Villadsen, D. B. , Otto, M. , & Kidmose, P. (2017). Automatic sleep staging using ear–EEG. Biomedical Engineering Online, 16(1), 111.
    1. Myllymaa, S. , Muraja‐Murro, A. , Westeren‐Punnonen, S. , Hukkanen, T. , Lappalainen, R. , Mervaala, E. , … Myllymaa, K. (2016). Assessment of the suitability of using a forehead EEG electrode set and chin EMG electrodes for sleep staging in polysomnography. Journal of Sleep Research, 25(6), 636–645. 10.1111/jsr.12425
    1. Pacharra, M. , Debener, S. , & Wascher, E. (2017). Concealed around‐the‐ear EEG captures cognitive processing in a visual Simon task. Frontiers in Human Neuroscience, 11, 290 10.3389/fnhum.2017.00290
    1. Phan, H. , Andreotti, F. , Cooray, N. , Chén, O. Y. , & de Vos, M. (2018). Joint classification and prediction CNN framework for automatic sleep stage classification. Retrieved from .
    1. Popovic, D. , Khoo, M. , & Westbrook, P. (2014). Automatic scoring of sleep stages and cortical arousals using two electrodes on the forehead: Validation in healthy adults. Journal of Sleep Research, 23(2), 211–221. 10.1111/jsr.12105
    1. Salarian, A. (2016). Intraclass correlation coefficient (version 1.3.1.0). Retrieved from , At: MATLAB Central File Exchange.
    1. Shambroom, J. R. , Fabregas, S. E. , & Johnstone, J. (2012). Validation of an automated wireless system to monitor sleep in healthy adults. Journal of sleep research, 21(2), 221–230. 10.1111/j.1365-2869.2011.00944.x
    1. Sterr, A. , Ebajemito, J. K. , Mikkelsen, K. B. , Bonmati‐Carrion, M. , Santhi, N. , Atzori, G. , … De Vos, M. (2018). Reliable detection of sleep stages derived from behind‐the‐ear electrodes (cEEGrid): A comparison to standard polysomnography. Frontiers in Human Neuroscience, in press.
    1. Stochholm, A. , Mikkelsen, K. B. , & Kidmose, P. (2016). Automatic sleep stage classification using ear‐EEG 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Orlando, FL, USA: IEEE, 4751–4754. 10.1109/embc.2016.7591789
    1. Werth, E. , & Borbely, A. A. (1995). Recording the sleep EEG with periorbital skin electrodes. Electroencephalography and Clinical Neurophysiology, 94(6), 406–413. 10.1016/0013-4694(94)00337-k
    1. Younes, M. , Soiferman, M. , Thompson, W. , & Giannouli, E. (2017). Performance of a new portable wireless sleep monitor. Journal of Clinical Sleep Medicine, 13(2), 245–258. 10.5664/jcsm.6456
    1. Zibrandtsen, I. , Kidmose, P. , Otto, M. , Ibsen, J. , & Kjaer, T. W. (2016). Case comparison of sleep features from ear‐EEG and scalp‐EEG. Sleep Science, 9, 69–72. 10.1016/j.slsci.2016.05.006
    1. Zich, C. , Debener, S. , Kranczioch, C. , Bleichner, M. G. , Gutberlet, I. , & de Vos, M. (2015). Real‐time EEG feedback during simultaneous EEG–fMRI identifies the cortical signature of motor imagery. NeuroImage, 114, 438–447. 10.1016/j.neuroimage.2015.04.020

Source: PubMed

3
Abonner