Automatic sleep stage classification based on subcutaneous EEG in patients with epilepsy

Sirin W Gangstad, Kaare B Mikkelsen, Preben Kidmose, Yousef R Tabar, Sigge Weisdorf, Maja H Lauritzen, Martin C Hemmsen, Lars K Hansen, Troels W Kjaer, Jonas Duun-Henriksen, Sirin W Gangstad, Kaare B Mikkelsen, Preben Kidmose, Yousef R Tabar, Sigge Weisdorf, Maja H Lauritzen, Martin C Hemmsen, Lars K Hansen, Troels W Kjaer, Jonas Duun-Henriksen

Abstract

Background: The interplay between sleep structure and seizure probability has previously been studied using electroencephalography (EEG). Combining sleep assessment and detection of epileptic activity in ultralong-term EEG could potentially optimize seizure treatment and sleep quality of patients with epilepsy. However, the current gold standard polysomnography (PSG) limits sleep recording to a few nights. A novel subcutaneous device was developed to record ultralong-term EEG, and has been shown to measure events of clinical relevance for patients with epilepsy. We investigated whether subcutaneous EEG recordings can also be used to automatically assess the sleep architecture of epilepsy patients.

Method: Four adult inpatients with probable or definite temporal lobe epilepsy were monitored simultaneously with long-term video scalp EEG (LTV EEG) and subcutaneous EEG. In total, 11 nights with concurrent recordings were obtained. The sleep EEG in the two modalities was scored independently by a trained expert according to the American Academy of Sleep Medicine (AASM) rules. By using the sleep stage labels from the LTV EEG as ground truth, an automatic sleep stage classifier based on 30 descriptive features computed from the subcutaneous EEG was trained and tested.

Results: An average Cohen's kappa of [Formula: see text] was achieved using patient specific leave-one-night-out cross validation. When merging all sleep stages into a single class and thereby evaluating an awake-sleep classifier, we achieved a sensitivity of 94.8% and a specificity of 96.6%. Compared to manually labeled video-EEG, the model underestimated total sleep time and sleep efficiency by 8.6 and 1.8 min, respectively, and overestimated wakefulness after sleep onset by 13.6 min.

Conclusion: This proof-of-concept study shows that it is possible to automatically sleep score patients with epilepsy based on two-channel subcutaneous EEG. The results are comparable with the methods currently used in clinical practice. In contrast to comparable studies with wearable EEG devices, several nights were recorded per patient, allowing for the training of patient specific algorithms that can account for the individual brain dynamics of each patient. Clinical trial registered at ClinicalTrial.gov on 19 October 2016 (ID:NCT02946151).

Keywords: Automatic sleep scoring; Epilepsy; Sleep; Subcutaneous EEG; Wearable EEG.

Conflict of interest statement

SWG, JDH and MCH are employed by UNEEGTM medical A/S. MHL and SW is partially funded by UNEEGTM medical A/S. TWK consults for UNEEGTM medical A/S. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Box plot of Cohen’s kappa values. The midline in the boxes represent the medians, and the dots represent the means. Red crosses are outliers. The mean value ± standard deviation of the mean for the five-class problem: κPS.=0.78±0.02, κLONO=0.74±0.02 and κexpert=0.66±0.04. Mean value ± standard deviation of the mean for the two-class problem: κPS.=0.85±0.03, κLONO=0.82±0.03 and κexpert=0.81±0.04. The horizontal lines represent intervals of the level of agreement as interpreted by McHugh et al. [18]
Fig. 2
Fig. 2
Representative night: the second night of patient B. Top panel: spectrogram of the proximal subcutaneous EEG channel (P–C). Middle panel: spectrogram of the corresponding scalp channel (P7–T7). Bottom panel: manually scored hypnogram based on scalp EEG and the predicted hypnograms by the PS and LONO algorithm and the human expert
Fig. 3
Fig. 3
Confusion matrices for the five- and two-class problems. Each entry in the matrices provides the percentage P of epochs known to belong to class i that were classified as belonging to class j, for i,j∈{1,…NumberOfClasses, and the raw count. The percentage P in the diagonal equals the class sensitivity. The coloring reflects the magnitude of P, which ranges from 0 to 100 %
Fig. 4
Fig. 4
Comparison of ground truth sleep measures to estimated sleep measures. The blue squares indicate results from the PS algorithm, the red circles are the LONO algorithm and the yellow diamonds are the human expert. Left: scatter plot with Deming regression line, slope of regression line (β) and Pearson’s correlation coefficient (r). Right: Bland–Altman plots. The solid line is the mean difference, and the dotted lines are 1.96 times the standard deviation of the mean
Fig. 5
Fig. 5
Illustration of the subcutaneous recordings system. Left: illustration of the implant and the beta-version of the external device used to collect data in the present study. The placement of the Proximal (P), Center (C) and Distal (D) electrodes are indicated by the letters. The length of the implant is approximately 11 cm. Right: illustration of the commercially available device. The device is worn under the shirt and secured in place by a magnet (gray circle)
Fig. 6
Fig. 6
Manually scored hypnograms based on scalp EEG. The five tick marks on the y-axis represent (from top to bottom) wake, REM sleep, N1, N2 and N3. REM sleep is marked with a red, bold line. Three nights were recorded for each of patients A, C and D, and two nights were recorded for patient B

References

    1. Smith MT, McCrae CS, Cheung J, Martin JL, Harrod CG, Heald JL, Carden KA. Use of actigraphy for the evaluation of sleep disorders and circadian rhythm sleep-wake disorders: an american academy of sleep medicine systematic review, meta-analysis, and grade assessment. J Clin Sleep Med. 2018;14(7):1209–1230. doi: 10.5664/jcsm.7228.
    1. Weisdorf S, Gangstad SW, Duun-Henriksen J, Mosholt KSS, Kjær TW. High similarity between eeg from subcutaneous and proximate scalp electrodes in patients with temporal lobe epilepsy. J Neurophysiol. 2018;120(3):1451–1460. doi: 10.1152/jn.00320.2018.
    1. Sadeh A. The role and validity of actigraphy in sleep medicine: an update. Sleep Med Rev. 2011;15(4):259–267. doi: 10.1016/j.smrv.2010.10.001.
    1. Sadaka Y, Sadeh A, Bradbury L, Massicotte C, Zak M, Go C, Shorer Z, Weiss SK. Validation of actigraphy with continuous video-electroencephalography in children with epilepsy. Sleep Med. 2014;15(9):1075–1081. doi: 10.1016/j.sleep.2014.04.021.
    1. Vilamala A, Madsen KH, Hansen LK. Deep convolutional neural networks for interpretable analysis of EEG sleep stage scoring. 2017;8.
    1. Tsinalis O, Matthews PM, Guo Y. Automatic sleep stage scoring using time-frequency analysis and stacked sparse autoencoders. Ann Biomed Eng. 2016;44(5):1587–1597. doi: 10.1007/s10439-015-1444-y.
    1. Tsinalis O, Matthews PM, Guo Y, Zafeiriou S. Automatic sleep stage scoring with single-channel EEG using convolutional neural networks. 2016;12.
    1. Mousavi S, Afghah F, Acharya UR. Sleepeegnet: automated sleep stage scoring with sequence to sequence deep learning approach. PLos ONE. 2019;14(5):0216456. doi: 10.1371/journal.pone.0216456.
    1. O’Reilly C, Gosselin N, Carrier J, Nielsen T. Montreal archive of sleep studies: an open-access resource for instrument benchmarking and exploratory research. J Sleep Res. 2014;23(6):628–35. 10.1111/jsr.12169, .
    1. Aboalayon KAI, Faezipour M, Almuhammadi WS, Moslehpour S. Sleep stage classification using EEG signal analysis: a comprehensive survey and new investigation. Entropy. 2016;18(9):272. doi: 10.3390/e18090272.
    1. Mikkelsen KB, Villadsen DB, Otto M, Kidmose P. Automatic sleep staging using ear-EEG. Biomed Eng Online. 2017;16(1):111. doi: 10.1186/s12938-017-0400-5.
    1. Mikkelsen KB, Ebajemito JK, Bonmati-Carrion MA, Santhi N, Revell VL, Atzori G, della Monica C, Debener S, Dijk DJ, Sterr A, de Vos M. Machine-learning-derived sleep-wake staging from around-the-ear electroencephalogram outperforms manual scoring and actigraphy. J Sleep Res. 2019;28(2):12786. doi: 10.1111/jsr.12786.
    1. Nakamura T, Goverdovsky V, Morrell MJ, Mandic DP. Automatic sleep monitoring using ear-EEG. IEEE J Transl Eng Health Med. 2017;5:7959059. doi: 10.1109/JTEHM.2017.2702558.
    1. Griessenberger H, Heib DPJ, Kunz AB, Hoedlmoser K, Schabus M. Assessment of a wireless headband for automatic sleep scoring. Sleep Breath. 2013;17(2):747–752. doi: 10.1007/s11325-012-0757-4.
    1. Myllymaa S, Muraja-Murro A, Westeren-Punnonen S, Hukkanen T, Lappalainen R, Mervaala E, Töyräs J, Sipilä K, Myllymaa K. Assessment of the suitability of using a forehead EEG electrode set and chin emg electrodes for sleep staging in polysomnography. J Sleep Res. 2016;25(6):636–645. doi: 10.1111/jsr.12425.
    1. Baumgartner C, Koren JP. Seizure detection using scalp-EEG. Epilepsia. 2018;59(Suppl. 1):14–22. doi: 10.1111/epi.14052.
    1. Duun-Henriksen J, Kjaer TW, Looney D, Atkins MD, Sørensen JA, Rose M, Mandic DP, Madsen RE, Juhl CB. Eeg signal quality of a subcutaneous recording system compared to standard surface electrodes. J Sensors. 2015;2015:341208. doi: 10.1155/2015/341208.
    1. McHugh ML. Interrater reliability: the kappa statistic. Biochem Med. 2012 doi: 10.11613/BM.2012.031.
    1. Bazil CW. Epilepsy and sleep disturbance. Epilepsy Behav. 2003;4(2):39–453945. doi: 10.1016/j.yebeh.2003.07.005.
    1. Lanigar S, Bandyopadhyay S. Sleep and epilepsy: a complex interplay. Mo Med. 2017;114(6):453–457453457.
    1. Kosmadopoulos A, Sargent C, Darwent D, Zhou X, Roach GD. Alternatives to polysomnography (PSG): a validation of wrist actigraphy and a partial-PSG system. Behav Res Methods. 2014;46(4):1032–1041. doi: 10.3758/s13428-013-0438-7.
    1. Slater JA, Botsis T, Walsh J, King S, Straker LM, Eastwood PR. Assessing sleep using hip and wrist actigraphy. Sleep Biol Rhythm. 2015;13(2):172–180. doi: 10.1111/sbr.12103.
    1. Werner H, Molinari L, Guyer C, Jenni OG. Agreement rates between actigraphy, diary, and questionnaire for children’s sleep patterns. Archiv Pediatr Adolesc Med. 2008;162(4):350–358. doi: 10.1001/archpedi.162.4.350.
    1. Danker-Hopfe H, Kunz D, Gruber G, Klösch G, Lorenzo JL, Himanen SL, Kemp B, Penzel T, Röschke J, Dorn H, Schlögl A, Trenker E, Dorffner G. Interrater reliability between scorers from eight European sleep laboratories in subjects with different sleep disorders. J Sleep Res. 2004;13(1):63–9. doi: 10.1046/j.1365-2869.2003.00375.x.
    1. Pornsriniyom D, Kim HW, Bena J, Andrews ND, Moul D, Foldvary-Schaefer N. Effect of positive airway pressure therapy on seizure control in patients with epilepsy and obstructive sleep apnea. Epilepsy Behav. 2014;37:270–275. doi: 10.1016/j.yebeh.014.07.005.
    1. Munk AM, Olesen KV, Gangstad SW, Hansen LK. Semi-supervised sleep-stage scoring based on single channel EEG. In: Proceedings of 2018 IEEE international conference on acoustics, speech and signal processing. 2018;2018-:2551–2555. 10.1109/ICASSP.2018.8461982.
    1. Berry R, Brooks R, Gamaldo C et al. The AASM manual for the scoring of sleep and associated events: rules, terminology and technical specifications. Version 2.4 ed. 2017.
    1. Consumer Technology Association, National Sleep Foundation. Definitions and characteristics for wearable sleep monitors, Ansi/cta-2052.1 edn. Consumer Technology Association, National Sleep Foundation; 2016.
    1. Reed DL, Sacco WP. Measuring sleep efficiency: what should the denominator be? J Clin Sleep Med. 2016;12(2):263–266. doi: 10.5664/jcsm.5498.
    1. Bland J, Altman D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–310. doi: 10.1016/S0140-6736(86)90837-8.
    1. Ernst MD. Permutation methods: a basis for exact inference. Stat Sci. 2004;19(4):676–685. doi: 10.1214/088342304000000396.

Source: PubMed

3
Předplatit