Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.

Literature DB >> 33599203

Interrater sleep stage scoring reliability between manual scoring from two European sleep centers and automatic scoring performed by the artificial intelligence-based Stanford-STAGES algorithm.

Matteo Cesari¹, Ambra Stefani¹, Thomas Penzel^2,3, Abubaker Ibrahim¹, Heinz Hackner¹, Anna Heidbreder¹, András Szentkirályi⁴, Beate Stubbe⁵, Henry Völzke⁶, Klaus Berger⁴, Birgit Högl¹.

Abstract

STUDY
OBJECTIVES: The objective of this study was to evaluate interrater reliability between manual sleep stage scoring performed in 2 European sleep centers and automatic sleep stage scoring performed by the previously validated artificial intelligence-based Stanford-STAGES algorithm.
METHODS: Full night polysomnographies of 1,066 participants were included. Sleep stages were manually scored in Berlin and Innsbruck sleep centers and automatically scored with the Stanford-STAGES algorithm. For each participant, we compared (1) Innsbruck to Berlin scorings (INN vs BER); (2) Innsbruck to automatic scorings (INN vs AUTO); (3) Berlin to automatic scorings (BER vs AUTO); (4) epochs where scorers from Innsbruck and Berlin had consensus to automatic scoring (CONS vs AUTO); and (5) both Innsbruck and Berlin manual scorings (MAN) to the automatic ones (MAN vs AUTO). Interrater reliability was evaluated with several measures, including overall and sleep stage-specific Cohen's κ.
RESULTS: Overall agreement across participants was substantial for INN vs BER (κ = 0.66 ± 0.13), INN vs AUTO (κ = 0.68 ± 0.14), CONS vs AUTO (κ = 0.73 ± 0.14), and MAN vs AUTO (κ = 0.61 ± 0.14), and moderate for BER vs AUTO (κ = 0.55 ± 0.15). Human scorers had the highest disagreement for N1 sleep (κN1 = 0.40 ± 0.16 for INN vs BER). Automatic scoring had lowest agreement with manual scorings for N1 and N3 sleep (κN1 = 0.25 ± 0.14 and κN3 = 0.42 ± 0.32 for MAN vs AUTO).
CONCLUSIONS: Interrater reliability for sleep stage scoring between human scorers was in line with previous findings, and the algorithm achieved an overall substantial agreement with manual scoring. In this cohort, the Stanford-STAGES algorithm showed similar performances to the ones achieved in the original study, suggesting that it is generalizable to new cohorts. Before its integration in clinical practice, future independent studies should further evaluate it in other cohorts.

Entities: Chemical

Keywords: automatic scoring; computerized analysis; deep neural networks; interrater variability; slow wave activity; study of health in Pomerania

Mesh：

Year: 2021 PMID： 33599203 PMCID： PMC8314654 DOI： 10.5664/jcsm.9174

Source DB: PubMed Journal: J Clin Sleep Med ISSN： 1550-9389 Impact factor: 4.324

34 in total

Review 1. Digital analysis and technical specifications.

Authors: Thomas Penzel; Max Hirshkowitz; John Harsh; Ron D Chervin; Nic Butkov; Meir Kryger; Beth Malow; Michael V Vitiello; Michael H Silber; Clete A Kushida; Andrew L Chesson
Journal: J Clin Sleep Med Date: 2007-03-15 Impact factor: 4.062

2. Cohort profile: the study of health in Pomerania.

Authors: Henry Völzke; Dietrich Alte; Carsten Oliver Schmidt; Dörte Radke; Roberto Lorbeer; Nele Friedrich; Nicole Aumann; Katharina Lau; Michael Piontek; Gabriele Born; Christoph Havemann; Till Ittermann; Sabine Schipf; Robin Haring; Sebastian E Baumeister; Henri Wallaschofski; Matthias Nauck; Stephanie Frick; Andreas Arnold; Michael Jünger; Julia Mayerle; Matthias Kraft; Markus M Lerch; Marcus Dörr; Thorsten Reffelmann; Klaus Empen; Stephan B Felix; Anne Obst; Beate Koch; Sven Gläser; Ralf Ewert; Ingo Fietze; Thomas Penzel; Martina Dören; Wolfgang Rathmann; Johannes Haerting; Mario Hannemann; Jürgen Röpcke; Ulf Schminke; Clemens Jürgens; Frank Tost; Rainer Rettig; Jan A Kors; Saskia Ungerer; Katrin Hegenscheid; Jens-Peter Kühn; Julia Kühn; Norbert Hosten; Ralf Puls; Jörg Henke; Oliver Gloger; Alexander Teumer; Georg Homuth; Uwe Völker; Christian Schwahn; Birte Holtfreter; Ines Polzer; Thomas Kohlmann; Hans J Grabe; Dieter Rosskopf; Heyo K Kroemer; Thomas Kocher; Reiner Biffar; Ulrich John; Wolfgang Hoffmann
Journal: Int J Epidemiol Date: 2010-02-18 Impact factor: 7.196

3. Learning machines and sleeping brains: Automatic sleep stage classification using decision-tree multi-class support vector machines.

Authors: Tarek Lajnef; Sahbi Chaibi; Perrine Ruby; Pierre-Emmanuel Aguera; Jean-Baptiste Eichenlaub; Mounir Samet; Abdennaceur Kachouri; Karim Jerbi
Journal: J Neurosci Methods Date: 2015-01-25 Impact factor: 2.390

4. Minimizing Interrater Variability in Staging Sleep by Use of Computer-Derived Features.

Authors: Magdy Younes; Patrick J Hanly
Journal: J Clin Sleep Med Date: 2016-10-15 Impact factor: 4.062

5. Prevalence and associated risk factors of periodic limb movement in sleep in two German population-based studies.

Authors: András Szentkirályi; Ambra Stefani; Heinz Hackner; Maria Czira; Inga K Teismann; Henry Völzke; Beate Stubbe; Sven Gläser; Ralf Ewert; Thomas Penzel; Ingo Fietze; Peter Young; Birgit Högl; Klaus Berger
Journal: Sleep Date: 2019-03-01 Impact factor: 5.849

Review 6. A comparative review on sleep stage classification methods in patients and healthy individuals.

Authors: Reza Boostani; Foroozan Karimzadeh; Mohammad Nami
Journal: Comput Methods Programs Biomed Date: 2016-12-10 Impact factor: 5.428

7. A study of the diagnostic utility of HLA typing, CSF hypocretin-1 measurements, and MSLT testing for the diagnosis of narcolepsy in 163 Korean patients with unexplained excessive daytime sleepiness.

Authors: Seung-Chul Hong; Ling Lin; Jong-Hyun Jeong; Yoon-Kyung Shin; Jin-Hee Han; Ji-Hyun Lee; Sung-Pil Lee; Jing Zhang; Mali Einen; Emmanuel Mignot
Journal: Sleep Date: 2006-11 Impact factor: 5.849

Review 8. Reinventing polysomnography in the age of precision medicine.

Authors: Diane C Lim; Diego R Mazzotti; Kate Sutherland; Jesse W Mindel; Jinyoung Kim; Peter A Cistulli; Ulysses J Magalang; Allan I Pack; Philip de Chazal; Thomas Penzel
Journal: Sleep Med Rev Date: 2020-03-20 Impact factor: 11.609

9. Actigraphic sleep duration and fragmentation are related to obesity in the elderly: the Rotterdam Study.

Authors: J F van den Berg; A Knvistingh Neven; J H M Tulen; A Hofman; J C M Witteman; H M E Miedema; H Tiemeier
Journal: Int J Obes (Lond) Date: 2008-04-15 Impact factor: 5.095

10. Agreement in the scoring of respiratory events and sleep among international sleep centers.

Authors: Ulysses J Magalang; Ning-Hung Chen; Peter A Cistulli; Annette C Fedson; Thorarinn Gíslason; David Hillman; Thomas Penzel; Renaud Tamisier; Sergio Tufik; Gary Phillips; Allan I Pack
Journal: Sleep Date: 2013-04-01 Impact factor: 5.849

6 in total

1. Automated Scoring of Sleep and Associated Events.

Authors: Peter Anderer; Marco Ross; Andreas Cerny; Edmund Shaw
Journal: Adv Exp Med Biol Date: 2022 Impact factor: 3.650

2. Objective underpinnings of self-reported sleep quality in middle-aged and older adults: The importance of N2 and wakefulness.

Authors: Renske Lok; Dwijen Chawra; Flora Hon; Michelle Ha; Katherine A Kaplan; Jamie M Zeitzer
Journal: Biol Psychol Date: 2022-02-19 Impact factor: 3.111

6. Computer-assisted analysis of polysomnographic recordings improves inter-scorer associated agreement and scoring times.

Authors: Diego Alvarez-Estevez; Roselyne M Rijsman
Journal: PLoS One Date: 2022-09-29 Impact factor: 3.752