Magdy Younes1,2,3, Jill Raneri2, Patrick Hanly2. 1. YRT Ltd, Winnipeg, MB, Canada. 2. Sleep Centre, Foothills Medical Centre, Calgary, Alberta, Canada. 3. Sleep Disorders Centre, Winnipeg, Manitoba, Canada.
Abstract
STUDY OBJECTIVES: To determine the reasons for inter-scorer variability in sleep staging of polysomnograms (PSGs). METHODS: Fifty-six PSGs were scored (5-stage sleep scoring) by 2 experienced technologists, (first manual, M1). Months later, the technologists edited their own scoring (second manual, M2) based upon feedback from the investigators that highlighted differences between their scoring. The PSGs were then scored with an automatic system (Auto) and the technologists edited them, epoch-by-epoch (Edited-Auto). This resulted in 6 different manual scores for each PSG. Epochs were classified as scorer errors (one M1 score differed from the other 5 scores), scorer bias (all 3 scores of each technologist were similar, but differed from the other technologist) and equivocal (sleep scoring was inconsistent within and between technologists). RESULTS: Percent agreement after M1 was 78.9% ± 9.0% and was unchanged after M2 (78.1% ± 9.7%) despite numerous edits (≈40/PSG) by the scorers. Agreement in Edited-Auto was higher (86.5% ± 6.4%, p < 1E-9). Scorer errors (< 2% of epochs) and scorer bias (3.5% ± 2.3% of epochs) together accounted for < 20% of M1 disagreements. A large number of epochs (92 ± 44/PSG) with scoring agreement in M1 were subsequently changed in M2 and/or Edited-Auto. Equivocal epochs, which showed scoring inconsistency, accounted for 28% ± 12% of all epochs, and up to 76% of all epochs in individual patients. Disagreements were largely between awake/NREM, N1/N2, and N2/N3 sleep. CONCLUSION: Inter-scorer variability is largely due to epochs that are difficult to classify. Availability of digitally identified events (e.g., spindles) or calculated variables (e.g., depth of sleep, delta wave duration) during scoring may greatly reduce scoring variability.
STUDY OBJECTIVES: To determine the reasons for inter-scorer variability in sleep staging of polysomnograms (PSGs). METHODS: Fifty-six PSGs were scored (5-stage sleep scoring) by 2 experienced technologists, (first manual, M1). Months later, the technologists edited their own scoring (second manual, M2) based upon feedback from the investigators that highlighted differences between their scoring. The PSGs were then scored with an automatic system (Auto) and the technologists edited them, epoch-by-epoch (Edited-Auto). This resulted in 6 different manual scores for each PSG. Epochs were classified as scorer errors (one M1 score differed from the other 5 scores), scorer bias (all 3 scores of each technologist were similar, but differed from the other technologist) and equivocal (sleep scoring was inconsistent within and between technologists). RESULTS: Percent agreement after M1 was 78.9% ± 9.0% and was unchanged after M2 (78.1% ± 9.7%) despite numerous edits (≈40/PSG) by the scorers. Agreement in Edited-Auto was higher (86.5% ± 6.4%, p < 1E-9). Scorer errors (< 2% of epochs) and scorer bias (3.5% ± 2.3% of epochs) together accounted for < 20% of M1 disagreements. A large number of epochs (92 ± 44/PSG) with scoring agreement in M1 were subsequently changed in M2 and/or Edited-Auto. Equivocal epochs, which showed scoring inconsistency, accounted for 28% ± 12% of all epochs, and up to 76% of all epochs in individual patients. Disagreements were largely between awake/NREM, N1/N2, and N2/N3 sleep. CONCLUSION: Inter-scorer variability is largely due to epochs that are difficult to classify. Availability of digitally identified events (e.g., spindles) or calculated variables (e.g., depth of sleep, delta wave duration) during scoring may greatly reduce scoring variability.
Authors: Stephen D Pittman; Mary M MacDonald; Robert B Fogel; Atul Malhotra; Koby Todros; Baruch Levy; Amir B Geva; David P White Journal: Sleep Date: 2004-11-01 Impact factor: 5.849
Authors: Atul Malhotra; Magdy Younes; Samuel T Kuna; Ruth Benca; Clete A Kushida; James Walsh; Alexandra Hanlon; Bethany Staley; Allan I Pack; Grace W Pien Journal: Sleep Date: 2013-04-01 Impact factor: 5.849
Authors: Peter Anderer; Georg Gruber; Silvia Parapatics; Michael Woertz; Tatiana Miazhynskaia; Gerhard Klosch; Bernd Saletu; Josef Zeitlhofer; Manuel J Barbanoj; Heidi Danker-Hopfe; Sari-Leena Himanen; Bob Kemp; Thomas Penzel; Michael Grozinger; Dieter Kunz; Peter Rappelsberger; Alois Schlogl; Georg Dorffner Journal: Neuropsychobiology Date: 2005-04-18 Impact factor: 2.328
Authors: Xiaozhe Zhang; Xiaosong Dong; Jan W Kantelhardt; Jing Li; Long Zhao; Carmen Garcia; Martin Glos; Thomas Penzel; Fang Han Journal: Sleep Breath Date: 2014-05-07 Impact factor: 2.816
Authors: Heidi Danker-Hopfe; D Kunz; G Gruber; G Klösch; J L Lorenzo; S L Himanen; B Kemp; T Penzel; J Röschke; H Dorn; A Schlögl; E Trenker; G Dorffner Journal: J Sleep Res Date: 2004-03 Impact factor: 3.981
Authors: Samuel T Kuna; Ruth Benca; Clete A Kushida; James Walsh; Magdy Younes; Bethany Staley; Alexandra Hanlon; Allan I Pack; Grace W Pien; Atul Malhotra Journal: Sleep Date: 2013-04-01 Impact factor: 5.849
Authors: Laura B Ray; Stéphane Sockeel; Melissa Soon; Arnaud Bore; Ayako Myhr; Bobby Stojanoski; Rhodri Cusack; Adrian M Owen; Julien Doyon; Stuart M Fogel Journal: Front Hum Neurosci Date: 2015-09-24 Impact factor: 3.169
Authors: Magdy Younes; Samuel T Kuna; Allan I Pack; James K Walsh; Clete A Kushida; Bethany Staley; Grace W Pien Journal: J Clin Sleep Med Date: 2018-02-15 Impact factor: 4.062
Authors: H Kloefkorn; L M Aiani; A Lakhani; S Nagesh; A Moss; W Goolsby; J M Rehg; N P Pedersen; S Hochman Journal: J Neurosci Methods Date: 2020-06-30 Impact factor: 2.390
Authors: Massimiliano de Zambotti; Nicola Cellini; Aimée Goldstone; Ian M Colrain; Fiona C Baker Journal: Med Sci Sports Exerc Date: 2019-07 Impact factor: 5.411
Authors: Daniel J Levendowski; Luigi Ferini-Strambi; Charlene Gamaldo; Mindy Cetel; Robert Rosenberg; Philip R Westbrook Journal: J Clin Sleep Med Date: 2017-06-15 Impact factor: 4.062
Authors: Diane C Lim; Diego R Mazzotti; Kate Sutherland; Jesse W Mindel; Jinyoung Kim; Peter A Cistulli; Ulysses J Magalang; Allan I Pack; Philip de Chazal; Thomas Penzel Journal: Sleep Med Rev Date: 2020-03-20 Impact factor: 11.609
Authors: Elie Gottlieb; Leonid Churilov; Emilio Werden; Thomas Churchward; Matthew P Pase; Natalia Egorova; Mark E Howard; Amy Brodtmann Journal: J Clin Sleep Med Date: 2021-02-01 Impact factor: 4.062
Authors: Brice V McConnell; Eugene Kronberg; Peter D Teale; Stefan H Sillau; Grace M Fishback; Rini I Kaplan; Angela J Fought; A Ranjitha Dhanasekaran; Brian D Berman; Alberto R Ramos; Rachel L McClure; Brianne M Bettcher Journal: Sleep Date: 2021-10-11 Impact factor: 5.849