| Literature DB >> 26734654 |
Nathan J Stevenson1, Robert R Clancy2, Sampsa Vanhatalo3, Ingmar Rosén4, Janet M Rennie5, Geraldine B Boylan1.
Abstract
OBJECTIVE: To determine the interobserver agreement (IOA) of neonatal seizure detection using the gold standard of conventional, multichannel EEG.Entities:
Year: 2015 PMID: 26734654 PMCID: PMC4693620 DOI: 10.1002/acn3.249
Source DB: PubMed Journal: Ann Clin Transl Neurol ISSN: 2328-9503 Impact factor: 4.511
Figure 1The assessment of agreement between the annotations of two simulated observers. The annotation of observer 2 is compared to observer 1. A – positive (seizure) agreement, B – negative (nonseizure) agreement, C – positive disagreement, D – negative disagreement. In this case, an event‐based assessment results in a positive agreement rate of 66.6% (2/3 events) and 0.6 positive disagreements per hour (one disagreement in 100 min). A temporal assessment results in a positive agreement rate of 50% (15/30 min) and a negative agreement rate of 71.4% (50/70 min). Kappa is calculated using a combination of temporal measures (A–D). These are one‐sided measurements of agreement; in order to form a complete estimate, the average of these measures based on a comparison of observer 1 against observer 2 and observer 2 against observer 1 is used. This averaging is inherent in kappa, which is 0.205, and in this example, equivalent to modified kappa.
A summary of the various analyses of interobserver agreement applied to the cohort of 70 neonates
| Kappa | ICC | PA | NA | PD/h | |
|---|---|---|---|---|---|
| Raw annotations | |||||
| Temporal |
|
|
| ||
| Event |
|
| |||
| Interpreted annotations | |||||
| Presence of seizure |
| ||||
| Presence of SE |
| ||||
| Characteristics of seizures |
| ||||
Results based on the analysis of the concatenated recording will be summarized as value (95% CI) and results based on the analysis of individual recordings will be summarized across the cohort as median (IQR). ICC, intraclass correlation coefficient; PA, positive (seizure) agreement; NA, negative (nonseizure) agreement; PD/h, positive disagreements per hour.
1, Estimated on concatenated annotations.
2, Estimated on annotations of seizure neonates individually.
3, Estimated on annotations of all neonates individually.
Demographics and EEG recording information relating to the 70 neonates used in this study
| Gestational age (weeks+days) | 40+3 (39+2 to 41+2) |
| Birthweight (g) | 3526 (3140 to 3920) |
| Gender (male:female) | 37:33 |
| Age at EEG onset (h) | 7.0 (3.6 to 19.0) |
| EEG recording duration (h) | 51.6 (21.5 to 84.4) |
Median (interquartile range).
HIE ‐ hypoxic iscaemic encephaloptahy. Ten mild, 19 moderate, eight severe HIE. Twenty‐two were treated with therapeutic hypothermia (TH).
Asphyxia was defined as depressed Apgar scores with no subsequent signs of encephalopathy.
Other: includes one or more conditions such as meningitis, viral encephalitis, sepsis, meconium aspiration syndrome, persistent pulmonary hypertension, respiratory distress syndrome, benign familial neonatal seizures, benign sleep myoclonus, and unknown diagnosis.
Figure 2Example seizures from the EEG recordings of neonates with high agreement. (A) kappa was 0.982, total seizure burden was [55, 55, and 56] minutes, and seizure number was [3, 2, and 2], seizure duration = [1106, 1656, and 1675] seconds, for observers 3, 2 and 1 respectively. (B) kappa was 0.950, total seizure burden was [201, 192, and 205] minutes, seizure number was [24, 26, and 24] and seizure duration was [502, 442, and 512] seconds, for observers 1, 2, and 3, respectively. Obs1, Obs2, and Obs3 are the annotations of observer 3, 2, and 1, respectively, S is seizure and NS is nonseizure.
Figure 3The decrease in interobserver agreement due to seizures with rare occurrence or short duration; in total and on a per neonate basis. (A) The positive agreement (event‐based assessment) per neonate in those with seizures annotated by at least one observer (n = 42). The agreement was reduced in neonates with a limited number of seizures, particularly if these seizures were of short duration. The black asterisks are the kappa value (expressed as a percent) for each neonate. Neonates 1–33 had annotations from all 3 observers, neonates 34–35 had annotations from 2 observers (n = 2) and neonates 36–42 had annotations from 1 observer (n = 7). (B) The number of seizures in each neonate (plotted on a logarithmic scale, base 10). (C) The mean seizure duration in each neonate (plotted on a logarithmic scale, base 10). (D) Positive agreement rate with respect to seizure duration summarized across all seizures. Observers were more likely to disagree if the seizure duration was less than 30 sec. (E) The number of seizures with a given seizure duration. All measures are averaged across observers. For (D and E) the last bin of the histogram contains seizures of duration greater than 330 sec (Inf is infinity).
The relationship between interobserver agreement and the degree of peak hourly seizure burden (HSBp)
| HSBp cutoff | |||
|---|---|---|---|
| >5 mins/h | >12 mins/h | >30 mins/h | |
| Kappa | 0.904 (0.809–0.980) | 0.818 (0.696–0.920) | 0.815 (0.641–0.950) |
| PI | 0.105 | 0.238 | 0.619 |
| BI | 0.010 | 0.019 | 0.010 |
|
| 34 | 31 | 17 |
| 3 Observers | 29 | 22 | 11 |
| 2 Observers | 2 | 5 | 1 |
| 1 Observer | 3 | 4 | 5 |
Kappa is presented as value (95% CI). PI is the prevalence index, BI is the bias index, N is the number of neonates identified as exceeding the HSBp by at least one observer, and X observers defines the number of observers who determined that the HSBp was exceeded.
Seizure characteristics in nine neonates where at least one observer did not agree on the presence of seizures
| Neonate | Cohen's | Observers | TSB (minutes) | Seizure number | Mean seizure duration (seconds) | Channel localization |
|---|---|---|---|---|---|---|
| 1 | – | 3 | 2.2 | 1 | 133 | Generalized |
| 2 | – | 3 | 3.7 | 2 | 110 | Focal (P4) |
| 3 | – | 1 | 3 | 2 | 90 | Focal (F3, T3) |
| 4 | – | 3 | 0.4 | 1 | 23 | Focal (F4) |
| 5 | – | 2 | 4.3 | 8 | 32 | Generalized and Focal (P3, T3) |
| 6 | – | 1 | 1.4 | 1 | 81 | Focal (O2) |
| 7 | – | 1 | 0.8 | 2 | 24 | Focal (Cz) |
| 8 | 0.592 | 2, 3 | 23.5, 26.1 | 12, 11 | 118, 142 | Focal (O1, O2) |
| 9 | 0.302 | 1, 3 | 20.1, 3.7 | 4, 1 | 310, 221 | Focal (T4, O2) |
In the case where two observers identified seizures, the individual characteristics of each observer's annotation have been listed. κ is kappa, TSB is the total seizure burden.
Estimated when at least two observers have detected seizures in a neonate.
Figure 4Example EEG where observers disagreed on the presence of seizure. The montage used at the time of recording is used to present the EEGs. (A) Seizure identified by one observer, total seizure burden was 3 min and seizure number was 2 – predominant seizure activity on O2. (B) Seizure identified by two observers, total seizure burden was [23.5, 26.1] minutes and seizure number was [12, 11] – predominant seizure activity on O1. (C) Seizure identified by one observer, total seizure burden was 3.7 min and seizure number was 2 – predominant seizure activity on F3. The scale is identical for all EEG segments.
Characteristics of seizures as annotated by each observer summarized across neonates with seizures
| Observer 1 ( | Observer 2 ( | Observer 3 ( | ICC ( | |
|---|---|---|---|---|
| Total seizure burden (mins) | 79.3 (25.4–199.8) | 87.7 (32.7–197.8) | 74.7 (24.6–189.5) | 0.996 (0.994–0.997) |
| Seizure number | 24 (6–73) | 28 (11–106) | 27 (5–77) | 0.974 (0.962–0.983) |
| Seizure duration (mean) (sec) | 155 (84–319) | 126 (90–253) | 128 (90–221) | 0.923 (0.885–0.950) |
| Seizure Duration (median) (sec) | 83 (56–140) | 93 (56–155) | 83 (55–140) | 0.866 (0.801–0.913) |
| Seizure onset (h) | 17.2 (10.8–34.6) | 17.7 (10.6–35.8) | 19.0 (10.9–33.8) | 0.982 |
| Seizure offset (h) | 50.8 (32.2–92.1) | 50.6 (34.5–100.0) | 54.8 (33.9–106.1) | 0.912 |
| Seizure period (h) | 20.3 (8.6–40.8) | 19.3 (7.6–54.2) | 22.8 (10.6–52.3) | 0.954 |
Total seizure burden is the accumulated duration of seizure, seizure onset/offset are the age of first/last seizures, respectively and seizure period is the time between first and last seizures in a neonate. Results are reported as median (IQR). ICC, intraclass correlation coefficient. The ICC is the average ICC, is presented as ICC (95% CI) and is estimated across the entire cohort (n = 70), unless1 at which it is estimated on the 33 neonates where all observers detected at least one seizure.