| Literature DB >> 35237228 |
Aaron M Spring1,2,3, Daniel J Pittman2,3, Arsalan Rizwan3,4, Yahya Aghakhani1, Jeffrey Jirsch5, Mary Connolly6, Samuel Wiebe1,2, Juan Pablo Appendino1,7, Anita Datta6, Trevor Steve5, Neelan Pillay1, Manouchehr Javidan8, Morris Scantlebury1,7, Chantelle Hrazdil8, Colin Bruce Josephson1,2, Cyrus Boelman6, Donald Gross5, Shaily Singh1, Luis Bello-Espinosa1,7, Linda Huh6, Nathalie Jetté9, Paolo Federico1,2,3.
Abstract
OBJECTIVE: We examined the effect of a simple Delphi-method feedback on visual identification of high frequency oscillations (HFOs) in the ripple (80-250 Hz) band, and assessed the impact of this training intervention on the interrater reliability and generalizability of HFO evaluations.Entities:
Keywords: Delphi method; epilepsy; feedback; generalizability theory; high frequency oscillations (HFO); interrater reliability; intracranial electroencephalography (iEEG); training
Year: 2022 PMID: 35237228 PMCID: PMC8884138 DOI: 10.3389/fneur.2022.794668
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Figure 1Diagram outlining the flow of the study. At the start, reviewers were given an instructional video and practice dataset. After this, each week, reviewers were presented with one or two datasets, each comprised of two to four “blocks” of data. These blocks may be new (black text and outline) or repeated from a previous dataset (white text and outline), but all consist of exactly 30 epochs (1 epoch per event type per patient) that were randomized. During Week 3, reviewers were presented with feedback before or after they complete Dataset 2a depending on which group they are randomized to. The corresponding phase(s) of the study are indicated on the right-hand side—those above the dashed line are included in the present work.
Figure 2Example of histograms given during the feedback for one reviewer. Each histogram illustrates the distribution of the differences between the group consensus and the reviewer's HFO ratings. Cyan bars indicate epochs where there was agreement between the reviewer and the group consensus on whether an HFO is present, while magenta bars indicate disagreement. The zero bin indicates epochs where the reviewer's rating matches the group consensus. Positive bins indicate epochs that the reviewer had marked as more likely containing an HFO than the group consensus, where the magnitude equals the difference between the group consensus and the reviewer's rating. Negative bins indicate epochs that the reviewer had marked as less likely containing an HFO. The top histogram represents epochs marked by the Group DIs containing HFOs, while the bottom histogram reflects those marked by the Group DIs not containing HFOs. In this particular example, the top histogram illustrates that when the group marked an HFO, the reviewer agreed 75% of the time; this is indicated by 75% of the bars being cyan, and only 25% being magenta. In all cases of agreement, the reviewer was more certain than the consensus, typically by 1 or 2 confidence points. The bottom histogram here illustrates that when the group marked a non-HFO, the reviewer agreed 100% of the time, and was more confident than the group consensus in all but one instance, again typically by 1 or 2 confidence points. Overall, this reviewer rates both HFOs and non-HFOs confidently, but is generally less likely to identify an event as an HFO than the group on average.
Figure 3Screenshot of the program used for the visual review component of the study. Three seconds of raw data are shown in the right pane. 250 ms of filtered data are shown in the left pane, and the corresponding raw data are highlighted in yellow. The top pane contains the evaluation form for the current epoch, as well as the current progress. A detailed description of the evaluation program is available in our previous work (8).
Generalizability study details for each of the groups of reviewers.
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
| 0.425 | 0.000–1.039 | 19 | 19 | 0.0224 | 0.000–0.0547 | Δ |
|
| n/a | 3 | |||||
|
| 1.238 | 0.604–1.872 | 57 | 0.0217 | 0.0106–0.0329 | Δ | |
|
|
|
|
|
|
| ||
|
| 5.366 | 5.056–5.706 | 19 | 0.282 | 0.266–0.300 | Δ,δ | |
|
|
|
|
| ||||
| Object variance component | σ2(τ) | 0.610 | 0.380–0.839 | ||||
| Relative residual variance | σ2(δ) | 0.282 | 0.266–0.300 | ||||
| Absolute residual variance | σ2(Δ) | 0.327 | 0.277–0.388 | ||||
|
|
|
|
| ||||
| Absolute dependability | Φ2 | 0.651 | 0.495–0.752 | ||||
|
|
| ||||||
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
| 0.572 | 0.000–1.848 | 6 | 6 | 0.0953 | 0.000–0.308 | Δ |
|
| n/a | 3 | |||||
|
| 1.094 | 0.00379–2.185 | 18 | 0.0608 | 0.00021–0.121 | Δ | |
|
|
|
|
|
|
| ||
|
| 5.987 | 5.356–6.737 | 6 | 0.998 | 0.893–1.123 | Δ,δ | |
|
|
|
|
| ||||
| Object variance component | σ2(τ) | 1.268 | 0.676–1.859 | ||||
| Relative residual variance | σ2(δ) | 0.998 | 0.893–1.123 | ||||
| Absolute residual variance | σ2(Δ) | 1.154 | 0.893–1.552 | ||||
|
|
|
|
| ||||
| Absolute dependability | Φ2 | 0.523 | 0.303–0.676 | ||||
|
|
| ||||||
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
| 0.496 | 0.000–1.202 | 13 | 13 | 0.0381 | 0.000–0.0925 | Δ |
|
| n/a | 3 | |||||
|
| 0.930 | 0.333–1.527 | 39 | 0.0238 | 0.00853–0.0392 | Δ | |
|
|
|
|
|
|
| ||
|
| 4.998 | 4.648–5.390 | 13 | 0.384 | 0.358–0.415 | Δ,δ | |
|
|
|
|
| ||||
| Object variance component | σ2(τ) | 0.387 | 0.187–0.586 | ||||
| Relative residual variance | σ2(δ) | 0.384 | 0.358–0.415 | ||||
| Absolute residual variance | σ2(Δ) | 0.446 | 0.366–0.546 | ||||
|
|
|
|
| ||||
| Absolute dependability | Φ2 | 0.464 | 0.255–0.616 | ||||
In each case, the left pane provides an overview of variance components, and the right pane contains the calculated coefficients. The sets of columns in the left pane outline the variance components, (first column set) as well as the normalized variance components where the object of measurement is set to epoch (second column set). Confidence intervals are noted for variance components, normalized variance components, and generalizability coefficients. Generalizability studies are depicted for (a) all reviewers, (b) the six reviewers who have previously participated in similar studies, and (c) the other 13 reviewers. The object of measurement and the relative generalizability coefficient are highlighted in bold.
Figure 4Decision study projections for epoch generalizability at baseline. Projections of the epoch generalizability based on the number of reviewers are presented for all 19 reviewers (blue line), for the six experienced reviewers (brown line), and for the 13 inexperienced reviewers (green line). The decision study projections from the previous study (9) are indicated by the dashed red line. The threshold of 0.8 is indicated by the dotted black line, and the number of reviewers projected to be required to achieve the threshold in each case is indicated by a colored marker.
Summary of ANOVAs for intervention, reviewer, and group, including F statistics and p-values for each effect and their interactions.
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
| |||||
| Intervention | 10.74 | 0.006 | ||||
| Group | 2.17 | 0.165 | ||||
| Reviewer (Group) | 4.55 | 0.005 | ||||
| Intervention*Group | 0.69 | 0.419 | ||||
| Intervention*Reviewer (Group) |
|
| ||||
|
|
| |||||
|
|
| |||||
| Intervention |
|
| ||||
| Reviewer |
|
| ||||
| Intervention*Reviewer | 0.99 | 0.428 | ||||
|
|
|
|
| |||
|
|
|
|
|
|
| |
| Intervention | 6.56 | 0.037 |
|
|
|
|
| Reviewer | 3.52 | 0.060 |
|
| ||
| Intervention*Reviewer |
|
| 0.64 | 0.697 | ||
Significant interactions precluding further interpretation are bolded and highlighted in orange. Significant effects are bolded highlighted in blue. (a) Three-way omnibus ANOVA including all reviewers. (b) two-way ANOVA for all reviewers within Group DI. (c) two-way ANOVA for all reviewers within Group EI, two-way ANOVA excluding Reviewer EI-1, and one-way ANOVA for Reviewer EI-1 only.
Figure 5The root mean square (RMS) deviation of each reviewer's evaluations from the group mean, before and after the intervention. The RMS deviation for each reviewer is indicated by a solid line. The overall RMS deviation for each group is indicated by a thick dashed line. The identified special cases, indicated by a dot-dashed line, are DI-7 for Group DI—the only reviewer to deviate more from the group after the intervention—and EI-1 for Group EI—the only reviewer to improve visibly more than the other reviewers in the group.
Summary of ANOVAs for session, reviewer, and group, including F statistics and p-values for each effect and their interactions.
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
| |||||
| Session | 6.38 | 0.025 | ||||
| Group | 4.24 | 0.060 | ||||
| Reviewer (Group) | 4.07 | 0.008 | ||||
| Session*Group | 2.93 | 0.111 | ||||
| Session*Reviewer (Group) |
|
| ||||
|
|
| |||||
|
|
| |||||
| Session | 0.77 | 0.414 | ||||
| Reviewer |
|
| ||||
| Session*Reviewer | 0.98 | 0.435 | ||||
|
|
|
|
| |||
|
|
|
|
|
|
| |
| Session | 6.28 | 0.041 |
|
|
|
|
| Reviewer | 2.90 | 0.091 |
|
| ||
| Session*Reviewer |
|
| 0.96 | 0.451 | ||
Significant interactions precluding further interpretation are bolded and highlighted in orange. Significant effects are bolded highlighted in blue. (a) Three-way omnibus ANOVA including all reviewers. (b) two-way ANOVA for all reviewers within Group DI. (c) two-way ANOVA for all reviewers within Group EI, two-way ANOVA excluding Reviewer EI-1, and one-way ANOVA for Reviewer EI-1 only.
Figure 6Root mean square (RMS) deviation of each reviewer's evaluations from the group mean for the first two datasets, regardless of whether an intervention has taken place. The RMS deviation for each reviewer is indicated by a solid line. The overall RMS deviation for each group is indicated by a thick dashed line. The special cases identified based on the intervention analysis—DI-7 for Group DI and EI-1 for Group EI—are indicated by dot-dashed lines. The timing of the intervention for each group is indicated by a vertical dotted line. This intervention occurs after Dataset 2a for Group DI, and between Datasets 1 and 2a for Group EI, as indicated on the figures.
Figure 7Decision study projections for epoch generalizability before and after the Delphi-style intervention. Projections of the epoch generalizability based on the number of reviewers are presented for all 15 reviewers before (dot-dashed blue line) and after (solid blue line) the intervention. The decision study projections from the previous study (9) are indicated by the dashed red line. The threshold of 0.8 is indicated by the dotted black line, and the number of reviewers projected to be required to achieve the threshold in each case is indicated by a colored marker.