| Literature DB >> 30689731 |
Angela Noufaily1, Roger A Morbey2, Felipe J Colón-González3, Alex J Elliot2, Gillian E Smith2, Iain R Lake3, Noel McCarthy4.
Abstract
MOTIVATION: Public health authorities can provide more effective and timely interventions to protect populations during health events if they have effective multi-purpose surveillance systems. These systems rely on aberration detection algorithms to identify potential threats within large datasets. Ensuring the algorithms are sensitive, specific and timely is crucial for protecting public health. Here, we evaluate the performance of three detection algorithms extensively used for syndromic surveillance: the 'rising activity, multilevel mixed effects, indicator emphasis' (RAMMIE) method and the improved quasi-Poisson regression-based method known as 'Farrington Flexible' both currently used at Public Health England, and the 'Early Aberration Reporting System' (EARS) method used at the US Centre for Disease Control and Prevention. We model the wide range of data structures encountered within the daily syndromic surveillance systems used by PHE. We undertake extensive simulations to identify which algorithms work best across different types of syndromes and different outbreak sizes. We evaluate RAMMIE for the first time since its introduction. Performance metrics were computed and compared in the presence of a range of simulated outbreak types that were added to baseline data.Entities:
Mesh:
Year: 2019 PMID: 30689731 PMCID: PMC6736430 DOI: 10.1093/bioinformatics/bty997
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Parameters and criteria used to generate the 16 representative signals
| Signal |
|
|
|
|
|
|
|
|
|
| Trend |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 6 | 0 | 0.2 | 0.2 | 0.5 | 0.4 | 2 | 29 | 1 | 2 | 0 |
| 2 | 0.5 | 0 | 1.5 | 1.4 | 0.5 | 0.4 | 1 | −167 | 1 | 2 | 0 |
| 3 | 5.5 | 0 | 0 | 0 | 0.3 | 0.25 | 1 | 1 | 0 | 2 | 0 |
| 4 | 2 | 0 | 0 | 0 | 0.3 | 0.25 | 1 | 1 | 0 | 2 | 0 |
| 5 | 6 | 0 | 0.3 | 2 | 0.3 | 0.5 | 1.5 | −50 | 1 | 2 | 0 |
| 6 | 1 | 0 | 0.1 | 2 | 0.05 | 0.05 | 1 | −50 | 1 | 1 | 0 |
| 7 | 6 | 0.0001 | 0 | 0 | 0.6 | 0.9 | 1.5 | 0 | 0 | 1 | 1 |
| 8 | 3 | 0 | 1.5 | 0.1 | 0.2 | 0.3 | 1 | −150 | 1 | 1 | 0 |
| 9 | 3 | 0 | 0.2 | 0.1 | 0.05 | 0.15 | 1 | −200 | 1 | 1 | 0 |
| 10 | 5 | 0 | 0.2 | 0.1 | 0.05 | 0.1 | 1 | 0 | 1 | 1 | 0 |
| 11 | 0.5 | 0 | 0.4 | 0 | 0.05 | 0.15 | 1 | 0 | 2 | 1 | 0 |
| 12 | 9 | 0 | 0.5 | 0.2 | 0.2 | 0.5 | 1 | 0 | 1 | 1 | 0 |
| 13 | 2 | 0.0005 | 0.8 | 0.8 | 0.8 | 0.4 | 4 | 57 | 1 | 2 | 1 |
| 14 | 0.05 | 0 | 0.01 | 0.01 | 1.8 | 0.1 | 1 | −85 | 4 | 1 | 0 |
| 15 | 3 | 0 | 0.8 | 0.6 | 0.8 | 0.4 | 4 | 29 | 1 | 2 | 0 |
| 16 | 6 | 0 | 0 | 0 | 0.8 | 0.4 | 4 | 1 | 0 | 2 | 0 |
Characteristics of 16 syndromes representative of the 16 simulated data signals
| Signal ID | Related system | Related syndrome | Mean daily count | Yearly variation | 5/7 day service | Trend |
|---|---|---|---|---|---|---|
| 1 | NHS111 | Diarrhoea | >100 | Moderate | 7 | No |
| 2 | ED | Arthropod bites | <10 | Large summer peak | 7 | No |
| 3 | ED | Cardiac | <500 | Small | 7 | No |
| 4 | ED | Cardiac admissions (HCU/ICU) | <10 | Small | 7 | No |
| 5 | GPIHSS | Allergic rhinitis | >100 | Large peak with variable timing | 5 | No |
| 6 | GPIHSS | Heat stroke | <10 | Large peak variable with timing | 5 | No |
| 7 | GPIHSS | Herpes zoster | >100 | Small | 5 | Yes |
| 8 | GPIHSS | Insect bite | 10–100 | Large summer peak | 5 | No |
| 9 | GPIHSS | Pertussis | 10–100 | Moderate | 5 | No |
| 10 | GPIHSS | Pneumonia | >100 | Moderate | 5 | No |
| 11 | GPIHSS | Rubella | <10 | Moderate | 5 | No |
| 12 | GPIHSS | Upper tract respiratory infection | >100 | Moderate | 5 | No |
| 13 | GPOOHSS | Bronchitis | 10–100 | Moderate | 7 | Yes |
| 14 | GPOOHSS | Hepatitis | <10 | Moderate | 7 | No |
| 15 | GPOOHSS | Influenza-like illness | 10–100 | Large peak with variable timing | 7 | No |
| 16 | GPOOHSS | Urinary tract infection | >100 | Small | 7 | No |
Fig. 1.Plots of the 16 simulated data signals
Fig. 2.Average (across the 16 signals) sensitivity (lower end of plot) and POD (upper end of plot) versus timeliness for evaluating the impact of ‘spiked outbreak’ size on detection capabilities obtained from applying RAMMIE, Farrington Flexible, EARS-C1, EARS-C2, EARS-C3 and EARS-NB to the most recent 49 weeks of each of the 100 simulations of the 16 signals. Marker size is proportional to outbreak size (i.e. largest point refers to large outbreaks; second largest refers to medium outbreaks; third largest refers to small outbreaks; smallest point refers to very small outbreaks)
Fig. 3.POD, sensitivity, specificity and timeliness for each of the simulated signals, with added medium ‘spiked outbreaks’, obtained from applying RAMMIE (dashed lines), Farrington Flexible (solid lines) and EARS-NB (dot dash lines) to the most recent 49 weeks of each of the 100 simulations from each signal