Howard S Burkom1, Y Elbert, A Feldman, J Lin. 1. Johns Hopkins University Applied Physics Laboratory, 11100 Johns Hopkins Road, Mailstop 8-224, Laurel, MD 20723, USA. Howard.Burkom@jhuapl.edu
Abstract
INTRODUCTION: Syndromic surveillance systems are used to monitor daily electronic data streams for anomalous counts of features of varying specificity. The monitored quantities might be counts of clinical diagnoses, sales of over-the-counter influenza remedies, school absenteeism among a given age group, and so forth. Basic data-aggregation decisions for these systems include determining which records to count and how to group them in space and time. OBJECTIVES: This paper discusses the application of spatial and temporal data-aggregation strategies for multiple data streams to alerting algorithms appropriate to the surveillance region and public health threat of interest. Such a strategy was applied and evaluated for a complex, authentic, multisource, multiregion environment, including >2 years of data records from a system-evaluation exercise for the Defense Advanced Research Project Agency (DARPA). METHODS: Multivariate and multiple univariate statistical process control methods were adapted and applied to the DARPA data collection. Comparative parametric analyses based on temporal aggregation were used to optimize the performance of these algorithms for timely detection of a set of outbreaks identified in the data by a team of epidemiologists. RESULTS: The sensitivity and timeliness of the most promising detection methods were tested at empirically calculated thresholds corresponding to multiple practical false-alert rates. Even at the strictest false-alert rate, all but one of the outbreaks were detected by the best method, and the best methods achieved a 1-day median time before alert over the set of test outbreaks. CONCLUSIONS: These results indicate that a biosurveillance system can provide a substantial alerting-timeliness advantage over traditional public health monitoring for certain outbreaks. Comparative analyses of individual algorithm results indicate further achievable improvement in sensitivity and specificity.
INTRODUCTION: Syndromic surveillance systems are used to monitor daily electronic data streams for anomalous counts of features of varying specificity. The monitored quantities might be counts of clinical diagnoses, sales of over-the-counter influenza remedies, school absenteeism among a given age group, and so forth. Basic data-aggregation decisions for these systems include determining which records to count and how to group them in space and time. OBJECTIVES: This paper discusses the application of spatial and temporal data-aggregation strategies for multiple data streams to alerting algorithms appropriate to the surveillance region and public health threat of interest. Such a strategy was applied and evaluated for a complex, authentic, multisource, multiregion environment, including >2 years of data records from a system-evaluation exercise for the Defense Advanced Research Project Agency (DARPA). METHODS: Multivariate and multiple univariate statistical process control methods were adapted and applied to the DARPA data collection. Comparative parametric analyses based on temporal aggregation were used to optimize the performance of these algorithms for timely detection of a set of outbreaks identified in the data by a team of epidemiologists. RESULTS: The sensitivity and timeliness of the most promising detection methods were tested at empirically calculated thresholds corresponding to multiple practical false-alert rates. Even at the strictest false-alert rate, all but one of the outbreaks were detected by the best method, and the best methods achieved a 1-day median time before alert over the set of test outbreaks. CONCLUSIONS: These results indicate that a biosurveillance system can provide a substantial alerting-timeliness advantage over traditional public health monitoring for certain outbreaks. Comparative analyses of individual algorithm results indicate further achievable improvement in sensitivity and specificity.
Authors: Richard S Hopkins; Catherine C Tong; Howard S Burkom; Judy E Akkina; John Berezowski; Mika Shigematsu; Patrick D Finley; Ian Painter; Roland Gamache; Victor J Del Rio Vilas; Laura C Streichert Journal: Public Health Rep Date: 2017 Jul/Aug Impact factor: 2.792
Authors: Geoffrey B Crawford; Sara McKelvey; Janet Crooks; Karen Siska; Kelly Russo; Jinlene Chan Journal: Public Health Rep Date: 2011 Jul-Aug Impact factor: 2.792
Authors: Cynthia Lucero-Obusan; Carla A Winston; Patricia L Schirmer; Gina Oda; Mark Holodniy Journal: Public Health Rep Date: 2017 Jul/Aug Impact factor: 2.792
Authors: Cynthia A Lucero; Gina Oda; Kenneth Cox; Frank Maldonado; Joseph Lombardo; Richard Wojcik; Mark Holodniy Journal: BMC Med Inform Decis Mak Date: 2011-09-19 Impact factor: 2.796
Authors: Nicola Marsden-Haug; Virginia B Foster; Philip L Gould; Eugene Elbert; Hailiang Wang; Julie A Pavlin Journal: Emerg Infect Dis Date: 2007-02 Impact factor: 6.883
Authors: Patricia L Schirmer; Cynthia A Lucero-Obusan; Stephen R Benoit; Luis M Santiago; Danielle Stanek; Achintya Dey; Mirsonia Martinez; Gina Oda; Mark Holodniy Journal: PLoS Negl Trop Dis Date: 2013-03-14