Jialan Que1, Fu-Chiang Tsui. 1. Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA. jiq4@pitt.edu
Abstract
OBJECTIVE: Public health surveillance requires outbreak detection algorithms with computational efficiency sufficient to handle the increasing volume of disease surveillance data. In response to this need, the authors propose a spatial clustering algorithm, rank-based spatial clustering (RSC), that detects rapidly infectious but non-contagious disease outbreaks. DESIGN: The authors compared the outbreak-detection performance of RSC with that of three well established algorithms-the wavelet anomaly detector (WAD), the spatial scan statistic (KSS), and the Bayesian spatial scan statistic (BSS)-using real disease surveillance data on to which they superimposed simulated disease outbreaks. MEASUREMENTS: The following outbreak-detection performance metrics were measured: receiver operating characteristic curve, activity monitoring operating curve curve, cluster positive predictive value, cluster sensitivity, and algorithm run time. RESULTS: RSC was computationally efficient. It outperformed the other two spatial algorithms in terms of detection timeliness, and outbreak localization. RSC also had overall better timeliness than the time-series algorithm WAD at low false alarm rates. CONCLUSION: RSC is an ideal algorithm for analyzing large datasets when the application of other spatial algorithms is not practical. It also allows timely investigation for public health practitioners by providing early detection and well-localized outbreak clusters.
OBJECTIVE: Public health surveillance requires outbreak detection algorithms with computational efficiency sufficient to handle the increasing volume of disease surveillance data. In response to this need, the authors propose a spatial clustering algorithm, rank-based spatial clustering (RSC), that detects rapidly infectious but non-contagious disease outbreaks. DESIGN: The authors compared the outbreak-detection performance of RSC with that of three well established algorithms-the wavelet anomaly detector (WAD), the spatial scan statistic (KSS), and the Bayesian spatial scan statistic (BSS)-using real disease surveillance data on to which they superimposed simulated disease outbreaks. MEASUREMENTS: The following outbreak-detection performance metrics were measured: receiver operating characteristic curve, activity monitoring operating curve curve, cluster positive predictive value, cluster sensitivity, and algorithm run time. RESULTS: RSC was computationally efficient. It outperformed the other two spatial algorithms in terms of detection timeliness, and outbreak localization. RSC also had overall better timeliness than the time-series algorithm WAD at low false alarm rates. CONCLUSION: RSC is an ideal algorithm for analyzing large datasets when the application of other spatial algorithms is not practical. It also allows timely investigation for public health practitioners by providing early detection and well-localized outbreak clusters.
Authors: M M Wagner; F C Tsui; J U Espino; V M Dato; D F Sittig; R A Caruana; L F McGinnis; D W Deerfield; M J Druzdzel; D B Fridsma Journal: J Public Health Manag Pract Date: 2001-11
Authors: David L Buckeridge; Howard Burkom; Murray Campbell; William R Hogan; Andrew W Moore Journal: J Biomed Inform Date: 2005-04 Impact factor: 6.317
Authors: Fu-Chiang Tsui; Jeremy U Espino; Virginia M Dato; Per H Gesteland; Judith Hutman; Michael M Wagner Journal: J Am Med Inform Assoc Date: 2003-06-04 Impact factor: 4.497