Literature DB >> 18598647

Optimizing use of multistream influenza sentinel surveillance data.

Eric H Y Lau1, Benjamin J Cowling, Lai-Ming Ho, Gabriel M Leung.   

Abstract

We applied time-series methods to multivariate sentinel surveillance data recorded in Hong Kong during 1998-2007. Our study demonstrates that simultaneous monitoring of multiple streams of influenza surveillance data can improve the accuracy and timeliness of alerts compared with monitoring of aggregate data or of any single stream alone.

Entities:  

Mesh:

Year:  2008        PMID: 18598647      PMCID: PMC2600317          DOI: 10.3201/eid1407.080060

Source DB:  PubMed          Journal:  Emerg Infect Dis        ISSN: 1080-6040            Impact factor:   6.883


The use of separate data streams based on sentinel surveillance has long been an accepted approach to monitor community incidence and to enable timely detection of infectious disease outbreaks (,). Recently, more attention has been given to the combined analysis of multivariate sentinel data (–). In this study we explored the possibility of improving the ability to more quickly detect peak periods of influenza activity in Hong Kong through simultaneous monitoring of multiple streams of sentinel surveillance data. Our findings have general implications in the choice of surveillance algorithms where multistream data are available.

The Study

The local Department of Health publishes weekly reports () from a network of 50 private-sector sentinel general practitioners (GP) and 62 public-sector sentinel general outpatient clinics (GOPC) on the proportion of patients seeking treatment for influenza-like illness (ILI), defined as fever plus cough or sore throat (). In this study, we used the GP and GOPC sentinel surveillance data in 9 annual influenza seasons from 1998–1999 to 2006–2007, stratified by 4 geographic regions in Hong Kong—Hong Kong Island, Kowloon, New Territories East, and New Territories West—resulting in 8 separate data streams (Figure).
Figure

Nine annual cycles (unbroken lines) of general practitioner (A) and general outpatient clinic (B) geographic sentinel surveillance data from Hong Kong Island, Kowloon, New Territories East, and New Territories West, 1998–2007. The monthly proportions of laboratory samples testing positive for influenza isolates are overlaid as gray bars, and the beginning of each annual period of peak activity (inferred from the laboratory data) is marked with a vertical dotted line. ILI, influenza-like illness.

Nine annual cycles (unbroken lines) of general practitioner (A) and general outpatient clinic (B) geographic sentinel surveillance data from Hong Kong Island, Kowloon, New Territories East, and New Territories West, 1998–2007. The monthly proportions of laboratory samples testing positive for influenza isolates are overlaid as gray bars, and the beginning of each annual period of peak activity (inferred from the laboratory data) is marked with a vertical dotted line. ILI, influenza-like illness. Each month a median of 1,555 specimens (interquartile range 1,140–2,740), primarily from hospitals, were sent to the Government Virus Unit of the Department of Health (). We calculated the highest proportion of positive influenza isolations each season, and used these laboratory data to define the onset of each peak activity period when the proportion of positive influenza A or B isolates exceeded 30% of the maximum seasonal level (). Dynamic linear models () were used to generate alerts (Technical Appendix). We determined that an aberration had occurred when the current observation fell outside a forecast interval generated by the model. For methods based on monitoring of single data streams only, an aberration triggers an alert. For simultaneous monitoring of all 8 data streams, we monitored separate aberrations as above and generated alerts based on the first occurrence of any aberration (M1), 2 simultaneous aberrations (M2), the first occurrence of 3 simultaneous aberrations (M3), any 2 aberrations within a 2-week period (M4), and any 3 aberrations within a 2-week period (M5). In the multistream analyses, we compared alerts produced by univariate models, which effectively assumed independence between the data streams, and multivariate models, which allowed for correlation between the data streams (Technical Appendix). Alerts were compared in terms of their sensitivity, specificity, and timeliness in detecting the onset of peak levels of influenza activity (). We combined these metrics and estimated the area under the weighted receiver operating characteristic curve (AUWROC) as an overall measure of performance (). The Table shows the highest AUWROC, for each method, from a predefined selection of parameter combinations and the sensitivity and timeliness at a fixed specificity of 95%. On the basis of aggregated data, we determined that alerts generated from the GOPC network achieved a higher AUWROC and better timeliness than those from the GP network. However, the best AUWROC from each of the data streams was produced by the GP New Territories East data, which outperformed the aggregate GP data. Conversely, for GOPC data, the performance of aggregate data was superior to that of any single data stream. The Table also shows simultaneous monitoring results for all 8 geographic data streams from both GPs and GOPCs. For the univariate (independent) models for each data stream, methods based on simultaneous alerts perform well. The optimal methods were M2 and M3 with AUWROC of 0.89 and 0.90 and timeliness of 1.22 and 1.47 weeks, respectively, for a fixed specificity of 0.95. In general, univariate models performed better than multivariate models. Empirical correlation derived from one of the fitted multivariate models is shown in the Technical Appendix; correlation structures under other models were similar (data not shown). Results were insensitive to the choice of parameters (Technical Appendix). The results also held when we varied the definition of the start of peak influenza activity between 10% and 50% of peak seasonal levels (Technical Appendix).

Conclusions

A primary objective of sentinel surveillance is to provide sensitive, specific, and timely alerts at the beginning of increased disease activity (). We evaluated the performance of multistream sentinel surveillance of ILI in detecting the onset of peak influenza activity. Splitting sentinel data into separate geographic-based streams and monitoring all 8 streams for 2 or 3 simultaneous aberrations provided substantial improvements in AUWROC and also in timeliness for a fixed specificity when compared with monitoring aggregated data or any single data stream. We also used multivariate models with various alternative correlation structures between data streams, but use of these more complex models did not appear to improve performance (Table), possibly because correlation between streams vary year to year; the multivariate model is based on constant correlations (Technical Appendix). It is possible that other complex multivariate models may allow even greater improvement in performance; however, simultaneous monitoring of data streams may be more practical because univariate models may be applied in a spreadsheet ().
Table

Performance of alerts generated by individual monitoring of aggregate data and separate data streams, and simultaneous monitoring of multiple data streams by using univariate and multivariate time series models, Hong Kong, 1998–2007*

DataUnivariate models
Multivariate models†
AUWROCSensitivity‡Timeliness, wk‡AUWROCSensitivity‡Timeliness, wk‡
Aggregated data
GP0.781.002.41
GOPC
0.86
1.00
1.50




Single stream
GP
HK0.751.002.360.730.872.64
KL0.661.002.710.620.883.06
NTE0.891.002.000.760.902.04
NTW0.801.002.070.800.912.24
GOPC
HK0.791.002.210.710.892.42
KL0.781.002.460.620.963.15
NTE0.790.952.220.790.962.26
NTW
0.73
1.00
2.55

0.72
1.00
2.52
Multiple streams
M1: First aberration0.841.001.570.861.001.66
M2: 2 simultaneous aberrations0.891.001.220.821.001.77
M3: 3 simultaneous aberrations0.901.001.470.801.001.70
M4: Any 2 aberrations in 2 wk0.811.002.630.721.002.43
M5: Any 2 aberrations in 2 wk0.831.002.440.771.002.11

*AUWROC, area under the weighted receiver operating characteristic curve; GP, general practitioner; GOPC, general outpatient clinic; HK, Hong Kong Island; KL, Kowloon; NTE, New Territories East; NTW, New Territories West.
†See Technical Appendix for more detailed description of the multivariate model.
‡At a fixed specificity of 0.95.

*AUWROC, area under the weighted receiver operating characteristic curve; GP, general practitioner; GOPC, general outpatient clinic; HK, Hong Kong Island; KL, Kowloon; NTE, New Territories East; NTW, New Territories West.
†See Technical Appendix for more detailed description of the multivariate model.
‡At a fixed specificity of 0.95. Although the relative performance of GP and GOPC sentinels may not be directly generalizable to other settings with differences in infectious disease dynamics and healthcare systems, the implications for data collection are nevertheless relevant. Inclusion of data streams should be based on their value to the overall surveillance system, rather than independent performance. For example, simultaneous monitoring of data streams where some have lower specificity and others have higher specificity could still improve overall timeliness. Specifically regarding Hong Kong, it is unclear why alerts from the private GP network have better timeliness than those from the public GOPC network. Although we note that both networks have different catchment populations, the GOPC network typically serves elderly and lower income groups (), whereas influenza would be more likely to affect children at the start of the influenza season (). Differences between geographic regions could be real, when disease progresses from 1 region to another (); however, this circumstance is unlikely in Hong Kong, an area of only 1,000 km2, where a high degree of mixing occurs among a population of 7 million persons. Geographic heterogeneity could also be explained by differential socioeconomics and demographics between different regions, associated differences in access to healthcare and health-seeking behavior issues, or small area variations in reporting behavior among the sentinel practices. A potential caveat of our analysis is the small number of annual cycles of sentinel data available for study. However, until recently, few subtropical or tropical regions had begun influenza sentinel surveillance. Another limitation is the absence of a generally agreed-upon standard in defining a peak influenza season. In our analysis, the start of peak activity was defined as laboratory isolation rates exceeding 30% of the annual level; however, we found that our results were not sensitive to other reasonable thresholds. In addition, we compared methods with only a few chosen parameter combinations; sensitivity analyses showed that the results were not sensitive to the smoothing parameter or the specification of correlations between streams. Finally, alerts generated by other more complicated combinations of aberrations might provide further enhancements. However, the value of simultaneously monitoring separate data streams () has already been demonstrated by the simple combinations chosen here.

Technical Appendix

Optimizing Use of Multistream Influenza Sentinel Surveillance Data
  12 in total

1.  Early statistical detection of anthrax outbreaks by tracking over-the-counter medication sales.

Authors:  Anna Goldenberg; Galit Shmueli; Richard A Caruana; Stephen E Fienberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-16       Impact factor: 11.205

2.  Simulation for assessing statistical methods of biologic terrorism surveillance.

Authors:  Ken P Kleinman; A Abrams; K Mandl; R Platt
Journal:  MMWR Suppl       Date:  2005-08-26

3.  Methods for monitoring influenza surveillance data.

Authors:  Benjamin J Cowling; Irene O L Wong; Lai-Ming Ho; Steven Riley; Gabriel M Leung
Journal:  Int J Epidemiol       Date:  2006-08-22       Impact factor: 7.196

4.  Assessing surveillance using sensitivity, specificity and timeliness.

Authors:  Ken P Kleinman; Allyson M Abrams
Journal:  Stat Methods Med Res       Date:  2006-10       Impact factor: 3.021

5.  Interpandemic influenza in the Houston area, 1974-76.

Authors:  W P Glezen; R B Couch
Journal:  N Engl J Med       Date:  1978-03-16       Impact factor: 91.245

6.  Using laboratory-based surveillance data for prevention: an algorithm for detecting Salmonella outbreaks.

Authors:  L C Hutwagner; E K Maloney; N H Bean; L Slutsker; S M Martin
Journal:  Emerg Infect Dis       Date:  1997 Jul-Sep       Impact factor: 6.883

7.  Multi-syndrome analysis of time series using PCA: a new concept for outbreak investigation.

Authors:  Mojdeh Mohtashemi; Ken Kleinman; W Katherine Yih
Journal:  Stat Med       Date:  2007-12-20       Impact factor: 2.373

8.  Comparing aberration detection methods with simulated data.

Authors:  Lori Hutwagner; Timothy Browne; G Matthew Seeman; Aaron T Fleischauer
Journal:  Emerg Infect Dis       Date:  2005-02       Impact factor: 6.883

9.  Evaluation of school absenteeism data for early outbreak detection, New York City.

Authors:  Melanie Besculides; Richard Heffernan; Farzad Mostashari; Don Weiss
Journal:  BMC Public Health       Date:  2005-10-07       Impact factor: 3.295

10.  An epidemiological network model for disease outbreak detection.

Authors:  Ben Y Reis; Isaac S Kohane; Kenneth D Mandl
Journal:  PLoS Med       Date:  2007-06       Impact factor: 11.069

View more
  16 in total

1.  International society for disease surveillance conference 2011: building the future of public health surveillance.

Authors:  Daniel B Neill; Karl A Soetebier
Journal:  Emerg Health Threats J       Date:  2011-12-06

2.  Early detection of influenza outbreaks using the DC Department of Health's syndromic surveillance system.

Authors:  Beth Ann Griffin; Arvind K Jain; John Davies-Cole; Chevelle Glymph; Garret Lum; Samuel C Washington; Michael A Stoto
Journal:  BMC Public Health       Date:  2009-12-22       Impact factor: 3.295

3.  Timely detection of localized excess influenza activity in Northern California across patient care, prescription, and laboratory data.

Authors:  Sharon K Greene; Martin Kulldorff; Jie Huang; Richard J Brand; Kenneth P Kleinman; John Hsu; Richard Platt
Journal:  Stat Med       Date:  2011-02-28       Impact factor: 2.373

4.  Data-driven approach for creating synthetic electronic medical records.

Authors:  Anna L Buczak; Steven Babin; Linda Moniz
Journal:  BMC Med Inform Decis Mak       Date:  2010-10-14       Impact factor: 2.796

5.  Situational awareness of influenza activity based on multiple streams of surveillance data using multivariate dynamic linear model.

Authors:  Eric H Y Lau; Calvin K Y Cheng; Dennis K M Ip; Benjamin J Cowling
Journal:  PLoS One       Date:  2012-05-31       Impact factor: 3.240

6.  Age-specific epidemic waves of influenza and respiratory syncytial virus in a subtropical city.

Authors:  Lin Yang; Kwok Hung Chan; Lorna K P Suen; King Pan Chan; Xiling Wang; Peihua Cao; Daihai He; J S Malik Peiris; Chit Ming Wong
Journal:  Sci Rep       Date:  2015-05-18       Impact factor: 4.379

7.  Forecasting influenza epidemics from multi-stream surveillance data in a subtropical city of China.

Authors:  Pei-Hua Cao; Xin Wang; Shi-Song Fang; Xiao-Wen Cheng; King-Pan Chan; Xi-Ling Wang; Xing Lu; Chun-Li Wu; Xiu-Juan Tang; Ren-Li Zhang; Han-Wu Ma; Jin-Quan Cheng; Chit-Ming Wong; Lin Yang
Journal:  PLoS One       Date:  2014-03-27       Impact factor: 3.240

8.  Effects of school closures, 2008 winter influenza season, Hong Kong.

Authors:  Benjamin J Cowling; Eric H Y Lau; Conrad L H Lam; Calvin K Y Cheng; Jana Kovar; Kwok Hung Chan; J S Malik Peiris; Gabriel M Leung
Journal:  Emerg Infect Dis       Date:  2008-10       Impact factor: 6.883

9.  Forecasting Influenza Epidemics in Hong Kong.

Authors:  Wan Yang; Benjamin J Cowling; Eric H Y Lau; Jeffrey Shaman
Journal:  PLoS Comput Biol       Date:  2015-07-30       Impact factor: 4.475

Review 10.  A review of the evidence to support influenza vaccine introduction in countries and areas of WHO's Western Pacific Region.

Authors:  Gina Samaan; Michelle McPherson; Jeffrey Partridge
Journal:  PLoS One       Date:  2013-07-16       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.