| Literature DB >> 27648777 |
Sharon K Greene, Eric R Peterson, Deborah Kapell, Annie D Fine, Martin Kulldorff.
Abstract
Each day, the New York City Department of Health and Mental Hygiene uses the free SaTScan software to apply prospective space-time permutation scan statistics to strengthen early outbreak detection for 35 reportable diseases. This method prompted early detection of outbreaks of community-acquired legionellosis and shigellosis.Entities:
Keywords: Legionella; New York; New York City; Shigella; USA; bacteria; communicable diseases; detection; disease clustering; epidemiology; foodborne diseases; legionellosis; outbreaks; shigellosis; surveillance; zoonoses
Mesh:
Year: 2016 PMID: 27648777 PMCID: PMC5038417 DOI: 10.3201/eid2210.160097
Source DB: PubMed Journal: Emerg Infect Dis ISSN: 1080-6040 Impact factor: 6.883
Case file specifications for routine reportable disease analyses in New York City, New York, using the prospective space–time permutation scan statistic
| Feature | Selection | Notes |
|---|---|---|
| Geographic aggregation | Census tract (defined using US Census 2000 boundaries) of residential address at time of report* | The less data are spatially aggregated, the more precisely areas with elevated rates can be identified. New York City has 2,216 census tracts in an area of 305 square miles. |
| Date of interest for analysis | Event date, defined using hierarchy of onset date → diagnosis date (collection date of first specimen testing positive) → report date → date event created in surveillance database | Defining reportable disease clusters according to when case-patients became ill is preferred. However, onset date is missing for most case-patients who have not yet been interviewed, and each case needs a date to be included in analysis. Thus, the best available proxy for onset date is used. Because we use daily data (rather than weekly, monthly, or yearly data), the time precision is specified as day on the SaTScan ( |
| Study period | 1 y for most diseases, ending the day before analysis† | One year is a reasonable choice, balancing the need for a period long enough to establish a stable local baseline for each spatial unit, yet short enough to avoid variable secular trends (e.g., geographically different increases in the underlying population of a spatial unit). Analyses are run each morning using data with event dates through the previous day. |
| Case inclusion criteria | Include all reported cases, regardless of current status (e.g., confirmed, probable, suspected, pending, noncase)† | Depending on the disease, cases initially might be assigned a transient pending status and, upon investigation, be reclassified as a case (confirmed, probable, or suspected) or a noncase. Timeliness is preserved by analyzing all reported cases, including noncases and pending cases, regardless of whether they ultimately will be confirmed. By analyzing all reported cases, case inclusion criteria are consistent across the study period. If instead the case file were restricted to confirmed and pending cases, then analyses would be biased toward false signaling, as some cases with an initial pending status will be ultimately reclassified after investigation as a noncase. This reclassification process is complete for the baseline but ongoing for the current period of interest ( |
| Day-of-week variable | Include a variable that indicates the day of the week (1–7) | The analysis automatically adjusts for day-of-week effects but not for space by day-of-week interaction. Including this variable in the SaTScan case file accounts for how the daily pattern of health-seeking behavior and diagnoses might vary geographically. |
*Exception to residential address at time of report: if the residential address is not geocodable (e.g., because the case-patient is not a resident of the city or because a post office box is reported instead of a street address), then the geocoded work address, if available, is substituted. †For exceptions, see online Technical Appendix (http://wwwnc.cdc.gov/EID/article/22/10/16-0097-Techapp1.pdf).
FigureAutomated output from spatiotemporal analysis on July 17, 2015, indicating a cluster (dark gray) of 8 legionellosis cases over 8 days centered in the South Bronx, New York City, New York, USA. In subsequent days, this cluster expanded in space and time into the second largest US outbreak of community-acquired legionellosis.
Signaling rates at 3 recurrence interval thresholds for 35 reportable diseases under surveillance in New York City, New York, USA, 2014–2015*
| Disease | Annual no. cases‡ | No. signals during 365 d of prospective surveillance† | ||
|---|---|---|---|---|
| Recurrence interval | Recurrence interval | Recurrence interval | ||
| Amebiasis | 476 | 0 | 1.2 | 4.3 |
| Babesiosis | 57 | 0 | 0 | 0 |
| Campylobacteriosis | 1,663 | 0.6 | 0.6 | 4.9 |
| Chikungunya | 171 | 0.6 | 1.8 | 3.1 |
| Cholera | 0 | 0 | 0 | 0 |
| Cryptosporidiosis | 135 | 0 | 0 | 0.6 |
| Cyclosporiasis | 51 | 0 | 0 | 1.2 |
| Dengue | 57 | 0 | 0 | 1.8 |
| Encephalitis | 2 | 0 | 0 | 0 |
| Giardiasis | 871 | 1.2 | 1.8 | 5.5 |
| Hemolytic uremic syndrome | 4 | 0 | 0 | 0 |
| Hepatitis A | 78 | 1.9 | 1.9 | 5.8 |
| Acute hepatitis B | 51 | 0.6 | 1.2 | 3.7 |
| Hepatitis D | 0 | 0 | 0 | 0 |
| Hepatitis E | 0 | 0 | 0.6 | 0.6 |
| Human granulocytic anaplasmosis | 51 | 0.6 | 0.6 | 0.6 |
| Human monocytic ehrlichiosis | 8 | 0 | 0.6 | 0.6 |
| Invasive group A | 263 | 0 | 0 | 1.8 |
| Invasive group B | 33 | 0.6 | 1.2 | 2.4 |
| Invasive | 97 | 0 | 0 | 1.8 |
| Invasive | 647 | 0 | 1.2 | 1.8 |
| Legionellosis | 434 | 9.1 | 9.1 | 11.4 |
| Listeriosis | 34 | 0 | 0 | 0.6 |
| Malaria | 187 | 0.6 | 1.8 | 4.3 |
| Meningococcal disease | 8 | 0 | 0 | 0.6 |
| Noncholera | 18 | 0 | 0 | 0 |
| Paratyphoid fever | 11 | 0 | 0 | 0 |
| Rickettisalpox | 9 | 0 | 0 | 0 |
| Rocky Mountain spotted fever | 6 | 0 | 0 | 2.4 |
| Shiga toxin–producing | 96 | 0 | 0 | 0 |
| Shigellosis | 806 | 1.8 | 1.8 | 6.1 |
| Typhoid fever | 31 | 0 | 1.9 | 3.8 |
| Vancomycin-intermediate | 28 | 0 | 0 | 0 |
| West Nile virus disease | 19 | 0 | 0 | 0 |
| Yersiniosis | 25 | 0 | 0 | 0 |
| Total signals across all diseases under surveillance | NA | 17.8 | 27.6 | 69.8 |
*Signals were detected by using the prospective space–time permutation scan statistic. NA, not applicable. †A signal for a particular disease was defined as unique if the first most likely cluster on a particular day did not encompass any of the same census tracts as the first most likely cluster on the prior day. The signaling rate for most diseases was based on 598 d of surveillance (February 10, 2014–September 30, 2015). For 5 diseases, the signaling rate was based on a shorter surveillance period to reflect analytic adjustments: hepatitis A, paratyphoid fever, and typhoid fever (190 d under surveillance after extending to a 60-d maximum temporal cluster size); legionellosis (160 d under surveillance after excluding unresolved cases); and Shiga toxin–producing E. coli (21 d under surveillance after excluding cases with only a positive multiplex PCR gastrointestinal panel test). ‡Confirmed, probable, and suspected cases among residents with event dates October 1, 2014–September 30, 2015. §The signal was detected at the lower ≥100-d threshold on the same day for 50% of the signals, 1 d earlier for 19% of signals, 2 d earlier for 19% of signals, 3 d earlier for 6% of signals, and 7 d earlier for 6% of signals.