| Literature DB >> 16018815 |
Christopher A Cassa1, Karin Iancu, Karen L Olson, Kenneth D Mandl.
Abstract
BACKGROUND: Evaluating surveillance systems for the early detection of bioterrorism is particularly challenging when systems are designed to detect events for which there are few or no historical examples. One approach to benchmarking outbreak detection performance is to create semi-synthetic datasets containing authentic baseline patient data (noise) and injected artificial patient clusters, as signal.Entities:
Mesh:
Year: 2005 PMID: 16018815 PMCID: PMC1182374 DOI: 10.1186/1472-6947-5-22
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Parameters that can be altered when creating a single cluster.
| User specified reference or identification number for each cluster | |
| Number of patients or points in the generated cluster. | |
| The latitude-longitude coordinates of a reference point, which could be a hospital or a primary care facility, for example. | |
| The distance between the outermost point in the cluster and the center of the cluster. | |
| The angle of the cluster measured counter-clockwise from due east of the reference point as zero degrees, using unit circle convention. | |
| The distance between the center point of the cluster and the reference point. | |
| The number of days from when the first person shows symptoms to when the last person does. | |
| This specifies which of the three models of temporal progression to use. Additional models can be incorporated into the software. | |
| The user can specify where the cluster data and user-specified cluster description will be written. |
Figure 1An example of the linear date algorithm estimation for thirty points spanning three days. The x-axis represents the day number.
Figure 2An example of the exponential date algorithm estimation for thirty points spanning three days. The x-axis represents the day number.
Example of single cluster parameters and multiple cluster parameters.
| Cluster ID | 100 | 101 [1:4] |
| Number of Points in Cluster | 30 | 30 |
| Reference Point Latitude | 42.35666 | 42.35666 |
| Reference Point Longitude | -71.09516 | -71.09516 |
| Cluster Radius | 600 m | 400 m |
| Angle from reference point | 90 | [varies, see below] |
| Distance from reference point | 1600 m | 3000 m |
| Number of Days | 5 | 5 |
| Time-growth Pattern | Linear | Linear |
| Cluster Description | Linear time-growth cluster north of center point | Varied angle around center point and created 4 clusters. |
| Number of Clusters | N/A | 4 |
| Minimum Angle | N/A | 0 |
| Maximum Angle | N/A | 270 |
Sample output to a comma separated value file from AEGIS-CCT. Note: Values are point identification number, longitude, latitude and day number.
| 0,-71.09600452536358,42.37455407329273,1 |
| 1,-71.10149672138236,42.365894560466806,1 |
| 2,-71.0954755413253,42.373890954435524,2 |
| 3,-71.08968377859539,42.37242100053542,2 |
| 4,-71.09281946336338,42.36955324904336,2 |
| 5,-71.09564524977307,42.371694560897,2 |
| 6,-71.09345472615571,42.370979504450304,3 |
| 7,-71.09983295495935,42.369683605959985,3 |
| 8,-71.09781606117451,42.37282397457113,3 |
| 9,-71.09685871099056,42.37540065852763,3 |
| 10,-71.0921214185705,42.37216701505921,3 |
| ... (to point with ClusterID 29) |
Figure 3A single linear time-growth cluster north of center point.
Figure 4Artificially-Injected Temporal Cluster into Low and High-Volume Weeks of Children's Hospital Boston ED Visit Data: An artificially-generated cluster (dashed line at bottom) containing a total of 56 additional points, linearly increasing in magnitude over a 7 day span, was added to two separate weeks of Children's Hospital Boston temporal visit data. The first week of data was from a low-volume week, containing a total of 120 authentic patient visits with an additional 56 artificially-generated visits while the second series contains a total of 472 authentic visits with the same 56 artificially-generated visits appended. While growth in the low-volume week is visible by inspection, it is difficult to notice the artificially added visits in a higher-volume week.
Figure 5Creation of a series of four clusters around the center point (with the angle varied.)