Literature DB >> 27837655

Snow's case revisited: new tool in geographic profiling of epidemiology.

Alessio Papini1, Ugo Santosuosso2.   

Abstract

Geographic Profiling technique is used to find the origin of a series of crimes. The method was recently extended to other fields. One of the best renowned data in epidemiology is that by John Snow during an outburst of cholera in London. We wrote Python scripts to perform the analyses to apply the Geographic Profiling for individuating the starting origin of an infection by using the old Snow's data set. We modified the method by applying a weight to each point of the map where cases of cholera were reported. The weight was proportional to the number of cases in a given location. This modification of the Geographic Profiling method allowed to individuate in the map an area of maximum probability of the infection source, which was a few meters wide and including the historically known source of cholera, that is the "classical" water pump at Broad Street. The method appears to be a useful complement in order to individuate the source of epidemics when available data about the cases of the infections can be summarized on a map.
Copyright © 2016 Sociedade Brasileira de Infectologia. Published by Elsevier Editora Ltda. All rights reserved.

Entities:  

Keywords:  Cholera; Geographic epidemiology; Geographic profiling; John Snow

Mesh:

Year:  2016        PMID: 27837655      PMCID: PMC9425470          DOI: 10.1016/j.bjid.2016.09.010

Source DB:  PubMed          Journal:  Braz J Infect Dis        ISSN: 1413-8670            Impact factor:   3.257


Introduction

Geographic Profiling (GP) is an analytic tool widely used in criminology in order to identify on a map an area of highest probability assumed to contain the origin of linked events, typically crimes executed by a serial offender. The method was extended from criminology to other fields where it was possible to identify a series of linked events which might have originated from a starting point in the space (represented on a two dimensional map). Fields of application other than criminology have been: invasion by alien species,2, 3, 4, 5 bumblebees foraging and nest location,6, 7 and infectious diseases targeting.8, 9 GP uses the coordinates on the mapped events, creating a probability surface, the so-called geoprofile. The geoprofile does not indicate the exact origin of the events, but rather prioritize a series of geographical points, based on the data. The geoprofile will provide on the map a decreasing probability density of finding the source of the events drawn on the map. The model does not search simply the geographical center of the events, but instead it considers a distance-decay function, such that the probability of an event will be lower by increasing the distance from the center of origin; and a buffer zone, within which the probability of an event tends to zero. The distance-decay function is related to maximizing parsimony in movement, in economical and energy terms. Surprisingly, these functions revealed to be found not only for humans (criminals), but also even for invasive (not human) species2, 3 and infectious diseases.8, 9, 10 The need for analytical tools to recognize the source of the spreading of “something” (generally a threat) has always been an important task. One of the best known cases is, in epidemiology, that of cholera outbreak in London, 1854, studied by John Snow and widely cited as a seminal work in spatial epidemiology [13 and references therein]. Dr. Snow tagged the cholera cases and the water pumps on the map of London and searched for the area with the highest number of cases, so discovering that the origin of the outbreak (the so-called focus of infection) was a contaminated water pump in Broad Street. The tagged cholera cases drawn by Snow on the map of London can be converted in a data set of coordinates, that was already used by Le Comber et al. to test the GP method for targeting infectious diseases. Le Comber et al. were able to mark a restricted area in the map of London containing the famous water pump of Broad Street (see Fig. 1C and D in their article). These authors used as input data the individual addresses where case of deaths due to cholera had occurred, that is 321 addresses, while the total number of cases amounted to 575, since more than one case might have occurred at the same address. Le Comber et al. used this approach “to avoid the possible problem of spatial temporal non-independence due to secondary infections at a given address”. Our approach included, instead, all cases assigning a weight to each point (addresses) proportional to the number of cases. We overlooked possible secondary human-to-human contagions, since cholera should not easily transmit from person-to-person, while its transmission is known to be more food- or water-born. For this reason, we interpreted more than one case in the same address as independent events and hence summable. Therefore, here we propose a new method of applying GP in which a different weight is assigned to each point of the map proportionally to the number of cases occurred in each point.

Methods

The data about the positions of cases on the map were acquired with Neuronmorpho (http://www.southampton.ac.uk/∼dales/morpho/), a plugin of ImageJ (National Institute of Health; http://rsb.info.nih.gov/ij/), that can read a map position with a mouse click, building a csv file containing the coordinates point by point. Weights were added manually. Our method calculates the GP by weighting each point of the map in direct proportionality with the number of cases occurred in a given point of the map. That is, some points of the map are more important than others. The data were analyzed with a Python script (Geoprof3.0.2.py). Crucial for the GP analysis is the assignment of the values B, corresponding to the radius of the buffer zone. In our analysis we used B = 30, corresponding to a buffer zone of 30 pixels (about 15 m on our map), that is quite small, with respect to other GP analyses in other fields, such as those on malaria cases in Cairo. We evaluated more B values, calculating the impact on the analysis. The GP technique is described in detail in Papini et al. The variable B (the buffer zone) is of course dependent on the map magnification and on the map resolution, since B is expressed in pixels, while the actual meaning of the buffer zone can be understood only if expressed in meters or km. The Python scripts were written by the authors and can be retrieved from the site www.unifi.it/caryologia/PapiniPrograms.html. The scripts were executed with Python 2.7.3 (http://www.python.org/), running in Ubuntu 12.04 LTS operating system, kernel 2.6.32. The Python (>=2.6 version) programs need NumPy (http://www.numpy.org/), SciPy (http://www.scipy.org/), Matplotlib (http://matplotlib.org/), Scikit-learn (http://scikit-learn.org), and Python Image Library – PIL – (http://www.pythonware.com/products/pil/) libraries installed. A note about the software is provided as Supplementary material (SoftwareUsesupplementary.pdf).

Results and discussion

Fig. 1 shows the results obtained by considering only the addresses on the map as data sets, corresponding to the analysis by Le Comber et al., that is, no weight was assigned to an address on the basis of the number of recorded cases. In Fig. 2 we show the GP analysis with weights assigned to each point of the map on the basis of the number of cases. The result is quite striking, since the red area, representing the area of the map with the points with 95% of highest probability comprised the pump of Broad Street. This area was about 30 m in diameter. With respect to the method that does not consider the number of cases as weights (shown in Fig. 1), the total area of highest probability of the presence of the source was hence much smaller.
Fig. 1

Results obtained by considering only the addresses on the map as data sets. No weight is assigned to each address on the basis of the number of recorded cases.

Fig. 2

GP analysis with weights assigned to each point of the map on the basis of the number of cholera cases. The red area (that with highest probability to find the infection source) is only about 30 m in diameter and it comprises the famous pump of Broad Street.

Results obtained by considering only the addresses on the map as data sets. No weight is assigned to each address on the basis of the number of recorded cases. GP analysis with weights assigned to each point of the map on the basis of the number of cholera cases. The red area (that with highest probability to find the infection source) is only about 30 m in diameter and it comprises the famous pump of Broad Street. Counting the pixels with highest probability of finding the source of the crimes, we found that the red pixels (those with highest probability) decreased substantially passing from considering only the addresses to using the whole data set with weights, that is from 36533 to 10068 (visible from the reduction in dimension of the red area from Fig. 1 to Fig. 2). Calculating each case as a single point, also if located in the same position on the map (that is at the same address), produced an area of red pixels only slightly higher with respect to the use of weights (data not shown). Calculating the distance on the map, the GP analysis with weights produced an area of maximum probability of finding the source of about 30 m in diameter, which contains the well known source of cholera cases in London, that is the famous pump of Broad Street recognized by Snow. This result shows that the use of weights proportional to the number of cases in each address largely increase the precision of the analysis, that is, it reduces the area of maximum probability where to look for the source with respect to other GP techniques as those employed by Le Comber et al. and Verity et al.

Conclusion

The weighted geoprofiling can be a useful method to identify a center of origin of an outbreak of a disease, in cases when more cases of infection can be found in the same point of the map (normally corresponding to a residence), largely reducing the priority points and hence showing the highest precision in delimiting the source search area. The use of weights for more cases of infections at the same address, can be a good choice only in cases where secondary person-to-person infections can be considered not probable (as it is likely the case of cholera), otherwise, as stated by Le Comber et al. it is necessary to use as input data each address (point on the map) as points with the same weight = 1.

Funding

Financial support by the Italian Ministry of Research (MUR), Fondi di Ateneo.

Conflicts of interest

The authors declare no conflicts of interest.
  7 in total

1.  From Jack the Ripper to epidemiology and ecology.

Authors:  Steven C Le Comber; Mark D Stevenson
Journal:  Trends Ecol Evol       Date:  2012-04-09       Impact factor: 17.712

2.  Applying geographic profiling used in the field of criminology for predicting the nest locations of bumble bees.

Authors:  Yukari Suzuki-Ohno; Maki N Inoue; Kazunori Ohno
Journal:  J Theor Biol       Date:  2010-04-13       Impact factor: 2.691

Review 3.  Cholera.

Authors:  David A Sack; R Bradley Sack; G Balakrish Nair; A K Siddique
Journal:  Lancet       Date:  2004-01-17       Impact factor: 79.321

4.  Geographic profiling as a novel spatial tool for targeting infectious disease control.

Authors:  Steven C Le Comber; D Kim Rossmo; Ali N Hassan; Douglas O Fuller; John C Beier
Journal:  Int J Health Geogr       Date:  2011-05-18       Impact factor: 3.918

5.  The mortality rates and the space-time patterns of John Snow's cholera epidemic map.

Authors:  Narushige Shiode; Shino Shiode; Elodie Rod-Thatcher; Sanjay Rana; Peter Vinten-Johansen
Journal:  Int J Health Geogr       Date:  2015-06-17       Impact factor: 3.918

6.  Geographic profiling applied to testing models of bumble-bee foraging.

Authors:  Nigel E Raine; D Kim Rossmo; Steven C Le Comber
Journal:  J R Soc Interface       Date:  2009-03-06       Impact factor: 4.118

7.  Spatial Targeting for Bovine Tuberculosis Control: Can the Locations of Infected Cattle Be Used to Find Infected Badgers?

Authors:  Catherine M Smith; Sara H Downs; Andy Mitchell; Andrew C Hayward; Hannah Fry; Steven C Le Comber
Journal:  PLoS One       Date:  2015-11-13       Impact factor: 3.240

  7 in total
  2 in total

1.  History vs. legend: Retracing invasion and spread of Oxalis pes-caprae L. in Europe and the Mediterranean area.

Authors:  Alessio Papini; Maria Adele Signorini; Bruno Foggi; Enrico Della Giovampaola; Luca Ongaro; Laura Vivona; Ugo Santosuosso; Corrado Tani; Piero Bruschi
Journal:  PLoS One       Date:  2017-12-29       Impact factor: 3.240

2.  Cholera deaths in Soho, London, 1854: Risk Terrain Modeling for epidemiological investigations.

Authors:  Joel M Caplan; Leslie W Kennedy; Christine H Neudecker
Journal:  PLoS One       Date:  2020-03-30       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.