| Literature DB >> 22738820 |
R Ryan Lash1, Darin S Carroll, Christine M Hughes, Yoshinori Nakazawa, Kevin Karem, Inger K Damon, A Townsend Peterson.
Abstract
BACKGROUND: Maps of disease occurrences and GIS-based models of disease transmission risk are increasingly common, and both rely on georeferenced diseases data. Automated methods for georeferencing disease data have been widely studied for developed countries with rich sources of geographic referenced data. However, the transferability of these methods to countries without comparable geographic reference data, particularly when working with historical disease data, has not been as widely studied. Historically, precise geographic information about where individual cases occur has been collected and stored verbally, identifying specific locations using place names. Georeferencing historic data is challenging however, because it is difficult to find appropriate geographic reference data to match the place names to. Here, we assess the degree of care and research invested in converting textual descriptions of disease occurrence locations to numerical grid coordinates (latitude and longitude). Specifically, we develop three datasets from the same, original monkeypox disease occurrence data, with varying levels of care and effort: the first based on an automated web-service, the second improving on the first by reference to additional maps and digital gazetteers, and the third improving still more based on extensive consultation of legacy surveillance records that provided considerable additional information about each case. To illustrate the implications of these seemingly subtle improvements in data quality, we develop ecological niche models and predictive maps of monkeypox transmission risk based on each of the three occurrence data sets.Entities:
Mesh:
Year: 2012 PMID: 22738820 PMCID: PMC3724478 DOI: 10.1186/1476-072X-11-23
Source DB: PubMed Journal: Int J Health Geogr ISSN: 1476-072X Impact factor: 3.918
Figure 1Total reported MPX case distribution across Central and West Africa, 1970–1986. The distribution of MPX cases in seven countries where MPX cases were reported through the joint WHO/CDC surveillance efforts, including the total number of cases identified within each county [24]. Countries labeled in gray without numbers indicate locations where additional MPX or MPX-related disease have occurred since 1986 [20-23].
Comparison of georeferencing match rates across countries and sub-national units for each different method
| | | | | | |||
|---|---|---|---|---|---|---|---|
| | | | | | | | |
| 2 | 0/2 | 0 | 1/2 | 50 | 0/2 | 0 | |
| Centre | 2 | 0/2 | 0 | 1/2 | 50 | 0/2 | 0 |
| 2 | 0/2 | 0 | 0/2 | 0 | 0/2 | 0 | |
| Sangha | 2 | 0/2 | 0 | 0/2 | 0 | 0/2 | 0 |
| 220 | 99/220 | 45 | 112/220 | 51 | 67/220 | 30 | |
| Bandundu | 37 | 14/37 | 38 | 23/37 | 62 | 12/37 | 32 |
| 0 | 0/0 | n/a | 0/0 | n/a | 1/0* | n/a | |
| Equateur | 143 | 62/143 | 43 | 71/143 | 50 | 38/143 | 27 |
| 3 | 2/3 | 67 | 9/3* | n/a | 8/3* | n/a | |
| Kasai Occidental | 3 | 2/3 | 67 | 1/3 | 33 | 2/3 | 67 |
| Kasai Oriental | 31 | 19/31 | 61 | 6/31 | 19 | 5/31 | 16 |
| Kivu | 3 | 0/3 | 0 | 2/3 | 67 | 0/3 | 0 |
| 0 | 0/0 | n/a | 0/0 | n/a | 1/0* | n/a | |
| 2 | 2/2 | 100 | 2/2 | 100 | 1/2 | 50 | |
| Abengourou | 1 | 1/1 | 100 | 1/1 | 100 | 0/1 | 0 |
| Haut-Sassandra | 1 | 1/1 | 100 | 1/1 | 100 | 1/1 | 100 |
| 2 | 2/2 | 100 | 0/2 | 0 | 0/2 | 0 | |
| Grand Gedeh | 2 | 2/2 | 100 | 0/2 | 0 | 0/2 | 0 |
| 2 | 2/2 | 100 | 1/2 | 50 | 1/2 | 50 | |
| East Central | 1 | 1/1 | 100 | 0/1 | 0 | 0/1 | 0 |
| Oyo | 1 | 1/1 | 100 | 1/1 | 100 | 1/1 | 100 |
| 1 | 1/1 | 100 | 0/1 | 0 | 0/1 | 0 | |
| Southern | 1 | 1/1 | 100 | 0/1 | 0 | 0/1 | 0 |
The number of MPX case localities were matched at different rates in different national and sub-national units (i.e. state or province), which are expressed as fractions and percentages, relative to numbers of unique localities reported there in the WHO spreadsheet. Bolded regions in the DRC represent likely errors of commission, where more localities were georeferenced than would be expected based on the WHO spreadsheet. Asterisks identify probable specific instances of this type of error, such that calculating match rate percentages are not useful.
Geographic information resources consulted for “researched” dataset
| 43 | Joint Operation Graphic’s (JOG’s) | [ |
| 18 | Legacy CDC case forms | |
| | | |
| 4 | Report of Meeting on the implementation of Post-Smallpox Eradication Policy | [ |
| 3 | Human infections with MPX virus: Liberia and Sierra Leone | [ |
| | | |
| 3 | The role of squirrels in sustaining MPX virus transmission. | [ |
| 2 | Ebola haemorrhagic fever in Zaire, 1976. | [ |
| 4 | A search for Ebola virus in animals in the Democratic Republic of the Congo and Cameroon: ecologic, virologic, and serologic surveys, 1979–1980. | [ |
| 1 | Human MPX. | [ |
| 1 | Human poxvirus disease after smallpox eradication. | [ |
| 1 | Four generations of probable person-to-person transmission of human MPX. | [ |
| 1 | Results of Ebola antibody surveys in various populations groups | [ |
The number of MPX case localities which benefited from more detailed CDC legacy data and other historic materials, by resource name.
Figure 2Exploration of effects of different levels of care and detail in georeferencing of human MPX cases on derivative transmission risk maps. Models derived from the automated and worked occurrence data differ in environmental and geographic dimensions from those based on the carefully researched occurrence data points. See text for additional detail. Red and orange areas in panels C and D are those that are more extensive in the researched data set, while blue areas are those that are less extensive. Panel E highlights portions of the ecological niche unique to the West African countries (Nigeria, Ivory Coast, Liberia, Sierra Leone) which were located using the researched method, but largely missed by the other two methods.
Figure 3Example of application of complex spatial logic to georeferencing a difficult locality. A portion of a JOG map is shown, with GNS gazetteer data overlaid as orange dots with orange labels. The village of Libela did not appear on either the JOG map or in the GNS database, but anecdotal reference was made to it as being 38 km south of Yambuku [61]. Using ArcGIS, a 38 km distance (solid white line) from Yambuku Mission (church symbol on JOG map highlighted in white) to the south led to an unnamed village on the JOG map 38 km away, which could reasonably be inferred to be Libela.