| Literature DB >> 23565065 |
Simon I Hay1, Dylan B George, Catherine L Moyes, John S Brownstein.
Abstract
Simon Hay and colleagues discuss the potential and challenges of producing continually updated infectious disease risk maps using diverse and large volume data sources such as social media.Entities:
Mesh:
Year: 2013 PMID: 23565065 PMCID: PMC3614504 DOI: 10.1371/journal.pmed.1001413
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Figure 1A schematic overview of the process of predicting spatial disease risk.
The definitive extent of infectious disease occurrence at the national level (red is certain presence, green is certain absence) [16] is combined with assemblies of known occurrence, presence points (red dots), to generate putative pseudo-absence points (blue dots). The presence and pseudo-absence data are then used in the analyses, with selected environmental covariates to predict disease risk, formally the probability of occurrence of the target disease. In this example a risk map of dengue is shown, shaded from low probability of occurrence in blue to high probability of occurrence in red [8]. The arrows represent data flows.
An assessment of the challenges of using Big Data in disease mapping.
| Definitive Extent | Occurrence Point | Pseudo-Absence Point | Environmental Covariates | Risk Prediction | |
| Volume (scale) | +++ | +++ | + | +++ | +++ |
| Velocity (frequency) | +++ | +++ | ++ | +++ | +++ |
| Variety (diversity) | ++ | ++ | + | + | + |
The potential Big Data challenges in each stage of an iterative mapping process are highlighted in the table. The columns represent each of the mapping stages defined in Figure 1. The rows reflect the volume, velocity, and variety descriptors of data contributions. The future Big Data challenge in relation to infectious disease risk mapping is as follows: low (+), medium (++), and high (+++).