| Literature DB >> 28830109 |
Elizabeth C Lee1, Jason M Asher2, Sandra Goldlust1, John D Kraemer3, Andrew B Lawson4, Shweta Bansal1,5.
Abstract
Spatial big data have the velocity, volume, and variety of big data sources and contain additional geographic information. Digital data sources, such as medical claims, mobile phone call data records, and geographically tagged tweets, have entered infectious diseases epidemiology as novel sources of data to complement traditional infectious disease surveillance. In this work, we provide examples of how spatial big data have been used thus far in epidemiological analyses and describe opportunities for these sources to improve disease-mitigation strategies and public health coordination. In addition, we consider the technical, practical, and ethical challenges with the use of spatial big data in infectious disease surveillance and inference. Finally, we discuss the implications of the rising use of spatial big data in epidemiology to health risk communication, and public health policy recommendations and coordination across scales.Entities:
Keywords: digital epidemiology; disease mapping; infectious diseases; spatial big data; spatial epidemiology; statistical bias
Mesh:
Year: 2016 PMID: 28830109 PMCID: PMC5144899 DOI: 10.1093/infdis/jiw344
Source DB: PubMed Journal: J Infect Dis ISSN: 0022-1899 Impact factor: 5.226
Figure 1.A, Spatial big data have spatial biases in the populations they represent. For instance, as reported by the 2013 American Community Survey, there is spatial variation in home Internet access across the United States, which might affect the populations generating search query data in Google Trends. B, With static spatial data (left), individuals (represented with different colors) report case events (points) at fixed locations. For instance, 2 individuals visited the same physician's office with symptoms multiple times (points along the time axis), so their events are recorded at the same position along the space axis (see overlapping trajectories in the lower part of the space axis), while another individual visited a different physician's office with symptoms 3 times in a similar period (upper part of space axis). Events from the same individual are connected with a dashed line. With dynamic spatial data (right), events are recorded as individuals move through space. For example, the dark blue individual (see trajectory that begins earliest on the time axis) recorded 4 events when they tweeted about symptoms at work, at the grocery store, at the pharmacy, and at home, so their case events occur at 4 different positions along the space axis. Events occur in time dynamically (as shown in this figure), but events may also be aggregated to regular intervals (eg, weekly). C, Data at different spatial scales may have different magnitudes and variability in time, after adjustment for population size, even if they are derived from the same data source. For instance, we observe time-varying fluctuations and variation in epidemic peak timing and magnitude in the county-level disease data (gray) that are lost in the state-level data (black). D, One possible method to protect privacy is to mask individual-level data by aggregating collected data to larger spatial resolutions. In reality, individuals (black circles) may be connected to other individuals through mobile phone calls (black lines). The publicly released data may be aggregated to the level of neighborhoods (green circles), and the number of calls between individuals from different neighborhoods (green lines) would be represented with different weights (here, depicted with varying thickness according to number of individual calls).