| Literature DB >> 34034534 |
Sophie A Lee1,2,3, Christopher I Jarvis1,3, W John Edmunds1,3, Theodoros Economou4, Rachel Lowe1,2,3.
Abstract
Spatial connectivity plays an important role in mosquito-borne disease transmission. Connectivity can arise for many reasons, including shared environments, vector ecology and human movement. This systematic review synthesizes the spatial methods used to model mosquito-borne diseases, their spatial connectivity assumptions and the data used to inform spatial model components. We identified 248 papers eligible for inclusion. Most used statistical models (84.2%), although mechanistic are increasingly used. We identified 17 spatial models which used one of four methods (spatial covariates, local regression, random effects/fields and movement matrices). Over 80% of studies assumed that connectivity was distance-based despite this approach ignoring distant connections and potentially oversimplifying the process of transmission. Studies were more likely to assume connectivity was driven by human movement if the disease was transmitted by an Aedes mosquito. Connectivity arising from human movement was more commonly assumed in studies using a mechanistic model, likely influenced by a lack of statistical models able to account for these connections. Although models have been increasing in complexity, it is important to select the most appropriate, parsimonious model available based on the research question, disease transmission process, the spatial scale and availability of data, and the way spatial connectivity is assumed to occur.Entities:
Keywords: epidemiology; infectious disease dynamics; machine learning; spatial analysis; vector-borne disease
Year: 2021 PMID: 34034534 PMCID: PMC8150046 DOI: 10.1098/rsif.2021.0096
Source DB: PubMed Journal: J R Soc Interface ISSN: 1742-5662 Impact factor: 4.118
Search terms used to search Medline, Embase, Global Health and Web of Science related to mosquito-borne diseases, modelling and spatial connectivity.
| mosquitoa,b diseaseb | (mathb OR statisticb)a modelb | (spatib OR cluster)a analysis |
| chikungunya | (gravity OR radiation)a modelb | autocorrelb OR neighbb OR hierarchb OR adjacenb OR proximity OR network OR commutb OR connectb |
| dengue | (spatib OR Bayesb)a modelb | randoma effectb |
| ‘Japanese encephalitis’ | (ecologb OR environmentb)a modelb | (BYM OR ‘Besagb Yorke and Mollie’)a modelb |
| malaria | (dynamic OR stochastic OR deterministb OR mechanb OR compartmentb)a modelb | ‘conditional autoregressb’ OR CAR |
| (Rift Valley)a (fever OR virus) | (regression OR generalb)a modelb | humana (mobility OR movement OR travel) |
| sindbis | (SIR OR SEIR)a modelb | spatba dependb |
| (‘West Nile’)a (fever OR diseaseb or virus) | patcha modelb | metapopulation |
| ‘yellow fever’ | (empirical OR correlb OR movement)a modelb | spatiba (structure OR matrix) |
| Zika | ||
| Aedes | ||
| Anopheles | ||
| Culex |
aProximity searching was used, search terms had to be within three words of each other. ADJ3 was used for Embase, Medline and Global Health, NEAR/3 was used for Web of Science.
bDenotes truncation. MeSH terms related to terms above were also searched.
Figure 1PRISMA flow diagram of the search and exclusion process.
Figure 2Number of spatial modelling studies published per year by model type. Statistical models were classified as a fixed effect if parameters were treated as fixed, non-random values or mixed effect if they also included random parameters to account for unobserved heterogeneity or clustering (also known as hierarchical or multilevel models). Machine learning models used algorithms to learn patterns from the data. Compartmental models were mechanistic models that simulated the movement of hosts and/or vectors through disease compartments. Models classified as 'other' did not fall into any of these categories, this included mechanistic models that did not explicitly model movement through compartments, or bespoke statistical models.
The advantages, disadvantages and uses of spatial modelling methods.
| model type | spatial method | description | advantages | disadvantages | application |
|---|---|---|---|---|---|
| statistical or machine learning | spatial covariate | inclusion of a covariate that aims to describe spatial connectivity within a regression model. For example, incidence of surrounding regions, distance between observations, or number of people moving between regions. The covariate is treated as a fixed effect and included into a model as any other covariate | compatible with all statistical or machine learning methods | models assume that the relationship between the outcome and spatial covariates is stationary and isotropic | exploratory tool for statistical or machine learning studies carried out on a small scale where few spatial connections are expected. Statistical or machine learning modelling studies where spatial connectivity is assumed to arise from human movement |
| statistical | local regression models | local regression models are fitted to each region using data from nearby regions, weighted by distance. Also known as GWR. Coefficients are calculated separately for each regression model | relatively simple to carry out and interpret | does not provide a global model to make interpretations about a region as a whole | exploratory tool to generate hypotheses about how relationships differ across space. Cannot be used to make inferences about regions as a whole. Only appropriate when studying areal data |
| statistical | random effects and fields | random effects or fields with a spatially structured covariance function are included in a regression model to account for additional correlation or heterogeneity arising from spatial connectivity. Users must choose an appropriate spatial structure before fitting the model, usually assuming that regions are connected if and only if they are adjacent (areal data) or that connections decay exponentially as the distance between them increases (individual-level data) | relatively easy to obtain connectivity data (if using structure based on adjacency or distance) | more complex to fit and interpret models than other statistical models | statistical models where spatial connectivity is expected to exist between nearby regions. Can be carried out in small- or large-scale studies. Recommended for established diseases rather than a newly emerging setting as requires large amounts of data for precise estimates |
| machine learning | movement matrix | movement matrices reflecting the movement of humans around a network used to weight connections between hidden layers of a neural network | allows complex, dynamic connectivity structures to be explored | requires human movement data (or a representative proxy) to create which can be difficult to obtain | inclusion in a neural network where human mobility is known to drive transmission. Studies that require accurate predictions based on a large amount of data but quantifying this process is not the focus |
| mechanistic | spatial parameter | spatial parameters are included in mechanistic model equations, either to take account for a spatial process or to update populations within each disease compartment of the model. Examples include diffusion parameters allowing hosts and vectors to move across a region or mosquito abundance that borrows information from connected regions | models can be fitted with few data and used to make causal inferences | requires knowledge and information regarding the underlying process of transmission | models aiming to make causal inferences about the underlying process of transmission. Able to fit models where few data are available making it useful for newly emerging diseases or areas with low transmission. More appropriate in small-scale studies where stationarity can be assumed |
| mechanistic | movement matrix | movement matrices that reflect the movement of hosts and/or vectors around a network are included within a mechanistic model. These allow interaction between hosts and vectors in different locations and update the population at each node of the network | allows complex, dynamic connectivity structures to be explored | adequate movement data are difficult to obtain | models taking account of human and/or vector movement or other complex connectivity structures. Able to fit models where few data exist as well as large amounts, useful for newly emerging diseases. Able to study the process of transmission or causal structures. Works well with agent-based or metapopulation mechanistic models where the population is described using a network |
Figure 3Comparison of spatial connectivity using different data sources and assumptions. The level of connectivity between regions represented in models can differ substantially depending on the assumptions made about how connectivity arises, and the data used to weight connections. The heat plots and connectivity matrices show the strength of connectivity between states in Southeast Brazil (a), represented by nodes in the matrices, using assumptions and methods identified in this review. Numbers within the heat plot and along edges of the connectivity matrix represent the weight of connections. These techniques were used to weight observations in GWR models, to structure random effects and random fields, or to weight movement matrices in neural networks, metapopulation models, and agent-based models. (b) Neighbourhood based: assumes states are connected if and only if they share a border. Application: to structure random effects in a CAR model. (c) Distance-based: assumes connectivity between states decays exponentially as distance between centroids (denoted x on the map) increases, where weight = exp(dij /1000) and dij is the distance between states i and j. Application: used to weight observations from neighbouring regions in a GWR model. (d) Human movement data: assumes connectivity between states arises due to human movement. In this case, based on the number of air travel passengers moving between capital cities of each state. Application: to weight hidden layers within a neural network. (e) Movement model: assumes connectivity between states arises due to human movement, estimated using a movement model (in this case, a gravity model). Application: used to weight movement between nodes in a metapopulation model.
The advantages, disadvantages and application of connectivity assumptions.
| connectivity assumption | advantages | disadvantages | application |
|---|---|---|---|
| distance based | easy to obtain data | oversimplifies process of transmission | small-scale studies where unobservable processes, such as shared behaviours, create spatial connectivity. Not appropriate where long-distance connections are expected to exist due to travel. Basis of most statistical approaches identified in this review, e.g. GWR and mixed effect models |
| human movement | shown to be an important part of disease transmission for mosquito-borne diseases | difficult to quantify and obtain data, often requiring a proxy such as distance to be used | |
| vector movement | an important part of the disease transmission process for all mosquito-borne diseases | difficult or impossible to obtain data | small-scale studies or long-term forecasts, particularly malaria studies where transmission generally occurs at night. Due to a lack of data, a proxy must be used such as distance based on known flight distances of mosquitoes. May be included to account for differences in exposure levels across space |
Figure 4Connectivity assumption by model type. The number of spatial modelling studies that assumed connectivity is based on distance, human movement or vector movement (bars) separated by model type. The vast majority of statistical models (fixed and mixed effect models) assumed that connectivity was based on distance, whereas compartmental models were more likely to assume human movement drives connectivity.
Figure 5Connectivity assumptions by mosquito species. The percentage of studies modelling a disease transmitted by each mosquito species that assumed spatial connectivity is related to the distance between regions or observations (using a distance-based function or a neighbourhood structure), human movement or vector movement. Dengue fever, chikungunya, yellow fever and Zika were transmitted by mosquitoes of the Aedes genus; malaria was transmitted by mosquitoes of the Anopheles genus, and Japanese encephalitis, Rift Valley fever and West Nile fever were transmitted by mosquitoes of the Culex genus.