Literature DB >> 30792449

A climate-driven and field data-assimilated population dynamics model of sand flies.

Kamil Erguler1, Irene Pontiki2, George Zittis2, Yiannis Proestos2, Vasiliki Christodoulou3, Nikolaos Tsirigotakis3, Maria Antoniou3, Ozge Erisoz Kasap4, Bulent Alten4, Jos Lelieveld2,5.   

Abstract

Sand flies are responsible for the transmission of leishmaniasis, a neglected tropical disease claiming more than 50,000 lives annually. Leishmaniasis is an emerging health risk in tropical and Mediterranean countries as well as temperate regions in North America and Europe. There is an increasing demand for predicting population dynamics and spreading of sand flies to support management and control, yet phenotypic diversity and complex environmental dependence hamper model development. Here, we present the principles for developing predictive species-specific population dynamics models for important disease vectors. Based on these principles, we developed a sand fly population dynamics model with a generic structure where model parameters are inferred using a surveillance dataset collected from Greece and Cyprus. The model incorporates distinct life stages and explicit dependence on a carefully selected set of environmental variables. The model successfully replicates the observations and demonstrates high predictive capacity on the validation dataset from Turkey. The surveillance datasets inform about biological processes, even in the absence of laboratory experiments. Our findings suggest that the methodology can be applied to other vector species to predict abundance, control dispersion, and help to manage the global burden of vector-borne diseases.

Entities:  

Year:  2019        PMID: 30792449      PMCID: PMC6385250          DOI: 10.1038/s41598-019-38994-w

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Leishmaniasis is a neglected tropical disease caused by protozoan flagellates vectored by phlebotomine sand flies. The disease is associated with about 20 Leishmania species, and it manifests itself in various forms (the leishmaniases) including zoonotic and anthroponotic visceral and cutaneous leishmaniasis[1]. Primary vectors are Phlebotomus spp. in Eurasia and Lutzomyia app. in the Americas, and a diverse zoonotic reservoir including rodents and domestic dogs has been described[2]. Historically, leishmaniasis has been associated with poor living conditions and insufficient health-care[3]; however, crowding and poor urban planning in combination with warming trends in global climate are rapidly exacerbating conditions that lead sand flies and leishmaniasis towards temperate territories[4,5]. The diseases caused by the Leishmania parasite are dynamic because sand flies are dependent on environmental, demographic and human behavioural factors. Such factors bring about changes in the habitat not only of the vectors but also of their natural hosts. At the same time, conditions resulting in immunosuppression in humans (like infection with human immunodeficiency virus (HIV) or organ transplantation-associated therapies) as well as the consequences of armed conflict, result in important changes in the epidemiology of the disease[6]. In order to safeguard public health, it is necessary to take account of sand fly and leishmaniasis risk factors. Progress in mathematical modelling of vector-parasite systems is limited by the complex epidemiology and the availability of reliable surveillance data[7,8]. Nevertheless, known vector presence combined with climatic variables have been exploited for large-scale projections of habitat suitability. Fischer et al. performed environmental niche modelling with the maximum entropy methodology (MaxEnt) to model habitat suitability and project routes of dispersal across Central Europe driven by climate change[5]. Pigott et al. used boosted regression trees to infer environmental constraints and project global habitat suitability[2]. In a more recent study, Meneguzzi et al.[9] performed environmental niche modelling to associate disease vectors with cutaneous leishmaniasis incidents in Southeastern Brazil[9]. While environmental niche modelling provides a large-scale overview of possible environmental limitations of the vector species, mechanistic modelling with explicit links to the underlying biological processes offers a more detailed understanding of the environmental dependence of vector biology and disease epidemiology. Oshaghi et al. developed a population dynamics model of Ph. papatasi driven by local meteorological station data in Iran[10]. The authors used temperature thresholds from the literature[11] to estimate accumulated degree days for life history processes, and adjusted laboratory derived parameters for field populations. The resulting correlation between predictions and observed population abundance suggests that laboratory-informed population dynamics modelling could be predictive provided that the parameters are properly adjusted for field conditions. One of the key limitations of most population dynamics and disease transmission models, including the ones proposed for sand flies and leishmaniasis[12-16], is their direct exploitation of laboratory experiments on key life-history traits, such as survival and development. Parameters derived from controlled experimental conditions may not be readily applicable for field populations especially while microclimate conditions around potential breeding sites cannot be reliably predicted with existing computational capacity and climate models. For these reasons, we recently proposed a Bayesian approach to develop population dynamics and disease transmission models by combining experimental data with field observations and observing the dynamics under the influence of large-scale environmental variables[17,18]. The emerging models demonstrated wide applicability and high predictive ability. However, the amount of experimental data on a vector or disease of interest is often limited due to the small number of experimental studies on model systems, the variety of vector and pathogen species, and the lack of host specificity[7]. Here, we propose an adaptation of our Bayesian approach to assess the information content of field data obtained from multiple locations in informing predictive population dynamics models. We focus on three important vector species: Phlebotomus neglectus, Phlebotomus tobbi and Phlebotomus papatasi. While Ph. neglectus and Ph. tobbi are potent vectors of Leishmania infantum, which causes anthroponotic and zoonotic visceral leishmaniasis[19-21], Ph. papatasi is a vector of Leishmania major, which causes zoonotic cutaneous leishmaniasis[22]. We developed a flexible and extensible framework for developing population dynamics models from field observations and performing climate-driven projections at a regional or global scale. We demonstrate that our approach is easily adjustable to study multiple vector species based on available surveillance data.

Methods

We developed a generic discrete-time stochastic model to represent the environmental dependence of sand fly populations. We present a schematic representation of the model in Fig. 1, and list model parameters in Table 1 together with domain boundaries used for parameter inference (see sec. Vector dynamics model). We implemented the model in ANSI C and integrated it with the albopictus Python package (v.1.6), which is available from the Python package index[23].
Figure 1

Flow diagram of the stochastic sand fly population dynamics model. The model is composed of four life stages including both male and female adult sand flies. Life stage parameters, such as survival, development, and fecundity are weather-driven, and a state of temperature-induced dormancy is assumed for the larval stage.

Table 1

Model parameters with reference intervals.

ParameterDefinitionPrior intervalNotes
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${n}_{E}(0)$$\end{document}nE(0), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${n}_{L}(0)$$\end{document}nL(0), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${n}_{P}(0)$$\end{document}nP(0), \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${n}_{A}(0)$$\end{document}nA(0)Initial number of eggs, larvae, pupae, and adults, respectivelyU (0,104)[1]
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{E}_{1}}$$\end{document}dE1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{L}_{1}}$$\end{document}dL1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{P}_{1}}$$\end{document}dP1Minimum mean development time of eggs, larvae, and pupae, respectivelyU (1, 50) days
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{E}_{2}}$$\end{document}dE2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{L}_{2}}$$\end{document}dL2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{P}_{2}}$$\end{document}dP2Maximum mean development time of eggs, larvae, and pupae, respectivelyU (1, 50) daysMust be higher than the minimum
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{E}_{3}}$$\end{document}dE3, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{L}_{3}}$$\end{document}dL3, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{P}_{3}}$$\end{document}dP3Median developmental temperature for eggs, larvae, and pupae, respectivelyU (−10, 50) °C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{A}_{1}}$$\end{document}dA1 Minimum gonotrophic cycle durationU (1, 50) days
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{A}_{2}}$$\end{document}dA2 Maximum gonotrophic cycle durationU (1, 50) daysMust be higher than the minimum
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${d}_{{A}_{3}}$$\end{document}dA3 Median temperature for gonotrophic cycle durationU (−10, 50) °C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{E}_{1}}$$\end{document}pE1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{L}_{1}}$$\end{document}pL1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{P}_{1}}$$\end{document}pP1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{A}_{1}}$$\end{document}pA1Maximum daily survival probability of eggs, larvae, pupae, and adults respectivelyU (0, 0.999)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{E}_{2}}$$\end{document}pE2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{L}_{2}}$$\end{document}pL2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{P}_{2}}$$\end{document}pP2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{A}_{2}}$$\end{document}pA2Minimum survival temperature threshold for eggs, larvae, pupae, and adults respectivelyU (−10, 50) °C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{E}_{3}}$$\end{document}pE3, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{L}_{3}}$$\end{document}pL3, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{P}_{3}}$$\end{document}pP3, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{{A}_{3}}$$\end{document}pA3Maximum survival temperature threshold for eggs, larvae, pupae, and adults respectivelyU (−10, 50) °CMust be higher than the minimum
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{{A}_{1}}$$\end{document}FA1 Maximum number of eggs laid in a single gonotrophic cycleU (0, 100)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{{A}_{2}}$$\end{document}FA2 Minimum temperature threshold for ovipositionU (−10, 50) °C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${F}_{{A}_{3}}$$\end{document}FA3 Maximum temperature threshold for ovipositionU (−10, 50) °CMust be higher than the minimum
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{g}$$\end{document}pg Probability of surviving ovipositionU (0, 0.5)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{x}$$\end{document}px Probability that a female adult emerges from a larvaU (0, 1)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${p}_{\eta }$$\end{document}pη Daily probability of getting caughtU (10−4, 10−1)0.01–10% coverage is assumed
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\rm{\Psi }}}_{T}$$\end{document}ΨT Temperature threshold for dormancyU (−10, 50) °C
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\rm{\Psi }}}_{H}$$\end{document}ΨH Relative humidity threshold for egg and larva survivalU (0, 1)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{\rm{\Psi }}}_{B}$$\end{document}ΨB Reduction in larva and pupa development probability and adult egg laying due to limited habitability of a breeding siteU (0, 1)
Flow diagram of the stochastic sand fly population dynamics model. The model is composed of four life stages including both male and female adult sand flies. Life stage parameters, such as survival, development, and fecundity are weather-driven, and a state of temperature-induced dormancy is assumed for the larval stage. Model parameters with reference intervals.

Vector dynamics model

We designed a stochastic difference equations model with Markov property, which implies that the state of a population on a given day is affected only by the state of the population and the environmental conditions on the previous day. We defined the population with the number of sand flies and the age structure in each stage. A single iteration comprises the cumulative effect of all stage transformations taking place during a day. Transformations include daily survival, development, egg laying, and, in case of an ongoing surveillance activity, being captured. For each transformation, we defined daily probabilities such as the daily probability of death or egg hatching. Life history parameters of sand flies depend mainly on temperature, and, to a certain extent, on humidity[11,24,25]. In an attempt to standardise environmental dependency for the extensively diverse Phlebotomus genera, we employed single- or double-sided sigmoid curves relating temperature and humidity to survival and development. Sigmoid curves are frequently used in machine learning and fuzzy-logic classification where class boundaries are poorly defined[26]. Here, we employed sigmoid curves as generic approximations of environmental dependence where experimental observations are limited and observational error is large. In order to limit complexity, we took account of the scarcity of experimental data, and assumed that temperature and humidity affect survival independently. We assumed that temperature strongly drives development and survival of all stages, and humidity is a requirement for the survival of the early immature stages, i.e. eggs and larvae. In essence, sufficient humidity is required for eggs and larvae to sustain viability[27], and when humidity is sufficiently high, survival becomes dependent mainly on temperature. We used a single-sided sigmoid curve to describe egg survival as a function of relative humidity,where RH represents daily average relative humidity and represents the minimum RH for survival. The normalisation factor, 0.01, is to constrain the left and right tails of the curve within 0–1. We used a double-sided sigmoid curve to describe temperature-driven survival. For instance, we described the daily probability of temperature-driven egg survival withwhere T is daily average near-surface temperature (2 meters above ground). The parameters constraining the curve are defined in Table 1. Survival probability changes daily as a result of temperature and humidity differences. In a hypothetical scenario where these variables are fixed, the survival process can be described with a geometric distribution with a constant probability . In this case, the average life span can be calculated as (1 − p)/p. We assumed that the development process is independent of survival, and modelled development time for eggs, larvae, pupae, and the gonotrophic cycle duration for adult females with a gamma distribution. Under constant environmental conditions, we assumed that each process lasts on average μ days with a standard deviation of σ. Together, μ and σ describe the probability of the duration of a development process. The resulting daily probability for the completion of development, pdev, can be written asfor day d, where . In order to limit the total number of parameters for this generic model structure, we assumed that the variance of the gamma distribution is equal to its mean (). We argue that this assumption is reasonable with regards to the experimental data (see sec. Experimental datasets); however, improvements and customisations are advisable on specific life history traits as more data become available. Immature stage development and the gonotrophic cycle are hampered at extremely low temperatures[11,24]. In order to model these processes, we used a single-sided sigmoid curve,where dmin is the minimum, dmax is the maximum duration, and Tmed is the temperature with the median duration. Equation 1 results in a mirrored S-shaped curve yielding long development times at low temperatures and a rapid development process at high temperatures. Although female sand flies usually require blood feeding to develop eggs, egg laying and feeding patterns are discordant (egg laying can be delayed until after multiple blood meals) in certain species[19], and reproduction is autogenous (egg laying requires no blood meal) in others[22]. Therefore, we assumed that the duration of the gonotrophic cycle is influenced mainly by temperature[24]. Mortality is strongly elevated after oviposition, where only a few females survive for the second, probably last, oviposition[28]. In order to model this unusually high death rate, we defined p as the probability of surviving egg laying for each female. At each gonotrophic cycle, a healthy female can lay up to 100 eggs[29], which is observed to depend on temperature[24]. According to this, we used a double-sigmoid curve for the number of eggs laid,where is maximum number of eggs a female can lay per cycle, and and are the minimum and maximum, respectively, temperatures for fecundity. Aside from egg laying, we assumed that male and female sand flies share similar survival and developmental characteristics in all life stages[24,30]. In order to determine the number of fertile female adults, we defined p as the probability of emergence of a female adult sand fly from a developing pupa. Facultative diapause, a mechanism of sustaining development in response to environmental factors, is reported in the 4 instar larva in several species including Ph. neglectus, Ph. tobbi, and Ph. papatasi[10,19,31]. In order to accommodate the state of environmentally driven dormancy, we defined and updated the development time given in Eqn. 1. According to this, when temperature is lower than this threshold, , larval development time is set to an extreme value (1000 days) to delay for as long as possible. Otherwise, development proceeds as given in Eqn. 1. Sand fly abundance and diversity are affected strongly by the availability of appropriate breeding grounds. Although precipitation is not a direct influence, existence of rocky slopes, vegetation, availability of decaying organic matter, and an alkaline soil pH are known to contribute to abundance and diversity[32-34]. In order to capture breeding site conditions, we defined as a location-specific fraction, which reduces fecundity, prolongs egg laying interval, and extends the development times of larvae and pupae. According to this, indicates perfect breeding conditions where development and fecundity depend only on temperature and humidity. When , development rate is reduced due to adverse conditions such as limited availability of food or resting sites. We implemented this effect by increasing average development times (μ and σ) and reducing fecundity () by a factor of . We note that can only be inferred when performing inference on multiple locations. When a single location is in focus, merges with the rest of the parameters that drive development rate, e.g. , , , and . The constant indicator implies that a breeding site provides sustained levels of availability and habitability. In addition, we assumed that the main drivers of population dynamics are climatic variables, and that population growth under the influence of these drivers would not reach high levels at which density or predation becomes a limiting factor. The effect of breeding site conditions on survival, therefore, is indirect as delayed development increases exposure to adverse weather conditions. In the current form of the model, is an arbitrary indicator and a first-order approximation of a complex and time-dependent phenomenon. Although this might be adequate for a generic model, it is subject of improvement in future work. Although the flight range of adult sand flies is limited, they are known to be dispersed over long distances with the help of wind[5,27,31]. We postpone the discussion of the effect of migration and dispersion to future work where spatiotemporal modelling will be considered. In the present context, we assumed that migration and wind have negligible effects on the abundance of sand fly populations. In addition to environmental constraints, we modelled the impact of surveillance activities on the abundance of adult sand flies. We defined as the daily probability of either a male or a female adult being captured and removed from the system. Accordingly, we defined the probability of an observation (δ) as the product of the binomial capture probabilities at each time point,where y(t) is the predicted number of adult males or females, and δ(t) is the number of captured sand flies for day t. Predictions and observations comprise the number of female or male sand flies collected over three days as given in the surveillance procedure (see sec. Surveillance and environmental datasets below). In this equation, Binom(n, m, p) represents the binomial probability of choosing m sand flies out of a total of n with probability p. We note that y(t) needs to be greater than or equal to δ(t) for all time points for the binomial probability to be defined. We handled the special case when this condition is not satisfied by replacing the binomial probability with an infinitesimal indicator value ( Likelihood of an observation in the context of Bayesian inference can be calculated as the weighted average of the binomial capture probability,where y is a prediction, i.e. a sample, from the model when its parameter values are given by θ. Due to limited computational resources, we approximated the likelihood with a Monte Carlo approximation,where N is the number of random samples from the model used to approximate the likelihood. More accurate approximations can be derived by using a larger sample size at the expense of computational resources; however, we found that N = 3 performs well in identifying an optimum fit (see Results and Discussion). The model captures the essence of the environmental dependency of sand fly populations; however, we expect that species-specific differences, microclimate conditions around the breeding sites, day-night differences, intrinsic variability associated with the limited population size, and many other unaccounted factors may contribute to deviations between simulations and observations. Inevitably, such differences yield a very small likelihood value. In order to facilitate working with small values, we opted to work with the log-transformed likelihood, which we designated as the score,

Inference and posterior sampling

Having defined an objective function (Eqn. 4), we employed a uniform prior distribution as presented in Table 1 (f(y|θ) = c), performed parameter optimisation, and obtained posterior samples of θ using the python (v2.7) implementation of the hoppMCMC (v0.6) algorithm[17,18]. The hoppMCMC algorithm uses an adaptive Markov chain Monte Carlo (MCMC) method to obtain samples from the posterior distribution[35]. For sufficiently large N, the Monte Carlo approximation (Eqn. 3) converges to the true likelihood, and the algorithm yields accurate posterior samples. When N is small, however, the error in the estimate, which is proportional to , may result in a prohibitively low acceptance rate. In order to compensate for small sample size, we performed posterior sampling with a tolerance. We scaled down the likelihood estimate exponentially with a constant, C < 1, to ensure that a minimum acceptance rate of 1% can be achieved. The constant corresponds to the inverse of the annealing temperature of the hoppMCMC algorithm. We report C where applicable. Details of inference and posterior sampling can be found in Supplementary File S1. When reporting inference results, we adopted the concept of posterior modes as reported in previous work[17,18]. In essence, we assumed that the posterior distribution comprises a set of local maxima analogous to the local minima frequently encountered in optimisation. We assumed that the locality of each maximum leads to similar model behaviour, and the extent of each locality is dictated by the sensitivity around the maxima. In case of high sensitivity, only the parameter values in the vicinity of the maximum yield similar behaviour, and the behaviour changes drastically at greater distance. Although it is not trivial to sample from the entire posterior distribution, it is relatively straightforward to sample from the vicinity of a local maximum by starting a Markov chain from around that point[17]. Posterior samples obtained as such are labelled as samples from the same posterior mode. Throughout the text, we report posterior modes (single or multiple) as possible explanations for the observations.

Experimental Datasets

Due to the cost and effort required to reliably determine life history parameters, comprehensive information on many sand fly species is yet to be obtained. Kasap et al. compiled a reliable data source on Ph. papatasi in two reports: Kasap et al.[11] and Kasap et al.[24]. Here, we derived daily survival probabilities, development times, and fecundity for a range of temperatures using these data (Fig. 2).
Figure 2

Life history parameters of Ph. papatasi derived under controlled environmental conditions. (a) Daily survival probabilities of eggs, larvae, pupae, and adults (both male and female). (b) Average development times of eggs, larvae, and pupae, and average gonotropic cycle lengths of adults. (c) Average number of eggs laid by an adult female at the end of each gonotropic cycle. Vertical ranges indicated in (b,c) correspond to the standard deviation.

Life history parameters of Ph. papatasi derived under controlled environmental conditions. (a) Daily survival probabilities of eggs, larvae, pupae, and adults (both male and female). (b) Average development times of eggs, larvae, and pupae, and average gonotropic cycle lengths of adults. (c) Average number of eggs laid by an adult female at the end of each gonotropic cycle. Vertical ranges indicated in (b,c) correspond to the standard deviation. We emphasise that these parameters originate from controlled environmental conditions, which include temperature, humidity, feeding, and photoperiod. Therefore, care must be taken when extrapolating them to be used with mathematical models due to pertaining structural restrictions (such as the existence of four distinct life stages) and simplifying assumptions (such as the use of daily average temperatures). In this context, instead of using experimentally-derived values as informed prior probabilities, we reserved them for validation. By adopting this strategy, we were able to assess the information content of field observations and apply the same inference procedure to other Phlebotomus species for which have fewer experimental data. We assumed that each observation is independent and normally distributed with the reported mean d and standard deviation s, where T is the temperature. We estimated and empirically with the exception of for survival. We assumed for survival, which corresponds to 1% of the range (0–1). In order to assess the agreement between observed and inferred parameter values, we defined a score function,where is the corresponding life history parameter indicated by the parameter configuration θ for temperature T. Consequently, the probability of θ given the experimental observations is proportional to .

Surveillance and environmental datasets

We allocated three surveillance datasets for parameter inference (training set) and one additional dataset for model validation (test set). The surveillance datasets comprise longitudinal abundance data from Greece (Fodele), Cyprus (Steni and Geri), and Turkey (Adana) collected as part of the comprehensive surveillance study reported in Alten et al.[36]. For each dataset, we obtained daily average near-surface temperature (2 meters above ground) and relative humidity from the open-access datasets of nearby meteorological stations. For Geri and Adana, we used the Integrated Surface Database Station History dataset of National Oceanic and Atmospheric Administration (NOAA)[37]. For Steni and Fodele, we used the blended meteorological datasets from the European Climate Assessment & Dataset (ECA&D)[38]. We list the collection sites and the sources of weather data in Table 2. In Fig. 3, we plot the abundance and weather data together for all the locations.
Table 2

Longitudinal datasets and corresponding weather stations.

Test/trainingCollection siteWeather station
CountryAreaLongitudeLatitudeID/datasetLongitudeLatitude
TrainingGreeceFodele24.95835.381Heraklion25.18335.333
TrainingCyprusSteni32.47134.998Polis32.43335.033
TrainingCyprusGeri32.47134.998Ercan_17521033.50035.150
TestTurkeyAdana (Koyunevi)35.65537.289İncirlik35.41737.000
Figure 3

Surveillance and environmental datasets used in this study. Male and female sand fly counts from 2011 to 2013 together with the meteorological data (daily mean temperature and relative humidity) are plotted at the top. Coordinates of the traps are plotted as blue marks in the maps at the bottom. Green marks on the maps indicate the nearest weather stations from which environmental data are obtained. The maps were generated using the Leaflet (v1.1.0) library (http://leafletjs.com/) with the Stamen-TerrainBackground tiles to emphasise topography. Map tiles by Stamen Design (http://stamen.com, CC BY 3.0, http://creativecommons.org/licenses/by/3.0), and map data ©OpenStreetMap contributors (http://www.openstreetmap.org/copyright).

Longitudinal datasets and corresponding weather stations. Surveillance and environmental datasets used in this study. Male and female sand fly counts from 2011 to 2013 together with the meteorological data (daily mean temperature and relative humidity) are plotted at the top. Coordinates of the traps are plotted as blue marks in the maps at the bottom. Green marks on the maps indicate the nearest weather stations from which environmental data are obtained. The maps were generated using the Leaflet (v1.1.0) library (http://leafletjs.com/) with the Stamen-TerrainBackground tiles to emphasise topography. Map tiles by Stamen Design (http://stamen.com, CC BY 3.0, http://creativecommons.org/licenses/by/3.0), and map data ©OpenStreetMap contributors (http://www.openstreetmap.org/copyright). In the training set, we included three patterns frequently encountered in surveillance studies: (i) An extended observational dataset in terms of temporal coverage, which follows two consecutive years in Fodele, (ii) a complete dataset, which follows the start and end of the sand fly high season in Steni, and an incomplete one, which starts late summer in Geri. We selected Adana for validation due to the similarity of its climate and land cover in spite of the distance and isolation with respect to the two islands. As reported by Alten et al., data collection was carried out using a combination of light and sticky traps between April and November during three consecutive days (13–15 days) in each month during 2011–2013[36]. Since sand fly presence is unlikely after November until the beginning of the next high season in April, no data was collected during this period. In order to constrain inference in the light of this expectation, we assumed zero presence for the unobserved months of the training set. The surveillance data comprise counts of male and female adult Ph. papatasi, Ph. neglectus, and Ph. tobbi, which were identified morphologically. It is important to note that morphological identification of Ph. neglectus, which belongs to the Ph. major s.l. species complex, was not definitive in Adana as Ph. neglectus and Ph. syriacus of the complex share the same habitat in the region[39]. Nevertheless, in order to validate model predictions for Ph. neglectus, we used the combined abundance of the Ph. major s.l. complex as the closest available dataset.

Results and Discussion

We assessed the inferability of a biologically-plausible environmentally driven population dynamics model using only sand fly surveillance data as the basis of inference. We assessed the predictive capacity of models inferred using various degrees of information, and presented guidelines for extending this procedure to other species.

Multiple versions of the model are informed by the surveillance data

The sand fly model comprises a total of 37 parameters including 6 that are location-specific (the four initial conditions, the degree of habitability, and the daily probability of capture — see sec. Vector dynamics model). We inferred all of these using only a single location for each of the three species: Ph. papatasi, Ph. neglectus, and Ph. tobbi (see sec. Inference and posterior sampling). Then, we combined the three locations to infer all for Ph. papatasi, the only species found in all locations, while allowing three different values for the location-specific parameters. As a result, when we used only one location for inference, we identified one major posterior mode for each. We identified multiple posterior modes when we combined all locations, and, here, we report three of the most representative and most probable ones labelled as “Combined A”, “Combined B”, and “Combined C”. In Table 3, we report the annealing temperature (C), the degree of success (), the average score (), and the absolute value of the determinant of the covariance matrix (). We report as a measure of dispersion in posterior modes.
Table 3

Posterior modes obtained using the training datasets.

SpeciesTypeArea C \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\rho $$\end{document}ρ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathscr{S}}$$\end{document}S(δ|θ) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathscr{U}}$$\end{document}U
Ph. neglectusSingleFodele4100.0%733.53 ± 19.1279.6714
Ph. tobbiSingleSteni2100.0%44.29 ± 5.49147.879
Ph. papatasiSingleGeri2100.0%31.36 ± 5.43168.334
SingleSteni299.7%41.71 ± 6.18133.308
SingleFodele2100.0%72.90 ± 5.51138.493
Combined AGeri2100.0%33.76 ± 3.85104.499
Steni100.0%54.61 ± 5.76106.085
Fodele100.0%96.11 ± 5.50105.249
Combined BGeri2100.0%33.40 ± 3.6196.1643
Steni100.0%47.82 ± 5.99100.002
Fodele99.9%92.73 ± 5.1498.1116
Combined CGeri2100.0%44.34 ± 5.17103.678
Steni100.0%134.80 ± 10.39106.022
Fodele100.0%85.51 ± 6.56106.926

Single or combined inferences are listed with the associated sand fly species, annealing temperature (C), degree of success (), average and standard deviation of the score ((δ|θ)), and the extent of dispersion in posterior modes ().

Posterior modes obtained using the training datasets. Single or combined inferences are listed with the associated sand fly species, annealing temperature (C), degree of success (), average and standard deviation of the score ((δ|θ)), and the extent of dispersion in posterior modes (). The likelihood function is proportional to the size of a dataset and the number of sand flies collected (Eqn. 2); therefore, a large number of collections or more data points yield high likelihood values. In turn, this constrains the extent of a posterior mode. In agreement with this, Fodele (the largest dataset) yields a less dispersed posterior mode than Geri (the smallest dataset). Combined inferences yield highly constrained posterior modes due to the cumulative size of the combined dataset. We observed the lowest dispersion for Ph. neglectus which was collected in large numbers. In order to preserve 1% acceptance rate in sampling, a higher annealing temperature was needed for this dataset. Estimates of kernel density of the scores associated with the inference of Ph. papatasi can be seen in Fig. 4. We found that single location inferences explain each location better than combined inferences. Posterior modes labelled “Combined A” and “Combined B” yield balanced fits to the three locations, while “Combined C” tends to fit Fodele better than the other two locations. Resulting matches with the data for all inferences are shown in Supplementary Fig. S1.
Figure 4

Score distributions of the posterior samples for Ph. papatasi. Kernel density estimates for samples from 6 posterior modes (3 single + 3 combined inferences) are shown for the three locations in the training set. Posterior modes from single inferences are different from each other, and yet, they are shown in red for simplicity. Blue: Combined A, green: Combined B, and purple: Combined C.

Score distributions of the posterior samples for Ph. papatasi. Kernel density estimates for samples from 6 posterior modes (3 single + 3 combined inferences) are shown for the three locations in the training set. Posterior modes from single inferences are different from each other, and yet, they are shown in red for simplicity. Blue: Combined A, green: Combined B, and purple: Combined C. In Fig. 5, we present the match between surveillance data and model predictions with Combined A. We note that the median predictions for winter, despite being explicitly set to zero in the training set, are relatively low, but rarely zero in Fodele. This suggests that a continuous, yet minimal, sand fly activity might be present in the region during winter. One of the consistent predictions by all posterior modes (Supplementary Fig. S1) is that the beginning of the sand fly high season may be several months earlier in Fodele than the beginning of the surveillance period (April).
Figure 5

Agreement between Ph. papatasi surveillance data and model predictions with Combined A. Number of adult sand flies collected from Geri, Steni, and Fodele are plotted as bars in (a–c), respectively. Solid lines and shaded areas indicate the median and the 95% range of the predictions. Female and male adult sand flies are marked in red and blue, respectively.

Agreement between Ph. papatasi surveillance data and model predictions with Combined A. Number of adult sand flies collected from Geri, Steni, and Fodele are plotted as bars in (a–c), respectively. Solid lines and shaded areas indicate the median and the 95% range of the predictions. Female and male adult sand flies are marked in red and blue, respectively. An interesting pattern in the surveillance data from Fodele is a peak occurrence both in males and females in June in both years of observations. Neither the single-location inference nor the combined ones fully explain these peaks for both sexes. The mismatch is indicative of a biological mechanism not accounted for in this version of the model. This could be, for instance, an alternative type of dormancy, such as obligate diapause, where termination is spontaneous and independent of external stimuli. Seasonal changes in breeding sites or feeding patterns could also result in the observed peaks. The understanding of this mechanism requires further iterations of the model, which will be the subject of continued research.

Surveillance data sufficiently informs predictive models

In this context, we constrained environmental dependence with model structure and longitudinal surveillance data. Although not included in the likelihood nor considered as a prior probability, we assessed whether laboratory-driven life history parameters can be inferred correctly for Ph. papatasi. In Table 4, we display the average matching scores () and their standard deviations for each posterior mode. In Supplementary Fig. S2, we plot all functional forms of environmental dependence included in the model, and in Fig. 6, four discerning dependences inferred for Fodele where the posterior modes are the least dispersed.
Table 4

Inference of environmental dependence using longitudinal surveillance data.

TypeAreaΠ(θ)
SingleGeri77596.15 ± 134275.64
SingleSteni12162.72 ± 3026.91
SingleFodele16415.80 ± 2494.93
Combined AGeri1366.70 ± 211.05
Steni2342.83 ± 468.60
Fodele1871.73 ± 341.43
Combined BGeri1032.60 ± 196.06
Steni1333.26 ± 243.98
Fodele1011.27 ± 178.32
Combined CGeri14050.91 ± 1695.24
Steni13706.81 ± 1574.20
Fodele14730.04 ± 1722.69
Figure 6

Representative functional forms of environmental dependence inferred for Fodele. In (a–c), temperature-dependent development times of eggs, larvae, and pupae, respectively, are plotted in units of days. In (d), temperature-dependent daily egg survival probabilities are plotted. Solid lines indicate the median and the shaded areas indicate the 95% range of the values predicted by a posterior mode. Circles and vertical lines represent the average and the standard deviation, respectively, of the experimental data (see sec. Experimental datasets).

Inference of environmental dependence using longitudinal surveillance data. Representative functional forms of environmental dependence inferred for Fodele. In (a–c), temperature-dependent development times of eggs, larvae, and pupae, respectively, are plotted in units of days. In (d), temperature-dependent daily egg survival probabilities are plotted. Solid lines indicate the median and the shaded areas indicate the 95% range of the values predicted by a posterior mode. Circles and vertical lines represent the average and the standard deviation, respectively, of the experimental data (see sec. Experimental datasets). We found that combined inferences that fit each location equally (Combined A and B) closely resemble the environmental dependence determined with laboratory experiments. The analysis indicated that although they yield more constrained posterior modes, larger datasets from single locations do not necessarily yield better predictions of environmental dependence. In addition, the unbalanced fit among multi-location inferences, Combined C, which leans towards Fodele, does not agree well with laboratory data. In the framework of this study, we obtained the best-possible agreement by omitting the likelihood and performing inference only on the prior probability. When we compared this best fit with the agreement scores () of Combined A and B, we found that although Combined A and B demonstrate a close match, they do not overlap with the best fit (Fig. 7). The evident divergence from the maximum agreement indicates that a certain degree of deviation from the experimentally-derived values is needed in order to explain the surveillance data. Accordingly, we argue that direct use of experimental results in mathematical models of population dynamics might be misleading.
Figure 7

Agreement of Combined A and B with laboratory-derived parameter values. Kernel density estimates of the parameter agreement scores () are plotted for the three versions of Combined A and B along with the best fit, which was obtained by excluding the likelihood. Frequency of the best fit distribution (black) is scaled down by a factor of 0.05 to aid in visualisation.

Agreement of Combined A and B with laboratory-derived parameter values. Kernel density estimates of the parameter agreement scores () are plotted for the three versions of Combined A and B along with the best fit, which was obtained by excluding the likelihood. Frequency of the best fit distribution (black) is scaled down by a factor of 0.05 to aid in visualisation. In order to assess the information content of the surveillance data, we performed parameter sensitivity analysis using a multivariate Gaussian approximation around the maximum of each posterior mode[18,40]. We calculated the sensitivity matrix as the inverse of the variation of each Gaussian, and normalised with the range of the search domain for each parameter (see Table 1). We report, in Fig. 8, the logarithm of the n diagonal element of as the sensitivity of the n parameter.
Figure 8

Parameter sensitivity analysis of posterior modes for Ph. papatasi. Color gradient shows the logarithm of the parameter sensitivities where darker colours indicate higher values. Parameters are sorted from left to right with a descending order with respect to the sum of their sensitivities in posterior modes Combined A and B.

Parameter sensitivity analysis of posterior modes for Ph. papatasi. Color gradient shows the logarithm of the parameter sensitivities where darker colours indicate higher values. Parameters are sorted from left to right with a descending order with respect to the sum of their sensitivities in posterior modes Combined A and B. As a result, we found that surveillance datasets constrain parameters in different ways and sugest alternative environmental dependences except from Combined A and B. According to Table 4, Combined A and B exhibit the highest resemblance to the laboratory data, therefore, are more likely to suggest biologically plausible environmental dependences. Our results reveal that, parameters consistently associated with high sensitivity are (i) minimum duration of the gonotropic cycle (), (ii) median developmental temperature for pupae (), (iii) probability of a female progeny (), (iv) breeding site habitability (), and (v) capture probability (). In addition, for Combined A and B, sensitivity is also high for minimum mean development time of pupae () and larvae (), and maximum daily survival probability of adults (). On the other hand, the relative humidity threshold (), the temperature threshold for dormancy (), and initial conditions do not have a high impact on model dynamics. High sensitivity is an a priori expectation for , as it is strongly informed by the presence of both female and male sand fly counts in surveillance datasets. High sensitivity was also expected for , being directly linked with multiple biological processes such as larva and pupa development, and fecundity. Lastly, high sensitivity for was expected due to the direct effect on the observed number of sand flies. We note that the sensitivity measure we employed depends largely on the characterisation of the posterior distribution. In this context, sensitivity is a measure of impact of a parameter on the dynamics of male and female sand flies. Therefore, the low impact parameters, such as temperature and relative humidity threshold for dormancy, and initial conditions are relatively less potent in determining adult sand fly dynamics according to most of the posterior modes. On the other hand, parameters controlling pupa development and adult survival have high sensitivities due to being directly related to adult population size. An interesting finding is the high sensitivity of the gonotrophic cycle duration that is likely due to the negative effect of egg laying on the survival of adult sand flies.

Multi-location inferences yield predictive models

We tested model predictability over the surveillance dataset from Adana. The test set contains sightings of Ph. papatasi, Ph. tobbi, and the Ph. major s.l. species complex (corresponding to either Ph. neglectus or Ph. syriacus in Adana) for the two consecutive years 2011–2012 (see sec. Surveillance and environmental datasets). As a result, we found that the single-location inferences and Combined C do not yield useful predictions for any of the species (see Supplementary Fig. S3). The majority of model predictions using these posterior modes comprise very few or no comparable trajectories with the observations (Table 5). Inclusion of laboratory data as a prior probability for single-location inferences elevated the success rate only for the case of Ph. papatasi in Steni (raised to from ). Despite this, the fit did not improve considerably (attenuated to 2301.96 ± 2242.85 from 1252.01 ± 153.29).
Table 5

Model validation over the test set from Adana.

SpeciesTypeArea ρ \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathscr{S}}$$\end{document}S(δ|θ)
Ph. major s.l.SingleFodele0
Ph. tobbiSingleSteni17.5%1358.76 ± 1273.42
Ph. papatasiSingleGeri0
SingleSteni0.6%1252.01 ± 153.29
SingleFodele0.7%4423.84 ± 161.76
Combined AGeri0.2%2221.41 ± 124.49
Steni99.3%12522.04 ± 12859.06
Fodele96.6%744.00 ± 478.24
Combined BGeri0
Steni91.9%5654.38 ± 7651.19
Fodele62.4%1250.66 ± 740.88
Combined CGeri0
Steni0
Fodele0
Model validation over the test set from Adana. We note that the test set does not contain many observations of Ph. major s.l., which led us to assume that low abundance may indicate low suitability for the establishment of any species in the complex including Ph. neglectus. In turn, the model may correctly predict low suitability. Although the success rate for Ph. tobbi is low, we found that the successful trajectories (17.5%) agree well with the observations. We note that Ph. neglectus and Ph. tobbi were observed each in only one location in the training set; therefore, more data from alternative locations are needed to draw reliable predictions for these species. We found that Combined A and B — the multi-location inferences that demonstrate a balanced fit to each location — successfully predict Ph. papatasi abundance (Fig. 9). We obtained high variability in the predictions due to the intrinsic stochasticity of the model and the uncertainty in parameter values. Despite this, we calculated high success rates. We followed the single-location inference procedure to identify an optimum posterior mode for the Ph. papatasi in the test set. We found that the median trajectories with Combined A and B are close to the median of the optimum posterior mode endorsing the reliability of the predictions.
Figure 9

Model validation over Adana, Turkey. Median (solid lines — red and blue) and the 95% range (shaded regions) of predictions using Combined A for Fodele (left) and Combined B for Steni (right) over the test set. The black lines indicate the median trajectories (black lines) obtained using the optimum posterior mode inferred for the region.

Model validation over Adana, Turkey. Median (solid lines — red and blue) and the 95% range (shaded regions) of predictions using Combined A for Fodele (left) and Combined B for Steni (right) over the test set. The black lines indicate the median trajectories (black lines) obtained using the optimum posterior mode inferred for the region. We note that Combined A and B each have different variations for the three locations in the training set. It appears that the main difference between each variation is the overall vector abundance, which is mainly controlled by breeding site habitability, (Fig. 10). Despite the differences between Combined A and B in the remaining parameters, we observed that the variations with between 3.5 and 4.5 explain better the observed abundance in the test set.
Figure 10

Marginal distributions of breeding site habitability, , in different variations of Combined A and B. Kernel density estimates of the marginal distributions of are plotted for Combined A and B in Geri, Steni, and Fodele.

Marginal distributions of breeding site habitability, , in different variations of Combined A and B. Kernel density estimates of the marginal distributions of are plotted for Combined A and B in Geri, Steni, and Fodele.

Conclusion

Comprehensive studies concerning the environmental dependence of vector species require considerable effort and the availability of suitable rearing facilities[41]. For this reason, vital information required for predictive modelling is scarce for many important vectors. It is clear that flexible strategies exploiting already available or relatively easily obtainable data are needed. Vector surveillance, which is usually performed by authorities for monitoring and control, provides a viable source of information. Here, we demonstrate that predictive models can be constructed by using field observations, while laboratory studies inform model structure and impose known biological constraints. The model we presented is structurally informed by the known biological mechanism and the environment-dependence of well-studied sand fly species. Parameters of the model are configured to represent a specific vector species by exploiting various amounts of surveillance data. Our analyses corroborate that in order to construct predictive models, surveillance data collected from multiple independent sites are needed. In addition, a posterior sample of parameters should demonstrate a balanced fit by representing each site adequately to yield a predictive model. As a result, we observed that Ph. papatasi dynamics in Adana, Turkey, can be predicted with parameters inferred using surveillance data from Crete and Cyprus. We note that the surveillance areas considered in this study share many environmental characteristics. Although our results suggest that the model may be applicable to other regions with similar characteristics, large-scale regional and global applications should take account of diverse surveillance datasets comprising observations from different environmental conditions. Our results suggest that the most predictive parameter configurations are also informative about the environmental dependence of Ph. papatasi. Although laboratory-derived life-history parameters were not explicitly imposed in the form of prior probability, the inferred parameter values agree well with the experimentally-determined values. Regardless of the agreement, we obtained large variations especially in the lower and upper bounds of temperature dependence. This is indicative of the absence of observations during such extreme conditions, which can be compensated by constraining the corresponding parameters explicitly with regards to expert knowledge. We note that humidity, despite being indicated as a key environmental dependence[11,24,25], was not informed adequately by the surveillance data — only a substantially low threshold was identified as critical. It is possible that a more biologically-tailored model structure and more detailed observational datasets are needed to provide the required resolution to discern humidity dependence. We suggest the inclusion of day-and-night temperature and humidity differences as a possible next step since such data were reported to correlate strongly with vector abundance[42]. We also note the possibility of the existence of a posterior mode with a higher probability and predictive capacity than the ones identified in this study. Further improvement of the model and the identification of predictive models for Ph. neglectus and Ph. tobbi will be the subject of future research. An additional factor contributing to the variance between observations and predictions is the assumed range of applicability of our model. The distance between collection sites and the closest available meteorological stations is a limiting factor. Furthermore, it is not trivial to include microclimatic conditions due to the difficulty of obtaining or predicting such information. At present, the most detailed global climate simulations are provided at 0.25 degree resolution, which corresponds to about 25 km near the equator[43]. With the improvement of global and regional climate models and the public availability of such data, we expect that the accuracy of vector and disease dynamics predictions will improve considerably. It is important to note that none of the inferred parameter values, despite some demonstrating high predictive ability, provides the maximum possible fit to the laboratory-derived values. This indicates that extra care is needed when applying laboratory data to a larger scale. Experiments reflect vector biology under controlled environmental conditions. The scale of such studies may be related, to some extent, to the microclimatic environment around breeding sites. The scale of a typical field study, however, is several square kilometers at best. The methodology we presented offers a potential solution to this problem by allowing environmental dependence to be informed, and therefore adjusted to data collected in the filed. Supplementary Figures S1 S2 S3 Supplementary file S1
  31 in total

Review 1.  The biology and control of phlebotomine sand flies.

Authors:  R Killick-Kendrick
Journal:  Clin Dermatol       Date:  1999 May-Jun       Impact factor: 3.541

2.  Colonization of Phlebotomus neglectus (Diptera: Psychodidae), the major vector of visceral leishmaniasis in Greece.

Authors:  B Chaniotis; I Spyridaki; E Scoulika; M Antoniou
Journal:  J Med Entomol       Date:  2000-05       Impact factor: 2.278

3.  Serological and entomological survey in a zoonotic visceral leishmaniasis focus of North Central Anatolia, Turkey: Corum province.

Authors:  Hatice Ertabaklar; Seray Ozensoy Toz; Aysegul Taylan Ozkan; Samiye Rastgeldi; I Cuneyt Balcioglu; Yusuf Ozbel
Journal:  Acta Trop       Date:  2005-03       Impact factor: 3.112

4.  Comparative demography of the sand fly Phlebotomus papatasi (Diptera: Psychodidae) at constant temperatures.

Authors:  Ozge Erisoz Kasap; Bulent Alten
Journal:  J Vector Ecol       Date:  2006-12       Impact factor: 1.671

5.  The epidemic threshold of vector-borne diseases with seasonality: the case of cutaneous leishmaniasis in Chichaoua, Morocco.

Authors:  Nicolas Bacaër; Souad Guernaoui
Journal:  J Math Biol       Date:  2006-07-05       Impact factor: 2.259

Review 6.  The increase in risk factors for leishmaniasis worldwide.

Authors:  P Desjeux
Journal:  Trans R Soc Trop Med Hyg       Date:  2001 May-Jun       Impact factor: 2.184

7.  Laboratory estimation of degree-day developmental requirements of Phlebotomus papatasi (Diptera: Psychodidae).

Authors:  Ozge Erisoz Kasap; Bulent Alten
Journal:  J Vector Ecol       Date:  2005-12       Impact factor: 1.671

8.  Cutaneous leishmaniasis caused by Leishmania infantum transmitted by Phlebotomus tobbi.

Authors:  Milena Svobodová; Bulent Alten; Lenka Zídková; Vít Dvorák; Jitka Hlavacková; Jitka Mysková; Veronika Seblová; Ozge Erisoz Kasap; Asli Belen; Jan Votýpka; Petr Volf
Journal:  Int J Parasitol       Date:  2008-08-14       Impact factor: 3.981

9.  Biology of Phlebotomus papatasi (Diptera: Psychodidae) in the laboratory.

Authors:  I Chelbi; E Zhioua
Journal:  J Med Entomol       Date:  2007-07       Impact factor: 2.278

Review 10.  Spread of vector-borne diseases and neglect of Leishmaniasis, Europe.

Authors:  Jean-Claude Dujardin; Lenea Campino; Carmen Cañavate; Jean-Pierre Dedet; Luigi Gradoni; Ketty Soteriadou; Apostolos Mazeris; Yusuf Ozbel; Marleen Boelaert
Journal:  Emerg Infect Dis       Date:  2008-07       Impact factor: 6.883

View more
  4 in total

1.  Species diversity and molecular insights into phlebotomine sand flies in Sardinia (Italy)-an endemic region for leishmaniasis.

Authors:  S Carta; D Sanna; F Scarpa; Antonio Varcasia; L Cavallo; M P Meloni; C Tamponi; P A Cabras; G Dessi; M Casu; V D Tarallo; D Otranto; A Scala
Journal:  Parasitol Res       Date:  2019-12-07       Impact factor: 2.289

2.  Climate change impacts on infectious diseases in the Eastern Mediterranean and the Middle East (EMME)-risks and recommendations.

Authors:  Shlomit Paz; Azeem Majeed; George K Christophides
Journal:  Clim Change       Date:  2021-12-30       Impact factor: 4.743

Review 3.  Sand flies: Basic information on the vectors of leishmaniasis and their interactions with Leishmania parasites.

Authors:  Pedro Cecílio; Anabela Cordeiro-da-Silva; Fabiano Oliveira
Journal:  Commun Biol       Date:  2022-04-04

4.  A dynamically structured matrix population model for insect life histories observed under variable environmental conditions.

Authors:  Kamil Erguler; Jacob Mendel; Dušan Veljko Petrić; Mina Petrić; Mihaela Kavran; Murat Can Demirok; Filiz Gunay; Pantelis Georgiades; Bulent Alten; Jos Lelieveld
Journal:  Sci Rep       Date:  2022-07-08       Impact factor: 4.996

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.