| Literature DB >> 33168841 |
Lene Jung Kjær1, Kirstine Klitgaard2, Arnulf Soleng3, Kristin Skarsfjord Edgar3, Heidi Elisabeth H Lindstedt3, Katrine M Paulsen4,5, Åshild Kristine Andreassen4, Lars Korslund6, Vivian Kjelland6,7, Audun Slettan6, Snorre Stuen8, Petter Kjellander9, Madeleine Christensson9, Malin Teräväinen9, Andreas Baum10, Laura Mark Jensen11, René Bødker11.
Abstract
Tick-borne pathogens cause diseases in animals and humans, and tick-borne disease incidence is increasing in many parts of the world. There is a need to assess the distribution of tick-borne pathogens and identify potential risk areas. We collected 29,440 tick nymphs from 50 sites in Scandinavia from August to September, 2016. We tested ticks in a real-time PCR chip, screening for 19 vector-associated pathogens. We analysed spatial patterns, mapped the prevalence of each pathogen and used machine learning algorithms and environmental variables to develop predictive prevalence models. All 50 sites had a pool prevalence of at least 33% for one or more pathogens, the most prevalent being Borrelia afzelii, B. garinii, Rickettsia helvetica, Anaplasma phagocytophilum, and Neoehrlichia mikurensis. There were large differences in pathogen prevalence between sites, but we identified only limited geographical clustering. The prevalence models performed poorly, with only models for R. helvetica and N. mikurensis having moderate predictive power (normalized RMSE from 0.74-0.75, R2 from 0.43-0.48). The poor performance of the majority of our prevalence models suggest that the used environmental and climatic variables alone do not explain pathogen prevalence patterns in Scandinavia, although previously the same variables successfully predicted spatial patterns of ticks in the same area.Entities:
Year: 2020 PMID: 33168841 PMCID: PMC7652892 DOI: 10.1038/s41598-020-76334-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Individual prevalence. Individual prevalence at the 50 sample sites for the 15 pathogens found within the study region. Individual prevalence was calculated using method 3 from Cowling et al.[77] that assumes 100% test sensitivity and specificity and fixed pool size. Clusters were analysed using SatScan on pool prevalence, and only significant clusters with the maximum Gini coefficient are depicted. Pool prevalence was calculated as positive pools out of the total number of pools at each site, whereas relative risk is calculated by SatScan as the estimated risk within a cluster divided by the estimated risk outside the cluster.
Figure 2Pathogen prevalence ranges. Percentage of sites (50 total) having different ranges of individual pathogen prevalence for the 15 pathogens found within the study region.
Pearson's chi square test for equality of proportions without continuity correction, testing for significant differences in pathogen prevalence (pool prevalence) between the 50 sites in Denmark, Norway and Sweden.
| Pathogen | χ2 | Df | |
|---|---|---|---|
| 83.27 | 49 | < 0.01 | |
| 655.92 | 49 | < 0.0001 | |
| 241.57 | 49 | < 0.0001 | |
| 399.29 | 49 | < 0.0001 | |
| 78.86 | 49 | < 0.01 | |
| 209.84 | 49 | < 0.0001 | |
| 364.36 | 49 | < 0.0001 | |
| 348.04 | 49 | < 0.0001 | |
| 61.04 | 49 | 0.12 | |
| 46.67 | 49 | 0.57 | |
| 65.69 | 49 | 0.056 | |
| 555.36 | 49 | < 0.0001 | |
| 483.17 | 49 | < 0.0001 |
Only tick-borne pathogens identified from I. ricinus ticks collected from Denmark, Norway and Sweden, 2016 are depicted (excluding B. burgdorferi s.l. and SFG rickettsiae).
Average and range of pathogen prevalence (estimated individual nymph prevalence) in southern Scandinavia.
| Pathogen | Average (%) | Range (%) |
|---|---|---|
| 0.4 | 0.0–1.8 | |
| 13.0 | 3.5–28.8 | |
| 7.9 | 0.3–28.8 | |
| 0.9 | 0.0–5.0 | |
| 2.9 | 0.2–9.1 | |
| 0.01 | 0.0–0.3 | |
| 1.2 | 0.0–3.5 | |
| 1.4 | 0.0–11.3 | |
| 1.6 | 0.0–6.7 | |
| 0.2 | 0.0–0.7 | |
| 0.01 | 0.0–0.2 | |
| 0.3 | 0.0–1.1 | |
| 3.0 | 0.0–12.9 | |
| SFG rickettsiae | 5.8 | 0.2–20.6 |
| 5.8 | 0.0–22.0 |
Only the 15 tick-borne pathogens identified from I. ricinus ticks collected from Denmark, Norway and Sweden, 2016 are depicted (including B. burgdorferi s.l. and SFG rickettsiae).
Global Moran’s I test, testing for spatial autocorrelation of pathogen prevalence (pool prevalence) between the 50 sites in Denmark, Norway and Sweden.
| Pathogen | Moran’s index | z-score | |
|---|---|---|---|
| − 0.01 | 0.08 | 0.93 | |
| 0.34 | 4.85 | < 0.0001 | |
| 0.10 | 0.62 | 0.10 | |
| − 0.03 | − 0.07 | 0.94 | |
| − 0.02 | 0.03 | 0.98 | |
| − 0.08 | − 0.79 | 0.43 | |
| − 0.08 | 49 | 0.33 | |
| 0.01 | 0.43 | 0.67 | |
| − 0.07 | − 0.63 | 0.53 | |
| 0.14 | 2.45 | 0.014 | |
| − 0.03 | − 0.14 | 0.89 | |
| 0.36 | 5.11 | < 0.0001 | |
| 0.47 | 6.68 | < 0.0001 |
Only tick-borne pathogens identified from I. ricinus ticks collected from Denmark, Norway and Sweden, 2016 are depicted (excluding B. burgdorferi s.l. and SFG rickettsiae).
Comparison of the best SVR and BRT models (lowest NRMSE) for each of the 13 pathogens (excluding B. microti and B. lusitaniae).
| Pathogen | SVM model type | R2 | NRMSE | # variables | BRT model | R2 | NRMSE |
|---|---|---|---|---|---|---|---|
| Polynomial kernel | 0.07 | 1.01 | 65a | Poisson | 0.06 | 1.17 | |
| Linear kernel | 0.22 | 0.89 | 65 | Gaussian | 0.05 | 1.05 | |
| Polynomial kernel | 0.24 | 0.89 | 40 | Gaussian | 0.23 | 0.89 | |
| Linear kernel | 0.07 | 0.96 | 65a | Gaussian | 0.11 | 0.96 | |
| Polynomial kernel | 0.03 | 0.99 | 50 | Gaussian | 0.05 | 1.25 | |
| Radial kernel | 0.00 | 0.97 | 65a | Gaussian | 0.15 | 1.29 | |
| Radial kernel | 0.21 | 1.03 | 65a | Gaussian | 0.01 | 1.10 | |
| Radial kernel | 0.19 | 0.90 | 5 | Gaussian | 0.04 | 1.04 | |
| Polynomial kernel | 0.08 | 0.95 | 20 | Gaussian | 0.02 | 1.19 | |
| Polynomial kernel | 0.17 | 0.93 | 30 | Gaussian | 0.17 | 0.92 | |
| Polynomial kernel | 0.43 | 0.75 | 10 | Gaussian | 0.29 | 0.84 | |
| SFG rickettsiae | Linear kernel | 0.45 | 0.72 | 5 | Gaussian | 0.40 | 0.77 |
| Linear kernel | 0.48 | 0.74 | 10 | Gaussian | 0.41 | 0.76 |
NRMSE is the normalized root-mean-square error (root-mean-square error divided by the standard deviation).
aBoth SVR and BRT methods transform factorial predictors into dummy variables, resulting in more predictors than the 56 predictors originally entered in the models.
Figure 3Observed versus predicted prevalence. Observed prevalence plotted against predicted prevalence (pool prevalence, both arc-sine-square-root transformed) for (a) N. mikurensis and (b) R. helvetica, based on LOOCV results from the final best SVR models. The red line is a linear regression line on the observed and predicted values, and the black line depicts a 1:1 relationship between observed and predicted values. NRMSE is the normalized root-mean-square error.
Figure 4Prevalence maps. Maps of predicted prevalence (pool prevalence, back-transformed from arc-sine-square-root) for (a) N. mikurensis and (b) R. helvetica from the final SVR models. Observed pool prevalence at the 50 study sites is also depicted. White areas are altitudes above 450 m or lakes, rivers and streams, or habitats other than forest or meadow (not predicted). The maps were created using ArcMap 10.6.1[78].