| Literature DB >> 31261713 |
William H Kessler1, Claudia Ganser2, Gregory E Glass3.
Abstract
The lone star (Amblyomma americanum), black-legged (Ixodes scapularis) and American dog ticks (Dermacentor variabilis) are species of great public health importance as they are competent vectors of several notable pathogens. While the regional distributions of these species are well characterized, more localized distribution estimates are sparse. We used records of field collected ticks and an ensemble modeling approach to predict habitat suitability for each of these species in Florida. Environmental variables capturing climatic extremes were common contributors to habitat suitability. Most frequently, annual precipitation (Bio12), mean temperature of the driest quarter (Bio9), minimum temperature of the coldest month (Bio6), and mean Normalized Difference Vegetation Index (NDVI) were included in the final models for each species. Agreement between the modeling algorithms used in this study was high and indicated the distribution of suitable habitat for all three species was reduced at lower latitudes. These findings are important for raising awareness of the potential for tick-borne pathogens in Florida.Entities:
Keywords: American dog; Black-legged; Ixodid ticks; Lone star; distribution; ensemble; geography; modeling; niche
Year: 2019 PMID: 31261713 PMCID: PMC6681331 DOI: 10.3390/insects10070190
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 3.139
Number of presence/absence observations used for modeling each species. The total number (n = 560) and geographic location of observations is conserved for each species, with only observed occurrence (presence) varying.
| Species | Total | Presence | Absence |
|---|---|---|---|
|
| 560 | 98 | 462 |
|
| 560 | 65 | 495 |
|
| 560 | 30 | 530 |
Deviance explained by considered variables and rank via univariate generalized additive models (GAM) for each species. NDVI is the Normalized Difference Vegetation Index, a proxy for vegetation health.
| Variable |
|
|
| |||
|---|---|---|---|---|---|---|
| Rank | Deviance Explained | Rank | Deviance Explained | Rank | Deviance Explained | |
| Mean Diurnal Range (Bio2) | ---- | ---- | ---- | ---- | 1 | 0.137 |
| Isothermality (Bio3) | 19 | 0.078 | ---- | ---- | 23 | 0.006 |
| Tmax of Warmest Month (Bio5) | ---- | ---- | ---- | ---- | ---- | ---- |
| Tmin of Coldest Month (Bio6) | ---- | ---- | 3 | 0.16 | ---- | ---- |
| Tmean of Wettest Quarter (Bio8) | ---- | ---- | 1 | 0.201 | 14 | 0.025 |
| Tmean of Driest Quarter (Bio9) | 8 | 0.169 | 11 | 0.102 | 6 | 0.096 |
| Tmean of Coldest Quarter (Bio11) | 1 | 0.253 | ---- | ---- | ---- | ---- |
| Annual Precipitation (Bio12) | 6 | 0.226 | 2 | 0.163 | 8 | 0.067 |
| Precipitation of Wettest Month (Bio13) | 5 | 0.232 | 9 | 0.106 | 10 | 0.047 |
| Precipitation of Driest Month (Bio14) | 13 | 0.151 | 21 | 0.026 | 22 | 0.007 |
| Precipitation Seasonality (Bio15) | 14 | 0.15 | ---- | ---- | 9 | 0.054 |
| Precipitation of Driest Quarter (Bio17) | ---- | ---- | 10 | 0.103 | ---- | ---- |
| Curvature (curv) | 23 | 0.027 | 25 | 0.004 | 26 | 0.0004 |
| Depth to Water (detwt) | ---- | ---- | 23 | 0.011 | 16 | 0.013 |
| Distance to Water (distwater) | 24 | 0.026 | 24 | 0.007 | 11 | 0.039 |
| Elevation (DEM) | 26 | 0.012 | 26 | 0.003 | 21 | 0.01 |
| Maximum NDVI (NDVImax) | ---- | ---- | ---- | ---- | 15 | 0.02 |
| Mean NDVI (NDVImean) | 10 | 0.167 | 16 | 0.059 | 20 | 0.01 |
| Minimum NDVI (NDVImin) | ---- | ---- | 18 | 0.044 | 24 | 0.005 |
Tmax = Maximum Temperature in Celsius; Tmean = Mean Temperature in Celsius; Tmin = Minimum Temperature in Celsius; NDVI = Normalized Difference Vegetation Index.
Variables included in final models for each species. The machine learning algorithms consider all variables initially, and unimportant variables are pruned back. As a result, all considered variables are listed for these models, though the contribution of some variables is quite small.
| Species | Final Model Variables |
|---|---|
|
| |
| LR | Bio13 + Bio12 + Bio9 + NDVImean + Bio3 + curv + distwater + dem + shrub |
| BRT | Bio12 + NDVImean + Bio13 + Bio9 + Bio14 + Bio15 + distwater + Bio3 + curv + dem + NDVImin + Bio11 + detwt + forest + wetlands + shrub + grass |
| RF | Bio11 + Bio13 + Bio12 + Bio9 + NDVImean + Bio14 + Bio15 + dtwt + Bio3 + curv + distwater + NDVImin + DEM + forest + grass + shrub + wetlands |
| MARS | Bio12 + NDVImean + Bio14 + Bio13 + distwater + DEM + Bio3 + NDVImin |
| MaxEnt | Bio12 + Bio9 + NDVImean + distwater |
|
| |
| LR | Bio12 + Bio6 + distwater + curv + shrub + wetlands |
| BRT | Bio12 + Bio13 + Bio17 + Bio8 + NDVImean + Bio9 + NDVImin + distwater + Curv + Bio14 + DEM + Bio6 + detwt + wetlands + shrub + forest + grass |
| RF | Bio8 + Bio12 + Bio6 + Bio13 + Bio17 + Bio9 + NDVImean + NDVImin + Bio14 + detwt + distwater + curv + DEM + forest + grass + shrub + wetlands |
| MARS | Bio12 + Bio6 + Bio8 + Bio13 + curv |
| MaxEnt | Bio12 + Shrub + Bio6 + distwater + Bio13 |
|
| |
| LR | Bio2 + Bio9 + Bio15 + Bio13 + NDVImax + DEM + Bio14 + curv |
| BRT | Bio12 + Bio15 + Bio9 + NDVImax + Bio3 + Bio13 + Bio2 + distwater + NDVImin + Bio14 + DEM + curv + Bio8 + forest + detwt + shrub + grass + wetlands |
| RF | Bio2 + Bio9 + Bio12 + Bio15 + Bio13 + distwater + Bio8 + NDVImax + detwt + DEM + Bio14 + Bio3 + NDVImin + curv + forest + grass + shrub + wetlands |
| MARS | Bio2 + Bio13 + Bio15 + Bio3 + Bio15 + Bio3 + detwt + NDVImax + curv + NDVImin |
| MaxEnt | Bio12 + NDVImax + forest + grass |
LR = Logistic Regression; BRT = Boosted Regression Trees; RF = Random Forests; MARS = Multivariate Adaptive Regression Splines; MaxEnt = Maximum Entropy.
Performance metrics associated with each of the five modeling algorithms for each species. AUC ranges correspond to the upper and lower bounds of a 95% CI calculated from across the 10-fold cross validation.
| Threshold 1 | AUC 2 (95% C.I.) | s.e. 3 | Accuracy 4 (95% C.I.) | Kappa 5 | Sensitivity 6 | Specificity 7 | Positive Predictive Value 8 | Negative Predictive Value 9 | |
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| Logistic | 0.18 | 0.90 (0.87–0.93) | 0.0152 | 0.828 (0.794–0.859) | 0.525 | 0.827 | 0.829 | 0.506 | 0.957 |
| BRT | 0.26 | 0.92 (0.89–0.95) | 0.0143 | 0.934 (0.91–0.953) | 0.792 | 0.939 | 0.933 | 0.748 | 0.986 |
| RF | 0.23 | 0.92 (0.89–0.95) | 0.0151 | 0.916 (0.89–0.938) | 0.749 | 0.959 | 0.907 | 0.686 | 0.991 |
| MARS | 0.3 | 0.92 (0.89–0.94) | 0.0132 | 0.889 (0.86–0.914) | 0.660 | 0.847 | 0.898 | 0.638 | 0.965 |
| MaxEnt | 0.13 | 0.89 (0.85–0.92) | 0.0168 | 0.617 (0.575–0.658) | 0.276 | 0.959 | 0.544 | 0.309 | 0.984 |
|
| |||||||||
| Logistic | 0.14 | 0.83 (0.78–0.88) | 0.0254 | 0.923 (0.898–0.944) | 0.693 | 0.923 | 0.923 | 0.612 | 0.989 |
| BRT | 0.16 | 0.88 (0.83–0.92) | 0.0227 | 0.794 (0.758–0.827) | 0.367 | 0.785 | 0.796 | 0.336 | 0.966 |
| RF | 0.09 | 0.90 (0.86–0.93) | 0.0188 | 0.878 (0.848–0.904) | 0.593 | 1.000 | 0.862 | 0.489 | 1.000 |
| MARS | 0.15 | 0.84 (0.78–0.89) | 0.0279 | 0.875 (0.844–0.901) | 0.524 | 0.785 | 0.887 | 0.477 | 0.969 |
| MaxEnt | 0.24 | 0.85 (0.80–0.90) | 0.0241 | 0.556 (0.514–0.598) | 0.166 | 0.923 | 0.508 | 0.198 | 0.980 |
|
| |||||||||
| Logistic | 0.06 | 0.82 (0.77–0.87) | 0.0246 | 0.809 (0.773–0.84) | 0.244 | 0.800 | 0.809 | 0.192 | 0.986 |
| BRT | 0.1 | 0.82 (0.76–0.88) | 0.0301 | 0.952 (0.931–0.968) | 0.659 | 0.967 | 0.951 | 0.527 | 0.998 |
| RF | 0.04 | 0.83 (0.78–0.88) | 0.0265 | 0.86 (0.829–0.888) | 0.383 | 1.000 | 0.853 | 0.278 | 1.000 |
| MARS | 0.14 | 0.77 (0.69–0.86) | 0.0424 | 0.916 (0.89–0.938) | 0.443 | 0.733 | 0.926 | 0.361 | 0.984 |
| MaxEnt | 0.14 | 0.76 (0.69–0.83) | 0.0355 | 0.358 (0.318–0.399) | 0.044 | 0.967 | 0.323 | 0.075 | 0.994 |
1 Continuous probability score used to delineate presence from absence for consensus predictions (sensitivity = specificity); 2 area under the Receiver Operating Characteristics (ROC) curve; 3 standard error of AUC; 4 ratio of sum of correctly predicted positives and negatives to the sample size; 5 Kohen’s Kappa, a measure of agreement that accounts for agreement due to chance; 6 true positive rate; 7 true negative rate; 8 proportion of positives that are true positives; 9 proportion of negatives that are true negatives.
Figure 1(A) Ensemble prediction of suitable habitat for A. americanum. Hotter colors indicate higher agreement in the number of models predicting suitability of an area. (B–F) Continuous suitability scores for the five modeling algorithms: LR, BRT, RF, MARS, MaxEnt, respectively. A core region in the north-central region of the state shows consensus across all five algorithms. Suitable areas in the southern part of the state are sparser, with lower model agreement.
Figure 2(A) Ensemble prediction of suitable habitat for I. scapularis. Hotter colors indicate higher agreement on habitat suitability across models for a given area. (B–F) Continuous suitability scores for the five modeling algorithms: LR, BRT, RF, MARS, MaxEnt, respectively. Much of the northeastern part of the state is deemed suitable by the majority of the models. Southern areas of predicted suitability show lower consensus.
Figure 3(A) Ensemble prediction of suitable habitat for D. variabilis. Hotter colors indicate higher agreement in habitat suitability. (B–F) Continuous suitability scores for the five modeling algorithms: LR, BRT, RF, MARS, MaxEnt, respectively. Overall there is lower consensus on suitable habitat across the state. However, there is greater model agreement in the southern part of the state for this species than for black-legged or lone star ticks.