| Literature DB >> 35735632 |
William H Kessler1,2, Carrie De Jesus3, Samantha M Wisely1,3, Gregory E Glass1,2.
Abstract
Ensembles of Species Distribution Models (SDMs) represent the geographic ranges of pathogen vectors by combining alternative analytical approaches and merging information on vector occurrences with more extensive environmental data. Biased collection data impact SDMs, regardless of the target species, but no studies have compared the differences in the distributions predicted by the ensemble models when different sampling frameworks are used for the same species. We compared Ensemble SDMs for two important Ixodid tick vectors, Amblyomma americanum and Ixodes scapularis in mainland Florida, USA, when inputs were either convenience samples of ticks, or collections obtained using the standard protocols promulgated by the U.S. Centers for Disease Control and Prevention. The Ensemble SDMs for the convenience samples and standard surveys showed only a slight agreement (Kappa = 0.060, A. americanum; 0.053, I. scapularis). Convenience sample SDMs indicated A. americanum and I. scapularis should be absent from nearly one third (34.5% and 30.9%, respectively) of the state where standard surveys predicted the highest likelihood of occurrence. Ensemble models from standard surveys predicted 81.4% and 72.5% (A. americanum and I. scapularis) of convenience sample sites. Omission errors by standard survey SDMs of the convenience collections were associated almost exclusively with either adjacency to at least one SDM, or errors in geocoding algorithms that failed to correctly locate geographic locations of convenience samples. These errors emphasize commonly overlooked needs to explicitly evaluate and improve data quality for arthropod survey data that are applied to spatial models.Entities:
Keywords: Amblyomma americanum; Florida; Ixodes scapularis; Species Distribution Models SDMs; biased sampling; ensemble models; study design; ticks
Year: 2022 PMID: 35735632 PMCID: PMC9222110 DOI: 10.3390/diseases10020032
Source DB: PubMed Journal: Diseases ISSN: 2079-9721
Environmental variables selected for SDMs (Logistic, BRT MARS, RF, and MAXENT) for Convenience Sampling (Convenience; variables in italics) and Standard Survey (Standard; variables in bold) methods. Amblyomma americanum (top) and Ixodes scapularis (bottom).
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
| |||||
|
| Standard |
| Standard |
| Standard |
| Standard |
| Standard |
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
| ||||||
|
|
|
|
|
|
| ||||
|
|
| ||||||||
|
|
|
|
|
|
|
|
| ||
|
|
|
|
| ||||||
|
|
|
|
|
| |||||
|
|
| ||||||||
|
|
| ||||||||
|
|
|
| |||||||
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
| |||
|
|
|
|
| ||||||
|
|
|
| |||||||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
| ||||
|
|
|
|
|
| |||||
|
|
|
|
|
|
| ||||
|
|
|
|
|
| |||||
|
|
|
|
| ||||||
|
| |||||||||
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||||||
|
|
|
| |||||||
|
|
|
|
|
| |||||
|
| |||||||||
|
|
|
| |||||||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
| |||
|
|
|
|
| ||||||
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
| |||||
|
|
| ||||||||
|
|
|
| |||||||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
|
|
| |||
|
|
|
|
|
| |||||
Figure 1Proportions of collection locations yielding I. scapularis or A. americanum as the number of SDMs identifying tick occurrence increased. Standard surveys used transects without ticks to generate the denominators. Convenience samples used randomized background locations to generate the denominators. I scapularis, convenience (n = 1958 locations); I. scapularis, standard (n = 560 locations); A. americanum, convenience (n = 583 locations); A. americanum standard (n = 560 locations). Standard surveys were more likely to yield ticks than did convenience sampling when there was concordance (SDM ≥ 3) among SDM’s (X2 = 53.60, 11 df, p < 0.0001 A. americanum; X2 = 326.25, 11 df, p < 0.0001 I. scapularis).
Total Suitable Area (sq km) by SDM agreement within the ensemble results for standard surveys (Standard) and convenience samples (Convenience). Numbers in brackets are the percentage of the mainland incorporated by the numbers of SDMs. Ensemble = 0 indicates that none of the SDMs predicted occurrence, while Ensemble = 5 indicates that all the SDMs predicted occurrence.
| Ensemble | Standard | Convenience | Standard | Convenience |
|---|---|---|---|---|
| 0 | 58,013 | 61,104 | 68,320 | 62,014 |
| 1 | 29,542 | 21,441 | 31,125 | 40,238 |
| 2 | 18,919 | 19,154 | 14,065 | 15,313 |
| 3 | 13,179 | 17,216 | 8710 | 12,195 |
| 4 | 13,279 | 11,303 | 10,384 | 7579 |
| 5 | 13,822 | 16,537 | 14,151 | 9415 |
Figure 2Ensemble models of A. americanum occurrence within Florida using standard survey (A), and A. americanum occurrence using convenience sampling (B). Ensemble models of I. scapularis occurrence within Florida using standard survey (C) and I. scapularis using convenience sampling (D). There was only slight agreement of ensemble maps, as measured by Kappa, for A. americanum, (Kappa = 0.016; Maps (A,B)) and for I scapularis (Kappa = 0.012; Maps (C,D)).
A. americanum SDM agreement between standard and convenience samples measured in sq km. Bolded cells are regions where the numbers of SDMs for standard and convenience surveys were in agreement. Off main diagonal elements showed discordance in classification based on survey strategy.
| Agreement | Standard Survey | |||||||
|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | TOTAL | ||
|
|
| 1339.649 | 5594.45 | 2437.46 | 2532.10 | 4868.60 | 62,013.95 | |
|
| 20,311.57 |
| 3606.80 | 1890.10 | 2451.88 | 3078.75 | 40,237.86 | |
| Convenience |
| 6249.33 | 3184.98 |
| 1010.98 | 1517.70 | 1862.10 | 15,313.23 |
| Survey |
| 4534.92 | 2758.37 | 1235.27 |
| 1232.57 | 1455.91 | 12,195.26 |
|
| 2350.61 | 1608.98 | 1028.64 | 748.37 |
| 935.63 | 7579.23 | |
|
| 1688.32 | 1277.32 | 1111.63 | 1644.53 | 1743.12 |
| 9415.09 | |
| TOTAL | 68,319.60 | 31,124.90 | 14,064.93 | 8709.66 | 10,384.37 | 14,151.16 | 146,754.62 | |
I. scapularis SDM agreement between standard and convenience samples measured in sq km. Bold cells are regions where the numbers of SDMs for standard and convenience samples were in agreement. Off main diagonal elements showed discordance in classification based on survey strategy.
| Agreement | Standard Survey | |||||||
|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | 5 | TOTAL | ||
| 0 |
| 10,627.14 | 6931.97 | 5068.29 | 4267.06 | 4264.80 | 61,104.09 | |
| 1 | 8927.07 |
| 2854.61 | 1456.21 | 1716.07 | 2082.93 | 21,441.03 | |
| Convenience | 2 | 7788.58 | 3826.18 |
| 1522.36 | 1583.34 | 2166.29 | 19,153.89 |
| Survey | 3 | 6195.48 | 4259.18 | 2184.84 |
| 1372.35 | 2038.91 | 17,216.15 |
| 4 | 2634.33 | 2888.80 | 1771.81 | 1140.65 |
| 1481.51 | 11,302.56 | |
| 5 | 2522.66 | 3536.64 | 2909.03 | 2826.40 | 2954.81 |
| 16,536.90 | |
| TOTAL | 58,012.95 | 29,542.08 | 18,919.40 | 13,179.30 | 13,279.09 | 13,821.80 | 146,754.62 | |