| Literature DB >> 35666334 |
Michelle V Evans1,2,3, John M Drake4,5.
Abstract
Many livestock diseases rely on wildlife for the transmission or maintenance of the pathogen, and the wildlife-livestock interface represents a potential site of disease emergence for novel pathogens in livestock. Predicting which pathogen species are most likely to emerge in the future is an important challenge for infectious disease surveillance and intelligence. We used a machine learning approach to conduct a data-driven horizon scan of bacterial associations at the wildlife-livestock interface for cows, sheep, and pigs. Our model identified and ranked from 76 to 189 potential novel bacterial species that might associate with each livestock species. Wildlife reservoirs of known and novel bacteria were shared among all three species, suggesting that targeting surveillance and/or control efforts towards these reservoirs could contribute disproportionately to reducing spillover risk to livestock. By predicting pathogen-host associations at the wildlife-livestock interface, we demonstrate one way to plan for and prevent disease emergence in livestock.Entities:
Keywords: bacteria; livestock; spillover; wildlife reservoirs
Mesh:
Year: 2022 PMID: 35666334 PMCID: PMC9168633 DOI: 10.1007/s10393-022-01599-3
Source DB: PubMed Journal: Ecohealth ISSN: 1612-9202 Impact factor: 4.464
Figure 1Number of known host-bacterium associations at the species level within each paired group of host and bacteria orders. Darker red colors represent more known associations of bacteria of that order in that mammal host order, on a natural log scale, and white represents no known associations.
Performance of Boosted Regression Tree Model on Training and Testing Datasets as Measured by Area Under the Receiving Operator Curve (AUC), True Skill Statistic (TSS), and Boyce Index (BI).
| Training | Testing | |
|---|---|---|
| AUC | 0.9985 | 0.9980 |
| True Skill Statistic | 0.9768 | 0.9598 |
| Boyce Index | 0.9980 | 0.9542 |
TSS and BI were calculated using a threshold of 0.5 to transform continuous predictions to binary values.
Figure 2Importance of top ten covariates, as measured by total gain in AUC due to tree splits including the covariate. Covariates are colored based on whether they were a trait of the bacteria species, host species, or unique to that host-bacterium pair.
Figure 3Partial-dependence plots of top six most important covariates. Rugs along the x-axis represent the distribution of each covariate in our training dataset.
Figure 4Number of known and novel bacterial associations of each bacteria order predicted by our model for B. taurus, O. aries, and S. scrofa domesticus. Colors correspond to novel (green) or known (gray) bacteria species and pie charts illustrate the proportion of known and novel bacteria for each of the three livestock species. Each set of bacteria was defined as those ranked above the lowest ranked known bacterial species for each livestock species (Color figure online).
Top Ten Wildlife Reservoirs of Known and Novel Bacteria Predicted to be Associated with Livestock Species, Ordered in Decreasing Order of Total Shared Bacteria Known to be Found in Each Wildlife Reservoir.
| Host | Novel | Known | Total | Host | Novel | Known | Total | Host | Novel | Known | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 8 | 25 | 33 | 11 | 28 | 39 | 18 | 14 | 32 | |||
| 7 | 24 | 31 | 4 | 30 | 34 | 12 | 16 | 28 | |||
| 5 | 25 | 30 | 8 | 25 | 33 | 10 | 18 | 28 | |||
| 1 | 27 | 28 | 5 | 21 | 26 | 14 | 12 | 26 | |||
| 5 | 23 | 28 | 3 | 21 | 24 | 9 | 16 | 25 | |||
| 3 | 17 | 20 | 5 | 17 | 22 | 9 | 12 | 21 | |||
| 1 | 18 | 19 | 1 | 20 | 21 | 13 | 7 | 20 | |||
| 0 | 18 | 18 | 2 | 18 | 20 | 10 | 10 | 20 | |||
| 3 | 14 | 17 | 2 | 17 | 19 | 6 | 11 | 17 | |||
| 0 | 16 | 16 | 4 | 14 | 18 | 10 | 6 | 16 | |||
Known bacteria species are those currently found in the livestock host and novel bacteria species are those bacteria species ranked above the lowest known positive for that livestock host by our model. All bacterial species are known to associate with that wildlife reservoir (e.g. are not based on model predictions).