| Literature DB >> 31827836 |
Tad A Dallas1,2, Colin J Carlson3, Timothée Poisot4.
Abstract
Predicting disease emergence and outbreak events is a critical task for public health professionals and epidemiologists. Advances in global disease surveillance are increasingly generating datasets that are worth more than their component parts for prediction-oriented work. Here, we use a trait-free approach which leverages information on the global community of human infectious diseases to predict the biogeography of pathogens through time. Our approach takes pairwise dissimilarities between countries' pathogen communities and pathogens' geographical distributions and uses these to predict country-pathogen associations. We compare the success rates of our model for predicting pathogen outbreak, emergence and re-emergence potential as a function of time (e.g. number of years between training and prediction), pathogen type (e.g. virus) and transmission mode (e.g. vector-borne). With only these simple predictors, our model successfully predicts basic network structure up to a decade into the future. We find that while outbreak and re-emergence potential are especially well captured by our simple model, prediction of emergence events remains more elusive, and sudden global emergences like an influenza pandemic are beyond the predictive capacity of the model. However, these stochastic pandemic events are unlikely to be predictable from such coarse data. Together, our model is able to use the information on the existing country-pathogen network to predict pathogen outbreaks fairly well, suggesting the importance in considering information on co-occurring pathogens in a more global view even to estimate outbreak events in a single location or for a single pathogen.Entities:
Keywords: community dissimilarity; community ecology; disease forecasting; emerging infectious disease; pathogen biogeography
Year: 2019 PMID: 31827836 PMCID: PMC6894608 DOI: 10.1098/rsos.190883
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.The dissimilarity-based model we used considers pathogens (indicated by symbol shape) distributed across a number of countries (a). Using these data, we build the interaction matrix (b) used to calculate pathogen community dissimilarity between two countries (C) and pathogen distributional dissimilarity (P). We then trained models on these two values, plus their product and the year of the sampling event. The model estimates the suitability for a pathogen to occur in a given country by comparing the density of dissimilarity values when the pathogen was present (green shaded region in c) relative to the dissimilarity density of all possible pairwise combinations of country and pathogen (grey density in c).
Figure 2.Pathogen events from previous years increased model predictive accuracy after an initial small decrease, suggesting that 5 years or more of data improves predictions, but accuracy could actually decrease in some data-sparse situations where only 2 or 3 years of data were available. Performance of the null expectation (grey line) was less than our approach, except when the null was given more than 15 years of previous data.
Figure 4.Using a rolling window (), we found that predictive accuracy did not increase as a result of enhanced surveillance and data collection of more recent years. The null expectation (grey line) performed consistently worse than our approach.
Figure 3.Predictive accuracy decreased when attempting to forecast far into the past or future. Models were trained on either the period between 2005 and 2015 (for prediction into the past) or 1990 and 2000 (for prediction into the future). The null expectation (grey line) performed consistently worse than our approach.