| Literature DB >> 29991455 |
Daniel Magee1, Matthew Scotch2.
Abstract
Public health researchers are often tasked with accurately and quickly identifying the location and time when an epidemic originated from a representative sample of nucleotide sequences. In this paper, we investigate multiple approaches to subsampling the sequence set when employing a Bayesian phylogeographic generalized linear model. Our results indicate that near-categorical posterior MCC estimates on the root can be obtained with replicate runs using 25-50% of the sequence data, and that including 90% of sequences does not necessarily entail more accurate inferences. We present the first analysis of predictor signal suppression and show how the ability to detect the influence of predictor variables is limited when sample size predictors are included in the models.Entities:
Keywords: Phylogeography; Selection Bias; Viruses
Mesh:
Year: 2018 PMID: 29991455 PMCID: PMC6123251 DOI: 10.1016/j.meegid.2018.07.003
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342