| Literature DB >> 22618804 |
Michelle Xia1, Paul Gustafson.
Abstract
When estimating the prevalence of a binary trait in a population, the presence of a hidden sub-population that cannot be sampled will lead to nonidentifiability and potentially biased estimation. We propose a Bayesian model of trait prevalence for a weighted sample from the non-hidden portion of the population, by modeling the relationship between prevalence and sampling probability. We studied the behavior of the posterior distribution on population prevalence, with the large-sample limits of posterior distributions obtained in simple analytical forms that give intuitively expected properties. We performed MCMC simulations on finite samples to evaluate the effectiveness of statistical learning. We applied the model and the results to two illustrative datasets arising from weighted sampling. Our work confirms that sensible results can be obtained using Bayesian analysis, despite the nonidentifiability in this situation.Mesh:
Year: 2012 PMID: 22618804 DOI: 10.1002/sim.5374
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373