| Literature DB >> 20052294 |
Adam Glynn1, Jon Wakefield, Mark S Handcock, Thomas S Richardson.
Abstract
In this paper, we illustrate that combining ecological data with subsample data in situations in which a linear model is appropriate provides three main benefits. First, by including the individual level subsample data, the biases associated with linear ecological inference can be eliminated. Second, by supplementing the subsample data with ecological data, the information about parameters will be increased. Third, we can use readily available ecological data to design optimal subsampling schemes, so as to further increase the information about parameters. We present an application of this methodology to the classic problem of estimating the effect of a college degree on wages. We show that combining ecological data with subsample data provides precise estimates of this value, and that optimal subsampling schemes (conditional on the ecological data) can provide good precision with only a fraction of the observations.Year: 2008 PMID: 20052294 PMCID: PMC2801082 DOI: 10.1111/j.1467-985X.2007.00511.x
Source DB: PubMed Journal: J R Stat Soc Ser A Stat Soc ISSN: 0964-1998 Impact factor: 2.483