| Literature DB >> 26308381 |
Abstract
Statistical inference is commonly said to be inapplicable to complete population studies, such as censuses, due to the absence of sampling variability. Nevertheless, in recent years, studies of whole populations, e.g., all cases of a certain cancer in a given country, have become more common, and often report p values and confidence intervals regardless of such concerns. With reference to the social science literature, the current paper explores the circumstances under which statistical inference can be meaningful for such studies. It concludes that its use implicitly requires a target population which is wider than the whole population studied - for example future cases, or a supranational geographic region - and that the validity of such statistical analysis depends on the generalizability of the whole to the target population.Entities:
Year: 2015 PMID: 26308381 PMCID: PMC4549103 DOI: 10.1186/s12982-015-0029-4
Source DB: PubMed Journal: Emerg Themes Epidemiol ISSN: 1742-7622
Fig. 1The classical situation of a sample (small dashed circle) drawn from a population (large solid circle). The darker shading represents units with a characteristic of interest
Consider the following two examples of a binary outcome with complete population coverage
|
|
|
|
| In the election example, is it meaningful to estimate a sampling error for the proportion voting for candidate A? The answer seems to be clearly ‘no’. This is because the purpose of the election is to choose a president, which is done on the basis of the observed proportion of votes cast. Any kind of interval estimate serves no purpose because there is no generalizability beyond the election. |
| In the cancer registry example, is it meaningful to estimate a sampling error for the proportion of cases who are female? Some would say ‘no’ on the basis that it’s a complete population enumeration with no sampling error. Similar examples in the literature show that some authors would say ‘yes’. This implies an attempt to generalize beyond the population observed, but what is this wider target population? Conceivably future cases, or a wider, supranational geographical area, although often this is left unspecified. |
Fig. 2In a whole population study, the sample (dashed circle) has become so large as to coincide with the population (solid circle). The use of confidence intervals, p values, or similar probability statements implies a claim to generalizability to some group beyond the study population: namely the target population, represented by the area outside the two circles. As in Fig. 1, presence or absence of a characteristic is indicated by darker or lighter shading. Authors of whole population studies often do not try to delimit their generalizability. In the figure, such indeterminacy is represented by the target population fading away from the original population: we may not know to what spatial or temporal range the findings may be applicable. Similarly, we may not be able to judge whether the prevalence of the characteristic remains similar in the target population, rather than increasing or decreasing (as it does to the bottom left and top right, respectively, of the figure)
ᅟ
| Publication Year | Reference Number | Population | Assessment of generalizability or external validity |
|---|---|---|---|
| 2015 | [ | Western Australia | ‘We are confident about the generalisability of our analytic findings on associations with stimulant medication use Australia-wide’ |
| 2014 | [ | Western Australia | none |
| 2014 | [ | North East Scotland | ‘Our results could be generalisable to young children across the UK' |
| 2014 | [ | Western Australia | none |
| 2013 | [ | Western Australia | none |
| 2012 | [ | Western Australia | none |
| 2012 | [ | Western Australia | none |
| 2012 | [ | Iceland | none |
| 2011 | [ | Western Australia | none |
| 2010 | [ | Scotland | none |
| 2004 | [ | ‘Top End’ of the Northern Territory of Australia | none |
| 2001 | [ | Isle of Wight, United Kingdom | none |
| 1998 | [ | Isle of Wight, United Kingdom | none |