| Literature DB >> 32536343 |
Jonas Björk1,2, Anton Nilsson3,4, Carl Bonander5, Ulf Strömberg5,6.
Abstract
BACKGROUND: Selection and selection bias are terms that lack consistent definitions and have varying meaning and usage across disciplines. There is also confusion in current definitions between underlying mechanisms that lead to selection and their consequences. Consequences of selection on study validity must be judged on a case-by-case basis depending on research question, study design and analytical decisions. The overall aim of the study was to develop a simple but general framework for classifying various types of selection processes of relevance for epidemiological research.Entities:
Keywords: Bias; Epidemiologic factors; Population characteristics; Public health; Selection bias
Mesh:
Year: 2020 PMID: 32536343 PMCID: PMC7294651 DOI: 10.1186/s12874-020-01015-w
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Selection in exposure vs. selection in population composition
Fig. 2Fundamental selection mechanisms occurring at exposure entry at the population-level, illustrated by causal diagrams with subscripts of the nodes indicating time. a. Unifactorial selection in exposure (E) b. Unifactorial selection in population composition (boxed S) c. Multifactorial selection in exposure (E) d. Multifactorial selection in population composition (boxed S, induced inverse association is marked with a dashed line)
Data from a hypothetical cohort study with multifactorial selection in exposure (E) originating from C (known source) and U (unknown source; Fig. 2C). The relative effect of E on the disease outcome D is constant across strata of C and U (RR = 2.0). U influences the effect of C on D (RR = 1.5 when U = 0 but 4.0 when U = 1). Lack of adjustment for U therefore leads to bias in the estimated exposure effect also when C is adjusted for. Stratification for C may lead to false conclusions regarding the heterogeneity in the exposure effect
| Risk | RR | RR | |||||
|---|---|---|---|---|---|---|---|
| 0 | 0 | 20 | 80 | 20% | |||
| 90 | 810 | 10% | 2.0 | ||||
| 1 | 90 | 210 | 30% | 1.5 | |||
| 105 | 595 | 15% | 2.0 | 1.5 | |||
| 1 | 0 | 40 | 160 | 20% | |||
| 80 | 720 | 10% | 2.0 | ||||
| 1 | 400 | 100 | 80% | 4.0 | |||
| 200 | 300 | 40% | 2.0 | 4.0 | |||
| Totals | 550 | 550 | 50% | ||||
| 475 | 2425 | 16% | 2.32 | ||||
| 0 | 60 | 240 | 20% | ||||
| 170 | 1530 | 10% | 2.0 | ||||
| 1 | 490 | 310 | 61% | ||||
| 305 | 895 | 25% | 2.41 |
1 Relative risk among exposed (E = 1) vs. unexposed (E = 0) across strata of C and U, overall and across strata of C
2 Relative risk among C-positive (C = 1) vs. C-negative (C = 0) across strata of E and U
Fig. 3Selection during exposure at the population-level, illustrated by causal diagrams with subscripts of the nodes indicating time. a. A known determinant (C) does not influence exposure initiation (E) but exposure continuation (E) b.E causes a side effect (A) that influences E. c.E and an unknown determinant (U) cause an index or competing event (boxed S1) that precludes the disease outcome (D) d.E and U cause an early disease event (boxed D) that leads to depletion of susceptibles that precludes later disease events (D) to occur. Induced inverse associations are marked with dashed lines
Fig. 4Selection post-outcome at the population-level, illustrated by causal diagrams with subscripts of the nodes indicating time. a. Selection in exposure (E) caused by disease outcome (D) b. Selection in population composition (boxed S) caused by disease outcome (D) and exposure (E). Induced inverse associations are marked with dashed lines
Commonly described selection processes at the population-level in the epidemiological literature classified according to the proposed framework
| Level | Type of mechanism | Timing of the selection | ||
|---|---|---|---|---|
| (1) | (1.1) | (1.1.1) Self-selection Confounding by indication Healthy worker hire effect | (1.1.2) Side effects causing exposure changes Healthy worker survivor effect | (1.1.3) Subclinical symptoms causing exposure changes |
| (1.2) | (1.2.1) Index event Self-selection | (1.2.2) Competing event Depletion of susceptibles | (1.2.3) Berkson’s fallacy | |
| (2) | (2.1) | (2.1.1) Self-selection | (2.1.2) Non-compliance1 | (2.1.3) Non-compliance1 |
| (2.2) Selection in population composition | (2.2.1) Restriction of source population Study base definition Self-selection Healthy volunteer effect | (2.2.2) Loss to follow-up | (2.2.3) Loss to follow-up Selective participation in a case-control study |
1Non-compliance with treatment (exposure) administration specified in the study protocol