| Literature DB >> 30871502 |
Guillermo Prada-Ramallal1,2, Bahi Takkouche1,2,3, Adolfo Figueiras4,5,6.
Abstract
BACKGROUND: The availability of clinical and therapeutic data drawn from medical records and administrative databases has entailed new opportunities for clinical and epidemiologic research. However, these databases present inherent limitations which may render them prone to new biases. We aimed to conduct a structured review of biases specific to observational clinical studies based on secondary databases, and to propose strategies for the mitigation of those biases.Entities:
Keywords: Administrative claims; Bias; Confounding factors; Electronic health records; Medical record linkage; Medical records; Observational studies; Pharmacoepidemiology
Mesh:
Year: 2019 PMID: 30871502 PMCID: PMC6419460 DOI: 10.1186/s12874-019-0695-y
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Flow chart of the article selection process. * Subgroup 1: Its principal objective was to describe, compare, evaluate, validate or develop a bias-control strategy for a known bias or limitation. † Subgroup 2: Estimated a measurement or identified risk factors for a disease, with the existence of bias being mentioned as a limitation of the study, regardless of whether or not strategies for its control were used. ‡ Subgroup 3: Had characteristics different from those indicated above or was a conference paper with no abstract/full-text available
Fig. 2Publication timeline of the 117 articles included in the review (left Y axis) and the 863 references identified through the automated search (right Y axis) unadjusted and adjusted by the number of indexed citations added to MEDLINE
Fig. 3a Distribution of included articles across medical disciplines. b Timeline of included articles by most prevalent indexed disciplines
Articles that mention the most usual biases described in observational studies of pharmacoepidemiologic databases
| Category/Subcategory | Description of the bias | References ( | Percentage (%) |
|---|---|---|---|
| Confounding | The measure of association between treatment and outcome is distorted by the effect of one or more variables, which are also risk factors for the outcome of interest | [ | 63.2 |
| Confounding by indicationa | The clinical condition that determined the prescription of the treatment is associated with the effect, acting as a confounding factor (e.g. a worse disease status at baseline: confounding by disease severity) | [ | 32.5 |
| Time-dependent confounding | A variable that can vary with time acts as a confounding factor between the current exposure and outcome, and as an intermediary between prior and current exposure | [ | 6.0 |
| Unmeasured/residual confounding | There is not enough information about all the relevant confounding factors known, unknown or difficult to measure (e.g. frailty). If confounding cannot be completely controlled for, the residual confounding effect of some factors remains in the final effect that is observed | [ | 28.2 |
| Healthy user/adherer effect | Access to health care resources is associated with a higher level of education and health-seeking behavior. Furthermore, patients who comply with the treatment during prolonged periods of time tend to be healthier | [ | 5.1 |
| Selection bias | The study sample population is not representative of the target population to which the results will be extrapolated | [ | 47.0 |
| Protopathic bias | The treatment is associated with subclinical disease stages (an early manifestation of the still undiagnosed condition under study gives rise to prescription of the treatment) | [ | 3.4 |
| Losses to follow-up (informative censoring) | The mechanism that triggers discontinuity of the treatment is associated with the risk of observing the outcome of interest | [ | 2.6 |
| Depletion of susceptibles (prevalent user bias) | The inclusion of prevalent instead of incident users entails insufficient verification of the adverse effects that occur at the beginning of treatment (those susceptible to the adverse effect have interrupted the treatment) | [ | 10.3 |
| Missing data | In multivariate analyses, such as regression models, observations that lack one or more of the values of a variable included in the model tend to be eliminated | [ | 17.9 |
| Measurement bias | Data on true exposures, outcomes and other variables are recorded in the form of indicators (observed measures) that do not accurately reflect reality | [ | 46.2 |
| Misclassification bias | The association between treatment and outcome is distorted by systematic errors, due to the way in which the variables of interest are measured in comparison groups | [ | 43.6 |
| Misclassification of exposure | The measure of exposure of a given treatment is not an exact reflection of its real use (e.g. flawed measurement, non-compliance with treatment, inappropriate use of time windows) | [ | 23.9 |
| Misclassification of outcome | Error in the diagnosis (e.g. clinical ambiguity, non-uniform coding) | [ | 28.2 |
| Time-related bias | Follow-up time and exposure status are inadequately taken into account in the study-design or analysis stages | [ | 30.8 |
| Immortal time bias | A period of time (immortal) during which the study event cannot occur is included in the follow-up or is excluded from analysis due to an incorrect definition of the start of follow-up | [ | 25.6 |
| Immeasurable time bias | A period of time (immeasurable) during follow-up is ignored and thus misclassified as unexposed period, since outpatient prescriptions that define exposure cannot occur (e.g. serious chronic diseases that require extensive use of medications and multiple hospitalizations) | [ | 3.4 |
| Time-window bias | The use of time-windows of different lengths between cases and controls to define time-dependent exposures prevents subjects from having the same opportunity time to receive prescriptions | [ | 2.6 |
| Time-lag bias | Comparisons are conducted of treatments given at different stages of the disease, which inherently introduces bias related to disease duration and progression | [ | 0.9 |
aSometimes also referred to as channeling bias
Fig. 4Frequency of the biases mentioned in the included articles stratified by time periods
Main bias-control strategies in observational studies of pharmacoepidemiologic databases
| Category | Control strategies |
|---|---|
| Confounding | |
| Measured confounding | - Multivariate analysis |
| Time-dependent confounding | - G–estimation |
| Unmeasured confounding | - Crossover design |
| Selection bias | |
| Protopathic bias | - Restriction (e.g. restricting the untreated group to a population with the same indication, or restricting the treated group to a population with an indication that is not a subclinical stage of the disease) |
| Losses to follow-up (informative censoring) | - Inclusion of variables that affect censoring and event times in the multivariate regression model |
| Depletion of susceptibles (prevalent user bias) | - New-user design |
| Missing data | - Replacing each absent observation with a mean value based on observed values of the variable or the predicted value based on a regression model |
| Measurement bias | |
| Misclassification bias | - Validation study (exposure/outcome/confounders) + (sensitivity analysis/misclassification control techniques using multivariate regression) |
| Time-related bias | |
| Immortal time bias | - Data analysis with procedures that take into account time-dependent exposure in a cohort |
| Immeasurable time bias | - Data analysis accounting for the time-varying exposable period |
| Time-window bias | - Accounting for duration of treatment in the selection of controls |
| Time-lag bias | - Comparing patients at the same stage of disease |