| Literature DB >> 32133725 |
Kitty J Jager1, Giovanni Tripepi2, Nicholas C Chesnaye1, Friedo W Dekker3, Carmine Zoccali2, Vianda S Stel1.
Abstract
Study quality depends on a number of factors, one of them being internal validity. Such validity can be affected by random and systematic error, the latter also known as bias. Both make it more difficult to assess a correct frequency or the true relationship between exposure and outcome. Where random error can be addressed by increasing the sample size, a systematic error in the design, the conduct or the reporting of a study is more problematic. In this article, we will focus on bias, discuss different types of selection bias (sampling bias, confounding by indication, incidence-prevalence bias, attrition bias, collider stratification bias and publication bias) and information bias (recall bias, interviewer bias, observer bias and lead-time bias), indicate the type of studies where they most frequently occur and provide suggestions for their prevention.Entities:
Keywords: bias; epidemiologic methods; research design; research methodology
Mesh:
Year: 2020 PMID: 32133725 PMCID: PMC7318122 DOI: 10.1111/nep.13706
Source DB: PubMed Journal: Nephrology (Carlton) ISSN: 1320-5358 Impact factor: 2.506
Types of bias related to the study designs where they are most frequently occurring
| Type of bias | Study designs most at risk |
|---|---|
|
| |
| Sampling bias | All study designs (cross‐sectional studies and cohort studies) not using representative samples of the source population, especially those with low response rates (frequently occurring in surveys) |
| Confounding by indication | Non‐randomized intervention studies (cohort studies, case‐control studies, cross‐sectional studies) |
| Incidence‐prevalence bias | Study designs using prevalent patients (cross‐sectional studies, cohort studies not using incident patients, case‐control studies) |
| Attrition bias | Longitudinal studies (randomized controlled trials [RCTs], prospective cohort studies) |
| Collider stratification bias | Studies (especially cohort studies) investigating groups of patients selected on the basis of a collider (restriction) or adjusting for a collider |
| Publication bias | All study designs |
|
| |
| Recall bias | Study designs that use self‐reporting (case‐control studies and retrospective cohort studies) |
| Interviewer bias | All study designs making use of interviews, especially those where the interviewer has information on the outcome status of the respondent (unblinded case–control studies or retrospective cohort studies) |
| Observer bias | Study designs using measurements that are prone to subjectivity, especially those where the observer has information on the exposure status of the patient (unblinded RCTs or observational studies) |
| Lead‐time bias | Cohort studies comparing survival times between screened subjects and those diagnosed on the basis of symptoms; studies comparing survival between patients in different stages of disease |
Figure 1Incidence‐prevalence bias in assessing the mortality in patients diagnosed with severe emphysema in cohorts of incident and prevalent patients. The dark bars represent those who continued smoking after diagnosis and the light bars represent those who quit smoking after diagnosis. In the incident cohort, the observation period starts at diagnosis, whereas in the prevalent cohort it starts 1 year after diagnosis. D denotes death
Figure 2Directed acyclic graph showing how the obesity paradox in patients with end‐stage kidney disease (ESKD) can possibly be explained by collider stratification bias. Measured confounders affecting mortality may include covariates like age and sex. Unmeasured risk factors may include risk factors that can be considered as a common cause for ESKD and mortality (eg, genetic or lifestyle factors) and that often go unmeasured. Restriction to ESKD patients induces collider stratification bias (ESKD being the collider) by introducing a non‐causal association between obesity and the unmeasured risk factors. This non‐causal pathway distorts the obesity‐mortality relationship by introducing confounding by the unmeasured risk factors and may be responsible for the seemingly protective effect of obesity in ESKD
Figure 3Example of a funnel plot. The precision of each study is plotted against its effect estimate. Larger dots represent larger studies. The vertical line is drawn through the overall pooled estimate of effect to detect symmetry or asymmetry. In this plot the right lower side seems emptier which indicates that small studies are missing pointing to some degree of publication bias
Non‐differential and differential misclassification in a hypothetical case‐control study investigating the effect of the intake of high‐caloric beverages on the occurrence of cerebrovascular accidents (CVAs)
| Correct classification | CVA cases | Controls | Odds ratio |
|---|---|---|---|
| High intake | 250 | 100 | 5.0 |
| Low intake | 450 | 900 | Reference |
CVA cases: 20% of 450 with low intake (=90) were misclassified as high intake, whereas 20% of 250 with high intake (=50) were misclassified as low intake. This results in 250 + 90‐50 = 290 with high intake. In addition, 20% of 250 with high intake (=50) were misclassified as low intake, whereas 20% of 450 with low intake (=90) were misclassified as high intake. This results in 450 + 50−90 = 410 with low intake. CVA controls: 20% of 900 with low intake (=180) were misclassified as high intake, whereas 20% of 100 with high intake (=20) were misclassified as low intake. This results in 900−180 + 20 = 740 with low intake. In addition, 20% of 100 with high intake (=20) were misclassified as low intake, whereas 20% of 900 with low intake (=180) were misclassified as high intake. This results in 100−20 + 180 = 260 with high intake.
CVA controls: 50% of 100 with high intake (=50) were misclassified as low intake. This results in 100−50 = 50 with high intake and 900 + 50 = 950 with low intake.
Figure 4Lead‐time bias. Often diseases are diagnosed at the onset of symptoms; sometimes they are diagnosed earlier at screening before causing any symptoms. Should this patient have been screened and diagnosed with the disease in 2014 when still asymptomatic, whereas otherwise he would be diagnosed in 2017 at the onset of symptoms, his survival would have appeared to be 5 years instead of 2 years. The difference of 3 years is called ‘lead‐time’