| Literature DB >> 34392488 |
Chava L Ramspek1, Ewout W Steyerberg2, Richard D Riley3, Frits R Rosendaal4, Olaf M Dekkers4, Friedo W Dekker4, Merel van Diepen4.
Abstract
Etiological research aims to uncover causal effects, whilst prediction research aims to forecast an outcome with the best accuracy. Causal and prediction research usually require different methods, and yet their findings may get conflated when reported and interpreted. The aim of the current study is to quantify the frequency of conflation between etiological and prediction research, to discuss common underlying mistakes and provide recommendations on how to avoid these. Observational cohort studies published in January 2018 in the top-ranked journals of six distinct medical fields (Cardiology, Clinical Epidemiology, Clinical Neurology, General and Internal Medicine, Nephrology and Surgery) were included for the current scoping review. Data on conflation was extracted through signaling questions. In total, 180 studies were included. Overall, 26% (n = 46) contained conflation between etiology and prediction. The frequency of conflation varied across medical field and journal impact factor. From the causal studies 22% was conflated, mainly due to the selection of covariates based on their ability to predict without taking the causal structure into account. Within prediction studies 38% was conflated, the most frequent reason was a causal interpretation of covariates included in a prediction model. Conflation of etiology and prediction is a common methodological error in observational medical research and more frequent in prediction studies. As this may lead to biased estimations and erroneous conclusions, researchers must be careful when designing, interpreting and disseminating their research to ensure this conflation is avoided.Entities:
Keywords: Causality; Epidemiology; Etiology; Methodology; Prediction; Scoping review
Mesh:
Year: 2021 PMID: 34392488 PMCID: PMC8502741 DOI: 10.1007/s10654-021-00794-w
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 8.082
: Etiology versus prediction key characteristics
| Etiology | Prediction | |
|---|---|---|
| Research question | Objective is to find a causal relation between exposure(s) and outcome(s) | Objective is to predict or diagnose outcome or improve prediction of an outcome in individuals |
| Statistical approach | Controls for confounding or mediation analysis, using knowledge and assumptions of causal structure and pathways | Develop and/or validate a multivariable model that contains variables (predictors) based on their ability to predict/diagnose the outcome and usability in practice |
| Presentation of results | Relative risk or risk difference given the exposure, minimizing bias | Measures of the predictive or diagnostic performance of the multivariable model (e.g. discrimination and calibration) |
| Discussion and interpretation of results | Causal interpretation and/or recognition limitations that preclude causal inference | Proposed use of the prediction model, for example for risk stratification or prediction of prognosis/diagnosis on an individual level, and/or limitations that preclude use (e.g. poor calibration, need for further validation) |
Three examples of observational studies included in this review are shown
| Example 1 [ | Example 2 [ | Example 3 [ | |
|---|---|---|---|
| Research question | |||
| Assessment | The objective seems to be causal | The objective is fitting for a predictor finding study | The objective is to develop a risk score to improve prediction of an outcome |
| Statistical approach | |||
| Assessment | Co-variates selected based on ability to predict the outcome, fitting for a prediction study | Co-variates selected based on ability to predict the outcome, fitting for a prediction study | Co-variates selected based on ability to predict the outcome, fitting for a prediction study |
| Presentation of results | |||
| Assessment | Effect measure given, inviting etiological interpretation | Unclear; the current wording could be found in both a prediction or etiological study | Risk score performance measure given, fitting for prediction study |
| Discussion and interpretation of results | |||
| Assessment | Etiological interpretation. Authors intended to correct for all measured confounders | Etiological interpretation by identifying modifiable risk factors and considering residual confounding | Both predictive and etiological interpretation. The potential causal mechanism between each predictor and the outcome is discussed |
| Overall assessment | Conflated: Mainly etiological study with conflation in methods by selecting ‘confounders’ based only on their predictive ability (p-values) | Conflated: Mainly prediction study, yet a causal conclusion is made from a data-driven predictive model and residual confounding is mentioned as limitation | Conflated: Prediction model development study with causal interpretation of predictors |
Each of these studies contains conflation and quotes from each domain of the article are given that exemplify this. The table should be read vertically
Some quotes are paraphrased slightly to shorten them and abbreviations are written full-out
Fig. 1Number of studies included by medical field and assessment; etiological, prediction, both (dual) or conflated
Fig. 2conflation proportion by journal impact factor and medical field. Each bubble represents 1 included journal, the size of the bubble corresponds to the number of articles included from this journal. Spearman’s correlation coefficient between conflation and impact factor is − 0.13 (p value 0.08)
Characteristics of included studies
| Total N = 180 | Not conflated N = 134 | Conflated = 46 | |
|---|---|---|---|
| Population | |||
| General population | 56 (31%) | 51 (38%) | 5 (11%) |
| Primary care | 12 (7%) | 10 (8%) | 2 (4%) |
| Secondary/tertiary care | 112 (62%) | 73 (55%) | 39 (85%) |
| Median impact factor (IQR) | 9.5 (7.3–12.3) | 10.7 (7.3–14.4) | 8.6 (7.2–11.8) |
| Affiliated epidemiology department | 68 (38%) | 58 (43%) | 10 (22%) |
| Affiliated statistics department | 46 (26%) | 33 (25%) | 13 (28%) |
| Adherence to reporting guideline | 11 (6%) | 10 (8%) | 1 (2%) |
Fig. 3Types of conflation. Type A = 25 studies, type B = 8 studies, type C = 2 studies, type D = 14 studies, type E = 5 studies, type F = 3 studies
Recommendations on how to avoid conflation
| Recommendations for researchers | |
| 1 | Clearly define the research question and consider whether the aim is causal, predictive, diagnostic or descriptive |
| 2 | Be mindful of frequent mistakes that cause conflation between etiology & prediction and distinguish between the two by using appropriate terminology |
| 3 | Consult methodological experts as well as reporting and methodological guidelines (e.g. STROBE, TRIPOD, STARD, REMARK) |
| Recommendations for universities, journals & policy-makers | |
| 1 | Work on improving education on prediction research and the distinction between prediction & etiology, including the promotion of distinct terminology for prediction and etiological research |
| 2 | Promote the use of reporting and methodological guidelines |
| 3 | Include methodological expert as peer-reviewers and/or editors |