| Literature DB >> 35537752 |
Chang Xu1,2,3, Tianqi Yu4, Luis Furuya-Kanamori5, Lifeng Lin6, Liliane Zorzela7, Xiaoqin Zhou8, Hanming Dai8, Yoon Loke9, Sunita Vohra10,11.
Abstract
OBJECTIVES: To investigate the validity of data extraction in systematic reviews of adverse events, the effect of data extraction errors on the results, and to develop a classification framework for data extraction errors to support further methodological research.Entities:
Mesh:
Year: 2022 PMID: 35537752 PMCID: PMC9086856 DOI: 10.1136/bmj-2021-069155
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Descriptions of the different types of errors during the data extraction
| Type of errors | Description | Real life example |
|---|---|---|
| Numerical error | Extracted numerical values were incorrect, potentially due to typo, calculation error, or extraction of data of another outcome. | The meta-analysis |
| Ambiguous error | Extracted data could not be reproduced from all available sources (unknown whether it is correct or not) owing to ambiguous definitions of the outcomes, while the review authors did not specify how the data were obtained or calculated. In some situations, the outcomes could not be found in the original study and related materials (eg, supplementary file, ClinicalTrials.gov). | Again, from the same meta-analysis, |
| Zero assumption error | This error was a special case of ambiguous error, and generally occurs in safety outcomes. The outcome was not reported in the original study and related materials (eg, supplementary file, ClinicalTrials.gov), while the review authors assumed that no event occurred. | The meta-analysis |
| Mismatching error | The extracted data were incorrectly matched to the intervention and exposure groups, but the numerical values were correct. This error could occur in any cells of the summarised table. | A meta-analysis |
| Misidentification* | Review authors did not correctly identify the eligibility of the included studies for a certain outcome, categorised into three situations: 1) study reported the outcome and related data but was not included in the meta-analysis of the outcome (this does not apply for double-zero studies as classical methods will exclude such studies by default, although other sophisticated methods can include them for meta-analysis); 2) study with the PICOS did not meet the defined criteria, but was included in the meta-analysis (theoretically, driven from situation 1); and 3) duplicated studies were included as different studies within the same outcome. | The meta-analysis |
PICOS=patient or population, intervention, comparison, and outcomes. RCT=randomised controlled trial.
This error is an identification error, and to be strict, not a data extraction error, but it results in errors for the final meta-analytical data and could affect the final pooled effect.
Fig 1Flowchart for selection of articles. RCT=randomised controlled trial
Fig 2Data extraction errors at the meta-analysis level. Bar plot is based on studies with data extraction errors (n=554). Error rate within a meta-analysis is calculated by the number of studies with data extraction errors against the total number of studies within a meta-analysis
Fig 3Data extraction errors at the systematic review level. Bar plot is based on studies with data extraction errors (n=171). Error rate within a systematic review is calculated by the number of meta-analyses with data extraction errors against the total number of meta-analyses within a systematic review
Fig 4Proportion of 1762 studies classified by five types of data extraction error
Fig 5Impact of data extraction errors on results
Examples of changes in the effects and significance when using corrected data
| Example | Original result (with error) | Corrected result (without error) | Difference* | |||||
|---|---|---|---|---|---|---|---|---|
| Effect (95% CI) | P value | Effect (95% CI) | P value | Relative effect (%) | P value of absolute data | |||
|
| ||||||||
| Increased effect (risk ratio) | 2.51 (1.21 to 5.22) | 0.01 | 3.19 (1.34 to 7.59) | 0.01 | 27.17 | 0 | ||
| Decreased effect (odds ratio) | 1.59 (0.63 to 4.02) | 0.33 | 1.17 (0.42 to 3.25) | 0.76 | −26.40 | 0.43 | ||
|
| ||||||||
| Increased effect (odds ratio) | 1.21 (1.05 to 1.40) | 0.01 | 3.24 (1.24 to 8.43) | 0.02 | 167.77 | 0.01 | ||
| Decreased effect (risk ratio) | 0.17 (0.13 to 0.22) | <0.01 | 0.09† (0.04 to 0.17) | <0.01 | −50.00 | 0 | ||
|
| ||||||||
| Benefit to harm (risk ratio) | 0.92 (0.78 to 1.08) | 0.32 | 1.11 (0.93 to 1.32) | 0.25 | 20.70 | −0.07 | ||
| Harm to benefit (odds ratio) | 1.14 (0.95 to 1.35) | 0.14 | 0.94 (0.83 to 1.06) | 0.29 | −17.50 | 0.15 | ||
|
| ||||||||
| Significance to non-significance (odds ratio) | 3.82 (1.27 to 11.45) | 0.02 | 0.93 (0.63 to 1.37) | 0.71 | −75.65 | 0.73 | ||
| Non-significance to significance (odds ratio) | 1.35 (0.75 to 2.44) | 0.32 | 1.65 (1.04 to 2.63) | 0.03 | 22.20 | −0.29 | ||
Difference in effects calculated by: (θcorrected − θoriginal) ÷ θoriginal × 100%, where θ is the estimated pooled effects. †0.09 has been rounded from 0.085.