| Literature DB >> 26576106 |
Kyung Won Kim1, Juneyoung Lee2, Sang Hyun Choi1, Jimi Huh1, Seong Ho Park1.
Abstract
In the field of diagnostic test accuracy (DTA), the use of systematic review and meta-analyses is steadily increasing. By means of objective evaluation of all available primary studies, these two processes generate an evidence-based systematic summary regarding a specific research topic. The methodology for systematic review and meta-analysis in DTA studies differs from that in therapeutic/interventional studies, and its content is still evolving. Here we review the overall process from a practical standpoint, which may serve as a reference for those who implement these methods.Entities:
Keywords: Diagnostic test accuracy; Meta-analysis; Systematic review
Mesh:
Year: 2015 PMID: 26576106 PMCID: PMC4644738 DOI: 10.3348/kjr.2015.16.6.1175
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 3.500
Six Steps for Systematic Review and Meta-Analysis
| Steps | Process |
|---|---|
| 1st | |
| The problems to be addressed by the review should be specified in the form of clear, unambiguous, and structured questions before beginning the review. Once the review questions have been set, modifications to the protocol should be allowed only if alternative ways of defining the populations, interventions, outcomes or study designs become apparent. | |
| 2nd | |
| The search for studies should be extensive. Multiple resources (both computerized and printed) should be searched without language restrictions. The study selection criteria should flow directly from the review questions and be specified a priori. Reasons for inclusion and exclusion should be recorded. | |
| 3rd | |
| Study quality assessment is relevant to every step of a review. Question formulation and study selection criteria should describe the minimumally acceptable level of design. Selected studies should be subjected to a more refined quality assessment by the use of general critical appraisal guides and design-based quality checklists. These detailed quality assessments will be used for exploring the heterogeneity and informing decisions regarding the suitability of meta-analysis. In addition, they help in assessing the strength of inferences and making recommendations for future research. | |
| 4th | |
| Data used by systematic reviews are the results of individual studies, and which are collected with the aid of a data management tool. Data should be extracted using a standardized form in order to ensure that all relevant data is collected, to minimize the risk of transcription errors, and to allow the accuracy of the data to be checked. | |
| 5th | |
| Data synthesis consists of tabulation of study characteristics, quality, and effects as well as the use of statistical methods for exploring the differences between studies and combining their effects (meta-analysis). Exploration of the heterogeneity and its sources should be planned in advance. If an overall meta-analysis cannot be done, subgroup meta-analysis may be feasible. | |
| 6th | |
| The issues highlighted in each of the four steps above should be addressed. The risk of publication bias and related biases should be assessed. Exploration for heterogeneity should help to determine whether the overall summary can be trusted, and, if not, the effects observed in high-quality studies should be used for generating inferences. Any recommendations should be graded by reference to the strengths and weaknesses of the evidence. |
PICO Format Structured Research Question
| Frame | Meaning | Example of Search Terms*: |
|---|---|---|
| P (patients/participants/ population) | Specific patients/population to be investigated | #1. (liver OR hepatocellular) AND (cancer OR carcinoma) |
| I (index tests/intervention) | Index tests or intervention being evaluated | #2. magnetic resonance imaging |
| C (comparator/reference tests) | Tests or intervention to be compared | #3. computed tomography |
| O (outcome) | Outcome of interest such as diagnostic accuracy and therapeutic effect | #4. diagnosis OR sensitivity OR specificity OR receiver operating curve OR accuracy |
*Final search terms are #1 AND #2 AND #3 AND #4. CT = computed tomography, HCC = hepatocellular carcinoma, MRI = magnetic resonance imaging
Fig. 1Diagram of study process and frame of research questions.
DOR = diagnostic odds ratio, FN = false negative, FP = false positive, HCC = hepatocellular carcinoma, HSROC = hierarchical summary receiver operating characteristic, SROC = summary receiver operating characteristic, TN = true negative, TP = true positive
Fig. 2Process to select literature according to Preferred Reporting Items of Systematic Reviews and Meta-Analyses guideline.
Sources of Bias and Variations in Studies Evaluating Diagnostic Test Accuracy
| 1. Population-related bias/variations |
| 1) Spectrum effect (or sometimes called spectrum bias) |
| 2) Sample selection bias |
| 2. Bias related with reference standards |
| 1) Bias due to inappropriate reference standard |
| 2) Differential verification bias |
| 3) Partial verification bias |
| 4) Disease progression bias |
| 3. Interpretation-related bias |
| 1) Diagnostic review bias |
| 2) Test review bias |
| 3) Clinical review bias |
| 4) Incorporation bias |
| 4. Analysis-related bias |
| 1) Managing indeterminate results |
| 2) Arbitrary choice of the threshold value |
Detailed explanations are provided in Supplementary Material (in the online-only Data Supplement).
Fig. 3Templates for presenting results of QUADAS-2 assessment for assessing quality of studies.
We can present results of QUADAS-2 assessments in tabular form (A) or in graphics (B). QUADAS = Quality Assessment of Diagnostic Accuracy Studies
Fig. 4Description of reconstructing diagnostic 2-by-2 table.
FN = false negative, FP = false positive, TN = true negative, TP = true positive
Fig. 5Graphs used in meta-analysis of diagnostic test accuracy studies.
A. Coupled forest plots. B. Summary receiver operating curve (SROC) plot. Open circle (o) represents false positive rate (x-coordinate) and sensitivity (y-coordinate) of individual studies. Size of bubbles reflects precision of estimate. C. SROC curve. CI = confidence interval
Fig. 6Funnel plot to assess publication bias.
A. Funnel plot with Egger's regression line. Each dot represents primary study. X-axis shows study result (i.e., diagnostic odds ratio [DOR]) and y-axis represents study size (i.e., standard error of study result). Empty region, to be filled with results of relatively small studies with negative results or small effect size, makes plot asymmetrical. Asymmetry of plot would indicate that such studies may not ever have been published, thus raising possibility of publication bias being presented as review result. B. Deeks funnel plot. In Deeks funnel plot, x-axis is natural logarithm of DOR and y-axis is 1/√ effective sample size (ESS). According to Deeks et al. (35), it is preferred method for meta-analysis of diagnostic test accuracy studies owing to its high statistical power.
Sources of Bias and Variations in Studies Evaluating Diagnostic Test Accuracy (DTA)
| 1. Population-related bias/variations |
| 1) Spectrum effect (sometimes called spectrum bias): The phenomenon that the diagnostic test accuracy may vary in terms of demographic features, disease severity, disease prevalence, and clinical setting. If a study is carried out in a certain spectrum of patients that is not similar to the population in which the test will be used in clinical practice, the results of the study may have limited applicability. |
| 2) Sample selection bias: The selection process determines the composition of the study sample. For example, if an investigator excludes patients whose MRI image is of poor quality, the study would report an overestimated diagnostic accuracy of the MRI. |
| 2. Bias related to reference standards |
| 1) Bias due to inappropriate reference standard: The use of inaccurate reference standards leads to misclassification of the true disease status and makes the index test results inaccurate. This leads to biased (usually underestimated) results of the test accuracy. |
| 2) Differential verification bias: This occurs when part of the index test results is verified by a different reference standard. It is common in radiological studies. For example, if the index test result is positive for disease (e.g., probable liver metastasis seen on MRI), the reference standard of surgery will be performed. In contrast, if the index test is negative for disease (e.g., probable hemangioma rather than liver metastasis), the reference standard would be follow-up. |
| 3) Partial verification bias: This occurs when only a selected sample of patients is verified by the reference standard. For example, bias arises if patients with a positive result on the index test undergo the reference standard test. |
| 4) Disease progression bias: This occurs when there is a sufficient time delay between the application of the index test and of the reference standard, leading to a significant change in the target disease state. |
| 3. Interpretation-related bias |
| 1) Diagnostic review bias: This occurs when the results of the index test are known to reviewers who interpret the reference standard. |
| 2) Test review bias: This occurs when the results of the reference standard are known to reviewers who interpret the index test. |
| 3) Clinical review bias: This occurs when the reviewers use certain clinical information (e.g., symptoms, co-morbidities) to interpret the index test or reference standard. Clinical information may affect the estimates of DTA. |
| 4) Incorporation bias: This occurs when the result of the index test is used in establishing the final diagnosis (i.e., it forms part of the reference standard). It usually overestimates the test accuracy of index tests. |
| 4. Analysis-related bias |
| 1) Managing indeterminate results: A diagnostic test may produce indeterminate results (e.g., lesions that are too small to characterize). If these indeterminate results are removed from the analysis, biased assessment of the test accuracy occurs (generally overestimation of test accuracy). |
| 2) Arbitrary choice of the threshold value: If the choice of the threshold (i.e., cutoff) value for the index test maximizes its sensitivity and specificity, the test accuracy may be overestimated. The test may perform less well at the selected cutoff when evaluated in a new set of patients or population. |