| Literature DB >> 21088686 |
Jason Kung1, Francesco Chiappelli, Olivia O Cajulis, Raisa Avezova, George Kossan, Laura Chew, Carl A Maida.
Abstract
Research synthesis seeks to gather, examine and evaluate systematically research reports that converge toward answering a carefully crafted research question, which states the problem patient population, the intervention under consideration, and the clinical outcome of interest. The product of the process of systematically reviewing the research literature pertinent to the research question thusly stated is the "systematic review".The objective and transparent approach of the systematic review aims to minimize bias. Most systematic reviews yield quantitative analyses of measurable data (e.g., acceptable sampling analysis, meta-analysis). Systematic reviews may also be qualitative, while adhering to accepted standards for gathering, evaluating, and reporting evidence. Systematic reviews provide highly rated recommendations for evidence-based health care; but, systematic reviews are not equally reliable and successful in minimizing bias.Several instruments are available to evaluate the quality of systematic reviews. The 'assessment of multiple systematic reviews' (AMSTAR) was derived from factor analysis of the most relevant items among them. AMSTAR consists of eleven items with good face and content validity for measuring the methodological quality of systematic reviews, has been widely accepted and utilized, and has gained in reliability, reproducibility. AMSTAR does not produce quantifiable assessments of systematic review quality and clinical relevance. In this study, we have revised the AMSTAR instrument, detracting nothing from its content and construct validity, and utilizing the very criteria employed in the development of the original tool, with the aim of yielding an instrument that can quantify the quality of systematic reviews. We present validation data of the revised AMSTAR (R-AMSTAR), and discuss its implications and application in evidence-based health care.Entities:
Year: 2010 PMID: 21088686 PMCID: PMC2948145 DOI: 10.2174/1874210601004020084
Source DB: PubMed Journal: Open Dent J ISSN: 1874-2106
Criteria:
| (A) ‘a priori’ design |
| (B) statement of inclusion criteria |
| (C) PICO/PIPO research question (population, intervention, comparison, prediction, outcome) |
Criteria:
| (A) There should be at least two independent data extractors as stated or implied. | |
| (B) Statement of recognition or awareness of | |
| (C) Disagreements among extractors resolved properly as stated or implied |
Criteria:
| (A) At least two electronic sources should be searched. | |
| (B) The report must include years and databases used (e.g. Central, EMBASE, and MEDLINE). | |
| (C) Key words and/or MESH terms must be stated | |
| (D) In addition to the electronic databases (PubMed, EMBASE, Medline), all searches should be supplemented by consulting current contents, reviews, textbooks, specialized registers, or experts in the particular field of study, and by reviewing the references in the studies found. | |
| (E) Journals were “hand-searched” or “manual searched” (i.e. identifying highly relevant journals and conducting a manual, page-by-page search of their entire contents looking for potentially eligible studies) |
Criteria:
| (A) The authors should state that they searched for reports regardless of their publication type. | |
| (B) The authors should state whether or not they excluded any reports (from the systematic review), based on their publication status, language etc. | |
| (C) “Non-English papers were translated” or readers sufficiently trained in foreign language | |
| (D) No language restriction or recognition of non-English articles |
Criteria:
| (A) Table/list/or figure of included studies, a reference list does not suffice. | |
| (B) Table/list/figure of excluded studies[ | |
| (C) Author satisfactorily/sufficiently stated the reason for exclusion of the seriously considered studies. | |
| (D) Reader is able to retrace the included and the excluded studies anywhere in the article bibliography, reference, or supplemental source |
Criteria:
| (A) In an aggregated form such as a table, data from the original studies should be provided on the participants, interventions | |
| (B) Provide the ranges of | |
| (C) The information provided appears to be complete and accurate (i.e. there is a tolerable range of subjectivity here. Is the reader left wondering? If so, state the needed information and the reasoning). |
Criteria:
| (A) ‘A priori’ methods of assessment should be provided (e.g., for effectiveness studies if the author(s) chose to include only randomized, double-blind, placebo controlled studies, or allocation concealment as inclusion criteria); for other types of studies alternative items will be relevant. | |
| (B) The scientific quality of the included studies | |
| (C) Discussion/recognition/awareness of level of evidence | |
| (D) Quality of evidence should be rated/ranked based on characterized instruments. (Characterized instrument is a created instrument that ranks the level of evidence, e.g. GRADE [Grading of Recommendations Assessment, Development and Evaluation.]) |
Criteria:
| (A) The results of the methodological rigor and scientific quality should be considered in the analysis and the conclusions of the review | |
| (B) The results of the methodological rigor and scientific quality are | |
| (C) To have conclusions integrated/drives towards a clinical consensus statement | |
| (D) This clinical consensus statement drives toward revision or confirmation of clinical practice guidelines |
Criteria:
| (A) Statement of criteria that were used to decide that the studies analyzed were similar enough to be pooled? | |
| (B) For the pooled results, a test should be done to ensure the studies were combinable, to assess their homogeneity (i.e. Chi-squared test for homogeneity, I2). | |
| (C) Is there a recognition of heterogeneity or lack of thereof | |
| (D) If heterogeneity exists a “random effects model” should be used and/or the rationale (i.e. clinical appropriateness) of combining should be taken into consideration (i.e. is it sensible to combine?), or stated explicitly | |
| (E) If homogeneity exists, author should state a rationale or a statistical test |
Criteria:
| (A) Recognition of publication bias or file-drawer effect | |
| (B) An assessment of publication bias should include graphical aids (e.g., funnel plot, other available tests) | |
| (C) Statistical tests (e.g., Egger regression test). |
Criteria:
| (A) Statement of sources of support | |
| (B) No conflict of interest. This is subjective and may require some deduction or searching. | |
| (C) An awareness/statement of support or conflict of interest in the |
Average R-AMSTAR Scores Across 2 Independent Readers for the PTSD Literature
| Report | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 4.00 | 3.50 | 3.50 | 3.00 | 1.50 | 4.00 | 3.50 | 4.00 | 4.00 | 1.00 | 1.50 | |
| 2 | 4.00 | 4.00 | 3.50 | 2.50 | 1.50 | 3.00 | 2.50 | 4.00 | 4.00 | 2.50 | 1.00 | |
| 3 | 4.00 | 4.00 | 4.00 | 2.00 | 1.00 | 2.50 | 3.00 | 4.00 | 4.00 | 1.00 | 1.50 | |
| 4 | 4.00 | 2.50 | 4.00 | 4.00 | 4.00 | 4.00 | 4.00 | 4.00 | 3.50 | 4.00 | 3.50 | |
| 5 | 4.00 | 2.50 | 4.00 | 2.00 | 1.50 | 4.00 | 3.50 | 4.00 | 3.00 | 1.50 | 2.00 | |
| 6 | 4.00 | 1.00 | 3.50 | 2.00 | 2.00 | 2.50 | 3.00 | 2.50 | 2.00 | 1.50 | 1.50 | |
| 7 | 4.00 | 4.00 | 3.50 | 1.50 | 1.00 | 4.00 | 3.00 | 4.00 | 4.00 | 1.50 | 2.50 | |
| 8 | 4.00 | 1.50 | 3.50 | 1.50 | 1.50 | 4.00 | 2.50 | 4.00 | 2.50 | 1.50 | 3.00 | |
| 9 | 4.00 | 2.00 | 3.00 | 1.00 | 2.00 | 3.50 | 4.00 | 4.00 | 3.00 | 1.00 | 2.50 | |
| 10 | 4.00 | 1.00 | 3.50 | 2.00 | 1.00 | 4.00 | 3.00 | 4.00 | 3.00 | 1.50 | 2.00 | |
| 11 | 4.00 | 1.00 | 3.50 | 2.50 | 2.50 | 4.00 | 2.50 | 3.50 | 2.00 | 1.50 | 3.00 | |
| 31.45 | ||||||||||||
| 3.37 |
(p<0.0001, Friedman non-parametric ANOVA equivalent)
Average R-AMSTAR Scores Across 2 Independent Readers for the RA Literature
| Report | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 4.00 | 1.00 | 4.00 | 2.00 | 3.50 | 3.50 | 3.50 | 3.50 | 1.00 | 1.00 | 1.00 | |
| 2 | 3.50 | 2.50 | 4.00 | 4.00 | 3.50 | 4.00 | 3.50 | 2.50 | 3.50 | 1.50 | 1.00 | |
| 3 | 4.00 | 4.00 | 3.50 | 4.00 | 1.50 | 2.50 | 3.50 | 3.50 | 2.50 | 1.50 | 1.00 | |
| 4 | 4.00 | 2.00 | 4.00 | 4.00 | 2.00 | 4.00 | 3.50 | 3.00 | 3.50 | 1.00 | 1.00 | |
| 5 | 3.50 | 4.00 | 4.00 | 3.00 | 2.50 | 4.00 | 4.00 | 4.00 | 2.50 | 1.00 | 2.50 | |
(p=0.001, Friedman non-parametric ANOVA equivalent)
Systematic Review Ranking Based on R-AMSTAR Scores
| PICO | Paper | R-AMSTARa | %ile | Rankb |
|---|---|---|---|---|
| PTSD | 1 | 32.50 | 83 | B |
| 2 | 33.00 | 85 | B | |
| 3 | 32.50 | 83 | B | |
| 4 | 39.00 | 100 | A | |
| 5 | 32.00 | 82 | B | |
| 6 | 25.50 | 65 | D | |
| 7 | 33.00 | 85 | B | |
| 8 | 29.50 | 76 | C | |
| 9 | 30.00 | 77 | C | |
| 5 | 35.00 | 100 | A | |
| 11 | 30.00 | 77 | C | |
| RA | 1 | 28.00 | 80 | B |
| 2 | 33.50 | 96 | A | |
| 3 | 31.50 | 90 | A | |
| 4 | 32.00 | 91 | A |
aThe values listed in the table correspond to the total R-AMSTAR scores listed in Tables 1 & 2 respectively for the PTSD and the RA bibliome
bBased on the criteria of excellence of systematic reviews that resulted in the 11 domains examined by the AMSTAR, the overall score on the R-AMSTAR, which is revised only to the extent that it produces a quantification of the assessments of these domains, reveals the possibility to assign a grade of systematic review quality and clinical relevance, based on the criteria of the top percentile of the scores reflecting an A paper, and so on. The rankings are, for obvious reasons, relative strictly to the systematic reviews examined in response to the specific P.I.C.O. question, and thus pertain to a fixed (rather than random) interpretative model.