| Literature DB >> 24529068 |
Paul G Shekelle1, Aneesa Motala, Breanne Johnsen, Sydne J Newberry.
Abstract
BACKGROUND: Systematic reviews are a cornerstone of evidence-based medicine but are useful only if up-to-date. Methods for detecting signals of when a systematic review needs updating have face validity, but no proposed method has had an assessment of predictive validity performed.Entities:
Mesh:
Year: 2014 PMID: 24529068 PMCID: PMC3937021 DOI: 10.1186/2046-4053-3-13
Source DB: PubMed Journal: Syst Rev ISSN: 2046-4053
Criteria for determining signals for updating
| Still valid | Original conclusion is still valid and this portion of the original report does not need updating. This conclusion was reached if we found no new evidence or only confirmatory evidence and all responding experts assessed the CER conclusion as still valid. |
| Possibly out of date | Original conclusion is possibly out of date and this portion of the original report may need updating. This conclusion was reached if we found some new evidence that might change the CER conclusion, and/or a minority of responding experts assessed the CER conclusion as having new evidence that might change the conclusion. |
| Probably out of date | Original conclusion is probably out of date and this portion of the original report may need updating. This conclusion was reached if we found substantial new evidence that might change the CER conclusion, and/or a majority of responding experts assessed the CER conclusion as having new evidence that might change the conclusion. |
| Out of date | Original conclusion is out of date. This conclusion was reached if we found new evidence that rendered the CER conclusion out of date or no longer applicable. Recognizing that our literature searches were limited, we reserved this category only for situations where a limited search would produce prima facie evidence that a conclusion was out of date, such as the withdrawal of a drug or surgical device from the market, a black box warning from FDA, etc. |
Comparative effectiveness reviews assessed
| High | Yes | |
| High | Yes | |
| High | Yes | |
| High | Yes | |
| Medium | Yes | |
| Medium | Yes | |
| Medium | Yes | |
| Low | Yes | |
| Low | Yes | |
| Medium | No | |
| Low | No |
aAHRQ, Agency for Healthcare Research and Quality; CER, comparative effectiveness review. bUpdate not commissioned pending publication of the PIVOT trial. cUpdate not commissioned or individual patient data meta-analysis had already been commissioned.
Examples of degree of concordance between 2009 prediction and updated conclusion
| Example 1 | |
| Original conclusion (from CER on analgesics for osteoarthritis) | No clear differences between various nonaspirin, nonselective NSAIDs or partially selective NSAIDs with regard to efficacy for pain relief or improvement |
| 2009 surveillance assessment [ | Conclusion still valid |
| Conclusion from 2011 CER update [ | No clear difference in efficacy for pain relief, or withdrawals due to lack of efficacy |
| Concordance | Good |
| Example 2 | |
| Original conclusion (from CER on analgesics for osteoarthritis) | Etoricoxib is associated with fewer gastrointestinal adverse events than nonselective NSAIDs |
| 2009 surveillance assessment [ | Possibly out-of-date |
| Conclusion from 2011 CER update [ | No comparable conclusion, as etoricoxib was not included because it did not gain FDA approval for sale in the United States |
| Concordance | Good |
| Example 3 | |
| Original conclusion (from CER on second-generation antidepressants) | Overall discontinuation rates did not differ significantly between SSRIs as a class and bupropion, mirtazapine, nefazodone, trazodone and venlafaxine. In the case of venlafaxine compared with SSRIs, higher discontinuation rates due to adverse events appeared to be balanced by lower discontinuation rates due to lack of efficacy. |
| 2009 surveillance assessment [ | Conclusion is possibly out-of-date, and this portion may need updating based on new analysis showing lower dropout rate with escitalopram. |
| Conclusion from 2011 CER update [ | Meta-analyses of numerous efficacy trials indicate that overall discontinuation rates are similar. Duloxetine and venlafaxine have a higher rate of discontinuations due to adverse events than SSRIs as a class. Venlafaxine has a lower rate of discontinuations due to lack of efficacy than SSRIs as a class. |
| Concordance | Fair: Escitalopram data did not end up in the conclusions |
| Example 4 | |
| Original conclusion from CER on second-generation antidepressants | Three head-to-head RCTs suggest that no substantial differences exist between fluoxetine and sertraline, fluvoxamine and sertraline, and trazodone and venlafaxine regarding relapse. Twenty-one placebo-controlled trials support the general efficacy and effectiveness of most second-generation antidepressants for preventing relapse or recurrence. No evidence exists for duloxetine. |
| 2009 surveillance assessment [ | Conclusion is possibly out-of-date, and this portion of the CER may need updating to include evidence for duloxetine. |
| Conclusion from 2011 CER update [ | On the basis of results of six efficacy trials and one naturalistic study, no significant differences exist between escitalopram and desvenlafaxine, escitalopram and paroxetine, fluoxetine and sertraline, fluoxetine and venlafaxine, fluvoxamine and sertraline, and trazodone and venlafaxine for preventing relapse or recurrence. |
| Concordance | Fair: No duloxetine evidence ended up being included with regard to this key question |
| Example 5 | |
| Original conclusion (from CER on management of GERD) | Medical therapy with PPIs and surgery (fundoplication) appeared to be similarly effective for improving symptoms and decreasing esophageal acid exposure. |
| 2009 surveillance assessment [ | Conclusion is still valid, and this portion of the CER does not need updating. |
| Conclusion from 2011 CER update [ | The 2005 CER concluded that medical therapy with PPIs and antireflux surgery were similarly effective in improving GERD-related symptoms and decreasing esophageal acid exposure, although some surgical patients required ongoing medical therapy postprocedure. With the addition of long-term follow-up data (7 to 12 years) from two previously reviewed studies and results from two new RCTs, our updated review found that patients who underwent antireflux surgery experienced a greater improvement in heartburn and regurgitation at follow-up than did patients who received medical treatment alone. |
| Concordance | Poor: Update indicates symptoms are better with surgery |
aCER, comparative effectiveness review; FDA, US Food and Drug Administration; GERD, gastroesophageal reflux disease; NSAID, nonsteroidal anti-inflammatory drug; PPI, proton pump inhibitor; RCT, randomized controlled trial; SSRI, selective serotonin reuptake inhibitor.
Summary of concordance of predicted and actual conclusions across nine comparative effectiveness reviews
| Still valid | 83 | 1 | 1 | 85 |
| Possibly out-of-date | 11 | 16 | 0 | 27 |
| Probably out-of-date | 7 | 0 | 0 | 7 |
| Out-of-date | 4 | 4 | 0 | 8 |
| Total | 105 | 21 | 1 | 127 |
aCER, comparative effectiveness review. Not applicable/no matching conclusions/new conclusions = 22.
Comparison of predicted vs. actual priority for updating
| High | August 2010 | High | Some procedures specifically mentioned in the Executive Summary have been withdrawn from the market. New procedures have been introduced. There is a major change in the conclusion about surgery vs. medical therapy. | |
| High | September 2010 | Medium | The new data did not change the overall conclusions very much. The conclusion that MRI and ultrasound may be sufficient to evaluate lesions in women at low risk may be an important new conclusion. | |
| High | April 2012 | High | Major safety concerns leading to substantial changes in black box warnings and practice guidelines | |
| High | January 2011 | High | The updated Executive Summary specifically mentions a number of drugs that have been withdrawn because of safety concerns. | |
| Medium | May 2011 | Medium | There are many new off-label indications and data on effectiveness, but these do not indicate strong effects of these drugs. | |
| Medium | February 2011 | High | New, expensive biologic DMARDs feature prominently in the Executive Summary of the updated report. | |
| Medium | March 2011 | Medium | There are two new drugs: zoledronic acid and denosumab. However, there is no evidence that they are any more effective than existing drugs. There are signals of serious but rare new side effects, in particular subtrochanteric fractures of the hip, but they are not sufficient to change the initial decision to recommend antiresorptive therapy for women with osteoporosis. | |
| Low | January 2011 | Low | No substantive changes in conclusions | |
| Low | December 2010 | Low | No substantive changes in conclusions |
aDMARD, disease-modifying antirheumatic drug; MRI, magnetic resonance imaging.
Predictive validity of priority for updating a systematic review (2009 predicted priority)
| | ||||
|---|---|---|---|---|
| High- 2009 prediction | 3 | 1 | 0 | 4 |
| Medium- 2009 prediction | 1 | 2 | 0 | 3 |
| Low- 2009 prediction | 0 | 0 | 2 | 2 |
| 4 | 3 | 2 | 9 | |
aκ = 0.74.