Literature DB >> 29216903

Evaluation of clinical practice guidelines using the AGREE instrument: comparison between data obtained from AGREE I and AGREE II.

Kanako Seto1, Kunichika Matsumoto1, Takefumi Kitazawa1, Shigeru Fujita1, Shimpei Hanaoka1, Tomonori Hasegawa2.   

Abstract

OBJECTIVE: The Appraisal of Guidelines for Research and Evaluation (AGREE) is a representative, quantitative evaluation tool for evidence-based clinical practice guidelines (CPGs). Recently, AGREE was revised (AGREE II). The continuity of evaluation data obtained from the original version (AGREE I) has not yet been demonstrated. The present study investigated the relationship between data obtained from AGREE I and AGREE II to evaluate the continuity between the two measurement tools.
RESULTS: An evaluation team consisting of three trained librarians evaluated 68 CPGs issued in 2011-2012 in Japan using AGREE I and AGREE II. The correlation coefficients for the six domains were: (1) scope and purpose 0.758; (2) stakeholder involvement 0.708; (3) rigor of development 0.982; (4) clarity of presentation 0.702; (5) applicability 0.919; and (6) editorial independence 0.971. The item "Overall Guideline Assessment" was newly introduced in AGREE II. This global item had a correlation coefficient of 0.628 using the six AGREE I domains, and 0.685 using the 23 items. Our results suggest that data obtained from AGREE I can be transferred to AGREE II, and the "Overall Guideline Assessment" data can be determined with high reliability using a standardized score of the 23 items.

Entities:  

Keywords:  AGREE (Appraisal of Guidelines for Research and Evaluation) instrument; Clinical practice guidelines; Data mapping; Data transfer

Mesh:

Year:  2017        PMID: 29216903      PMCID: PMC5721454          DOI: 10.1186/s13104-017-3041-7

Source DB:  PubMed          Journal:  BMC Res Notes        ISSN: 1756-0500


Introduction

Clinical practice guidelines (CPGs) are “statements that include recommendations intended to optimize patient care that are informed by a systematic review of evidence and an assessment of the benefits and harms of alternative care options.” [1]. CPGs are a representative tool for standardizing medical interventions and improve healthcare quality. In Japan, CPG development, using evidence-based medicine (EBM), began in the late 1990s with government support. Currently, 30–40 CPGs are developed per year, mainly by academic societies. With the spread of CPGs in Japan, infrastructure to promote their use is also being developed. This includes clearing houses and standard manuals for developing CPGs. The Toho University Medical Media Center and the Medical Information Network Distribution Service Guideline Center of the Japan Council for Quality Health Care both operate CPG clearing houses [2, 3]. The Appraisal of Guidelines for Research and Evaluation (AGREE) instrument, developed by the AGREE Enterprise, is a quantitative method for evaluating CPGs. The AGREE instrument determines items that must be satisfied by CPGs, and is expected to facilitate cost effective CPG development and improve CPG quality [4]. In 2010, the original version (AGREE I) was revised and published as AGREE II [5-7]. Several studies evaluated CPGs using the AGREE I or AGREE II [8-10]. However, the continuity of the data obtained from AGREE I and AGREE II has not yet been demonstrated. The AGREE I was widely used and there is large amount of associated data; investigation of the continuity and conversion of data between AGREE I and II is necessary to make full use of AGREE I data. We investigated the continuity of AGREE I and AGREE II data, and the conversion method from AGREE I data to AGREE II data.

Main text

Methods

A team consisting of three experienced librarians evaluated 68 CPGs, based on EBM issued in 2011–2012 using the AGREE I [11] and AGREE II [12]. The evaluated CPGs were all issued in 2011–2012 in Japan. Their contents were checked and judged by expert librarians as to whether they were prepared using EBM methodology, or not. The librarians who evaluated the CPGs have knowledge about the CPG preparation and experience using the AGREE tool. The librarians conducted independent evaluations and did not adjust the result; the results were aggregated into standardized scores. Correlation coefficients were calculated for the domains and items of the two instruments. AGREE I comprised one overall assessment item and six domains: (1) scope and purpose; (2) stakeholder involvement; (3) rigor of development; (4) clarity of presentation; (5) applicability; (6) editorial independence, totaling 23 items. Each item is rated on a 4-point Likert scale (1 = “Strongly Disagree” to 4 = “Strongly Agree”). A standardized score for each domain was calculated according the formula shown below: For example, the scope and purpose domain consists of three items; the sum of the maximum possible score is 3 × 3 × 3 = 27, and the sum of the minimum possible score is 1 × 3 × 3 = 9 [11]. AGREE II is based on AGREE I, incorporating four distinct changes. First, the rating scale was changed from a 4-point to a 7-point Likert scale (1 = “Strongly Disagree” to 7 = “Strongly Agree”). Second, an item was added as a second overall guideline assessment item: “Rate the overall quality of this guideline”. Third, the wording or expression of several items was changed, although the meaning of the items was preserved. Finally, Q7 (AGREE I) “The guideline has been piloted among end users” was removed, and was incorporated in Q19 (AGREE II) “The guideline describes facilitators and barriers to its application” and a new item Q9 (AGREE II) “The strengths and limitations of the body of evidence are clearly described”. Therefore, Q7 (AGREE I) and Q9 (AGREE II) were excluded from analysis in the present study. A comparison of AGREE I and AGREE II items is shown in Table 1.
Table 1

Comparison between the AGREE I and AGREE II

AGREE IAGREE IIChange from AGREE I to AGREE II
DomainNoItemItemNoDomain
1. Scope and purpose1The overall objective(s) of the guideline is (are) specifically describedThe overall objective(s) of the guideline is (are) specifically described11. Scope and purposeNo change
2The clinical question(s) covered by the guideline is (are) specifically describedThe health question(s) covered by the guideline is (are) specifically described2Change in underline part
3The patients to whom the guideline is meant to apply are specifically describedThe population (patients, public, etc.) to whom the guideline is meant to apply is specifically described3Change in underline part
2. Stakeholder involvement4The guideline development group includes individuals from all the relevant professional groupsThe guideline development group includes individuals from all the relevant professional groups42. Stakeholder involvementNo change
5The patients’ views and preferences have been soughtThe views and preferences of the target population (patients, public, etc.) have been sought5Change in underline part
6The target users of the guideline are clearly defined. NoThe target users of the guideline are clearly defined. No6No change
7The guideline has been piloted among end usersDelete item. Incorporated into user guide description of item 19
3. Rigour of development8Systematic methods were used to search for evidenceSystematic methods were used to search for evidence73. Rigour of developmentNo change, renumber to 7
9The criteria for selecting the evidence are clearly describedThe criteria for selecting the evidence are clearly described8No change, renumber to 8
The strengths and limitations of the body of evidence are clearly described9New item
10The methods for formulating the recommendations are clearly describedThe methods for formulating the recommendations are clearly described10No change
11The health benefits, side effects, and risks have been considered in formulating the recommendationsThe health benefits, side effects, and risks have been considered in formulating the recommendations11No change
12There is an explicit link between the recommendations and the supporting evidenceThere is an explicit link between the recommendations and the supporting evidence12No change
13The guideline has been externally reviewed by experts prior to its publicationThe guideline has been externally reviewed by experts prior to its publication13No change
14A procedure for updating the guideline is providedA procedure for updating the guideline is provided14No change
4. Clarity of presentation15The recommendations are specific and unambiguousThe recommendations are specific and unambiguous154. Clarity of presentationNo change
16The different options for management of the condition are clearly presentedThe different options for management of the condition or health issue are clearly presented16Change in underline part
17Key recommendations are easily identifiableKey recommendations are easily identifiable17No change
18The guideline is supported with tools for applicationThe guideline describes facilitators and barriers to its application195. ApplicabilityChange in underline part, renumber to 19, change in domain from #4 clarity of presentation to #5 applicability
5. Applicability19The potential organizational barriers in applying the recommendations have been discussed The guideline provides advice and/or tools on how the recommendations can be put into practice 18Change in underline part, renumber to 18.
20The potential cost implications of applying the recommendations have been consideredThe potential resource implications of applying the recommendations have been considered20Change in underline part
21The guideline presents key review criteria for monitoring and/ or audit purposesThe guideline presents monitoring and/ or auditing criteria 21Change in underline part
6. Editorial independence22The guideline is editorially independent from the funding body The views of the funding body have not influenced the content of the guideline 226. Editorial independenceChange in underline part
23Conflicts of interest of guideline development members have been recorded Competing interests of guideline development group members have been recorded and addressed 23Change in underline part
Overall guideline assessmentRate the overall quality of this guideline1Overall guideline assessmentNew item
1I would recommend this guideline for useI would recommend this guideline for use2Renumber to 2
Comparison between the AGREE I and AGREE II As there was no item in AGREE I that corresponded with the new overall guideline assessment item in AGREE II, we attempted to calculate this value using two approaches. First, we calculated the average of the standardized score using results of the six AGREE I domains. Second, we calculated the standardized score using the results of the 23 AGREE I items. We examined the correlation between “Overall Guideline Assessment” in the AGREE II and the results of the two approaches described above. We used t-tests to compare standardized scores, and calculated correlation coefficients for each AGREE I and AGREE II item and domain. p values < 0.05 were indicated statistical significance. All analyses were performed using SPSS, version 20.0 (IBM SPSS Statistics for Windows, Version 20.0. Armonk, NY: IBM Corp.).

Results

The results of the AGREE I and AGREE II evaluations are shown in Fig. 1. Correlation coefficients are shown in Table 2. High correlations were observed in all domains: scope and purpose = 0.758; stakeholder involvement = 0.756; rigor of development = 0.992; clarity of presentation = 0.865; applicability = 0.938; and editorial independence = 0.938. The correlation coefficients of each item ranged from 0.708 to 0.982.
Fig. 1

Evaluation of clinical practice guidelines, published between 2011 and 2012, using the AGREE I and AGREE II (n = 68). t test; *p < 0.05, **p < 0.01

Table 2

Correlation between the AGREE I and AGREE II domains

AGREE IAGREE II
123456TotalAverage123456TotalOverall guideline assessment
AGREE I
 1. Scope and purpose1.0000.1440.366**0.162− 0.0770.1720.371**0.332**0.758**0.1970.365**0.100− 0.0510.1560.368**0.328**
 2. Stakeholder involvement1.0000.1210.352**0.106− 0.0590.302*0.308*0.1130.708**0.1070.2290.248*− 0.0790.2330.114
 3. Rigour of development1.0000.428**0.243*0.491**0.938**0.849**0.260*0.0130.982**0.483**0.263*0.491**0.943**0.736**
 4. Clarity of presentation1.0000.1220.0690.524**0.502**0.243*0.337**0.397**0.702**0.333**0.1060.497**0.339**
 5. Applicability1.0000.1460.423**0.505**− 0.0420.0070.1710.1060.919**0.1410.361**0.011
 6. Editorial independence1.0000.561**0.647**− 0.0080.1670.484**0.0790.1810.971**0.575**0.464**
 Total1.0000.968**0.268*0.278*0.905**0.524**0.473**0.561**0.978**0.685**
 Average1.0000.2110.302*0.810**0.468**0.552**0.646**0.937**0.628**
AGREE II
 1. Scope and purpose1.0000.1740.276*0.251*− 0.0040.0010.303*0.183
 2. Stakeholder involvement1.0000.1610.289**0.1290.1590.316**0.247*
 3. Rigour of development1.0000.501**0.1820.479**0.938**0.771**
 4. Clarity of presentation1.0000.1550.0920.570**0.492**
 5. Applicability1.0000.1890.403**0.033
 6. Editorial independence1.0000.575**0.445**
 Total1.0000.746**
 Overall guideline assessment1.000

* p < 0.05, ** p < 0.01

Evaluation of clinical practice guidelines, published between 2011 and 2012, using the AGREE I and AGREE II (n = 68). t test; *p < 0.05, **p < 0.01 Correlation between the AGREE I and AGREE II domains * p < 0.05, ** p < 0.01 Correlation coefficients for the 22 items ranged from 0.694 to 0.995; 16 items had a correlation coefficient of 0.9 or more, three items were 0.8–0.9, and three items were 0.6–0.8. A high overall correlation was observed for all items (Additional file 1: Table S1). The newly-introduced overall assessment item “Overall Guideline Assessment” (AGREE II) should be assessed based on AGREE I data. The six AGREE I domains had a correlation coefficient of 0.628, when 23 items were used it was 0.685, suggesting a higher related value could be gained using the latter (Table 2).

Discussion

Since its publication in 2003, the high popularity of the AGREE instrument has produced a large amount of evaluation data. With the revision of the AGREE instrument, the relationship between data obtained from AGREE I and AGREE II, and data conversion from the AGREE I to the AGREE II are high research agenda priorities for investigating time trend analyses of CPG quality. For the 68 CPGs issued in 2011–2012, our results demonstrated that AGREE I and AGREE II were highly correlated at both the domain and item levels, and the newly introduced overall rating item “Overall Guideline Assessment” could be calculated more precisely using the 23 AGREE I items, rather than domain-level data. Increasing attention is being directed to safety and quality issues, and CPGs based on EBM are a representative method for standardizing and improving the quality and safety of healthcare procedures. The AGREE instrument is widely used to measure CPG quality. Our results suggest that the AGREE instrument can still be used as a measurement tool, which exhibits high consistency, although it has now been revised (AGREE II). It enables long-term, comprehensive CPG evaluation. The Japanese government has promoted CPG preparation since 1996. Our study may help evaluate the underlying policy guidelines.

Conclusion

Data obtained from AGREE I can be transferred to the AGREE II, and the data for “Overall Guideline Assessment” can be calculated with high reliability using a standardized score of the 23 items.

Limitations

Our evaluation team did not include any researchers or clinicians. However, the expert librarians had extensive knowledge about CPG preparation and had experience evaluating CPGs using the AGREE measure.
  7 in total

1.  Development and validation of an international appraisal instrument for assessing the quality of clinical practice guidelines: the AGREE project.

Authors: 
Journal:  Qual Saf Health Care       Date:  2003-02

2.  International assessment of the quality of clinical practice guidelines in oncology using the Appraisal of Guidelines and Research and Evaluation Instrument.

Authors:  J S Burgers; B Fervers; M Haugh; M Brouwers; G Browman; T Philip; F A Cluzeau
Journal:  J Clin Oncol       Date:  2004-05-15       Impact factor: 44.544

3.  AGREE II: advancing guideline development, reporting, and evaluation in health care.

Authors:  Melissa C Brouwers; Michelle E Kho; George P Browman; Jako S Burgers; Francoise Cluzeau; Gene Feder; Béatrice Fervers; Ian D Graham; Jeremy Grimshaw; Steven E Hanna; Peter Littlejohns; Julie Makarski; Louise Zitzelsberger
Journal:  Prev Med       Date:  2010-08-20       Impact factor: 4.018

Review 4.  Guidelines for the treatment of pneumonia and urinary tract infections: evaluation of methodological quality using the Appraisal of Guidelines, Research and Evaluation II instrument.

Authors:  O Henig; D Yahav; L Leibovici; M Paul
Journal:  Clin Microbiol Infect       Date:  2013-08-30       Impact factor: 8.067

5.  Development of the AGREE II, part 2: assessment of validity of items and tools to support application.

Authors:  Melissa C Brouwers; Michelle E Kho; George P Browman; Jako S Burgers; Françoise Cluzeau; Gene Feder; Béatrice Fervers; Ian D Graham; Steven E Hanna; Julie Makarski
Journal:  CMAJ       Date:  2010-05-31       Impact factor: 8.262

6.  Development of the AGREE II, part 1: performance, usefulness and areas for improvement.

Authors:  Melissa C Brouwers; Michelle E Kho; George P Browman; Jako S Burgers; Francoise Cluzeau; Gene Feder; Béatrice Fervers; Ian D Graham; Steven E Hanna; Julie Makarski
Journal:  CMAJ       Date:  2010-05-31       Impact factor: 8.262

7.  A Systematic Critical Appraisal of Clinical Practice Guidelines in Juvenile Idiopathic Arthritis Using the Appraisal of Guidelines for Research and Evaluation II (AGREE II) Instrument.

Authors:  Christine A M Smith; Karine Toupin-April; Jeffrey W Jutai; Ciarán M Duffy; Prinon Rahman; Sabrina Cavallo; Lucie Brosseau
Journal:  PLoS One       Date:  2015-09-10       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.