| Literature DB >> 15743522 |
Joy C MacDermid1, Dina Brooks, Sherra Solway, Sharon Switzer-McIntyre, Lucie Brosseau, Ian D Graham.
Abstract
BACKGROUND: The AGREE instrument has been validated for evaluating Clinical Practice Guidelines (CPG) pertaining to medical care. This study evaluated the reliability and validity of physical therapists using the AGREE to assess quality of CPGs relevant to physical therapy practice.Entities:
Mesh:
Year: 2005 PMID: 15743522 PMCID: PMC555572 DOI: 10.1186/1472-6963-5-18
Source DB: PubMed Journal: BMC Health Serv Res ISSN: 1472-6963 Impact factor: 2.655
Demographic description of physical therapy evaluators
| 40 (8) | 96 | 4 | 9 | 58 | 32 | 1 | |
| 39 (9) | 86 | 14 | 19 | 71 | 10 | 0 | |
| 39 (7) | 100 | 0 | 7 | 79 | 14 | 0 | |
| 41 (9) | 100 | 0 | 0 | 11* | 84 | 5 | |
* these two participants had significant research experience and were close to completion of Masters degree
Inter-rater reliability of AGREE instrument – individual items
| 0.34 | 0.51 | 0.50 | 0.66 | 0.58 | 0.73 | 0.46 | 0.84 | |
| 0.28 | 0.44 | |||||||
| 0.45 | 0.62 | 0.23 | 0.37 | 0.22 | 0.63 | |||
| 0.58 | 0.74 | 0.65 | 0.79 | 0.75 | 0.86 | 0.67 | 0.92 | |
| 0.57 | 0.73 | 0.37 | 0.54 | 0.55 | 0.71 | 0.45 | 0.83 | |
| 0.43 | 0.60 | 0.62 | 0.76 | 0.69 | 0.82 | 0.55 | 0.88 | |
| 0.48 | 0.65 | 0.36 | 0.53 | 0.36 | 0.77 | |||
| 0.88 | 0.93 | 0.77 | 0.87 | 0.78 | 0.88 | 0.83 | 0.97 | |
| 0.72 | 0.84 | 0.84 | 0.91 | 0.69 | 0.82 | 0.72 | 0.94 | |
| 0.61 | 0.76 | 0.54 | 0.70 | 0.66 | 0.80 | 0.62 | 0.91 | |
| 0.45 | 0.63 | 0.32 | 0.49 | 0.39 | 0.79 | |||
| 0.61 | 0.75 | 0.46 | 0.63 | 0.55 | 0.71 | 0.63 | 0.91 | |
| 0.74 | 0.85 | 0.66 | 0.79 | 0.56 | 0.72 | 0.64 | 0.92 | |
| 0.57 | 0.73 | 0.61 | 0.76 | 0.70 | 0.82 | 0.65 | 0.92 | |
| 0.31 | 0.47 | 0.30 | 0.46 | 0.38 | 0.79 | |||
| 0.39 | 0.56 | 0.49 | 0.66 | 0.46 | 0.63 | 0.43 | 0.82 | |
| 0.31 | 0.48 | 0.31 | 0.48 | 0.41 | 0.58 | 0.35 | 0.76 | |
| 0.32 | 0.48 | 0.60 | 0.75 | 0.42 | 0.59 | 0.52 | 0.87 | |
| 0.53 | 0.69 | 0.50 | 0.67 | 0.45 | 0.62 | 0.43 | 0.82 | |
| 0.48 | 0.65 | 0.40 | 0.57 | 0.49 | 0.66 | 0.43 | 0.82 | |
| 0.34 | 0.51 | 0.37 | 0.54 | 0.28 | 0.70 | |||
| 0.53 | 0.69 | 0.42 | 0.60 | 0.26 | 0.68 | |||
| 0.58 | 0.73 | 0.53 | 0.69 | 0.72 | 0.84 | 0.40 | 0.80 | |
ICC – Intraclass correlation coefficient
Results statistically significant at the p < 0.05 level except where indicated by bold.
Inter-rater reliability of AGREE instrument domain scores
| 0.37 (0.11–0.59) | 0.54 (0.19–0.74) | 0.71 (0.54–0.83) | 0.83 (0.70–0.90) | 0.81 (0.68–0.89) | 0.89 (0.81–0.94) | 0.41 (0.15–0.62) | 0.58 (0.26–0.76) | 0.65 (0.46–0.79) | 0.79 (0.63–0.88) | 0.60 (0.39–0.75) | 0.75 (0.56–0.86) | |
| 0.35 (0.10–0.56) | 0.52 (0.18–0.72) | 0.59 (0.39–0.74) | 0.74 (0.56–0.85) | 0.65 (0.47–0.78) | 0.79 (0.64–0.88) | 0.51 (0.29–0.69) | 0.68 (0.45–0.81) | 0.43 (0.19–0.63) | 0.61 (0.32–0.77) | 0.32 (0.06–0.54) | 0.49 (0.11–0.70) | |
| 0.17 (0.14–0.46) | 0.73 (0.54–0.85) | 0.84 (0.70–0.92) | 0.77 (0.61–0.87) | 0.87 (0.76–0.93) | 0.47 (0.18–0.69) | 0.64 (0.31–0.81) | 0.47 (0.19–0.68) | 0.64 (0.32–0.81) | 0.59 (0.34–0.77) | 0.75 (0.51–0.87) | ||
| 0.40 (0.25–0.58) | 0.80 (0.66–0.89) | 0.67 (0.54–0.80) | 0.93 (0.87–0.96) | 0.79 (0.68–0.88) | 0.96 (0.93–0.98) | 0.55 (0.40–0.72) | 0.88 (0.80–0.94) | 0.50 (0.35–0.68) | 0.86 (0.76–0.93) | 0.35 (0.19–0.54) | 0.76 (0.59–0.88) | |
CI – confidence interval; ICC – intraclass correlation coefficient
Results statistically significant at the p < 0.5 level except where indicated by bold.
Intraclass correlations for each AGREE instrument domain as a function of the number of raters
| 0.37 | 0.54 | 0.71 | 0.83 | 0.81 | 0.89 | 0.41 | 0.58 | 0.65 | 0.79 | 0.60 | 0.75 | |
| 0.38 | 0.64 | 0.71 | 0.88 | 0.82 | 0.93 | 0.54 | 0.78 | 0.66 | 0.85 | 0.38 | 0.65 | |
| 0.41 | 0.73 | 0.64 | 0.88 | 0.77 | 0.93 | 0.51 | 0.80 | 0.54 | 0.82 | 0.35 | 0.69 | |
| 0.42 | 0.78 | 0.69 | 0.92 | 0.79 | 0.95 | 0.51 | 0.84 | 0.57 | 0.87 | 0.38 | 0.76 | |
| 0.40 | 0.80 | 0.67 | 0.93 | 0.79 | 0.96 | 0.55 | 0.88 | 0.51 | 0.86 | 0.35 | 0.76 | |
ICC = Intraclass correlation coefficient
Agreement on whether a CPG would be recommended or not
| Pair of Raters | Kappa (unweighted) | Kappa (quadratic weights) |
| Generalists | 0.20 | 0.34 |
| Specialists | 0.25 | 0.22 |
| Researchers | 0.39 | 0.47 |
A Kappa was calculated on the final overall rating question whether or not a CPG should be using with the data dichotomized as YES (strongly recommend or recommend with provisos) or NO (Would not recommend or unsure) or by using quadratic weighting to compare the strength of recommendation (Strongly, with provisos, would not, unsure).
Results of factor analysis (principal components with varimax rotation)
| Item Mean | Std. Deviation | Components | ||||
| 1 | 2 | 3 | 4 | |||
| Scope and Purpose | ||||||
| Q1 | 3.4003 | .72614 | .339 | .271 | .217 | |
| Q2 | 2.9681 | .68948 | .213 | .129 | .296 | |
| Q3 | 3.3961 | .51941 | .393 | .016 | .430 | |
| Stakeholder Involvement | ||||||
| Q4 | 2.6814 | 1.07262 | .336 | .422 | .093 | |
| Q5 | 1.7517 | .89553 | .179 | .173 | .058 | |
| Q6 | 2.8331 | .89717 | .195 | .294 | .259 | |
| Q7 | 1.6664 | .75710 | .255 | .290 | .175 | |
| Rigour of development | ||||||
| Q8 | 2.7344 | 1.21287 | .096 | .247 | -.087 | |
| Q9 | 2.6533 | 1.20139 | .112 | .229 | -.123 | |
| Q10 | 2.6314 | 1.05247 | .132 | .384 | .066 | |
| Q11 | 2.9492 | .76769 | .059 | .339 | .302 | |
| Q12 | 2.9811 | 1.00409 | -.058 | .223 | .514 | |
| Q13 | 2.3556 | 1.16306 | .422 | .273 | .213 | |
| Q14 | 1.9425 | .99370 | .323 | .090 | .044 | |
| Clarity and presentation | ||||||
| Q15 | 3.2250 | .73262 | .132 | .200 | .149 | |
| Q16 | 3.2269 | .77070 | .022 | .241 | .322 | |
| Q17 | 3.1631 | .71880 | .093 | .453 | .400 | |
| Q18 | 2.3342 | .98180 | .091 | .449 | .273 | |
| Applicability | ||||||
| Q19 | 2.2072 | .94797 | .158 | .231 | .005 | |
| Q20 | 1.9500 | .83211 | .172 | .176 | -.073 | |
| Q21 | 2.2336 | .80322 | -.017 | .299 | .237 | |
| Editorial Independence | ||||||
| Q22 | 2.1453 | .90158 | .451 | .108 | .045 | |
| Q23 | 1.7575 | .92802 | .178 | .305 | -.436 | |
This table presents the results of the final 4 factor solution to factor analysis. Bolded cells shown the factor for which each item loaded most strongly. Results are grouped according to the Domains of the AGREE.
Hypothesis test: rigour of development is greater where panel is known to have methodology expertise
| Rater | Expert Panel | No or Uncertain Expertise | p |
| Generalist #1 | 79 | 58 | 0.002 |
| Generalist #2 | 79 | 57 | 0.001 |
| Specialist #1 | 73 | 58 | 0.015 |
| Specialist #2 | 75 | 65 | 0.09 |
| Researcher #1 | 72 | 57 | 0.02 |
| Research #2 | 80 | 61 | 0.014 |
| All 6 appraisers combined | 78 | 59 | <0.001 |
This table contains the scores for the Agree Domain on Rigour of Development. CPGs were classified as having an expert panel if 1) there were more than 3 authors listed (or an agency) and 2) there was an experienced CPG methodologist clearly identified as a panel member or if one of the study investigators was recognized as such. All others were classified as "No or Uncertain Expertise". The p value for the independent samples t-test is shown.
Correlation of domain scores with overall assessment of AGREE
| Domain | Correlation with Overall Rating |
| Stakeholder Involvement | 0.59 |
| Scope and Purpose | 0.52 |
| Rigour of Development | 0.64 |
| Clarity and Presentation | 0.62 |
| Applicability | 0.49 |
| Editorial Independence | 0.38 |
Kendall's Tau-b correlations were conducted between the mean rating of the over assessment of the CPG across all raters as compared to the mean of each Domain score. As hypothesized the correlation was highest with Rigour of development.