| Literature DB >> 32962692 |
Maarten P M Debets1, Renée A Scheepers2, Benjamin C M Boerebach3, Onyebuchi A Arah4,5,6,7, Kiki M J M H Lombarts3.
Abstract
BACKGROUND: Medical faculty's teaching performance is often measured using residents' feedback, collected by questionnaires. Researchers extensively studied the psychometric qualities of resulting ratings. However, these studies rarely consider the number of response categories and its consequences for residents' ratings of faculty's teaching performance. We compared the variability of residents' ratings measured by five- and seven-point response scales.Entities:
Keywords: Faculty; Five-point scale; Performance ratings; Residents; Response categories; Seven-point scale; Teaching performance; Variability
Mesh:
Year: 2020 PMID: 32962692 PMCID: PMC7510269 DOI: 10.1186/s12909-020-02244-9
Source DB: PubMed Journal: BMC Med Educ ISSN: 1472-6920 Impact factor: 2.463
Identical five- and seven-point questionnaire items compared in this study
| 1. | |
| 2. | |
| 3. | |
| 4. | |
| 5. | |
| 6. | |
| 7. | |
| 8. | |
| 9. | |
| 10. | |
| 11. | |
| 12. | |
| Total score (TS)a |
aMean average of the 12 identical items
Characteristics of residents and their ratings using the five-and seven-point questionnaires
| Questionnaire | 5 pt | 7 pt |
|---|---|---|
| Number of training programs | 10 | 8 |
| Number of residents | 119 | 206 |
| Number of measurement periods that residents participated ina | 150 | 241 |
| Median number of ratings per resident per measurement period | 7 | 7 |
| Number of ratings | 1264 | 2115 |
| Male residents (%) | 38 | 41.6 |
| Ratings by year of residency training (%) | ||
| 1 | 12.4 | 34.1 |
| 2 | 34.9 | 17.1 |
| 3 | 12.5 | 4.4 |
| 4 | 22.9 | 24.3 |
| ≥5 | 17.3 | 20.1 |
| Number of faculty | 175 | 254 |
| Number of faculty measurement periodsb | 273 | 354 |
| Median number of ratings per faculty per measurement period | 3 | 5 |
| Number of faculty evaluated by ≥3 residentsc | 205 | 275 |
| Median number of resident ratings (from faculty rated by ≥3 residents) | 4 | 6 |
a, bA measurement period is a four- to six-week data collection period. Some residents and faculty participated in more than one measurement period from January 2013 to January 2017
cAggregated faculty scores require three or more residents’ ratings to be reliable
Means, standard deviations and IQRs of residents’ ratings and faculty scores of five-and seven-point questionnaire items
| Residents’ ratings | Aggregated faculty scores | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | |||||
| Item | M (R) | M | Δ M | SD (R) | SD | Δ SD | IQR | IQR | M (R) | M | Δ M | SD (R) | SD | Δ SD | IQR | IQR |
| 1. | 3.98 (5.47) | 5.46 | .00 | .86 (1.29) | 1.21 | .08a | 4.00–5.00 | 5.00–6.00 | 3.98 (5.47) | 5.49 | .02 | .59 (.88) | .78 | .10 | 3.67–4.33 | 5.00–6.00 |
| 2. | 3.95 (5.42) | 5.45 | .03 | .90 (1.35) | 1.22 | .13 | 4.00–5.00 | 5.00–6.00 | 3.95 (5.43) | 4.46 | .03 | .61 (.92) | .79 | .13 | 3.67–4.33 | 5.00–6.00 |
| 3. | 3.95 (5.43) | 5.44 | .01 | .90 (1.35) | 1.21 | .14 | 4.00–5.00 | 5.00–6.00 | 3.94 (5.41) | 5.47 | .06 | .61 (.92) | .80 | .11 | 3.67–4.33 | 5.00–6.00 |
| 4. | 3.74 (5.12) | 5.10 | .01 | .91 (1.37) | 1.23 | .14a, b | 3.00–4.00 | 4.00–6.00 | 3.71 (5.08) | 5.14 | .06 | .61 (.91) | .83 | .08 | 3.33–4.13 | 4.67–5.75 |
| 5. | 4.00 (5.50) | 5.56 | .07 | .90 (1.35) | 1.15 | .20 | 4.00–5.00 | 5.00–6.00 | 3.98 (5.47) | 5.58 | .11 | .61 (.91) | .79 | .12 | 3.67–4.40 | 5.17–6.17 |
| 6. | 4.09 (5.63) | 5.78 | .15* | .95 (1.43) | 1.30 | .13 | 4.00–5.00 | 5.00–7.00 | 4.13 (5.69) | 5.77 | .08 | .62 (.93) | .89 | .04 | 3.75–4.57 | 5.43–6.40 |
| 7. | 4.29 (5.93) | 6.00 | .06 | .89 (1.33) | 1.21 | .12a | 4.00–5.00 | 6.00–7.00 | 4.35 (6.02) | 6.00 | .03 | .58 (.86) | .80 | .06 | 4.00–4.70 | 5.67–6.60 |
| 8. | 4.34 (6.00) | 6.12 | .12 | .82 (1.22) | 1.16 | .07a, b | 4.00–5.00 | 6.00–7.00 | 4.38 (6.07) | 6.12 | .06 | .51 (.75) | .76 | .00 | 4.20–4.75 | 6.00–6.62 |
| 9. | 3.97 (5.45) | 5.40 | .05 | .84 (1.26) | 1.21 | .05a, b | 4.00–4.00 | 5.00–6.00 | 3.95 (5.42) | 5.45 | .03 | .54 (.82) | .80 | .01 | 3.67–4.33 | 5.00–6.00 |
| 10. | 3.96 (5.44) | 5.41 | .02 | .85 (1.28) | 1.22 | .06a | 4.00–5.00 | 5.00–6.00 | 3.96 (5.44) | 5.46 | .02 | .56 (.85) | .80 | .05 | 3.67–4.33 | 5.00–6.00 |
| 11. | 4.03 (5.55) | 5.76 | .21* | .79 (1.19) | 1.08 | .11 | 4.00–5.00 | 5.00–6.00 | 4.06 (5.59) | 5.75 | .16 | .47 (.70) | .65 | .05 | 3.80–4.33 | 5.50–6.62 |
| 12. | 4.06 (5.59) | 5.84 | .25* | .87 (1.30) | 1.14 | .15 | 4.00–5.00 | 5.00–7.00 | 4.10 (5.66) | 5.84 | .18 | .54 (.81) | .70 | .11 | 3.90–4.40 | 5.57–6.29 |
| TS. | 4.03 (5.54) | 5.61 | .07 | .71 (1.06) | 0.99 | .07 | 3.75–4.50 | 5.17–6.25 | 4.04 (5.56) | 5.63 | .06 | .49 (.73) | .68 | .05 | 3.85–4.37 | 5.33–6.10 |
p < .05 after Bonferroni correction
aEqual variances could not be assumed
bAfter applying weights to the analysis, equal variances could not be assumed
Percentages for similar response categories and percentages below and above the neutral response category for the five- and seven-point questionnaire items
| Totally disagree | Neutral | Totally agree | Below | Above | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Item | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt | 5 pt | 7 pt |
| 1. | 1.3 | .7 | 15.5 | 13.4 | 27.1* | 18.5* | 6.3 | 6.2 | 78.2 | 80.4 |
| 2. | 1.9* | .8* | 15.3 | 13.9 | 26.7* | 17.6* | 7.4 | 6.1 | 77.3 | 79.9 |
| 3. | 1.7* | .6* | 17.6* | 14.0* | 28.4* | 19.0* | 6.8 | 6.1 | 75.6* | 79.9* |
| 4. | 1.5 | .9 | 27.5* | 21.8* | 20.5* | 12.0* | 8.5 | 8.4 | 64.0* | 69.8* |
| 5. | 2.0* | .7* | 15.5* | 10.0* | 29.9* | 19.0* | 6.3 | 4.9 | 78.2* | 85.2* |
| 6. | 2.8* | 1.5* | 11.6* | 5.9* | 37.5* | 31.8* | 7.1 | 7.1 | 81.3* | 87.0* |
| 7. | 2.3 | 1.3 | 8.0* | 4.5* | 48.5* | 39.7* | 4.7 | 5.2 | 87.3* | 90.3* |
| 8. | 1.3 | 1.0 | 6.6* | 3.8* | 49.3 | 45.5 | 3.9 | 4.3 | 89.5 | 91.8 |
| 9. | 1.3 | .7 | 15.1 | 12.6 | 24.9* | 16.9* | 6.0 | 7.4 | 78.9 | 80.0 |
| 10 | 1.4 | .9 | 17.0* | 13.0* | 25.6* | 17.0* | 5.7 | 7.0 | 77.3 | 80.0 |
| 11. | 1.3* | .5* | 13.6* | 7.2* | 26.3 | 24.2 | 4.1 | 3.8 | 82.3* | 88.9* |
| 12. | 1.6 | .7 | 12.7* | 7.1* | 31.8 | 29.9 | 5.8 | 4.5 | 81.5* | 88.3* |
* Percentages of both scales differ with p < 0.05 (Bonferroni correction included)