| Literature DB >> 22193944 |
Alberto Alves de Lima1, Diego Conde, Juan Costabel, Juan Corso, Cees Van der Vleuten.
Abstract
Reliability estimations of workplace-based assessments with the mini-CEX are typically based on real-life data. Estimations are based on the assumption of local independence: the object of the measurement should not be influenced by the measurement itself and samples should be completely independent. This is difficult to achieve. Furthermore, the variance caused by the case/patient or by assessor is completely confounded. We have no idea how much each of these factors contribute to the noise in the measurement. The aim of this study was to use a controlled setup that overcomes these difficulties and to estimate the reproducibility of the mini-CEX. Three encounters were videotaped from 21 residents. The patients were the same for all residents. Each encounter was assessed by 3 assessors who assessed all encounters for all residents. This delivered a fully crossed (all random) two-facet generalizability design. A quarter of the total variance was associated with universe score variance (28%). The largest source of variance was the general error term (34%) followed by the main effect of assessors (18%). Generalizability coefficients indicated that an approximate sample of 9 encounters was needed assuming a single different assessor per encounter and assuming different cases per encounter (the usual situation in real practice), 4 encounters when 2 raters were used and 3 encounters when 3 raters are used. Unexplained general error and the leniency/stringency of assessors are the major causes for unreliability in mini-CEX. To optimize reliability rater training might have an effect.Entities:
Mesh:
Year: 2011 PMID: 22193944 PMCID: PMC3569586 DOI: 10.1007/s10459-011-9343-y
Source DB: PubMed Journal: Adv Health Sci Educ Theory Pract ISSN: 1382-4996 Impact factor: 3.853
Means scores and standard deviations for the 3 cases split up per assessors, the overall scores for all assessors per case and the total score for all cases
| Medical interviewing skills | Physical examination skills | Humanistic qualities/professionalism | Clinical judgement | Counselling skills | Organization/efficiency | Global clinical competence | |||
|---|---|---|---|---|---|---|---|---|---|
| Case 1 | |||||||||
| A 1 | M | 6.61 | 6.9 | 7.61 | 6.09 | 6.14 | 6.14 | 6.04 | |
| SD | 1.85 | 1.25 | 0.58 | 2.09 | 2.05 | 1.79 | 1.68 | ||
| A 2 | M | 6.14 | 6 | 6.19 | 6.3 | 6.38 | 6.23 | 6.57 | |
| SD | 1.45 | 0.5 | 1.5 | 1.34 | 1.85 | 1.6 | 1.59 | ||
| A 3 | M | 5.95 | 6.11 | 6.04 | 2.96 | 5.71 | 6 | 5.42 | |
| SD | 1.85 | 1.28 | 1.8 | 1.72 | 2.14 | 1.78 | 1.8 | ||
| O | M | 6.24 | 6.25 | 6.62 | 6.06 | 6.08 | 6.13 | 6.02 | |
| SD | 1.79 | 1.22 | 1.54 | 1.73 | 2.01 | 1.7 | 1.73 | ||
| Case 2 | |||||||||
| A 1 | M | 7.33 | 7 | 7.66 | 6.52 | 6.9 | 6.57 | 6.52 | |
| SD | 1.15 | 1.22 | 0.57 | 2.33 | 1.17 | 1.2 | 1.32 | ||
| A 2 | M | 6.09 | 5.95 | 5.71 | 5.8 | 5.9 | 5.95 | 6.19 | |
| SD | 1.13 | 0.38 | 1 | 0.81 | 1.41 | 0.97 | 1.12 | ||
| A 3 | M | 4.9 | 6.23 | 5.76 | 5.04 | 5 | 5.66 | 4.8 | |
| SD | 1.48 | 1.3 | 1.3 | 1.43 | 1.3 | 1.31 | 1.24 | ||
| O | M | 6.11 | 6.4 | 6.38 | 5.79 | 5.81 | 6.06 | 5.84 | |
| SD | 1.59 | 1.12 | 1.34 | 1.73 | 1.65 | 1.21 | 1.42 | ||
| Case 3 | |||||||||
| A 1 | M | 6.95 | 7.47 | 7.61 | 6.42 | 6.09 | 6.42 | 6.61 | |
| SD | 1.96 | 0.87 | 0.49 | 1.85 | 2.34 | 1.69 | 1.62 | ||
| A 2 | M | 5.71 | 6 | 5.71 | 5.9 | 5.33 | 5.76 | 5.71 | |
| SD | 0.84 | 0 | 0.9 | 0.53 | 1.35 | 0.53 | 0.78 | ||
| A 3 | M | 5.52 | 5.61 | 6.14 | 5.38 | 4.71 | 5.23 | 5.04 | |
| SD | 1.36 | 0.97 | 1.15 | 1.49 | 1.84 | 1.67 | 1.65 | ||
| O | M | 6.06 | 6.37 | 6.49 | 5.81 | 5.38 | 5.81 | 5.79 | |
| SD | 1.57 | 1.09 | 1.2 | 1.63 | 1.94 | 1.46 | 1.53 | ||
| All cases | |||||||||
| Total score | M | 6.14 | 6.34 | 6.5 | 5.92 | 5.8 | 6 | 5.88 | |
| SD | 1.62 | 1.14 | 1.37 | 1.64 | 1.84 | 1.47 | 1.56 | ||
A1 assessor 1, A2 assessor 2, A3 assessor 3, O overall score, M mean, SD standard deviation
Estimated variance components, standard errors, and relative size of variance components
| Source of variance | Explanation | Estimated variance components | Standard error | % Of total variance |
|---|---|---|---|---|
| Vr | Systematic variability of residents | 0.48431 | 0.19709 | 28 |
| Vc | Systematic variability of cases (case difficulty) | 0.00000 | 0.02150 | 0 |
| Va | Systematic variability of assessors (leniency/stringency) | 0.30925 | 0.24682 | 18 |
| Vcr | Variability of residents across cases | 0.15974 | 0.08302 | 9 |
| Var | Assessor variability for some residents | 0.12108 | 0.07523 | 7 |
| Vca | Assessor variability for some cases | 0.07113 | 0.05726 | 4 |
| Vrca | General error term | 0.58305 | 0.09106 | 34 |
| ∑ | 1.72855 |
Reliability estimates as a function of the number of cases and assessors for the situation where residents are given different cases with different assessors
| Number of cases | One assessor per case | Two assessors per case | Three assessors per case |
|---|---|---|---|
| 1 | 0.33 | 0.49 | 0.59 |
| 2 | 0.49 | 0.66 | 0.74 |
| 3 | 0.59 | 0.74 | 0.81 |
| 4 | 0.66 | 0.80 | 0.85 |
| 5 | 0.71 | 0.83 | 0.88 |
| 7 | 0.77 | 0.87 | 0.91 |
| 9 | 0.81 | 0.90 | 0.93 |
| 11 | 0.84 | 0.91 | 0.94 |
| 13 | 0.86 | 0.93 | 0.95 |
| 15 | 0.88 | 0.94 | 0.96 |
Reliability estimates as a function of the number of cases and assessors for the situation where residents are given different cases but with the same assessors
| Number of cases | One assessor for all cases | The same two assessors for all cases | The same three assessors for all cases |
|---|---|---|---|
| 1 | 0.36 | 0.49 | 0.56 |
| 2 | 0.50 | 0.64 | 0.70 |
| 3 | 0.58 | 0.71 | 0.77 |
| 4 | 0.63 | 0.75 | 0.80 |
| 5 | 0.66 | 0.78 | 0.83 |
| 7 | 0.70 | 0.81 | 0.86 |
| 9 | 0.73 | 0.83 | 0.87 |
| 11 | 0.75 | 0.85 | 0.88 |
| 13 | 0.76 | 0.85 | 0.89 |
| 15 | 0.77 | 0.86 | 0.90 |