| Literature DB >> 34007675 |
Michael J Peeters1, M Kenneth Cor2, Sarah E Petite1, Michelle N Schroeder1.
Abstract
OBJECTIVES: Performance-based assessments, including objective structured clinical examinations (OSCEs), are essential learning assessments within pharmacy education. Because important educational decisions can follow from performance-based assessment results, pharmacy colleges/schools should demonstrate acceptable rigor in validation of their learning assessments. Though G-Theory has rarely been reported in pharmacy education, it would behoove pharmacy educators to, using G-Theory, produce evidence demonstrating reliability as a part of their OSCE validation process. This investigation demonstrates the use of G-Theory to describes reliability for an OSCE, as well as to show methods for enhancement of the OSCE's reliability. INNOVATION: To evaluate practice-readiness in the semester before final-year rotations, third-year PharmD students took an OSCE. This OSCE included 14 stations over three weeks. Each week had four or five stations; one or two stations were scored by faculty-raters while three stations required students' written responses. All stations were scored 1-4. For G-Theory analyses, we used G_Strings and then mGENOVA. CRITICAL ANALYSIS: Ninety-seven students completed the OSCE; stations were scored independently. First, univariate G-Theory design of students crossed with stations nested in weeks (p × s:w) was used. The total-score g-coefficient (reliability) for this OSCE was 0.72. Variance components for test parameters were identified. Of note, students accounted for only some OSCE score variation. Second, a multivariate G-Theory design of students crossed with stations (p• × s°) was used. This further analysis revealed which week(s) were weakest for the reliability of test-scores from this learning assessment. Moreover, decision-studies showed how reliability could change depending on the number of stations each week. For a g-coefficient >0.80, seven stations per week were needed. Additionally, targets for improvements were identified. IMPLICATIONS: In test validation, evidence of reliability is vital for the inference of generalization; G-Theory provided this for our OSCE. Results indicated that the reliability of scores was mediocre and could be improved with more stations. Revision of problematic stations could help reliability as well. Within this need for more stations, one practical insight was to administer those stations over multiple weeks/occasions (instead of all stations in one occasion). © Individual authors.Entities:
Keywords: generalizability theory; objective structured clinical examination; reliability; validation
Year: 2021 PMID: 34007675 PMCID: PMC8102968 DOI: 10.24926/iip.v12i1.2110
Source DB: PubMed Journal: Innov Pharm ISSN: 2155-0417
Variance Component Estimates from a Practice OSCE of 3rd-year PharmD Students using a p × (s: w) G-Theory design
Variance from difference in ability of students | 0.13 (64%) | |
Variance from difficulty of stations for all students | 0.015 (7%) | |
Variance from difficulty in stations from each week | 0.009 (4%) | |
Variance from some students (but not all) finding a week more difficult than other weeks | 0.001 (1%) | |
Variance from some students finding some stations more difficult than others (context specificity) | 0.049 (24%) | |
| 0.204 |
This also includes residual error
Estimated G-Coefficients for Stations in 3rd-year PharmD OSCE
1 | 2 | 3 | 4 | 5 | 6 | 7 |
| |
|---|---|---|---|---|---|---|---|---|
Week 1 |
| |||||||
Week 2 |
| |||||||
Week 3 |
| |||||||
| ||||||||
Figure 1.Graph of Estimated G-Coefficients with Increased Stations in Each of Three Weeks
Note: A threshold of 0.80 is often considered acceptable for high-stakes testing[5]