BACKGROUND: Large-scale performance-based examinations are logistically complex and costly to run. Scores based solely on performance-based stations require extended testing time to achieve acceptable generalizability. PURPOSE: Combining scores from performance-based formats and written formats may improve test generalizability. METHODS: Data from 718 test-takers on the standardized patient-based portion of the National Board of Chiropractic Examiners Part IV examination were analyzed using a generalizability framework. RESULTS: The written and performance scores displayed a moderate universe score correlation indicating each component provided unique information about related clinical skills and the examinee's ability to manage a case. Combining the two scores provided a summary score that was more generalizable than either of the two measures individually. CONCLUSION: These results provide guidance for improving the generalizability of the standardized patient-based stations by combining the scores on the performance and written components using appropriate weighting procedures.
BACKGROUND: Large-scale performance-based examinations are logistically complex and costly to run. Scores based solely on performance-based stations require extended testing time to achieve acceptable generalizability. PURPOSE: Combining scores from performance-based formats and written formats may improve test generalizability. METHODS: Data from 718 test-takers on the standardized patient-based portion of the National Board of Chiropractic Examiners Part IV examination were analyzed using a generalizability framework. RESULTS: The written and performance scores displayed a moderate universe score correlation indicating each component provided unique information about related clinical skills and the examinee's ability to manage a case. Combining the two scores provided a summary score that was more generalizable than either of the two measures individually. CONCLUSION: These results provide guidance for improving the generalizability of the standardized patient-based stations by combining the scores on the performance and written components using appropriate weighting procedures.