Literature DB >> 34907964

Differential Item Functioning Analysis of United States Medical Licensing Examination Step 1 Items.

Jonathan D Rubright1, Michael Jodoin2, Stephanie Woodward3, Michael A Barone4.   

Abstract

PURPOSE: Previous studies have examined and identified demographic group score differences on United States Medical Licensing Examination (USMLE) Step examinations. It is necessary to explore potential etiologies of such differences to ensure fairness of examination use. Although score differences are largely explained by preceding academic variables, one potential concern is that item-level bias may be associated with remaining group score differences. The purpose of this 2019-2020 study was to statistically identify and qualitatively review USMLE Step 1 exam questions (items) using differential item functioning (DIF) methodology.
METHOD: Logistic regression DIF was used to identify and classify the effect size of DIF on Step 1 items meeting minimum sample size criteria. After using DIF to flag items statistically, subject matter expert (SME) review was used to identify potential reasons why items may have performed differently between racial and gender groups, including characteristics such as content, format, wording, context, or stimulus materials. USMLE SMEs reviewed items to identify the group difference they believed was present, if any; articulate a rationale behind the group difference; and determine whether that rationale would be considered construct relevant or construct irrelevant.
RESULTS: All identified DIF rationales were relevant to the constructs being assessed and therefore did not reflect item bias. Where SME-generated rationales aligned with statistical differences (flags), they favored self-identified women on items tagged to women's health content categories and were judged to be construct relevant.
CONCLUSIONS: This study did not find evidence to support the hypothesis that group-level performance differences beyond those explained by prior academic performance variables are driven by item-level bias. Health professions examination programs have an obligation to assess for group differences, and when present, investigate to what extent, if any, measurement bias plays a role.
Copyright © 2021 by the Association of American Medical Colleges.

Entities:  

Mesh:

Year:  2022        PMID: 34907964     DOI: 10.1097/ACM.0000000000004567

Source DB:  PubMed          Journal:  Acad Med        ISSN: 1040-2446            Impact factor:   6.893


  1 in total

1.  HIGH-STAKES KNOWLEDGE ASSESSMENT AT ABFM: WHAT WE HAVE LEARNED AND HOW IT IS USEFUL.

Authors:  Warren P Newton; Thomas R O'Neill; Ting Wang
Journal:  Ann Fam Med       Date:  2022 Mar-Apr       Impact factor: 5.166

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.