| Literature DB >> 31455049 |
Dario Cecilio-Fernandes1,2, André Bremers3, Carlos Fernando Collares4, Wybe Nieuwland5, Cees van der Vleuten4, René A Tio4,6.
Abstract
PURPOSE: Assessment in different languages should measure the same construct. However, item characteristics, such as item flaws and content, may favor one test-taker group over another. This is known as item bias. Although some studies have focused on item bias, little is known about item bias and its association with items characteristics. Therefore, this study investigated the association between item characteristics and bias.Entities:
Keywords: Bias; Educational measurement; Medical education
Mesh:
Year: 2019 PMID: 31455049 PMCID: PMC6715902 DOI: 10.3946/kjme.2019.130
Source DB: PubMed Journal: Korean J Med Educ ISSN: 2005-727X
Mean, SD, Minimum and Maximum of Measurement, Infit, Outfit, and Error for Items and Person
| Test | Category | Items | Person | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Measure | Infit | Outfit | Error | Measure | Infit | Outfit | Error | ||
| Test 1 (September) | Mean±SD | 0.00±1.39 | 1.00±0.13 | 0.92±0.30 | 0.09±0.03 | -1.93±1.14 | 0.99±0.13 | 0.92±0.38 | 0.23±0.07 |
| Minimum | -3.78 | 0.73 | 0.40 | 0.06 | -5.54 | 0.68 | 0.18 | 0.72 | |
| Maximum | 2.93 | 1.58 | 2.06 | 0.24 | 0.78 | 1.79 | 5.83 | 0.17 | |
| Test 2 (December) | Mean±SD | 0.00±1.37 | 1.00±0.12 | 0.96±0.26 | 0.08±0.04 | -1.41±0.93 | 0.99±0.12 | 0.96±0.28 | 0.20±0.04 |
| Minimum | -3.42 | 0.73 | 0.49 | 0.06 | -4.27 | 0.69 | 0.36 | 0.17 | |
| Maximum | 3.87 | 1.53 | 1.75 | 0.32 | 0.83 | 1.54 | 4.21 | 0.43 | |
| Test 3 (February) | Mean±SD | 0.00±1.28 | 1.00±0.14 | 0.97±0.28 | 0.08±0.03 | -1.31±0.91 | 0.99±0.10 | 0.97±0.27 | 0.19±0.03 |
| Minimum | -3.99 | 0.71 | 0.48 | 0.06 | -3.72 | 0.73 | 0.34 | 0.16 | |
| Maximum | 3.73 | 1.61 | 1.90 | 0.29 | 1.28 | 1.42 | 3.13 | 0.36 | |
| Test 4 (May) | Mean±SD | 0.00±1.28 | 1.00±0.11 | 1.00±0.22 | 0.08±0.03 | -1.10±0.86 | 0.99±0.11 | 1.00±0.25 | 0.19±0.03 |
| Minimum | -3.38 | 0.74 | 0.62 | 0.06 | -3.74 | 0.70 | 0.39 | 0.16 | |
| Maximum | 3.52 | 1.32 | 1.85 | 0.25 | 0.92 | 1.65 | 3.31 | 0.36 | |
SD: Standard deviation.
Number of Items That Presented Differential Item Function Favoring the National or International Track Divided by the Following Categories: Negligible, Moderate, and Larger
| Test | Category | Size | |||
|---|---|---|---|---|---|
| Negligible (%) | Moderate (%) | Larger (%) | Total (%) | ||
| Test 1 (September) | International | 4 (2) | 7 (3.5) | 25 (12.5) | 36 (18) |
| National | 16 (8) | 8 (4) | 8 (4) | 32 (16) | |
| Test 2 (December) | International | 9 (4.5) | 4 (2) | 25 (12.5) | 38 (19) |
| National | 21 (10,5) | 3 (1.5) | 9 (4.5) | 33 (16.5) | |
| Test 3 (February) | International | 4 (2) | 8 (4) | 24 (12) | 36 (18) |
| National | 17 (8.5) | 3 (1.5) | 11 (5.5) | 31 (15.5) | |
| Test 4 (May) | International | 8 (4) | 4 (2) | 24 (12) | 36 (18) |
| National | 17 (8.5) | 4 (2) | 9 (4.5) | 30 (15) | |
| Total | International | 25 (3.12) | 23 (2.87) | 98 (12.25) | 146 (18.5) |
| National | 71 (8.8) | 18 (2.25) | 37 (4.62) | 126 (15.75) | |
Distribution of Items in the 17 Categories of the Progress Test
| Category | Items | Total of items | |||||
|---|---|---|---|---|---|---|---|
| No DIF | DIF favoring: international | DIF favoring: national | |||||
| No. of items (%) | Min/max size | No. of items (%) | Min/max size | No. of items (%) | Min/max size | ||
| Respiratory system | 46 (74.4) | 0.01/1.9 | 12 (19.7) | 0.13/1.21 | 3 (4.9) | 0.34/1.04 | 61 |
| Blood & immune system | 27 (67.5) | 0/2.4 | 4 (10) | 0.02/1.34 | 9 (22.5) | 0.02/0.97 | 40 |
| Musculoskeletal system | 30 (61.2) | 0.03/2.36 | 11 (22.4) | 0.22/1.59 | 8 (16.3) | 0.21/0.85 | 49 |
| Mental health care | 23 (50) | 0.02/1.95 | 7 (15.2) | 0.13/1.54 | 16 (34.8) | 0.03/1.25 | 46 |
| Reproductive system, pregnancy, childbirth & puerperium | 25 (58.1) | 0.19/2.56 | 11 (25.6) | 0.28/1.3 | 7 (16.3) | 0.08/0.23 | 43 |
| Cardiovascular system | 39 (65) | 0.02/2.3 | 16 (26.7) | 0.54/2.08 | 5 (8.3) | 0.13/0.89 | 60 |
| Hormones & metabolism, endocrine system | 25 (64.1) | 0.03/2.24 | 13 (33.3) | 0.38/2.4 | 1 (2.6) | 1.33/1.33 | 39 |
| Dermis & connective tissue | 23 (60.5) | 0.11/2.18 | 8 (21.1) | 0.45/1.81 | 7 (18.4) | 0.10/1.01 | 38 |
| Personal and social aspects | 21 (44.7) | 0.01/2.31 | 6 (12.8) | 0.07/1.02 | 20 (42.6) | 0.11/1.85 | 47 |
| Digestive/gastrointestinal system, nutritional disorders | 33 (68.8) | 0.01/2.56 | 12 (25) | 0.24/1.06 | 3 (6.3) | 0.01/1.08 | 48 |
| Nervous system & senses | 46 (83.6) | 0.05/2.56 | 6 (10.9) | 0.29/1.12 | 3 (5.5) | 0.13/0.7 | 55 |
| Kidneys & urinary system | 52 (68.4) | 0.01/2.56 | 14 (18.4) | 0.17/1.61 | 10 (13.2) | 0.02/0.69 | 76 |
| Molecular & cellular aspects | 24 (68.6) | 0.03/1.87 | 8 (22.9) | 0.03/1.5 | 3 (8.6) | 0.36/0.62 | 35 |
| Epistemology, methodology & applied biostatistics | 23 (65.7) | 0.17/1.97 | 3 (8.6) | 0.85/1.09 | 9 (25.7) | 0.21/1.04 | 35 |
| Stages of life | 18 (66.7) | 0.07/1.29 | 4 (14.8) | 0.43/1.22 | 5 (18.5) | 0.06/1.35 | 27 |
| Knowledge of skills | 44 (74.6) | 0/2.5 | 8 (13.6) | 0.62/1.95 | 7 (11.9) | 0.04/1.92 | 59 |
| Preventive medicine | 11 (45.8) | 0.05/0.96 | 3 (12.5) | 0.08/2.22 | 10 (41.7) | 0.02/0.86 | 24 |
DIF: Differential item function, Min: Minimum, Max: Maximum.
Number of Items Favoring the National or International Track Divided by Item Flaws
| Variable | Category | No DIF | DIF favoring: international | DIF favoring: national | Total | |||
|---|---|---|---|---|---|---|---|---|
| No. of items (%) | Min/max size | No. of items (%) | Min/max size | No. of items (%) | Min/max size | |||
| Logical clues | No | 507 (65.3) | 0.00/2.56 | 145 (18.7) | 0.02/2.4 | 125 (16.1) | 0.01/1.92 | 777 |
| Yes | 3 (60) | 0.23/1.47 | 1 (20) | 0.80/0.80 | 1 (20) | 1.06/1.06 | 5 | |
| Greater detail in correct option | No | 504 (65.3) | 0.00/2.56 | 143 (18.5) | 0.02/2.4 | 125 (16.2) | 0.01/1.92 | 772 |
| Yes | 6 (60) | 0.04/2.36 | 3 (30) | 0.91/2.08 | 1 (10) | 0.69/0.69 | 10 | |
| Implausible distractors | No | 475 (64.9) | 0.00/2.56 | 140 (19.1) | 0.02/2.40 | 117 (16) | 0.01/1.92 | 732 |
| Yes | 35 (70) | 0.01/1.87 | 6 (12) | 0.55/0.92 | 9 (18) | 0.13/1.25 | 50 | |
| Unfocused stem | No | 500 (65) | 0.00/2.56 | 145 (18.9) | 0.02/2.40 | 124 (16.1) | 0.01/1.92 | 769 |
| Yes | 10 (76.9) | 0.20/1.95 | 1 (7.7) | 1.08/1.08 | 2 (15.4) | 0.25/1.1 | 13 | |
| No correct or more than one correct answer | No | 504 (65.8) | 0.00/2.56 | 142 (18.5) | 0.02/2.40 | 120 (15.7) | 0.01/1.92 | 766 |
| Yes | 6 (37.5) | 0.01/0.80 | 4 (25) | 0.56/1.22 | 6 (37.5) | 0.03/1.06 | 16 | |
| Unnecessary information | No | 502 (65.1) | 0.00/2.56 | 145 (18.8) | 0.02/2.40 | 124 (16.1) | 0.01/1.92 | 771 |
| Yes | 8 (72.7) | 0.11/1.95 | 1 (9.1) | 1.08/1.08 | 2 (18.2) | 0.06/0.69 | 11 | |
| Unbalance in distractors | No | 493 (65.1) | 0.00/2.56 | 141 (18.6) | 0.02/2.40 | 123 (16.3) | 0.01/1.92 | 757 |
| Yes | 17 (68) | 0.01/1.91 | 5 (20) | 0.47/1.21 | 3 (12) | 0.02/0.41 | 25 | |
| Negative items | No | 496 (64.8) | 0.00/2.56 | 144 (18.8) | 0.02/2.40 | 125 (16.3) | 0.01/1.92 | 765 |
| Yes | 14 (82.4) | 0.02/1.31 | 2 (11.8) | 0.35/0.49 | 1 (5.9) | 0.31/0.31 | 17 | |
DIF: Differential item function, Min: Minimum, Max: Maximum.
Number of Items Favoring the National or International Track Divided by the Number of Non- and Case-Based Questions
| Case-based questions | No DIF | DIF favoring: international | DIF favoring: national | Total | |||
|---|---|---|---|---|---|---|---|
| No. of items (%) | Min/max size | No. of items (%) | Min/max size | No. of items (%) | Min/max size | ||
| No | 314 (59.7) | 0/2.56 | 120 (22.8) | 0.2/2.4 | 92 (17.5) | 0.01/1.85 | 526 |
| Yes | 196 (76.6) | 0/2.56 | 26 (10.2) | 0.07/2.08 | 34 (13.3) | 0.02/1.92 | 256 |
DIF: Differential item function, Min: Minimum, Max: Maximum.