| Literature DB >> 30237646 |
Eva A O Zijlmans1, L Andries van der Ark2, Jesper Tijmstra1, Klaas Sijtsma1.
Abstract
Reliability is usually estimated for a test score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the item's contribution to the test score's reliability, for identifying unreliable scores in aberrant item-score patterns in person-fit analysis, and for selecting the most reliable item from a test to use as a single-item measure. Four methods were discussed for estimating item-score reliability: the Molenaar-Sijtsma method (method MS), Guttman's method λ6 , the latent class reliability coefficient (method LCRC), and the correction for attenuation (method CA). A simulation study was used to compare the methods with respect to median bias, variability (interquartile range [IQR]), and percentage of outliers. The simulation study consisted of six conditions: standard, polytomous items, unequal α parameters, two-dimensional data, long test, and small sample size. Methods MS and CA were the most accurate. Method LCRC showed almost unbiased results, but large variability. Method λ6 consistently underestimated item-score reliabilty, but showed a smaller IQR than the other methods.Entities:
Keywords: Guttman’s method λ6; correction for attenuation; item-score reliability; latent class reliability coefficient; method MS
Year: 2018 PMID: 30237646 PMCID: PMC6140096 DOI: 10.1177/0146621618758290
Source DB: PubMed Journal: Appl Psychol Meas ISSN: 0146-6216
Marginal Cumulative Probabilities for Four Artificial Items With Three Ordered Item Scores.
| Item | ||||
|---|---|---|---|---|
| 1 | 2 | 3 | 4 | |
|
| 1.00 | 1.00 | 1.00 | 1.00 |
|
| .97 | .94 | .93 | .86 |
|
| .53 | .32 | .85 | .72 |
Matrix With Joint Cumulative Probabilities and Marginal Cumulative Probabilities .
|
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|
| .32 | .53 | .72 | .85 | .86 | .93 | .94 | .97 | ||
|
| .32 | NA | .20 | .27 | .29 | .30 | .31 | NA | .32 |
|
| .53 | .20 | NA | .41 | .47 | .48 | .50 | .51 | NA |
|
| .72 | .27 | .41 | NA | .64 | NA | .68 | .68 | .70 |
|
| .85 | .29 | .47 | .64 | NA | .76 | NA | .81 | .84 |
|
| .86 | .30 | .48 | NA | .76 | NA | .81 | .81 | .84 |
|
| .93 | .31 | .50 | .68 | NA | .81 | NA | .88 | .91 |
|
| .94 | NA | .51 | .68 | .81 | .81 | .88 | NA | .91 |
|
| .97 | .32 | NA | .70 | .84 | .84 | .91 | .91 | NA |
Note. NA = not available.
Item Parameters of the Multidimensional Graded Response Model for the Simulation Design.
| Item | Design | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Standard | Polytomous | Unequal | Two dimensions | |||||||||
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 1 | 1 | −1.5 | 1 | −3 | −2 | −1 | 0 | 0.5 | −1.5 | 1 | 0 | −1.5 |
| 2 | 1 | −0.9 | 1 | −2.4 | −1.4 | −0.4 | 0.6 | 2 | −0.9 | 0 | 1 | −0.9 |
| 3 | 1 | −0.3 | 1 | −1.8 | −0.8 | 0.2 | 1.2 | 0.5 | −0.3 | 1 | 0 | −0.3 |
| 4 | 1 | 0.3 | 1 | −1.2 | −0.2 | 0.8 | 1.8 | 2 | 0.3 | 0 | 1 | 0.3 |
| 5 | 1 | 0.9 | 1 | −0.6 | 0.4 | 1.4 | 2.4 | 0.5 | 0.9 | 1 | 0 | 0.9 |
| 6 | 1 | 1.5 | 1 | 0 | 1 | 2 | 3 | 2 | 1.5 | 0 | 1 | 1.5 |
Note. = item discrimination, = item location.
Figure 1.Difference , where represents an estimate of methods MS, , LCRC, and CA, for six different conditions (see Table 3 for the specifications of the conditions).
Note. The bold horizontal line represents the median bias. The numbers in the boxplots represent the percentage outliers in that condition. MS = Molenaar–Sijtsma method; = Guttman’s method ; LCRC = latent class reliability coefficient; CA = correction for attenuation.
Estimated Item Indices for the Transitive Reasoning Data Set.
| Item | Item-score reliability | Item indices | ||||||
|---|---|---|---|---|---|---|---|---|
| Item | Method MS | Method | Method CA | Item-rest correlation | Item-factor loading | Item scalability | Item discrimination | |
|
| 0.97 |
| 0.28 | 0.21 | 0.26 |
| 0.28 |
|
|
| 0.81 | 0.01 | 0.13 | 0.05 | 0.13 | −0.04 | 0.08 | −0.05 |
|
| 0.97 |
|
|
|
|
|
|
|
|
| 0.78 | 0.05 | 0.13 | 0.02 | 0.08 | −0.10 | 0.05 | −0.20 |
|
| 0.84 | 0.18 | 0.23 |
| 0.29 |
| 0.18 |
|
|
| 0.94 |
| 0.20 | 0.17 | 0.23 |
| 0.21 |
|
|
| 0.64 | 0.03 | 0.05 | 0.00 | −0.04 | −0.06 | −0.03 | −0.01 |
|
| 0.88 |
|
| 0.26 | 0.28 |
| 0.19 |
|
|
| 0.80 | 0.05 | 0.06 | 0.07 | 0.15 | 0.34 | 0.09 | 0.64 |
|
| 0.30 | 0.00 | 0.10 | 0.10 | 0.18 | 0.48 | 0.17 |
|
|
| 0.52 | 0.00 | 0.17 | 0.14 | 0.21 | 0.61 | 0.14 |
|
|
| 0.48 | 0.00 | 0.07 | 0.06 | −0.17 | −0.29 | −0.14 | −0.50 |
Note. Bold-faced values are above the heuristic rule for that item index. MS = Molenaar–Sijtsma method; CA = correction for attenuation.
Parameters of Latent Class Models Having Two and Three Classes.
| Two-class model | Three-class model | ||
|---|---|---|---|
| Class weights | Response probabilities | Class weights | Response probabilities |
|
|
|
|
|
|
|
|
|
|
|
|
| ||