| Literature DB >> 30542214 |
Eva A O Zijlmans1, Jesper Tijmstra1, L Andries van der Ark2, Klaas Sijtsma1.
Abstract
Reliability is usually estimated for a total score, but it can also be estimated for item scores. Item-score reliability can be useful to assess the repeatability of an individual item score in a group. Three methods to estimate item-score reliability are discussed, known as method MS, method λ 6 , and method CA. The item-score reliability methods are compared with four well-known and widely accepted item indices, which are the item-rest correlation, the item-factor loading, the item scalability, and the item discrimination. Realistic values for item-score reliability in empirical-data sets are monitored to obtain an impression of the values to be expected in other empirical-data sets. The relation between the three item-score reliability methods and the four well-known item indices are investigated. Tentatively, a minimum value for the item-score reliability methods to be used in item analysis is recommended.Entities:
Keywords: Coefficient λ6; correction for attenuation; item discrimination; item scalability; item-factor loading; item-rest correlation; item-score reliability
Year: 2017 PMID: 30542214 PMCID: PMC6236637 DOI: 10.1177/0013164417728358
Source DB: PubMed Journal: Educ Psychol Meas ISSN: 0013-1644 Impact factor: 2.821
Overview of the Data Sets
| Data set | Attribute |
|
| Percentage missingness | Recoded items | Reference | |
|---|---|---|---|---|---|---|---|
| 1 VER | Verbal intelligence by means of verbal analogies | 990 | 32 | 2 | 0 | — |
|
| 2 BAL | Intelligence by balance scale problem-solving | 484 | 25 | 2 | 0 | — |
|
| 3 CRY | Tendency to cry | 705 | 23 | 2 | 0 | — |
|
| 4 IND | Inductive reasoning | 484 | 43 | 2 | 1.24 | — |
|
| 5 RAK | Word comprehension | 1641 | 60 | 2 | 0 | — |
|
| 6 TRA | Transitive reasoning | 425 | 12 | 2 | 0 | — |
|
| 7 COP | Strategies for coping with industrial malodor | 828 | 17 | 4 | 0 | — |
|
| 8 WIL | Willingness to participate in labor union action | 496 | 24 | 5 | 0 | — |
|
| 9 SEN | Sensation seeking tendency | 441 | 13 | 7 | 0 | — |
|
| 10 DS14 | Type D personality | 541 | 14 | 5 | 0.13 | 1 - 3 |
|
| 1 - 3 - 4 | |||||||
| 11 TMA | Taylor Manifext Anxiety Scale | 5,410 | 50 | 2 | 0.97 | 9 -12 -18 - 20 - 29 |
|
| 32 - 38 - 50 | |||||||
| 12 LON | Loneliness | 7,440 | 11 | 3 | 0.58 | 1 - 4 - 7 - 8 -11 |
|
| 13 SAT | Satisfaction with life | 7,423 | 4 | 5 | 0.43 | — |
|
|
| |||||||
| 14 SES | Rosenberg Self-Esteem Scale | 47,974 | 10 | 4 | 0.43 | 3 - 5 - 8 - 9 -10 |
|
| 15 ACL | Personality traits | 433 | 218 | 6 | 0 | — |
|
| 16 HEX | HEXACO Personality Inventory | 22,786 | 240 | 8 | <0.01 | — |
|
Overview of the Empirical-Data Sets Arranged by Number of Items and Number of Item Scores.
| No. of items | Maximum performance | Typical behavior | ||
|---|---|---|---|---|
| No. of Item Scores | No. of Item Scores | |||
| 2 | >2 | 2 | >2 | |
| ≤10 | SAT SES ACL HEX | |||
| 10 < | TRA | COP SEN DS14 LON | ||
| ≥20 | VER BAL IND RAK | CRY TMA | WIL | |
Note. See the Appendix for the descriptions of the data sets.
Figure 1.Scatter plots for the three data clusters comparing the item-score reliability estimates for methods MS, , and CA.
Note. id. coeff. = identity coefficient; cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Figure 2.Scatter plots for the three data clusters comparing the item-score reliability methods with the item-rest correlation (IR-corr.).
Note. cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Figure 3.Scatter plots for the three data clusters comparing the item-score reliability methods with the item-factor loading (FL).
Note. cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Figure 4.Scatter plots for the three data clusters comparing the item-score reliability methods with the coefficient (-coeff.).
Note. cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Figure 5.Scatter plots for the three data clusters comparing the item-score reliability methods with the discrimination parameter (DiscrPar).
Note. cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Figure 6Scatter plots for the three data clusters comparing the item-rest correlation (IR-corr.), item-factor loading (FL), the coefficient (-coeff.), and the discrimination parameter (DiscrPar).
Note. cor = correlation between two method estimates. See the Appendix for a description of the data sets.
Estimates of the Three Item-Score Reliability Methods Based on the Cutoff Values of the Other Four Item Indices obtained Using a Bivariate Regression Analysis.
| Method MS | Method | Method CA | |
|---|---|---|---|
| Item-rest correlation | .20 | .24 | .17 |
| Item-factor loading | .18 | .20 | .15 |
|
| .28 | .33 | .28 |
| Item discrimination | .22 | .25 | .20 |