Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparing Traditional and IRT Scoring of Forced-Choice Tests.

Literature DB >> 29881030

Comparing Traditional and IRT Scoring of Forced-Choice Tests.

Pedro M Hontangas¹, Jimmy de la Torre², Vicente Ponsoda³, Iwin Leenen⁴, Daniel Morillo³, Francisco J Abad³.

Abstract

This article explores how traditional scores obtained from different forced-choice (FC) formats relate to their true scores and item response theory (IRT) estimates. Three FC formats are considered from a block of items, and respondents are asked to (a) pick the item that describes them most (PICK), (b) choose the two items that describe them the most and the least (MOLE), or (c) rank all the items in the order of their descriptiveness of the respondents (RANK). The multi-unidimensional pairwise-preference (MUPP) model, which is extended to more than two items per block and different FC formats, is applied to obtain the responses to each item block. Traditional and IRT (i.e., expected a posteriori) scores are computed from each data set and compared. The aim is to clarify the conditions under which simpler traditional scoring procedures for FC formats may be used in place of the more appropriate IRT estimates for the purpose of inter-individual comparisons. Six independent variables are considered: response format, number of items per block, correlation between the dimensions, item discrimination level, and sign-heterogeneity and variability of item difficulty parameters. Results show that the RANK response format outperforms the other formats for both the IRT estimates and traditional scores, although it is only slightly better than the MOLE format. The highest correlations between true and traditional scores are found when the test has a large number of blocks, dimensions assessed are independent, items have high discrimination and highly dispersed location parameters, and the test contains blocks formed by positive and negative items.

Entities: Gene

Keywords: EAP; GGUM; MUPP; faking; forced choice; ipsative data; multi-unidimensional pairwise-preference; personality assessment; traditional scoring; unfolding model

Year: 2015 PMID： 29881030 PMCID： PMC5978493 DOI： 10.1177/0146621615585851

Source DB: PubMed Journal: Appl Psychol Meas ISSN： 0146-6216

4 in total

1. Validation of an ipsative personality measure (DISCUS).

Authors: M Martinussen; A M Richardsen; H W Vårum
Journal: Scand J Psychol Date: 2001-12

2. Forced-choice assessments of personality for selection: evaluating issues of normative assessment and faking resistance.

Authors: Eric D Heggestad; Morgan Morrison; Charlie L Reeve; Rodney A McCloy
Journal: J Appl Psychol Date: 2006-01

3. A critical review of the validity and rationale of the forced-choice technique.

Authors: R M W TRAVERS
Journal: Psychol Bull Date: 1951-01 Impact factor: 17.737

4. How IRT can solve problems of ipsative data in forced-choice questionnaires.

Authors: Anna Brown; Alberto Maydeu-Olivares
Journal: Psychol Methods Date: 2012-11-12

4 in total

10 in total

1. A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.

Authors: HyeSun Lee; Weldon Z Smith
Journal: Educ Psychol Meas Date: 2019-08-27 Impact factor: 2.821

2. Fit Indices for Measurement Invariance Tests in the Thurstonian IRT Model.

Authors: HyeSun Lee; Weldon Z Smith
Journal: Appl Psychol Meas Date: 2019-12-26

3. GGUM-RANK Statement and Person Parameter Estimation With Multidimensional Forced Choice Triplets.

Authors: Philseok Lee; Seang-Hwane Joo; Stephen Stark; Oleksandr S Chernyshenko
Journal: Appl Psychol Meas Date: 2018-04-23

4. Computerized Adaptive Testing for Ipsative Tests with Multidimensional Pairwise-Comparison Items: Algorithm Development and Applications.

Authors: Xue-Lan Qiu; Jimmy de la Torre; Sage Ro; Wen-Chung Wang
Journal: Appl Psychol Meas Date: 2022-04-14

Comparing Traditional and IRT Scoring of Forced-Choice Tests.

1. Validation of an ipsative personality measure (DISCUS).

2. Forced-choice assessments of personality for selection: evaluating issues of normative assessment and faking resistance.

3. A critical review of the validity and rationale of the forced-choice technique.

4. How IRT can solve problems of ipsative data in forced-choice questionnaires.

1. A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.

2. Fit Indices for Measurement Invariance Tests in the Thurstonian IRT Model.

3. GGUM-RANK Statement and Person Parameter Estimation With Multidimensional Forced Choice Triplets.

4. Computerized Adaptive Testing for Ipsative Tests with Multidimensional Pairwise-Comparison Items: Algorithm Development and Applications.

5. On the Information Obtainable from Comparative Judgments.

6. Integration of the Forced-Choice Questionnaire and the Likert Scale: A Simulation Study.

7. Controlling for Response Biases in Self-Report Scales: Forced-Choice vs. Psychometric Modeling of Likert Items.

8. On the Statistical and Practical Limitations of Thurstonian IRT Models.

9. Modeling Faking in the Multidimensional Forced-Choice Format: The Faking Mixture Model.

10. Bayesian paired comparison with the bpcs package.