Literature DB >> 29881030

Comparing Traditional and IRT Scoring of Forced-Choice Tests.

Pedro M Hontangas1, Jimmy de la Torre2, Vicente Ponsoda3, Iwin Leenen4, Daniel Morillo3, Francisco J Abad3.   

Abstract

This article explores how traditional scores obtained from different forced-choice (FC) formats relate to their true scores and item response theory (IRT) estimates. Three FC formats are considered from a block of items, and respondents are asked to (a) pick the item that describes them most (PICK), (b) choose the two items that describe them the most and the least (MOLE), or (c) rank all the items in the order of their descriptiveness of the respondents (RANK). The multi-unidimensional pairwise-preference (MUPP) model, which is extended to more than two items per block and different FC formats, is applied to obtain the responses to each item block. Traditional and IRT (i.e., expected a posteriori) scores are computed from each data set and compared. The aim is to clarify the conditions under which simpler traditional scoring procedures for FC formats may be used in place of the more appropriate IRT estimates for the purpose of inter-individual comparisons. Six independent variables are considered: response format, number of items per block, correlation between the dimensions, item discrimination level, and sign-heterogeneity and variability of item difficulty parameters. Results show that the RANK response format outperforms the other formats for both the IRT estimates and traditional scores, although it is only slightly better than the MOLE format. The highest correlations between true and traditional scores are found when the test has a large number of blocks, dimensions assessed are independent, items have high discrimination and highly dispersed location parameters, and the test contains blocks formed by positive and negative items.

Entities:  

Keywords:  EAP; GGUM; MUPP; faking; forced choice; ipsative data; multi-unidimensional pairwise-preference; personality assessment; traditional scoring; unfolding model

Year:  2015        PMID: 29881030      PMCID: PMC5978493          DOI: 10.1177/0146621615585851

Source DB:  PubMed          Journal:  Appl Psychol Meas        ISSN: 0146-6216


  4 in total

1.  Validation of an ipsative personality measure (DISCUS).

Authors:  M Martinussen; A M Richardsen; H W Vårum
Journal:  Scand J Psychol       Date:  2001-12

2.  Forced-choice assessments of personality for selection: evaluating issues of normative assessment and faking resistance.

Authors:  Eric D Heggestad; Morgan Morrison; Charlie L Reeve; Rodney A McCloy
Journal:  J Appl Psychol       Date:  2006-01

3.  A critical review of the validity and rationale of the forced-choice technique.

Authors:  R M W TRAVERS
Journal:  Psychol Bull       Date:  1951-01       Impact factor: 17.737

4.  How IRT can solve problems of ipsative data in forced-choice questionnaires.

Authors:  Anna Brown; Alberto Maydeu-Olivares
Journal:  Psychol Methods       Date:  2012-11-12
  4 in total
  10 in total

1.  A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.

Authors:  HyeSun Lee; Weldon Z Smith
Journal:  Educ Psychol Meas       Date:  2019-08-27       Impact factor: 2.821

2.  Fit Indices for Measurement Invariance Tests in the Thurstonian IRT Model.

Authors:  HyeSun Lee; Weldon Z Smith
Journal:  Appl Psychol Meas       Date:  2019-12-26

3.  GGUM-RANK Statement and Person Parameter Estimation With Multidimensional Forced Choice Triplets.

Authors:  Philseok Lee; Seang-Hwane Joo; Stephen Stark; Oleksandr S Chernyshenko
Journal:  Appl Psychol Meas       Date:  2018-04-23

4.  Computerized Adaptive Testing for Ipsative Tests with Multidimensional Pairwise-Comparison Items: Algorithm Development and Applications.

Authors:  Xue-Lan Qiu; Jimmy de la Torre; Sage Ro; Wen-Chung Wang
Journal:  Appl Psychol Meas       Date:  2022-04-14

5.  On the Information Obtainable from Comparative Judgments.

Authors:  Paul-Christian Bürkner
Journal:  Psychometrika       Date:  2022-02-08       Impact factor: 2.500

6.  Integration of the Forced-Choice Questionnaire and the Likert Scale: A Simulation Study.

Authors:  Yue Xiao; Hongyun Liu; Hui Li
Journal:  Front Psychol       Date:  2017-05-18

7.  Controlling for Response Biases in Self-Report Scales: Forced-Choice vs. Psychometric Modeling of Likert Items.

Authors:  Rodrigo Schames Kreitchmann; Francisco J Abad; Vicente Ponsoda; Maria Dolores Nieto; Daniel Morillo
Journal:  Front Psychol       Date:  2019-10-15

8.  On the Statistical and Practical Limitations of Thurstonian IRT Models.

Authors:  Paul-Christian Bürkner; Niklas Schulte; Heinz Holling
Journal:  Educ Psychol Meas       Date:  2019-02-22       Impact factor: 2.821

9.  Modeling Faking in the Multidimensional Forced-Choice Format: The Faking Mixture Model.

Authors:  Susanne Frick
Journal:  Psychometrika       Date:  2021-12-20       Impact factor: 2.290

10.  Bayesian paired comparison with the bpcs package.

Authors:  David Issa Mattos; Érika Martins Silva Ramos
Journal:  Behav Res Methods       Date:  2021-11-30
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.