Literature DB >> 28550657

Scoring best-worst data in unbalanced many-item designs, with applications to crowdsourcing semantic judgments.

Geoff Hollis1.   

Abstract

Best-worst scaling is a judgment format in which participants are presented with a set of items and have to choose the superior and inferior items in the set. Best-worst scaling generates a large quantity of information per judgment because each judgment allows for inferences about the rank value of all unjudged items. This property of best-worst scaling makes it a promising judgment format for research in psychology and natural language processing concerned with estimating the semantic properties of tens of thousands of words. A variety of different scoring algorithms have been devised in the previous literature on best-worst scaling. However, due to problems of computational efficiency, these scoring algorithms cannot be applied efficiently to cases in which thousands of items need to be scored. New algorithms are presented here for converting responses from best-worst scaling into item scores for thousands of items (many-item scoring problems). These scoring algorithms are validated through simulation and empirical experiments, and considerations related to noise, the underlying distribution of true values, and trial design are identified that can affect the relative quality of the derived item scores. The newly introduced scoring algorithms consistently outperformed scoring algorithms used in the previous literature on scoring many-item best-worst data.

Entities:  

Keywords:  Best-worst scaling; Human judgment; Rank judgment; Semantics; Tournament scoring

Mesh:

Year:  2018        PMID: 28550657     DOI: 10.3758/s13428-017-0898-2

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  4 in total

1.  Governance of forest resource use in western Nepal: Current state and community preferences.

Authors:  Manoj Bhatta; Kerstin K Zander; Stephen T Garnett
Journal:  Ambio       Date:  2022-01-15       Impact factor: 5.129

2.  Specificity ratings for Italian data.

Authors:  Marianna Marcella Bolognesi; Tommaso Caselli
Journal:  Behav Res Methods       Date:  2022-09-26

3.  Best-worst scaling improves measurement of first impressions.

Authors:  Nichola Burton; Michael Burton; Dan Rigby; Clare A M Sutherland; Gillian Rhodes
Journal:  Cogn Res Princ Implic       Date:  2019-09-23

4.  Beyond Likert ratings: Improving the robustness of developmental research measurement using best-worst scaling.

Authors:  Nichola Burton; Michael Burton; Carmen Fisher; Patricia González Peña; Gillian Rhodes; Louise Ewing
Journal:  Behav Res Methods       Date:  2021-04-05
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.