| Literature DB >> 25850617 |
Pieter C Schoonees1, Michel van de Velden2, Patrick J F Groenen2.
Abstract
Dual scaling (DS) is a multivariate exploratory method equivalent to correspondence analysis when analysing contingency tables. However, for the analysis of rating data, different proposals appear in the DS and correspondence analysis literature. It is shown here that a peculiarity of the DS method can be exploited to detect differences in response styles. Response styles occur when respondents use rating scales differently for reasons not related to the questions, often biasing results. A spline-based constrained version of DS is devised which can detect the presence of four prominent types of response styles, and is extended to allow for multiple response styles. An alternating nonnegative least squares algorithm is devised for estimating the parameters. The new method is appraised both by simulation studies and an empirical application.Entities:
Keywords: -means; correspondence analysis; dual scaling; nonnegative least squares; response style; splines
Mesh:
Year: 2015 PMID: 25850617 PMCID: PMC4644217 DOI: 10.1007/s11336-015-9458-9
Source DB: PubMed Journal: Psychometrika ISSN: 0033-3123 Impact factor: 2.500
Fig. 1Examples of (inverse) response style functions mapping the true item content scale (vertical axis) into the observed measurement scale (horizontal axis).
Curvature properties of the four response styles.
| Response style | Lower curvature | Upper curvature |
|---|---|---|
| No response style | None | None |
| Acquiescence | Convex | Convex |
| Disacquiescence | Concave | Concave |
| Extreme responding | Concave | Convex |
| Midpoint responding | Convex | Concave |
Fig. 2The three I-spline basis functions for quadratic monotone splines with a single interior knot .
Fig. 3Classifying response styles graphically using the curvature properties of monotone quadratic splines.
Fig. 4Response styles used in the simulation study. Each curve represents a different style.
Average adjusted Rand index for 50 simulations at the different parameter settings.
|
|
| |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||||||||||||
| RS% |
| 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 |
|
| ||||||||||||||||||
| 20 | 0.28 | 0.40 | 0.61 | 0.30 | 0.42 | 0.62 | 0.29 | 0.40 | 0.62 | 0.31 | 0.48 | 0.74 | 0.29 | 0.48 | 0.80 | 0.30 | 0.48 | 0.80 |
| 50 | 0.59 | 0.80 | 0.90 | 0.57 | 0.80 | 0.91 | 0.58 | 0.80 | 0.91 | 0.62 | 0.85 | 0.93 | 0.64 | 0.86 | 0.94 | 0.62 | 0.85 | 0.94 |
| 80 | 0.73 | 0.90 | 0.93 | 0.72 | 0.89 | 0.95 | 0.75 | 0.89 | 0.95 | 0.75 | 0.91 | 0.96 | 0.76 | 0.90 | 0.96 | 0.76 | 0.91 | 0.96 |
|
| ||||||||||||||||||
| 20 | 0.16 | 0.22 | 0.33 | 0.16 | 0.22 | 0.34 | 0.16 | 0.21 | 0.34 | 0.17 | 0.24 | 0.35 | 0.17 | 0.25 | 0.36 | 0.18 | 0.25 | 0.36 |
| 50 | 0.42 | 0.65 | 0.82 | 0.42 | 0.65 | 0.81 | 0.42 | 0.65 | 0.82 | 0.44 | 0.67 | 0.86 | 0.44 | 0.66 | 0.84 | 0.44 | 0.66 | 0.85 |
| 80 | 0.70 | 0.85 | 0.93 | 0.70 | 0.86 | 0.93 | 0.71 | 0.86 | 0.93 | 0.73 | 0.88 | 0.94 | 0.73 | 0.88 | 0.95 | 0.73 | 0.88 | 0.95 |
Average hit rates for 50 simulations at the different parameter settings.
|
|
| |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||||||||||||
| RS% |
| 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 |
|
| ||||||||||||||||||
| 20 | 0.66 | 0.76 | 0.87 | 0.67 | 0.77 | 0.87 | 0.67 | 0.76 | 0.88 | 0.69 | 0.81 | 0.92 | 0.67 | 0.81 | 0.94 | 0.68 | 0.81 | 0.94 |
| 50 | 0.84 | 0.93 | 0.97 | 0.83 | 0.93 | 0.97 | 0.84 | 0.93 | 0.97 | 0.85 | 0.95 | 0.98 | 0.86 | 0.95 | 0.98 | 0.86 | 0.95 | 0.98 |
| 80 | 0.87 | 0.96 | 0.97 | 0.87 | 0.95 | 0.98 | 0.89 | 0.95 | 0.98 | 0.88 | 0.96 | 0.98 | 0.89 | 0.96 | 0.99 | 0.89 | 0.96 | 0.98 |
|
| ||||||||||||||||||
| 20 | 0.50 | 0.56 | 0.68 | 0.50 | 0.57 | 0.70 | 0.49 | 0.56 | 0.69 | 0.51 | 0.60 | 0.70 | 0.50 | 0.61 | 0.71 | 0.52 | 0.61 | 0.72 |
| 50 | 0.72 | 0.86 | 0.93 | 0.71 | 0.86 | 0.93 | 0.71 | 0.86 | 0.93 | 0.73 | 0.87 | 0.95 | 0.74 | 0.87 | 0.94 | 0.74 | 0.87 | 0.94 |
| 80 | 0.84 | 0.93 | 0.97 | 0.84 | 0.94 | 0.97 | 0.85 | 0.94 | 0.97 | 0.86 | 0.95 | 0.98 | 0.86 | 0.95 | 0.98 | 0.86 | 0.95 | 0.98 |
Fig. 5The effect of response styles on the underlying uncorrelated objects: estimated Pearson correlations before and after contamination, as well as after cleaning the data. The number of rating categories is for (a)–(c) and for (d)–(f), with items in all cases.
Fig. 6An example of the correlation structure imposed by the Clayton copula’s, in terms of Kendall’s .
Average proportional improvement in the RMSE of the cleaned over the contaminated data.
|
|
| |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||||||||||||
| RS% |
| 0.75 | 1.0 | 0.5 | 0.75 | 1.0 | 0.5 | 0.75 | 1.0 | 0.5 | 0.75 | 1.0 | 0.5 | 0.75 | 1.0 | 0.5 | 0.75 | 1.0 |
|
| ||||||||||||||||||
| 20 | 0.08 | 0.09 | 0.33 | 0.05 | 0.03 | 0.35 |
| 0.01 | 0.36 | 0.67 | 0.71 | 0.45 | 0.63 | 0.77 | 0.44 | 0.69 | 0.74 | 0.48 |
| 50 |
|
|
|
|
|
|
|
|
| 0.64 | 0.70 | 0.83 | 0.70 | 0.70 | 0.86 | 0.64 | 0.69 | 0.87 |
| 80 |
|
|
|
|
|
|
|
|
| 0.60 | 0.65 | 0.81 | 0.64 | 0.66 | 0.79 | 0.61 | 0.66 | 0.8 |
|
| ||||||||||||||||||
| 20 | 0.09 | 0.19 | 0.50 | 0.14 | 0.19 | 0.55 | 0.14 | 0.15 | 0.54 | 0.75 | 0.85 | 0.47 | 0.70 | 0.82 | 0.48 | 0.70 | 0.79 | 0.49 |
| 50 | 0.12 | 0.15 | 0.18 | 0.12 | 0.14 | 0.21 | 0.13 | 0.14 | 0.26 | 0.71 | 0.75 | 0.93 | 0.70 | 0.76 | 0.94 | 0.70 | 0.76 | 0.92 |
| 80 | 0.10 | 0.12 | 0.07 | 0.07 | 0.11 | 0.12 | 0.08 | 0.11 | 0.10 | 0.70 | 0.72 | 0.85 | 0.68 | 0.72 | 0.85 | 0.68 | 0.72 | 0.85 |
A two-sample Wilcoxon test for no difference in RMSE against the alternative hypothesis that the cleaned data significantly reduces the RMSE shows significant improvements () for all tests except those shown in italic print.
Average proportional improvement in the RMSE when comparing the principal component loadings between the cleaned and contaminated data.
|
|
| |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||||||||||||
| r |
| 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 | 10 | 20 | 30 |
|
| ||||||||||||||||||
| 2 | 0.13 |
|
| 0.11 |
|
| 0.13 |
|
| 0.61 | 0.59 | 0.42 | 0.60 | 0.63 | 0.44 | 0.60 | 0.62 | 0.43 |
| 3 | 0.06 |
|
| 0.08 |
|
| 0.07 |
|
| 0.54 | 0.41 | 0.17 | 0.59 | 0.46 | 0.18 | 0.60 | 0.46 | 0.20 |
| 4 | 0.04 |
|
| 0.06 |
|
| 0.04 |
|
| 0.50 | 0.27 | 0.09 | 0.50 | 0.32 | 0.11 | 0.53 | 0.31 | 0.09 |
|
| ||||||||||||||||||
| 2 | 0.10 |
|
| 0.13 | 0.00 |
| 0.10 | 0.00 |
| 0.64 | 0.50 | 0.20 | 0.73 | 0.54 | 0.21 | 0.66 | 0.53 | 0.20 |
| 3 | 0.08 | 0.01 | 0.00 | 0.08 | 0.02 | 0.00 | 0.09 | 0.01 |
| 0.61 | 0.30 | 0.11 | 0.63 | 0.28 | 0.10 | 0.66 | 0.32 | 0.08 |
| 4 | 0.06 | 0.03 | 0.01 | 0.07 | 0.03 | 0.02 | 0.08 | 0.03 | 0.01 | 0.55 | 0.19 | 0.10 | 0.60 | 0.24 | 0.10 | 0.61 | 0.21 | 0.09 |
Fig. 7Scree plot for the sensory data.
Fig. 8The estimated response mappings for (top left) to 8 (bottom right) groups, respectively. The area of the bubbles are proportional to how often that particular rating is used. The group sizes are also shown in a legend. Groups are labelled sequentially; the legend should be read vertically and then horizontally.
Fig. 9Relative aggregate frequencies of rating scale use in the identified groups when .
The Kullback–Leibler divergence between the groups when , based on the rating scale use per group.
| Group | I | II | III | IV | V |
|---|---|---|---|---|---|
| I | – | 0.158 | 0.009 | 0.187 | 0.234 |
| II | 0.161 | – | 0.138 | 0.699 | 0.701 |
| III | 0.008 | 0.134 | – | 0.224 | 0.297 |
| IV | 0.166 | 0.606 | 0.202 | – | 0.053 |
| V | 0.231 | 0.680 | 0.317 | 0.065 | – |
The distributions of the groups in the rows are treated as the respective reference distributions, .
Fig. 10a–d Relative frequencies of rating scale use for the chosen solution ; and e–h Variability of rating scale use within these groups, with each line representing a single individual.
Fig. 11a Optimal scores assigned to the response style groups, from rating 1 (left) to rating 9 (right). b Curvature plot similar to Figure 3 for the four groups, with the axes now transformed to obtain a more symmetrical plot. The ellipse in the centre is an approximate 95 % confidence ellipse for no response style.
Fig. 12Optimal scores for each of the seven questions, separated by product and with similar items depicted by the same colours.