| Literature DB >> 29339996 |
Catarina Leandro1,2, Lurdes Ávila-Carvalho3, Elena Sierra-Palmeiro2, Marta Bobo-Arce2.
Abstract
This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach's alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges' difficulty scores, the Kendall's coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach's alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level.Entities:
Keywords: bias; evaluation; reliability; rhythmic gymnastics; validity
Year: 2017 PMID: 29339996 PMCID: PMC5765796 DOI: 10.1515/hukin-2017-0099
Source DB: PubMed Journal: J Hum Kinet ISSN: 1640-5544 Impact factor: 2.193
Statistics of D scores and the performance of individual judges
| Apparatus | Mean ± SD | Dev. max. | Ab. dev. max. | R min. | C | |
|---|---|---|---|---|---|---|
| 1st part of the ranking | Hoop | 8.25 ± 0.53 | -0.15 | 0.29 | 0.71 | 0.91 |
| Ball | 8.34 ± 0.49 | -0.03 | 0.20 | 0.76 | 0.91 | |
| Ribbon | 7.98 ± 0.60 | -0.12 | 0.26 | 0.75 | 0.92 | |
| Clubs | 8.21 ± 0.55 | -0.20 | 0.29 | 0.76 | 0.91 | |
| 2nd part of the ranking | Hoop | 6.61 ± 0.45 | -0.13 | 0.41 | 0.16 | 0.65 |
| Ball | 6.85 ± 0.60 | 0.26 | 0.41 | 0.47 | 0.79 | |
| Ribbon | 6.52 ± 0.63 | -0.08 | 0.38 | 0.55 | 0.77 | |
| Clubs | 6.68 ± 0.48 | -0.16 | 0.42 | 0.24 | 0.59 | |
| 3rd part of the ranking | Hoop | 4.58 ± 1.31 | 0.38 | 0.56 | 0.82 | 0.94 |
| Ball | 4.64 ± 1.33 | -0.22 | 0.36 | 0.89 | 0.96 | |
| Ribbon | 4.36 ± 1.38 | -0.10 | 0.32 | 0.92 | 0.97 | |
| Clubs | 4.56 ± 1.39 | 0.33 | 0.44 | 0.88 | 0.95 | |
Minimum (min) and maximum (max) values, mean and standard deviation (SD), Dev. max: maximal judge average deviation from D score, Ab. Dev. Max.: maximum of average absolute deviation from D score; R min: minimum of corrected item-total correlation of individual judges; Cα: Cronbach’s alpha coefficient.
Figure 1The eta-squared (η2) values of repeated measures ANOVA of D-scores in all apparatuses clustered according to the position of the gymnast in the final ranking (1st part, 2nd part and 3rd part)
Correlation Matrix for between - judges correlation
| 1st part of the ranking | 2nd part of the ranking | 3rd part of the ranking | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Judge | 2 | 3 | 4 | 2 | 3 | 4 | 2 | 3 | 4 | |
| 1 | 0.72 | 0.76 | 0.85 | 0.42 | 0.15 | 0.73 | 0.86 | 0.87 | 0.75 | |
| Hoop | 2 | 0.48 | 0.80 | 0.33 | 0.30 | 0.85 | 0.81 | |||
| 3 | 0.74 | -0.14 | 0.78 | |||||||
| 1 | 0.71 | 0.80 | 0.68 | 0.71 | 0.49 | 0.47 | 0.86 | 0.83 | 0.89 | |
| Ball | 2 | 0.74 | 0.79 | 0.34 | 0.55 | 0.89 | 0.79 | |||
| 3 | 0.61 | 0.38 | 0.86 | |||||||
| 1 | 0.64 | 0.79 | 0.69 | 0.38 | 0.44 | 0.40 | 0.90 | 0.91 | 0.93 | |
| Ribbon | 2 | 0.74 | 0.82 | 0.45 | 0.32 | 0.87 | 0.93 | |||
| 3 | 0.87 | 0.61 | 0.92 | |||||||
| 1 | 0.74 | 0.83 | 0.69 | 0.23 | 0.19 | 0.12 | 0.83 | 0.83 | 0.86 | |
| Clubs | 2 | 0.70 | 0.71 | 0.34 | 0.36 | 0.88 | 0.83 | |||
| 3 | 0.69 | 0.34 | 0.84 | |||||||
correlation is significant at the 0.01 level (2 - tailed);
correlation is significant at the 0.05 level (2 - tailed)
Overall measures of inter-judge reliability
| ICC Single | ICC Average | Kendall’s W | P (W) | ||
|---|---|---|---|---|---|
| Apparatus | |||||
| 1st part of the ranking | Hoop | 0.70 | 0.90 | 0.133 | 0.023* |
| Ball | 0.73 | 0.91 | 0.013 | 0.821 | |
| Ribbon | 0.76 | 0.92 | 0.053 | 0.283 | |
| Clubs | 0.88 | 0.89 | 0.177 | 0.005* | |
| 2nd part of the ranking | Hoop | 0.33 | 0.66 | 0.021 | 0.671 |
| Ball | 0.46 | 0.77 | 0.109 | 0.049* | |
| Ribbon | 0.47 | 0.78 | 0.002 | 0.982 | |
| Clubs | 0.26 | 0.58 | 0.071 | 0.162 | |
| 3rd part of the ranking | Hoop | 0.80 | 0.94 | 0.112 | 0.045* |
| Ball | 0.84 | 0.95 | 0.088 | 0.096 | |
| Ribbon | 0.91 | 0.97 | 0.028 | 0.564 | |
| Clubs | 0.83 | 0.95 | 0.064 | 0.203 | |
ICC single (average): intra-class correlation for single (average) scores; p(w): p value of Kendall’s W