| Literature DB >> 29312033 |
Abstract
Researchers and practitioners often use standardized vocabulary tests such as the Peabody Picture Vocabulary Test-4 (PPVT-4; Dunn and Dunn, 2007) and its companion, the Expressive Vocabulary Test-2 (EVT-2; Williams, 2007), to assess English vocabulary skills as an indicator of children's school readiness. Despite their psychometric excellence in the norm sample, issues arise when standardized vocabulary tests are used to asses children from culturally, linguistically and ethnically diverse backgrounds (e.g., Spanish-speaking English language learners) or delayed in some manner. One of the biggest challenges is establishing the appropriateness of these measures with non-English or non-standard English speaking children as often they score one to two standard deviations below expected levels (e.g., Lonigan et al., 2013). This study re-examines the issues in analyzing the PPVT-4 and EVT-2 scores in a sample of 4-to-5-year-old low SES Hispanic preschool children who were part of a larger randomized clinical trial on the effects of a supplemental English shared-reading vocabulary curriculum (Pollard-Durodola et al., 2016). It was found that data exhibited strong floor effects and the presence of floor effects made it difficult to differentiate the invention group and the control group on their vocabulary growth in the intervention. A simulation study is then presented under the multilevel structural equation modeling (MSEM) framework and results revealed that in regular multilevel data analysis, ignoring floor effects in the outcome variables led to biased results in parameter estimates, standard error estimates, and significance tests. Our findings suggest caution in analyzing and interpreting scores of ethnically and culturally diverse children on standardized vocabulary tests (e.g., floor effects). It is recommended appropriate analytical methods that take into account floor effects in outcome variables should be considered.Entities:
Keywords: ethnically and culturally diverse children; floor effects; intervention effects; standardized vocabulary tests (the PPVT-4; the EVT-2)
Year: 2017 PMID: 29312033 PMCID: PMC5732956 DOI: 10.3389/fpsyg.2017.02146
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Distribution of scores and corresponding descriptive for all of the participants.
| Extremely high | 130+ | 0 | 0 | 0 | 0 |
| Moderately high | 115–129 | 0 | 0 | 0 | 0 |
| High average | 100–114 | 0 | 0 | 2 | 1 |
| Low average | 85–99 | 15 | 6 | 17 | 7 |
| Moderately low | 70–84 | 84 | 34 | 74 | 31 |
| Extremely low | ≤69 | 150 | 60 | 145 | 61 |
Figure 1Distribution of standardized vocabulary test scores of a sample of low SES Hispanic preschool children.
Pretest and posttest scores for intervention and comparison groups.
| 249 | 136 | 113 | 0.05, | 234 | 129 | 105 | 0.53, | |
| 63.81 | 63.86 | 63.75 | 72.70 | 73.16 | 72.13 | |||
| 15.42 | 14.76 | 16.24 | 14.63 | 14.12 | 15.29 | |||
| 238 | 132 | 106 | 0.17, | 232 | 127 | 105 | −0.30, | |
| 55.63 | 55.87 | 55.33 | 64.08 | 63.65 | 64.60 | |||
| 24.01 | 23.59 | 24.64 | 24.01 | 25.09 | 22.75 | |||
PPVT-4, Peabody Picture Vocabulary Test (4th ed.); EVT-2, Expressive Vocabulary Test (2nd ed.).
Results of hierarchical linear model to the low SES Hispanic Latino preschool children with or without modeling floor effects.
| Intercept | 35.97 | 35.97 | 36.93 | 1.62 | 1.62 | 0.51 |
| ( | (13.46) | (14.54) | (15.42) | (3.10) | (3.13) | (2.84) |
| Level-1 Pretest (γ10) | 0.98 | 0.98 | 0.98 | 0.83 | 0.83 | 0.82 |
| ( | (0.01) | (0.01) | (0.01) | (0.03) | (0.03) | (0.04) |
| Level-1 Age (γ20) | −0.01 | −0.01 | −0.01 | −0.002 | −0.002 | −0.01 |
| ( | (0.02) | (0.02) | (0.02) | (0.05) | (0.04) | (0.04) |
| Level-1 Gender | −0.004 | −0.004 | −0.004 | −0.45 | −0.05 | −0.03 |
| ( | (0.02) | (0.02) | (0.02) | (0.04) | (0.04) | (0.04) |
| Level-1 Bilingual | −0.04 | −0.04 | −0.04 | −0.05 | −0.05 | −0.05 |
| ( | (0.03) | (0.02) | (0.03) | (0.06) | (0.06) | (0.06) |
| Level-1 Ethnicity | 0.003 | 0.003 | 0.003 | −0.02 | −0.02 | −0.02 |
| ( | (0.02) | (0.01) | (0.01) | (0.04) | (0.04) | (0.05) |
| Level-1 Attendance (γ60) | 0.02 | 0.02 | 0.02 | 0.01 | 0.01 | 0.01 |
| ( | (0.02) | (0.02) | (0.02) | (0.04) | (0.04) | (0.05) |
| Level-1 | 0.000 | 0.000 | 0.000 | 0.05 | 0.05 | 0.04 |
| ( | (0.02) | (0.02) | (0.02) | (0.05) | (0.04) | (0.05) |
| Level-1 | 0.001 | 0.001 | 0.002 | −0.01 | −0.01 | −0.01 |
| ( | (0.02) | (0.02) | (0.02) | (0.05) | (0.04) | (0.05) |
| Level-2 School district | −0.70 | −0.70 | −0.73 | −0.58 | −0.58 | −0.54 |
| ( | (0.46) | (0.37) | (0.39) | (0.39) | (0.25) | (0.23) |
| Level-2 Teacher's primary language | −0.59 | −0.59 | −0.60 | −0.35 | −0.35 | −0.31 |
| ( | (0.28) | (0.23) | (0.24) | (0.22) | (0.23) | (0.22) |
| Level-2 Years of teaching (γ04) | −0.81 | −0.81 | −0.81 | −0.44 | −0.44 | −0.39 |
| ( | (0.40) | (0.40) | (0.41) | (0.32) | (0.40) | (0.40) |
| Level-2 Years of teaching in PreK (γ05) | −0.22 | −0.22 | −0.23 | −0.52 | −0.52 | −0.57 |
| ( | (0.34) | (0.41) | (0.42) | (0.29) | (0.31) | (0.33) |
| Level-2 University reading credits (γ06) | 1.11 | 1.11 | 1.14 | 0.86 | 0.86 | 0.87 |
| ( | (0.40) | (0.37) | (0.39) | (0.30) | (0.34) | (0.32) |
| Level-2 Professional development (γ07) | 0.03 | 0.03 | 0.06 | 0.63 | 0.63 | 0.57 |
| ( | (0.45) | (0.39) | (0.40) | (0.37) | (0.27) | (0.26) |
| Level-1 Residual Variance (σ2) | 0.06 | 0.06 | 0.06 | 0.29 | 0.29 | 0.32 |
| ( | (0.01) | (0.01) | (0.01) | (0.04) | (0.05) | (0.05) |
| Level-2 Residual Variance (τ 00) | 0.23 | 0.23 | 0.19 | 0.46 | 0.46 | 0.48 |
| ( | (0.39) | (0.34) | (0.37) | (0.25) | (0.20) | (0.20) |
PPVT-4, Peabody Picture Vocabulary Test (4th ed.); EVT-2, Expressive Vocabulary Test (2nd ed.).
The significance level is set at p < 0.05 (two-tailed).
The reference group for gender is female (coded 0).
The reference group for bilingual is non-bilingual (coded 0).
The reference group for ethnicity is Native American (coded 0).
The reference group for school district is school district A (coded 0).
The reference group for teachers' primary language is English (coded 0).
Bold and italic values indicated a contrast of significant effects to non-significant effects when floor effects were addressed regarding the intervention effects which is the target research interest in the empirical study.
Floor proportions with different floor thresholds.
| 0 | No floor | 0.86 (1.60) | (−4.62, 5.60) |
| 5 | −1.50 | 1.04 (1.47) | (−1.50, 5.61) |
| 10 | −0.95 | 1.08 (1.41) | (−0.95, 5.61) |
| 15 | −0.60 | 1.12 (1.35) | (−0.60, 5.61) |
| 20 | −0.25 | 1.18 (1.28) | (−0.25, 5.61) |
| 25 | 0.03 | 1.25 (1.21) | (0.03, 5.61) |
Figure 2Distribution of six simulated data sets showing different proportions of floor data (from 0 to 25% floor data).
Relative bias in parameter estimates comparing three comparative methods.
| 0 | −0.00 | −0.00 | −0.00 | −0.01 | −0.01 | −0.01 |
| 5 | −0.05 | −0.05 | 0.00 | −0.05 | −0.05 | 0.00 |
| 10 | −0.10 | −0.10 | 0.00 | −0.10 | −0.10 | 0.00 |
| 15 | −0.15 | −0.15 | 0.00 | −0.14 | −0.14 | 0.00 |
| 20 | −0.20 | −0.20 | 0.00 | −0.20 | −0.20 | 0.00 |
| 25 | −0.26 | −0.26 | 0.00 | −0.26 | −0.26 | 0.00 |
Relative bias in standard error estimates comparing three comparative methods.
| 0 | 0.00 | −0.01 | −0.01 | −0.03 | −0.04 | −0.04 |
| 5 | −0.07 | −0.09 | −0.03 | −0.06 | −0.09 | −0.03 |
| 10 | −0.14 | −0.14 | −0.05 | −0.10 | −0.44 | 0.00 |
| 15 | −0.16 | −0.13 | −0.03 | −0.16 | −0.18 | 0.01 |
| 20 | −0.21 | −0.15 | −0.02 | −0.21 | −0.23 | −0.01 |
| 25 | −0.27 | −0.17 | −0.02 | −0.27 | −0.27 | −0.01 |