| Literature DB >> 34748214 |
Ying Xu1, Joseph Aubele1, Valery Vigil1, Andres S Bustamante1, Young-Suk Kim1, Mark Warschauer1.
Abstract
Dialogic reading, when children are read a storybook and engaged in relevant conversation, is a powerful strategy for fostering language development. With the development of artificial intelligence, conversational agents can engage children in elements of dialogic reading. This study examined whether a conversational agent can improve children's story comprehension and engagement, as compared to an adult reading partner. Using a 2 (dialogic reading or non-dialogic reading) × 2 (agent or human) factorial design, a total of 117 three- to six-year-olds (50% Female, 37% White, 31% Asian, 21% multi-ethnic) were randomly assigned into one of the four conditions. Results revealed that a conversational agent can replicate the benefits of dialogic reading with a human partner by enhancing children's narrative-relevant vocalizations, reducing irrelevant vocalizations, and improving story comprehension.Entities:
Mesh:
Year: 2021 PMID: 34748214 PMCID: PMC9299009 DOI: 10.1111/cdev.13708
Source DB: PubMed Journal: Child Dev ISSN: 0009-3920
Background information by condition
| Full sample | Agent DR | Agent non‐DR | Human DR | Human non‐DR | ANOVA/ | |
|---|---|---|---|---|---|---|
| Age | 57.63 (9.53) | 59.97 (9.05) | 57.59 (10.41) | 58.29 (8.82) | 53.92 (9.42) |
|
| EOWPVT | 69.18 (17.22) | 70.58 (17.43) | 70.70 (19.71) | 66.71 (17.09) | 68.77 (14.82) |
|
| Predominant home language |
| |||||
| English | 78.63% | 75.76% | 85.19% | 80.65% | 73.08 | |
| Other | 21.37% | 24.24% | 14.81% | 19.35% | 26.92% | |
| Female | 49.57% | 57.58% | 48.15% | 48.39% | 42.31% |
|
| Race |
| |||||
| White | 36.75% | 33.33% | 44.44% | 32.26% | 38.46% | |
| Asian | 30.77% | 27.27% | 33.33% | 41.94% | 19.23% | |
| Hispanic | 6.84% | 12.12% | 3.70% | 0.00% | 11.54% | |
| Black | 0.85% | 0.00% | 0.00% | 3.23% | 0.00% | |
| Two or more | 21.37% | 24.24% | 11.11% | 22.58% | 26.92% | |
| Other | 1.71% | 0.00% | 3.70% | 0.00% | 3.85% | |
| Decline | 0.85% | 3.03% | 3.70% | 0.00% | 00.00% | |
| Regular conversational agent usage |
| |||||
| Yes | 43.59% | 45.45% | 40.74% | 48.39% | 38.46% | |
| No | 55.56% | 54.55% | 59.26% | 51.61% | 57.69% | |
| Decline | 0.85% | 0.00% | 0.00% | 0.00% | 3.85% | |
|
| 117 | 33 | 27 | 31 | 26 | |
Standard deviation in parentheses.
Abbreviations: DR, dialogic reading; EOWPVT, Expressive One Word Picture Vocabulary Test.
FIGURE 1Study procedure of Human‐dialogic reading (DR) condition (left) and Agent‐DR condition (right). Note: A child participant in the Human DR condition (left) and another participant in the Agent DR condition (right; note the agent device in the lower right corner)
Descriptive statistics of outcome measures for full sample
| Comp | Mem | Inf | Seq | Global | RV | IRV | PE |
|
| Range | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Comprehension | 1 | 10.4 | 4.39 | (0, 20) | |||||||
| Event memorization (Mem) | .83 | 1 | 3.07 | 2.12 | (0, 8) | ||||||
| Inference making (Inf) | .81 | .55 | 1 | 3.20 | 1.74 | (0, 6) | |||||
| Sequence understanding (Seq) | .82 | .48 | .51 | 1 | 4.16 | 2.14 | (0, 7) | ||||
| Global engagement | .18 | .07 | .20 | .18 | 1 | 3.04 | 0.19 | (2.32, 3.71) | |||
| Relevant vocalization (RV) | .23 | .18 | .22 | .18 | .46 | 1 | 0.11 | 0.09 | (0, 0.34) | ||
| Irrelevant vocalization (IRV) | −.21 | −.12 | −.24 | −.17 | −.21 | .10 | 1 | 0.02 | 0.04 | (0, 0.22) | |
| Positive expression (PE) | −.02 | −.07 | .03 | .00 | .53 | .41 | .05 | 1 | 0.11 | 0.17 | (0, 1) |
| Visual attention | .02 | .02 | .03 | −.01 | .16 | −.31 | −.27 | −.01 | 0.75 | 0.16 | (0.27, 0.98) |
Coefficients are Pearson correlations.
p < .1.
p < .05
p < .01
p < .001.
Observed outcome measures by condition
| Agent DR | Agent non‐DR | Human DR | Human non‐DR | |
|---|---|---|---|---|
| Story comprehension | ||||
|
| 11.6 (4.64) | 8.67 (4.90) | 11.2 (5.32) | 9.88 (4.48) |
| Median [min, max] | 12 [2, 20] | 7 [1, 17] | 11 [0, 20] | 9.50 [2, 19] |
| Event memorization | ||||
|
| 3.42 (2.17) | 2.33 (1.86) | 3.61 (2.22) | 2.73 (2.03) |
| Median [min, max] | 3 [0, 7] | 2 [0, 7] | 3 [0, 8] | 3 [0, 7] |
| Inference making | ||||
|
| 3.58 (1.56) | 2.74 (1.83) | 3.42 (1.80) | 2.92 (1.74) |
| Median [min, max] | 4 [1, 6] | 3 [0, 6] | 4 [0, 6] | 3 [0, 6] |
| Sequence understanding | ||||
|
| 4.61 (1.84) | 3.59 (2.37) | 4.13 (2.26) | 4.23 (2.07) |
| Median [min, max] | 4 [1, 7] | 4 [0, 7] | 4 [0, 7] | 4 [0, 7] |
| Global engagement | ||||
|
| 3.09 (0.16) | 2.96 (0.17) | 3.03 (0.20) | 3.05 (0.21) |
| Median [min, max] | 3.06 [2.76, 3.52] | 2.98 [2.53, 3.26] | 3.07 [2.32, 3.35] | 3.04 [2.54, 3.71] |
| Relevant vocalization | ||||
|
| 0.13 (0.04) | 0.03 (0.06) | 0.18 (0.07) | 0.09 (0.11) |
| Median [min, max] | 0.12 [0, 0.24] | 0 [0, 0.23] | 0.16 [0.02, 0.32] | 0.04 [0, 0.34] |
| Irrelevant vocalization | ||||
|
| 0.01 (0.01) | 0.01 (0.02) | 0.02 (0.04) | 0.04 (0.06) |
| Median [min, max] | 0 [0, 0.06] | 0 [0, 0.08] | 0.01 [0, 0.15] | 0.01 [0, 0.22] |
| Positive expression | ||||
|
| 0.14 (0.21) | 0.10 (0.16) | 0.10 (0.12) | 0.09 (0.15) |
| Median [min, max] | 0.05 [0, 1] | 0.01 [0, 0.55] | 0.04 [0, 0.39] | 0.04 [0, 0.69] |
| Visual attention | ||||
|
| 0.74 (0.13) | 0.79 (0.15) | 0.70 (0.17) | 0.76 (0.17) |
| Median [min, max] | 0.75 [0.43, 0.96] | 0.83 [0.44, 0.98] | 0.74 [0.27, 0.93] | 0.82 [0.48, 0.98] |
Abbreviation: DR, dialogic reading.
Regression results and estimated marginal means of story comprehension scales by condition
|
Regression
|
Marginal means by condition
| ||||||
|---|---|---|---|---|---|---|---|
| DR | Agent | DR × Agent | Agent DR | Agent non‐DR | Human DR | Human non‐DR | |
| Story comprehension | |||||||
| Main |
| −0.14 (.12) | |||||
| Int. |
| −0.27 (.18) | 0.22 (.24) | 0.22 (.14)a | −0.39 (.14)b | 0.26 (.14)a | −0.12 (.15)ab |
| Event memorization | |||||||
| Main |
|
| |||||
| Int. |
| −0.21 (.21) | −0.03 (.27) | 0.07 (.16)ac | −0.45 (.16)b | 0.31 (.15)ac | −0.24 (.17)ab |
| Inference making | |||||||
| Main |
| −0.10 (.15) | |||||
| Int. | 0.26 (.23) | −0.23 (.23) | 0.22(.30) | 0.15 (.18)a | −0.33 (.18)b | 0.16 (.17)a | −0.10 (.19)a |
| Sequence understanding | |||||||
| Main |
| −0.02 (.15) | |||||
| Int. | 0.14 (.23) | −0.23 (.24) | 0.37 (.31) | 0.32 (.19)a | −0.19 (.21)b | 0.17 (.21)a | 0.04 (.24)a |
All coefficients and estimated marginal means are standardized. “Main” refers to the regression model that includes two experimental factors as predictors (dialogic reading as DR and reading partner as Agent). “Int.” refers to the interaction model that includes the two experimental factors as well as the interaction term between them (DR × Agent). For all regression models, covariates included age, expressive vocabulary, and prior usage of agents. Experimenter fixed effects included to adjust any potential confounding introduced by the experimenters. Regression coefficients with p values less than .1 are in bold. Pairwise comparisons with Tukey adjustments were conducted to examine the significant differences between the estimated marginal means of each two conditions. Means in the same row that do not share subscripts differ at p < .05.
p < .1.
p < .05
p < .01
p < .001.
Regression analysis of the condition effects and interaction effects on story comprehension
| Model 1 | Model 2 | Model 3 | |
|---|---|---|---|
| DR |
|
|
|
| Agent | −.24 (.19) | −.26 (.18) | −.23 (.20) |
| DR × Agent | .22 (.24) | .20 (.24) | .19 (.25) |
| DR × Expressive Vocab | .23 (.21) | — | |
| Agent × Expressive Vocab | .10 (.19) | — | |
| DR × Agent × Expressive Vocab | −.14 (.26) | — | |
| DR × Age | — |
| |
| Agent × Age | — | .00 (.18) | |
| DR × Agent × Age | — | −.17 (.27) | |
|
| .63 | .63 | .64 |
Standardized coefficients reported. Standard errors in parentheses. Regression coefficients with p values less than .1 are in in bold. For all regression models, covariates included age, expressive vocabulary, and prior usage of agents. Experimenter fixed effects included to adjust any potential confounding introduced by the experimenters.
p < .1.
p < .05.
Regression results and estimated marginal means of engagement scales by condition
|
Regression
|
Marginal means by condition
| ||||||
|---|---|---|---|---|---|---|---|
| DR | Agent | DR × Agent | Agent DR | Agent non‐DR | Human DR | Human non‐DR | |
| Global engagement | |||||||
| Main |
| 0.00 (.19) | |||||
| Int. | 0.06 (.28) | −0.36 (.28) |
| 0.25 (.22)a | −0.45 (.22)b | −0.03 (.21)a | −0.09 (.23)a |
| Relevant vocalization | |||||||
| Main |
|
| |||||
| Int. |
|
| 0.08 (.29) | 0.53 (.18)ac | −0.62 (.20)b | 1.04 (.20)c | −0.03 (.22)a |
| Irrelevant vocalization | |||||||
| Main | −0.25 (.19) |
| |||||
| Int. |
|
| 0.45 (.36) | −0.02 (.22)a | 0.03 (.25)a | 0.42 (.24)ab | 0.92 (.27)b |
| Positive expression | |||||||
| Main | .23 (.20) | 0.22 (.19) | |||||
| Int. | .18 (.29) | 0.17 (.29) | 0.09 (.38) | 0.35 (.23)a | 0.08 (.26)a | 0.09 (.26)a | −0.09 (.29)a |
| Visual attention | |||||||
| Main |
| 0.20 (.19) | |||||
| Int. | −0.40 (.28) | 0.17 (.28) | 0.06 (.37) | −0.36 (.23)a | −0.02 (.26)a | −0.59 (.25)a | −0.19 (.28)a |
All coefficients and estimated marginal means are standardized. “Main” refers to the regression model that includes two experimental factors as predictors (dialogic reading as DR and reading partner as Agent). “Int.” refers to the interaction model that includes the two experimental factors as well as the interaction term between them (DR × Agent). For all regression models, Covariates including age, expressive vocabulary, and prior usage of agents. Experimenter fixed effects included to adjust any potential confounding introduced by the experimenters. Regression coefficients with p values less than .1 are in bold. Pairwise comparisons with Tukey adjustments were conducted to examine the significant differences between the estimated marginal means of each two conditions. Means in the same row that do not share subscripts differ at p < .05.
p < .1
p < .05
p < .01
p < .001.
FIGURE 2Structural equation modeling analysis of reading condition, vocalizations, and story comprehension. Note: Solid lines are statistically significant paths, dashed lines are marginally significant paths, and dotted lines are non‐significant paths. DR, dialogic reading. † p < .10; *p < .05; **p < .01; ***p < .001
Results of structural equation modeling
| Dependent variable | Independent variable | Coefficient |
|
| |
|---|---|---|---|---|---|
| Direct paths | |||||
| Comprehension | ← | Agent non‐DR |
| (.18) | .06 |
| Comprehension | ← | Human non‐DR | .05 | (.19) | .83 |
| Comprehension | ← | Human DR | .05 | (.17) | .73 |
| Comprehension | ← | Relevant Voc. |
| (.07) | <.05 |
| Comprehension | ← | Irrelevant Voc. |
| (.07) | <.05 |
| Comprehension | ← | Expressive Vocab |
| (.07) | <.001 |
| Comprehension | ← | Age |
| (.07) | <.001 |
| Comprehension | ← | Prior CA use | −.19 | (.13) | .13 |
| Relevant Voc. | ← | Agent non‐DR |
| (.21) | <.001 |
| Relevant Voc. | ← | Human non‐DR | − | (.23) | .10 |
| Relevant Voc. | ← | Human DR |
| (.20) | <.01 |
| Relevant Voc. | ← | Expressive Vocab | .13 | (.09) | .13 |
| Relevant Voc. | ← | Age | −.13 | (.09) | .16 |
| Relevant Voc. | ← | Prior CA use | .16 | (.15) | .31 |
| Irrelevant Voc. | ← | Agent non‐DR | .02 | (.24) | .95 |
| Irrelevant Voc. | ← | Human non‐DR |
| (.26) | <.001 |
| Irrelevant Voc. | ← | Human DR |
| (.23) | .06 |
| Irrelevant Voc. | ← | Expressive Vocab | −.09 | (.10) | .34 |
| Irrelevant Voc. | ← | Age | .05 | (.11) | .66 |
| Irrelevant Voc. | ← | Prior CA use |
| (.18) | .05 |
| Indirect paths | |||||
| Comprehension | ← Relevant Voc. | ← Agent non‐DR | − | (.09) | <.05 |
| Comprehension | ← Irrelevant Voc. | ← Agent non‐DR | −.00 | (.04) | .95 |
| Comprehension | ← Relevant Voc. | ← Human non‐DR | −.07 | (.05) | .17 |
| Comprehension | ← Irrelevant Voc. | ← Human non‐DR |
| (.08) | <.05 |
| Comprehension | ← Relevant Voc. | ← Human DR |
| (.05) | .07 |
| Comprehension | ← Irrelevant Voc. | ← Human DR | −.07 | (.05) | .14 |
Standardized coefficient presented. Coefficients with p values less than .1 are in bold. Standard errors in parentheses.
Abbreviations: CA, conversational agent; DR, dialogic reading.
p < .1.
p < .05
p < .01
p < .001.