| Literature DB >> 31231542 |
Abstract
Experiences of failure can provide valuable opportunities to learn, however, the typical classroom does not tend to function from an orientation of learning from failure. Rather, educators aim to teach accurate information as efficiently as possible, with the main goal for students to be able to produce correct knowledge when called for, in the classroom and beyond. Alternatively, teaching for failure requires instructional designs that function out of a different paradigm altogether. Failures can occur during activities like problem solving, problem posing, idea generation, comparing/contrasting cases, or inventing formalisms or pattern-based rules. We present findings from a study done in fourth-grade classes on environmental sustainability that used a design allowing for failures to occur during collaboration. These center on dialogs that included "micro-failures," where we could address how students deal with failure during the process of learning. Our design drew from "productive failure," where students are given opportunities to fail at producing canonical concepts before receiving explicit instruction, and unscripted collaborative learning, where students engage in collaboration without being directed in specific dialogic moves. By focusing on failures during an unscripted collaborative process, our work achieved two goals: (1) We singled out occurrences of failure by analyzing students' dialogs when they encountered impasses and identified several behaviors that differentially related to learning; (2) We explored how the form of task design influences the collaborative learning process around failure occurrences, showing the potential benefits of more structured tasks.Entities:
Keywords: Education; Social sciences
Year: 2019 PMID: 31231542 PMCID: PMC6544644 DOI: 10.1038/s41539-019-0045-1
Source DB: PubMed Journal: NPJ Sci Learn ISSN: 2056-7936
Fig. 1Preparation for future collaboration (PFC) phases and corresponding cognitive mechanisms. The PFC instructional design includes three phases as shown at the top. Arrows point to the cognitive learning mechanisms that are engaged during each phase. Note that multiple mechanisms are invoked in the first and second phases. The two instructional phases of Productive Failure (PF) are shown below the mechanisms. The first phase of PF typically involves group work, thus, invoking the first three cognitive mechanisms. By separating exploration and generation into (1) individual exploration and generation and (2) peer collaboration, as is done in the PFC design, we see a more nuanced representation of how each mechanism aligns to the different phases of learning activity. In both PFC and PF, the mechanism of knowledge assembly and consolidation occurs during direct instruction
Coding scheme used to identify behaviors in failure episodes
| Behavior code | Description | Example from discourse ( |
|---|---|---|
| 02 Ques/expl–partners | Students are questioning and/or explaining to one another without clear argument/debate (could be for clarification of answers, reasons for answers, or any questions related to the task) | Dyad A S1: This one refuse. S2: Refuse? S1: Ya you can use metal, so you do not need to throw away. S2: Oh yea, do not need to throw away because it is easy to wash. |
| 03 Ques/expl–teacher | Students are explaining or elaborating on their ideas, or providing reasons for answers after being prompted by the teacher. | Dyad B T: [You say] best to reduce food. Why? S1: Because later when you eat, uh… T: Yes, ok. S1: All the food left already, uh… later you, you- S2: Put it in the fridge. |
| 04 Argue–consensus | Students are engaged in an argumentation type of dialog, meaning that they are challenging one another or engaged in debate, and eventually the students come to consensus | Dyad C S1: Why do you think it is recycle? I want reuse because [ S2: Because… inside is the pen right? Er, no. The one is pen and paper? … I choose recycle. S1: You can reuse, you can keep reusing, like using the pen refiller. I think it is reuse. How about you? S2: Ya, I think it is reuse. |
| 05 Initiate teacher help | Students (either partner) initiate teacher help during a failure episode | “Teacher, can you help us?” “Teacher, Teacher! Do not know what to write.” |
| 06 Quick choice–no consensus | Either partner in a dyad quickly names/selects an answer to a question item with very little to no dialog and does not necessarily engage the other to come to consensus | Dyad D[ S1: We use tetra packs, we recycle! Of course. S2: Reduce. S1: No, not reduce! Recycle. [ |
| 07 Argue–no consensus | Students are engaged in an argumentation type of dialog, however, they do not ever come to consensus on their disagreement | Dyad E[ S1: Refuse. T: Yeah, you agree. S1: Ah no, recycle. S2: Yeah, because they… T: You must ask him if he agrees. S1: Refuse, eh no, recycle. Reuse. S2: You ask me, gotta ask me first! S1: Use less! Reduce! [ |
| 08 Quick argue–one dominator | Students engage in a very brief argument, however, one partner generally dominates the interaction and names/selects an answer | Dyad F S1: Reduce or, reduce? S2: I chose recycle. S1: How to recycle bottle? S2: Just put reuse. I think we should put… S1: I have not say my answer. |
| 09 Ignore–move forward | Students (either partner) do not engage around the failure moment but simply move past it | [ |
| 10 Distract | Students (either partner) begin to go off-task and may engage in joking, teasing, singing, talking to other students (non-partners), or otherwise engage in off-task talk | Dyad G S1: Recycle! S2: Refuse. Recycle. S1: Of course la Auntie. You dunno, I am Tommy [the turtle] speaking. S2: Wah, he call me Auntie. |
Notes: [1] Dyads D and E are examples where more than one code was used. [2] Italics are used to indicate the type of extended dialogue that was removed to save space
This table illustrates the coding scheme used to identify the nine categories of dialogic behaviors that occurred during all micro-failure episodes. It includes the code, description of the code to the right, and real excerpts from the data in the far right column. Dyads are labeled by letters to differentiate the excerpts by code. S1 and S2 refer to the two students in each dyad; they were not the same students across the dyads. Some utterances were removed for the sake of saving space and replaced with their function in the dialog using italics
Means/SDs of number of coded episodes and behaviors across conditions
| Separated by condition | |||
|---|---|---|---|
| Behaviors | Select | Generate | Total |
| 01 Failure episodes | 5.40/3.20 | 3.80/2.49 | 4.60/2.91 |
| 02 Ques/expl–partners | 1.70/1.49 | 1.40/1.71 | 1.55/1.57 |
| 03 Ques/expl–teacher | 1.50/3.14 | 0.90/1.52 | 1.20/2.42 |
| 04 Argue–consensus | 1.00/1.05 | 0.00/0.00 | 0.50/0.89 |
| 05 Initiate teacher help | 0.20/0.42 | 0.30/0.67 | 0.25/0.55 |
| 06 Quick choice–no consensus | 0.20/0.42 | 0.30/0.67 | 0.25/0.55 |
| 07 Argue–no consensus | 1.00/1.49 | 0.60/1.26 | 0.80/1.36 |
| 08 Quick argue–one dominator | 0.60/0.84 | 0.70/0.82 | 0.65/0.81 |
| 09 Ignore–move forward | 0.40/0.70 | 0.06/1.07 | 0.50/0.89 |
| 10 Distract | 0.50/0.97 | 0.70/1.25 | 0.60/1.10 |
The means and standard deviations (SDs) of the number of micro-failure episodes and the number of occurrences of the nine categories of behaviors are shown. Number 1 in the “Behaviors” column refers to the average number of micro-failures that occurred. Numbers 2–10 refer to the average number of behaviors observed by category. “Select” and “Generate” refer to the two conditions being compared. The “Total” column on the far right refers to the overall mean by category, collapsing across conditions
Correlations of dialogic behaviors and posttest scores
| Posttest scores | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Posttest scores | 1.00 | 0.21 | 0.13 | 0.24 | 0.34 | 0.26 | −0.17 | −0.09 | −0.08 | −0.19 | −0.02 |
| 01 Num failures | – | 1.00 | 0.31 | 0.48a | 0.25 | 0.51b | 0.05 | 0.34 | −0.10 | 0.39a | 0.31 |
| 02 Ques/exp peer | – | – | 1.00 | −0.04 | 0.35 | 0.07 | −0.11 | −0.19 | −0.03 | −0.12 | −0.25 |
| 03 Ques/exp teacher | – | – | – | 1.00 | −0.16 | 0.76b | 0.04 | 0.09 | 0.04 | 0.11 | 0.04 |
| 04 Argue/consens | – | – | – | – | 1.00 | 0.0 | −0.04 | −0.20 | −0.14 | −0.18 | −0.17 |
| 05 Call on teacher | – | – | – | – | – | 1.00 | −0.24 | 0.07 | −0.24 | 0.10 | 0.18 |
| 06 Quick choice | – | – | – | – | – | – | 1.00 | 0.13 | 0.40 | 0.26 | −0.05 |
| 07 Argue/no consens | – | – | – | – | – | – | – | 1.00 | −0.03 | 0.83b | 0.81b |
| 08 Dom student | – | – | – | – | – | – | – | – | 1.00 | 0.05 | −0.21 |
| 09 Ignore | – | – | – | – | – | – | – | – | – | 1.00 | 0.73b |
| 10 Distract | – | – | – | – | – | – | – | – | – | – | 1.00 |
aCorrelation is significant at the 0.05 level (two-tailed)
bCorrelation is significant at the 0.01 level (two-tailed)
We correlated the dyad posttest scores with the number of micro-failure episodes and the number of occurrences of each of the nine dialogic behaviors in each dyad. This analysis was done collapsing across the conditions for the whole data set. Kendall’s tau two-tailed tests were used to calculate correlations because the data was not normally distributed. Following across the first row within the table, note that posttest scores were not significantly correlated with the number of failures or any one behavior. However, also note that all correlations were small to moderate in magnitude, except for the Distract variable, which was near zero. The behaviors that showed small to moderate correlations with posttest scores were used in further analyses
Correlations of behaviors by category and posttests
| 1 | 2 | 3 | 4 | 5 | |
|---|---|---|---|---|---|
| 1 Posttest score | 1.00 | 0.28 | −0.15 | 0.24 | −0.31a |
| 2 Num of functional behaviors | – | 1.00 | −0.14 | 0.69b | −0.63b |
| 3 Num of dysfunctional behaviors | – | – | 1.00 | −0.62b | 0.73b |
| 4 Prop of functional | – | – | – | 1.00 | −0.85b |
| 5 Prop of dysfunctional | – | – | – | – | 1.00 |
aCorrelation is significant at the 0.05 level (one-tailed)
bCorrelation is significant at the 0.01 level (one-tailed)
The nine dialogic behaviors were collapsed into two categories. We included all behaviors that were positively correlated with posttest scores in a broader category named, “Functional,” while we included all behaviors that were negatively correlated with posttest scores in a broader category named, “Dysfunctional.” (See Table 3 for correlations of the nine behaviors and posttest scores). Correlations were calculated collapsing across conditions using Kendall’s tau due to non-normal distributions of the data. One-tailed tests were used since each broad category was hypothesized to have a unidirectional relationship to posttest scores (e.g., Functional/positive relationship, Dysfunctional/negative relationship). This table displays the correlations between the number of Functional and Dysfunctional behaviors in each dyad with posttest scores and the proportion of Functional and Dysfunctional behaviors with posttest scores. Owing to the high variation in behaviors, we calculated proportions by dividing the number of behavior occurrences in each broad category (Functional, Dysfunctional) by the total number of occurrences in order to observe a more sensitive measure of their relationships to posttest scores. Following across the first row within the table, note the significant correlation between the proportion of Dysfunctional behaviors and posttest scores
Means/SDs of behaviors by category and posttest
| Conditions | ||
|---|---|---|
| Measure | Select | Generate |
| Posttesta | 8.62/– | 6.83/– |
| Behaviorsb | ||
| Functional | ||
| Percentage | 62.15/36.79 | 38.02/39.93 |
| Mean rank | 10.40 | 8.38 |
| Dysfunctional | ||
| Percentage | 29.45/26.46 | 43.93/36.18 |
| Mean rank | 8.60 | 10.63 |
aPosttest means are adjusted for covariates
bPercentage of behaviors and Mann–Whitney mean ranks are shown
The means and standard deviations (SDs) of the prevalence of the broader categories of behaviors (Functional, Dysfunctional) are shown along with the adjusted means of posttest scores in each condition. The percentages of each type of behavior relative to the total number of occurrences and Mann–Whitney mean ranks are displayed. We used the Mann–Whitney U-test to calculate differences across conditions due to non-normal distributions. Posttest means were adjusted for covariates. “Select” and “Generate” refer to the two conditions being compared. Note that there was a higher percentage of Functional behaviors compared to Dysfunctional behaviors in the Select condition. The opposite holds true in the Generate condition, which shows a higher percentage of Dysfunctional behaviors
Fig. 2Excerpts contrasting high- and low-scoring dyads. The figure displays a side-by-side comparison of the dialogic utterances, moves (function of language), and corresponding cognitive mechanisms for the highest-scoring (Hi-Sel) dyad against the lowest-scoring (Lo-Gen) dyad relative to the whole data set. The Hi-Sel dyad was from the Select condition; the Lo-Gen dyad was from the Generate condition. Two episodes of micro-failures are shown for each dyad. S1 and S2 in the excerpts refer to the dialogic utterances of student 1 and student 2 of the Hi-Sel dyad. S3 and S4 refer to the utterances of student 3 and student 4 of the Lo-Gen dyad. Utterances were separated by speaker turns. The Hi-Sel dyad engaged in longer and more substantive interactions during micro-failures, which invoked a greater number and diversity of cognitive mechanisms. This comparison supports the finding that Select dyads interacted in ways that were helpful to learning while Generate dyads interacted in ways that hindered learning. This also mirrors the quantitative results showing a significant difference in learning outcomes, with students from the Select condition scoring higher on average than students from the Generate condition
Fig. 3Example illustrating development of codes. The figure shows an episode of a micro-failure. The episode begins with the start of a question item where the students are hedging. Hedging indicated entering into a micro-failure. Here students A and B are discussing which “R” goes best with the item of leftover food. The episode ends when the students start to discuss a new item. Utterances were separated by speaker turns. In order to develop our coding scheme, we recorded the functional behavior of each utterance, as shown in column 3. Related utterances were then collapsed into broader dialogic categories, as shown in the far right column. In this example, the students initially engaged in the broader categories of (a) questioning and explaining without debate, and (b) arguing to consensus