| Literature DB >> 28821538 |
Aakanksha Angra1, Stephanie M Gardner2.
Abstract
Undergraduate biology education reform aims to engage students in scientific practices such as experimental design, experimentation, and data analysis and communication. Graphs are ubiquitous in the biological sciences, and creating effective graphical representations involves quantitative and disciplinary concepts and skills. Past studies document student difficulties with graphing within the contexts of classroom or national assessments without evaluating student reasoning. Operating under the metarepresentational competence framework, we conducted think-aloud interviews to reveal differences in reasoning and graph quality between undergraduate biology students, graduate students, and professors in a pen-and-paper graphing task. All professors planned and thought about data before graph construction. When reflecting on their graphs, professors and graduate students focused on the function of graphs and experimental design, while most undergraduate students relied on intuition and data provided in the task. Most undergraduate students meticulously plotted all data with scaled axes, while professors and some graduate students transformed the data, aligned the graph with the research question, and reflected on statistics and sample size. Differences in reasoning and approaches taken in graph choice and construction corroborate and extend previous findings and provide rich targets for undergraduate and graduate instruction.Entities:
Mesh:
Year: 2017 PMID: 28821538 PMCID: PMC5589433 DOI: 10.1187/cbe.16-08-0245
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
Criteria for evaluating graph attributes explaining the components of graph mechanics, data form, graph choice, and aesthetics
| Categories used to describe graphs qualitatively | Category descriptions | Citations |
|---|---|---|
| Graph mechanics | Title: a title should be descriptive for the graph. Axes labels: both the Units: should be appropriate and descriptive for the type of data displayed. Scale: should be appropriate for the data displayed such that the increments are clear and easy to understand. |
|
| Data form | 1. Graph should show a clear distinction between raw and manipulated data plotted. |
|
| Graph choice | Graph type: graph type should be appropriate for both the independent and dependent variables. Alignment: graph should align with the original intended purpose. Take-home message: graph type allows reader to draw appropriate conclusions from the data in the graph. |
|
| Aesthetics and visuospatial aspects | The graph should be pleasing to the eye such that the data plotted occupy sufficient room in the Cartesian plane. Sound construction and mechanistic properties enable the reader to extract meaning from the graph. |
|
Categories in the MRC and their definitions and connections to this study
| Categories in the MRC | Definitionsa | Connection to this study |
|---|---|---|
Invention | The underlying skills and abilities that allow students to conceive novel representations | Competency with graph choice, construction, and knowledge of variables is vital for conjuring new graphical representations ( In the think-aloud interviews, participants were asked to construct a graph from raw data. |
Critique | Critical knowledge that is essential for assessing the quality of representations | Assessing the strengths and weaknesses of various graphs exposes students’ critical knowledge ( Although the interviewer did not explicitly probe the participants to critique their graphs, we wanted to see whether participants spontaneously generated a critique. |
Functioning | Providing reasoning for understanding the purpose of different representations, their usage, and limitations | Functioning unearths students’ reasoning for understanding the purpose of different types of graphs and the usage being dependent on the type of data present ( In the think-aloud interviews, students were asked to articulate their graph choice. |
Learning/reflection | Strategies for fostering understanding of representations | Reflection, reveals students’ awareness of their own understanding of graphs and gaps in their knowledge ( Several times in the think-aloud interviews, participants were probed to reflect on their graph choice and construction. |
See diSessa and Sherin (2000).
FIGURE 1.A comparison of the amount of time spent in each interview phase by undergraduate students, graduate students, and professors that summarizes the time spent during the planning, construction, and reflection phases for professors (P, n = 5), graduate students (GS, n = 8), undergraduates with research experience (UGR, n = 5), and undergraduates without research experience (UGNR, n = 10). An independent-samples t test shows there was a significant difference in the amount of time spent reflecting between GS and UGR (*, p < 0.05) and GS and UGNR (**, p < 0.01).
FIGURE 2.Summary of graph-construction reasoning findings showing the presence of themes in each of the three interview phases by professors (P, N = 5), graduate students (GS, N = 8), undergraduates with research experience (UGR, N = 5), and undergraduates without research experience (UGNR, N = 10). “X” denotes the presence of a theme by one participant; “” indicates the presence of a theme by multiple participants. Because invention involves graph construction and participants were explicitly asked to reflect on graph choice, these themes are blacked out. Refer to Figures 2, 3, and 4 for themes that appeared for each participant. Small n in the table is a subset of the total sample (N) of the participant group.
FIGURE 3.Graph exemplars from all participant groups using the bacteria scenario. (A complete summary of the graphs constructed can be found in Figures 1–8 of the Supplemental Material.)
FIGURE 4.Visual summary of graph-construction reasoning, graphing behavior, and graph attribute findings with the reasoning behind graph choice and construction, graphing behaviors, and graph attributes along the novice to expert continuum.
Planning phase: summary of the themes, definitions, and participant examples
| Categories in MRC | Themes | Participant examples |
|---|---|---|
| Function | Purpose: this is when the participant explicitly states that the purpose of the graph is to align with the purpose of the task. | P2: So the question is how temperature affects growth of bacteria. P4: We might be interested in taking a particular time point that we think is key and looking at the data for the groups at that time point, or we might sort of go the whole nine yards and [make] 5 separate plots. UGR2: Okay so we’re measuring how one bacteria type grows at two different temperatures, so we have the two different temperatures and there are three tubes for each temperature, and we have different times so you can see how it grows. |
| Graph choice: the participant is explicitly stating graph choice (i.e., bar, line, scatter) based on the data provided in the table. Participants may also interject their personal feelings or rely on their past experiences when contemplating between different graph types, their usage, and limitations. | GS4: I could make a scatter plot and have different symbols for different temperatures and they have three replicates for each. UGR1: Okay I’m going to make a bar graph for the sake of comparison here. Actually I might want to change my mind about what I’m doing here. I think I’m going to change to a type of line graph. I don’t think the bars are going to be the best comparison for showing a time course of a single plant. UGNR4: I’m going to make a line graph to compare two different types of data in the same graph … I think it’s going to show best patterns of each. | |
| Invention | Graph construction: when the participant either verbalizes variables in the table to the axes on the graph or the data or explains how they are visualizing the data on the graph. | P1: So what I’m going to do is because the dependent measure is number of cells, I’m going to put that on the GS3: So definitely the independent variable is time, as they call it, the UGR1: So it will be plants with 15 ml of water per day and the bar with the lines will be for the 5 ml treatment group. UGNR9: So generally when you have time you want to put that on the |
| Data type: the participant is explicitly making decisions about whether or not to plot raw data or plot manipulated data (i.e., average) or the number of graphs to use to properly convey the data. | P5: Let’s start with the 15 ml [treatment]. What I would do is, since we have 3 points per time point I will try to get the average of the three. GS1: I would probably pull the replicates [together], although the math for this would be pretty bad in my head, and I would have to draw fake error bars. UGNR1: Well I might be able to make this into two graphs because it’ll be easier to see maybe ... or we could do the average of the three tubes. | |
| Learning or reflection | Data table: this is when the participant is making sense of the data provided in the data table as evidenced by summarizing the data and/or the variables presented. | P5: Number of leaves, 2 different amounts of water and you have time on axis and you have for each amount of water from plants. Ok. So now I should create a graph of that. GS6: I’m looking at the time, the number of cells, three test tubes, temperature needs to go there, and so it’s at 22 degrees and 10 degrees Celsius, and as the time progresses we will see whether there is any growth of bacteria or not so at 22 degrees I see there is growth and at 10 degrees there is not as much. UGR1: Okay so measurements of the number of leaves are taken every thirty hours for up to 120 hours. Looks like they have three plants in each treatment group. UGNR1: So we are doing this at 22 degrees Celsius and 10 degrees Celsius. |
GS, graduate student; P, professor; UGNR, undergraduate student who did not have research experience; UGR, undergraduate student who did have research experience.
Construction phase: summary of the themes, definitions, and participant examples
| Categories in MRC | Themes | Participant examples |
|---|---|---|
| Function | Graph choice: participant is explicitly stating graph choice (i.e., bar, line, scatter) based on the data provided in the table. Participants may also interject their personal feelings or rely on their past experiences when contemplating between different graph types, their usage, and limitations. | GS1: Oh that’s a good point, whether or not I can connect them, because [with] the [variable] time line can be discrete. I’m not sure. I think since its cell growth over time that should be fine [to do] so (connects points on the graph). UGR2: I’m using the line graph because it shows the trend the easiest, because it goes straight and up a little. UGR3: … did I say line or bar? I’m doing lines. I’m doing a line chart now I changed my mind. |
| Invention | Statistics: participant is talking about either descriptive or inferential statistics. | P5: [the trend] is almost linear and [this is] because there is some error [in the data] which I didn’t calculate (sketches error bars on each data point). P2: You do need a bigger sample size, but [I will estimate] the error [bar] for each one [treatment]. (adds error bars and labels lines as either 10C or 22C). GS7: I think what I’m going to do is take average of three tubes and make a bar for each time point at each temperature. I’m plotting to show the standard deviation from the average value. UGR4: … you can create a trendline for each dataset, so basically out of 15 ml and 5 ml, you can do the line of best fit, where you try and roughly go through as many of the points as possible. UGNR7: This graph looks like it’s not going to be linear, but I’ll make a line of best fit for each [tube] just so you can tell where it’s going. |
| Data type: participant is explicitly making decisions about whether or not to plot raw data or plot manipulated data (i.e., average) and the number of graphs to use to properly convey the data. | P1: I’m collapsing across tubes, so I’m giving total [number of cells], or I could do mean [number of cells]. GS5: There are three tubes within each temperature group, so I will do the average—calculate the mean of the number of cells for the same time point for all three tubes. And for each time point I can have the mean and standard deviation. UGR3: Okay well I’m going to make two charts then if that’s the case. I’ll make one the cell count at 22 degrees Celsius, and I’ll make another one for cell count at 10 degrees Celsius with the same axes. UGR4: Because we have three plants, which is like three trials for each, I’m going to average the number of leaves at each time for each plant for each amount of water. UGNR6: I’m thinking maybe I could do like an average number of plants that would require doing calculations. There’s fifteen milliliters of water a day. I’m just going to go ahead and do averages. | |
| Learning/reflection | Evaluation: participant is talking either about the general graphing habits, future directions, or take-home message. | P2: You do need a bigger sample size, but [I will estimate] the error [bar] for each one [treatment] (adds error bars and labels lines as either 10C or 22C). GS8: This is the most horrible graph ever because it’s not even clear what the data mean. It might be easy for me to understand what I’ve done but it’s not easy. If I gave it to you, I’m sure you would not understand it, if it was out of context. UGR4: You can see really clearly that they [lines] are increasing at the same rate but throughout the entire experiment, the 5 ml produces less leaves. UGNR3: I did this wrong … I should have put ml on the |
| Technology: participant is mentioning the habitual use of graph-making software to reflect on elements of the current graph construction. | GS3: So if I read the problem and use Excel, I can just put linear regression lines and the r2 values, both are greater than 0.8 or something (draws 2 linear regression lines through points and labels lines with r2 > 0.8). UGR1: So I feel like if I was doing this in Excel, I would make each plant its own representation symbol or its own color to better represent that. Have like a uniform structure to this but a different representation. UGR4: … if you are in Excel, you can [get] the equation for the trend line and it will tell you that y equals some function of x. From that, you can see the mathematical relationship behind the number of leaves that you have. | |
| Critique | Aesthetics: participant is using elements of graph design (i.e., gestalt principles and color) to critique the constructed graph. | UGR2: I guess I will graph the other [tubes] too and we can just imagine that they are different colors. UGNR1: I’d use different colors for the ones at 22 [degrees Celsius] and the ones at 10 [degrees Celsius] and then you can show that in the legend … . But the legend is black so I guess I’ll just graph the points at different lines. They will all be the same color. |
| Sample size: participant is critiquing the small sample size presented in the data table. | P4: With 3 plants in each, I guess you could put a standard error on that [data point]; P2: You do need a bigger sample size. UGNR6: I’ll draw the dotted line that represents the five milliliters of water per day, which is also approximately a linear line but if there was more data it could possibly be curving off to give a constant average, at least if you want any of those. |
GS, graduate student; P, professor; UGNR, undergraduate student who did not have research experience; UGR, undergraduate student who did have research experience.
Reflection phase: summary of the themes, definitions, and participant examples
| Categories in MRC | Themes | Participant examples |
|---|---|---|
| Function | Purpose: this is when the participant explicitly states that the purpose of the graph is to align with the purpose of the task. | P2: Well you want to see the effect of temperature on growth. Here (pointing to the graph), you can easily see the two treatments, [and the] two levels of temperatures that were used while they changed over time. GS4: My question was how temperature affects the growth of bacteria, so here I can see the difference between these two lines is how much difference the temperature had on the growth. |
| Time: participant is using phrases like “change over time” or “flow over time” to justify choosing a line graph. | P5: I would say that [usually] [when you have the variable] time, a line graph is used. GS1: I would be able to show how the cell number changed over time. UGR1: Things that are measuring changes over time I think lines show trends there better than my initial thought of a bar graph. | |
| Variables: participant explains variables in the data table using the words “independent” or “dependent.” | P5: So because we have independent variable, time and dependent variable, number of leaves and we have two—in this case, two different conditions of, uh, amount of water that a second variable and we can just show it as two different lines. GS1: I was trying to decide whether or not time was going to be a continuous variable. I ended up thinking it would be, even though it might not be because of the distinct chosen time points. | |
| Invention | Statistics: participant is talking about either descriptive or inferential statistics. | P2: Of course we know that as more time passes bacteria grow faster, but there could be an interaction between time and temperature [not depicted by the data plotted]. GS3: … in the beginning I was thinking [of] putting the standard deviations but I decided to [plot] the data first [and] I think that putting a linear regression is very easy to use and read. UGR4: You can’t compare the number of leaves for 15 ml at 120 hours with the 5 ml at 30 hours because that’s just not a fair comparison. You have to show them linearly and in some kind of relationship. UGNR6: A best fit line is like when you have points that almost make a linear line but they’re a little bit off which could be due to experimental error. So you draw a line that best represents all the data so it doesn’t go minimum and a maximum so it kind of evens it out if you have some equal number of points below the best fit line and above, so it makes an average between the line. |
| Learning/reflection | Evaluation: participant is talking about the general graphing habits, future directions, or take-home message. | P5: (Pointing to the graph) If this [was] 4 different plants instead of time points then I probably would have [made] a bar graph, [to accommodate for] more categories. GS8: If I were to do any other type of bar graph or something, I’m not very sure how to do that by myself. Maybe if I were to do it in Excel then, yeah. The truth is, I don’t really know what type of data to use for a bar graph. UGR4: One of the scales in the experiment was the passing of time. You can’t use a bar graph or pie chart to show the passing of time, because you’re going to want to show it like linearly along some kind of axis, so that means you’re going to have to find some way to put the data points sequentially according to the time it happened, in order to compare them accurately. UGNR1: This is the most common type of graph that I make so I thought of this kind first. |
| Data table: this is when the participant is making sense of the data provided in the data table as evidenced by summarizing the data and or the variables presented. | GS1: … since the two variables have the same cell number over time, things that are being studied could both be displayed on the same graph which would help to visualize by looking at one time point, [which is] why I chose the line [graph]. UGR1: The way this chart is presented, at first I thought it was a comparison because plant 1,2, and 3 is redundant, but that’s just in my treatment group so I misread that. UGNR3: Because in order to plot time versus number of leaves, you’d have to do a scatter plot of sorts. In retrospect, I should have made two graphs and separated them out into 5 and 15 ml. UGNR1: Because that’s what I thought about when I first looked at this chart and it does show the number of cells. | |
| Critique | Aesthetics: participant is using elements of graph design (e.g., gestalt principles and color) to critique the constructed graph. | GS8: I know that if I were to make this graph in Excel, I could put in a lot of colors and make sense out of it. UGR1: Ideally, this would be a little bit more visually appealing with different colors and evenly spaced dots and lines. |
GS, graduate student; P, professor; UGNR, undergraduate student who did not have research experience; UGR, undergraduate student who did have research experience.