| Literature DB >> 31074696 |
Carl Procko1,2, Steven Morrison2, Courtney Dunar2, Sara Mills2, Brianna Maldonado2, Carlee Cockrum2, Nathan Emmanuel Peters2, Shao-Shan Carol Huang3, Joanne Chory1,4.
Abstract
Next-generation sequencing (NGS)-based methods are revolutionizing biology. Their prevalence requires biologists to be increasingly knowledgeable about computational methods to manage the enormous scale of data. As such, early introduction to NGS analysis and conceptual connection to wet-lab experiments is crucial for training young scientists. However, significant challenges impede the introduction of these methods into the undergraduate classroom, including the need for specialized computer programs and knowledge of computer coding. Here, we describe a semester-long, course-based undergraduate research experience at a liberal arts college combining RNA-sequencing (RNA-seq) analysis with student-driven, wet-lab experiments to investigate plant responses to light. Students derived hypotheses based on analysis of RNA-seq data and designed follow-up studies of gene expression and plant growth. Our assessments indicate that students acquired knowledge of big data analysis and computer coding; however, earlier exposure to computational methods may be beneficial. Our course requires minimal prior knowledge of plant biology, is easy to replicate, and can be modified to a shorter, directed-inquiry module. This framework promotes exploration of the links between gene expression and phenotype using examples that are clear and tractable and improves computational skills and bioinformatics self-efficacy to prepare students for the "big data" era of modern biology.Entities:
Mesh:
Year: 2019 PMID: 31074696 PMCID: PMC6755220 DOI: 10.1187/cbe.18-08-0161
Source DB: PubMed Journal: CBE Life Sci Educ ISSN: 1931-7913 Impact factor: 3.325
Schedule of class activities
| Week(s) | Class activities | Graded assessment |
|---|---|---|
| 1 | Lecture: The scientific method in the context of “big data” | |
| 2 | Lecture: Sanger sequencing; NGS technologies (Illumina); traditional gene expression analysis vs. RNA-seq; introduction to plants and plant anatomy | |
| Journal club: “What became of the Neanderthals? An introduction to NGS” ( | ||
| Wet lab: Plant culture techniques and growth medium preparation | ||
| 3 | Lecture: Introduction to | |
| Guest lecture: NGS in the clinic and genetic counseling. | ||
| Journal club: “How does gene expression change during human embryo development? Visualizing RNA-seq results” ( | ||
| Computer lab: Introduction to CyVerse; aligning sequencing reads | ||
| Wet lab: Preparing light chambers; growing plants to test effect of light environment. | ||
| 4 and 5 | Computer lab: Introduction to R; differential gene expression analysis | |
| Wet lab: Scoring the effect of light on plant phenotype | ||
| 6 | Lecture: Performing qRT-PCR | R script files |
| Guest lecture: NGS applications in research (ChIP-Seq) | ||
| Computer lab: GO analysis and making sense of the data | ||
| Discussion: Formulating hypotheses and project selection | ||
| 7 | Peer review and finalization of research proposals | |
| 8–12 | Wet lab: Supervised independent research | Literature review and written proposal; theory exam |
| 13 | Completion of research and poster preparation | |
| 14 | Poster feedback; peer review; finalization; printing | |
| 15 | Departmental poster presentation | Poster presentation |
| 16 | No class | Gene expression analysis with R assignment; lab notebooks |
FIGURE 1.Summary of the schedule of class activities.
FIGURE 2.Student analysis of gene expression and phenotype of shade-treated Arabidopsis seedlings. (A) Flowchart of RNA-seq analysis and follow-up experimentation performed by students. Steps in red were completed using the Discovery Environment in CyVerse. (B) Student photograph showing the phenotype of 10-day-old Arabidopsis seedlings grown in white light or shade (5 days white light + 5 days low R:FR light). Note increased elongation growth of the hypocotyl (hyp) in shade. (C) Student worksheets were used to correlate the hypocotyl phenotype with increased expression of genes involved with auxin growth hormone signaling. Shown is cDNA sequencing reads from white-light-treated and shade-treated Arabidopsis seedlings, aligned over the YUC2 gene (boxes, exons). YUC2 codes for an enzyme involved with auxin biosynthesis (Mashiguchi ). Students noted the increased number of reads in the shade-treated plants.
Learning outcomes and associated assessment tools
| Learning outcome | Graded assessment |
|---|---|
| 1. Design and conduct an independent research project. | Literature review and written proposal; participation and lab performance; lab notebook |
| 2. Demonstrate a command of the scientific literature associated with research topic. | Literature review and written proposal |
| 3. Show mastery of techniques related to research. | Participation and lab performance; lab notebook |
| 4. Articulate scientific information orally and in writing. | Literature review and written proposal; poster presentation |
| 5. Explain NGS: what it is, how it is performed, and its various applications. | Techniques and theory exam |
| 6. Analyze an RNA-seq data set to find differentially expressed genes and formulate hypotheses in light of the relevant primary literature. | R script and bioinformatics analyses; gene expression with R assignment |
| 7. Demonstrate an understanding of basic plant anatomy and growth responses to environmental stimuli. | Literature review and written proposal; poster presentation |
| 8. Demonstrate an ability to independently plan, execute, and document phenotypic and/or molecular–genetic experiments with plants to evaluate gene expression hypotheses. | Participation and lab performance; lab notebook |
Student scores
| Activity | % of overall grade | Mean scorea |
|---|---|---|
| Participation and lab performance | 20 | 88.6 |
| Literature review and written proposal | 20 | 85.6 |
| Poster presentation | 20 | 95.4 |
| R script and bioinformatics analyses | 10 | 95.4 |
| Lab notebook | 10 | 87.5 |
| Gene expression with R assignment | 10 | 96.7 |
| Techniques and theory exam | 10 | 88.2 |
aMean score represents the mean across all students in the class, as a value out of 100: >90 represents a letter grade of “A” (excellence), 80–90 represents a “B” (satisfactory), and 70–80 represents a “C.”
Mean pre- and postclass scores for bioinformatics self-efficacy
| Item | Preclassa | Postclassa | |
|---|---|---|---|
| I am comfortable with statistical analyses in biology. | 2.625 | 3.875 | 0.029 |
| I am knowledgeable in computer programming. | 2.25 | 3.375 | 0.026 |
| I can use bioinformatics tools to answer biological questions. | 2.5 | 4.125 | 0.00093 |
| Molecular genetics and genomics excite me! | 3.875 | 4.3125 | 0.19 |
| I am likely to use R in the future for data analysis. | NAc | 3.0625 | NAc |
aStudents anonymously self-reported levels of agreement to each item statement. The response format was: strongly disagree (coded 1), disagree (2), neutral (3), agree (4), and strongly agree (5). Mean scores are shown.
bSignificance was determined by Wilcoxon-Mann-Whitney test.
cNA, not applicable. This question was asked only at the completion of the class when all students had acquired some knowledge of R coding.
Examples of postclass student comments
| Class component | Student comments |
|---|---|
| General/computer lab | Positive: “Best course at USD; learned the most.”Negative: “We went through R too fast and it was overwhelming.” |
| Wet lab | Positive: “I enjoyed the research that we did and the fact that it is the first time being done at USD.”Negative: “Research requirement … requires too much class time as well as time out of class.” |