| Literature DB >> 29216185 |
Rebecca F Alford1, Andrew Leaver-Fay2, Lynda Gonzales3, Erin L Dolan3,4, Jeffrey J Gray1,5.
Abstract
Computational biology is an interdisciplinary field, and many computational biology research projects involve distributed teams of scientists. To accomplish their work, these teams must overcome both disciplinary and geographic barriers. Introducing new training paradigms is one way to facilitate research progress in computational biology. Here, we describe a new undergraduate program in biomolecular structure prediction and design in which students conduct research at labs located at geographically-distributed institutions while remaining connected through an online community. This 10-week summer program begins with one week of training on computational biology methods development, transitions to eight weeks of research, and culminates in one week at the Rosetta annual conference. To date, two cohorts of students have participated, tackling research topics including vaccine design, enzyme design, protein-based materials, glycoprotein modeling, crowd-sourced science, RNA processing, hydrogen bond networks, and amyloid formation. Students in the program report outcomes comparable to students who participate in similar in-person programs. These outcomes include the development of a sense of community and increases in their scientific self-efficacy, scientific identity, and science values, all predictors of continuing in a science research career. Furthermore, the program attracted students from diverse backgrounds, which demonstrates the potential of this approach to broaden the participation of young scientists from backgrounds traditionally underrepresented in computational biology.Entities:
Mesh:
Year: 2017 PMID: 29216185 PMCID: PMC5720517 DOI: 10.1371/journal.pcbi.1005837
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Overview of Rosetta Boot Camp lecture topics.
| Day | Lecture Topic | Learning Objectives |
|---|---|---|
| Monday | Introduction to computational protein structure prediction and design | -- |
| Introduction to the C++ programming language | 1.a.i, 1.a.ii | |
| Tuesday | Utility, numeric, basic, and core Rosetta3 libraries | 2.a.i, 2.a.ii, 2.a.iii |
| Core Rosetta3 libraries | 2.a.i, 2.a.ii, 2.a.iii | |
| Wednesday | Writing protocols in RosettaScripts | 2.e.i, 2.e.ii, 2.e.iii. 2.e.iv, 3.e.i, 3.e.ii, 3.e.iii, 3.e.iv, 3.e.v |
| Const correctness in C++ | 2.d.iv, 2.d.v | |
| Thursday | Common Rosetta modeling protocols | 2.c.i, 2.c.ii, 2.c.iii, 2.c.iv, 2.c.v, 2.c.vi, 2.c.vii |
| Controlling flexibility during modeling | 3.f.ii.4 | |
| Friday | Adding code to Rosetta | 3.f.i, 3.f.ii, 3.f.iii, 3.f.iv, 3.f.v, 3.f.vi, 3.f.vii, 3.f.viii, 2.b.i, 2.b.ii, 2.b.iii |
Overview of Rosetta Boot Camp lab activities.
| Day | Lab Activities | Learning Objectives |
|---|---|---|
| Monday | Version control and branching with Git | 1.c.i, 1.c.ii, 1.c.iii, 1.c.iv, 1.c.v, 1.c.vi, 1.c.vii |
| Writing your first Rosetta C++ modeling protocol | 2.d.i, 2.d.ii, 2.d.iii, 2.e, 2.f.i, 2.f.ii.1, 2.f.ii.2, 3.c, 3.a.i, 3.a.iii, 3.a.v | |
| Tuesday | Writing unit tests for C++ classes | 3.a.ii, 3.a.iv, 3.b.i, 3.b.ii. 3.b.iii, 3.b.iv, 3.b.v, 3.b.vi |
| Kinematic control with the FoldTree | 2.f.ii.3, 3.d | |
| Wednesday | Writing a protocol in RosettaScripts | 3.e.i, 3.e.ii, 3.e.iii, 3.e.iv, 3.e.v |
| Packaging protocols in a Mover subclass | 1.d.i, 1.d.ii, 1.d.iii, 1.d.iv | |
| Thursday | Unix primer and scripting with bash, sed, and awk | 1.a.iii, 3.d |
| Loop modeling with CCD | 2.f.ii.4 | |
| Friday | Extra time to complete remaining labs | -- |
*Each lab builds on the previous lab marked with an asterisk toward development of a complex modeling protocol.
Fig 1Overview of the “Build your own Rosetta protocol” lab.
During the evenings, students worked on a lab activity designed to guide them through the process of writing a Rosetta protocol that takes advantage of different sampling strategies. On Day 1, students outlined a basic Rosetta executable that perturbed structures and then recovered from the perturbation using side-chain packing and whole-structure minimization. On Day 2, students used the FoldTree to restrict the propagation of structural perturbations by partitioning the structure by its secondary structure. On Day 3, students wrapped their protocol in a Mover class that could be hooked into the job distribution system and our XML-based scripting language, RosettaScripts. On Day 4, students applied the CCD method to close loops opened by their perturbations. Day 5 was unstructured time for students to complete their labs. CCD, cyclic-coordinate-descent.
Intern projects from the summer 2015 and summer 2016 cohorts.
| Cohort | Project | Principal Investigator | Institution | Location |
|---|---|---|---|---|
| 2015 | Redesigning HIV broadly neutralizing antibody PGT 121 to maintain stability and increase binding potency | Bill Schief | Scripps Research Institute | La Jolla, CA |
| 2015 | Encoding covariation into re-design of PDZ domains: Is sequence tolerance context-independent? | Tanja Kortemme | University of California at San Francisco | San Francisco, CA |
| 2015 | Quantification of local contact densities at protein-small molecule and protein-protein interfaces | Justin Siegel | University of California at Davis | Davis, CA |
| 2015 | Stepwise redesign: Application for designing atomic resolution RNA | Rhiju Das | Stanford University | Stanford, CA |
| 2015 | Marburg virus antibody modeling using comparative modeling | Jens Meiler | Vanderbilt University | Nashville, TN |
| 2015 | Carbohydrate and protein effects on antibody-receptor binding | Jeffrey Gray | Johns Hopkins University | Baltimore, MD |
| 2015 | Scoring sequence for modeled folding conformation in InteractiveROSETTA using a Hidden Markov Model based on sequence-structure motifs | Chris Bystroff | Rensaleer Polytechnic Institute | Troy, NY |
| 2015 | Analyzing the molecular interactions of the α-GID/α4β2 receptor complex: An evaluation for drug design | Richard Bonneau | New York University | New York, NY |
| 2016 | Iteratively building hydrogen bond networks at protein-protein interfaces | Brian Kuhlman | University of North Carolina at Chapel Hill | Chapel Hill, NC |
| 2016 | Ligand Holes: Screening for better fitting ligands | John Karanicolas | University of Kansas | Lawrence, KS |
| 2016 | Improving player onboarding in citizen science games with three-star systems | Seth Cooper | Northeastern University | Boston, MA |
| 2016 | Computational design of auto-inhibited chemotherapeutic enzyme using Rosetta | Sagar Khare | Rutgers University | New Brunswick, NJ |
| 2016 | Structure-based prediction of non-histone human deacetylase (HDAC) 2 substrates | Ora Schueler-Furman | Hebrew University | Jerusalem, Israel |
| 2016 | Modeling cancerous mutations in CCCTC binding factor “Core” | Richard Bonneau | New York University | New York, NY |
| 2016 | Predicting glycoforms of Mucin 1 in cancer cells and identifying their binding forms | Jeffrey Gray | Johns Hopkins University | Baltimore, MD |
| 2016 | Computational design of co-assembling multi-component protein crystals in the F222 space group | David Baker | University of Washington | Seattle, WA |
Fig 2Comparison between the Rosetta REU and two other life science REU programs.
We surveyed students at the completion of the program on four outcomes: sense of community, scientific self-efficacy, scientific identity, and values alignment. Here, these data are compared to the survey results of two other life sciences REU programs. REU, Research Experience for Undergraduates.