| Literature DB >> 26902426 |
Jeff Anderson-Lee1, Eli Fisker1, Vineet Kosaraju2, Michelle Wu3, Justin Kong4, Jeehyung Lee4, Minjae Lee4, Mathew Zada1, Adrien Treuille5, Rhiju Das6.
Abstract
Designing RNAs that form specific secondary structures is enabling better understanding and control of living systems through RNA-guided silencing, genome editing and protein organization. Little is known, however, about which RNA secondary structures might be tractable for downstream sequence design, increasing the time and expense of design efforts due to inefficient secondary structure choices. Here, we present insights into specific structural features that increase the difficulty of finding sequences that fold into a target RNA secondary structure, summarizing the design efforts of tens of thousands of human participants and three automated algorithms (RNAInverse, INFO-RNA and RNA-SSD) in the Eterna massive open laboratory. Subsequent tests through three independent RNA design algorithms (NUPACK, DSS-Opt and MODENA) confirmed the hypothesized importance of several features in determining design difficulty, including sequence length, mean stem length, symmetry and specific difficult-to-design motifs such as zigzags. Based on these results, we have compiled an Eterna100 benchmark of 100 secondary structure design challenges that span a large range in design difficulty to help test future efforts. Our in silico results suggest new routes for improving computational RNA design methods and for extending these insights to assess "designability" of single RNA structures, as well as of switches for in vitro and in vivo applications.Entities:
Keywords: RNA design; RNA secondary structure; benchmark; citizen science; inverse folding
Mesh:
Substances:
Year: 2016 PMID: 26902426 PMCID: PMC4833017 DOI: 10.1016/j.jmb.2015.11.013
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469