Benjamin M David1, Ryan M Wyllie1, Ramdane Harouaka2, Paul A Jensen1,3,4. 1. Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801. 2. Biotechnology and Bioengineering Department, Sandia National Laboratories, Livermore, CA, 94550. 3. Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801. 4. Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801.
Abstract
MOTIVATION: The goal of oligonucleotide (oligo) design is to select oligos that optimize a set of design criteria. Oligo design problems are combinatorial in nature and require computationally intensive models to evaluate design criteria. Even relatively small problems can be intractable for brute-force approaches that test every possible combination of oligos, so heuristic approaches must be used to find near-optimal solutions. RESULTS: We present a general reinforcement learning framework, called OligoRL, to solve oligo design problems with complex constraints. OligoRL allows "black-box" design criteria and can be adapted to solve many oligo design problems. We highlight the flexibility of OligoRL by building tools to solve three distinct design problems: 1.) finding pools of random DNA barcodes that lack restriction enzyme recognition sequences (CutFreeRL); 2.) compressing large, non-degenerate oligo pools into smaller degenerate ones (OligoCompressor); and 3.) finding Not-So-Random hexamer primer pools that avoid rRNA and other unwanted transcripts during RNA-seq library preparation (NSR-RL). OligoRL demonstrates how reinforcement learning offers a general solution for complex oligo design problems. AVAILABILITY: OligoRL and all simulation codes are available as a Julia package at http://jensenlab.net/tools and archived at https://archive.softwareheritage.org/browse/origin/directory/?origin\_url=https://github.com/bmdavid2/OligoRL. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The goal of oligonucleotide (oligo) design is to select oligos that optimize a set of design criteria. Oligo design problems are combinatorial in nature and require computationally intensive models to evaluate design criteria. Even relatively small problems can be intractable for brute-force approaches that test every possible combination of oligos, so heuristic approaches must be used to find near-optimal solutions. RESULTS: We present a general reinforcement learning framework, called OligoRL, to solve oligo design problems with complex constraints. OligoRL allows "black-box" design criteria and can be adapted to solve many oligo design problems. We highlight the flexibility of OligoRL by building tools to solve three distinct design problems: 1.) finding pools of random DNA barcodes that lack restriction enzyme recognition sequences (CutFreeRL); 2.) compressing large, non-degenerate oligo pools into smaller degenerate ones (OligoCompressor); and 3.) finding Not-So-Random hexamer primer pools that avoid rRNA and other unwanted transcripts during RNA-seq library preparation (NSR-RL). OligoRL demonstrates how reinforcement learning offers a general solution for complex oligo design problems. AVAILABILITY: OligoRL and all simulation codes are available as a Julia package at http://jensenlab.net/tools and archived at https://archive.softwareheritage.org/browse/origin/directory/?origin\_url=https://github.com/bmdavid2/OligoRL. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Christopher D Armour; John C Castle; Ronghua Chen; Tomas Babak; Patrick Loerch; Stuart Jackson; Jyoti K Shah; John Dey; Carol A Rohl; Jason M Johnson; Christopher K Raymond Journal: Nat Methods Date: 2009-08-09 Impact factor: 28.547
Authors: Marissa Vignali; Christopher D Armour; Jingyang Chen; Robert Morrison; John C Castle; Matthew C Biery; Heather Bouzek; Wonjong Moon; Tomas Babak; Michal Fried; Christopher K Raymond; Patrick E Duffy Journal: J Clin Invest Date: 2011-02-07 Impact factor: 14.808
Authors: Y Mizuno; P Carninci; Y Okazaki; M Tateno; J Kawai; H Amanuma; M Muramatsu; Y Hayashizaki Journal: Nucleic Acids Res Date: 1999-03-01 Impact factor: 16.971
Authors: Michaela Hendling; Stephan Pabinger; Konrad Peters; Noa Wolff; Rick Conzemius; Ivan Barišic Journal: Nucleic Acids Res Date: 2018-07-02 Impact factor: 16.971