Literature DB >> 32910644

A Unified Dynamic Programming Framework for the Analysis of Interacting Nucleic Acid Strands: Enhanced Models, Scalability, and Speed.

Mark E Fornace1, Nicholas J Porubsky1, Niles A Pierce2,3,4.   

Abstract

Dynamic programming algorithms within the NUPACK software suite enable analysis of nucleic acid sequences over complex and test tube ensembles containing arbitrary numbers of interacting strand species, serving the needs of researchers in molecular programming, nucleic acid nanotechnology, synthetic biology, and across the life sciences. Here, to enhance the underlying physical model, ensure scalability for large calculations, and achieve dramatic speedups when calculating diverse physical quantities over complex and test tube ensembles, we introduce a unified dynamic programming framework that combines three ingredients: (1) recursions that specify the dependencies between subproblems and incorporate the details of the structural ensemble and the free energy model, (2) evaluation algebras that define the mathematical form of each subproblem, (3) operation orders that specify the computational trajectory through the dependency graph of subproblems. The physical model is enhanced using new recursions that operate over the complex ensemble including coaxial and dangle stacking subensembles. The recursions are coded generically and then compiled with a quantity-specific evaluation algebra and operation order to generate an executable for each physical quantity: partition function, equilibrium base-pairing probabilities, MFE energy and proxy structure, suboptimal proxy structures, and Boltzmann sampled structures. For large complexes (e.g., 30 000 nt), scalability is achieved for partition function calculations using an overflow-safe evaluation algebra, and for equilibrium base-pairing probabilities using a backtrack-free operation order. A new blockwise operation order that treats subcomplex blocks for the complex species in a test tube ensemble enables dramatic speedups (e.g., 20-120× ) using vectorization and caching. With these performance enhancements, equilibrium analysis of substantial test tube ensembles can be performed in ≤ 1 min on a single computational core (e.g., partition function and equilibrium concentration for all complex species of up to six strands formed from two strand species of 300 nt each, or for all complex species of up to two strands formed from 80 strand species of 100 nt each). A new sampling algorithm simultaneously samples multiple structures from the complex ensemble to yield speedups of an order of magnitude or more as the number of structures increases above ≈103. These advances are available within the NUPACK 4.0 code base (www.nupack.org) which can be flexibly scripted using the all-new NUPACK Python module.

Keywords:  DNA; RNA; base-pairing; base-pairing probability; coaxial and dangle stacking subensembles; complex ensemble; concentration; dynamic programming algorithm; equilibrium; evaluation algebra; free energy model; minimum free energy; operation order; partition function; recursion; secondary structure; structure sampling; test tube ensemble

Mesh:

Substances:

Year:  2020        PMID: 32910644     DOI: 10.1021/acssynbio.9b00523

Source DB:  PubMed          Journal:  ACS Synth Biol        ISSN: 2161-5063            Impact factor:   5.110


  5 in total

1.  The stability and number of nucleating interactions determine DNA hybridization rates in the absence of secondary structure.

Authors:  Sophie Hertel; Richard E Spinney; Stephanie Y Xu; Thomas E Ouldridge; Richard G Morris; Lawrence K Lee
Journal:  Nucleic Acids Res       Date:  2022-08-12       Impact factor: 19.160

2.  Scaling up DNA digital data storage by efficiently predicting DNA hybridisation using deep learning.

Authors:  David Buterez
Journal:  Sci Rep       Date:  2021-10-15       Impact factor: 4.379

3.  Cellular Computational Logic Using Toehold Switches.

Authors:  Seungdo Choi; Geonhu Lee; Jongmin Kim
Journal:  Int J Mol Sci       Date:  2022-04-12       Impact factor: 6.208

4.  DNA circuits compatible encoder and demultiplexer based on a single biomolecular platform with DNA strands as outputs.

Authors:  Tianci Xie; Yuhan Deng; Jiarui Zhang; Zhen Zhang; Zhe Hu; Tongbo Wu
Journal:  Nucleic Acids Res       Date:  2022-08-26       Impact factor: 19.160

5.  Programmable siRNA pro-drugs that activate RNAi activity in response to specific cellular RNA biomarkers.

Authors:  Si-Ping Han; Lisa Scherer; Matt Gethers; Ane M Salvador; Marwa Ben Haj Salah; Rebecca Mancusi; Sahil Sagar; Robin Hu; Julia DeRogatis; Ya-Huei Kuo; Guido Marcucci; Saumya Das; John J Rossi; William A Goddard
Journal:  Mol Ther Nucleic Acids       Date:  2022-01-03       Impact factor: 10.183

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.