Literature DB >> 26713687

Efficient Design of Compact Unstructured RNA Libraries Covering All k-mers.

Yaron Orenstein1, Bonnie Berger1,2.   

Abstract

Current microarray technologies to determine RNA structure or measure protein-RNA interactions rely on single-stranded, unstructured RNA probes on a chip covering together all k-mers. Since space on the array is limited, the problem is to efficiently design a compact library of unstructured ℓ-long RNA probes, where each k-mer is covered at least p times. Ray et al. designed such a library for specific values of k, ℓ, and p using ad-hoc rules. To our knowledge, there is no general method to date to solve this problem. Here, we address the problem of finding a minimum-size covering of all k-mers by ℓ-long sequences with the desired properties for any value of k, ℓ, and p. As we prove that the problem is NP-hard, we give two solutions: the first is a greedy algorithm with a logarithmic approximation ratio; the second, a heuristic greedy approach based on random walks in de Bruijn graphs. The heuristic algorithm works well in practice and produces a library of unstructured RNA probes that is only ∼1.1-times greater in size compared to the theoretical lower bound. We present results for typical values of k and probe lengths ℓ and show that our algorithm generates a library that is significantly smaller than the library of Ray et al.; moreover, we show that our algorithm outperforms naive methods. Our approach can be generalized and extended to generate RNA or DNA oligo libraries with other desired properties. The software is freely available online.

Entities:  

Keywords:  RNA secondary structure; de Bruijn graph; microarray library design

Year:  2015        PMID: 26713687      PMCID: PMC4752187          DOI: 10.1089/cmb.2015.0179

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  23 in total

1.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities.

Authors:  Michael F Berger; Anthony A Philippakis; Aaron M Qureshi; Fangxue S He; Preston W Estep; Martha L Bulyk
Journal:  Nat Biotechnol       Date:  2006-09-24       Impact factor: 54.908

2.  Design of compact, universal DNA microarrays for protein binding microarray experiments.

Authors:  Anthony A Philippakis; Aaron M Qureshi; Michael F Berger; Martha L Bulyk
Journal:  J Comput Biol       Date:  2008-09       Impact factor: 1.479

3.  Rapid determination of RNA accessible sites by surface plasmon resonance detection of hybridization to DNA arrays.

Authors:  Joshua B Mandir; Matthew R Lockett; Margaret F Phillips; Hatim T Allawi; Victor I Lyamichev; Lloyd M Smith
Journal:  Anal Chem       Date:  2009-11-01       Impact factor: 6.986

4.  Cross-linking, ligation, and sequencing of hybrids reveals RNA-RNA interactions in yeast.

Authors:  Grzegorz Kudla; Sander Granneman; Daniela Hahn; Jean D Beggs; David Tollervey
Journal:  Proc Natl Acad Sci U S A       Date:  2011-05-24       Impact factor: 11.205

Review 5.  A census of human RNA-binding proteins.

Authors:  Stefanie Gerstberger; Markus Hafner; Thomas Tuschl
Journal:  Nat Rev Genet       Date:  2014-11-04       Impact factor: 53.242

6.  Free energy minimization to predict RNA secondary structures and computational RNA design.

Authors:  Alexander Churkin; Lina Weinbrand; Danny Barash
Journal:  Methods Mol Biol       Date:  2015

Review 7.  Context-dependent control of alternative splicing by RNA-binding proteins.

Authors:  Xiang-Dong Fu; Manuel Ares
Journal:  Nat Rev Genet       Date:  2014-08-12       Impact factor: 53.242

8.  A compendium of RNA-binding motifs for decoding gene regulation.

Authors:  Debashish Ray; Hilal Kazan; Kate B Cook; Matthew T Weirauch; Hamed S Najafabadi; Xiao Li; Serge Gueroussov; Mihai Albu; Hong Zheng; Ally Yang; Hong Na; Manuel Irimia; Leah H Matzat; Ryan K Dale; Sarah A Smith; Christopher A Yarosh; Seth M Kelly; Behnam Nabet; Desirea Mecenas; Weimin Li; Rakesh S Laishram; Mei Qiao; Howard D Lipshitz; Fabio Piano; Anita H Corbett; Russ P Carstens; Brendan J Frey; Richard A Anderson; Kristen W Lynch; Luiz O F Penalva; Elissa P Lei; Andrew G Fraser; Benjamin J Blencowe; Quaid D Morris; Timothy R Hughes
Journal:  Nature       Date:  2013-07-11       Impact factor: 49.962

9.  RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins.

Authors:  Nicole Lambert; Alex Robertson; Mohini Jangi; Sean McGeary; Phillip A Sharp; Christopher B Burge
Journal:  Mol Cell       Date:  2014-05-15       Impact factor: 17.970

10.  ViennaRNA Package 2.0.

Authors:  Ronny Lorenz; Stephan H Bernhart; Christian Höner Zu Siederdissen; Hakim Tafer; Christoph Flamm; Peter F Stadler; Ivo L Hofacker
Journal:  Algorithms Mol Biol       Date:  2011-11-24       Impact factor: 1.405

View more
  2 in total

1.  Optimized Sequence Library Design for Efficient In Vitro Interaction Mapping.

Authors:  Yaron Orenstein; Robert Puccinelli; Ryan Kim; Polly Fordyce; Bonnie Berger
Journal:  Cell Syst       Date:  2017-09-27       Impact factor: 10.304

2.  Joker de Bruijn: Covering k-Mers Using Joker Characters.

Authors:  Yaron Orenstein; Yun William Yu; Bonnie Berger
Journal:  J Comput Biol       Date:  2018-08-17       Impact factor: 1.479

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.