Literature DB >> 31954797

Identification of novel RNA design candidates by clustering the extended RNA-As-Graphs library.

Swati Jain1, Qiyao Zhu2, Amiel S P Paz3, Tamar Schlick4.   

Abstract

BACKGROUND: We re-evaluate our RNA-As-Graphs clustering approach, using our expanded graph library and new RNA structures, to identify potential RNA-like topologies for design. Our coarse-grained approach represents RNA secondary structures as tree and dual graphs, with vertices and edges corresponding to RNA helices and loops. The graph theoretical framework facilitates graph enumeration, partitioning, and clustering approaches to study RNA structure and its applications.
METHODS: Clustering graph topologies based on features derived from graph Laplacian matrices and known RNA structures allows us to classify topologies into 'existing' or hypothetical, and the latter into, 'RNA-like' or 'non RNA-like' topologies. Here we update our list of existing tree graph topologies and RAG-3D database of atomic fragments to include newly determined RNA structures. We then use linear and quadratic regression, optionally with dimensionality reduction, to derive graph features and apply several clustering algorithms on our tree-graph library and recently expanded dual-graph library to classify them into the three groups.
RESULTS: The unsupervised PAM and K-means clustering approaches correctly classify 72-77% of all existing graph topologies and 75-82% of newly added ones as RNA-like. For supervised k-NN clustering, the cross-validation accuracy ranges from 57 to 81%.
CONCLUSIONS: Using linear regression with unsupervised clustering, or quadratic regression with supervised clustering, provides better accuracies than supervised/linear clustering. All accuracies are better than random, especially for newly added existing topologies, thus lending credibility to our approach. GENERAL SIGNIFICANCE: Our updated RAG-3D database and motif classification by clustering present new RNA substructures and RNA-like motifs as novel design candidates.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Graph clustering; RAG-3D database; RNA design; RNA-like motifs; Tree and dual graph topologies

Mesh:

Substances:

Year:  2020        PMID: 31954797      PMCID: PMC7078028          DOI: 10.1016/j.bbagen.2020.129534

Source DB:  PubMed          Journal:  Biochim Biophys Acta Gen Subj        ISSN: 0304-4165            Impact factor:   3.770


  27 in total

1.  Structural genomics of RNA.

Authors:  J A Doudna
Journal:  Nat Struct Biol       Date:  2000-11

2.  A computational proposal for designing structured RNA pools for in vitro selection of RNAs.

Authors:  Namhee Kim; Hin Hark Gan; Tamar Schlick
Journal:  RNA       Date:  2007-02-23       Impact factor: 4.942

3.  Using sequence signatures and kink-turn motifs in knowledge-based statistical potentials for RNA structure prediction.

Authors:  Cigdem Sevim Bayrak; Namhee Kim; Tamar Schlick
Journal:  Nucleic Acids Res       Date:  2017-05-19       Impact factor: 16.971

Review 4.  Therapeutic applications of DNA and RNA aptamers.

Authors:  Kristina W Thiel; Paloma H Giangrande
Journal:  Oligonucleotides       Date:  2009-09

5.  Opportunities and Challenges in RNA Structural Modeling and Design.

Authors:  Tamar Schlick; Anna Marie Pyle
Journal:  Biophys J       Date:  2017-02-02       Impact factor: 4.033

6.  An extended dual graph library and partitioning algorithm applicable to pseudoknotted RNA structures.

Authors:  Swati Jain; Sera Saju; Louis Petingi; Tamar Schlick
Journal:  Methods       Date:  2019-03-27       Impact factor: 3.608

Review 7.  Gene regulation by non-coding RNAs.

Authors:  Veena S Patil; Rui Zhou; Tariq M Rana
Journal:  Crit Rev Biochem Mol Biol       Date:  2013-10-28       Impact factor: 8.250

8.  Mechanisms of RNA catalysis.

Authors:  David M J Lilley
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2011-10-27       Impact factor: 6.237

9.  RNA graph partitioning for the discovery of RNA modularity: a novel application of graph partition algorithm to biology.

Authors:  Namhee Kim; Zhe Zheng; Shereef Elmetwaly; Tamar Schlick
Journal:  PLoS One       Date:  2014-09-04       Impact factor: 3.240

10.  Dual Graph Partitioning Highlights a Small Group of Pseudoknot-Containing RNA Submotifs.

Authors:  Swati Jain; Cigdem S Bayrak; Louis Petingi; Tamar Schlick
Journal:  Genes (Basel)       Date:  2018-07-25       Impact factor: 4.096

View more
  2 in total

1.  Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field.

Authors:  Tamar Schlick; Stephanie Portillo-Ledesma; Christopher G Myers; Lauren Beljak; Justin Chen; Sami Dakhel; Daniel Darling; Sayak Ghosh; Joseph Hall; Mikaeel Jan; Emily Liang; Sera Saju; Mackenzie Vohr; Chris Wu; Yifan Xu; Eva Xue
Journal:  Annu Rev Biophys       Date:  2021-02-19       Impact factor: 12.981

2.  A Fiedler Vector Scoring Approach for Novel RNA Motif Selection.

Authors:  Qiyao Zhu; Tamar Schlick
Journal:  J Phys Chem B       Date:  2021-01-20       Impact factor: 2.991

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.