Literature DB >> 19908358

RNA structural segmentation.

Ivan Dotú1, William A Lorenz, Pascal VAN Hentenryck, Peter Clote.   

Abstract

We describe several dynamic programming segmentation algorithms to segment RNA secondary and tertiary structures into distinct domains. For this purpose, we consider fitness functions that variously depend on (i) base pairing probabilities in the Boltzmann low energy ensemble of structures, (ii) contact maps inferred from 3-dimensional structures, and (iii) Voronoi tessellation computed from 3-dimensional structures. Segmentation algorithms include a direct dynamic programming method, previously discovered by Bellman and by Finkelstein and Roytberg, as well as two novel algorithms - a parametric algorithm to compute the optimal segmentation into k classes, for each value k, and an algorithm that simultaneously computes the optimal segmentation of all subsegments. Since many non-coding RNA gene finders scan the genome by a moving window method, reporting high-scoring windows, we apply structural segmentation to determine the most likely 5' and 3' boundaries of precursor microRNAs. When tested on all precursor microRNAs of length at most 100 nt from the Rfam database, benchmarking studies indicate that segmentation determines the 5' boundary with discrepancy (absolute value of difference between predicted and real boundaries) having mean -0.640 (stdev 15.196) and the 3' boundary with discrepancy having mean -0.266 (stdev. 17.415). This yields a sensitivity of 0.911 and positive predictive value of 0.906 for determination of exact boundaries of precursor microRNAs within a window of approximately 900 nt. Additionally, by comparing the manual segmentation of Jaeger et al. with our optimal structural segmentation of 16S and 16S-like rRNA of E. coli, rat mitochondria, Halobacterium volcanii, and Chlamydomonas reinhardii chloroplast into 4 segments, we establish the usefulness of (automated) structural segmentation in decomposing large RNA structures into distinct domains.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 19908358     DOI: 10.1142/9789814295291_0008

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  4 in total

1.  A domain-based model for predicting large and complex pseudoknotted structures.

Authors:  Song Cao; Shi-Jie Chen
Journal:  RNA Biol       Date:  2012-02-01       Impact factor: 4.652

2.  Optimizing RNA structures by sequence extensions using RNAcop.

Authors:  Nikolai Hecker; Mikkel Christensen-Dalsgaard; Stefan E Seemann; Jakob H Havgaard; Peter F Stadler; Ivo L Hofacker; Henrik Nielsen; Jan Gorodkin
Journal:  Nucleic Acids Res       Date:  2015-08-17       Impact factor: 16.971

3.  Multiple Sequence Alignments Enhance Boundary Definition of RNA Structures.

Authors:  Radhakrishnan Sabarinathan; Christian Anthon; Jan Gorodkin; Stefan E Seemann
Journal:  Genes (Basel)       Date:  2018-12-04       Impact factor: 4.096

4.  RNAsnp: efficient detection of local RNA secondary structure changes induced by SNPs.

Authors:  Radhakrishnan Sabarinathan; Hakim Tafer; Stefan E Seemann; Ivo L Hofacker; Peter F Stadler; Jan Gorodkin
Journal:  Hum Mutat       Date:  2013-04       Impact factor: 4.878

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.