| Literature DB >> 31172257 |
Thomas J X Li1, Christie S Burris2, Christian M Reidys3,4.
Abstract
In this paper we analyze the length-spectrum of blocks in [Formula: see text]-structures. [Formula: see text]-structures are a class of RNA pseudoknot structures that play a key role in the context of polynomial time RNA folding. A [Formula: see text]-structure is constructed by nesting and concatenating specific building components having topological genus at most [Formula: see text]. A block is a substructure enclosed by crossing maximal arcs with respect to the partial order induced by nesting. We show that, in uniformly generated [Formula: see text]-structures, there is a significant gap in this length-spectrum, i.e., there asymptotically almost surely exists a unique longest block of length at least [Formula: see text] and that with high probability any other block has finite length. For fixed [Formula: see text], we prove that the length of the complement of the longest block converges to a discrete limit law, and that the distribution of short blocks of given length tends to a negative binomial distribution in the limit of long sequences. We refine this analysis to the length spectrum of blocks of specific pseudoknot types, such as H-type and kissing hairpins. Our results generalize the rainbow spectrum on secondary structures by the first and third authors and are being put into context with the structural prediction of long non-coding RNAs.Entities:
Keywords: Block; Fatgraph; Length-spectrum; Pseudoknot; RNA structure; Singularity analysis
Mesh:
Substances:
Year: 2019 PMID: 31172257 DOI: 10.1007/s00285-019-01379-8
Source DB: PubMed Journal: J Math Biol ISSN: 0303-6812 Impact factor: 2.259