| Literature DB >> 33615214 |
Marco Pietrosanto1, Marta Adinolfi1, Andrea Guarracino1, Fabrizio Ferrè2, Gabriele Ausiello1, Ilio Vitale3, Manuela Helmer-Citterich1.
Abstract
Structural characterization of RNAs is a dynamic field, offering many modelling possibilities. RNA secondary structure models are usually characterized by an encoding that depicts structural information of the molecule through string representations or graphs. In this work, we provide a generalization of the BEAR encoding (a context-aware structural encoding we previously developed) by expanding the set of alignments used for the construction of substitution matrices and then applying it to secondary structure encodings ranging from fine-grained to more coarse-grained representations. We also introduce a re-interpretation of the Shannon Information applied on RNA alignments, proposing a new scoring metric, the Relative Information Gain (RIG). The RIG score is available for any position in an alignment, showing how different levels of detail encoded in the RNA representation can contribute differently to convey structural information. The approaches presented in this study can be used alongside state-of-the-art tools to synergistically gain insights into the structural elements that RNAs and RNA families are composed of. This additional information could potentially contribute to their improvement or increase the degree of confidence in the secondary structure of families and any set of aligned RNAs.Entities:
Year: 2021 PMID: 33615214 PMCID: PMC7884220 DOI: 10.1093/nargab/lqab007
Source DB: PubMed Journal: NAR Genom Bioinform ISSN: 2631-9268