| Literature DB >> 26805815 |
Kun Yan1, Yasir Arfat2, Dijie Li3, Fan Zhao4, Zhihao Chen5, Chong Yin6, Yulong Sun7, Lifang Hu8, Tuanmin Yang9, Airong Qian10.
Abstract
Long noncoding RNAs (lncRNAs), which form a diverse class of RNAs, remain the least understood type of noncoding RNAs in terms of their nature and identification. Emerging evidence has revealed that a small number of newly discovered lncRNAs perform important and complex biological functions such as dosage compensation, chromatin regulation, genomic imprinting, and nuclear organization. However, understanding the wide range of functions of lncRNAs related to various processes of cellular networks remains a great experimental challenge. Structural versatility is critical for RNAs to perform various functions and provides new insights into probing the functions of lncRNAs. In recent years, the computational method of RNA structure prediction has been developed to analyze the structure of lncRNAs. This novel methodology has provided basic but indispensable information for the rapid, large-scale and in-depth research of lncRNAs. This review focuses on mainstream RNA structure prediction methods at the secondary and tertiary levels to offer an additional approach to investigating the functions of lncRNAs.Entities:
Keywords: function; lncRNAs; secondary structure; structure prediction; tertiary structure
Mesh:
Substances:
Year: 2016 PMID: 26805815 PMCID: PMC4730372 DOI: 10.3390/ijms17010132
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The graphical abstract of this review.
Figure 2Mechanisms of lncRNA action in transcriptional regulation. (a) Transcription of the lncRNA SRG1 inhibits the expression of the SER3 gene by interfering with the binding of RNA polymerase II to DNA; (b) The expression of the p15 antisense RNA, the lncRNA of a tumor suppressor gene, results in the silencing of the p15 gene through the induction of heterochromatin formation, which persisted after the p15 antisense RNA was turned off; (c) lncRNA binds to the major DHFR promoter and IIB, a general transcriptional factor, to form a stable and specific complex to dissociate the preinitiation complex from the major DHFR promoter; (d) As a response to stress, the RNA-binding protein TLS, under allosteric modulation via lncRNA upstream of CCND1, binds to chromatin-binding protein (CBP) and inhibits CBP/P300 HAT activities on CCND1; (e) The lncRNA Evf2, a crucial co-enhancer of regulatory proteins involved in transcription, cooperates with the Dlx2 protein to activate the Dlx5/6 enhancer in a target gene; (f) In response to heat shock, the lncRNA HSR1 (heat shock RNA-1) promotes the trimerization of HSF1 (heat-shock transcription factor 1), and consequently the translation factor EIF interacts with HSR1 and HSF1 to forms a complex to facilitate the expression of heat-shock protein (HSP); (g) NFAT is nuclear factor of activated T cells. The lncRNA NRON (noncoding repressor of NFAT) may form a complex with importin proteins to regulate the subcellular localization of NFAT. The knockdown of NRON increases the expression and activity of NFAT; (h) The lncRNA metastasis-associated lung adenocarcinoma transcript 1(MALAT1) has been shown to be abnormally expressed in many human cancers. The nascent MALAT1 transcript is cleaved by RNase P to produce the 3′ end of the mature MALAT1 transcript and the 5′ end of the small RNA; (i) Several studies have elucidated that some lncRNAs can act as microRNA sponges to competitively bind to microRNAs and decrease microRNA-induced tumorsphere differentiation.
Comparison of the various major methods to predict RNA secondary structure.
| Method | Principle | Advantages | Limitations |
|---|---|---|---|
| Foldalign [ | Sankoff, dynamic programming algorithm | time complexity decreased | length of sequence shorter than 300 nt; low speed and efficiency |
| Dynalign [ | Sankoff, dynamic programming algorithm | suboptimal secondary structures accessible; constrained information added | Pseudoknots not predictable |
| MARNA [ | folding sequences using minimum free energy; proceedings structural alignment | individual parameters freely set | total length of sequences shorter than 10,000 nt |
| Mfold [ | Zuker’s dynamic programming algorithm based on minimum free energy model | priori knowledge specified; structure of circular RNA sequence predictable; some values related to structure artificially made | only structure of single stranded RNA predictable |
| Alifold/RNAfold [ | minimum free energy model and multiple sequence alignment | containing incorrect characters; single stranded RNA and several stranded RNAs predictable; base pairing of G and U acceptable | when predicting several stranded RNAs, only producing consensus structure instead of the secondary structure of each sequence; when predicting single sequence, its length requirement is less than 300 nt; total length of sequence not to exceed 10K nt when predicting consensus structure |
| RNAshapes [ | abstract shapes approach | single stranded RNA, sequence files and multi-sequence files predictable; redundant suboptimal structures avoided | does not consider folding kinetics; minimum free energy prediction may be incorrect |
| RNAstructure [ | dynamic programming algorithm and Sankoff | number of suboptimal structures limited; structures constrained by experimental data | only AGCU predictable |
Various methods for predicting RNA tertiary structure.
| Method | Principles | Advantages | Limitations |
|---|---|---|---|
| FARNA [ | coarse-grained models, minimum free energy | better computational efficiency | small RNA molecules (<40 nt) |
| NAST [ | coarse-grained models, knowledge-based energy function | relatively high modeling speed; constraint models | computational complexity |
| iFoldRNA [ | discrete molecular dynamics | rapid conformational sampling ability | small RNA molecules (<50 nt) |
| BARNACLE [ | probabilistic model, sampling of RNA conformations in continuous space | efficient sampling of 3D RNA conformations on a short length scale | small RNA molecules (<50 nt); sample difficulty |
| CG Model [ | molecular dynamics based on a new statistical coarse-grained potential | high computational efficiency | small RNA molecules or those with simple topology |
| RNA2D3D [ | base-pairing structure of RNA molecules | can predict pseudoknots | obtaining reasonable RNA tertiary structure to be solved |
| Vfold Model [ | physics-based method | statistical mechanical calculations for the conformational entropy of RNA tertiary structures | does not consider the sequence-dependent tertiary contacts |
| RSIM [ | fragment assembly | the reduction in the size of conformational space sampled; reasonable base-pairing constraints | RNA molecules with pseudoknot structures not automatically predictable |
| 3dRNA [ | hierarchical approach to construct RNA tertiary structure | highest prediction accuracy | ability to model larger RNA molecules or those with complex topology |