Literature DB >> 31178900

Coding or Noncoding, the Converging Concepts of RNAs.

Jing Li1, Changning Liu1.   

Abstract

Technological advances over the past decade have unraveled the remarkable complexity of RNA. The identification of small peptides encoded by long non-coding RNAs (lncRNAs) as well as regulatory functions mediated by non-coding regions of mRNAs have further complicated our understanding of the multifaceted functions of RNA. In this review, we summarize current evidence pointing to dual roles of RNA molecules defined by their coding and non-coding potentials. We also discuss how the emerging roles of RNA transform our understanding of gene expression and evolution.

Entities:  

Keywords:  coding potential; long noncoding RNA; messenger RNA; micropeptide; ribosome profiling

Year:  2019        PMID: 31178900      PMCID: PMC6538810          DOI: 10.3389/fgene.2019.00496

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Benefiting from the advances in science and technology, our understanding of the complexity of organisms is constantly increasing. The “central dogma” of molecular biology states that genetic information is typically processed from DNA to RNA to protein, and this decides cellular and organismal phenotype (Crick, 1970). In the past, RNAs, except for infrastructural RNAs (such as rRNAs and tRNAs), were commonly considered as an intermediate between DNA and proteins. However, over recent decades, the rapid development of high-throughput sequencing technologies has revealed the pervasive transcription of eukaryotic genomes (Okazaki et al., 2002; Carninci et al., 2005; Kapranov et al., 2007; Lander, 2011), thus revealing RNA-mediated gene regulation. The fact that most regulatory RNAs function without involvement in protein translation led us to re-examine the roles of RNAs in the development and evolution of higher organisms. In higher organisms, only a small fraction of genetic transcripts (less than 3%) have the capability to encode proteins, despite pervasive transcription across genomes. This raises the question of whether the remaining non-protein-coding transcripts are transcriptional “noise” or contain more genetic information. Large-scale projects for the systematic annotation and functional characterization of genes (such as ENCODE and FANTOM) have reported that at least 80% of mammalian genomic DNA is actively transcribed and elaborately regulated, with the vast majority of this considered to be noncoding RNA (ncRNA) genes (Consortium, 2012; Hon et al., 2017). The numbers of ncRNA genes vary between species, and interestingly, the complexity of an organism is highly associated with the abundance of ncRNA genes but not protein-coding genes, implying the potential importance of ncRNAs (Rubin et al., 2000; Stover et al., 2000; Mattick, 2001; Venter et al., 2001; Kapusta and Feschotte, 2014). Among these, lncRNAs that are defined as transcripts longer than 200 nucleotides with low/no protein-coding potential, represent a considerable proportion. Long non-coding RNAs can regulate gene expression in various ways, including epigenetic, transcriptional, post-transcriptional, translational and protein location effects. Corresponding to functional diversity, the modes of action of lncRNAs are also quite varied. lncRNAs can recruit epigenetic factors to modify chromatin state (Rinn and Chang, 2012), assemble transcriptional machinery to trigger the initiation of transcription (Bonasio and Shiekhattar, 2014), or act as a structural organizer to participate in the formation of subcellular organelles (Naganuma and Hirose, 2013). Additionally, lncRNAs can complementarily bind with other forms of RNA molecules to modulate gene expression at transcriptional, post-transcriptional and translational levels, for example as a moderator of mRNA activity or a decoy/sponge for miRNA (Poliseno et al., 2010; Gong and Maquat, 2011; Bonasio and Shiekhattar, 2014; Tay et al., 2014; Yoon et al., 2014). Moreover, lncRNAs couple with proteins through particular structures to act as a location transferor, or to modulate enzyme activities (Wang and Chang, 2011). Based on the “noncoding” definition, the modes of action of lncRNAs mentioned above are exerted primarily through ncRNAs. Intriguingly, recent bioinformatics analyses of large-scale data from ribosome-protected RNA fragments (ribosome profiling or ribo-profiling) have revealed that a considerably large part of these transcripts tends to contain sORFs and binds with ribosomes (Aspden et al., 2014; Ruiz-Orera et al., 2014; Anderson et al., 2015; Mackowiak et al., 2015; Olexiouk et al., 2016), suggesting that the coding potential of lncRNAs has been vastly underestimated. Several functional experiments have demonstrated that some lncRNAs can encode small peptides (named “micropeptides” with a length less than 100 aa) that are involved in various biological processes, although this is rare (Hubé and Francastel, 2018). In addition, certain coding transcripts, such as TP53 mRNA, could also function as RNA, without translation to proteins, to regulate significant biological processes (Candeias, 2011; Kloc et al., 2011). Therefore, it seems reasonable to presume that the demarcation of RNA depending on its coding or noncoding status is somewhat blurred, and partially intertwined. That is, RNA roles are likely not tightly constrained (such as RNA functioning only as mRNA or ncRNA), but rather converge and overlap: lncRNAs can function by encoding small peptides, while mRNAs can use their special structural features, such as the 3′ UTR or 5′ UTR, to function (Figure 1).
FIGURE 1

The interchangeable roles between coding and long noncoding RNAs. Traditionally, RNAs could be divided into two categories in accordance with their coding potential, that is, coding RNAs and noncoding RNAs. Coding RNAs generally refers to mRNA that encodes protein ① to act as various components including enzymes, cell structures, and signal transductors. Noncoding RNAs act as cellular regulators without encoding proteins ③. However, it appears that the boundaries blur between coding RNA and noncoding RNA as some coding mRNAs can function without translating to protein via the formation of RNA secondary structure primarily derived from the UTR ②; some lncRNAs can bind with ribosomes, and encode peptides to modulate cellular activities ④.

The interchangeable roles between coding and long noncoding RNAs. Traditionally, RNAs could be divided into two categories in accordance with their coding potential, that is, coding RNAs and noncoding RNAs. Coding RNAs generally refers to mRNA that encodes protein ① to act as various components including enzymes, cell structures, and signal transductors. Noncoding RNAs act as cellular regulators without encoding proteins ③. However, it appears that the boundaries blur between coding RNA and noncoding RNA as some coding mRNAs can function without translating to protein via the formation of RNA secondary structure primarily derived from the UTR ②; some lncRNAs can bind with ribosomes, and encode peptides to modulate cellular activities ④. In the present article, we will review current studies of the bilateral functionality of lncRNAs and mRNAs in terms of their coding potential, as well as the advancement of high-throughput techniques that would facilitate a deeper recognition of functional diversity of RNAs. This review will highlight the cases that illuminate the contrapositive roles between lncRNAs and mRNAs, and briefly discuss the biological significance of these discoveries for gene expression and evolution.

Long Noncoding Rnas Encode Small Peptides/Proteins With Regulatory Functions

Peptides/Proteins Encoded by Regular Long Noncoding RNAs

The original definition of lncRNAs concerns their low/non-coding potential. However, with accumulating evidence from bioinformatics and ribosome transcriptome profiling, lncRNAs have been shown to display strong ribosomal associations in many species, varying from plant to animal, indicating a potential coding capacity in lncRNA sORFs (Kageyama et al., 2011; Nam et al., 2016; Yeasmin et al., 2018). In recent years, several micropeptides derived from lncRNAs have been shown to be functional. We have summarized these micropeptides in Table 1.
Table 1

Peptides encoded by lncRNAs in plants and animals.

OriginMicropeptidesGeneFunctionSize (AAs)References
PlantsEarly nodulin 40 (Enod 40)Enod 40Nodule organogenesis12; 24Rohrig et al., 2002
PlantsPOLARIS (PLS)POLARISLeaf morphogenesis36Chilley et al., 2006
PlantsROTUNDIFOLIA (ROT4)ROT4Leaf morphogenesis53Narita et al., 2004
PlantsROT18/ DLV1DEVIL1 (DVL1)Plant organogenesis51Wen et al., 2004
PlantsKiss of death (KOD)Kiss of death (KOD)Programmed cell death regulation25Blanvillain et al., 2011
Plants and animalsBrick1 (Brk)Brick1 (Brk)Leaf morphogenesis84Frank and Smith, 2002
PoaceaeZm401p10; Zm908p11Zm401Pollen development89; 97Ma et al., 2008
MammalsDWORFLOC100507537Enhance muscle performance34Nelson et al., 2016
VertebratesToddlerLOC100506013Promotes cell migration; activator of APJ/Apelin receptor signaling54Pauli et al., 2014
VertebratesMyomixerLOC101929726Functionally involve in controlling muscle performance84Bi et al., 2017
Fruit flyPripolished rice (pri)Epidermal morphogenesis in embryogenesis11; 32Kondo et al., 2007
Fruit flyMOTS-c12s rRNAInsulin sensitivity and metabolic Homeostasis.16Lee et al., 2015
Fruit flyPgc polypeptidepgcPositivephosphorylation of transcription elongation factor b (P-TEFb)71Hanyu-Nakamura et al., 2008
Fruit flysarcolamban (Scl)pncr003:2LCalcium transport and muscle contraction,28; 29Magny et al., 2013
Fruit FlyTarsal-less/talLP10384Morphogenesis, including tissue morphogenesis and pattern formation11Galindo et al., 2007
MammalsPINT87aaLINC-PINTInhibit the transcriptional elongation of multiple oncogenes87Zhang et al., 2018b
MammalsMitoregulin (Mtln)LINC00116Mitochondrial respiration, ROS, and Ca2+ retention capacity56Stein et al., 2018
MammalsMRI-2C7orf49Non-homologous end-joining DNA repair69Slavoff et al., 2014
From humans to zebrafishAGD3AGU1Human stem Cell differentiation63Kikuchi et al., 2009
MammalsNoBodyLINC01420/ LOC550643mRNA turnover and nonsense-mediated decay (NMD)67D’Lima et al., 2017
MammalsMinionLOC101929726Muscle development84Zhang et al., 2017
Human and mouseSPARLINC00961Regulate muscle regeneration90Matsumoto et al., 2017
PrimatesHOXB-AS3 peptideHOXB-AS3Suppresses colon cancer growth, PKM splicing and subsequent metabolic reprogramming53Huang et al., 2017
Different speciesHumanin16s rRNAProgram cell death24Hashimoto et al., 2001
HumanSRAPSRADiverse roles in both normal biological processes and pathological changes224; 236Emberley et al., 2003; Chooniedass-Kothari et al., 2006
Mouse and humanMyoregulin (MLN)LINC00948 in human; 2310015B20Rik in mouseInhibit the SERCA activity and regulate Ca2+ flow in muscle46Anderson et al., 2015
Peptides encoded by lncRNAs in plants and animals. Steroid receptor RNA activator is a prototypic example of lncRNAs with both coding and noncoding products (Lanz et al., 1999; Mattick, 2003; Hubé et al., 2006, 2011). SRA was initially identified as a noncoding gene with multiple RNA isoforms, which is critical in many biological processes, such as acting as a co-activator of nuclear receptors and a regulator of steroid receptor-dependent gene expression (Hubé et al., 2006, 2011; Cooper et al., 2011). Interestingly, SRA can also encode for a conserved SRAP, which, in turn, represses the transcriptional regulatory activity of the SRA1 gene by interacting with a specific SRA stem-loop (Emberley et al., 2003; Chooniedass-Kothari et al., 2006; Hubé et al., 2011). The transmissible functionalities between the coding and noncoding SRA gene are caused by alternative splicing (AS) of introns/extrons (Colley and Leedman, 2011), suggesting the significance of AS events in the generation of bifunctional RNA. Of note, among the small number of already-known functional micropeptides, a few are muscle-specific, and have been implicated in the regulation of the activities of SERCA (Anderson et al., 2016; Nelson et al., 2016; Matsumoto et al., 2017). For example, MLN, a 46-aa micropeptide specifically expressed in skeletal-muscle, is encoded by a lncRNA (LINC00948 in human and 2310015B20Rik in mouse); it can directly interact with SERCA to decrease the affinity of this ATPase for Ca2+ and inhibit Ca2+ entry into the SR (Anderson et al., 2015). The Scl micropeptide is encoded by the noncoding pncr003:2L gene, and can affect Ca2+ traffic in cardiac muscle in the fly; the mutation of this gene triggers an arrhythmic phenotype (Magny et al., 2013). The MOTS-C micropeptide can regulate insulin sensitivity and metabolic homeostasis in the mitochondria of muscle cells, and derives from mitochondrial 12S rRNA (Slavoff et al., 2014; Lee et al., 2015). In the above example, the Scl peptides and their respective regulatory functions in the heart are quite conserved between species, including the fly and humans (Magny et al., 2013). These results indicate that several sORFs embedded in the noncoding region of the genome seem to undergo a relatively stricter natural selection than adjacent sequences, raising the question of whether these sORFs have a capability to sprout into a new gene in situ or to be integrated as a component into new genes elsewhere during evolution. The tal gene in Drosophila is of vital importance in tarsal morphogenesis in the fly leg, and stage-and position-specific expression have been reported in embryonic development. Although tal is regarded as a noncoding gene as none of its ORFs are over 100 aa, deeper analysis has found that the functionality of tal is predominantly dependent on the ORF regions (Manak et al., 2006). There are five ORFs in the tal gene, four of which contain a similar and conserved 7 aa motif that determines the functionality of the gene, with the shortest peptide of only 11 aa. Phylogenetic analysis revealed that these tal-like peptides are conserved in metazoans and represent a new class of eukaryotic genes. The discovery of these mini-peptides further expands the possible scope and function of lncRNA-encoded peptides that are hidden in currently sequenced genomes and the transcriptome (Galindo et al., 2007).

Peptides/Proteins Encoded by Circular RNAs

Circular RNAs pertain to a sub-category of specialized lncRNAs, which are primarily produced by backsplicing the 3′ end to the 5′ end of exons in the same transcript (often a coding gene) via the spliceosome, thereby forming lncRNAs in a circular shape (Ashwal-Fluss et al., 2014; Zhang X.O. et al., 2014; Starke et al., 2015). Through bioinformatics analysis and high-throughput sequencing, many circRNAs have been identified in multiple species (Sanger et al., 1976; Capel et al., 1993; Danan et al., 2012; Memczak et al., 2013; Jeck and Sharpless, 2014; Wang et al., 2014). However, understanding of circRNA function is still very limited. The reported biological activities of circRNAs include acting as a sponge for microRNAs (Hansen et al., 2013; Memczak et al., 2013), as a competitor during pre-mRNA splicing (Ashwal-Fluss et al., 2014), and as a transcriptional regulator in the nucleus (Li et al., 2015). The majority of circRNAs are chimeric lncRNAs derived from mRNA transcripts and likely in part encompass the exons of protein-coding genes. This poses the question of whether circRNAs have protein coding capabilities. In fact, many studies have demonstrated that circRNAs have coding capabilities both in vitro and in vivo in terms of cap-independent translation (Chen and Sarnow, 1995; Li and Lytton, 1999; Guo et al., 2014; Jeck and Sharpless, 2014; Abe et al., 2015; Wang and Wang, 2015; Pamudurti et al., 2017). Moreover, some functional protein products are encoded by circRNAs (such as circ-FBXW7; circ-Mbl, circ-ZNF609 and circ-SHPRH) (Rybak-Wolf et al., 2015; Pamudurti et al., 2017; Yang et al., 2018; Zhang et al., 2018a). circ-ZNF609 was initially screened out in a functional genetic screen, and is differentially expressed during myogenesis (Legnini et al., 2017). This circRNA contains an ORF covering almost all ORF regions of the host gene, but has a small variation at the splice junction. Its protein product lacks the zinc-finger domain compared with its linear counterpart, with an obvious impact on myoblast proliferation. Interestingly, heat shock could significantly activate the translation of circ-ZNF609; suggesting a possible regulatory role of circRNA translation under specific stimuli (Legnini et al., 2017). circ-Mbl was first detected in the lodge of the second exon of the splicing factor muscleblind (MBL/MBNL1) in flies and humans, with a function of competing with pre-mRNA splicing (Ashwal-Fluss et al., 2014). Recently, through a bioinformatics analysis of ribosome foot-printing datasets, Pamudurti and co-workers revealed that circ-Mbl could encode a peptide in the fly head, as detected through MS. Both circ-Mbl1 RNA and its protein-related product reside in the synaptosome and can be regulated by the 4E-BP and the transcription factor in forkhead family – FOXO, suggesting that this circRNA translation might be distinctively important in the brain (Pamudurti et al., 2017). The observation that circRNAs generate proteins can be traced back to much earlier studies in Archaea, where circularized introns produce a site-specific endonuclease (Dalgaard et al., 1993). However, to date, direct experimental evidence for circRNA translation to peptides is still scarce; as a result, it is even tougher to understand the function of their translated products. Considering that most circRNAs stem from coding transcripts and contain complete exons, it is possibly assumed that the circRNAs and their coding-products might provide uncharacterized modes of regulation of gene and protein expression (Pamudurti et al., 2017). Therefore, it is important to further investigate the possible functions associated with circRNA coding.

Large-Scale Approaches for the Identification of Potential sORFs

To date, hundreds of thousands of lncRNAs have been discovered in various species, and there is a desire to study their relevant functional mechanisms (Okazaki et al., 2002; Liu et al., 2005; Kapranov et al., 2007; Ponting et al., 2009; Ulitsky and Bartel, 2013; Volders et al., 2013). However, it is unpractical to identify lncRNAs and predict their functions using only traditional technical approaches, irrespective of the requirement for intensive validation of the exact mechanisms underlying lncRNA activities. The same is true for the identification of lncRNA coding capacity. Therefore, new large-scale technologic approaches based on computational analysis of transcriptome data and proteomics data have been developed, all of which are mutually reinforcing and cross-validated. A ribo-seq technique has been recently developed and is widely used to measure the full coding potential of RNA transcripts on a genome scale through deep sequencing of ribosome-protected RNA fragments (Ingolia et al., 2009). By identifying the precise ribosomal positions of RNAs, ribo-seq can plot the potential on-going events of translation in the cytosol, which is useful in identifying potentially functional micropeptides (Ingolia et al., 2011; Ingolia, 2016). With the advent of Ribo-Seq, thousands of translated sORFs were discovered in lncRNAs (Ingolia et al., 2011; Bazzini et al., 2014; Ruiz-Orera et al., 2014; Ji et al., 2015), with a few functional peptides, such as MLN (Anderson et al., 2015) and HOXB-AS3 (Huang et al., 2017). However, the proportion of coding lncRNAs estimated by various ribosome-profiling studies differ widely (Guttman et al., 2013; Ingolia et al., 2014), resulting from false positive and distinct prediction thresholds. Therefore, MS has emerged as a complementary method. Mass spectrometry demonstrates excellent performance in detecting and characterizing the products of proteins/peptides in a complex biological sample. The detection of lncRNA-encoded peptides is the most direct evidence for lncRNA coding potential. However, to date, the proportion of coding lncRNAs detected by MS-based proteomes is small compared with that in ribo-seq results (Verheggen et al., 2017). The main weakness attributed to this approach is that MS-based proteomics is obviously impacted by the length and concentration of the detected samples. Therefore, specialized methods have been developed to circumvent these detection limitations. Short translation products at low abundance can surmount the threshold of MS detection through the use of peptidomics approaches (Schulz-Knappe et al., 2005) and enrichment protocols (Mustafa et al., 2015). Both of the above techniques have their respective advantages and shortcomings; therefore, “proteogenomics” has been developed (Nesvizhskii, 2014; Menschaert and Fenyö, 2017; Ruggles et al., 2017). In proteogenomics, proteomics data are systematically integrated and analyzed with genomics and transcriptomic data generated from DNA-sequencing, RNA-sequencing and ribosome-profiling. The predicted sequences of proteins/peptides are tracked back to the genome and transcripts to identify the gene expression patterns and actual translational events. The significance of proteogenomics studies lies in improving genome annotation, and reasonably applying multi-omics data to explore complex and profound mechanisms in biological activities and complex diseases (Zhang B. et al., 2014; Zhang et al., 2016; Mertins et al., 2016).

Noncoding RNA Regulatory Functions Embedded in mRNAs

3′ UTR Regulatory Roles of mRNAs

Based on current research results, the noncoding regulatory functions discovered in mRNAs are mainly present in the 3′ UTRs, which were previously supposed to be the vital regulative elements for mRNA self-stability and location. Compared with highly conserved coding regions that have to undergo strictly selected pressure, the 3′ UTR displays more flexibility and plasticity between species. Its size varies from a few to hundreds of nucleotides, and likely has a close relationship with biological complexity (Chen et al., 2012; Mayr, 2016). Moreover, for an RNA molecule, other than the impact on base pairing, the changes in sequence are most likely to induce corresponding changes in structure, resulting in information transmitted from RNA to protein (Berkovits and Mayr, 2015). By comprehensively estimating up-to-date cases where mRNAs regulate biologic activities without translating to protein, we found that the 3′ UTR of mRNA plays a large role as an effectors. Increasing evidence has demonstrated that the 3′ UTRs of mRNAs are actively involved in repressing the occurrence and progression of cancer cells, such as the 3′ UTRs of α-tropomyosin mRNA, prohibitin mRNA and ribonucleotide reductase mRNA (Rastinejad et al., 1993; Fan et al., 1996; Manjeshwar et al., 2003). These studies demonstrate that the 3′ UTRs of some mRNAs can antagonize tumor development, likely through RNA interactions with regulatory factors involved in cellular growth in a post-transcriptional pattern. Indeed, the 3′ UTR can recruit RNA-binding proteins, as in the case of CD47 mRNA. CD47 mRNA has two isoforms of the 3′ UTR, long (CD47-LU) and short (CD47-SU), and only the CD47-LU, which is AU-rich, can interact with the RNA-binding protein TIS11B to form a membraneless organelle with a specific biochemical and biophysical environment which is separate from the cytosol (Ma and Mayr, 2018). However, the most prevalent mode of action of the 3′ UTR is as ceRNAs, such as in the cases of CCR2 mRNA and Ube3a1 RNA, which confer to the function of lncRNAs (Valluy et al., 2015; Hu et al., 2017).

Noncoding Regulatory Roles of mRNA Not Involving the 3′ UTR

Other than the 3′ UTR, the 5′ UTR and ORF can also be involved in RNA-mediated regulatory function, although recent reports of this phenomenon are scare. Two mRNAs, TP53 mRNA and HIST1H1C mRNA, are recognized as being involved in ORF-mediated regulation. TP53 protein is a tumor suppressor implicated in many processes during tumor occurrence and development. However, a triple synonymous mutant (TriMp53) in codons led to a misshapen structure, resulting in loss of the IRES activity of p47 (one isoform of p53) and an abrogated affinity of hnRNPC, but with better binding to Mdm2, which is an E3 ubiquitin-protein ligase in mediating p53/TP53 ubiquitination, and an augmented ability of p53 to activate apoptosis. These facts indicate that TP53 has intricate regulatory roles at both the RNA and protein levels, suggesting that the functions of the RNA and protein molecules are closely intertwined (Candeias, 2011). HIST1H1C mRNA participates in regulating telomere length homeostasis. Aside from the protein-related product, a 15-nt long region in the ORF region (nt334–nt348) is attributed to HIST1H1C-mRNA-mediated biological activity, through complementation with the terminal stem-loop sequence of the P6b region of hTR, in a base-pairing pattern. These results extend the functional potency of mRNA ORF regions in a non-traditional and noncoding direction (Ivanyi-Nagy et al., 2018). In terms of the 5′ UTR, there are only two examples. VEGF is a key regulator of angiogenesis during embryonal and cancerous development, and this regulatory function is closely correlated with the 5′ UTR. vegf mRNA has an unusually long 5′ UTR of 1,038 nucleotides, and contains two IRES, resulting in an intricate regulation of VEGF expression. In addition, the presence of the 5′ UTR of vegf mRNA alone in tumor cells could promote the expression of anti-apoptotic genes but repress pro-apoptotic genes, suggesting an anti-apoptotic role of the vegf 5′ UTR, and demonstrating its potential as a target for cancer treatment. To the best of our knowledge, the 5′ UTR of vegf mRNA represents the only example of an mRNA UTR which can promote tumor progression (Akiri et al., 1998; Huez et al., 1998; Masuda et al., 2008). The c-myc P0 transcript is an isoform transcript from the promoter 0 (P0) of the c-myc gene, which has an extra ˜639-nucleotide extension of the 5′ UTR when compared with two major isoforms (P1 and P2) of c-myc mRNA. Ectopic expression of the 5′ UTR of the c-myc P0 transcript alone in HeLa cells results in significantly increased expression of the c-Myc1 (p67) and c-Myc2 (p64) proteins as well as incremental apoptosis sensitivity, but decreased tumorigenicity, all of which are likely attributable to competitive regulation of gene expression in the c-myc locus. These results demonstrate that the 5′ UTR potentially functions in trans to perform gene regulation (Blume et al., 2003).

Perspective

In recent years, researchers have begun to pay close attention to the development of bifunctional RNAs, and have discussed the evolved roles of RNAs with multiple functions (Dinger et al., 2011; Ulveling et al., 2011; Kageyama et al., 2011; Hubé and Francastel, 2018). In the early stages of such research, researchers discovered individual gene on a case-by-case basis. However, in the last ten years, rapid advances in large-scale detection and identification techniques (such as ribo-seq and MS-based proteomics) have facilitated multi-faceted investigations of genomes and vital processes, thus shedding light on the complex activities of various RNA molecules. Bifunctional RNAs raise questions about the concept of a gene, in terms of whether RNA, both coding and noncoding, is an independent gene type or a convergence of coding and noncoding genes which occurred during evolution. In this review, we intend to not only investigate the current status of bifunctional RNAs as reported in recent years, but also discuss the potential pervasiveness of bifunctional RNAs from a global perspective in terms of large-scale data. With recent estimates of ribosome profiling, small peptides encoded by lncRNAs have significantly expanded the extent and diversity of the proteome, and predictions suggested that a large fraction of the annotated lncRNAs in various eukaryotic organisms would be translated with sORFs (Aspden et al., 2014; Ruiz-Orera et al., 2014; Mackowiak et al., 2015; Olexiouk et al., 2016). Proteogenomic evidence has confirmed that many small peptides which stem from regions of lncRNA genes are expressed differentially in different cell types and during different developmental/disease stages, although their functions are somewhat enigmatic (Nesvizhskii, 2014; Zhu et al., 2018). However, other studies have revealed that mRNAs could also be involved in cellular regulatory processes in a coding-independent manner (Nam et al., 2016). The results from a large-scale RNA structure analysis revealed that the secondary structures of mRNA have an essential regulatory effect on its maturation and stability, even for the evolutionarily conserved RNA silencing pathways of eukaryotes, suggesting that mRNAs partially retain the functionality of structure that exists in many RNA molecules (Katz and Burge, 2003; Li et al., 2012; Taggart et al., 2012). All these facts indicate that the coding potential and biological roles of mRNAs and lncRNAs could be switched in some cases, implying a conceptual blurring between coding and noncoding genes. Many lncRNAs share similar features with classical mRNAs, such as transcription by polymerase II with a 5′-cap and 3′-polyadenylated tail, and frequent accumulation in the cytoplasm (van Heesch et al., 2014). Therefore, when associated with ribosomes, sORFs embedded in lncRNAs have a significant chance to be translated to peptides. The peptides derived from lncRNAs have a relatively shorter chain length and weaker conservation across different species, and this is consistent with the original lncRNAs which often have few introns, a low expression level and weak phylogenetic conservation (Cabili et al., 2011; Derrien et al., 2012; Kutter et al., 2012; Necsulea et al., 2014). From the perspective of proteins driving evolution, these peptides are likely considered to be an important source for new protein (Ruiz-Orera et al., 2014). Previously reported experimental evidence indicates that noncoding RNAs expressed at low levels could contribute to the birth of novel protein coding genes (Levine et al., 2006; Cai et al., 2008; Reinhardt et al., 2013). Given that several lncRNA-derived peptides have been demonstrated to play essential roles in many biological activities, it is worth investigating the putative significance of the generation of these lncRNA-derived peptides in gene evolution, expression and regulation. However, in view of the huge quantity, diversified mechanisms of action, and intricate functions of lncRNAs, it is inappropriate to regard lncRNAs just as a pool for evolved peptides. In terms of RNA alone, its roles are diverse, including the potential to be retro-transcribed into DNA, or to act as an enzyme to participate in complex biochemical processes (Cech, 1986). Moreover, random RNA sequences can inoculate structurally complex and highly active RNA ligases, suggesting that randomness can produce functionality (Ekland et al., 1995). Therefore, it is very likely that RNA molecules alone comprise abundant genetic information, such as particular structural features and ultra-conservative sequence elements, which could regulate the timing and place of gene expression during cellular differentiation and development. In recent decades, because of the addition of the huge family of noncoding genes, RNAs have provoked great interest for their mysterious roles in organisms. lncRNA-encoded peptides expand the horizon of functional mechanisms for these bio-macromolecules. To date, thousands of peptide products have been identified in human cells, with limited understanding of their function. The current review has summarized the recently discovered micropeptides implicated in various biological processes. We also discussed the potential noncoding roles of mRNAs as a regulator. The continued discovery and functional characterization of bifunctional RNAs will provide new insights into important cellular processes and organismal evolution.

Author Contributions

Both authors have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  137 in total

1.  Non-coding RNAs: the architects of eukaryotic complexity.

Authors:  J S Mattick
Journal:  EMBO Rep       Date:  2001-11       Impact factor: 8.807

2.  A rescue factor abolishing neuronal cell death by a wide spectrum of familial Alzheimer's disease genes and Abeta.

Authors:  Y Hashimoto; T Niikura; H Tajima; T Yasukawa; H Sudo; Y Ito; Y Kita; M Kawasumi; K Kouyama; M Doyu; G Sobue; T Koide; S Tsuji; J Lang; K Kurokawa; I Nishimoto
Journal:  Proc Natl Acad Sci U S A       Date:  2001-05-22       Impact factor: 11.205

3.  Viroids are single-stranded covalently closed circular RNA molecules existing as highly base-paired rod-like structures.

Authors:  H L Sanger; G Klotz; D Riesner; H J Gross; A K Kleinschmidt
Journal:  Proc Natl Acad Sci U S A       Date:  1976-11       Impact factor: 11.205

4.  Soybean ENOD40 encodes two peptides that bind to sucrose synthase.

Authors:  Horst Rohrig; Jurgen Schmidt; Edvins Miklashevichs; Jeff Schell; Michael John
Journal:  Proc Natl Acad Sci U S A       Date:  2002-02-12       Impact factor: 11.205

5.  Comparative genomics of the eukaryotes.

Authors:  G M Rubin; M D Yandell; J R Wortman; G L Gabor Miklos; C R Nelson; I K Hariharan; M E Fortini; P W Li; R Apweiler; W Fleischmann; J M Cherry; S Henikoff; M P Skupski; S Misra; M Ashburner; E Birney; M S Boguski; T Brody; P Brokstein; S E Celniker; S A Chervitz; D Coates; A Cravchik; A Gabrielian; R F Galle; W M Gelbart; R A George; L S Goldstein; F Gong; P Guan; N L Harris; B A Hay; R A Hoskins; J Li; Z Li; R O Hynes; S J Jones; P M Kuehl; B Lemaitre; J T Littleton; D K Morrison; C Mungall; P H O'Farrell; O K Pickeral; C Shue; L B Vosshall; J Zhang; Q Zhao; X H Zheng; S Lewis
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

6.  A steroid receptor coactivator, SRA, functions as an RNA and is present in an SRC-1 complex.

Authors:  R B Lanz; N J McKenna; S A Onate; U Albrecht; J Wong; S Y Tsai; M J Tsai; B W O'Malley
Journal:  Cell       Date:  1999-04-02       Impact factor: 41.582

7.  Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen.

Authors:  C K Stover; X Q Pham; A L Erwin; S D Mizoguchi; P Warrener; M J Hickey; F S Brinkman; W O Hufnagle; D J Kowalik; M Lagrou; R L Garber; L Goltry; E Tolentino; S Westbrock-Wadman; Y Yuan; L L Brody; S N Coulter; K R Folger; A Kas; K Larbig; R Lim; K Smith; D Spencer; G K Wong; Z Wu; I T Paulsen; J Reizer; M H Saier; R E Hancock; S Lory; M V Olson
Journal:  Nature       Date:  2000-08-31       Impact factor: 49.962

8.  A small, novel protein highly conserved in plants and animals promotes the polarized growth and division of maize leaf epidermal cells.

Authors:  Mary J Frank; Laurie G Smith
Journal:  Curr Biol       Date:  2002-05-14       Impact factor: 10.834

9.  A circularized sodium-calcium exchanger exon 2 transcript.

Authors:  X F Li; J Lytton
Journal:  J Biol Chem       Date:  1999-03-19       Impact factor: 5.157

10.  The sequence of the human genome.

Authors:  J C Venter; M D Adams; E W Myers; P W Li; R J Mural; G G Sutton; H O Smith; M Yandell; C A Evans; R A Holt; J D Gocayne; P Amanatides; R M Ballew; D H Huson; J R Wortman; Q Zhang; C D Kodira; X H Zheng; L Chen; M Skupski; G Subramanian; P D Thomas; J Zhang; G L Gabor Miklos; C Nelson; S Broder; A G Clark; J Nadeau; V A McKusick; N Zinder; A J Levine; R J Roberts; M Simon; C Slayman; M Hunkapiller; R Bolanos; A Delcher; I Dew; D Fasulo; M Flanigan; L Florea; A Halpern; S Hannenhalli; S Kravitz; S Levy; C Mobarry; K Reinert; K Remington; J Abu-Threideh; E Beasley; K Biddick; V Bonazzi; R Brandon; M Cargill; I Chandramouliswaran; R Charlab; K Chaturvedi; Z Deng; V Di Francesco; P Dunn; K Eilbeck; C Evangelista; A E Gabrielian; W Gan; W Ge; F Gong; Z Gu; P Guan; T J Heiman; M E Higgins; R R Ji; Z Ke; K A Ketchum; Z Lai; Y Lei; Z Li; J Li; Y Liang; X Lin; F Lu; G V Merkulov; N Milshina; H M Moore; A K Naik; V A Narayan; B Neelam; D Nusskern; D B Rusch; S Salzberg; W Shao; B Shue; J Sun; Z Wang; A Wang; X Wang; J Wang; M Wei; R Wides; C Xiao; C Yan; A Yao; J Ye; M Zhan; W Zhang; H Zhang; Q Zhao; L Zheng; F Zhong; W Zhong; S Zhu; S Zhao; D Gilbert; S Baumhueter; G Spier; C Carter; A Cravchik; T Woodage; F Ali; H An; A Awe; D Baldwin; H Baden; M Barnstead; I Barrow; K Beeson; D Busam; A Carver; A Center; M L Cheng; L Curry; S Danaher; L Davenport; R Desilets; S Dietz; K Dodson; L Doup; S Ferriera; N Garg; A Gluecksmann; B Hart; J Haynes; C Haynes; C Heiner; S Hladun; D Hostin; J Houck; T Howland; C Ibegwam; J Johnson; F Kalush; L Kline; S Koduru; A Love; F Mann; D May; S McCawley; T McIntosh; I McMullen; M Moy; L Moy; B Murphy; K Nelson; C Pfannkoch; E Pratts; V Puri; H Qureshi; M Reardon; R Rodriguez; Y H Rogers; D Romblad; B Ruhfel; R Scott; C Sitter; M Smallwood; E Stewart; R Strong; E Suh; R Thomas; N N Tint; S Tse; C Vech; G Wang; J Wetter; S Williams; M Williams; S Windsor; E Winn-Deen; K Wolfe; J Zaveri; K Zaveri; J F Abril; R Guigó; M J Campbell; K V Sjolander; B Karlak; A Kejariwal; H Mi; B Lazareva; T Hatton; A Narechania; K Diemer; A Muruganujan; N Guo; S Sato; V Bafna; S Istrail; R Lippert; R Schwartz; B Walenz; S Yooseph; D Allen; A Basu; J Baxendale; L Blick; M Caminha; J Carnes-Stine; P Caulk; Y H Chiang; M Coyne; C Dahlke; A Deslattes Mays; M Dombroski; M Donnelly; D Ely; S Esparham; C Fosler; H Gire; S Glanowski; K Glasser; A Glodek; M Gorokhov; K Graham; B Gropman; M Harris; J Heil; S Henderson; J Hoover; D Jennings; C Jordan; J Jordan; J Kasha; L Kagan; C Kraft; A Levitsky; M Lewis; X Liu; J Lopez; D Ma; W Majoros; J McDaniel; S Murphy; M Newman; T Nguyen; N Nguyen; M Nodell; S Pan; J Peck; M Peterson; W Rowe; R Sanders; J Scott; M Simpson; T Smith; A Sprague; T Stockwell; R Turner; E Venter; M Wang; M Wen; D Wu; M Wu; A Xia; A Zandieh; X Zhu
Journal:  Science       Date:  2001-02-16       Impact factor: 47.728

View more
  40 in total

Review 1.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  A novel algorithm for ranking RNA structure candidates.

Authors:  Anastacia Wienecke; Alain Laederach
Journal:  Biophys J       Date:  2021-12-10       Impact factor: 4.033

3.  Exploring the Peptide Potential of Genomes.

Authors:  Chris Papadopoulos; Nicolas Chevrollier; Anne Lopes
Journal:  Methods Mol Biol       Date:  2022

Review 4.  Significance of lncRNA abundance to function.

Authors:  Ioannis Grammatikakis; Ashish Lal
Journal:  Mamm Genome       Date:  2021-08-18       Impact factor: 2.957

5.  Characterizing miRNA-lncRNA Interplay.

Authors:  Dimitra Karagkouni; Anna Karavangeli; Maria D Paraskevopoulou; Artemis G Hatzigeorgiou
Journal:  Methods Mol Biol       Date:  2021

6.  Genomic Landscapes of Noncoding RNAs Regulating VEGFA and VEGFC Expression in Endothelial Cells.

Authors:  Isidore Mushimiyimana; Vanesa Tomas Bosch; Nihay Laham-Karam; Minna U Kaikkonen; Henri Niskanen; Nicholas L Downes; Pierre R Moreau; Kiley Hartigan; Seppo Ylä-Herttuala
Journal:  Mol Cell Biol       Date:  2021-06-23       Impact factor: 4.272

Review 7.  Modern epigenetics methods in biological research.

Authors:  Yuanyuan Li
Journal:  Methods       Date:  2020-07-06       Impact factor: 3.608

Review 8.  Brain Long Noncoding RNAs: Multitask Regulators of Neuronal Differentiation and Function.

Authors:  Sarva Keihani; Verena Kluever; Eugenio F Fornasiero
Journal:  Molecules       Date:  2021-06-28       Impact factor: 4.411

Review 9.  Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective.

Authors:  Manas Joshi; Adamandia Kapopoulou; Stefan Laurent
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

Review 10.  Long Non-coding RNAs: Mechanisms, Experimental, and Computational Approaches in Identification, Characterization, and Their Biomarker Potential in Cancer.

Authors:  Anshika Chowdhary; Venkata Satagopam; Reinhard Schneider
Journal:  Front Genet       Date:  2021-07-01       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.