Literature DB >> 23696037

On the classification of long non-coding RNAs.

Lina Ma1, Vladimir B Bajic, Zhang Zhang.   

Abstract

Long non-coding RNAs (lncRNAs) have been found to perform various functions in a wide variety of important biological processes. To make easier interpretation of lncRNA functionality and conduct deep mining on these transcribed sequences, it is convenient to classify lncRNAs into different groups. Here, we summarize classification methods of lncRNAs according to their four major features, namely, genomic location and context, effect exerted on DNA sequences, mechanism of functioning and their targeting mechanism. In combination with the presently available function annotations, we explore potential relationships between different classification categories, and generalize and compare biological features of different lncRNAs within each category. Finally, we present our view on potential further studies. We believe that the classifications of lncRNAs as indicated above are of fundamental importance for lncRNA studies, helpful for further investigation of specific lncRNAs, for formulation of new hypothesis based on different features of lncRNA and for exploration of the underlying lncRNA functional mechanisms.

Entities:  

Keywords:  RNA transcripts; lncRNA; lncRNA classification; long non-coding RNA

Mesh:

Substances:

Year:  2013        PMID: 23696037      PMCID: PMC4111732          DOI: 10.4161/rna.24604

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Introduction

In contrast to a small proportion of the mammalian genome (e.g., human, mouse) that are transcribed into mRNAs, the vast majority of the genome is transcribed into what was previously regarded as “dark matter”—non-coding RNAs (ncRNAs) that do not encode information about proteins.- Among these ncRNAs, long ncRNAs (lncRNAs) represent the most prevalent and functionally diverse class.- There is no definition of lncRNA that is based on biological argumentation and widely accepted in the community. The most commonly used definition is based on the threshold of 200 nucleotides (nt) of the RNA length.,, It conventionally divides ncRNAs into lncRNAs that have more than 200 nt in length and the remaining ones that are considered “small” RNAs. Small ncRNAs include many different RNAs, such as microRNAs (miRNAs), small nucleolar RNAs (snoRNAs), piwiRNAs (piRNAs) (Taft et al., 2010). Undoubtedly, the definition of lncRNA merely based on length is arbitrary. One attempt to distinguish lncRNAs from small ncRNAs, based more on the biological argumentation is proposed by Amaral et al. defining lncRNAs as those ncRNAs that function either as primary or spliced transcripts, independent of extant known classes of small ncRNAs. Therefore, there are some lncRNAs that do not exceed the arbitrary threshold in length (such as BC1and snaR, which are less than or close to 200 nt but included in lncRNAdb). LncRNAs are observed in a large diversity of species, including animals,, plants, yeast, prokaryotes and even viruses. However, lncRNAs are poorly conserved among different species when compared with the well-studied RNAs (such as mRNAs, miRNAs, snoRNAs),,, invoking uncertainty about whether a given lncRNA is functional at all due to poor interspecies conservation, or it conveys functional species-specific characteristics. In addition, lncRNAs are usually low expressed,,, making them to look like more as transcriptional noise. Despite this, a lot of evidence has accumulated showing that lncRNAs play a significant role in a wide variety of important biological processes,, including transcription,, splicing,, translation,, protein localization,, cellular structure integrity,, imprinting,- cell cycle, and apoptosis,, stem cell pluripotency and reprogramming and heat shock response., It has been suggested that lncRNAs may regulate cancer progression and development of many other human diseases. Moreover, a considerable number of lncRNAs are 3′ polyadenylated, 5′ capped, multi-exonic,, and exhibit transcriptional activation activity similar to that of mRNAs.,, As a consequence, all these functions and biological features of lncRNAs make them interesting and important research topic. Recent advances in experimental and computational technologies make it feasible to conduct deep mining on more and more transcribed sequences.,,- At present, there are 73,370 lncRNA entries from 1,239 organisms according to NONCODE v3.0 (a database of literature documented lncRNAs). Conversely, among all these lncRNAs, only a small proportion (less than 200 according to LncRNAdb, a database of lncRNA annotation) has been functionally annotated. To better understand their functional significance, it helps to classify lncRNAs into different groups that are useful for exploring their underlying mechanisms of actions, for formulating new hypotheses and for providing insights in differences of such major classes of lncRNAs. Here, we summarize classification methods of lncRNAs according to their different features as discussed in what follows, including their (1) genome location and context, (2) exerted effect on DNA sequences, (3) mechanism of functioning and (4) targeting mechanism. Finally, we provide our perspectives on potential further studies.

Genomic Location and Context

Intergenic lncRNAs and intronic lncRNAs

There is a large number of non-coding regions (accounting for 98–99% in the human genome) interspersed between coding regions., Since lncRNAs are located and transcribed from different genomic locations, those transcribed from intergenic regions are named intergenic lncRNAs (Fig. 1A) and, in contrast, those transcribed entirely from introns of protein-coding genes are named intronic lncRNAs (Fig. 1B).

Figure 1. Genomic location and context of lncRNAs. Protein-coding genes and their exons are represented by blue color, while lncRNAs and their exons are represented by red color. Panels are mainly based on lncRNA location annotation of GENCODE. (A) Intergenic lncRNA, transcribed intergenically from both strands. (B) Intronic lncRNA, transcribed entirely from introns of protein-coding genes. (C) Sense lncRNA, transcribed from the sense strand of protein-coding genes and contain exons from protein-coding genes, overlapping with part of protein-coding genes or covering the entire sequence of a protein-coding gene through an intron. (D) Antisense lncRNA, transcribed from the antisense strand of protein-coding genes, overlapping with exonic or intronic regions or covering the entire protein-coding sequence through an intron.

Figure 1. Genomic location and context of lncRNAs. Protein-coding genes and their exons are represented by blue color, while lncRNAs and their exons are represented by red color. Panels are mainly based on lncRNA location annotation of GENCODE. (A) Intergenic lncRNA, transcribed intergenically from both strands. (B) Intronic lncRNA, transcribed entirely from introns of protein-coding genes. (C) Sense lncRNA, transcribed from the sense strand of protein-coding genes and contain exons from protein-coding genes, overlapping with part of protein-coding genes or covering the entire sequence of a protein-coding gene through an intron. (D) Antisense lncRNA, transcribed from the antisense strand of protein-coding genes, overlapping with exonic or intronic regions or covering the entire protein-coding sequence through an intron. It is suggested that intergenic lncRNAs and intronic lncRNAs are most likely regulated through different transcription activation mechanisms and may have different poly(A) modifications and manifest activities in different cellular locations. However, only a small portion of intronic lncRNAs has been explored regarding their function. In contrast, there is a large number of long intergenic non-coding RNAs (lincRNAs) that function through different types of mechanisms: cis or trans transcriptional regulation (described below), translational control, splicing regulation, other post-transcriptional regulation, etc. (see Table 1). Also, lincRNAs have been extensively studied about their expression feature and conservation among species.

Table 1. Functional mechanisms and genomic locations of lncRNAs

Function mechanism Gene symbol
Transcriptional regulationa  
 CisAird, Alpha 250/ Alpha 280d, ANRILd, Beta-globin transcriptsb, Beta-MHC antisense transcriptsd, CAR Intergenic 10d, CCND1 associated ncRNAsb, COLDAIRe, COOLAIRd, DHFR upstream transcriptsd, Emx2osd, Evf2d, fbp1+ promoter RNAsb, GAL10-ncRNAd, H19b, H19 antisensec, H19 upstream conserved 1 and 2b, H19 ICR ncRNAs, HOTAIRM1b, HOTTIPd, Hoxa11asd, ICR1c, Kcnq1ot1d, Khps1ad, L1PA16b, LINoCRb, MEG3b, Mistralb, Msx1asd, Nespasd, ncR-Uparb, PHO5 lncRNAd, PHO84 antisensed, pRNAb, PWR1d, RTLd, SRG1b,c, TEA ncRNAsc, TIR1axutd, TPO1axutd, Tsixb, Xistb
   
 Trans7SKb, B2 SINE RNAb, GAS5b, HOTAIRd, Jpxb, LXRBSVc, PR antisense transcriptsd, VL30 RNAs
   
 UnclearAdapt33b, antiPeg11d, Gtl2-asd, HOXA3asd, HOXA6asd, linc1242b, linc1257b, linc1368b, linc1547b, linc1582b, linc1609b, linc1610b, lincRNA-p21b, lincRNA-RoR b, Malat1-asb, MEG9b, NDM29e, NEAT1b, PANDAb, PCAT-1 b, Rianb, SatIII transcriptsb, SNHG3c, SRAc, Tmevpg1b, TncRNAb, TUG1b
   
Translational regulation BC1b, BC200b, Gadd7, SNHG1b, SNHG6d, snaRb, Zeb2NATd
   
Splicing regulation MIATb, LUSTd, Malat1b, SAFd, VL30 RNAs, Zeb2NATd
   
Other post-transcriptional regulation 21Ab, 1/2-sbsRNA1 c, At4 b, BACE1ASd, CDR1 ASd, Dio3os b, E2F4 antisensed, Emx2os d, Gadd7, H19 b, HULCb, HSUR1 and HSUR2, IPS1b, KRASP1b, Linc-MD1 b, psvA antisense RNAd, PTENP1b, tie-1asd, WT1-ASd
   
Other functional mechanisms 7SLb, Beta 2.7 RNA, Centromeric α-satellite RNAb, ENOD40c, EBER1 and EBER2 RNAs, G22b, L1PA16b, hsr omega transcriptsb, meiRNA, Maternal RNA templatesc,d, Maternal somatic nucleus RNAsc,d, MER11Cb, NRONd, rncs-1b, roX1 and roX2b, sfRNA, TERRA, TERCb, VAI and VAII RNAs, VegT RNAc, Xlsirts, Y RNAsb

Note: lncRNAs listed are collected from the database of lncRNAdb and published papers. aCis, lncRNAs that regulate expression of genes in close genomic proximity; Trans, lncRNAs that regulate expression of distant genes; Unclear, lncRNAs that regulate gene expression at transcriptional level, either in cis or trans. bLincRNAs. cSense lncRNAs. dAntisense lncRNAs. eIntronic lncRNAs.

Note: lncRNAs listed are collected from the database of lncRNAdb and published papers. aCis, lncRNAs that regulate expression of genes in close genomic proximity; Trans, lncRNAs that regulate expression of distant genes; Unclear, lncRNAs that regulate gene expression at transcriptional level, either in cis or trans. bLincRNAs. cSense lncRNAs. dAntisense lncRNAs. eIntronic lncRNAs. It is found that lincRNAs are transcriptionally activated similarly to mRNAs,,,, as they are more conserved than introns,, and antisense transcripts, more tissue-specifically expressed than protein-coding genes,, and more stable than intronic lncRNAs. “K4-K36” domain (with histone H3K4 trimethylation at their 5′ end and histone H3K36 trimethylation in the body of the gene), an indicator of active transcription in protein-coding genes, is found to prevalently exist in transcriptionally active lincRNAs.,,, Approximately 70% of lincRNAs with “K4-K36” domain show evidence of RNA transcription, which is similar to the proportion (~72%) of protein-coding genes. Most importantly, nearly 70% of the transcription active domains (K4-K36 domain) of lincRNAs in human are conserved in the orthologous region of mouse, which is comparable to the corresponding proportion (80%) of protein-coding genes. In addition, lincRNAs are found to be conserved across multiple vertebrate species. The above evidences (active transcription, a degree of domain conservation, tissue-specific expression, stability) strongly indicate the functional importance of lincRNAs. In reality, lincRNAs are found to perform important functions in many cellular processes, from embryonic stem cell pluripotency to cell proliferation and cancer progression.,,,

Sense and antisense lncRNAs

Sense lncRNAs are transcribed from the sense strand of protein-coding genes, containing exons from protein-coding genes. They may overlap with part of protein-coding genes, or cover the entire sequence of a protein-coding gene (Fig. 1C). Antisense lncRNAs, to the contrary, are transcribed from the antisense strand of protein-coding genes. According to GENCODE (a database of manually curated lncRNAs) annotation, antisense lncRNAs may appear in three scenarios: (1) transcripts from the antisense strand of protein-coding genes overlap an exon of a sense gene through lncRNAs’ exons, (2) transcripts from the intron of a sense gene do not have exon-exon overlap with this sense gene and (3) transcripts cover the entire sequence of a sense gene through an intron (Fig. 1D). Sense and antisense lncRNAs are proved to be genuine transcripts by strand-specific assay or sequencing,, qRT-PCR validation and by sequencing 5′ and 3′ ends of full-length cDNA and, thus, they do not represent truncated CDSs or transcriptional noise. Most of the lncRNAs that come from protein-coding genes or the antisense of the protein-coding genes can be obtained using CAGE (cap-analysis gene expression) and oligo-dT guided reverse transcription, suggesting that they also possess mRNA-like features of 3′ polyadenylation and 5′ capping. Moreover, sense and antisense lncRNAs can also be multi-exonic., Many antisense lncRNAs function through different types of mechanisms (similar to lincRNAs) (Table 1). It has been found that as many as 87% coding transcripts have antisense partners in the mouse genome and ~32% of the human lncRNAs are antisense to coding genes, suggesting that antisense regulation is likely to be commonly utilized. However, in comparison with lincRNAs and antisense lncRNAs, sense lncRNAs have been less explored for their functions (Table 1). It is suggested that most lncRNAs tend not to have protein-coding potential. Intriguingly, some sense lncRNAs are special in the sense that they can function as both RNA and protein-coding gene. For instance, SRA (steroid receptor RNA activator) can translate into protein, and the RNA sequence can also act as a scaffold for several co-activator and repressor proteins to form complexes that regulate gene transcription; ENOD40 (early nodulin 40) can translate into proteins and is also needed for correct subcellular localization of RNP particles in legume plants. These findings have challenged our understanding of gene classification and, in the meantime, broadened the known roles of lncRNA. Such special relationship between the sense lncRNAs and protein-coding genes may provide novel insights into the evolution of gene function. Present studies are mainly focused on lincRNAs and antisense lncRNAs (especially lincRNAs), though less is known about intronic lncRNAs and sense lncRNAs. Some biological features of lncRNA are quite prominent, such as the high conservation of lincRNAs among mammals.,, Also, another evidence, albeit not extensive, may indicate the special feature of the sense lncRNAs’ coding potential., Additionally, as antisense lncRNAs and sense lncRNAs are correlated differently with coding genes, they are most likely to exert different effects on gene locus or mRNAs. Therefore, genomic location and context can be used for classification of lncRNA, though a classification of lncRNA using exclusively genomic localization and context may not be fully adequate.

Effects Exerted on DNA Sequences

It has been found that lncRNAs are predominately localized in nucleus and chromatin, suggesting that lncRNAs may have a significant impact on DNA sequences. Also, a large proportion of lncRNAs are involved in transcriptional regulation (~42% of the 182 assessed entries according to lncRNAdb; Table 1). Therefore, it is meaningful to classify lncRNAs based on their effects exerted on DNA sequences: cis-lncRNAs (cis-acting lncRNAs) that regulate the expression of genes in close genomic proximity and trans-lncRNAs (trans-acting lncRNAs) that regulate the expression of distant genes (Fig. 2A).

Figure 2. Functional mechanisms of lncRNAs. LncRNA is represented by a letter “L” and a number appended. (A) Transcriptional regulation: We listed examples of cis-lncRNA (L1 and L3) and trans-lncRNA (L2). L1 is transcribed from the promoter region of gene A and its binding to promoter of gene A blocks the binding of transcription factors, thus affecting transcription initiation of gene A. L3 functions to modify chromatin protein in its vicinity through recruiting the complex of PRC2. L2 influences transcription of gene B from a distant region through interaction with transcription factor or RNA polymerase. Therefore, L1 and L2 also function through transcriptional interference, whereas L3 functions through chromatin modification. (B) Post-transcriptional regulation: L1, L2 and L3 all influence gene splicing. Specifically, L1 binds to intronic area to inhibit binding of splicing factor, L2 functions to modulate the pool of modified (such as phosphorylation) splicing factor and L3 binds to splicing factor to block spliceosomal complex formation. L4 interacts with translational factors to inhibit translation. L5 and L6 are two examples of ceRNAs, which interact directly or indirectly with miRNAs. L5 binds to miRNA and, thus, inhibits the binding of miRNA to the 3′ UTR of target mRNA. L6 binds to the 3′ UTR of target mRNA, which also blocks the binding of miRNA to the target gene. L7 serves as natural antisense inhibitor to promote degradation of mRNA. (C) Other functional mechanisms. L1 is involved in protein transportation and L2 binds to Dicer to influence RNA interference.

Figure 2. Functional mechanisms of lncRNAs. LncRNA is represented by a letter “L” and a number appended. (A) Transcriptional regulation: We listed examples of cis-lncRNA (L1 and L3) and trans-lncRNA (L2). L1 is transcribed from the promoter region of gene A and its binding to promoter of gene A blocks the binding of transcription factors, thus affecting transcription initiation of gene A. L3 functions to modify chromatin protein in its vicinity through recruiting the complex of PRC2. L2 influences transcription of gene B from a distant region through interaction with transcription factor or RNA polymerase. Therefore, L1 and L2 also function through transcriptional interference, whereas L3 functions through chromatin modification. (B) Post-transcriptional regulation: L1, L2 and L3 all influence gene splicing. Specifically, L1 binds to intronic area to inhibit binding of splicing factor, L2 functions to modulate the pool of modified (such as phosphorylation) splicing factor and L3 binds to splicing factor to block spliceosomal complex formation. L4 interacts with translational factors to inhibit translation. L5 and L6 are two examples of ceRNAs, which interact directly or indirectly with miRNAs. L5 binds to miRNA and, thus, inhibits the binding of miRNA to the 3′ UTR of target mRNA. L6 binds to the 3′ UTR of target mRNA, which also blocks the binding of miRNA to the target gene. L7 serves as natural antisense inhibitor to promote degradation of mRNA. (C) Other functional mechanisms. L1 is involved in protein transportation and L2 binds to Dicer to influence RNA interference.

Cis-lncRNAs

It seems plausible that cis-lncRNAs function through transcriptional interference or chromatin modification, ranging from yeast [e.g., SRG1 (regulatory gene 1) RNAs,] to plants [e.g., COLDAIR (cold assisted intronic noncoding RNA) ] to mammals [e.g., DHFR (dihydrofolate reductase) upstream transcripts,, Xist (X inactive-specific transcript)] (detailed below). Regarding the mechanism of transcriptional interference, lncRNAs may influence the transcription activity of target genes through promoter binding to block PIC (preinitiation complex) formation, or by interacting with transcription factors,, (Fig. 2A). Such cis-lncRNAs may be transcribed from genes’ promoter regions. For instance, DHFR upstream transcripts, ~0.8–7.3 kb lncRNAs from the promoter region of DHFR, can form stable triplex structures with the promoter of DHFR, and interact with TFIIB to efficiently dissociate PIC. Likewise, SRG1 RNAs, ~0.4–1.9 kb lncRNAs from the promoter region of SER3 (Ser3p), are found to cover the promoter of SER3 coding gene to prevent transcription factor binding to the promoter and, thus, repress the expression of SER3., Also, the transcription of lncRNAs in promoter regions may induce chromatin remodeling and, thus, activate the downstream protein-coding genes’ transcription, such as fbp1 promoter RNAs. Cis-lncRNAs that function through chromatin modification often recruit chromatin modification complexes, e.g., PRC (polycomb repressive complex) or Rpd3S HDAC (Rpd3 small histone deacetylase complexes). The most studied chromatin modification complex is PRC and one well-known example is Xist (a 19 kb lncRNA in human), which binds to PRC2 to induce H3K27me3 modification and, thus, leads to transcriptional silencing of genes on the X chromosome. Similar examples can be found in MEG3 (maternally expressed 3) (a ~1.6 kb lncRNA) and COLDAIR (a ~1.1 kb lncRNA). Also, there are other chromatin modification complexes that are recruited by lncRNAs. GAL10-ncRNA (Gal10p-noncoding RNA) (a ~4 kb lncRNA) has been reported to recruit the Rpd3S HDAC complex, resulting in a decrease in some histone 3 acetylation to repress the expression of GAL1. Although the above-mentioned examples of PRC and Rpd3S HDAC are all implicated in negative regulation of gene expression, interaction of lncRNA with chromatin modification complexes is also involved in positive regulation. A case example is HOTTIP (HOXA transcript at the distal tip) (a ~3.8 kb lncRNA in human) that recruits a MLL chromatin modifying complex to maintain a domain of active chromatin over the 5′ end of HOXA (homeobox A cluster) gene cluster.

Trans-lncRNAs

Although it may be easier for lncRNAs to influence genes in their immediate vicinity probably based on sequence complimentary to the locus from which they are transcribed, lncRNAs can also function in trans-acting mode to target distant gene loci. For instance, HOTAIR (HOX antisense intergenic RNA), a ~2.2 kb lncRNA that is transcribed from the HOXC (homeobox C cluster) gene locus in chromosome 12, can be transported by the Suz-Twelve protein to regulate the homologous target sites at HOXD (homeobox D cluster) gene locus in chromosome 224. Also, HOTAIR are found to bind to many other genomic loci that tend to possess specific DNA motifs and influence gene expression by recruiting chromatin modification complexes., Therefore, unlike cis-lncRNAs, trans-lncRNAs may function independently of sequence complementary to target gene locus. In addition to chromatin modification complexes,, they may bind to transcription elongation factors, or RNA polymerases to affect transcription. It is reported that 7SK RNA (a ~330 bp lncRNA) functions as a central scaffold to coordinate protein-protein interactions in 7SK snRNP (small nuclear ribonucleoproteins), which comprises transcription elongation factor—P-TEFb (positive transcription elongation factor b). This activity consequently leads to repression of transcription elongation at many gene loci., Another lncRNA, B2 SINE RNA, has been found to stably bind to polymerase II complex to block its activity during heat shock response.

Mechanisms of Functioning

While only a small number of lncRNAs has been well documented, it is believed that lncRNAs are involved in a wide variety of cellular molecular functions. According to their mechanisms of functioning, lncRNAs roughly fall into three groups that affect transcriptional regulation, post-transcriptional regulation or other functions (Fig. 2).

Transcriptional regulation

As mentioned above, there is a large number of lncRNAs that regulate gene transcription through transcriptional interference (e.g., DHFR upstream transcripts, SRG1 RNAs, 7SK snRNA, B2 SINE RNA) and chromatin remodeling (e.g., fbp1 (fructose-1,6-bisphosphatase-1) promoter RNAs, Xist, MEG3, GAL10-ncRNA, HOTAIR, HOTTIP and COLDAIR). Therefore, lncRNAs responsible for transcription regulation can be sub-divided according to the mechanism of their functioning: (1) transcriptional interference and (2) chromatin remodeling (Fig. 2A). Besides, there are other related functional mechanisms, for example, the regulation effect: a set of lncRNAs transcribed from enhancers are termed eRNAs (enhancer RNAs) as they positively regulate genes’ transcription, such as ncRNA-a1 (activating long ncRNA 1), Evf-2 (embryonic ventral forebrain-2) RNA, Alpha-250/Alpha-280.

Post-transcriptional regulation

There are two common post-transcriptional regulation mechanisms that lncRNAs get involved in, namely, splicing regulation and translational control (Fig. 2B). LncRNAs that influence mRNA splicing may function through binding to or modulating splicing factors, or directly hybridizing with mRNA sequences to block splicing., MIAT (myocardial infarction associated transcript), a ~9–10 kb lncRNA, contains strong intron branch point sequences (UACUAAC repeats) and is able to bind to SF1 (splicing factor 1) to inhibit splicing and spliceosomal complex formation. Malat1 (metastasis-associated lung adenocarcinoma transcript 1), a ~7 kb lncRNA, can bind to SR splicing factor [serine–arginine (SR)-rich splicing factor] and regulate its distribution in nuclear speckle domains. Also, it is suggested that Malat1 may modulate the pools of phosphorylated SR and, thus, influence alternative splicing of pre-mRNAs. Additionally, other splicing regulation mechanisms may exist. LUST (LUCA-15-specific transcript), a ~1.4–2.4 kb lncRNA, is the antisense transcript of RBM5 (RNA binding motif protein 5) and is hypothesized to regulate the expression of RBM5 splice variants through masking a sense-strand regulatory sequence. LncRNAs that participate in translational control may function through binding to translation factors, or ribosome., There are two lncRNAs, BC1 (brain cytoplasmic RNA 1) and BC200 (200 nt brain cytoplasmic RNA), which can bind eIF4A (eukaryotic translation initiation factor 4A), PABP (poly(A)-binding protein) and other factors, to repress translation initiation by blocking assembly of the required complex., snaR (small NF90-associated RNAs), a cytoplasmic lncRNAs, can bind to ribosome, presumably influencing translation of mRNAs. Gadd7 (growth arrested DNA-damage inducible gene 7), which is associated with active translation, is hypothesized to bind to ribosome. It should be noted that in some cases translation and splicing are associated with each other. Zeb2 (zinc finger E-box-binding homeobox 2) translation requires retention of an intron. Zeb2NAT (Zeb2 natural antisense transcript) (a lncRNA that is more than 1.2 kb), which overlaps the 5′ splicing site of an intron, can inhibit splicing of the intron to allow translation of Zeb2. Aside from splicing regulation and translational control, there are other post-transcriptional regulation mechanisms utilized by lncRNAs. The findings of siRNA (small interfering RNA) mechanism, and competing endogenous RNAs- have opened up new aspects of post-transcriptional regulation and recent studies suggest that lncRNAs are also implicated in these processes, displaying direct siRNA mechanisms, or interacting with miRNAs.- LncRNAs may function as natural antisense inhibitors to promote degradation of mRNA, (Fig. 2B). It has been found that 21A, a ~300 bp lncRNA, which shows high sequence homology to CENP-F (centromere protein F) intronic portions, can reduce CENP-F expression at both mRNA and protein level through antisense inhibitor. 1/2-sbsRNA1 (half-STAU1-binding site RNA1), a ~0.7 kb lncRNA, has been found to bind to mRNAs’ 3′ UTR through Alu elements, and reduces mRNA abundance. Moreover, there are many lncRNAs that interact directly or indirectly with miRNAs to stabilize target mRNAs- (Fig. 2B). These lncRNAs are called ceRNAs (competing endogenous RNAs).- For instance, linc-MD1 (long intergenic ncRNA that is associated with muscle differentiation), a ~0.5 kb lncRNA, acts as sponge/target mimic of miR-133 and miR-135 to regulate the expression of two transcription factors: MAML1 (mastermind-like protein 1) and MEF2C (myocyte-specific enhancer factor 2C), which activate the expression of muscle-specific genes. Similar examples can be found in IPS1 (induced by phosphate starvation 1) RNA and HULC (highly up-reglated in liver cancer) RNA. Some pseudogenes can function as a sponge/target mimic for miRNAs to stabilize their homologous mRNAs, such as KRASP1 and PTENP1, which are pseudogenes of KRAS (V-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) and PTEN (phosphatase and tensin homolog), respectively., In addition, some antisense lncRNAs may bind to mRNA to mask the binding sites of miRNA and thus stabilize mRNA. BACE1AS (BACE1 antisense RNA), a ~2 kb lncRNA, has been reported to be activated in Alzheimer disease to form an RNA duplex with BACE1 (β-secretase 1) mRNA, which may mask the binding site for miR-485-5p and, thus, prevent translational repression of BACE1 mRNA by miRNA.

Other mechanisms of lncRNA functioning

In addition to transcriptional regulation and post-transcriptional regulation, lncRNAs may function through other mechanisms (Fig. 2C), such as protein localization,, telomere replication, RNA interference, beyond transcription and translation regulation, etc. Considering the limitation of the currently available knowledge, it is difficult to categorize these lncRNAs into some more “stable” groups and, thus, we roughly place them into lncRNAs with “other functional mechanisms.” For instance, meiRNA, a ~0.5 kb lncRNA, is required for nuclear localization of Mei2. ENOD40 RNA is required for correct subcellular localization of RNP particles in legume plants. TERC (telomerase RNA component), which is part of telomerase reverse transcriptase, acts as template to extend telomere during DNA replication in eukaryote. In ciliated protozoa, maternal RNA templates have been found to guide reproducible rearrangement during the transition from germline nucleus to somatic nucleus. In addition, lncRNAs may be involved in RNA interference by regulation of Dicer1. It is suggested that rncs-1 (RNA noncoding and starvation upregulated) reduces Dicer-generated siRNA and affects levels of Dicer-regulated genes.

Targeting Mechanisms of lncRNAs

According to their mode of action, lncRNAs may also be classified based on their targeting mechanisms, mainly associated with the following categories: (1) signal: show cell type-specific expression and respond to diverse stimuli, such as Xist, COLDAIR; (2) decoy: bind and titrate away a protein target, but does not exert any additional functions, such as DHFR upstream transcripts, PANDA; (3) guide: bind proteins and then direct the localization of ribonucleoprotein complex to specific targets, such as Xist, HOTAIR;, (4) scaffold: serve as central platforms to bring together multiple proteins to form ribonucleoprotein complexes, such as HOTAIR,, 7SL., Alternatively, lncRNAs can be grouped based on the types of interactions they make with their targets: RNA-RNA pairings, RNA-DNA hybrids, RNA structure mediated interactions and protein linkers. However, a single targeting archetype in referring to mode of action may not be sufficient to fully describe one lncRNA since one lncRNA may contain multiple archetypes. There are many lncRNAs that are induced by endogenous and exogenous signals to express, and they also can possess the binding sites of chromatin modification complexes (such as PRC) to repress or activate expression of a set of genes, e.g., Xist, Air, COLDAIR, HOTTIP, HOTAIR,, lincRNA-p21. Therefore, these lncRNAs operate in a dual mode as both signal and guide. Some signal lncRNAs may also function as decoys that bind and titrate away a protein target, e.g., PANDA. More complexly, some lncRNAs have more than two archetypes, such as HOTAIR that functions according to three archetypes: as anatomic signal, guiding the chromatin-modifying complexes to the target gene, and as a scaffold for PRC2 and LSD1., Albeit, there are multiple groups of lncRNAs based on different classification methods as mentioned above, different groups appear to be linked closely to one another. LncRNAs that function via transcriptional interference mechanism may target gene loci through RNA-DNA hybrids and operate as the archetypes of decoy, e.g., DHFR upstream transcripts; lncRNAs that affect chromatin modification may target gene loci through RNA structure mediated interaction, which, thus, acts as the archetypes of signal, guide and scaffold, e.g., HOTAIR;, ceRNAs that directly or indirectly regulate mRNA level target miRNAs or mRNAs through RNA-RNA pairing, may also operate as the archetype of decoy, for example, KRASP1 and PTENP1; lncRNAs that regulate splicing may function through RNA-RNA pairing to influence their targets and operate as the archetype of decoy (e.g., Malat1).

Perspectives

Here, we summarized classification methods of lncRNAs according to their four major characteristics (genomic location and context, effects on DNA sequences, functional mechanisms and targeting mechanisms). Based on function annotations presently available, we explored potential relationships between different classification categories and investigated biological features of different lncRNAs within each category. Although lncRNA could be described in terms of other features, such classifications of lncRNAs as summarized here are of fundamental importance for lncRNA studies, helpful for further investigation of specific groups of lncRNA, for generation of new hypothesis based on different lncRNA groups and for exploration of lncRNA underlying functional mechanisms.

Classification of lncRNAs

When studying lncRNAs, it is straightforward to investigate their functional features by classifying them into different groups. However, classification of lncRNAs is highly dependent on the current existing knowledge, thus requiring frequent validation of the classification system, exploring new classification systems and, when necessary, abandoning old ones. One possible example is the definition of lncRNA that is longer than 200 nt. We investigated the length of lncRNAs and compared their distribution between human and mouse (data from NONCODE V3.0). Clearly, lncRNAs can be further divided into different groups based on their length distribution (Fig. 3): small-lncRNA (200~950 nt), medium-lncRNA (950~4,800 nt), large-lncRNA (4,800 nt~). In human, the majority of lncRNAs are small-lncRNAs (58%), whereas the majority of lncRNAs in mouse are medium-lncRNAs (78%). In addition, human genome contains more small-lncRNAs and large-lncRNAs than mouse, but less medium-lncRNAs. However, the number of manually annotated lncRNAs by GENCODE released recently is less than half of NONCODE V3.0,, suggesting the uncertainty of the comparative results. While the differences could be due to the annotation levels, if proved to reflect the genuine situation, such differences may imply distinct evolutionary process of lncRNAs between human and mouse and need further evolutionary analysis.

Figure 3. Length distribution of lncRNAs in human and mouse. LncRNAs are divided into three groups based on their length distribution: small-lncRNA (200~950 nt), medium-lncRNA (950~4,800 nt) and large-lncRNA (4,800 nt~). Density distributions of lncRNA length are shown in (A) and percentages of three lncRNA groups are depicted in (B).

Figure 3. Length distribution of lncRNAs in human and mouse. LncRNAs are divided into three groups based on their length distribution: small-lncRNA (200~950 nt), medium-lncRNA (950~4,800 nt) and large-lncRNA (4,800 nt~). Density distributions of lncRNA length are shown in (A) and percentages of three lncRNA groups are depicted in (B).

Investigation of specific groups of lncRNAs

Classification of lncRNAs is useful for focusing on specific groups. Currently, lincRNAs that recruit chromatin modification complexes have attracted a lot of attention. It is reported that lincRNAs tend to bind to specific chromatin-modifying complexes, so that lincRNA-associated chromatin modification may represent a specific regulation mechanism. Additionally, lincRNAs recruiting the same chromatin-modifying complex may diverge greatly between different species. Therefore, lincRNAs may affect specific variety of target genes in different biological processes, such as PCAT-1, which regulates a variety of prostate cancer implicated genes through recruiting PRC2. These findings shed lights on the functional significance of the lincRNA group (that functions through recruiting PRC2) and stimulate further work on other chromatin-modifying complexes. Notably, there is a large number of antisense lncRNAs, which function through recruiting chromatin modification complexes to regulate transcription activity according to available evidences (Table 1). It has been suggested that most of the PRC2 binding transcripts are antisense and sense lncRNAs rather than lincRNAs. Therefore, it would be desirable for future studies to explore more chromatin complex binding features of the antisense and sense lncRNAs.

Regulation network of lncRNA and small ncRNA

The discovery of lncRNAs and their regulatory roles challenges the initially miRNA-centered regulatory networks, which may help comprehensively understand gene regulation by both small ncRNA and lncRNA and, accordingly, provide new insights into complex processes of gene regulation. As mentioned above, ceRNAs, a group of lncRNAs interacting with miRNAs, can impose an additional level on post-transcriptional regulation.- In addition to those lncRNAs that act as sponge/target mimic of miRNAs,- it should be noted that more than 50% of the sense and antisense transcript pairs may be composed of coding and antisense non-coding transcripts, and the formed RNA duplex may also influence the interaction between miRNAs and their target mRNAs., According to our previous integrative analysis of miRNAs and mRNAs in NSCLC (non-small cell lung carcinoma), we found that most of the target mRNAs do not vary significantly, in spite of the dramatic increase or decrease of their miRNAs, indicating that gene regulation network may be more sophisticated. Considering their similar background in gene expression and function overlap between lncRNA and small ncRNA, it is attractive to investigate whether there is a certain evolutionary association between the two components. It is found that some lncRNAs contain miRNAs in their gene locus.- H19 (gene comes from colon pH19), a ~2.3 kb lncRNA, contains mir-675 in its exon and also serves as a precursor of this miRNA in addition to its transcription regulation activity. Also, lncRNA LOC554202 contains mir-31 in its intron, and both miR-31 and the host lncRNA are found to be lowly expressed in triple-negative breast cancer. It has been found that five lncRNAs, namely, MEG3, MEG8 (maternally expressed 3), MEG9 (maternally expressed 3), antiPeg11 (antisense transcript to Peg11/Rtl1) and Rian (RNA imprinted and accumulated in nucleus), contain a lot of miRNAs and snoRNAs and function through transcriptional regulation; interestingly, these five lncRNAs and their endogenous small ncRNA may target the same gene, but it remains unclear whether those five lncRNAs can serve as precursors of their corresponding miRNAs. In addition, it is suggested that some lncRNAs in human may be preferentially post-processed into snoRNAs., Taken together, future studies focused on this aspect may bring new unexpected insights into the evolutionary relationships between small ncRNA and lncRNA.
  102 in total

1.  A starvation-induced noncoding RNA modulates expression of Dicer-regulated genes.

Authors:  Sabine Hellwig; Brenda L Bass
Journal:  Proc Natl Acad Sci U S A       Date:  2008-08-22       Impact factor: 11.205

Review 2.  Non-coding RNAs: regulators of disease.

Authors:  Ryan J Taft; Ken C Pang; Timothy R Mercer; Marcel Dinger; John S Mattick
Journal:  J Pathol       Date:  2010-01       Impact factor: 7.996

Review 3.  Long noncoding RNAs in cell biology.

Authors:  Michael B Clark; John S Mattick
Journal:  Semin Cell Dev Biol       Date:  2011-01-20       Impact factor: 7.727

4.  A natural antisense transcript regulates Zeb2/Sip1 gene expression during Snail1-induced epithelial-mesenchymal transition.

Authors:  Manuel Beltran; Isabel Puig; Cristina Peña; José Miguel García; Ana Belén Alvarez; Raúl Peña; Félix Bonilla; Antonio García de Herreros
Journal:  Genes Dev       Date:  2008-03-15       Impact factor: 11.361

5.  Most "dark matter" transcripts are associated with known genes.

Authors:  Harm van Bakel; Corey Nislow; Benjamin J Blencowe; Timothy R Hughes
Journal:  PLoS Biol       Date:  2010-05-18       Impact factor: 8.029

6.  Expression of a noncoding RNA is elevated in Alzheimer's disease and drives rapid feed-forward regulation of beta-secretase.

Authors:  Mohammad Ali Faghihi; Farzaneh Modarresi; Ahmad M Khalil; Douglas E Wood; Barbara G Sahagan; Todd E Morgan; Caleb E Finch; Georges St Laurent; Paul J Kenny; Claes Wahlestedt
Journal:  Nat Med       Date:  2008-06-29       Impact factor: 53.440

7.  Hydrogen peroxide induces the expression of adapt15, a novel RNA associated with polysomes in hamster HA-1 cells.

Authors:  D R Crawford; G P Schools; S L Salmon; K J Davies
Journal:  Arch Biochem Biophys       Date:  1996-01-15       Impact factor: 4.013

8.  The transcriptional landscape of the mammalian genome.

Authors:  P Carninci; T Kasukawa; S Katayama; J Gough; M C Frith; N Maeda; R Oyama; T Ravasi; B Lenhard; C Wells; R Kodzius; K Shimokawa; V B Bajic; S E Brenner; S Batalov; A R R Forrest; M Zavolan; M J Davis; L G Wilming; V Aidinis; J E Allen; A Ambesi-Impiombato; R Apweiler; R N Aturaliya; T L Bailey; M Bansal; L Baxter; K W Beisel; T Bersano; H Bono; A M Chalk; K P Chiu; V Choudhary; A Christoffels; D R Clutterbuck; M L Crowe; E Dalla; B P Dalrymple; B de Bono; G Della Gatta; D di Bernardo; T Down; P Engstrom; M Fagiolini; G Faulkner; C F Fletcher; T Fukushima; M Furuno; S Futaki; M Gariboldi; P Georgii-Hemming; T R Gingeras; T Gojobori; R E Green; S Gustincich; M Harbers; Y Hayashi; T K Hensch; N Hirokawa; D Hill; L Huminiecki; M Iacono; K Ikeo; A Iwama; T Ishikawa; M Jakt; A Kanapin; M Katoh; Y Kawasawa; J Kelso; H Kitamura; H Kitano; G Kollias; S P T Krishnan; A Kruger; S K Kummerfeld; I V Kurochkin; L F Lareau; D Lazarevic; L Lipovich; J Liu; S Liuni; S McWilliam; M Madan Babu; M Madera; L Marchionni; H Matsuda; S Matsuzawa; H Miki; F Mignone; S Miyake; K Morris; S Mottagui-Tabar; N Mulder; N Nakano; H Nakauchi; P Ng; R Nilsson; S Nishiguchi; S Nishikawa; F Nori; O Ohara; Y Okazaki; V Orlando; K C Pang; W J Pavan; G Pavesi; G Pesole; N Petrovsky; S Piazza; J Reed; J F Reid; B Z Ring; M Ringwald; B Rost; Y Ruan; S L Salzberg; A Sandelin; C Schneider; C Schönbach; K Sekiguchi; C A M Semple; S Seno; L Sessa; Y Sheng; Y Shibata; H Shimada; K Shimada; D Silva; B Sinclair; S Sperling; E Stupka; K Sugiura; R Sultana; Y Takenaka; K Taki; K Tammoja; S L Tan; S Tang; M S Taylor; J Tegner; S A Teichmann; H R Ueda; E van Nimwegen; R Verardo; C L Wei; K Yagi; H Yamanishi; E Zabarovsky; S Zhu; A Zimmer; W Hide; C Bult; S M Grimmond; R D Teasdale; E T Liu; V Brusic; J Quackenbush; C Wahlestedt; J S Mattick; D A Hume; C Kai; D Sasaki; Y Tomaru; S Fukuda; M Kanamori-Katayama; M Suzuki; J Aoki; T Arakawa; J Iida; K Imamura; M Itoh; T Kato; H Kawaji; N Kawagashira; T Kawashima; M Kojima; S Kondo; H Konno; K Nakano; N Ninomiya; T Nishio; M Okada; C Plessy; K Shibata; T Shiraki; S Suzuki; M Tagami; K Waki; A Watahiki; Y Okamura-Oho; H Suzuki; J Kawai; Y Hayashizaki
Journal:  Science       Date:  2005-09-02       Impact factor: 47.728

9.  Transcriptome sequencing across a prostate cancer cohort identifies PCAT-1, an unannotated lincRNA implicated in disease progression.

Authors:  John R Prensner; Matthew K Iyer; O Alejandro Balbin; Saravana M Dhanasekaran; Qi Cao; J Chad Brenner; Bharathi Laxman; Irfan A Asangani; Catherine S Grasso; Hal D Kominsky; Xuhong Cao; Xiaojun Jing; Xiaoju Wang; Javed Siddiqui; John T Wei; Daniel Robinson; Hari K Iyer; Nallasivam Palanisamy; Christopher A Maher; Arul M Chinnaiyan
Journal:  Nat Biotechnol       Date:  2011-07-31       Impact factor: 54.908

Review 10.  Steroid receptor RNA activator (SRA1): unusual bifaceted gene products with suspected relevance to breast cancer.

Authors:  Etienne Leygue
Journal:  Nucl Recept Signal       Date:  2007-08-03
View more
  411 in total

1.  Upregulation of the long noncoding RNA TUG1 promotes proliferation and migration of esophageal squamous cell carcinoma.

Authors:  Youtao Xu; Jie Wang; Mantang Qiu; Lei Xu; Ming Li; Feng Jiang; Rong Yin; Lin Xu
Journal:  Tumour Biol       Date:  2014-10-31

2.  Up-regulation of SUMO1 pseudogene 3 (SUMO1P3) in gastric cancer and its clinical association.

Authors:  Danyi Mei; Haojun Song; Kai Wang; Yichao Lou; Weiliang Sun; Zhong Liu; Xiaoyun Ding; Junming Guo
Journal:  Med Oncol       Date:  2013-09-01       Impact factor: 3.064

Review 3.  Multifaceted Roles of Long Non-coding RNAs in Head and Neck Cancer.

Authors:  Leslie Duncan; Chloe Shay; Yong Teng
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

Review 4.  Non-coding RNAs in hepatocellular carcinoma: molecular functions and pathological implications.

Authors:  Chun-Ming Wong; Felice Ho-Ching Tsang; Irene Oi-Lin Ng
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2018-01-10       Impact factor: 46.802

5.  Chronic Sleep Restriction Induces Cognitive Deficits and Cortical Beta-Amyloid Deposition in Mice via BACE1-Antisense Activation.

Authors:  Hong-Yi Zhao; Hui-Juan Wu; Jia-Lin He; Jian-Hua Zhuang; Zhen-Yu Liu; Liu-Qing Huang; Zhong-Xin Zhao
Journal:  CNS Neurosci Ther       Date:  2017-02-01       Impact factor: 5.243

Review 6.  Minireview: Long noncoding RNAs: new "links" between gene expression and cellular outcomes in endocrinology.

Authors:  Miao Sun; W Lee Kraus
Journal:  Mol Endocrinol       Date:  2013-07-24

Review 7.  Long non-coding RNA HOTAIR in carcinogenesis and metastasis.

Authors:  Jinsong Zhang; Peijing Zhang; Li Wang; Hai-long Piao; Li Ma
Journal:  Acta Biochim Biophys Sin (Shanghai)       Date:  2013-10-27       Impact factor: 3.848

Review 8.  Long noncoding RNAs: a new regulatory code in osteoarthritis.

Authors:  Xiao Cen; Xin-Qi Huang; Wen-Tian Sun; Qing Liu; Jun Liu
Journal:  Am J Transl Res       Date:  2017-11-15       Impact factor: 4.060

Review 9.  Long noncoding RNAs in lipid metabolism.

Authors:  Coen van Solingen; Kaitlyn R Scacalossi; Kathryn J Moore
Journal:  Curr Opin Lipidol       Date:  2018-06       Impact factor: 4.776

10.  Pseudogene integrator complex subunit 6 pseudogene 1 (INTS6P1) as a novel plasma-based biomarker for hepatocellular carcinoma screening.

Authors:  Ka Yin Lui; Hao-Ran Peng; Jin-Rong Lin; Chun-Hui Qiu; Hu-An Chen; Rong-Dang Fu; Chang-Jie Cai; Min-Qiang Lu
Journal:  Tumour Biol       Date:  2015-08-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.