| Literature DB >> 24106460 |
Carl Ernst1, Cynthia C Morton.
Abstract
Long non-coding (lnc) RNAs are defined as non-protein coding RNAs distinct from housekeeping RNAs such as tRNAs, rRNAs, and snRNAs, and independent from small RNAs with specific molecular processing machinery such as micro- or piwi-RNAs. Recent studies of lncRNAs across different species have revealed a diverse population of RNA molecules of differing size and function. RNA sequencing studies suggest transcription throughout the genome, so there is a need to understand how sequence relates to functional and structural relationships amongst RNA molecules. Our synthesis of recent studies suggests that neither size, presence of a poly-A tail, splicing, direction of transcription, nor strand specificity are of importance to lncRNA function. Rather, relative genomic position in relation to a target is fundamentally important. In this review, we describe issues of key importance in functional assessment of lncRNA and how this might apply to lncRNAs important in neurodevelopment.Entities:
Keywords: epigenetics; gene regulation; neurodevelopment; non-coding RNA
Year: 2013 PMID: 24106460 PMCID: PMC3788346 DOI: 10.3389/fncel.2013.00168
Source DB: PubMed Journal: Front Cell Neurosci ISSN: 1662-5102 Impact factor: 5.505
Some examples of categorization of non-coding RNA.
| Category | Description | Example | Reference |
|---|---|---|---|
| Intronic | Expressed from the intron of target | DMD lncRNA | |
| H3K4me3 | Has a methylated H3K4 promoter | lincRNA-p21 | |
| Antisense | Expressed from the non-coding strand and acts on the complementary target | BACE1-AS | |
| Enhancer | Expressed to enhance expression at a locus at some distance from target | p53 eRNAs | |
| Promoter | Acting on and expressed from the promoter of target | DBE-T | |
| Intergenic | Expressed at some distance from coding genes | lincRNA00299 | |
| Acting at some distance from target | Evf2 | ||
| Acting on an adjacent target | AIRN | ||
| Small | Less than 200 bp in size | microRNA 137 | Ripke et al. (2011) |
| Long | Greater than 200 bp in size | Fendrr | |
| 5-UTR | Expressed near the 5′UTR of target | 5′UTR ELK-1 |
Processing and positional diversity of lncRNA (in order described in text).
| lncRNA [Ref] | Species | Description of all structural properties reported |
|---|---|---|
| COLDAIR ( | 1100 base RNA expressed from the sense strand relative to target, has no poly-A tail, expressed from an intron of the target gene | |
| IRT1 ( | Expressed from promoter of target, 1.4 kb in length, not spliced and is transcribed from the same strand as the gene it regulates | |
| NeST ( | Encoded on the antisense strand, contains six exons spread over a 45 kb region, transcript is 914 bases | |
| Braveheart ( | 590 base RNA with three exons, 33% confined to the nucleus | |
| NeST ( | Encoded on antisense strand, contains six exons spread over a 45 kb region, primary transcript is 914 bases | |
| DBE-T ( | 9.5 kb is one major product, transcribed from same strand as target genes, transcript contains one of many targets, nuclear and chromatin associated | |
| HOTTIP ( | 3,764-nucleotide, spliced and polyadenylated intergenic RNA, ~330 base product, regulates gene cluster | |
| ANRIL ( | ~126 kb transcript, spliced, 19 exons with an ~1.1 kb transcript, 13 isoforms transcribed in the antisense orientation of gene cluster, overlaps one target gene | |
| lincMD1 ( | Three exons and two introns in 14 kb of genomic space, spliced product of 521 bases, accumulates as cytoplasmic poly-A+ RNA, transcribed on same strand in same orientation as microRNAs for which it acts as a decoy | |
| TINCR ( | Three exons, 3.7 kb transcript predominantly cytoplasmically expressed, over 100 different targets dispersed through genome | |
| UCHL1-AS ( | Four exons spanning 70 kb, overlaps the first 73 bases of | |
| 1/2sbsRNA1 ( | Present in cytoplasm, poly-A+, two alternative transcripts consist of 688 nucleotides, multiple targets throughout genome |
Protein:RNA interacting domains.
| CXC (redox-like), CXXC (redox), or C-X(6)-X (zinc finger or ring finger) motifs refer to the cysteine residue (C) with any amino acid (X) in between. These Cys residues may be active, meaning they can use their highly active sulfhydryl (SH) group to form a covalent bond with the OH group on the RNA sugar ring. These motifs can also interact with Ser, Thr, or Tyr amino acid residues to form S–S or S–O bonds on other proteins. An example of the cysteine RNA interacting domain are the Enhancer of zeste-related proteins with conserved X(6)-C-X(3)-C-X-C motifs. |
| WD domains refer to peptide domains with rich repeats of tryptophan (W; hydrophobic) and aspartic acid (D; negatively charged) that are present in a large range of proteins. WD domains are non-catalytic and are thought to form a platform for the interaction of different cellular partners. |