| Literature DB >> 23319885 |
Handong Ma1, Yun Hao, Xinran Dong, Qingtian Gong, Jingqi Chen, Jifeng Zhang, Weidong Tian.
Abstract
The central dogma of gene expression considers RNA as the carrier of genetic information from DNA to protein. However, it has become more and more clear that RNA plays more important roles than simply being the information carrier. Recently, whole genome transcriptomic analyses have identified large numbers of dynamically expressed long noncoding RNAs (lncRNAs), many of which are involved in a variety of biological functions. Even so, the functions and molecular mechanisms of most lncRNAs still remain elusive. Therefore, it is necessary to develop computational methods to predict the function of lncRNAs in order to accelerate the study of lncRNAs. Here, we review the recent progress in the identification of lncRNAs, the molecular functions and mechanisms of lncRNAs, and the computational methods for predicting the function of lncRNAs.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23319885 PMCID: PMC3540756 DOI: 10.1100/2012/541786
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Workflow of lncRNA identification from RNA-Seq.
Machine-learning methods for identifying lncRNAs.
| Method | Features | Algorithm | References |
|---|---|---|---|
| Peptide length | |||
| Amino acid composition | |||
| Hydrophobicity | |||
| CONC | Secondary structure content | SVM | [ |
| Percentage of residues exposed to solvent | |||
| Sequence compositional entropy | |||
| Number of homologs obtained by PSI-BLAST | |||
| Alignment entropy | |||
|
| |||
| ORF prediction quality | |||
| CPC | Number of homologs obtained by BLASTX | SVM | [ |
| Alignment quality | |||
| Segment distribution | |||
|
| |||
| Lu et al. | RNA-seq experiments | Naïve Bayes | [ |
Function classification of lncRNAs.
| Archetype | lncRNA name | Length | Target | Function |
| References |
|---|---|---|---|---|---|---|
| Signal | KCNQ1ot1, Air, Xist | 91 kb, 108 kb, ~17 kb | G9a, PRC, YY1 | Transcriptional silencing of multiple genes; X inactivation (XCI) |
| [ |
| HOTAIR, Frigidair, HOTTIP, | 2.2 kb, N.A., 3.7 kb | LSD1-CoREST | Signals of anatomic position, |
| [ | |
| lincRNA-p21, PANDA | 3 kb; 1.5 kb | hnRNP-K | p53 targets in response to DNA damage |
| [ | |
| lincRNA-RoR | 2.6 kb | Oct4, Sox2, Nanog | Pluripotency-associated | N.A.b | [ | |
| COOLAIR, COLDAIR | Multiple spliced: 400 bp/750 bp; ~1.1 kb | FLC, PRC2 | Combinatorial transcriptional regulation | N.A. | [ | |
| eRNA | Various sizes | MLL-WDR5, TFsa | Promotes mRNA synthesis |
| [ | |
| Gas5 | ~7 kb | Glucocorticoid receptor | Represses the glucocorticoid receptor | N.A. | [ | |
| 1/2-sbsRNAs | N.A.c | SMD | Formation of STAU1 binding sites | N.A. | [ | |
|
| ||||||
| Decoys | DHFR-Minor | 7.3, 5.0, 1.4, and 0.8 kb | TFIIB | Inhibits assembly of the preinitiation complex | N.A. | [ |
| TERRA | Various sizes | Telomerase | Regulation and protection of chromosome ends | N.A. | [ | |
| PANDA | 1.5 kb | NF-YA | Inhibits expression of apoptotic genes |
| [ | |
|
| ~3.9 kb | PTEN | Sequestration of miRNAs | N.A. | [ | |
| MALAT1 | ~7 kb | SR splicing factors | Alters pattern of alternative splicing | N.A. | [ | |
|
| ||||||
| Guides | Xist | ~17 kb | PRC2, YY1 | Inactives X chromosome |
| [ |
| Air, COLDAIR | 108 kb, | G9a, PRC2 | Silences transcription, affects histone acetylation and methylation states |
| [ | |
| HOTTIP | ~3.8 kb | MLL-WDR5 | Chromosomal looping, chromatin modifications |
| [ | |
| HOTAIR | 2.2 kb | LSD1-CoREST | Alters and regulates epigenetic states |
| [ | |
| Jpx | Multiple isoforms | polycomb complexa | Activation of Xist RNA on the inactive X |
| [ | |
| lincRNA-p21 | 3 kb | hnRNP-Ka | p53 targets in response to DNA damage |
| [ | |
|
| ||||||
| Scaffold | TERC | Various sizes | TERT | Telomerase catalytic activity |
| [ |
| HOTAIR | 2.2 kb | PRC2, LSD1, CoREST, REST | Demethylates histone H3 on K4 to antagonize gene activation |
| [ | |
| ANRIL | Multiple spliced: 3.9 kb/34.8 kb | PRC1, PRC2 | Contributes to the functions of both PRC1 and PRC2 proteins |
| [ | |
| Alpha Satellite Repeat LncRNA | N.A. | SUMO-HP1 | Molecular scaffold for the targeting and local accumulation of HP1 | N.A. | [ | |
aNot yet understood.
bNot clearly referred as cis-action.
cNo length data available in all six databases listed in Table 3.
List of lncRNA databases.
| Tools | Source | Description | Reference |
|---|---|---|---|
| lncRNAdb |
| Contain comprehensive list of lncRNAs in eukaryotes, and mRNAs with regulatory roles | [ |
| NONCODE |
| Integrative annotation of noncoding RNA (73,372 lncRNAs) | [ |
| LNCipedia |
| 21 488 annotated human lncRNA transcripts with secondary structure information, protein coding potential, and microRNA binding sites | [ |
| fRNAdb |
| A large collection of noncoding transcripts including annotated/unannotated sequences from H-inv database, NONCODE, and RNAdb | [ |
| NRED |
| Noncoding RNA Expression Database | [ |