| Literature DB >> 26072035 |
Xingli Guo, Lin Gao, Yu Wang, David K Y Chiu, Tong Wang, Yue Deng.
Abstract
Long noncoding RNAs (lncRNAs), generally longer than 200 nucleotides and with poor protein coding potential, are usually considered collectively as a heterogeneous class of RNAs. Recently, an increasing number of studies have shown that lncRNAs can involve in various critical biological processes and a number of complex human diseases. Not only the primary sequences of many lncRNAs are directly interrelated to a specific functional role, strong evidence suggests that their secondary structures are even more interrelated to their known functions. As functional molecules, lncRNAs have become more and more relevant to many researchers. Here, we review recent, state-of-the-art advances in the three levels (the primary sequence, the secondary structure and the function annotation) of the lncRNA research, as well as computational methods for lncRNA data analysis.Entities:
Keywords: function annotation; identification; lncRNA; protein-coding potential; secondary structure prediction
Mesh:
Substances:
Year: 2015 PMID: 26072035 PMCID: PMC5863772 DOI: 10.1093/bfgp/elv022
Source DB: PubMed Journal: Brief Funct Genomics ISSN: 2041-2649 Impact factor: 4.241
Tools for calculating the protein coding potential
| Name | Features | Classifier | Reference |
|---|---|---|---|
|
CSF Coding Substitution Frequency | Log-likelihood ratio of coding versus noncoding based on empirical frequencies of all codon substitutions in combination with evolutionary information for several organisms | Heuristic strategy | [ |
| PhyloCSF | A rigorous reformulation of CSF features | Statistical models; | [ |
|
CPC Coding Potential Calculator | Three features based on identified ORF and three features based on the output of parsing protein database | SVM classifier; | [ |
|
CPAT Coding-Potential Assessment Tool | Four features based on ORF | Logistic regression model; | [ |
|
CNCI Coding–Noncoding Index | Five features based on usage frequency of adjoining nucleotide triplets | SVM classifier; | [ |
|
CONC Coding or Noncoding | Nine features based on sequence composition, secondary structure and alignment with proteins | SVM classifier | [ |
Figure 1A framework of computational methods for lncRNA function prediction. (A colour version of this figure is available online at: http://bfg.oxfordjournals.org)
The databases for lncRNAs
| Database | Web site | Reference | |
|---|---|---|---|
| Annotation databases | NONCODE v 4.0 |
| [ |
| lncRNAdb |
| [ | |
| LNCipedia |
| [ | |
| lncRNome |
| [ | |
| fRNAdb |
| [ | |
| lncRNAtor |
| [ | |
| lncRNAMap |
| [ | |
| PLncDB |
| [ | |
| Interaction databases | ChIPBase |
| [ |
| NPInter |
| [ | |
| miRcode |
| [ | |
| DIANA-LncBase |
| [ | |
| StarBase v2.0 |
| [ | |
| lncRNA2Target |
| [ | |
| lncRNADisease |
| [ | |
| Specific databases | lnCeDB |
| [ |
| NRED |
| [ | |
| Linc2go |
| [ | |
| lncRNASNP |
| [ |