Literature DB >> 26362104

Identification of microRNA precursor with the degenerate K-tuple or Kmer strategy.

Bin Liu1, Longyun Fang2, Shanyi Wang3, Xiaolong Wang4, Hongtao Li5, Kuo-Chen Chou6.   

Abstract

The microRNA (miRNA), a small non-coding RNA molecule, plays an important role in transcriptional and post-transcriptional regulation of gene expression. Its abnormal expression, however, has been observed in many cancers and other disease states, implying that the miRNA molecules are also deeply involved in these diseases, particularly in carcinogenesis. Therefore, it is important for both basic research and miRNA-based therapy to discriminate the real pre-miRNAs from the false ones (such as hairpin sequences with similar stem-loops). Most existing methods in this regard were based on the strategy in which RNA samples were formulated by a vector formed by their Kmer components. But the length of Kmers must be very short; otherwise, the vector's dimension would be extremely large, leading to the "high-dimension disaster" or overfitting problem. Inspired by the concept of "degenerate energy levels" in quantum mechanics, we introduced the "degenerate Kmer" (deKmer) to represent RNA samples. By doing so, not only we can accommodate long-range coupling effects but also we can avoid the high-dimension problem. Rigorous jackknife tests and cross-species experiments indicated that our approach is very promising. It has not escaped our notice that the deKmer approach can also be applied to many other areas of computational biology. A user-friendly web-server for the new predictor has been established at http://bioinformatics.hitsz.edu.cn/miRNA-deKmer/, by which users can easily get their desired results.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Degenerate Kmer; False pre-miRNA; Long-range effect; MicroRNA precursor; True pre-miRNA; deKmer web-server

Mesh:

Substances:

Year:  2015        PMID: 26362104     DOI: 10.1016/j.jtbi.2015.08.025

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  51 in total

1.  Prediction of Protein-Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures.

Authors:  Guang-Hui Liu; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-11-12       Impact factor: 1.843

2.  Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier.

Authors:  Xiaotong Guo; Fulin Liu; Ying Ju; Zhen Wang; Chunyu Wang
Journal:  Sci Rep       Date:  2016-06-21       Impact factor: 4.379

3.  NmSEER V2.0: a prediction tool for 2'-O-methylation sites based on random forest and multi-encoding combination.

Authors:  Yiran Zhou; Qinghua Cui; Yuan Zhou
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

4.  GlyinsRNA: a webserver for predicting glycosylation sites on small RNAs.

Authors:  Chunmei Cui; Xiaobin Wu; Yuan Zhou
Journal:  RNA Biol       Date:  2021-09-24       Impact factor: 4.766

5.  RFAthM6A: a new tool for predicting m6A sites in Arabidopsis thaliana.

Authors:  Xiaofeng Wang; Renxiang Yan
Journal:  Plant Mol Biol       Date:  2018-01-16       Impact factor: 4.076

6.  CNNLSTMac4CPred: A Hybrid Model for N4-Acetylcytidine Prediction.

Authors:  Guiyang Zhang; Wei Luo; Jianyi Lyu; Zu-Guo Yu; Guohua Huang
Journal:  Interdiscip Sci       Date:  2022-02-01       Impact factor: 2.233

7.  iACP: a sequence-based tool for identifying anticancer peptides.

Authors:  Wei Chen; Hui Ding; Pengmian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-03-29

8.  Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms.

Authors:  Guo-Sheng Han; Qi Li; Ying Li
Journal:  BMC Bioinformatics       Date:  2021-06-02       Impact factor: 3.307

9.  Combined sequence and sequence-structure based methods for analyzing FGF23, CYP24A1 and VDR genes.

Authors:  Selvaraman Nagamani; Kh Dhanachandra Singh; Karthikeyan Muthusamy
Journal:  Meta Gene       Date:  2016-03-31

10.  DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.

Authors:  Xiaofeng Wang; Renxiang Yan; Jiangning Song
Journal:  Sci Rep       Date:  2016-03-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.