Literature DB >> 25645238

iMiRNA-PseDPC: microRNA precursor identification with a pseudo distance-pair composition approach.

Bin Liu1,2,3, Longyun Fang1, Fule Liu1, Xiaolong Wang1,2, Kuo-Chen Chou3,4.   

Abstract

A microRNA (miRNA) is a small non-coding RNA molecule, functioning in transcriptional and post-transcriptional regulation of gene expression. The human genome may encode over 1000 miRNAs. Albeit poorly characterized, miRNAs are widely deemed as important regulators of biological processes. Aberrant expression of miRNAs has been observed in many cancers and other disease states, indicating that they are deeply implicated with these diseases, particularly in carcinogenesis. Therefore, it is important for both basic research and miRNA-based therapy to discriminate the real pre-miRNAs from the false ones (such as hairpin sequences with similar stem-loops). Particularly, with the avalanche of RNA sequences generated in the post-genomic age, it is highly desired to develop computational sequence-based methods for effectively identifying the human pre-miRNAs. Here, we propose a predictor called "iMiRNA-PseDPC", in which the RNA sequences are formulated by a novel feature vector called "pseudo distance-pair composition" (PseDPC) with 10 types of structure statuses. Rigorous cross-validations on a much larger and more stringent newly constructed benchmark data-set showed that our approach has remarkably outperformed the existing ones in either prediction accuracy or efficiency, indicating the new predictor is quite promising or at least may become a complementary tool to the existing predictors in this area. For the convenience of most experimental scientists, a user-friendly web server for the new predictor has been established at http://bioinformatics.hitsz.edu.cn/iMiRNA-PseDPC/, by which users can easily get their desired results without the need to go through the mathematical details. It is anticipated that the new predictor may become a useful high throughput tool for genome analysis particularly in dealing with large-scale data.

Entities:  

Keywords:  Chou’s PseAAC approach; free energy; iMiRNA-PseDPC; large-scale analysis; local structure status; pre-miRNA

Mesh:

Substances:

Year:  2015        PMID: 25645238     DOI: 10.1080/07391102.2015.1014422

Source DB:  PubMed          Journal:  J Biomol Struct Dyn        ISSN: 0739-1102


  46 in total

1.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

2.  repRNA: a web server for generating various feature vectors of RNA sequences.

Authors:  Bin Liu; Fule Liu; Longyun Fang; Xiaolong Wang; Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2015-06-18       Impact factor: 3.291

3.  The Helitron family classification using SVM based on Fourier transform features applied on an unbalanced dataset.

Authors:  Rabeb Touati; Afef Elloumi Oueslati; Imen Messaoudi; Zied Lachiri
Journal:  Med Biol Eng Comput       Date:  2019-08-17       Impact factor: 2.602

4.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

5.  Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.

Authors:  Khurshid Ahmad; Muhammad Waris; Maqsood Hayat
Journal:  J Membr Biol       Date:  2016-01-08       Impact factor: 1.843

6.  Prediction of protein-protein interactions with clustered amino acids and weighted sparse representation.

Authors:  Qiaoying Huang; Zhuhong You; Xiaofeng Zhang; Yong Zhou
Journal:  Int J Mol Sci       Date:  2015-05-13       Impact factor: 5.923

7.  Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity.

Authors:  Xing Chen; Chenggang Clarence Yan; Cai Luo; Wen Ji; Yongdong Zhang; Qionghai Dai
Journal:  Sci Rep       Date:  2015-06-10       Impact factor: 4.379

8.  Predicting cancerlectins by the optimal g-gap dipeptides.

Authors:  Hao Lin; Wei-Xin Liu; Jiao He; Xin-Hui Liu; Hui Ding; Wei Chen
Journal:  Sci Rep       Date:  2015-12-09       Impact factor: 4.379

9.  Combined sequence and sequence-structure based methods for analyzing FGF23, CYP24A1 and VDR genes.

Authors:  Selvaraman Nagamani; Kh Dhanachandra Singh; Karthikeyan Muthusamy
Journal:  Meta Gene       Date:  2016-03-31

10.  DephosSite: a machine learning approach for discovering phosphotase-specific dephosphorylation sites.

Authors:  Xiaofeng Wang; Renxiang Yan; Jiangning Song
Journal:  Sci Rep       Date:  2016-03-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.