Literature DB >> 25715848

miRNA-dis: microRNA precursor identification based on distance structure status pairs.

Bin Liu1, Longyun Fang, Junjie Chen, Fule Liu, Xiaolong Wang.   

Abstract

MicroRNA precursor identification is an important task in bioinformatics. Support Vector Machine (SVM) is one of the most effective machine learning methods used in this field. The performance of SVM-based methods depends on the vector representations of RNAs. However, the discriminative power of the existing feature vectors is limited, and many methods lack an interpretable model for analysis of characteristic sequence features. Prior studies have demonstrated that sequence or structure order effects were relevant for discrimination, but little work has explored how to use this kind of information for human pre-microRNA identification. In this study, in order to incorporate the structure-order information into the prediction, a method called "miRNA-dis" was proposed, in which the feature vector was constructed by the occurrence frequency of the "distance structure status pair" or just the "distance-pair". Rigorous cross-validations on a much larger and more stringent newly constructed benchmark dataset showed that the miRNA-dis outperformed some state-of-the-art predictors in this area. Remarkably, miRNA-dis trained with human data can correctly predict 87.02% of the 4022 pre-miRNAs from 11 different species ranging from animals, plants and viruses. miRNA-dis would be a useful high throughput tool for large-scale analysis of microRNA precursors. In addition, the learnt model can be easily analyzed in terms of discriminative features, and some interesting patterns were discovered, which could reflect the characteristics of microRNAs. A user-friendly web-server of miRNA-dis was constructed, which is freely accessible to the public at the web-site on http://bioinformatics.hitsz.edu.cn/miRNA-dis/.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25715848     DOI: 10.1039/c5mb00050e

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  24 in total

1.  repRNA: a web server for generating various feature vectors of RNA sequences.

Authors:  Bin Liu; Fule Liu; Longyun Fang; Xiaolong Wang; Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2015-06-18       Impact factor: 3.291

2.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

3.  Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.

Authors:  Khurshid Ahmad; Muhammad Waris; Maqsood Hayat
Journal:  J Membr Biol       Date:  2016-01-08       Impact factor: 1.843

4.  Prediction of protein-protein interactions with clustered amino acids and weighted sparse representation.

Authors:  Qiaoying Huang; Zhuhong You; Xiaofeng Zhang; Yong Zhou
Journal:  Int J Mol Sci       Date:  2015-05-13       Impact factor: 5.923

5.  Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity.

Authors:  Xing Chen; Chenggang Clarence Yan; Cai Luo; Wen Ji; Yongdong Zhang; Qionghai Dai
Journal:  Sci Rep       Date:  2015-06-10       Impact factor: 4.379

6.  iMiRNA-SSF: Improving the Identification of MicroRNA Precursors by Combining Negative Sets with Different Distributions.

Authors:  Junjie Chen; Xiaolong Wang; Bin Liu
Journal:  Sci Rep       Date:  2016-01-12       Impact factor: 4.379

Review 7.  Survey of Natural Language Processing Techniques in Bioinformatics.

Authors:  Zhiqiang Zeng; Hua Shi; Yun Wu; Zhiling Hong
Journal:  Comput Math Methods Med       Date:  2015-10-07       Impact factor: 2.238

8.  Predicting cancerlectins by the optimal g-gap dipeptides.

Authors:  Hao Lin; Wei-Xin Liu; Jiao He; Xin-Hui Liu; Hui Ding; Wei Chen
Journal:  Sci Rep       Date:  2015-12-09       Impact factor: 4.379

9.  Improving classification of mature microRNA by solving class imbalance problem.

Authors:  Ying Wang; Xiaoye Li; Bairui Tao
Journal:  Sci Rep       Date:  2016-05-16       Impact factor: 4.379

10.  Recombination spot identification Based on gapped k-mers.

Authors:  Rong Wang; Yong Xu; Bin Liu
Journal:  Sci Rep       Date:  2016-03-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.