Literature DB >> 17267435

De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures.

Kwang Loong Stanley Ng1, Santosh K Mishra.   

Abstract

MOTIVATION: MicroRNAs (miRNAs) are small ncRNAs participating in diverse cellular and physiological processes through the post-transcriptional gene regulatory pathway. Critically associated with the miRNAs biogenesis, the hairpin structure is a necessary feature for the computational classification of novel precursor miRNAs (pre-miRs). Though many of the abundant genomic inverted repeats (pseudo hairpins) can be filtered computationally, novel species-specific pre-miRs are likely to remain elusive.
RESULTS: miPred is a de novo Support Vector Machine (SVM) classifier for identifying pre-miRs without relying on phylogenetic conservation. To achieve significantly higher sensitivity and specificity than existing (quasi) de novo predictors, it employs a Gaussian Radial Basis Function kernel (RBF) as a similarity measure for 29 global and intrinsic hairpin folding attributes. They characterize a pre-miR at the dinucleotide sequence, hairpin folding, non-linear statistical thermodynamics and topological levels. Trained on 200 human pre-miRs and 400 pseudo hairpins, miPred achieves 93.50% (5-fold cross-validation accuracy) and 0.9833 (ROC score). Tested on the remaining 123 human pre-miRs and 246 pseudo hairpins, it reports 84.55% (sensitivity), 97.97% (specificity) and 93.50% (accuracy). Validated onto 1918 pre-miRs across 40 non-human species and 3836 pseudo hairpins, it yields 87.65% (92.08%), 97.75% (97.42%) and 94.38% (95.64%) for the mean (overall) sensitivity, specificity and accuracy. Notably, A.mellifera, A.geoffroyi, C.familiaris, E.Barr, H. Simplex virus, H.cytomegalovirus, O.aries, P.patens, R.lymphocryptovirus, Simian virus and Z.mays are unambiguously classified with 100.00% (sensitivity) and >93.75% (specificity). AVAILABILITY: Data sets, raw statistical results and source codes are available at http://web.bii.a-star.edu.sg/~stanley/Publications

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17267435     DOI: 10.1093/bioinformatics/btm026

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  82 in total

Review 1.  Genome-wide approaches in the study of microRNA biology.

Authors:  Melissa L Wilbert; Gene W Yeo
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2010-12-31

2.  Using RNA inverse folding to identify IRES-like structural subdomains.

Authors:  Ivan Dotu; Gloria Lozano; Peter Clote; Encarnacion Martinez-Salas
Journal:  RNA Biol       Date:  2013-11-04       Impact factor: 4.652

3.  Effective classification of microRNA precursors using feature mining and AdaBoost algorithms.

Authors:  Ling Zhong; Jason T L Wang; Dongrong Wen; Virginie Aris; Patricia Soteropoulos; Bruce A Shapiro
Journal:  OMICS       Date:  2013-06-29

Review 4.  Computational approaches for microRNA studies: a review.

Authors:  Li Li; Jianzhen Xu; Deyin Yang; Xiaorong Tan; Hongfei Wang
Journal:  Mamm Genome       Date:  2009-12-15       Impact factor: 2.957

5.  RNAmutants: a web server to explore the mutational landscape of RNA secondary structures.

Authors:  Jerome Waldispühl; Srinivas Devadas; Bonnie Berger; Peter Clote
Journal:  Nucleic Acids Res       Date:  2009-06-16       Impact factor: 16.971

6.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes.

Authors:  Wei Yu; Tiebin Liu; Rodolfo Valdez; Marta Gwinn; Muin J Khoury
Journal:  BMC Med Inform Decis Mak       Date:  2010-03-22       Impact factor: 2.796

7.  Genome-wide searching with base-pairing kernel functions for noncoding RNAs: computational and expression analysis of snoRNA families in Caenorhabditis elegans.

Authors:  Kensuke Morita; Yutaka Saito; Kengo Sato; Kotaro Oka; Kohji Hotta; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2009-01-07       Impact factor: 16.971

8.  Identification and analysis of miRNAs in human breast cancer and teratoma samples using deep sequencing.

Authors:  Sanne Nygaard; Anders Jacobsen; Morten Lindow; Jens Eriksen; Eva Balslev; Henrik Flyger; Niels Tolstrup; Søren Møller; Anders Krogh; Thomas Litman
Journal:  BMC Med Genomics       Date:  2009-06-09       Impact factor: 3.063

9.  MicroRNA prediction with a novel ranking algorithm based on random walks.

Authors:  Yunpen Xu; Xuefeng Zhou; Weixiong Zhang
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

10.  Identification and characterization of novel amphioxus microRNAs by Solexa sequencing.

Authors:  Xi Chen; Qibin Li; Jin Wang; Xing Guo; Xiangrui Jiang; Zhiji Ren; Chunyue Weng; Guoxun Sun; Xiuqiang Wang; Yaping Liu; Lijia Ma; Jun-Yuan Chen; Jun Wang; Ke Zen; Junfeng Zhang; Chen-Yu Zhang
Journal:  Genome Biol       Date:  2009-07-17       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.