Literature DB >> 25795417

miRBoost: boosting support vector machines for microRNA precursor classification.

Van Du T Tran1, Sebastien Tempel2, Benjamin Zerath3, Farida Zehraoui3, Fariza Tahi4.   

Abstract

Identification of microRNAs (miRNAs) is an important step toward understanding post-transcriptional gene regulation and miRNA-related pathology. Difficulties in identifying miRNAs through experimental techniques combined with the huge amount of data from new sequencing technologies have made in silico discrimination of bona fide miRNA precursors from non-miRNA hairpin-like structures an important topic in bioinformatics. Among various techniques developed for this classification problem, machine learning approaches have proved to be the most promising. However these approaches require the use of training data, which is problematic due to an imbalance in the number of miRNAs (positive data) and non-miRNAs (negative data), which leads to a degradation of their performance. In order to address this issue, we present an ensemble method that uses a boosting technique with support vector machine components to deal with imbalanced training data. Classification is performed following a feature selection on 187 novel and existing features. The algorithm, miRBoost, performed better in comparison with state-of-the-art methods on imbalanced human and cross-species data. It also showed the highest ability among the tested methods for discovering novel miRNA precursors. In addition, miRBoost was over 1400 times faster than the second most accurate tool tested and was significantly faster than most of the other tools. miRBoost thus provides a good compromise between prediction efficiency and execution time, making it highly suitable for use in genome-wide miRNA precursor prediction. The software miRBoost is available on our web server http://EvryRNA.ibisc.univ-evry.fr.
© 2015 Tran et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

Entities:  

Keywords:  boosting; classification; imbalanced data; microRNA prediction; support vector machine (SVM)

Mesh:

Substances:

Year:  2015        PMID: 25795417      PMCID: PMC4408786          DOI: 10.1261/rna.043612.113

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


  43 in total

1.  Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier.

Authors:  Malik Yousef; Michael Nebozhyn; Hagit Shatkay; Stathis Kanterakis; Louise C Showe; Michael K Showe
Journal:  Bioinformatics       Date:  2006-03-16       Impact factor: 6.937

2.  Prediction of novel pre-microRNAs with high accuracy through boosting and SVM.

Authors:  Yuanwei Zhang; Yifan Yang; Huan Zhang; Xiaohua Jiang; Bo Xu; Yu Xue; Yunxia Cao; Qian Zhai; Yong Zhai; Mingqing Xu; Howard J Cooke; Qinghua Shi
Journal:  Bioinformatics       Date:  2011-03-23       Impact factor: 6.937

3.  MiRNA recognition with the yasMiR system: the quest for further improvements.

Authors:  Daniel Pasailă; Andrei Sucilă; Irina Mohorianu; Stefan Panţiru; Liviu Ciortuz
Journal:  Adv Exp Med Biol       Date:  2011       Impact factor: 2.622

4.  mirExplorer: detecting microRNAs from genome and next generation sequencing data using the AdaBoost method with transition probability matrix and combined features.

Authors:  Dao-Gang Guan; Jian-You Liao; Zhen-Hua Qu; Ying Zhang; Liang-Hu Qu
Journal:  RNA Biol       Date:  2011-09-01       Impact factor: 4.652

5.  A microRNA polycistron as a potential human oncogene.

Authors:  Lin He; J Michael Thomson; Michael T Hemann; Eva Hernando-Monge; David Mu; Summer Goodson; Scott Powers; Carlos Cordon-Cardo; Scott W Lowe; Gregory J Hannon; Scott M Hammond
Journal:  Nature       Date:  2005-06-09       Impact factor: 49.962

6.  Mutations in the seed region of human miR-96 are responsible for nonsyndromic progressive hearing loss.

Authors:  Angeles Mencía; Silvia Modamio-Høybjør; Nick Redshaw; Matías Morín; Fernando Mayo-Merino; Leticia Olavarrieta; Luis A Aguirre; Ignacio del Castillo; Karen P Steel; Tamas Dalmay; Felipe Moreno; Miguel Angel Moreno-Pelayo
Journal:  Nat Genet       Date:  2009-04-12       Impact factor: 38.330

7.  Heterogeneous ensemble approach with discriminative features and modified-SMOTEbagging for pre-miRNA classification.

Authors:  Supatcha Lertampaiporn; Chinae Thammarongtham; Chakarida Nukoolkit; Boonserm Kaewkamnerdpong; Marasri Ruengjitchatchawalya
Journal:  Nucleic Acids Res       Date:  2012-09-24       Impact factor: 16.971

8.  MiRPara: a SVM-based software tool for prediction of most probable microRNA coding regions in genome scale sequences.

Authors:  Yonggan Wu; Bo Wei; Haizhou Liu; Tianxian Li; Simon Rayner
Journal:  BMC Bioinformatics       Date:  2011-04-19       Impact factor: 3.169

9.  ncRNAclassifier: a tool for detection and classification of transposable element sequences in RNA hairpins.

Authors:  Sébastien Tempel; Nicolas Pollet; Fariza Tahi
Journal:  BMC Bioinformatics       Date:  2012-09-25       Impact factor: 3.169

10.  MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans.

Authors:  Ting-Hua Huang; Bin Fan; Max F Rothschild; Zhi-Liang Hu; Kui Li; Shu-Hong Zhao
Journal:  BMC Bioinformatics       Date:  2007-09-17       Impact factor: 3.169

View more
  7 in total

1.  Sparse support vector machines with L0 approximation for ultra-high dimensional omics data.

Authors:  Zhenqiu Liu; David Elashoff; Steven Piantadosi
Journal:  Artif Intell Med       Date:  2019-04-30       Impact factor: 5.326

2.  CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks.

Authors:  Huiqing Wang; Yue Ma; Chunlin Dong; Chun Li; Jingjing Wang; Dan Liu
Journal:  Front Genet       Date:  2019-10-11       Impact factor: 4.599

3.  miRNAFold: a web server for fast miRNA precursor prediction in genomes.

Authors:  Christophe Tav; Sébastien Tempel; Laurent Poligny; Fariza Tahi
Journal:  Nucleic Acids Res       Date:  2016-05-29       Impact factor: 16.971

4.  IpiRId: Integrative approach for piRNA prediction using genomic and epigenomic data.

Authors:  Anouar Boucheham; Vivien Sommard; Farida Zehraoui; Adnane Boualem; Mohamed Batouche; Abdelhafid Bendahmane; David Israeli; Fariza Tahi
Journal:  PLoS One       Date:  2017-06-16       Impact factor: 3.240

5.  Novel and Haplotype Specific MicroRNAs Encoded by the Major Histocompatibility Complex.

Authors:  P M Clark; N Chitnis; M Shieh; M Kamoun; F B Johnson; D Monos
Journal:  Sci Rep       Date:  2018-03-01       Impact factor: 4.379

6.  The associations between Toll-like receptor 4 gene polymorphisms and hepatitis C virus infection: a systematic review and meta-analysis.

Authors:  Narttaya Chaiwiang; Teera Poyomtip
Journal:  Biosci Rep       Date:  2019-02-26       Impact factor: 3.840

7.  miRWoods: Enhanced precursor detection and stacked random forests for the sensitive detection of microRNAs.

Authors:  Jimmy Bell; Maureen Larson; Michelle Kutzler; Massimo Bionaz; Christiane V Löhr; David Hendrix
Journal:  PLoS Comput Biol       Date:  2019-10-09       Impact factor: 4.475

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.