Literature DB >> 16945945

PSoL: a positive sample only learning algorithm for finding non-coding RNA genes.

Chunlin Wang1, Chris Ding, Richard F Meraz, Stephen R Holbrook.   

Abstract

MOTIVATION: Small non-coding RNA (ncRNA) genes play important regulatory roles in a variety of cellular processes. However, detection of ncRNA genes is a great challenge to both experimental and computational approaches. In this study, we describe a new approach called positive sample only learning (PSoL) to predict ncRNA genes in the Escherichia coli genome. Although PSoL is a machine learning method for classification, it requires no negative training data, which, in general, is hard to define properly and affects the performance of machine learning dramatically. In addition, using the support vector machine (SVM) as the core learning algorithm, PSoL can integrate many different kinds of information to improve the accuracy of prediction. Besides the application of PSoL for predicting ncRNAs, PSoL is applicable to many other bioinformatics problems as well.
RESULTS: The PSoL method is assessed by 5-fold cross-validation experiments which show that PSoL can achieve about 80% accuracy in recovery of known ncRNAs. We compared PSoL predictions with five previously published results. The PSoL method has the highest percentage of predictions overlapping with those from other methods.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16945945     DOI: 10.1093/bioinformatics/btl441

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening.

Authors:  Nobuyoshi Nagamine; Takayuki Shirakawa; Yusuke Minato; Kentaro Torii; Hiroki Kobayashi; Masaya Imoto; Yasubumi Sakakibara
Journal:  PLoS Comput Biol       Date:  2009-06-05       Impact factor: 4.475

Review 2.  Computational methods in noncoding RNA research.

Authors:  Ariane Machado-Lima; Hernando A del Portillo; Alan Mitchell Durham
Journal:  J Math Biol       Date:  2007-09-04       Impact factor: 2.259

3.  Integrating microRNA target predictions for the discovery of gene regulatory networks: a semi-supervised ensemble learning approach.

Authors:  Gianvito Pio; Donato Malerba; Domenica D'Elia; Michelangelo Ceci
Journal:  BMC Bioinformatics       Date:  2014-01-10       Impact factor: 3.169

4.  Identification of non-coding RNAs with a new composite feature in the Hybrid Random Forest Ensemble algorithm.

Authors:  Supatcha Lertampaiporn; Chinae Thammarongtham; Chakarida Nukoolkit; Boonserm Kaewkamnerdpong; Marasri Ruengjitchatchawalya
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

5.  Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis.

Authors:  Chuang Ma; Mingming Xin; Kenneth A Feldmann; Xiangfeng Wang
Journal:  Plant Cell       Date:  2014-02-11       Impact factor: 11.277

6.  Learning gene regulatory networks from only positive and unlabeled data.

Authors:  Luigi Cerulo; Charles Elkan; Michele Ceccarelli
Journal:  BMC Bioinformatics       Date:  2010-05-05       Impact factor: 3.169

7.  Identification of type 2 diabetes-associated combination of SNPs using support vector machine.

Authors:  Hyo-Jeong Ban; Jee Yeon Heo; Kyung-Soo Oh; Keun-Joon Park
Journal:  BMC Genet       Date:  2010-04-23       Impact factor: 2.797

8.  Computational identification of protein methylation sites through bi-profile Bayes feature extraction.

Authors:  Jianlin Shao; Dong Xu; Sau-Na Tsai; Yifei Wang; Sai-Ming Ngai
Journal:  PLoS One       Date:  2009-03-17       Impact factor: 3.240

9.  De novo computational prediction of non-coding RNA genes in prokaryotic genomes.

Authors:  Thao T Tran; Fengfeng Zhou; Sarah Marshburn; Mark Stead; Sidney R Kushner; Ying Xu
Journal:  Bioinformatics       Date:  2009-09-10       Impact factor: 6.937

10.  Prevalence of transcription promoters within archaeal operons and coding sequences.

Authors:  Tie Koide; David J Reiss; J Christopher Bare; Wyming Lee Pang; Marc T Facciotti; Amy K Schmid; Min Pan; Bruz Marzolf; Phu T Van; Fang-Yin Lo; Abhishek Pratap; Eric W Deutsch; Amelia Peterson; Dan Martin; Nitin S Baliga
Journal:  Mol Syst Biol       Date:  2009-06-16       Impact factor: 11.429

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.