Literature DB >> 27302911

A computational approach for prediction of donor splice sites with improved accuracy.

Prabina Kumar Meher1, Tanmaya Kumar Sahu2, A R Rao3, S D Wahi4.   

Abstract

Identification of splice sites is important due to their key role in predicting the exon-intron structure of protein coding genes. Though several approaches have been developed for the prediction of splice sites, further improvement in the prediction accuracy will help predict gene structure more accurately. This paper presents a computational approach for prediction of donor splice sites with higher accuracy. In this approach, true and false splice sites were first encoded into numeric vectors and then used as input in artificial neural network (ANN), support vector machine (SVM) and random forest (RF) for prediction. ANN and SVM were found to perform equally and better than RF, while tested on HS3D and NN269 datasets. Further, the performance of ANN, SVM and RF were analyzed by using an independent test set of 50 genes and found that the prediction accuracy of ANN was higher than that of SVM and RF. All the predictors achieved higher accuracy while compared with the existing methods like NNsplice, MEM, MDD, WMM, MM1, FSPLICE, GeneID and ASSP, using the independent test set. We have also developed an online prediction server (PreDOSS) available at http://cabgrid.res.in:8080/predoss, for prediction of donor splice sites using the proposed approach.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Conditional error; Di-nucleotide dependency; Machine learning; PreDOSS; Sequence encoding

Mesh:

Substances:

Year:  2016        PMID: 27302911     DOI: 10.1016/j.jtbi.2016.06.013

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  1 in total

1.  Improved recognition of splice sites in A. thaliana by incorporating secondary structure information into sequence-derived features: a computational study.

Authors:  Prabina Kumar Meher; Subhrajit Satpathy
Journal:  3 Biotech       Date:  2021-10-31       Impact factor: 2.406

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.