Literature DB >> 20546757

A classification-based prediction model of messenger RNA polyadenylation sites.

Guoli Ji1, Xiaohui Wu, Yingjia Shen, Jiangyin Huang, Qingshun Quinn Li.   

Abstract

Messenger RNA polyadenylation is one of the essential processing steps during eukaryotic gene expression. The site of polyadenylation [(poly(A) site] marks the end of a transcript, which is also the end of a gene. A computation program that is able to recognize poly(A) sites would not only prove useful for genome annotation in finding genes ends, but also for predicting alternative poly(A) sites. Features that define the poly(A) sites can now be extracted from the poly(A) site datasets to build such predictive models. Using methods, including K-gram pattern, Z-curve, position-specific scoring matrix and first-order inhomogeneous Markov sub-model, numerous features were generated and placed in an original feature space. To select the most useful features, attribute selection algorithms, such as information gain and entropy, were employed. A training model was then built based on the Bayesian network to determine a subset of the optimal features. Test models corresponding to the training models were built to predict poly(A) sites in Arabidopsis and rice. Thus, a prediction model, termed Poly(A) site classifier, or PAC, was constructed. The uniqueness of the model lies in its structure in that each sub-model can be replaced or expanded, while feature generation, selection and classification are all independent processes. Its modular design makes it easily adaptable to different species or datasets. The algorithm's high specificity and sensitivity were demonstrated by testing several datasets and, at the best combinations, they both reached 95%. The software package may be used for genome annotation and optimizing transgene structure. Copyright 2010 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20546757     DOI: 10.1016/j.jtbi.2010.05.015

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  10 in total

1.  In silico prediction of mRNA poly(A) sites in Chlamydomonas reinhardtii.

Authors:  Xiaohui Wu; Guoli Ji; Yong Zeng
Journal:  Mol Genet Genomics       Date:  2012-10-30       Impact factor: 3.291

2.  Dragon PolyA Spotter: predictor of poly(A) motifs within human genomic DNA sequences.

Authors:  Manal Kalkatawi; Farania Rangkuti; Michael Schramm; Boris R Jankovic; Allan Kamau; Rajesh Chowdhary; John A C Archer; Vladimir B Bajic
Journal:  Bioinformatics       Date:  2011-11-15       Impact factor: 6.937

3.  Predict and Analyze Protein Glycation Sites with the mRMR and IFS Methods.

Authors:  Yan Liu; Wenxiang Gu; Wenyi Zhang; Jianan Wang
Journal:  Biomed Res Int       Date:  2015-04-15       Impact factor: 3.411

4.  Experimental Verification and Evolutionary Origin of 5'-UTR Polyadenylation Sites in Arabidopsis thaliana.

Authors:  Yingdong Zhu; Jack C Vaughn
Journal:  Front Plant Sci       Date:  2018-07-05       Impact factor: 5.753

5.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

Review 6.  Advances in the Bioinformatics Knowledge of mRNA Polyadenylation in Baculovirus Genes.

Authors:  Iván Gabriel Peros; Carolina Susana Cerrudo; Marcela Gabriela Pilloff; Mariano Nicolás Belaich; Mario Enrique Lozano; Pablo Daniel Ghiringhelli
Journal:  Viruses       Date:  2020-12-06       Impact factor: 5.048

7.  Poly(A) motif prediction using spectral latent features from human DNA sequences.

Authors:  Bo Xie; Boris R Jankovic; Vladimir B Bajic; Le Song; Xin Gao
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

8.  A multispecies polyadenylation site model.

Authors:  Eric S Ho; Samuel I Gunderson; Siobain Duffy
Journal:  BMC Bioinformatics       Date:  2013-01-21       Impact factor: 3.169

9.  Motif types, motif locations and base composition patterns around the RNA polyadenylation site in microorganisms, plants and animals.

Authors:  Xiu-Qing Li; Donglei Du
Journal:  BMC Evol Biol       Date:  2014-07-23       Impact factor: 3.260

10.  RNA polyadenylation sites on the genomes of microorganisms, animals, and plants.

Authors:  Xiu-Qing Li; Donglei Du
Journal:  PLoS One       Date:  2013-11-18       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.