Literature DB >> 14571388

Using feature generation and feature selection for accurate prediction of translation initiation sites.

Fanfan Zeng1, Roland H C Yap, Limsoon Wong.   

Abstract

Correct prediction of the translation initiation site (TIS) is an important issue in genomic research. We show that feature generation together with correlation based feature selection can be used with a variety of machine learning algorithms to give highly accurate translation initiation site prediction. Only very few features are needed and the results achieve comparable accuracy to the best existing approaches. Our approach has the advantage that it does not require one to devise a special prediction method; rather standard machine learning classifiers are shown to give very good performance on the selected features. The raw and generated features which we have found to be important are the following: positions -3 and -1 in the sequence; upstream k-grams for k=3, 4, and 5; stop-codon frequency; downstream in-frame 3-gram; and the distance of ATG to the beginning of the sequence. The best result, with an overall accuracy of 90%, is obtained by selecting only seven features from this set. The same features retrained with the use of a scanning model achieves an overall accuracy of 94% on this dataset.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 14571388

Source DB:  PubMed          Journal:  Genome Inform        ISSN: 0919-9454


  7 in total

1.  Improvement in the prediction of the translation initiation site through balancing methods, inclusion of acquired knowledge and addition of features to sequences of mRNA.

Authors:  Lívia Márcia Silva; Felipe Carvalho de Souza Teixeira; José Miguel Ortega; Luis Enrique Zárate; Cristiane Neri Nobre
Journal:  BMC Genomics       Date:  2011-12-22       Impact factor: 3.969

2.  Automatic single-trial discrimination of mental arithmetic, mental singing and the no-control state from prefrontal activity: toward a three-state NIRS-BCI.

Authors:  Sarah D Power; Azadeh Kushki; Tom Chau
Journal:  BMC Res Notes       Date:  2012-03-13

3.  Feature selection for the prediction of translation initiation sites.

Authors:  Guo Liang Li; Tze Yun Leong
Journal:  Genomics Proteomics Bioinformatics       Date:  2005-05       Impact factor: 7.691

Review 4.  From shallow to deep: some lessons learned from application of machine learning for recognition of functional genomic elements in human genome.

Authors:  Boris Jankovic; Takashi Gojobori
Journal:  Hum Genomics       Date:  2022-02-18       Impact factor: 4.639

5.  Dragon TIS Spotter: an Arabidopsis-derived predictor of translation initiation sites in plants.

Authors:  Arturo Magana-Mora; Haitham Ashoor; Boris R Jankovic; Allan Kamau; Karim Awara; Rajesh Chowdhary; John A C Archer; Vladimir B Bajic
Journal:  Bioinformatics       Date:  2012-10-30       Impact factor: 6.937

6.  Bioinformatic analyses of mammalian 5'-UTR sequence properties of mRNAs predicts alternative translation initiation sites.

Authors:  Jill L Wegrzyn; Thomas M Drudge; Faramarz Valafar; Vivian Hook
Journal:  BMC Bioinformatics       Date:  2008-05-08       Impact factor: 3.169

7.  Stepwise approach for combining many sources of evidence for site-recognition in genomic sequences.

Authors:  Javier Pérez-Rodríguez; Nicolás García-Pedrajas
Journal:  BMC Bioinformatics       Date:  2016-03-05       Impact factor: 3.169

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.