Literature DB >> 17509616

Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.

Changchuan Yin1, Stephen S-T Yau.   

Abstract

With the exponential growth of genomic sequences, there is an increasing demand to accurately identify protein coding regions (exons) from genomic sequences. Despite many progresses being made in the identification of protein coding regions by computational methods during the last two decades, the performances and efficiencies of the prediction methods still need to be improved. In addition, it is indispensable to develop different prediction methods since combining different methods may greatly improve the prediction accuracy. A new method to predict protein coding regions is developed in this paper based on the fact that most of exon sequences have a 3-base periodicity, while intron sequences do not have this unique feature. The method computes the 3-base periodicity and the background noise of the stepwise DNA segments of the target DNA sequences using nucleotide distributions in the three codon positions of the DNA sequences. Exon and intron sequences can be identified from trends of the ratio of the 3-base periodicity to the background noise in the DNA sequences. Case studies on genes from different organisms show that this method is an effective approach for exon prediction.

Mesh:

Year:  2007        PMID: 17509616     DOI: 10.1016/j.jtbi.2007.03.038

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  28 in total

1.  SNR of DNA sequences mapped by general affine transformations of the indicator sequences.

Authors:  Jianfeng Shao; Xiaohua Yan; Shuo Shao
Journal:  J Math Biol       Date:  2012-07-21       Impact factor: 2.259

2.  Periodic power spectrum with applications in detection of latent periodicities in DNA sequences.

Authors:  Changchuan Yin; Jiasong Wang
Journal:  J Math Biol       Date:  2016-03-04       Impact factor: 2.259

3.  Design of high-performance parallelized gene predictors in MATLAB.

Authors:  Sylvain Robert Rivard; Jean-Gabriel Mailloux; Rachid Beguenane; Hung Tien Bui
Journal:  BMC Res Notes       Date:  2012-04-10

4.  Sequence Maneuverer: tool for sequence extraction from genomes.

Authors:  Tayyaba Yasmin; Inayat Ur Rehman; Adnan Ahmad Ansari; Khurrum Liaqat; Muhammad Irfan Khan
Journal:  Bioinformation       Date:  2012-12-19

5.  Patterns of nucleotides that flank substitutions in human orthologous genes.

Authors:  Lei Ma; Tingting Zhang; Zhuoran Huang; Xiaoqian Jiang; Shiheng Tao
Journal:  BMC Genomics       Date:  2010-07-05       Impact factor: 3.969

6.  Visualization of the protein-coding regions with a self adaptive spectral rotation approach.

Authors:  Bo Chen; Ping Ji
Journal:  Nucleic Acids Res       Date:  2010-10-14       Impact factor: 16.971

7.  The 3-base periodicity and codon usage of coding sequences are correlated with gene expression at the level of transcription elongation.

Authors:  Edoardo Trotta
Journal:  PLoS One       Date:  2011-06-28       Impact factor: 3.240

8.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

9.  Categorical spectral analysis of periodicity in human and viral genomes.

Authors:  Elizabeth D Howe; Jun S Song
Journal:  Nucleic Acids Res       Date:  2012-12-14       Impact factor: 16.971

10.  [Chromosomal localization and molecular organization of human genomic fragment containing TNF/LT locus in transgenic mice].

Authors:  A R Galimov; A A Kruglov; N L Bol'sheva; O Iu Iurkevich; D Ia Lipin'sh; I A Mufazalov; D V Kuprash; S A Nedospasov
Journal:  Mol Biol (Mosk)       Date:  2008 Jul-Aug
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.