Literature DB >> 3134115

Statistical method for predicting protein coding regions in nucleic acid sequences.

G Fichant1, C Gautier.   

Abstract

Protein coding regions of a genome fragment can be mathematically predicted by studying variations in the statistical properties or by searching the signals characteristic of the junctions between the coding and non-coding regions. We propose here a new statistical method using correspondence analysis. This method does not use any reference codon set but takes into account the codon usage homogeneity along the studied genome fragment. Comparison with previously published methods especially the 'codon usage method' of Staden has been made, and two examples are presented here. Applications to analysis of prokaryotic operon and eukaryotic split genes are also discussed. Use of the method has also shown two structures not previously described: i) in the human prt gene, a strong triplet structure exists in a non-coding region; ii) in the human tp-a codon usage is not uniform between the different exons.

Entities:  

Mesh:

Substances:

Year:  1987        PMID: 3134115     DOI: 10.1093/bioinformatics/3.4.287

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  11 in total

1.  Use and misuse of correspondence analysis in codon usage studies.

Authors:  Guy Perrière; Jean Thioulouse
Journal:  Nucleic Acids Res       Date:  2002-10-15       Impact factor: 16.971

Review 2.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

3.  Metagenomic Classification Using an Abstraction Augmented Markov Model.

Authors:  Xiujun Sylvia Zhu; Monnie McGee
Journal:  J Comput Biol       Date:  2015-11-30       Impact factor: 1.479

4.  Chaos game representation of gene structure.

Authors:  H J Jeffrey
Journal:  Nucleic Acids Res       Date:  1990-04-25       Impact factor: 16.971

5.  A frameshift error detection algorithm for DNA sequencing projects.

Authors:  G A Fichant; Y Quentin
Journal:  Nucleic Acids Res       Date:  1995-08-11       Impact factor: 16.971

6.  NRSub: a non-redundant database for Bacillus subtilis.

Authors:  G Perrière; I Moszer; T Gojobori
Journal:  Nucleic Acids Res       Date:  1996-01-01       Impact factor: 16.971

7.  Intrinsic and extrinsic approaches for detecting genes in a bacterial genome.

Authors:  M Borodovsky; K E Rudd; E V Koonin
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

8.  A novel hierarchical clustering algorithm for gene sequences.

Authors:  Dan Wei; Qingshan Jiang; Yanjie Wei; Shengrui Wang
Journal:  BMC Bioinformatics       Date:  2012-07-23       Impact factor: 3.169

9.  Integrating overlapping structures and background information of words significantly improves biological sequence comparison.

Authors:  Qi Dai; Lihua Li; Xiaoqing Liu; Yuhua Yao; Fukun Zhao; Michael Zhang
Journal:  PLoS One       Date:  2011-11-10       Impact factor: 3.240

10.  Comparison study on k-word statistical measures for protein: from sequence to 'sequence space'.

Authors:  Qi Dai; Tianming Wang
Journal:  BMC Bioinformatics       Date:  2008-09-23       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.