Literature DB >> 15871459

Gene recognition based on nucleotide distribution of ORFs in a hyper-thermophilic crenarchaeon, Aeropyrum pernix K1.

Feng-Biao Guo1, Ju Wang, Chun-Ting Zhang.   

Abstract

The 2694 ORFs originally annotated as potential genes in the genome of Aeropyrum pernix can be categorized into three clusters (A, B, C), according to their nucleotide composition at three codon positions. Coding potential was found to be responsible for the phenomenon of three clusters in a 9-dimensional space derived from the nucleotide composition of ORFs: ORFs assigned to cluster A are coding ones, while those assigned to clusters B and C are non-coding ORFs. A "codingness" index called the AZ score is defined based on a clustering method used to recognize protein-coding genes in the A. pernix genome. The criterion for a coding or non-coding ORF is based on the AZ score. ORFs with AZ > 0 or AZ < 0 are coding or non-coding, respectively. Consequently, 620 out of 632 ORFs with putative functions based on the original annotation are contained in cluster A, which have positive AZ scores. In addition, all 29 ORFs encoding putative or conserved proteins newly added in RefSeq annotation also have positive AZ scores. Accordingly, the number of re-recognized protein-coding genes in the A. pernix genome is 1610, which is significantly less than 2694 in the original annotation and also much less than 1841 in the RefSeq annotation curated by NCBI staff. Annotation information of re-recognized genes and their AZ scores are available at: http://tubic.tju.edu.cn/Aper/.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15871459     DOI: 10.1093/dnares/11.6.361

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


  9 in total

1.  Proteome driven re-evaluation and functional annotation of the Streptococcus pyogenes SF370 genome.

Authors:  Akira Okamoto; Keiko Yamada
Journal:  BMC Microbiol       Date:  2011-11-10       Impact factor: 3.605

2.  An integrative method for identifying the over-annotated protein-coding genes in microbial genomes.

Authors:  Jia-Feng Yu; Ke Xiao; Dong-Ke Jiang; Jing Guo; Ji-Hua Wang; Xiao Sun
Journal:  DNA Res       Date:  2011-09-08       Impact factor: 4.458

3.  Re-annotation of protein-coding genes in 10 complete genomes of Neisseriaceae family by combining similarity-based and composition-based methods.

Authors:  Feng-Biao Guo; Lifeng Xiong; Jade L L Teng; Kwok-Yung Yuen; Susanna K P Lau; Patrick C Y Woo
Journal:  DNA Res       Date:  2013-04-09       Impact factor: 4.458

4.  Re-annotation of protein-coding genes in the genome of saccharomyces cerevisiae based on support vector machines.

Authors:  Dan Lin; Xin Yin; Xianlong Wang; Peng Zhou; Feng-Biao Guo
Journal:  PLoS One       Date:  2013-07-10       Impact factor: 3.240

5.  MED: a new non-supervised gene prediction algorithm for bacterial and archaeal genomes.

Authors:  Huaiqiu Zhu; Gang-Qing Hu; Yi-Fan Yang; Jin Wang; Zhen-Su She
Journal:  BMC Bioinformatics       Date:  2007-03-16       Impact factor: 3.169

6.  Separate base usages of genes located on the leading and lagging strands in Chlamydia muridarum revealed by the Z curve method.

Authors:  Feng-Biao Guo; Xiu-Juan Yu
Journal:  BMC Genomics       Date:  2007-10-10       Impact factor: 3.969

7.  Recognition of Protein-coding Genes Based on Z-curve Algorithms.

Authors:  Feng -Biao Guo; Yan Lin; Ling -Ling Chen
Journal:  Curr Genomics       Date:  2014-04       Impact factor: 2.236

8.  A Brief Review: The Z-curve Theory and its Application in Genome Analysis.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  Curr Genomics       Date:  2014-04       Impact factor: 2.236

9.  Two Family B DNA Polymerases From Aeropyrum pernix, Based on Revised Translational Frames.

Authors:  Katsuya Daimon; Sonoko Ishino; Namiko Imai; Sachiyo Nagumo; Takeshi Yamagami; Hiroaki Matsukawa; Yoshizumi Ishino
Journal:  Front Mol Biosci       Date:  2018-04-16
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.