Literature DB >> 18856072

[Classification of triplet periodicity in DNA sequences of genes taken from KEGG databank].

F E Frenkel', E V Korotkov.   

Abstract

We conducted classification for 472,288 regions of triplet periodicity found in 578,868 genes from release 29 of KEGG databank. A new concept of triplet periodicity class and a measure of similarity between them are introduced. Totally 2520 classes were created that contain 94% of found triplet periodicity. For 92% of triplet periodicity regions contained in classes an identical linkage of triplet periodicity to reading frame is observed. For the rest triplet periodicity cases a shift between reading frame of a gene and reading frame common for majority of genes contained in a class of triplet periodicity was observed. These periodicity regions were encoded into hypothetical amino acid sequences in accordance with reading frame built by triplet periodicity class. By BLAST program it was shown that 2660 hypothetical amino acid sequences have statistically significant similarity with proteins from UniProt databank. We suppose that 8% of triplet periodicity regions that joined classes mutated by means of reading frame shift. Created classes of triplet periodicity can be used for identification of coding regions of genes as well as for searching for mutations arisen from reading frame shift.

Mesh:

Year:  2008        PMID: 18856072

Source DB:  PubMed          Journal:  Mol Biol (Mosk)        ISSN: 0026-8984


  2 in total

1.  Prediction of Sphingosine protein-coding regions with a self adaptive spectral rotation method.

Authors:  Zhongwei Li; Yanan Guan; Xiang Yuan; Pan Zheng; Hu Zhu
Journal:  PLoS One       Date:  2019-04-03       Impact factor: 3.240

2.  Search for potential reading frameshifts in cds from Arabidopsis thaliana and other genomes.

Authors:  Y M Suvorova; M A Korotkova; K G Skryabin; E V Korotkov
Journal:  DNA Res       Date:  2019-04-01       Impact factor: 4.458

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.