Literature DB >> 9330906

Differences in dinucleotide frequencies of human, yeast, and Escherichia coli genes.

H Nakashima1, K Nishikawa, T Ooi.   

Abstract

Nucleotide sequences coding proteins in human, yeast and Escherichia coli genes were analyzed in terms of dinucleotide occurrences. Every gene is plotted as a point in the dinucleotide space, which is spanned by 16 axes corresponding to the 16 components of the dinucleotide. The metric unit in the space is defined using the log-odds ratio of dinucleotide occurrences in a gene. The distribution of points showed that genes from the same organism are clustered in the space. The clusters of human and E. coli are completely separated, and the yeast cluster sits between, implying that individual genes are classified into the three sources from their location. In fact, they could be identified with accuracy of 90%, using the DNA data alone. Even genes encoding homologous proteins belonging to the same protein superfamily were discriminated by the DNA data, and were correctly identified into their sources with the same accuracy as above. DNA sequences of non-coding regions, including human introns, as well as human genes of GC-rich and GC-poor types, were also analyzed in the same manner. The most significant finding is that human genomic DNA sequences, including genes and introns together, exhibit the largest deviation of dinucleotide occurrence from the random expectation. Possible origins for this phenomenon are discussed.

Entities:  

Mesh:

Year:  1997        PMID: 9330906     DOI: 10.1093/dnares/4.3.185

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


  10 in total

1.  Capturing whole-genome characteristics in short sequences using a naïve Bayesian classifier.

Authors:  R Sandberg; G Winberg; C I Bränden; A Kaske; I Ernberg; J Cöster
Journal:  Genome Res       Date:  2001-08       Impact factor: 9.043

Review 2.  An overview of dinucleotide and codon usage in all viruses.

Authors:  Diego Simón; Juan Cristina; Héctor Musto
Journal:  Arch Virol       Date:  2022-04-25       Impact factor: 2.574

3.  Comparative analysis of machine learning algorithms on the microbial strain-specific AMP prediction.

Authors:  Boris Vishnepolsky; Maya Grigolava; Grigol Managadze; Andrei Gabrielian; Alex Rosenthal; Darrell E Hurt; Michael Tartakovsky; Malak Pirtskhalava
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

4.  Circuit assemblages derived from net dinucleotide values provide a succinct identity for the HIV-1 genome and each of its genes.

Authors:  Dorothy M Lang
Journal:  Virus Genes       Date:  2007-11-07       Impact factor: 2.332

5.  Oligonucleotide frequencies of barcoding loci can discriminate species across kingdoms.

Authors:  Antariksh Tyagi; Sumit K Bag; Virendra Shukla; Sribash Roy; Rakesh Tuli
Journal:  PLoS One       Date:  2010-08-20       Impact factor: 3.240

6.  Variable-order sequence modeling improves bacterial strain discrimination for Ion Torrent DNA reads.

Authors:  Thomas M Poulsen; Martin Frith
Journal:  BMC Bioinformatics       Date:  2017-06-12       Impact factor: 3.169

7.  Folate Deficiency Triggers the Abnormal Segregation of a Region With Large Cluster of CG-Rich Trinucleotide Repeats on Human Chromosome 2.

Authors:  Lorenza Garribba; Ivan Vogel; Mads Lerdrup; Marisa M Gonçalves Dinis; Liqun Ren; Ying Liu
Journal:  Front Genet       Date:  2021-07-01       Impact factor: 4.599

8.  Large scale comparative codon-pair context analysis unveils general rules that fine-tune evolution of mRNA primary structure.

Authors:  Gabriela Moura; Miguel Pinheiro; Joel Arrais; Ana Cristina Gomes; Laura Carreto; Adelaide Freitas; José L Oliveira; Manuel A S Santos
Journal:  PLoS One       Date:  2007-09-05       Impact factor: 3.240

9.  Detecting horizontally transferred and essential genes based on dinucleotide relative abundance.

Authors:  Robert H Baran; Hanseok Ko
Journal:  DNA Res       Date:  2008-09-16       Impact factor: 4.458

10.  In silico identification of pseudo-exon activation events in personal genome and transcriptome data.

Authors:  Narumi Sakaguchi; Mikita Suyama
Journal:  RNA Biol       Date:  2020-08-30       Impact factor: 4.652

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.