Literature DB >> 1741388

Over- and under-representation of short oligonucleotides in DNA sequences.

C Burge1, A M Campbell, S Karlin.   

Abstract

Strand-symmetric relative abundance functionals for di-, tri-, and tetranucleotides are introduced and applied to sequences encompassing a broad phylogenetic range to discern tendencies and anomalies in the occurrences of these short oligonucleotides within and between genomic sequences. For dinucleotides, TA is almost universally under-represented, with the exception of vertebrate mitochondrial genomes, and CG is strongly under-represented in vertebrates and in mitochondrial genomes. The traditional methylation/deamination/mutation hypothesis for the rarity of CG does not adequately account for the observed deficiencies in certain sequences, notably the mitochondrial genomes, yeast, and Neurospora crassa, which lack the standard CpG methylase. Homodinucleotides (AA.TT, CC.GG) and larger homooligonucleotides are over-represented in many organisms, perhaps due to polymerase slippage events. For trinucleotides, GCA.TGC tends to be under-represented in phage, human viral, and eukaryotic sequences, and CTA.TAG is strongly under-represented in many prokaryotic, eukaryotic, and viral sequences. The CCA.TGG triplet is ubiquitously over-represented in human viral and eukaryotic sequences. Among the tetranucleotides, several four-base-pair palindromes tend to be under-represented in phage sequences, probably as a means of restriction avoidance. The tetranucleotide CTAG is observed to be rare in virtually all bacterial genomes and some phage genomes. Explanations for these over- and under-representations in terms of DNA/RNA structures and regulatory mechanisms are considered.

Entities:  

Mesh:

Substances:

Year:  1992        PMID: 1741388      PMCID: PMC48449          DOI: 10.1073/pnas.89.4.1358

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  21 in total

1.  Recombination in the lambda repressor gene: evidence that very short patch (VSP) mismatch correction restores a specific sequence.

Authors:  M Lieb
Journal:  Mol Gen Genet       Date:  1985

2.  Evolution of the genome and the genetic code: selection at the dinucleotide level by methylation and polyribonucleotide cleavage.

Authors:  E Beutler; T Gelbart; J H Han; J A Koziol; B Beutler
Journal:  Proc Natl Acad Sci U S A       Date:  1989-01       Impact factor: 11.205

3.  Three-dimensional crystal structures of Escherichia coli met repressor with and without corepressor.

Authors:  J B Rafferty; W S Somers; I Saint-Girons; S E Phillips
Journal:  Nature       Date:  1989-10-26       Impact factor: 49.962

4.  Crystal structure of trp repressor/operator complex at atomic resolution.

Authors:  Z Otwinowski; R W Schevitz; R G Zhang; C L Lawson; A Joachimiak; R Q Marmorstein; B F Luisi; P B Sigler
Journal:  Nature       Date:  1988-09-22       Impact factor: 49.962

Review 5.  Compositional patterns in vertebrate genomes: conservation and change in evolution.

Authors:  G Bernardi; D Mouchiroud; C Gautier; G Bernardi
Journal:  J Mol Evol       Date:  1988 Dec-1989 Feb       Impact factor: 2.395

6.  Deviations from expected frequencies of CpG dinucleotides in herpesvirus DNAs may be diagnostic of differences in the states of their latent genomes.

Authors:  R W Honess; U A Gompels; B G Barrell; M Craxton; K R Cameron; R Staden; Y N Chang; G S Hayward
Journal:  J Gen Virol       Date:  1989-04       Impact factor: 3.891

7.  Genome inhomogeneity is determined mainly by WW and SS dinucleotides.

Authors:  C G Kozhukhin; P A Pevzner
Journal:  Comput Appl Biosci       Date:  1991-01

Review 8.  Premeiotic instability of repeated sequences in Neurospora crassa.

Authors:  E U Selker
Journal:  Annu Rev Genet       Date:  1990       Impact factor: 16.830

9.  Universal rule for coding sequence construction: TA/CG deficiency-TG/CT excess.

Authors:  S Ohno
Journal:  Proc Natl Acad Sci U S A       Date:  1988-12       Impact factor: 11.205

View more
  145 in total

Review 1.  The influence of base sequence on the immunostimulatory properties of DNA.

Authors:  D S Pisetsky
Journal:  Immunol Res       Date:  1999       Impact factor: 2.829

Review 2.  The role of immunostimulatory CpG-DNA in septic shock.

Authors:  H Wagner; G B Lipford; H Häcker
Journal:  Springer Semin Immunopathol       Date:  2000

3.  A computational approach to identify genes for functional RNAs in genomic sequences.

Authors:  R J Carter; I Dubchak; S R Holbrook
Journal:  Nucleic Acids Res       Date:  2001-10-01       Impact factor: 16.971

4.  Evolutionary implications of microbial genome tetranucleotide frequency biases.

Authors:  David T Pride; Richard J Meinersmann; Trudy M Wassenaar; Martin J Blaser
Journal:  Genome Res       Date:  2003-02       Impact factor: 9.043

5.  Primer-template interactions during DNA amplification fingerprinting with single arbitrary oligonucleotides.

Authors:  G Caetano-Anollés; B J Bassam; P M Gresshoff
Journal:  Mol Gen Genet       Date:  1992-11

6.  An in vitro strategy for the selective isolation of anomalous DNA from prokaryotic genomes.

Authors:  M W J van Passel; A Bart; R J A Waaijer; A C M Luyf; A H C van Kampen; A van der Ende
Journal:  Nucleic Acids Res       Date:  2004-08-10       Impact factor: 16.971

7.  Codon usage bias from tRNA's point of view: redundancy, specialization, and efficient decoding for translation optimization.

Authors:  Eduardo P C Rocha
Journal:  Genome Res       Date:  2004-10-12       Impact factor: 9.043

8.  Overlapping codes within protein-coding sequences.

Authors:  Shalev Itzkovitz; Eran Hodis; Eran Segal
Journal:  Genome Res       Date:  2010-09-14       Impact factor: 9.043

9.  Synthesis of signals for de novo DNA methylation in Neurospora crassa.

Authors:  Hisashi Tamaru; Eric U Selker
Journal:  Mol Cell Biol       Date:  2003-04       Impact factor: 4.272

10.  Compositional heterogeneity of the Escherichia coli genome: a role for VSP repair?

Authors:  G Gutiérrez; J Casadesús; J L Oliver; A Marín
Journal:  J Mol Evol       Date:  1994-10       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.