Literature DB >> 8932382

Frequent oligonucleotides and peptides of the Haemophilus influenzae genome.

S Karlin1, J Mrázek, A M Campbell.   

Abstract

The complete Haemophilus influenzae genome (1.83 Mb, Rd strain) provides opportunities for characterizing global genomic inhomogeneities and for detecting important sequence signals. Along these lines, new methods for identifying frequent words (oligonucleotides and/or peptides) and their distributions are applied to the H.influenzae genome with some comparisons and contrasts made with frequent words of other bacterial genomes. Three major classes of frequent oligonucleotides stand out: (i) oligos related to the familiar uptake signal sequences (USSs), AAGTGCGGT (USS+) and its inverted complement (USS-), (ii) multiple tetranucleotide iterations and (iii) intergenic dyad sequences (ISDs) found as AAGCCCACCCTAC and its dyad form. The USS+ and USS- occur in almost equal counts, are remarkably evenly spaced around the genome, and appear predominantly in the same reading frame of protein coding domains (USS+ translated to Ser-Ala-Val, USS- translated to Thr-Ala-Leu). These observations suggest that USSs contribute to global genomic functions, for example, in replication and/or repair processes, or as membrane attachment sites, or as sequences helping to pack DNA. The long tetranucleotide iterations, virtually unique to H.influenzae (i.e., unknown in other prokaryotes), through polymerase slippage during replication and/or homologous recombination may produce subpopulations expressing alternative proteins. The 13 bp frequent IDS words, invariably intergenic, occur mostly in clusters and provide potential for complex secondary structures suggesting that these sequences may be important signals for regulating the activity of their flanking genes. The frequent oligopeptides of H.influenzae are principally of two kinds--those induced by oligonucleotide frequent words (USSs, tetranucleotide iterations), and those associated with ATP or GTP binding sites that are generally composed of three motifs: the A-box which contributes to delineating the binding pocket; the B-box which functions in hydrolysis; and the C-box whose function is unknown. The A-box occurs fairly universally in prokaryotes and eukaryotes. The B- and C-motifs appear to be specialized to various functional groups (e.g., transport, recombination, chaperone activity). Other putative motifs correspond to homologs of Escherichia coli motifs, for example, are associated with proteins of transcriptional processing, aminoacyl-tRNA synthetases and proteins functioning in electron transfer.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8932382      PMCID: PMC146255          DOI: 10.1093/nar/24.21.4263

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  28 in total

1.  DNA repair and the evolution of transformation in Haemophilus influenzae.

Authors:  J A Mongold
Journal:  Genetics       Date:  1992-12       Impact factor: 4.562

2.  First and second moment of counts of words in random texts generated by Markov chains.

Authors:  J Kleffe; M Borodovsky
Journal:  Comput Appl Biosci       Date:  1992-10

Review 3.  Statistical methods and insights for protein and DNA sequences.

Authors:  S Karlin; P Bucher; V Brendel; S F Altschul
Journal:  Annu Rev Biophys Biophys Chem       Date:  1991

4.  Significant dispersed recurrent DNA sequences in the Escherichia coli genome. Several new groups.

Authors:  B E Blaisdell; K E Rudd; A Matin; S Karlin
Journal:  J Mol Biol       Date:  1993-02-20       Impact factor: 5.469

Review 5.  Adaptive evolution of highly mutable loci in pathogenic bacteria.

Authors:  E R Moxon; P B Rainey; M A Nowak; R E Lenski
Journal:  Curr Biol       Date:  1994-01-01       Impact factor: 10.834

Review 6.  Bacterial gene transfer by natural genetic transformation in the environment.

Authors:  M G Lorenz; W Wackernagel
Journal:  Microbiol Rev       Date:  1994-09

Review 7.  Organization of the bacterial chromosome.

Authors:  S Krawiec; M Riley
Journal:  Microbiol Rev       Date:  1990-12

8.  Cloning and expression in Escherichia coli of opc, the gene for an unusual class 5 outer membrane protein from Neisseria meningitidis (meningococci/surface antigen).

Authors:  A J Olyhoek; J Sarkari; M Bopp; G Morelli; M Achtman
Journal:  Microb Pathog       Date:  1991-10       Impact factor: 3.738

9.  Over- and under-representation of short oligonucleotides in DNA sequences.

Authors:  C Burge; A M Campbell; S Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  1992-02-15       Impact factor: 11.205

10.  The role of a repetitive DNA motif (5'-CAAT-3') in the variable expression of the Haemophilus influenzae lipopolysaccharide epitope alpha Gal(1-4)beta Gal.

Authors:  N J High; M E Deadman; E R Moxon
Journal:  Mol Microbiol       Date:  1993-09       Impact factor: 3.501

View more
  29 in total

1.  Predicted highly expressed genes of diverse prokaryotic genomes.

Authors:  S Karlin; J Mrázek
Journal:  J Bacteriol       Date:  2000-09       Impact factor: 3.490

2.  Isolation of regulated genes of the cyanobacterium Synechocystis sp. strain PCC 6803 by differential display.

Authors:  D Bhaya; D Vaulot; P Amin; A W Takahashi; A R Grossman
Journal:  J Bacteriol       Date:  2000-10       Impact factor: 3.490

3.  Biased distribution of DNA uptake sequences towards genome maintenance genes.

Authors:  Tonje Davidsen; Einar A Rødland; Karin Lagesen; Erling Seeberg; Torbjørn Rognes; Tone Tønjum
Journal:  Nucleic Acids Res       Date:  2004-02-11       Impact factor: 16.971

4.  Bacterial DNA uptake sequences can accumulate by molecular drive alone.

Authors:  H Maughan; L A Wilson; R J Redfield
Journal:  Genetics       Date:  2010-07-13       Impact factor: 4.562

Review 5.  Statistical signals in bioinformatics.

Authors:  Samuel Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-12       Impact factor: 11.205

6.  Distinctive features of large complex virus genomes and proteomes.

Authors:  Jan Mrázek; Samuel Karlin
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-09       Impact factor: 11.205

7.  Simple sequence repeats in prokaryotic genomes.

Authors:  Jan Mrázek; Xiangxue Guo; Apurva Shah
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-07       Impact factor: 11.205

8.  Long simple sequence repeats in host-adapted pathogens localize near genes encoding antigens, housekeeping genes, and pseudogenes.

Authors:  Xiangxue Guo; Jan Mrázek
Journal:  J Mol Evol       Date:  2008-10-17       Impact factor: 2.395

9.  Evolutionary stability of DNA uptake signal sequences in the Pasteurellaceae.

Authors:  M Bakkali; T-Y Chen; H C Lee; R J Redfield
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-19       Impact factor: 11.205

10.  Initial proteome analysis of model microorganism Haemophilus influenzae strain Rd KW20.

Authors:  Eugene Kolker; Samuel Purvine; Michael Y Galperin; Serg Stolyar; David R Goodlett; Alexey I Nesvizhskii; Andrew Keller; Tao Xie; Jimmy K Eng; Eugene Yi; Leroy Hood; Alex F Picone; Tim Cherny; Brian C Tjaden; Andrew F Siegel; Thomas J Reilly; Kira S Makarova; Bernhard O Palsson; Arnold L Smith
Journal:  J Bacteriol       Date:  2003-08       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.