| Literature DB >> 16120222 |
Etienne Larsabal1, Antoine Danchin.
Abstract
BACKGROUND: The genomes of prokaryotes and lower eukaryotes display a very strong 11 bp periodic bias in the distribution of their nucleotides. This bias is present throughout a given genome, both in coding and non-coding sequences. Until now this bias remained of unknown origin.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16120222 PMCID: PMC1242344 DOI: 10.1186/1471-2105-6-206
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Deconvoluted correlation function of A following A in the genome of H. pylori. The correlation function has been treated so as to hide the most intense component of period 3 bp due to the presence of genes in the genome of H. pylori. After treatment, the function reveals a prevalent short-range component of period 11 bp. This component represents the prevalent short-range bias of period 11 bp in the distribution of nucleotides in the genome of H. pylori.
Figure 2Diagrammatic structure of class A flexible patterns. Class A flexible patterns belong to the category of flexible patterns. Here "flexible" means that there is limited variation in the exact position of their conserved nucleotides. This is shown in the figure by the green arrows, indicating that the position of the conserved nucleotides may vary from one occurrence of the pattern to the next. The particular class of flexible patterns depicted here in the standard 5'-3' orientation is composed of two sets of conserved nucleotides. First, the patterns are shaped by a skeleton of regularly repeated Ts or TTs every 10 bp to 11.5 bp, spanning a maximum of 50 bp. These are called "skeleton nucleotides" and are symbolized by the black and dark grey Ts. The peripheral repeats of the skeleton, in dark grey, are sometimes absent from a given occurrence. The Ts of the central part, spanning 20 bp on average, are always present. Furthermore, class A flexible patterns are composed of a set of "inner nucleotides". These conserved nucleotides are represented here in dark blue. They can be any nucleotide but never Ts. They are located between the Ts of the skeleton and in the central part only.
Figure 3A few identified class A flexible patterns. Ten related yet distinct class A flexible patterns common to different genomes have been identified so far. Their structures share common features, which are characteristic of class A flexible patterns. Peripheral repeats of the skeleton nucleotides of the patterns have not been represented here. Skeleton nucleotides are shown in black. Inner nucleotides are shown in dark blue.
Distribution of class A flexible patterns in genomes.
| X | X | X | Archaea; Crenarchaeota; Thermoprotei; Desulfurococcales | ||||||||
| X | X | X | X | Archaea; Crenarchaeota; Thermoprotei; Sulfolobales | |||||||
| X | X | X | Archaea; Crenarchaeota; Thermoprotei; Sulfolobales | ||||||||
| X | X | Archaea; Crenarchaeota; Thermoprotei; Thermoproteales | |||||||||
| X | X | X | Archaea; Euryarchaeota; Archaeoglobi; Archaeoglobales | ||||||||
| X | X | X | X | X | X | Archaea; Euryarchaeota; Methanosarcinales | |||||
| X | Archaea; Euryarchaeota; Halobacteriales | ||||||||||
| X | X | Archaea; Euryarchaeota; Methanobacteriales | |||||||||
| X | X | X | Archaea; Euryarchaeota; Methanococcales | ||||||||
| X | X | X | Archaea; Euryarchaeota; Thermococcales | ||||||||
| X | X | X | X | Archaea; Euryarchaeota; Thermococcales | |||||||
| X | X | X | Archaea; Euryarchaeota; Thermococcales | ||||||||
| X | X | Archaea; Euryarchaeota; Thermoplasmatales | |||||||||
| X | X | X | Bacteria; Actinobacteria; Actinomycetales | ||||||||
| X | X | X | Bacteria; Aquificae; Aquificales | ||||||||
| X | X | Bacteria; Chlorobi; Chlorobiales | |||||||||
| Bacteria; Cyanobacteria; Chroococcales | |||||||||||
| X | X | Bacteria; Deinococcus-Thermus; Deinococcales | |||||||||
| X | X | Bacteria; Firmicutes; Bacillales | |||||||||
| X | Bacteria; Firmicutes; Bacillales | ||||||||||
| X | X | Bacteria; Firmicutes; Bacillales | |||||||||
| X | X | Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales | |||||||||
| X | Bacteria; Firmicutes; Lactobacillales | ||||||||||
| X | Bacteria; Planctomycetes; Planctomycetales | ||||||||||
| X | X | Bacteria; Proteobacteria | |||||||||
| X | Bacteria; Proteobacteria; Alphaproteobacteria; Caulobacteriales | ||||||||||
| X | X | Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales | |||||||||
| X | Bacteria; Proteobacteria; Alphaproteobacteria; Rhizobiales | ||||||||||
| X | X | X | Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsialles | ||||||||
| X | X | X | X | X | Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsialles | ||||||
| X | X | Bacteria; Proteobacteria; Betaproteobacteria; Burkholderiales | |||||||||
| X | Bacteria; Proteobacteria; Betaproteobacteria; Neisseriales | ||||||||||
| X | X | Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales | |||||||||
| X | X | X | X | Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales | |||||||
| X | X | X | Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales | ||||||||
| X | X | X | Bacteria; Proteobacteria; Epsilonproteobacteria; Campylobacterales | ||||||||
| X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Alteromonadales | |||||||||
| X | Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales | ||||||||||
| X | Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales | ||||||||||
| X | Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales | ||||||||||
| X | Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales | ||||||||||
| X | Bacteria; Proteobacteria; Gammaproteobacteria; Legionellales | ||||||||||
| X | X | X | X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Pasteurellales | ||||||
| X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Pseudomonadales | |||||||||
| X | X | X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Pseudomonadales | |||||||
| X | X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales | ||||||||
| X | X | X | X | X | Bacteria; Proteobacteria; Gammaproteobacteria; Xanthomonadales | ||||||
| X | Bacteria; Spirochaetes; Spirochaetales | ||||||||||
| X | X | Bacteria; Thermotogae; Thermotogales | |||||||||
| Eukaryota; Alveolata; Apicomplexa | |||||||||||
| X | X | Eukaryota; Fungi; Ascomycota | |||||||||
| Eukaryota; Fungi; Microsporidia | |||||||||||
| X | X | X | Eukaryota; Metazoa; Nematoda | ||||||||
| X | Virus; Enterobacteria phage T4 | ||||||||||
| X | Virus; Fusellovirus | ||||||||||
| Virus; Human herpesvirus 4 |
1. AxxxxTxxxxAxxxxTTxxxxxAxxxxTxxxxA
2. GxxxxTTxxxCxxxT
3. TTxxxGxxxTTxxxxGxxxxTT
4. TxxxxAGxxxTTxxxxxxxxT
5. TxxxxxxxxxxTxxxGAxxxTT
6. CxxxxxTTxxxCxxxxxxT
7. TxxxGCxGxT
8. TxxCxGxCxTT
9. GxxxxxTxxxxxAxxxxxT
10. TTTxxxCAxxxxxT
Figure 4The distribution of three types of class A flexible patterns is correlated to specific phylogenetic groups of organisms. We identified five class A flexible patterns distributed in many different organisms. Three of them, displayed here, show a distribution which can be related to the phylogeny.
The variations defining the five class A flexible patterns found in the genome of H. pylori
|p463pt|The variations defining the five class A flexible patterns found in the genome ofH. pylori
Figure 5The treated correlation function of G(G-). This correlation function of nucleotide A following A reveals biases generated by the part of the genome of H. pylori that do not contain occurrences of class A flexible patterns.
Figure 6The treated correlation function of G(G+). This correlation function of nucleotide A following A reveals biases in the genome of H. pylori which are generated by the occurrences of class A flexible patterns in its genome.
Figure 7Accessibility of class A flexible patterns. There are two opposed sides from which nucleotides composing this occurrence of this given class A flexible pattern are accessible. The dinucleotides are visible through major grooves only from the upper side and hence fully accessible from this side only. Hence a given occurrence of a given class A flexible pattern in a genome is only accessible from one side of the DNA molecule.