| Literature DB >> 24904647 |
Mario López-Pérez1, Ana-Belen Martin-Cuadrado1, Francisco Rodriguez-Valera1.
Abstract
Different strains of the same prokaryotic species, even very similar ones, vary in large regions of their genomes. This flexible genome represents a huge reservoir of diversity that allows prokaryotes to exploit their environment efficiently. Most of the flexible genome is concentrated in genomic islands, some of which are present in all the strains and coding for similar functions but containing different genes. These replacement genomic islands are typically involved in exposed cellular structures, and their diversity has been connected to their recognition as targets by prokaryotic viruses (phages). We have compared genomes of closely related aquatic microbes from different origins and found examples of recent replacement of some of these flexible genomic islands. In all cases, that include Gram positive and negative bacteria and one archaeon, the replaced regions boundaries contain tell-tale peaks of increased, mostly synonymous, nucleotide substitutions. They tended to be sharper at the boundary closest to the origin of replication of the island. We will present the hypothesis that replacement flexible genomic islands are often exchanged by homologous recombination between different clonal frames. These recombination events are possibly selected due to the immediate reward provided by a change in the phage sensitivity spectrum.Entities:
Keywords: SNP; aquatic bacteria; genomic diversity; genomic island; homologous recombination; phage predation
Year: 2014 PMID: 24904647 PMCID: PMC4033161 DOI: 10.3389/fgene.2014.00147
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Comparison of fGIs found in different clonal frames of the same species.
| 11 | 615 | U8 | 98.4 | 30913 | 3.5 | 93 | 15 | 0.034 | 1.165 | 0.029 | FlgH | |||
| 7 | AMC35 | B565 | 97.8 | 19600 | 2.2 | 47 | 20 | 0.000 | 0.111 | 0.001 | DNA-binding protein | |||
| AMC35 | AER397 | 98.1 | 21516 | 2.4 | 47 | 37 | 0.000 | 0.100 | 0.001 | DNA-binding protein | ||||
| 9 | HTCC 1002 | HTCC 1062 | 97.6 | 13762 | 5.2 | 38 | 74 | 0.002 | 0.481 | 0.005 | PntB | |||
| 20 | O395 | O1 | 99.3 | 13592 | 2.2 | 29 | 10 | 0.000 | 0.100 | 0.001 | Dfp | |||
| 10 | MO6-24/O | CMCP6 | 98.9 | 34037 | 5.1 | 89 | 26 | 0.004 | 0.108 | 0.005 | lipoprotein | |||
| 25 | CNX481 | CNT857 | 98.2 | 2084 | 3.3 | 56 | 16 | – | – | – | Intergenic region | |||
| 11 | DE | DE1 | 98.6 | 23225 | 2.6 | 154 | 27 | 0.012 | 1.132 | 0.011 | UDP-GlcA epimerase | |||
| DE | AD1000-35-C2 | – | – | – | 57 | – | 0.003 | 0.302 | 0.010 | UDP-GlcA epimerase | ||||
| 14 | NCGM2.S1 | PAO1 | 98.6 | 53064 | 3.9 | 107 | 9 | 0.215 | 3.032 | 0.070 | FlgL | |||
| DK2 | M18 | 99.3 | 25709 | 2 | 108 | 11 | 0.203 | 2.987 | 0.060 | FlgL | ||||
| 4 | HSBQ001 | C23 | 98.7 | 34453 | 5.5 | 37 | 91 | 0.205 | 2.156 | |||||
SNP in a 500 bp window [(#SNPs/genome length) × 500];
Highest value of the previous parameter in the fGI
Data from López-Pérez et al. (2013);
draft genome; PntB, beta subunit of the NAD (P) transhydrogenase; Dfp, phosphopantothenoylcysteine decarboxylase; UDP-GlcA epimerase, UDP-glucuronic acid epimerase. OS, Origin side; TS, Terminus side.
Figure 1. The plot above the genomes indicates the number of SNPs in a 500 bp window. The average number of SNPs in the genome is indicated by a red dot line. The region enlarged in the gray box indicates the rate of non-synonymous (dN), synonymous (dS) substitutions and the dN/dS ratios for the gene which accumulates the largest number of SNPs that encodes a PntB: beta subunit of the NAD (P) transhydrogenase (marked with a yellow asterisk). The blue circle indicates the position of the gene cluster in the Candidatus Pelagibacter ubique HTCC1062 genome. The origin of replication (oriC) has been manually identified, while the terminus (Ter) is just located oposite (not identified). The red arrow shows the orientation in which the gene cluster is represented. Yellow lightning indicates the side of the island where the accumulation of SNPs is highest. OS, Origin Side; TS, Terminus Side.
Figure 2Identical gene cluster involved in the biosynthesis of the O-chain in aquatic/pathogenic bacteria. The plots above the lower genome indicate the number of SNPs in a 500 bp window. The average number of SNPs in the genome is indicated by a red dot line. Yellow star indicates position of the gene which has the highest number of SNPs and between brackets is shown the rate of non-synonymous (dN), synonymous (dS) substitutions and the dN/dS ratios. The colored circles show the position of the gene cluster in the genome. Location in the genome and site of maximum SNP numbers indicated like in Figure 1.
Figure 3Different versions of the same replacement fGI between pairs of strains of the same species. Designation of the fGI compared in each example are indicated to the left. Representation of the graphic is like in Figures 1, 2.
d.
| 615 | U8 | Flagellum | 0.0064 | 0.0410 | 0.1560 | |
| DE | DE1 | Flagellum | 0.0058 | 0.0590 | 0.0980 | |
| HTCC 1002 | HTCC 1062 | O-chain | 0.0100 | 0.0860 | 0.1160 | |
| O395 | O1 | O-chain | 0.0016 | 0.0300 | 0.0360 | |
| MO6-24/O | CMCP6 | O-chain | 0.0017 | 0.0300 | 0.0400 | |
| DK2 | M18 | Flagellum | 0.0005 | 0.0979 | 0.0049 | |
| HSBQ001 | C23 | S-layer | 0.0097 | 0.1025 | 0.0944 | |
fGI: flexible Genomic Island.