| Literature DB >> 26516092 |
Koji Yahara1, Xavier Didelot2, Keith A Jolley3, Ichizo Kobayashi4, Martin C J Maiden3, Samuel K Sheppard5, Daniel Falush6.
Abstract
Recombination enhances the adaptive potential of organisms by allowing genetic variants to be tested on multiple genomic backgrounds. Its distribution in the genome can provide insight into the evolutionary forces that underlie traits, such as the emergence of pathogenicity. Here, we examined landscapes of realized homologous recombination of 500 genomes from ten bacterial species and found all species have "hot" regions with elevated rates relative to the genome average. We examined the size, gene content, and chromosomal features associated with these regions and the correlations between closely related species. The recombination landscape is variable and evolves rapidly. For example in Salmonella, only short regions of around 1 kb in length are hot whereas in the closely related species Escherichia coli, some hot regions exceed 100 kb, spanning many genes. Only Streptococcus pyogenes shows evidence for the positive correlation between GC content and recombination that has been reported for several eukaryotes. Genes with function related to the cell surface/membrane are often found in recombination hot regions but E. coli is the only species where genes annotated as "virulence associated" are consistently hotter. There is also evidence that some genes with "housekeeping" functions tend to be overrepresented in cold regions. For example, ribosomal proteins showed low recombination in all of the species. Among specific genes, transferrin-binding proteins are recombination hot in all three of the species in which they were found, and are subject to interspecies recombination.Entities:
Keywords: pathogenicity; population genomics; recombination; selection
Mesh:
Year: 2015 PMID: 26516092 PMCID: PMC4866539 DOI: 10.1093/molbev/msv237
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FLandscapes of homologous recombination in bacterial species. Left: For each species, values of the per-site statistic (H) reflecting relative intensity of recombination at a site (nucleotide) are plotted along the reference genome of each species (supplementary table S1, Supplementary Material online). Some regions devoid of points indicate absence of SNPs for calculation of H because the alignment was not obtained in the regions. Locations of some recombination hot regions which are mentioned in the text or table 1 are indicated by letter. Right: Distance-dependence of the per-site statistic is shown in which x axis is distance between SNPs (i, j) and y axis is mean magnitude of the absolute difference of the H (normalized D) and H.
Notable Recombination Hot Genes.
| Species | Recombination Hot Gene | Notes | |
|---|---|---|---|
| Genes in the | Known recombination hot regions | ||
| Outer membrane protein | The fimbrial gene cluster safABCD locating in the centisome 7 genomic island (SCI) ( | ||
| Fimbrial assembly chaperone | |||
| An outer membrane usher protein | |||
| Ferrichrome outer membrane transporter | A ferrichrome uptake system | ||
| Iron-hydroxamate transporter ATP-binding subunit | |||
| Iron-hydroxamate transporter substrate-binding subunit | |||
| Iron-hydroxamate transporter permease subunit | |||
| Thiol-disulfide interchange protein | Virulence factor ( | ||
| Transferrin-binding protein A | See the main text | ||
| Transferrin-binding protein B | |||
| HI1217 | Haem-utilization protein (hup) | Involving acquisition of heme that is not synthesized in | |
| Heme-hemopexin utilization protein A | |||
| Heme-hemopexin utilization protein B | |||
| Major outer membrane protein, PorB | Multiple roles during infection ( | ||
| Transferrin-binding protein A | See the main text | ||
| Transferrin-binding protein B | |||
| PL1861 | A predicted pilus-associated protein | The highest peak in | |
| Lactoferrin-binding protein A | Vaccine candidates as the iron-limitation-inducible outer membrane proteins ( | ||
| Lactoferrin-binding protein B | |||
| Outer membrane protein porin PorA | The two variable regions defining subtypes of this species and eliciting specific bactericidal antibodies in human, which makes it a major component of a number of meningococcal vaccines ( | ||
| Transferrin-binding protein A | See the main text | ||
| Transferrin-binding protein B | |||
| Flagellar hook protein FlgE | A novel recombination hot region which was not reported in | ||
| S-layer proteins on the cell surface | See the main text | ||
| Adhesin | Facilitating bacterial binding to a host ( | ||
| Adhesin | Interaction with the extracellular matrix of a host ( | ||
| SP_0346 | Capsular polysaccharide biosynthesis protein Cps4A | Capsule biosynthesis gene cluster ( | |
| SP_0347 | Capsular polysaccharide biosynthesis protein Cps4B | ||
| SP_0348 | Capsular polysaccharide biosynthesis protein Cps4C | ||
| Dihydrofolate reductase | Recombination hot ( | ||
| 16S rRNA adenine dimethyltransferase | Its inactivation causes antibiotic resistance ( | ||
FRelation in intensity of recombination between closely related species. Each dot indicates an one-to-one orthologous gene shared between the species. X and Y axis indicate average values of H per orthologous gene in each species.
FBroad-scale relation between GC content and H. Each dot corresponds to a gene. Y axis is average H per gene. Correlation coefficient (r) is indicated.
FRelationship between average nucleotide diversity and H per gene for the virulence genes and other genes in each species. Correlation coefficient (r) is indicated. The regression lines compare levels of recombination in the virulence genes and other genes after controlling for the effect of nucleotide diversity.
F(a) Relative intensity of recombination in each functional category in each species. Each cell indicates median of H. The rows are sorted by average of the medians of each category across the species (in the most right column). Cells circled by the black rectangles mean presence of a recombination hot gene in the categories. The numbers in the left indicate average number of genes in each category. Gray cells indicate absence of genes in a category of a species. (b) Low level of recombination in genes of ribosomal proteins compared with others across the ten species. Each orange x-mark (ribosomal) or black dot (others) corresponds to median of H of a functional category of a species in figure 5a. The regression lines show low level of recombination in genes of ribosomal proteins after controlling for the effect of nucleotide diversity.