| Literature DB >> 32665274 |
Roman Kogay1, Yuri I Wolf2, Eugene V Koonin3, Olga Zhaxybayeva4,5.
Abstract
Gene transfer agents (GTAs) are virus-like elements integrated into bacterial genomes, particularly, those of Alphaproteobacteria The GTAs can be induced under conditions of nutritional stress, incorporate random fragments of bacterial DNA into miniphage particles, lyse the host cells, and infect neighboring bacteria, thus enhancing horizontal gene transfer. We show that GTA genes evolve under conditions of pronounced positive selection for the reduction of the energy cost of protein production as shown by comparison of the amino acid compositions with those of both homologous viral genes and host genes. The energy saving in GTA genes is comparable to or even more pronounced than that in the genes encoding the most abundant, essential bacterial proteins. In cases in which viruses acquire genes from GTAs, the bias in amino acid composition disappears in the course of evolution, showing that reduction of the energy cost of protein production is an important factor of evolution of GTAs but not bacterial viruses. These findings strongly suggest that GTAs represent bacterial adaptations rather than selfish, virus-like elements. Because GTA production kills the host cell and does not propagate the GTA genome, it appears likely that the GTAs are retained in the course of evolution via kin or group selection. Therefore, we hypothesize that GTAs facilitate the survival of bacterial populations under energy-limiting conditions through the spread of metabolic and transport capabilities via horizontal gene transfer and increases in nutrient availability resulting from the altruistic suicide of GTA-producing cells.IMPORTANCE Kin selection and group selection remain controversial topics in evolutionary biology. We argue that these types of selection are likely to operate in bacterial populations by showing that bacterial gene transfer agents (GTAs), but not related viruses, evolve under conditions of positive selection for the reduction of the energy cost of GTA particle production. We hypothesize that GTAs are dedicated devices mediating the survival of bacteria under conditions of nutrient limitation. The benefits conferred by GTAs under nutritional stress conditions appear to include horizontal dissemination of genes that could provide bacteria with enhanced capabilities for nutrient utilization and increases of nutrient availability occurring through the lysis of GTA-producing bacteria.Entities:
Keywords: GTA; alphaproteobacteria; bacteriophages; energy saving; gene transfer agents; metabolic efficiency; nutrient depletion; positive selection; virus exaptation
Mesh:
Substances:
Year: 2020 PMID: 32665274 PMCID: PMC7360931 DOI: 10.1128/mBio.01206-20
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1GC1, GC2, and GC3 content of GTA regions and their immediate neighborhoods and all protein-coding genes in 212 alphaproteobacterial genomes. The neighborhoods immediately upstream and downstream of a GTA region consist of 17 genes each. Box plots represent median values bounded by the first and third quartiles. Whiskers show the values that lie in the range of the 1.5 × interquartile rule. Dots outside the whiskers represent the outliers.
FIG 2Carbon content (A) and biosynthetic cost (B) of proteins encoded by GTA genes in 212 alphaproteobacterial genomes and their viral homologs. Box plots represent median values that are bounded by the first and third quartiles. Whiskers show the values that lie in the range of the 1.5 × interquartile rule. Dots outside the whiskers represent the outliers. The number of data points in each box plot is listed in Table S1.
Change in the carbon content between viral homologs of the GTA proteins and their closest GTA ancestral node
| GTA | Virus name | Change in the no. of carbons per side chain of an amino acid | Alignment | |
|---|---|---|---|---|
| +0.605 | <0.001 | 193 | ||
| +0.394 | 0.001 | 147 | ||
| +0.167 | 0.179 | 147 | ||
| +0.222 | 0.048 | 147 | ||
| +0.229 | 0.042 | 147 | ||
| +0.184 | 0.115 | 147 | ||
| +0.244 | 0.083 | 147 | ||
| +0.191 | 0.107 | 147 | ||
| +0.105 | 0.271 | 123 | ||
| +0.123 | 0.081 | 228 | ||
| +0.048 | 0.226 | 304 | ||
| +0.027 | 0.383 | 304 | ||
| +0.005 | 0.447 | 304 | ||
| +0.019 | 0.388 | 304 | ||
| +0.191 | 0.108 | 166 | ||
| +0.147 | <0.001 | 1,369 | ||
| +0.143 | <0.001 | 1,369 | ||
| Cumulative across 7 genes | +0.163 | <0.001 | 2,530 | |
FIG 3The number of carbons (A) and number of high-energy phosphates (B) in proteins encoded by all protein-coding genes in 212 genomes, by highly expressed genes, and by GTA genes. Box plots represent median values that are bounded by the first and third quartiles. Whiskers show the values that lie in the range of the 1.5 × interquartile rule. Dots outside the whiskers represent outliers.
FIG 4Carbon content of GTA proteins for four orders of the class Alphaproteobacteria. For each GTA protein, the heat map visualizes the number of carbons per side chain in amino acid averaged across taxonomic order. The numbers are shown either as raw values (A) or as values normalized by the carbon content of proteins encoded by 26 single-copy genes (B). The asterisks mark GTA proteins with significantly lower numbers of carbons per amino acid in the Sphingomonadales order than in the other three orders combined (α of 0.01; Mann-Whitney U test, all P values < 0.01). Box plots summarize the distribution of carbon content within each alphaproteobacterial order averaged across the examined GTA genes. Median values are bounded by the first and third quartiles. Whiskers show the values that lie in the range of the 1.5 × interquartile rule, and dots outside the whiskers represent the outliers. The phylogenetic tree is the reference alphaproteobacterial phylogeny (see Materials and Methods for details), in which branches are collapsed at the taxonomic rank of order. Numbers at the tree nodes represent bootstrap support values. Scale bar, number of substitutions per site.
Contribution of positively selected sites to the reduction of carbon utilization in GTA proteins of Sphingomonadales
| GTA | No. of sites | Avg change in no. of | No. of sites that |
|---|---|---|---|
| g2 | 13 | −0.22 | 6 |
| g3 | 33 | −0.72 | 22 |
| g4 | 29 | −0.42 | 13 |
| g5 | 12 | −0.39 | 8 |
| g6 | 11 | +0.16 | 5 |
| g9 | 29 | −0.68 | 16 |
| g12 | 23 | −0.52 | 13 |
| g13 | 31 | −0.44 | 16 |
| g15 | 27 | −0.55 | 15 |