| Literature DB >> 27734920 |
Evgeni Bolotin1, Ruth Hershberg1.
Abstract
Gene loss is a major contributor to the evolution of bacterial gene content. Gene loss may occur as a result of shifts in environment leading to changes in the intensity and/or directionality of selection applied for the maintenance of specific genes. Gene loss may also occur in a more neutral manner, when gene functions are lost that were not subject to strong selection to be maintained, irrespective of changes to environment. Here, we used a pangenome-based approach to investigate patterns of gene loss across 15 bacterial species. We demonstrate that gene loss tends to occur mostly within a pool of genes that are less constrained within species, even in those strains from which they are not lost, and less conserved across bacterial species. Our results indicate that shifts in selection, resulting from shifts in environment are not required to explain the majority of gene loss events occurring within a diverse collection of bacterial species. Caution should therefore be taken when attributing differences in gene content to differences in environment.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27734920 PMCID: PMC5062063 DOI: 10.1038/srep35168
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Pangenome data summary of the analyzed species.
| Species Name | Analyzed genomes | Core pangenes | Near core pangenes | Single loss pangenes | Multiple loss pangenes |
|---|---|---|---|---|---|
| 13 | 3165 | 630 | 512 | 118 | |
| 10 | 2270 | 591 | 479 | 112 | |
| 13 | 1630 | 145 | 119 | 26 | |
| 13 | 1012 | 360 | 322 | 38 | |
| 15 | 1471 | 415 | 298 | 117 | |
| 60 | 1456 | 1625 | 639 | 986 | |
| 12 | 1010 | 186 | 130 | 56 | |
| 10 | 1026 | 354 | 300 | 54 | |
| 26 | 2174 | 357 | 296 | 61 | |
| 20 | 2665 | 746 | 527 | 219 | |
| 14 | 1270 | 199 | 133 | 66 | |
| 13 | 4332 | 779 | 652 | 127 | |
| 21 | 1202 | 292 | 188 | 104 | |
| 17 | 1056 | 473 | 357 | 116 | |
| 12 | 2396 | 639 | 507 | 132 |
Figure 1Gene loss occurs mostly within a pool of less conserved and constrained pangenes.
Depicted are boxplots representing levels of conservation (A), constraint (B), or codon bias (C) applied on core pangenes (red), single-loss pangenes (green) and multiple loss pangenes (blue). Whisker length for each boxplot represent 1.5 IQR. Statistical significance of differences between the gene-loss groups according to a non-paired, one-sided Mann-Whitney-Wilcoxon test is denoted by: (***) for P ≤ 0.001, (**) for P ≤ 0.01, (*) for P ≤ 0.05, and (ns) for P > 0.05. (A) Conservation-the number of additional species in which pangenes are found. (the Y-axis is presented in logarithmic scale). Genes lost multiple times tend to be conserved in less species than genes lost once, which are in turn less conserved than core genes that were never lost. (B) constraint within the species the pangenes belong to, as measured using the dN/dS metric. Lower dN/dS values indicate higher constraint. Multiple loss pangenes tend to be less constrained within their species than single loss genes that are in turn less constrained than genes that were never lost. (C) Codon Bias, as measured using the ENC’ metric. Lower ENC’ values indicate higher codon usage bias. Core genes that were never lost tend to be more biased in their codon usage than genes that were lost once. Genes that were lost once tend to be more codon biased than genes lost multiple times.