| Literature DB >> 22436996 |
Abstract
Horizontal gene transfer (HGT), the movement of genetic material from one species to another, is a common phenomenon in prokaryotic evolution. Although the rate of HGT is known to vary among genes, our understanding of the cause of this variation, currently summarized by two rules, is far from complete. The first rule states that informational genes, which are involved in DNA replication, transcription, and translation, have lower transferabilities than operational genes. The second rule asserts that protein interactivity negatively impacts gene transferability. Here, we hypothesize that high expression hampers HGT, because the fitness cost of an HGT to the recipient, arising from the 1) energy expenditure in transcription and translation, 2) cytotoxic protein misfolding, 3) reduction in cellular translational efficiency, 4) detrimental protein misinteraction, and 5) disturbance of the optimal protein concentration or cell physiology, increases with the expression level of the transferred gene. To test this hypothesis, we examined laboratory and natural HGTs to Escherichia coli. We observed lower transferabilities of more highly expressed genes, even after controlling the confounding factors from the two established rules and the genic GC content. Furthermore, expression level predicts gene transferability better than all other factors examined. We also confirmed the significant negative impact of gene expression on the rate of HGTs to 127 of 133 genomes of eubacteria and archaebacteria. Together, these findings establish the gene expression level as a major determinant of horizontal gene transferability. They also suggest that most successful HGTs are initially slightly deleterious, fixed because of their negligibly low costs rather than high benefits to the recipient.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22436996 PMCID: PMC3342876 DOI: 10.1093/gbe/evs030
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FRecently transferred genes to Escherichia coli have lower expressions than the resident genes in E. coli. (A) A phylogeny of E. coli K12 and related strains and species that was used to identify the HGTs (Lercher and Pal 2008) analyzed here. Only those HGTs that occurred in branches 1–4 are considered recent HGTs to E. coli K12. All other genes in E. coli K12 are considered resident genes. The shaded clade is the family Enterobacteriaceae mentioned in Materials and Methods. (B) Microarray expression levels of horizontally acquired genes are lower than those of resident genes in E. coli. The numbers of genes analyzed are indicated inside bars. (C) Percentile ranks of microarray expression levels of horizontally acquired genes and resident genes in E. coli. Percentile ranks range from 0 for the gene with the lowest expression to 100 for the gene with the highest expression. (D) Percentile ranks of CAIs of horizontally acquired genes and resident genes in E. coli, calculated using the codon usage pattern of E. coli. Percentile ranks range from 0 for the gene with the lowest CAI to 100 for the gene with the highest CAI. (E) Percentile ranks of CAIs of transferred genes, calculated using the codon usage patterns of their respective likely donors. The percentile rank of a transferred gene ranges from 0 for the lowest CAI to 100 for the highest CAI in its donor genome. In (B–E), median values are presented, with the error bars indicating 25% and 75% quartiles. In (B), the P value is from the Mann–Whitney U test. In (C–E), P values show the probabilities that the median percentile ranks are lower than 50 (or higher than 50 for P*), determined by bootstrapping the genes 10,000 times.
FNontransferable genes have higher expressions than transferable genes in laboratory HGTs to Escherichia coli. (A) Median microarray expression levels of transferable and nontransferable genes in donor species. The numbers of genes used are indicated inside the bars. (B) Median CAIs of transferable and nontransferable genes in donor species. (C) Median CAIs of transferable and nontransferable genes in donor species when informational genes are separated from operational genes. (D) Median CAI percentile ranks of transferable and nontransferable genes from all 14 donor species. The CAI percentile rank of a gene is based on the rank of its CAI relative to those of all genes in the same donor genome. In all panels, error bars show 25% and 75% quartiles in the sample. All P values are from the Mann–Whitney U test. In (B) and (C), *P < 0.05, **P < 0.01, and ***P < 0.001.
Relative Contributions of Protein Function (Informational vs. Operational), Complexity (Number of Protein Interaction Partners), GC%, and Expression Level on Gene Transferability in Natural HGTs to Escherichia coli
| Factors Considered | Rank Correlations with Gene Transferability | Multiple Linear Regression | ||||
| Correlation | Partial Correlation | RCVE | ||||
| Expression level | −0.283 (−0.499) | 0.0001 (0.0001) | −0.195 | 0.0001 | 0.337 | 0.0001 |
| Number of protein interactions | −0.250 (−0.432) | 0.0001 (0.0001) | −0.148 | 0.0001 | 0.191 | 0.0001 |
| Informational/operational | −0.001 (−0.001) | 0.9976 (0.9999) | 0.014 | 0.5280 | 0.002 | 0.5281 |
| GC% | −0.147 (−0.259) | 0.0001 (0.0001) | −0.055 | 0.0120 | 0.026 | 0.0121 |
| Total | 0.105 | |||||
Recently acquired genes by HGT have a score of 1, and resident genes have a score of 0.
The regression is transferability = a(expression level) + b(number of protein interaction partners) + c(informational/operational score) + d(GC%) + e.
Partial correlation between transferability and the focal factor, after the simultaneous controls of the other three factors.
Relative contribution of the focal factor to the total variance explained by the linear model. For details, see main text.
Probability that the null hypothesis of no contribution of the factor to transferability is correct.
Rank–Biserial correlation coefficient.
Informational genes have a score of 1, and operational genes have a score of 0.
Phi correlation coefficient.
Variance of gene transferability explained by the linear model.
FRecently horizontally acquired genes have lower expressions than resident genes in most recipient species. (A) Median CAI percentile ranks of horizontally acquired genes in 133 recipient species examined. The percentile rank of a horizontally acquired gene relative to all other genes in the recipient genome ranges from 0 for the lowest CAI to 100 for the highest CAI. All are significantly different from 50, except those indicated with “#.” Statistical significance was determined by bootstrapping the genes 10,000 times. Class names are indicated, with the numbers of genomes examined shown in parentheses. N.D.*: not defined by taxonomic classes. Underlined class names indicate archaebacteria, while the rest belong to eubacteria. Information about the individual genomes analyzed here is provided in supplementary table S1 (Supplementary Material online). (B) Median expression levels of horizontally acquired genes (light gray) and resident genes (dark gray) in recipient species with publicly available microarray gene expression data. Error bars show 25% and 75% quartiles. The microarray data of the eight species came from different sources and the gene expression levels of different species are not comparable. P values are from the Mann–Whitney U test. Arrows connect the same genomes in the two panels.