Literature DB >> 24586046

Differential codon adaptation between dsDNA and ssDNA phages in Escherichia coli.

Shivapriya Chithambaram1, Ramanandan Prabhakaran1, Xuhua Xia2.   

Abstract

Because phages use their host translation machinery, their codon usage should evolve toward that of highly expressed host genes. We used two indices to measure codon adaptation of phages to their host, rRSCU (the correlation in relative synonymous codon usage [RSCU] between phages and their host) and Codon Adaptation Index (CAI) computed with highly expressed host genes as the reference set (because phage translation depends on host translation machinery). These indices used for this purpose are appropriate only when hosts exhibit little mutation bias, so only phages parasitizing Escherichia coli were included in the analysis. For double-stranded DNA (dsDNA) phages, both r(RSCU) and CAI decrease with increasing number of transfer RNA genes encoded by the phage genome. r(RSCU) is greater for dsDNA phages than for single-stranded DNA (ssDNA) phages, and the low r(RSCU) values are mainly due to poor concordance in RSCU values for Y-ending codons between ssDNA phages and the E. coli host, consistent with the predicted effect of C→T mutation bias in the ssDNA phages. Strong C→T mutation bias would improve codon adaptation in codon families (e.g., Gly) where U-ending codons are favored over C-ending codons ("U-friendly" codon families) by highly expressed host genes but decrease codon adaptation in other codon families where highly expressed host genes favor C-ending codons against U-ending codons ("U-hostile" codon families). It is remarkable that ssDNA phages with increasing C→T mutation bias also increased the usage of codons in the "U-friendly" codon families, thereby achieving CAI values almost as large as those of dsDNA phages. This represents a new type of codon adaptation.
© The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Escherichia coli; bacteriophage; codon adaptation; deamination; mutation bias; phage-host coevolution

Mesh:

Substances:

Year:  2014        PMID: 24586046      PMCID: PMC4032129          DOI: 10.1093/molbev/msu087

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Efficient production of proteins is essential for survival and reproduction and strongly affects the fitness of a genotype, especially in unicellular organisms and viruses where rapid replication is essential for propagating the genotype into future generations. Efficient translation depends on the efficiency of the three subprocesses of translation, that is, initiation, elongation, and termination. Codon–anticodon adaptation directly impacts elongation efficiency. Ever since the empirical documentation of the correlation between codon usage and transfer RNA (tRNA) abundance (Ikemura 1981), codon–anticodon adaptation has been well documented in bacterial and fungal genomes (Ikemura 1981, 1992; Gouy and Gautier 1982; Xia 1998) as well as in mitochondrial genomes in vertebrates (Xia 2005; Xia et al. 2007) and fungi (Carullo and Xia 2008; Xia 2008). In short, differential tRNA availability almost invariably leads to biased codon usage, with most frequently used codons corresponding to the most abundant tRNA species. Optimizing codon usage according to host codon usage has been shown to increase the production of viral proteins (Haas et al. 1996; Ngumbela et al. 2008) or transgenic genes (Hernan et al. 1992; Kleber-Janke and Becker 2000; Koresawa et al. 2000). Studies on codon–anticodon adaptation have progressed in theoretical elaboration (Bulmer 1987, 1991; Xia 1998, 2008; Higgs and Ran 2008; Jia and Higgs 2008; Palidwor et al. 2010), in critical tests of alternative theoretical predictions (Xia 1996, 2005; Carullo and Xia 2008; van Weringh et al. 2011), and in formulation and implementation of codon bias indices such as relative synonymous codon usage (RSCU, Sharp et al. 1986), effective number of codons (Nc, Wright 1990; Sun et al. 2013), and Codon Adaptation Index (CAI, Sharp and Li 1987; Xia 2007). Although a recent study has questioned the relationship between codon usage and protein production (Kudla et al. 2009), its conclusion has been found to be unwarranted (Tuller et al. 2010). Bacteriophage needs to have efficient translation to survive among alternative phage genotypes. Because phages depend mainly on the translation machinery of their host for protein translation, their codon adaptation is shaped by mutation and selection of the host tRNA pool (Grosjean et al. 1978; Gouy 1987; Kunisawa et al. 1998; Sahu et al. 2005; Carbone 2008; Lucks et al. 2008). Although some studies have suggested that extrinsic factors such as temperature (Sau and Deb 2009) and host diversity (Sau et al. 2007) may also affect phage codon usage, such factors should act indirectly through mutation and selection. To study factors contributing to phage codon adaptation, we first use two codon usage indices, rRSCU (correlation of RSCU values between the host and the phage) and CAI, to measure phage codon adaptation. As explained in the next section, these indices are appropriate measures of phage codon adaptation when the host exhibits little nucleotide bias indicating little mutation bias. We then derive testable predictions on factors that contribute to phage codon adaptation.

Two Codon Usage Indices to Measure Phage Codon Adaptation

Assuming that the codon usage of highly expressed host genes are well adapted to their own translation machinery, we expect the phage genes to evolve a codon usage pattern similar to that of highly expressed host genes (Sharp et al. 1984). This suggests that concordance in codon usage between the host and the phage may be used as a proxy of phage codon adaptation. A simple measure of such concordance could be the correlation between host RSCU and phage RSCU, referred to hereafter as rRSCU. rRSCU as a measure of phage codon adaptation has two problems. First, it can be increased not only by selection for codon adaptation but also by biased mutation. For example, strongly AT-biased mutations shared by both the host and the phage will lead to a high rRSCU. Such a high rRSCU cannot be equated to a high degree of codon adaptation because adaptation, by definition, arises in response to selection. There is, however, one special case where rRSCU can be reasonably used as a proxy of phage codon adaptation and that is when we study phages parasitizing the same host and when the host has roughly equal nucleotide frequencies indicating unbiased mutations. Escherichia coli is approximately such a host species. Its genomic nucleotide frequencies are roughly equal, being 0.2462, 0.2541, 0.2537, and 0.2460 for nucleotides A, C, G and T, respectively. This indicates that mutations in E. coli do not lead to strong codon usage bias, in contrast to AT-biased or GC-biased mutations in many other bacterial species that can cause strong codon usage bias without any selection (Muto and Osawa 1987). Increasing the rate of unbiased mutations will lead to more randomized RSCU values and smaller rRSCU values. The benefit of using a host with equal genomic nucleotide frequencies (presumably resulting from unbiased mutation) is that the effect of tRNA-mediated selection is often unequivocally detectable. Table 1 illustrates E. coli codon usage of four codon families in which tRNA-mediated selection favors A-, G-, C-, and U-ending codons, respectively. The most frequently used codon in each codon family matches the tRNA species with the highest gene copy numbers (table 1). For example, there are four tRNAGlu/UUC genes forming Watson–Crick base pair with Glu codon GAA but no tRNAGlu/CUC. As tRNA gene copy number is well correlated with experimentally measured tRNA abundance (Percudani et al. 1997), tRNA-mediated selection therefore should favor GAA, which is true (table 1). What is remarkable is that this association between major codon and tRNA abundance is visible when tRNA-mediated selection favors A-, G-, C-, and U-ending codons, respectively (table 1). If the E. coli genome had experienced strong AT-biased mutation, then tRNA-mediated selection for C-ending or G-ending codons may be invisible (i.e., A-ending and T-ending codons may still be the most frequently observed in spite of tRNA-mediated selection favoring C-ending and G-ending codons when AT-biased mutation dominates over the tRNA-mediated selection). For this reason, phages studied here are all E. coli phages.
Table 1.

The Effect of tRNA-Mediated Selection in Escherichia coli, Whose Genomic Sequence Has Equal Nucleotide Frequencies, Presumably Resulting from Little Mutation Bias.

AACodonNatRNAbCF
GluGAA4,6834A-ending
GAG1,4590
PheUUC2,2292C-ending
UUU8720
Leu4cCUA541
CUG5,6984G-ending
CUC5411
CUU3570
Arg4cCGA340
CGG331
CGC1,5300
CGU2,9953U-ending

Note.—CF, codon favored by tRNA.

aNumber of codons in highly expressed E. coli genes compiled in the EMBOSS package (Rice et al. 2000).

bNumber of E. coli tRNA genes with anticodon forming Watson–Crick pairing with the associated codon. Nucleotide A at the first anticodon position is mostly modified to inosine.

cLeu and Arg are coded by a four-codon subfamily and a two-codon subfamily. Leu4 and Arg4 refer to their respective four-codon subfamily.

The Effect of tRNA-Mediated Selection in Escherichia coli, Whose Genomic Sequence Has Equal Nucleotide Frequencies, Presumably Resulting from Little Mutation Bias. Note.—CF, codon favored by tRNA. aNumber of codons in highly expressed E. coli genes compiled in the EMBOSS package (Rice et al. 2000). bNumber of E. coli tRNA genes with anticodon forming Watson–Crick pairing with the associated codon. Nucleotide A at the first anticodon position is mostly modified to inosine. cLeu and Arg are coded by a four-codon subfamily and a two-codon subfamily. Leu4 and Arg4 refer to their respective four-codon subfamily. The second problem with rRSCU is that it does not capture all aspects of codon adaptation. This is illustrated in table 2, which shows fictitious codon count and RSCU of highly expressed host genes and two phage genes (PG1 and PG2). RSCU values for codons in PG1 and PG2 are exactly the same, so rRSCU for PG1 and PG2 will also be the same. However, PG2 is expected to be translated more efficiently than PG1 for the following reason. We notice that highly expressed host genes strongly avoid UUU in the Phe codon family (table 2), suggesting that UUU cannot be translated efficiently by the host translation machinery. Given this, PG2 as a whole should be translated faster than PG1 because PG2 has only 90 “bad” UUU codons, whereas PG1 has 180 “bad” UUU codons. In this case, the Gly codon family is “U-friendly” because an increased number of U-ending codons will in fact improve translation. In contrast, the Phe codon family is “U-hostile” because increasing the number of U-ending codons will reduce translation efficiency. A single-stranded DNA (ssDNA) phage that cannot avoid high C→T mutations can nonetheless evolve codon adaptation by reducing the usage of codons in U-hostile codon families and increase the usage of codons in U-friendly codon families as PG2 does (table 2). This kind of adaptation is invisible to rRSCU but can be detected by CAI. We use the mean CAI value, computed from all genes in a phage genome with highly expressed host genes as a reference set, as an alternative measure of phage codon adaptation. The reason for using highly expressed host genes is that phage translation depends on host translation machinery, that is, efficient translation elongation of phage mRNA depends on whether the phage mRNA would overuse codons preferred by highly expressed host genes.
Table 2.

Fictitious Codon Usage for Highly Expressed Host Genes (HOST) and Two Phage Genes (PG1 and PG2).

AACodonCount
RSCU
HOSTPG1PG2HOSTPG1PG2
GlyGGA40050750.888911
GGG30030450.66670.60.6
GGC10020300.22220.40.4
GGU1,0001001502.222222
PheUUC2,00020101.81820.20.2
UUU200180900.18181.81.8

Note.—rRSCU between HOST and PG1 is identical to that between HOST and PG2, but PG2 will have higher CAI than PG1 when CAI is computed with HOST as the reference set of genes.

Fictitious Codon Usage for Highly Expressed Host Genes (HOST) and Two Phage Genes (PG1 and PG2). Note.—rRSCU between HOST and PG1 is identical to that between HOST and PG2, but PG2 will have higher CAI than PG1 when CAI is computed with HOST as the reference set of genes. Phages are essentially a mosaic of genes sampled from a pool of frolicking phage genomes. For example, although many related tailed phages have nearly identical genome organization such as “DNA packaging-head-tail-tail fiber-lysis-lysogeny-DNA replication-transcription regulation” (Desiere et al. 2001), essentially any function in a phage can be fulfilled by one of many distinct genes with homologous function but little sequence similarity (Brussow and Kutter 2005). In other words, horizontal gene transfer is rampant in phage, so that individual genes in each phage could differ dramatically in evolutionary history and different codon usage. Consequently, a mean/median CAI may not be representative of all genes in a phage genome. For this reason, we have added standard deviation of CAI values in the supplementary files S1-S3, Supplementary Material online, to show that the among-gene difference in CAI is actually quite small.

Effect of Phage-Encoded tRNA Genes on Phage Codon Usage

Some phage genomes are long known to encode tRNA genes (Chattopadhyay and Ghosh 1988; Mandal and Ghosh 1988), for example, Enterobacteria phage WV8 carries 20 tRNA genes on its genome. Phage-encoded tRNAs tend to have anticodons decoding codons overused in the phage genes but rarely used in host genes (Kunisawa 1992, 2000; Bailly-Bechet et al. 2007; Enav et al. 2012). Such phage-encoded tRNAs would alter host tRNA pool, render the phage less dependent on the host tRNAs, and reduce the need (selection pressure) for the phage genes to evolve toward a codon usage pattern similar to that of the host genes. In other words, such tRNA genes would tend to reduce rRSCU and CAI and need to be taken into consideration in studying phage codon adaptation, especially in characterizing the difference between double-stranded DNA (dsDNA) and ssDNA phages because the latter do not encode tRNA genes in their genomes.

Effect of C→T Mutation Bias on Codon Usage of ssDNA Phages

Mutation rate differs much between ssDNA and dsDNA phages. Although dsDNA is well protected against mutation agents, ssDNA is subject to a high rate of DNA decay, especially spontaneous deamination leading to C→T mutations, the rate of which is about 100 times higher in ssDNA than in dsDNA (Frederico et al. 1990). Oxidative deamination leading to high C→U/T transitional mutation rates has been reported in ssDNA phage M13 (Kreutzer and Essigmann 1998). The high mutation rate of ssDNA phages relative to dsDNA phages impact strongly on genomic GC content (Xia and Yuen 2005) and codon usage bias (Cardinale and Duffy 2011). For this reason, one would predict that, given the same tRNA-mediated selection for codon usage bias, dsDNA phages would achieve better codon adaptation than ssDNA phages.

Coevolution Time and Maximum rRSCU

We have predicted that tRNA-mediated selection will increase rRSCU and that increased mutation rate will decrease rRSCU in E. coli phage. However, testing these predictions is confounded by coevolution time between phages and their host. Suppose a group of phages, given sufficient coevolution time with E. coli, would reach a maximum rRSCU. When we sample these phage lineages, some may have coevolved sufficiently long to have reached the maximum rRSCU, whereas others may be far from reaching the maximum because they may have invaded E. coli only recently. Thus, both dsDNA and ssDNA phages may have some of their members with low rRSCU values, but we predict that the maximum rRSCU value should be much greater for dsDNA phages than for ssDNA phages. In short, we predict that 1) for dsDNA phages, rRSCU should decrease with the number of tRNA genes encoded by the phage genome, with phage-encoded tRNAs likely decoding codons overused by phage mRNAs but rarely used by host mRNAs, 2) rRSCU should be greater for dsDNA phages than ssDNA phages when the effect of phage-encoded tRNA genes has been taken into consideration, and maximum rRSCU should in particular be much greater for dsDNA phages than for ssDNA phages, and 3) ssDNA phages with a strong C→T mutation bias may evolve to increase the usage of codons in U-friendly codon families and reduce the usage of codons in U-hostile codon families. We report results confirming these predictions.

Results

Twenty-two dsDNA phage species encode tRNA genes in their genomes (13 from Myoviridae, 4 from Podoviridae, and 5 from Siphoviridae; supplementary file S1, Supplementary Material online), whereas none of the ssDNA phage genomes carry tRNA genes. Before making comparisons in codon usage between dsDNA and ssDNA phages, it is important to test if phage-encoded tRNA genes can affect codon usage. The presence of an effect implies that the fair comparison should only be carried out between ssDNA phages and those dsDNA phages that do not carry tRNA genes.

Effect of Phage-Encoded tRNA on Codon Adaptation in dsDNA Phage

We have reasoned before that phage-encoded tRNA genes may reduce rRSCU, especially if these tRNAs tend to decode codons overused in the phage genes but underused in host genes. There is indeed a highly significant (P < 0.0001) negative relationship between rRSCU and the number of tRNA genes encoded in the phage genome (fig. 1). The use of an exponential decay to fit the negative relationship is based on the rationale that, if the number of tRNA genes in the phage approaches infinity, then the codon usage of the phage would approach complete independence of the host tRNA pool, with rRSCU approaching zero. A significant (P = 0.0260) negative relationship is also observed between CAI and the number of tRNA genes encoded in the phage genome.
F

Codon adaptation of the phage genes, measured by rRSCU, decreases with increasing number of tRNA genes encoded in phage genomes.

Codon adaptation of the phage genes, measured by rRSCU, decreases with increasing number of tRNA genes encoded in phage genomes. What tRNA genes would benefit dsDNA phages that carry them? Translation of codons that are overused in phage genes but decoded by few host tRNAs would benefit from having extra cognate tRNAs from the phage genomes. Take R-ending codon, for example (where R stands for purine). If the host tRNA pool favors G-ending codon, but A-ending codon is overused by phage genes, then it is beneficial for the phage to carry tRNA genes with a wobble U to decode the overused A-ending codons. Similarly, if the host has few tRNAs decoding G-ending codons and uses few G-ending codons, but the phage uses many more G-ending codons, then it would be beneficial for phage tRNAs to have a wobble C to decode its relatively more frequently used G-ending codons. Three general rules can be derived from the results in table 3, which shows the R-ending codon usage of highly expressed E. coli genes and two dsDNA phages each carrying a set of tRNA genes. First, if phage codon usage bias is the same as that of E. coli (e.g., GAR, AAR, and AGR codons for amino acids E, K, and R, respectively), then the phage-encoded tRNAs will decode the most frequently used codon. Second, if phage codon usage bias is opposite to that of the host (e.g., GGR, UUR, CCR, and UCR codons for amino acids G, L, P, and S, respectively), then the phage-encoded tRNAs will decode the codon overused in the phage but underused in the host. Third, if phage genes use the two R-ending codons roughly equally (e.g., CAR codons for amino acid Q), then the phage may carry tRNAs for both codons. Although only two phage species are included in table 3, the three rules are shared among other phage species with phage-encoded tRNAs.
Table 3.

Number of A- or G-Ending Codons (Ncod), RSCU, and Number of tRNA Genes (NtRNA) for Escherichia coli and Two Phage Species (WV8 and bV_EcoS_AKFV33).

AACodonE. colia
WV8
bV_EcoS_AKFV33
NcodRSCUNtRNANcodRSCUNtRNANcodRSCUNtRNA
EGAA4,6831.52541,1251.25911,4891.3651
EGAG1,4590.4756620.7416920.635
GGGA1180.06812450.5841
GGGG2670.15411500.357
KAAA4,1291.59551,2621.19511,5511.3641
KAAG1,0500.4068510.80517230.6361
LCUA540.03312330.74515441.3351
LCUG5,6983.42733181.0174331.063
LUUA2100.77417181.4531
LUUG3331.22712700.547
PCCA4740.56414082.03214281.5581
PCCG2,5092.9831620.3091540.561
QCAA5500.35524811.05815931.061
QCAG2,5481.64524280.94215260.941
RAGA211.23584381.58113171.4611
RAGG130.76511160.4191170.539
SUCA1890.26114981.641
SUCG2750.3801380.125
TACA1810.16014471.0021
TACG5260.46511640.368
VGUA1,3290.80557651.5081
VGUG1,7841.0802310.455

Note.—See text for reasons of including only R-ending codons.

aFrom highly expressed E. coli genes, as compiled in the EMBOSS distribution (Rice et al. 2000).

Number of A- or G-Ending Codons (Ncod), RSCU, and Number of tRNA Genes (NtRNA) for Escherichia coli and Two Phage Species (WV8 and bV_EcoS_AKFV33). Note.—See text for reasons of including only R-ending codons. aFrom highly expressed E. coli genes, as compiled in the EMBOSS distribution (Rice et al. 2000). The three rules are generally consistent with the interpretation that phage-encoded tRNAs facilitate translation of phage mRNAs. Similar findings, but less complete, have also been reported in previous studies on T4-like phages (Kunisawa 1992; Bailly-Bechet et al. 2007; Enav et al. 2012). They are also consistent with previous experiments in which alteration of E. coli tRNA pool is associated with changed translation efficiency of transgenes (Kleber-Janke and Becker 2000). One may note that table 3 includes only R-ending codons. Can we extend the pattern to Y-ending codons (where Y stands for pyrimidine)? Suppose that the host overuses C-ending codons, with many tRNAs with a wobble G, but the phage overuses U-ending codons. Should we not predict that phage genomes should encode tRNAs with a wobble A to decode its overused U-ending codons? However, this prediction cannot be tested because a tRNA with wobble A would interfere with translation. That is, once such a tRNA is in the P-site, it interferes with the tRNA at the A-site (Lim 1994). Thus, Y-ending codons are decoded by either tRNAs with a wobble G or tRNA with a wobble A-derived inosine. This was overlooked in a previous study on tRNAs encoded in bacteriophage T4 (Kunisawa 1992).

Difference in rRSCU between dsDNA and ssDNA Phages

Given the significant effect of phage-encoded tRNA on rRSCU (fig. 1 and table 3), all phage genomes with encoded tRNA genes were excluded in all comparisons between dsDNA phages and ssDNA phages because none of the ssDNA phage genomes encode tRNA genes. This leaves 38 dsDNA phages and 11 ssDNA phages for further comparisons in rRSCU. rRSCU is significantly greater for dsDNA phages than for ssDNA phages (0.5917 for the former and 0.3273 for the latter, t = 3.6533, DF = 47, P = 0.0008, table 4). To test if it is the C→T-biased mutation that is chiefly responsible for the reduced rRSCU values for the ssDNA phages, we computed the rRSCU values separately for the R-ending codons and Y-ending codons (table 5). The rRSCU values for the R-ending codons (rRSCU.R) are significantly greater than those for the Y-ending codons (rRSCU.Y), with the mean being 0.5217 for rRSCU.R and 0.1074 for rRSCU.Y (table 5). The difference is highly significant (paired-sample t-test: t = 17.2872, DF = 10, P < 0.0001), assuming data independence.
Table 4.

Mean and Distribution of rRSCU Values for Various dsDNA and ssDNA Phage Families.

TypePhage FamilynMinimumMaximumAverageSD
dsDNAMyoviridae90.34370.92070.69530.2359
Podoviridae120.25530.80340.42160.1859
Siphoviridae160.24120.89550.66000.2355
Tectiviridae10.60840.60840.6084NA
ssDNAInoviridae40.27000.39220.34490.0524
Microviridae70.27570.37090.31730.0409

Note.—NA, not applicable.

Table 5.

Contrasting rRSCU Values for R-Ending Codons and for Y-Ending Codons (designated by rRSCU.R and rRSCU.Y, respectively).

FamilyACCNrRSCU.RrRSCU.Y
MicroviridaeNC_0013300.65040.0854
MicroviridaeNC_0014200.45300.0332
MicroviridaeNC_0078560.46520.0447
MicroviridaeNC_0078170.41680.0200
MicroviridaeNC_0014220.44970.0843
MicroviridaeNC_0128680.60090.1118
MicroviridaeNC_0078210.60300.1158
InoviridaeNC_0013320.54750.1709
InoviridaeNC_0019540.47530.2154
InoviridaeNC_0020140.58920.2105
InoviridaeNC_0032870.48760.0894
Mean0.52170.1074
Mean and Distribution of rRSCU Values for Various dsDNA and ssDNA Phage Families. Note.—NA, not applicable. Contrasting rRSCU Values for R-Ending Codons and for Y-Ending Codons (designated by rRSCU.R and rRSCU.Y, respectively). Because some phages may not have enough time coevolving with their host, their rRSCU may not have reached the maximum possible. For example, if a dsDNA phage has recently switched to a host with a different codon usage pattern, then we would not expect it to have a high rRSCU value because codon adaptation takes time to evolve. However, given enough time, we expect dsDNA phages to reach a higher rRSCU than ssDNA phages whose mutation rate is higher than that of dsDNA phages. The mean and distribution of rRSCU values for the dsDNA and ssDNA phage (table 4) is consistent with this interpretation. The maximum rRSCU observed is only 0.3922 for ssDNA phages but 0.9207 for dsDNA phages (Enterobacteria phage Mu in Myoviridae). The mean and standard variation of rRSCU values for ssDNA phage is 0.3273 and 0.0450, respectively, so that the probability of having an rRSCU value as large as 0.5 is less than 0.0001 for ssDNA phages. When a phage species has a small rRSCU value, it could be due to weakened selection (e.g., the phage carries a large number of its own tRNA genes), strong mutation pressure disrupting codon adaptation, or insufficient coevolution time. Given that the three dsDNA phage families and the two ssDNA phage families all have multiple phage lineages parasitizing E. coli, we may assume that the phages should have coevolved with E. coli for sufficiently long time for codon adaptation to reach a mutation-selection equilibrium. Also, the comparison above between the dsDNA and ssDNA phages excluded phages with phage-encoded tRNA genes, so all these phages should have experienced roughly the same host tRNA-mediated selection. The most plausible explanation for the difference in rRSCU between the dsDNA and ssDNA phages is the higher mutation pressure in ssDNA phages that disrupt codon adaptation.

Effect of Life Cycle (Temperate vs. Virulent) on rRSCU in dsDNA Phages

dsDNA phages differ in their life cycles, some being temperate with a lysogenic phage and some are virulent with only lytic phase, although lysogenic phages can become lytic through mutations at lysogenic conversion genes (van Vliet et al. 1978; Brussow and Kutter 2005). Temperate phages are expected to have better concordance in codon usage with the host (i.e., higher rRSCU values) than lytic phages for two reasons. First, a prophage and its lysogen share the same mutation spectrum as the host DNA. Second, they have increased chance of recombining with or acquiring host genes or gene segments. For example, phage λ and phage µ carry a piece of host genome when they switch from the lysogenic phase to the lytic phase. The expectation is borne out by empirical data (table 6), with rRSCU significantly greater in temperate phages than in virulent phages with two-sample t-tests (DF = 7, t = 11.5914, P < 0.0001 for Myoviridae; DF = 9, t = 5.7328, P = 0.0003 for Podoviridae; DF = 12, t = 10.4545, P < 0.0001 for Siphoviridae). A two-way analysis of variance accounts for 91.24% of total variance in rRSCU, with rRSCU differing highly significantly between temperate and virulent phages (F = 280.9918, DFmodel = 1, DFerror = 28, P < 0.0001), significantly among the three dsDNA phage families (F = 5.095, DF = 2, P = 0.0130), but with no significant interaction (F = 0.2101, DF = 2, P = 0.81175).
Table 6.

Effect of Life Cycle of dsDNA Phages on Codon Usage Concordance between Phage and Host, Measured by rRSCU.

PhageFamPhageNameAccessionLifeCyclerRSCU
MyoviridaeEnterobacteria phage MuNC_000929Temperate0.9207
MyoviridaeEnterobacteria phage P2NC_001895Temperate0.9011
MyoviridaeEnterobacteria phage P4NC_001609Temperate0.8287
MyoviridaeEnterobacteria phage SfVNC_003444Temperate0.8750
MyoviridaeEscherichia phage D108NC_013594Temperate0.9207
MyoviridaeEnterobacteria phage JSENC_012740Virulent0.4789
MyoviridaeEnterobacteria phage Phi1NC_009821Virulent0.4971
MyoviridaeEnterobacteria phage phiEcoM-GJ1NC_010106Virulent0.3437
MyoviridaeEnterobacteria phage RB49NC_005066Virulent0.4917

PodoviridaeEscherichia phage phiV10NC_007804Temperate0.7308
PodoviridaeStx2 converting phage INC_003525Temperate0.8034
PodoviridaeEnterobacteria phage 13aNC_011045Virulent0.3181
PodoviridaeEnterobacteria phage EcoDS1NC_011042Virulent0.4021
PodoviridaeEnterobacteria phage K1-5NC_008152Virulent0.2629
PodoviridaeEnterobacteria phage K1ENC_007637Virulent0.2553
PodoviridaeEnterobacteria phage K1FNC_007456Virulent0.2553
PodoviridaeEnterobacteria phage N4NC_008720Virulent0.2661
PodoviridaeEnterobacteria phage T3NC_003298Virulent0.5306
PodoviridaeEnterobacteria phage T7NC_001604Virulent0.3274
PodoviridaeEnterobacteria phage BA14NC_011040Virulent0.4504

SiphoviridaeEnterobacteria phage BP-4795NC_004813Temperate0.8049
SiphoviridaeEnterobacteria phage cdtINC_009514Temperate0.8307
SiphoviridaeEnterobacteria phage HK022NC_002166Temperate0.7416
SiphoviridaeEnterobacteria phage HK97NC_002167Temperate0.7303
SiphoviridaeEnterobacteria phage lambdaNC_001416Temperate0.8520
SiphoviridaeEnterobacteria phage N15NC_001901Temperate0.8955
SiphoviridaeEscherichia Stx1 converting bacteriophageNC_004913Temperate0.8108
SiphoviridaeStx2-converting phage 1717NC_011357Temperate0.8335
SiphoviridaeEnterobacteria phage SSL-2009aNC_012223Temperate0.7853
SiphoviridaeEnterobacteria phage EPS7NC_010583Virulent0.2583
SiphoviridaeEnterobacteria phage JK06NC_007291Virulent0.2565
SiphoviridaeEnterobacteria phage RTPNC_007603Virulent0.2412
SiphoviridaeEnterobacteria phage T1NC_005833Virulent0.4637
SiphoviridaeEnterobacteria phage TLSNC_009540Virulent0.4734

Note.—The phages are organized by phage families (PhageFam) and then by life cycle (LifeCycle: temperate or virulent) within each phage family.

Effect of Life Cycle of dsDNA Phages on Codon Usage Concordance between Phage and Host, Measured by rRSCU. Note.—The phages are organized by phage families (PhageFam) and then by life cycle (LifeCycle: temperate or virulent) within each phage family.

A New Type of Codon Adaptation Mediated by C→T-Biased Mutation

Some ssDNA phages have strong C→T mutations as measured by SKEWTC defined as where NT and NC are the count of nucleotides T and C, respectively. SKEWTC is expected to increase with increased C→T mutation rate and result in overuse of U-ending codons. For example, Enterobacteria phage Ike (NC_002014, Inoviridae) has a SKEWTC value of 0.2893, with U-ending codons being the most frequent in all Y-ending or N-ending codon families. The effect of biased mutation on codon usage has also been shown for several other ssDNA phages (Cardinale and Duffy 2011). This bias in favor of U-ending codons interferes with codon adaptation because E. coli translation machinery does not favor U-ending codons in most codon families. Highly expressed E. coli genes, as compiled in the EMBOSS distribution (Rice et al. 2000) or in Ran and Higgs (2012), have U-ending codons being the most frequent in four codon families, that is, Gly, Arg4 (the CGN codon subfamily for Arg), Ser4 (the UCN codon subfamily for Ser), and Val. Take the Val (GUN) codon family, for example. The RSCU values for GUA, GUC, GUG, and GUU are 0.8047, 0.4989, 1.0802, and 1.6161, respectively, based on the EMBOSS distribution (Rice et al. 2000). Such a codon family is “U-friendly” because U-ending codons are preferred and C→T-biased mutation will consequently improve translation elongation. In contrast, the other codon families containing U-ending codons have C-ending codons more frequent than U-ending codons based on the highly expressed E. coli protein-coding genes. These codon families will be designated as U-hostile. T-biased mutation in ssDNA phages would enhance codon adaptation in the four U-friendly codon families but would go against codon adaptation in the U-hostile codon families. What can ssDNA phages do to increase their translation elongation efficiency in face of the C→T mutation? One obvious solution to the problem is illustrated in table 2 with codon frequencies of two codon families (Gly and Phe) from two fictitious phage genes (designated as PG1 and PG2, respectively) and from the host. We can infer U-friendliness of the host translation machinery based on codon usage of host genes. The Gly codon family is U-friendly, with the host machinery strongly preferring U-ending codons. The Phe codon family is U-hostile with host translation machinery strongly favoring C-ending codons (table 2). The total number of codons for the two genes is the same and equal to 400, and the RSCU for each codon is also identical for two genes (table 2). Thus, rRSCU between PG1 and host would be exactly the same as that between PG2 and host. However, we note that the PG2 could be translated more efficiently than PG1 because the former has only 90 “bad” UUU codons, whereas the latter has 180. This differential translation elongation efficiency is not reflected by RSCU but is by CAI. For example, with the data in table 2 and assuming no other codons except for those listed in table 2, we have CAI being 0.2577 for PG1 but 0.3686 for PG2 when host codon frequencies are used as the reference set. The example illustrated above suggested that E. coli ssDNA phages with strong C→T mutation bias can improve their translation elongation efficiency by overusing the codons in the four U-friendly codon families and decreasing the codons in the U-hostile codon families. This leads to the prediction that the summed frequencies of codons in the four U-friendly codon families, designated as F4, should increase with SKEWTC. That is, when U-ending codons are increased by U-biased mutations, these U-ending codons should be more concentrated in the four U-friendly codon families. This prediction is strongly supported by data from the 11 ssDNA E. coli phages (fig. 2), with the correlation between F4 and SKEWTC3 = 0.707 (P = 0.0151). Furthermore, F4 is significantly and positively correlated with mean CAI from the 11 ssDNA phages (r = 0.6595, P = 0.0273). The result in figure 2 is consistent with the interpretation that increased C→T mutation drives the increased use of codons in the four U-friendly codon families. Thus, although the ssDNA phages cannot fight against the C→T mutation, they have evolved to minimize the disruptive effect of this biased mutation on codon adaptation by coding more amino acids in the four U-friendly codon families.
F

Positive association between SKEWTC, defined as (NT – NC)/(NT + NC) where Ni is the number of nucleotide i in a phage genome, and F4, the percentage of codons in four codon families (Gly, Arg4, Ser4, and Val) in which highly expressed E. coli genes prefer U-ending codons against C-ending codons. Results are from 11 ssDNA E. coli phages. We noted that, because U-rich codons will increase, and C-rich codons decrease, with increasing C→T mutation bias, only Gly codon family should be used for testing the predicted positive correlation, which would lead to r = 0.6837 and P = 0.02036.

Positive association between SKEWTC, defined as (NT – NC)/(NT + NC) where Ni is the number of nucleotide i in a phage genome, and F4, the percentage of codons in four codon families (Gly, Arg4, Ser4, and Val) in which highly expressed E. coli genes prefer U-ending codons against C-ending codons. Results are from 11 ssDNA E. coli phages. We noted that, because U-rich codons will increase, and C-rich codons decrease, with increasing C→T mutation bias, only Gly codon family should be used for testing the predicted positive correlation, which would lead to r = 0.6837 and P = 0.02036. The usage of Ser codons for Enterobacteria phage Ike (NC_002014, Inoviridae) illustrates this special codon adaptation well. Ser is coded by the four-codon UCN and the two-codon AGY codon subfamilies. In the AGY codon subfamily, highly expressed E. coli genes prefer AGC against AGU, suggesting that AGU is a “bad” codon. C→T mutations will lead to many “bad” AGU codons if Ser is largely encoded by the AGY subfamily. In contrast, in the UCN subfamily, highly expressed E. coli genes strongly prefer UCU against other synonymous codons, suggesting that UCU is a “good” codon. C→T mutations will lead to many “good” UCU codons if Ser is largely encoded by the UCN subfamily. In this conceptual framework, it is easy to understand that 88.4% of Ser codons in Enterobacteria phage Ike belong to the UCN subfamily. Because of this adaptive trick, the mean CAI value for ssDNA phages is almost as large as that for dsDNA phages (0.4768 for dsDNA phages and 0.4743 for ssDNA phages, excluding the 22 phages with phage-encoded tRNA genes), with no statistically significant difference. The type of codon adaptation outlined earlier, that is, by switching codon usage from U-hostile codon families to U-friendly codon families, implies increased nonsynonymous substitution with increased C→T mutation. A simple way to check this is to test the change of UUC and CCN frequencies with increased C→T mutation rate. We used TC skew at the third codon position (SKEWTC3) to measure C→T mutation and checked how the frequencies of UUN and CCN codons would change SKEWTC3. The frequency of UUN codons increases (P = 0.0008, fig. 3) and that of CCN codons decreases (P = 0.0320, fig. 3), with increasing SKEWTC3, consistent with the expectation. However, the sharp increase in UUN codons and the relatively slow decrease in CCN codons (fig. 3) suggest that the increase in UUN codon is not entirely due to the decrease of CCN codons. Similar response of nonsynonymous mutation rate to directional mutation pressure has also been documented in several other studies (Sueoka 1961; Lobry 2004; Urbina et al. 2006).
F

UUN codons increases, and CCN codons decreases, with C→T mutation measured by TC skew at the third codon position (SKEWTC3), but at different extent.

UUN codons increases, and CCN codons decreases, with C→T mutation measured by TC skew at the third codon position (SKEWTC3), but at different extent. The results above suggest to us that our empirical test of the new type of codon adaptation in figure 2 is incorrect. For example, the Val codon family (coded by GUN) is U-friendly and its usage increases with C→T mutation bias, thus supporting the prediction from the hypothesized new type of codon adaptation. However, the increase may have nothing to do with codon adaptation but may be simply due to the increase of all U-containing codons and the decrease of C-containing codons with increasing C→T mutation bias. Thus, only codon families that do not contain C or U at the first and second codon positions are relevant to test the prediction of a positive association between the usage of U-friendly codon families and SKEWTC3. Among the U-friendly codon families, only the Gly codon family (coded by GGN) fulfills this criterion. The hypothesis is still supported as the percentage of Gly codons increased with SKEWTC3 (r = 0.6837, P = 0.0204).

Discussion

Studying codon adaptation in bacteriophage is important not only in understanding the biology of translation but also in practical applications. Several phages have been used to remove infectious biofilms (Azeredo and Sutherland 2008; Gladstone et al. 2012), to deliver vaccines (Clark and March 2004), or to treat human infections (Sau et al. 2005; Ranjan et al. 2007; Sau 2007; Skurnik et al. 2007; Goodridge 2010; Timms et al. 2010; Abedon et al. 2011), especially those caused by bacterial pathogens that have developed resistance to antibiotics. However, many of these phages do not have optimal codon usage for efficient replication. Studying codon adaptation in phages contributes to the theoretical foundation for re-engineering more efficient phages for therapeutic or industrial purposes (Skiena 2001). A database has been created to facilitate the study of phage codon adaptation to their hosts (Hilterbrand et al. 2012).

Phage-Encoded tRNA Affects Phage Codon Usage

We found that the number of tRNA genes carried by dsDNA phage genomes reduced the need for the phages to evolve a codon usage pattern similar to that of their hosts and that these phage-encoded tRNA facilitate the translation of overused phage codons, especially when the host provides few tRNAs for these phage codons (fig. 1 and table 3). Several viral species have been found to alter host tRNA pool to favor the translation of the viral genes. HIV-1 viruses selectively enrich rare host tRNAs to decode A-ending codons overused in HIV-1 genes but rarely used by host genes (van Weringh et al. 2011), and such selective enrichment has also been found in vaccinia and influenza A viruses (Pavon-Eternod et al. 2013). Translation efficiency is sensitive to the change of tRNA pool (Kleber-Janke and Becker 2000). A gain/loss of a tRNAMet/UAU gene has resulted significant change in AUA codon frequencies, in both bivalve mitochondria and tunicate mitochondria (Xia et al. 2007; Xia 2012). All these findings on the association of tRNA pool and codon usage suggest that translation efficiency of a target gene can not only be improved by optimizing the codon usage of the target gene but also by modifying the tRNA pool where the target gene is translated. This latter approach has the advantage over the former because the former sometimes will alter the structure of the mRNA leading to reduced translation initiation efficiency (Kudla et al. 2009). Phage-encoded tRNA genes provide phages with the opportunity to parasitize hosts with different codon usage and may therefore increase their host diversity (Sau et al. 2007). However, existing data do not allow the characterization of phage-encoded tRNA and host diversity because few phage species have their host diversity characterized. One way to characterize host diversity is by subjecting phages to a diverse array of hosts and checking for lytic activities (Villegas et al. 2009). Unfortunately, few such studies have been carried out.

Mutation Plays a Significant Role in Phage Codon Adaptation

The rate of spontaneous deamination leading to C→T mutation is about 100 times higher in ssDNA than in dsDNA (Frederico et al. 1990), and such high mutation rate mediated by oxidative deamination has been reported in a ssDNA phage M13 (Kreutzer and Essigmann 1998). These high C→T mutations prevent ssDNA phages from evolving a codon usage pattern as close to that of the host as dsDNA phages. This is substantiated by the observation that rRSCU for R-ending codons are significantly greater than rRSCU for Y-ending codons in ssDNA phages (table 5). Although our result is consistent with the mutation hypothesis, the lack of selection for Y-ending codons may also play a role in the poor concordance in RSCU for Y-ending codons between ssDNA phages and E. coli. A previous study (Xia 2008) strongly suggests that tRNAs with a wobble G are equally efficient in decoding C-ending and U-ending codons. This implies that C→T mutations will not be counterchecked by selection, leaving the ratio of U-ending to C-ending codons entirely to the mercy of mutation bias.

A New Type of Codon Adaptation in ssDNA Phage in Response to the C→T Mutation Pressure

The C→T mutation pressure has driven ssDNA phages to evolve a previously unknown type of codon adaptation by biased usage of codon families. That is, they overuse U-friendly codon families in which C→T-biased mutations improve codon adaptation and avoid U-hostile codon families in which the biased mutation hampers codon adaptation (fig. 2). We have illustrated this adaption strategy with the codon usage in the Ser codon family for Enterobacteria phage Ike (NC_002014, Inoviridae) with a strong SKEWTC indicating a strong C→T mutation bias. This simple strategy allows the protein-coding genes in ssDNA phages to have CAI values comparable to those of dsDNA phages. We have noticed an analogous codon adaptation in the six-codon Leu, Arg, and Ser compound codon families in the yeast, Saccharomyces cerevisiae, in which the number of tRNA genes differ much between the four-codon subfamily and the two-codon subfamily. The yeast genome has 17 tRNALeu genes for the two-codon UUR subfamily but only four tRNALeu genes for the four-codon CUN codon family. The UUR codons account for 84% of Leu codons in highly expressed yeast genes compiled in the EMBOSS distribution (Rice et al. 2000). A similar pattern is observed for the Arg codon family. There are 16 tRNASer genes for the four-codon UCN subfamily and only two for the two-codon AGY codon subfamily. As expected, the UCN codons account for 89% of all Ser codons in highly expressed yeast genes. In short, whenever possible, selection for increased translation efficiency would drive protein-coding genes to maximize the use of codons that have many tRNAs to decode them. Our study can be advanced in two ways. First, it should take into consideration the role of translation initiation in addition to translation elongation. Genes with poor translation initiation are not expected to increase their protein production with optimized codon usage. It is only genes with efficient translation initiation that are expected to increase protein production with improved codon–anticodon adaptation (Tuller et al. 2010). Second, the existing phage genomic sequences still do not allow the construction of a sufficiently large phylogeny for phylogeny-based comparisons (Felsenstein 1985; Xia 2013), mainly due to 1) the rapid evolution of phage genomes, especially ssDNA phage genomes, and 2) few homologous genes identifiable among phage species parasitizing E. coli. However, one could argue that, given the rapid evolutionary erosion of coancestry among these phage lineages, the data from different phage lineages may indeed be considered nearly independent. Phages are essentially a mosaic of genes sampled from a pool of frolicking phage genomes. For example, although a number of “related” tailed phages have nearly identical genome organization at function level such as “DNA packaging-head-tail-tail fiber-lysis-lysogeny-DNA replication-transcription regulation” (Desiere et al. 2001), essentially any function in a phage can be fulfilled by one of many distinct genes with “homologous” function but little sequence homology (Brussow and Kutter 2005). In other words, horizontal gene transfer is so rampant that, coupled with rapid evolution, phylogenetic reconstruction based on sequence homology is nearly impossible. For example, a large number of phages have DNA polymerase, but these DNA polymerases apparently belong to a number of nonhomologous classes. Supplementary files S1-S3, Supplementary Material online, list all E. coli phage genes that share functional similarity but not necessarily sequence similarity, so that future researchers can add to it with newly sequenced phage genomes. The difficulty in building a reliable phage tree also prevents an interesting question to be addressed. The loss/gain of tRNA genes may be related to host tRNA pool. Take AAR (Lys) codon family, for example. If a phage species overusing AAA codons originally parasitizes a host overusing AAG codons and having abundant tRNALys/CUU but rare tRNALys/UUU, then the phage would benefit from retaining a tRNALys/UUU gene decoding its overused AAA codons. If the phage subsequently switched to a host overusing AAA codons and having abundant tRNALys/UUU, then the phage-encoded tRNALys/UUU gene would be of little value and would be prone to gene loss. Addressing such a question would be straightforward if one can build a reliable phage tree, so that the gain/loss of tRNA genes can be mapped onto the tree.

Materials and Methods

Genomic Data and Processing

The genome sequences of 469 dsDNA phages, 41 ssDNA phages, and their corresponding bacterial hosts were downloaded from GenBank, of which 71 have E. coli specified as their host in the “/HOST” tag in “FEATURES” table, including 60 dsDNA phages and 11 ssDNA phages. All phage genomes were searched for encoded tRNAs by using tRNAscan-SE Search Server (Schattner et al. 2005). The complete compilation with phage name, phage family, phage accession, phage genome length, genomic GC%, number of coding sequences (CDSs) in each phage genome, genomic TC skew defined as (NT − NC)/(NT + NC) where NC and NT are the genomic counts of nucleotides C and T, number of tRNA genes encoded in each phage genome, rRSCU, and CAI were included in a supplementary file S1, Supplementary Material online. Escherichia coli has many strains sequenced, but the “/Host” tag in most annotated viral genomes gives only species name (i.e., E. coli), with no strain-specific information. For this reason, the host GC% and RSCU are computed from the average of all E. coli genomes (The difference among E. coli strains is minimal.). The mean E. coli genome length is 5,024,514 nt, mean number of CDSs is 4,692.2, and mean genomic GC% is 50.68. The genomic accession numbers of all E. coli strains used to compute the average statistics are also included in the supplementary file S1, Supplementary Material online. The classification of phages into temperate and virulent categories is based on three publications (Lima-Mendez et al. 2007; Deschavanne et al. 2010; McNair et al. 2012).

Indices of Codon Adaptation

CDSs and tRNA genes in each phage and host genomes were extracted and RSCU computed by using DAMBE (Xia 2013). rRSCU (correlation between host and phage RSCU values) is taken as a measure of phage codon adaptation to the host translation machinery, with justifications outlined in the Introduction. Single-codon families such as the Met (coded by AUG) and Trp (coded by UGG) were excluded from computing rRSCU because the RSCU value is 1 for the two codons regardless of codon usage. CAI was computed with the improved implementation (Xia 2007) and highly expressed E. coli genes as the reference gene set. Throughout the text, the codon usage of highly expressed E. coli genes refers to the codon usage table compiled and distributed with the EMBOSS package (Rice et al. 2000). The median CAI for protein-coding genes for each phage is used as an alternative measure of phage codon adaptation. We did not use Nc (Wright 1990; Sun et al. 2013) as a measure of codon adaptation for the following reason. For an E. coli phage, selection by the host tRNA pool is expected to increase rRSCU and CAI. In contrast, mutation, biased or not, will decrease rRSCU and CAI. The effect of mutation and tRNA-mediated selection on Nc is more difficult to distinguish. In general, tRNA-mediated selection will decrease Nc, but biased mutation will also decrease Nc. For this reason, Nc is not good for measuring codon adaptation in E. coli phages.

Supplementary Material

Supplementary files S1–S3 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
  72 in total

Review 1.  Biotechnological challenges of phage therapy.

Authors:  Mikael Skurnik; Maria Pajunen; Saija Kiljunen
Journal:  Biotechnol Lett       Date:  2007-03-16       Impact factor: 2.461

2.  Coevolution of codon usage and tRNA genes leads to alternative stable states of biased codon usage.

Authors:  Paul G Higgs; Wenqi Ran
Journal:  Mol Biol Evol       Date:  2008-08-06       Impact factor: 16.240

3.  An extensive study of mutation and selection on the wobble nucleotide in tRNA anticodons in fungal mitochondrial genomes.

Authors:  Malisa Carullo; Xuhua Xia
Journal:  J Mol Evol       Date:  2008-04-10       Impact factor: 2.395

4.  A sensitive genetic assay for the detection of cytosine deamination: determination of rate constants and the activation energy.

Authors:  L A Frederico; T A Kunkel; B R Shaw
Journal:  Biochemistry       Date:  1990-03-13       Impact factor: 3.162

5.  Characterization of the phage-specific transfer RNA molecules coded by cholera phage phi 149.

Authors:  N Mandal; R K Ghosh
Journal:  Virology       Date:  1988-10       Impact factor: 3.616

6.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

7.  The use of genomic signature distance between bacteriophages and their hosts displays evolutionary relationships and phage growth cycle determination.

Authors:  Patrick Deschavanne; Michael S DuBow; Christophe Regeard
Journal:  Virol J       Date:  2010-07-17       Impact factor: 4.099

8.  A general model of codon bias due to GC mutational bias.

Authors:  Gareth A Palidwor; Theodore J Perkins; Xuhua Xia
Journal:  PLoS One       Date:  2010-10-27       Impact factor: 3.240

9.  Genome landscapes and bacteriophage codon usage.

Authors:  Julius B Lucks; David R Nelson; Grzegorz R Kudla; Joshua B Plotkin
Journal:  PLoS Comput Biol       Date:  2008-02-29       Impact factor: 4.475

10.  The cost of wobble translation in fungal mitochondrial genomes: integration of two traditional hypotheses.

Authors:  Xuhua Xia
Journal:  BMC Evol Biol       Date:  2008-07-19       Impact factor: 3.260

View more
  16 in total

1.  Dissimilation of synonymous codon usage bias in virus-host coevolution due to translational selection.

Authors:  Feng Chen; Peng Wu; Shuyun Deng; Heng Zhang; Yutong Hou; Zheng Hu; Jianzhi Zhang; Xiaoshu Chen; Jian-Rong Yang
Journal:  Nat Ecol Evol       Date:  2020-03-02       Impact factor: 15.460

2.  Escherichia coli and Staphylococcus phages: effect of translation initiation efficiency on differential codon adaptation mediated by virulent and temperate lifestyles.

Authors:  Ramanandan Prabhakaran; Shivapriya Chithambaram; Xuhua Xia
Journal:  J Gen Virol       Date:  2015-01-22       Impact factor: 3.891

3.  Bacteriophage evolution differs by host, lifestyle and genome.

Authors:  Travis N Mavrich; Graham F Hatfull
Journal:  Nat Microbiol       Date:  2017-07-10       Impact factor: 17.745

Review 4.  Bioinformatics and Drug Discovery.

Authors:  Xuhua Xia
Journal:  Curr Top Med Chem       Date:  2017       Impact factor: 3.295

5.  Isolation and Characterization of a Shewanella Phage-Host System from the Gut of the Tunicate, Ciona intestinalis.

Authors:  Brittany Leigh; Charlotte Karrer; John P Cannon; Mya Breitbart; Larry J Dishaw
Journal:  Viruses       Date:  2017-03-22       Impact factor: 5.048

6.  Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias.

Authors:  Lauren A Esposito; Swati Gupta; Fraida Streiter; Ashley Prasad; John J Dennehy
Journal:  Microb Genom       Date:  2016-10-21

7.  The Role of +4U as an Extended Translation Termination Signal in Bacteria.

Authors:  Yulong Wei; Xuhua Xia
Journal:  Genetics       Date:  2016-11-30       Impact factor: 4.562

8.  The Evolution of Molecular Compatibility between Bacteriophage ΦX174 and its Host.

Authors:  Alexander Kula; Joseph Saelens; Jennifer Cox; Alyxandria M Schubert; Michael Travisano; Catherine Putonti
Journal:  Sci Rep       Date:  2018-05-29       Impact factor: 4.379

9.  A major controversy in codon-anticodon adaptation resolved by a new codon usage index.

Authors:  Xuhua Xia
Journal:  Genetics       Date:  2014-12-05       Impact factor: 4.562

10.  Coevolution between Stop Codon Usage and Release Factors in Bacterial Species.

Authors:  Yulong Wei; Juan Wang; Xuhua Xia
Journal:  Mol Biol Evol       Date:  2016-06-13       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.