Literature DB >> 16963497

Codon usage bias and tRNA over-expression in Buchnera aphidicola after aromatic amino acid nutritional stress on its host Acyrthosiphon pisum.

Hubert Charles1, Federica Calevro, José Vinuelas, Jean-Michel Fayard, Yvan Rahbe.   

Abstract

Codon usage bias and relative abundances of tRNA isoacceptors were analysed in the obligate intracellular symbiotic bacterium, Buchnera aphidicola from the aphid Acyrthosiphon pisum, using a dedicated 35mer oligonucleotide microarray. Buchnera is archetypal of organisms living with minimal metabolic requirements and presents a reduced genome with high-evolutionary rate. Codonusage in Buchnera has been overcome by the high mutational bias towards AT bases. However, several lines of evidence for codon usage selection are given here. A significant correlation was found between tRNA relative abundances and codon composition of Buchnera genes. A significant codon usage bias was found for the choice of rare codons in Buchnera: C-ending codons are preferred in highly expressed genes, whereas G-ending codons are avoided. This bias is not explained by GC skew in the bacteria and might correspond to a selection for perfect matching between codon-anticodon pairs for some essential amino acids in Buchnera proteins. Nutritional stress applied to the aphid host induced a significant overexpression of most of the tRNA isoacceptors in bacteria. Although, molecular regulation of the tRNA operons in Buchnera was not investigated, a correlation between relative expression levels and organization in transcription unit was found in the genome of Buchnera.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16963497      PMCID: PMC1636365          DOI: 10.1093/nar/gkl597

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Associated with agricultural aphid pests, Buchnera aphidicola is one of the best-known symbiotic intracellular bacteria of insects. Aphids rely on Buchnera to support the rapid growth and reproductive potential that make them such versatile and notorious pests. Buchnera furnishes some vitamins and most essential amino acids that the aphid host cannot synthesize or find in sufficient quantities in plant phloem sap (1,2). The Buchnera genome presents all the characteristics of a vertically transmitted intracellular bacterium (3): (i) a small size of 400–600 kb, depending on host species (4); (ii) a highly biased base composition towards A and T (5); and (iii) a high-evolutionary rate due to drastic bottlenecks and the absence of recombination, occurring in the population dynamics of the bacteria following transmission through host generations (6). During its intracellular evolution, Buchnera has conserved most of the genes encoding vital physiological functions for the symbiotic entity. As an example, enzymes of the essential amino acid biosynthesis pathways were conserved whereas most of the genes encoding proteins involved in non-essential amino acid biosynthesis (those the aphids would produce) are deleted. DNA replication and repair mechanisms have been partly deleted, as compared to its free-living relative Escherichia coli. The same observations can be made regarding the transcription machinery (i.e. lack of many transcriptional regulators, as well as recognizable terminator and promoter sequences). Such genes and structures have been lost during the genome shrinkage of Buchnera, probably early in the intracellular life style of the bacteria (7,8). As generally in all bacteria, protein translation machinery is also highly conserved in Buchnera. Hence, Koonin (9), based on comparative genomics, determined a ‘consensus’ prokaryotic gene set composed of ∼60 proteins, primarily those involved in translation functions. Genome comparison between Buchnera and E.coli, using the KEGG database (), reveals that all of the 55 ribosomal proteins, the 12 translation factors and the 21 tRNA amino-acyl transferases found in E.coli are also present in Buchnera. It is noticeable that rRNA genes are present as a single copy in Buchnera, whereas seven copies are present in E.coli. This point is generally interpreted as a property of slow-growing organisms (10,11). The tRNA set of Buchnera is composed of 32 isoacceptors and is almost minimal, being only surpassed in compactness by 8 organisms of the TIGR CMR (), all of them from small genome Mollicutes (Mycoplasma), and the euryarcheota Methanopyrus kandleri. Mycoplasma mobile is the only bacterium with a putative remnant set of 28 tRNA genes in its genome (11), including just two tRNA-Met. When analysing the codon distribution of the tRNA genes present in the Buchnera genomes (Table 1), it seems clear that the occurrence of genes is almost optimized to fulfil a minima the standard wobble rule (encoded anticodons starting with G or U, i.e. codons ending with C or A). The charging of proline and isoleucine through a single tRNA implies, for these two amino acids (and arginine), that a part of the tRNA population is modified with non-standard bases to ensure a proper recognition of the three non-perfect match codons. For arginine and proline, it is noteworthy that the encoded anticodons do not follow the general rule, and that the tRNAs matching the CGC and CCC codons were avoided, in contrast to all the other C-terminated codons, which have their matching tRNAs. This situation is not unique among small genomes, as many other bacteria displayed very similar sets of tRNA-Arg, including Wolbachia pipientis, Wiggelsworthia glossinidia brevipalpis and Blochmania floridanus.
Table 1

Codon usage and tRNA isoacceptors in the genome of B.aphidicola

TCAG
codaAARCSUbNcodAARCSUNcodAARCSUNcodAARCSUN
TTTTPhe1.838662TCTSer2.565821TATTyr1.735882TGTCys1.641874
TTC0.17814TCC0.25570TAC0.27910TGCc0.36409
TTALeu3.8712 029TCA1.493388TAASTOP2.27433TGASTOP Trp0.3465
TTG0.621932TCG0.21468TAG0.3975TGG11701
CCTTLeu0.732251CCTPro1.892678CATHis1.753470CGTArg22384
CTC0.1320CCC0.29404CAC0.25504CGC0.29342
CTA0.561727CCA1.522144CAAGln1.745226CGAArg0.951128
CTG0.12370CCG0.3423CAG0.26790CGG0.09102
AATTIle1.6511 901ACTThr1.823900AATAsn1.7211 599AGTSer1.232801
ATC0.251783ACC0.26551AAC0.281864AGC0.26596
ATAIle1.118013ACA1.713674AAALys1.8417 059AGAArg2.52962
ATGMet14023ACG0.21453AAG0.161514AGG0.18211
GGTTVal1.84102GCTAla1.763688GATAsp1.767153GGTGly1.674244
GTC0.28640GCC0.25524GAC0.24966GGC0.3782
GTA1.613665GCA1.723618GAAGlu1.829417GGA1.784529
GTG0.32726GCG0.27572GAG0.18942GGG0.25643

aPerfect match codons are bold-underlined (codons with a corresponding perfect match anticodon-harbouring isoacceptor tRNA in Buchnera).

bRSCU, relative synonymous codon usage (ΣRSCU = n, the degeneracy level for each amino acid); boldfaced figures are prefered codons (RCSU > 1).

cBU330 described as pseudo-tRNA (GCA anticodon) in the automated annotation of TIGR-CMR.

The frequency of alternative synonymous codons varies among bacterial species and often also among genes from a single genome. Factors influencing codon usage in bacteria are the GC skew (i.e. differential mutation rate between lagging and leading strand) (12), horizontal gene transfer between bacteria exhibiting different codon usage bias (13) and selection for translation efficiency. The latter hypothesis was formulated first in E.coli. Ikemura (14,15) showed that tRNA abundances are correlated with codon usage: highly expressed isoacceptors are those corresponding to the more frequently used codons (rule 1). This correlation has also been found in a few other bacteria [for a review see (11)]. In contrast, selection for codon usage is often inefficient in bacteria with extreme GC content (16–18). In the case of Buchnera, several authors have claimed the absence of selection for codon usage (19,20). As each isoacceptor recognizes several codons, additional rules are needed to fully explain codon choice in bacteria [listed in (21): rule 2: modified uridines such as thiolated uridine and 5-carboxymethyluridine at the wobble position produce a preference for A over G at the codon third position; rule 3: an inosine at the anticodon wobble position produces a preference for T or C over A at the third position; rule 4: in two-codon sets of the (A/T)-(A/T)-pyrimidine type, C rather than T at the third position promotes an optimal interaction strength between codon and anticodon (the ‘good choice’ codon rule of Grosjean et al. (22) and Grantham et al. (23)]. Regulation of rRNA and tRNA expression is complex in bacteria. In E.coli, rRNA and tRNA operons are negatively regulated by the free nontranslating ribosomes (24). However, the regulation mechanism is unclear and probably occurs during the four following steps: transcription, processing of precursors, degradation of precursors and degradation of mature tRNA (25). Recently, it has been shown that tRNA levels corresponding to major isoacceptors increase while those corresponding to minor isoacceptors decrease when the bacterial growth rate increases (26,27). Dittmar et al. (25) also demonstrated, in Bacillus subtilis, that the relative abundances of tRNA were higher when the bacteria were placed in nutritional conditions producing high growth rates. In Buchnera, the 32 tRNA genes are organized into 18 groups of contiguous sequences (dashed lines in the Table 2). Two of them include the rRNA genes rrs (BU243) and rrf-rrl (BU490-BU491), and the associated tRNAs are highly expressed. As a comparison, E.coli K-12 has 88 tRNA genes organized in 40 transcription units, of which 7 are associated with rRNA and are highly expressed. Global transcriptional regulators are known to be involved in the regulation of tRNA expression (28). Among them, the histone-like proteins FIS and H-NS are preponderant. Such regulators are conserved in the genome of Buchnera but their role in tRNA regulation has not yet been demonstrated.
Table 2

Chromosome location and isoacceptor tRNA expression in B. aphidicola from A.pisum reared on YF0 and control medium (AP3)

LabelsanameSg-AMFCUp/down
BU017tRNA-Phe-GAA12.53−0.1750.8850
BU041tRNA-Thr-GGU10.84−0.0300.9790
BU042tRNA-Gly-GCC12.610.6461.565+
BU043tRNA-Tyr-GUA15.400.2441.1840
BU044tRNA-Thr-UGU16.090.4061.325+*
BU068tRNA-Lys-UUU11.760.1541.1130
BU069tRNA-Val-GAC13.440.2311.1730
BU071tRNA-Ala-GGC14.25−0.0280.9800
BU111tRNA-Val-UAC11.850.1761.1300
BU244tRNA-Ile-GAU28.080.4901.405+
BU245tRNA-Ala-UGC15.540.0261.0180
BU249tRNA-Asp-GUC12.770.0131.0090
BU329tRNA-Leu-GAG15.950.8511.804+**
BU330tRNA-Cys-PseudoGCA15.310.2581.200+*
BU331tRNA-Ser-GCU10.720.9341.911+
BU379tRNA-Leu-UAA20.41−0.2760.825
BU405tRNA-Ser-GGA16.430.3731.295+
BU406tRNA-Arg-ACG12.770.1151.0830
BU412tRNA-Gln-UUG15.270.2971.228+
BU413tRNA-Leu-UAG15.190.5101.424+*
BU414tRNA-Met-CAU10.990.0721.0510
BU457tRNA-Met-CAU11.290.1931.1430
BU485tRNA-Arg-CCG10.630.4041.324+
BU492tRNA-Glu-UUC20.28−0.4560.728
BU540tRNA-Ser-UGA21.730.9671.955+
BU557tRNA-Asn-GUU25.57−0.0160.9880
BU558tRNA-Met-CAU10.110.4171.335+
BU575tRNA-Gly-UCC14.260.2711.207+
BU593tRNA-Pro-UGG19.590.9231.896+
BU594tRNA-His-GUG8.780.1791.1320
BU595tRNA-Arg-UCU10.000.2261.1690*
BU601tRNA-Trp-CCA10.180.2461.1860

Expression levels (g-A) are given as g-normalized A values with , Logratio are and fold changes (FC) are 2M. + and − are up- and down-regulated genes with fold changes greater than 1.2 or smaller than 0.83. Genes marked with an asterisk show M value statistically not equal to 0 (**P-value < 0.05, *P-value < 0.1). Dashed lines in the table separate non-contiguous genes on the chromosome.

aThe three numerical digits represent chromosome locations (gene numbering from the origin of replication).

This work addresses the following two questions: (i) Is codon usage only driven by mutational bias in Buchnera, as is generally claimed? and (ii) To what extent is the regulation of tRNA expression functional in the degenerate genome of Buchnera? tRNA relative abundances were measured with an oligonucleotide microarray dedicated to the Buchnera genome when the aphid Acyrthosiphon pisum diet is depleted of tyrosine and phenylalanine, two important amino acids for which Buchnera furnishes the phenolic nucleus to its host when supply is low. An analysis of the different rules for assigning optimal codons in Buchnera is presented.

MATERIALS AND METHODS

Aphids and artificial diets

A long-established parthenogenetic clone (LL01) of A.pisum (Harris) was maintained at 21°C with a 16 h light photoperiod. Alate viviparous adults, reared at low density on Vicia faba seedlings for 2 days, were allowed to lay progeny on young plants. Neonate aphids (aged 0–12 h) were used for all the experiments. All experiments were initiated by transferring 2-day-old aphids from plants to artificial diets. Two diet formulations were used in this study, differing only in their amino acid composition. Diet AP3 was nutritionally optimized, and based on the total amino acid profiles of whole aphid tissues, as described by Febvay et al. (29). The YF0 diet shared the omission of both phenylalanine and tyrosine from the AP3 complete diet, and was designed to stimulate the biosynthetic activities of Buchnera for the important classes of essential amino acids harbouring the phenolic nucleus. For the microarray experiments, Buchnera were purified from the aphids as described by Charles and Ishikawa (30).

Microarray experiments

Protocols for RNA isolation and labelling, microarray manufacturing (including probe design), hybridization reaction and washing, microarray scanning and statistical analyses of data are available on the SITRANS Database under the project name ‘faromat’ (31). Details of RNA preparation are provided in Calevro et al. (32). tRNA relative and g-normalized expressions (see further) are given in Table 2. Microarray data are also available in the Array Express database (accession no. E-TABM-83). The complete physiological and metabolic analysis has been submitted elsewhere. The procedure is described briefly below. Oligonucleotide probe sequences (35 bases) were defined using the software ROSO (33). RNA was purified using the Trizol method and indirectly labelled by incorporating aminoallyl-dUTP into reverse transcript cDNA. Cy3 and Cy5 fluorescent dyes were then coupled with the targets in a dye swap experimental design including 2 × 3 microarrays (Cy3-YF0/Cy5-AP3 and Cy3-AP3/Cy5-YF0). QMT aldheyde slides (Interchim, Montluçon, France) were used and hybridizations were performed manually and automatically using a Ventana Discovery automated station. Manual hybridization took place under a glass cover slip (24 × 60 mm) in a humidified slide chamber (Proteigene, Saint Marcel, France) and incubated at 50°C overnight for 16 h. The slides were washed twice in 2× SSC/0.1% SDS buffer for 10 min at room temperature and four times in 0.1× SSC for 1 min at room temperature, rinsed in distilled water and dried by centrifugation. In the case of automatic hybridization, target solution was inserted under an oil droplet (mineral oil LCS) at 45°C for 8 h followed by several washes of variable stringencies in a Ventana Medical Systems hybridization apparatus (Ventana Inc., Tucson, AZ). Scanning was performed with a GeneTAC LSIV scanner (Genomic Solutions, Huntingdon, UK). Signal intensity values for each spot (pixel median) were recorded and quality analysis was performed with the GenePix 4.0 image analysis software (Axon Instruments, Foster City, CA). Normalization and statistical analysis of microarray data were performed with the Bioconductor libraries () of the R software (). Dye and slide normalizations were performed on an invariant set of genes that was determined a posteriori following the non-parametric methods of Tseng et al. (34). A printTip Loess normalization was then applied using the R library ‘marray’ developed by Dudoit et al. (35). Statistical tests applied on microarray data are modified t-test (36) and ANOVA analysis using the R library ‘maanova’ (37). Non-parametric tests (i.e. simulated Pearson's Chi-squared, median and Wilcoxon tests) were performed with the R software. Microarray technique usually does not allow absolute expression measurements because the hybridization reaction between the probes and their targets is highly variable. However, in bacteria, the use of genomic DNA (g-normalization) offers the facility to calibrate absolute gene expressions (38). This calibration was used here to estimate the expressions of the 32 tRNA isoacceptors of Buchnera, all sharing similar thermodynamic properties (i.e. length and GC content). Genomic DNA was classically purified with phenol/chloroform/isoamyl alcohol (25:24:1) protocol, followed by an RNase A treatement (Sigma-Aldrich, St Quentin Fallavier, France). The labelling was performed using the Nick Translation kit (Amersham) with 5 µg genomic DNA and incorporating the dUTP-Cy3 fluorochrome (Amersham). Non-incorporated fluorescent dyes were eliminated by purification on AutoseqTM G-50 columns (Amersham). Obtained targets have been independently hybridized on four additional slides to estimate the relative hybridization rate of each Buchnera gene probe. As no gene repeat occurs in Buchnera, genomic DNA fluorescence signals (gFg) should be homogeneous over the 617 Buchnera genes. Assuming this homogeneity, we calculated a calibration coefficient for each gene . g-Normalized tRNA expressions were then expressed as percentage values within the group of tRNA genes.

Expression datasets and codon usage indices

Highly and poorly expressed gene sets (n = 50 for each, Supplementary Table 1) were selected based on expression data obtained with the Buchnera microarray on AP3 diet. The highly expressed genes included 19 ribosomal proteins, 8 flagellar genes, tuf, mopA, mopB and several other genes coding for metabolic enzymes. Codon usage indices (RSCU and Nc) were calculated using the codonw software developed by J. Peden (). RSCU (relative synonymous codon usage) were computed by dividing the observed frequency for one codon by the frequency expected if all synonyms for that amino acid were used equally. Thus, RSCU values close to unity indicate a lack of bias. Differential RSCU were the ratio between RSCU obtained with the highly expressed genes versus RSCU of the poorly expressed genes. Comparisons were made using codon effectives (Nc, see below) and Pearson's Chi-squared tests. Nc is the effective number of codons used by a gene, and varies between 20, when only one codon is used for each amino acid, and 61, when codons are randomly used for each amino acid. Expected values of Nc can be estimated under the assumption that codon bias is only due to mutational bias (GC3s) using the following formula:

RESULTS

Isoacceptors and codon usage

Boldface and underlined codons in Table 1 represent the perfect match codons for the 32 tRNA isoacceptors in Buchnera. The Buchnera tRNA repertory and comparison with other small genome bacteria have been described previously in Introduction. Codon usage is presented in Table 1 with the RSCU computed for each codon. It is very clear from the table that preferred codons in Buchnera are those ending with A or T. Codon usage in Buchnera is hence mainly governed by the mutational AT bias and most often the ‘good choice’ rule is violated.

tRNA expression and isoacceptor abundances

Unexpectedly, the expression of genes encoding tRNAs could only be quantified using manual hybridization. Figure 1C and D shows that when automated hybridization was performed tRNAs were systematically over-labelled with red fluorescence. The hypothesis of a direct interaction between Cy5 and tRNA can be rejected because when hybridization was performed manually (i.e. under a glass cover slip in a hybridization chamber, Material and Methods), the dye swap was observed for the tRNA genes (Figure 1A and B). We hypothesize that this artefact, which is specific to tRNA genes, is caused by an interaction between tRNAs and the mineral oil (LCS) used in Ventana automated hybridization. Indeed, the fluorescence of the oil at the same wavelength as Cy5 is described in the technical notes of the manufacturer. To explain this measurement artefact, secondary structures of tRNA molecules might be involved. However, examination of dye swap responses on the other Buchnera genes harbouring secondary structures (such as rRNA) did not show the same over-labelling problem (data not shown). As tRNAs are GC-rich sequences, the influence of GC content was tested, but again no significant effect was observed (data not shown). Specific properties of tRNAs, such as their link to amino acid, were suspected to be responsible for the artefact, but this possibility was not explored further here. Unexplained dye bias was also reported by Dittmar et al. (25) in the fluorophore-labelling reaction for some tRNA species.
Figure 1

Normalized MA-plots for four slides in a dye swap experiment. (A and B) Manual hybridization; (C and D) automated hybridization. represents raw mean expression value. represents YF0 versus AP3 logratio in (A) and (C) plots, whereas reverse hybridizations are found in (B) and (D) plots. tRNA genes are highlighted with red (over-expressed) and green (repressed) colours on graphs. Dye swap is clearly visible with manual hybridization, whereas in automated hybridization tRNA are always over-labelled in red (Cy5 fluorescence).

g-Normalized expression levels for the 32 tRNA isoacceptors, corresponding to the control nutritional conditions for the host aphids (AP3 diet), are presented in Table 2 (‘g-A’ column). Figure 2 represents the correlation between g-normalized tRNA relative abundances and codon frequencies for the 50 most expressed genes of Buchnera. A significant correlation was observed (r = 0.38, P = 0.02), moreover removing the AT-rich codon (AAA) returns an higher correlation (r = 0.61, P = 4 × 10−4). Similar results were obtained with the whole set of Buchnera genes (data not shown). In Buchnera, the abundances of isoacceptors tRNA are therefore adjusted to their matching codon frequencies. These results confirm that rule 1 is respected in Buchnera. However, the expression of the , which pairs the overused codons AAA (representing about 12% of all the Buchnera codons), has not been increased in the bacteria.
Figure 2

Plot of g-normalized tRNA relative abundances (expressed as percentage of total tRNA) versus matching codon compositions (i.e. the sums of codons associated to the corresponding tRNA species) of the 50 most expressed genes in Buchnera. Solid line, complete regression; dashed line, regression without the AAA-Lysine codon.

Codon usage bias and codon–anticodon pairing rules

Differential RSCU values were computed using the two sets of highly and poorly expressed genes in Buchnera (Table 3, column Hi/Lo). Significant differences in codon effectives (Chi-squared test) were found for the following amino acids: leucine, isoleucine, valine, tyrosine, histidine, asparagine and arginine. For these amino acids (except Ile and Val), the highly expressed genes prefer C-ending codons as compared to the low expression group. It is also noteworthy that for all of the 16 C-ended codons in Table 3, the differential RSCU is above or equal to 1. This distribution is therefore highly biased (median test, P = 10−5) and is partly explained by the optimal codon choice rule 4 for the amino acids Phe, Tyr, Asn, Asp and His. These C-ending codons are optimal for a majority of microorganisms (21). Although rare in Buchnera due to the AT bias, they could be selected for high expression due to their optimal pairing capacities.
Table 3

tRNA g-normalized expression and differential RSCU

TCAG
codaAAExpbHi/LocodAAExpHi/LocodAAExpHi/LocodAAExpHi/Lo
TTTTPhe12.530.97cTCTSer0.98TATTyr*0.94TGTCys1
TTC1.35TCC16.431.25TAC15.41.42TGCc15.311
TTALeu*20.410.97TCA21.730.99TAASTOP0.92TGASTOP1.35
TTG0.77TCG0.56TAG1.13TGGTrp10.181
CCTTLeu*1.25CCTPro0.9CATHis*0.93CGTArg*12.771.36
CTC15.951.55CCC1.11CAC8.781.6CGC1.35
CTA15.191.1CCA19.591.1CAAGln15.271.01CGAArg*0.87
CTG1.1CCG1.08CAG0.93CGG10.630.5
AATTIle*1.06ACTThr1.03AATAsn*0.96AGTSer1.04
ATC28.081.04ACC10.841.1AAC25.571.28AGC10.721.34
ATAIle*0.89ACA16.091.01AAALys11.761AGAArg*100.87
ATGMet10.811ACG0.66AAG1AGG0.33
GGTTVal*0.98GCTAla0.91GATAsp0.97GGTGly0.99
GTC13.441GCC14.251.26GAC12.771.23GGC12.611.19
GTA11.850.93GCA15.541.04GAAGlu20.281.01GGA20.280.98
GTG1.71GCG1.08GAG0.95GGG0.96

aPerfect match codons are bold-underlined (codons with a corresponding perfect match anticodon-harbouring isoacceptor tRNA in Buchnera).

bExpression measurement (g-normalized fluorescence) of the tRNA isoacceptor.

cDifferential ratio of RSCU between the high and low expressed genes in Buchnera (boldface values are extreme values and asterisks are amino acids associated with significant Chi-squared test, α = 0.05).

Furthermore, the ATA Ile codon (requiring modified tRNA) is more rejected in the highly expressed genes. A similar rejection is observed for the arginine CGA codon which might be explained by the cost of the base modification required on the corresponding two tRNAs (rule 3). Finally, G-ending codons are often rejected in highly expressed genes as compared to poorly expressed genes (with the exception of Leu, Ser, Thr and Val). This might be explained by rule 2 involving thiolated or carboxymethylated urididine facing the codon third position; however, the preference for A-ending codons is not clearly observable in our dataset. The plot of the effective number of codons (Nc) versus GC content in the third position of synonymous codons (GC3s) is also well adapted to visualize codon usage bias (Figure 3). Gene distribution is nearly symmetrical around the expected curve (dashed line in Figure 3) revealing that codon usage is mainly directed by GC3s composition. However, the distribution of highly expressed genes is slightly biased toward small Nc values (codon usage bias) and higher GC3s, comforting the hypothesis of C-ending preferred codons, revealed by the differential RSCU analysis.
Figure 3

Effective number of codons used (Nc) plotted against GC3s (gc3) for 571 Buchnera genes. Dashed line represents the expected value of Nc if bias is only due to GC content; left panel: all Buchnera genes; right panel: open squares are the 50 genes with the highest expression rates and open triangles are the 50 genes with the lowest expression rates. Note that the square distribution is slightly decentred in the bottom and right area of the graph.

Codon usage bias and GC skew

GC skew is high in Buchnera () and a bias distribution of highly expressed genes on the leading strand might explain the observed codon usage bias (12). However, as the leading strand is more G+T rich, the C-ending codon preference in Buchnera should be attenuated (rather than accentuated) by strand distribution bias. Moreover, no significant differences were found when ratios of gene numbers located on leading strand versus lagging strand were compared for the two expression groups. Ratios were equal to 1.33 and 1.52 for the highly and poorly expressed genes, respectively. A more biased distribution, with numerous highly expressed genes located on the leading strand, might have been expected. However, this distribution is consistent with the work of Rocha and Danchin (39) showing, in different bacteria, that distribution of genes between the two replicating strands is driven by gene essentiality and not by gene expressivity. To quantify GC skew influence on codon usage in Buchnera, the two datasets (highly and poorly expressed) were split following the strand localization of each gene, and differential RSCU analyses were performed within both strands. The C-ending preference was enhanced in the CT-rich lagging strand and attenuated in the AG-rich leading strand. However, the tendency was still significant in both groups (data not shown). The rejection of G-ending codons (except the GTG-Val) was also observed for the two strands. The results presented here are contradictory with the work of Sharp et al. (11) who studied the strength of codon usage selection in 80 bacterial taxa. Based on a set of putative highly expressed genes mostly composed of ribosomal proteins, the authors observe that T-ending codons are unexpectedly preferred in Buchnera sp. and explain the bias by the GC skew in the bacteria. Two factors were probably misleading in this interpretation: (i) ribosomal protein genes are not the only and the highest expressed genes in Buchnera and (ii) no strand distribution asymmetry is observed with the highly expressed genes of Buchnera, whereas the essential ribosomal proteins are mainly located on the leading strand.

Regulation of tRNA gene expression

Depletion of the aromatic amino acids tyrosine and phenylalanine in aphid artificial diet results in up-regulation of most tRNA genes of Buchnera (Table 2, M column). Fold change measurements reveal positive values for 26 tRNA out of 32. This proportion is highly significant (median test, P-value = 2 × 10−4). If genes with fold changes close to 1 (i.e. >0.83 or <1.2) are grouped in an ‘invariant class’, the proportion of up-regulated genes remains significant (P-value = 2 × 10−3). The metabolic analysis of the experiment is not detailed in this paper. However, we can mention that six other genes related to tRNA metabolism were differentially expressed in the YF0 conditions. The genes glnS (Glutaminyl-tRNA synthetase), gltX (Glutamyl-tRNA synthetase) and glyS (Glycyl-tRNA synthetase beta chain) were up-regulated in YF0 condition, whereas pheT (Phenylalanyl-tRNA synthetase beta chain) was significantly down-regulated on the aromatic-depleted medium. Two genes involved in tRNA modification, trmE and trmD, were both significantly induced in the YF0 condition. Several ribosomal proteins were also regulated in the experiment. The TrmE protein is involved in the biosynthesis of the hypermodified nucleoside 5-methylaminomethyl-2-thiouridine, found in the wobble position on some tRNAs. Translation regulation efficiency might therefore be regulated via the modification of the wobble position of isoacceptor tRNAs. Overexpression of tRNA synthase genes is consistent with tRNA overexpression. Moreover, in the context of limiting aromatic amino acids, the specific repression of the pheT gene encoding the Phenylalanyl-tRNA synthetase might appear as a regulatory mechanism to increase the flux of the free phenylalanine exportation toward the host rather than incorporating this free amino acid for its own protein biosynthesis (‘altruistic’ response of endosymbiont). However, the latter two hypotheses are as yet speculative, and the mechanisms underlying this regulatory response are completely unknown. Expression regulation of genes encoding tRNA isoacceptors is usually determined by their association to the different transcription units (e.g. rRNA operons). In E.coli, genome location of tRNA genes explains ∼15% of their expression variation at a given growth rate (40). In Buchnera, tRNA genes are located on 18 different contiguous chromosome locations (pTU, putative transcription units) visualized by dashed lines in Table 2. Although not significant (P = 0.12),between-pTU variance of M values is always bigger than within-pTU variance (data not shown). To obtain more robust data, the M column was reduced to the up/down column of Table 2, and a contingency table was built between the two qualitative variables, ‘pTU’ and ‘up/down’. A Pearson's Chi-squared test was then applied with simulated P-values to prevent bias of small effectives. A significant dependency was observed (P-value = 0.05) between the two variables, indicating that there is a tendency for tRNA belonging to the same transcription unit, and most probably controlled by the same promoter, to be regulated similarly.

DISCUSSION

To our knowledge, the results presented here are the first analysis of tRNA gene expression in Buchnera, the previously published microarray work having been developed on a cDNA-chip lacking this set of short genes (41). Also, this is the first time that real sets of high/low expression genes have been used in the same species, characterized by extreme GC content composition, to analyse the link between expressivity and codon usage biases. In Buchnera, previous studies either analysed E.coli codon adaptation of orthologous genes to detect high expression (42) or used limited sets of genes from different species to infer those of the BuchneraAPS set (20). With only 32 molecular species, Buchnera set of tRNAs is almost minimal. Further reduction may have been possible, as was occurred in some mollicutes (2 tRNA-Met, 1 tRNA-Ala or 2 tRNA-Arg in M.mobile). Two reductions in tRNA numbers, observed in Buchnera, seem very widespread in small genomes, namely the charging of proline and isoleucine through a single tRNA. This implies, for these two amino acids (and for arginine), that a part of the tRNA population is hypermodified to ensure proper codon–anticodon recognition. Hence, all the enzymes involved in the hypermodification of the tRNAs (rnpA, pth, trmU, cca, trmD, gidA, trmE and truA) are conserved in Buchnera, as well as in other intracellular endosymbionts, whereas they are not all essential in E.coli (43). Owing to reduction in tRNA sets, we propose that this set of genes is trully essential in small genomes, confirming their inclusion in the minimal cell gene set proposed by Gil et al. (43). In bacteria, translational control of gene expression is partly regulated by changes in the charging of tRNA due to differential expression of single isoacceptors (44). Indeed during amino acid limitation, the concentration of the charged tRNAs corresponding to the most abundant codons tends to zero whereas concentrations of the other charged-tRNA will be unchanged. The theory of selective tRNA charging is the basis of the attenuation system described in E.coli (45), but in Buchnera, most if not all attenuation systems have been lost, and the reduction of tRNA species might be an adaptation to the metabolic constraints of the intracellular environment characterized by extreme limitations of several amino acids: reducing tRNA species and limiting codon usage would reduce the waste of free amino acids for charging useless tRNAs. The first question we addressed in this study concerned the forces driving codon usage in Buchnera. Mutational bias has a major influence on codon choice in the bacteria and is independent of gene expressivity. However, a significant correlation between codon composition of Buchnera genes and tRNA production was observed with the exception of the isoacceptor . For the latter, adjustment of its expression has been overcome by the effect of mutational bias increasing drastically the frequencies of the corresponding lysine codon in proteins. These results indicate that growth rate is probably not the main factor governing evolution of Buchnera. Indeed, Buchnera's environment is highly constrained by the metabolic demand from the aphid host. In that context, Rocha and Danchin (46) suggested that the higher energy cost and limited availability of G and C over A and T/U could explain the AT-enrichment of all obligate intracellular organisms including bacteria, viruses and plasmids. Such metabolic constraints are not restricted to the bacteria and occur on the whole symbiocosme (e.g. the associated organisms). Hence, the AT-enrichment of the pea-aphid genome observed by Sabater-Munoz et al. (47) might also be explained by the strict intracellular feeding of the insect. In Buchnera, a significant codon usage bias was also observed for the choice of rare codons: C-ending codons are preferred for the majority of amino acids in highly expressed genes. There is experimental evidence that rare codons can reduce the rate of translation in E.coli (48,49). In organisms with efficient selection for translational efficiency, one might expect rejection of rare codons in highly expressed genes. It has been demonstrated in Drosophila that accuracy of translation is increased when using perfect match codon–anticodons pairs (50). Conservation of C-ending rare codons within essential or highly expressed genes might correspond to selection for translation efficiency of functionally important protein amino acids in Buchnera (C. Rispe, personal communication). To test this hypothesis, we analysed the codon composition of the groEL Buchnera gene (one of the most highly expressed in the bacteria) and no clear evidence for selection was found. Of the 28 C-ending codons of the protein 3 were functionally important and highly conserved codons (51). When compared to the proportion of all the C-ending codon possibilities within the gene, the distribution was not significantly biased. Surprisingly, more C-ending codons accumulated within the positively selected regions of the protein described by Fares et al. (52) (P-value = 0.06). However, the distributions were not significantly biased in some other Buchnera species (e.g. those from the aphids Schizaphis graminum and Baizongia pistacea) and further studies are needed to analyse whether this codon bias is functional (selected) or whether it is a remnant trait derived from ancestral bacteria. The second question was directed towards the tRNA transcriptional regulation capacities of the bacteria. We have shown here that Buchnera is able to respond to the variation in nutritional demand of the aphid host, even though most specific transcriptional regulators are absent from its genome. Overexpression of tRNA genes is generally correlated with increase of metabolic activities and cell division in bacteria. In our experiment, the biological parameters of aphids feeding on the depleted diet are good. Indeed, only larval growth was slightly affected (−20%), but neither survival nor fecundity is affected by the lack of phenylalanine and tyrosine (Supplementary Figure 1). These results are consistent with a global increased compensatory metabolism of Buchnera upon phenolic amino acid deficiency in the aphid diet and the correlation between differential gene expression and transcription unit organization in the chromosome of Buchnera may indicate a functional regulation activity in the bacteria. In conclusion, this paper addresses the question of determinants that shape the evolution of codon usage, as well as tRNA isoacceptors abundance and diversity. Translation speed is optimized when tRNA abundances are adjusted to fit to codon usage of highly expressed genes for the highest growth rates in E.coli (53). However, as reviewed in Bailly-Bechet et al. (54), the efficiency of translation is probably not the only force driving codon usage and tRNA evolution. Metabolism limitation, proper folding of the nascent protein, horizontally transferred genes and local recycling of rare tRNAs are other factors that might influence selection of the translational machinery. These factors might be even more crucial for slow growers and intracellular bacteria (55). In the aphid-Buchnera association, the strong metabolic constraints affecting the two partners (i.e. both are intracellular feeders) have probably shaped the evolution of the translation machinery, explaining both the codon usage and the genome AT-enrichment.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  50 in total

Review 1.  Lateral gene transfer and the nature of bacterial innovation.

Authors:  H Ochman; J G Lawrence; E A Groisman
Journal:  Nature       Date:  2000-05-18       Impact factor: 49.962

2.  Gene essentiality determines chromosome organisation in bacteria.

Authors:  Eduardo P C Rocha; Antoine Danchin
Journal:  Nucleic Acids Res       Date:  2003-11-15       Impact factor: 16.971

3.  SITRANS: a Web Information System for Microarray Experiments.

Authors:  Frédérique Laforest; Anne Tchounikine; Tarak Chaari; Hubert Charles; Federica Calevro
Journal:  Stud Health Technol Inform       Date:  2005

4.  Growth rate-optimised tRNA abundance and codon usage.

Authors:  O G Berg; C G Kurland
Journal:  J Mol Biol       Date:  1997-07-25       Impact factor: 5.469

5.  Accelerated evolution and Muller's rachet in endosymbiotic bacteria.

Authors:  N A Moran
Journal:  Proc Natl Acad Sci U S A       Date:  1996-04-02       Impact factor: 11.205

Review 6.  Attenuation in the control of expression of bacterial operons.

Authors:  C Yanofsky
Journal:  Nature       Date:  1981-02-26       Impact factor: 49.962

7.  Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates.

Authors:  H Dong; L Nilsson; C G Kurland
Journal:  J Mol Biol       Date:  1996-08-02       Impact factor: 5.469

8.  Consequences of reductive evolution for gene expression in an obligate endosymbiont.

Authors:  Jennifer L Wilcox; Helen E Dunbar; Russell D Wolfinger; Nancy A Moran
Journal:  Mol Microbiol       Date:  2003-06       Impact factor: 3.501

9.  Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy.

Authors:  H Akashi
Journal:  Genetics       Date:  1994-03       Impact factor: 4.562

10.  The genomic pattern of tDNA operon expression in E. coli.

Authors:  David H Ardell; Leif A Kirsebom
Journal:  PLoS Comput Biol       Date:  2005-06-24       Impact factor: 4.475

View more
  11 in total

1.  Different levels of transcriptional regulation due to trophic constraints in the reduced genome of Buchnera aphidicola APS.

Authors:  Nancie Reymond; Federica Calevro; José Viñuelas; Nicolas Morin; Yvan Rahbé; Gérard Febvay; Christian Laugier; Angela Douglas; Jean-Michel Fayard; Hubert Charles
Journal:  Appl Environ Microbiol       Date:  2006-10-13       Impact factor: 4.792

2.  Comparative analysis of nuclear tRNA genes of Nasonia vitripennis and other arthropods, and relationships to codon usage bias.

Authors:  S K Behura; M Stanke; C A Desjardins; J H Werren; D W Severson
Journal:  Insect Mol Biol       Date:  2010-02       Impact factor: 3.585

3.  Deep learning based on biologically interpretable genome representation predicts two types of human adaptation of SARS-CoV-2 variants.

Authors:  Jing Li; Ya-Nan Wu; Sen Zhang; Xiao-Ping Kang; Tao Jiang
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

4.  Translational selection is ubiquitous in prokaryotes.

Authors:  Fran Supek; Nives Skunca; Jelena Repar; Kristian Vlahovicek; Tomislav Smuc
Journal:  PLoS Genet       Date:  2010-06-24       Impact factor: 5.917

5.  Isolation of the Buchnera aphidicola flagellum basal body complexes from the Buchnera membrane.

Authors:  Matthew J Schepers; James N Yelland; Nancy A Moran; David W Taylor
Journal:  PLoS One       Date:  2021-05-10       Impact factor: 3.240

6.  Altered tRNA characteristics and 3' maturation in bacterial symbionts with reduced genomes.

Authors:  Allison K Hansen; Nancy A Moran
Journal:  Nucleic Acids Res       Date:  2012-06-11       Impact factor: 16.971

7.  Multimodal dynamic response of the Buchnera aphidicola pLeu plasmid to variations in leucine demand of its host, the pea aphid Acyrthosiphon pisum.

Authors:  José Viñuelas; Gérard Febvay; Gabrielle Duport; Stefano Colella; Jean-Michel Fayard; Hubert Charles; Yvan Rahbé; Federica Calevro
Journal:  Mol Microbiol       Date:  2011-07-28       Impact factor: 3.501

8.  Conservation of the links between gene transcription and chromosomal organization in the highly reduced genome of Buchnera aphidicola.

Authors:  José Viñuelas; Federica Calevro; Didier Remond; Jacques Bernillon; Yvan Rahbé; Gérard Febvay; Jean-Michel Fayard; Hubert Charles
Journal:  BMC Genomics       Date:  2007-06-04       Impact factor: 3.969

9.  Generalized substitution of isoencoding codons shortens the duration of papillomavirus L1 protein expression in transiently gene-transfected keratinocytes due to cell differentiation.

Authors:  Wenyi Gu; Jianmin Ding; Xiao Wang; Rachel L de Kluyver; Nicholas A Saunders; Ian H Frazer; Kong-Nan Zhao
Journal:  Nucleic Acids Res       Date:  2007-07-09       Impact factor: 16.971

10.  The Rare Codon AGA Is Involved in Regulation of Pyoluteorin Biosynthesis in Pseudomonas protegens Pf-5.

Authors:  Qing Yan; Benjamin Philmus; Cedar Hesse; Max Kohen; Jeff H Chang; Joyce E Loper
Journal:  Front Microbiol       Date:  2016-04-19       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.