Literature DB >> 22014033

Mapping codon usage of the translation initiation region in porcine reproductive and respiratory syndrome virus genome.

Jun-hong Su1, Xiao-xia Ma, Ya-li He, Ji-dong Li, Xu-sheng Ma, Yong-xi Dou, Xue-nong Luo, Xue-peng Cai.   

Abstract

BACKGROUND: Porcine reproductive and respitatory syndrome virus (PRRSV) is a recently emerged pathogen and severely affects swine populations worldwide. The replication of PRRSV is tightly controlled by viral gene expression and the codon usage of translation initiation region within each gene could potentially regulate the translation rate. Therefore, a better understanding of the codon usage pattern of the initiation translation region would shed light on the regulation of PRRSV gene expression.
RESULTS: In this study, the codon usage in the translation initiation region and in the whole coding sequence was compared in PRRSV ORF1a and ORFs2-7. To investigate the potential role of codon usage in affecting the translation initiation rate, we established a codon usage model for PRRSV translation initiation region. We observed that some non-preferential codons are preferentially used in the translation initiation region in particular ORFs. Although some positions vary with codons, they intend to use codons with negative CUB. Furthermore, our model of codon usage showed that the conserved pattern of CUB is not directly consensus with the conserved sequence, but shaped under the translation selection.
CONCLUSIONS: The non-variation pattern with negative CUB in the PRRSV translation initiation region scanned by ribosomes is considered the rate-limiting step in the translation process.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22014033      PMCID: PMC3219751          DOI: 10.1186/1743-422X-8-476

Source DB:  PubMed          Journal:  Virol J        ISSN: 1743-422X            Impact factor:   4.099


Introduction

Porcine reproductive and respiratory syndrome virus (PRRSV) infection causes serious disease in swine populations with a series of clinical consequences, such as high mortality, reproductive failure, post-weaning pneumonia and growth reduction [1,2]. Based on its serological characteristics, PRRSV has two main serotypes, which named the Northern American isolate (US) and the European isolate (EU), respectively [3-7]. PRRSV is an enveloped, single-stranded positive-sense RNA virus with a genome size of about 15.4kb and classified into the order Nidovirales of family Arteriviridae [8,9]. The PRRSV genome contains ORF1a, encoding papain-like cysteine protease, ORF1b, encoding RNA dependent RNA polymerase, ORF2-6, encoding envelop proteins, and ORF7, encoding the nucleocapsid protein [10-13]. Despite a well-organization of the ORFs within the single RNA genome, viral proteins are in fact encoded from subgenomic RNAs that are likely generated through a discontinuous transcription mechanism [12,14]. Therefore, each subgenomic RNA could be translated at different translation rates that are regulated by codon usage bias (CUB). Because the faster a polypeptide chain is completed, the more rapid the ribosomes return to initiate and complete another polypeptide chain. The relationship between the efficiency of translation initiation and the level of gene expression has been well-established in many species [15-19]. Moreover, when the distance between the initiation codon and the non-preferential site is less than 50-60 positions (codons), the ribosomes can be blocked at the non-preferential positions to shape a queue of ribosomes [20]. It is generally considered that the alternative synonymous codons are not used with equal frequencies among organisms, and the codon usage pattern plays a role in genes expressed at higher levels [21-30]. Jacques and Dreyfus proposed that the translation initiation site is a rate-limiting factor for gene expression [31]. Nevertheless, a regulatory relationship, which is thought to be mediated by preferential codons, between CUB and translation efficiency for individual genes is challengeable [32,33]. This suggested that a heterogonous gene is not necessarily expressed at a low level simply because its codons are infrequently translated by the host cell. There is a codon bias with respect to intragenic codon bias in the initial sequences of genes for which major proteins are strikingly different from their downstream codon bias. It is found that the translational initiation region plays an important role in regulating the translational efficiency and the pattern of synonymous codon usage varies in different regions along a coding sequence [34,35]. This indicated that the alternative synonymous codon usage might be related with gene function, protein structure and translation efficiency. In this study, we focus on the pattern of CUB in the translation initiation region of PRRSV as well as the characteristics of the synonymous codon usage at each position in the target region, since the interest in the pattern of CUB has been aroused by its potential relevance to the translational efficiency of PRRSV subgenomic RNAs. And the frequency of non-preferential codons usage in the target region is investigated in order to evaluate the role of translation selection on the formation of negative CUB pattern.

2. Materials and methods

2.1. Sequences data and the synonymous codon usage value

The 13 complete RNA sequences of PRRSV were downloaded from the National Center for Biotechnology Information (NCBI) http://www.ncbi.nlm.nih.gov/Genbank/ and the synonymous codon usage values (SCUV) for this virus were reported previously [30]. Multiple alignment analyses were performed with the Clustal W (1.7) method of DNAStar software (7.0) for windows. The translation initiation regions (the 1st to the 50th residue) of ORF1a, ORF2, ORF3, ORF4, ORF5, ORF6 and ORF7 were used as targets for alignment analysis respectively.

2.2. The calculation of codon usage bias

To calculate CUB, it is supposed that statistically equal and random usage of all available synonymous codons was the "neutral point" (RSCU0 = 1.00) for the development of serotype-specific codon usage [19]. CUB: More simply, CUB is the average value of difference between RSCUand RSCUat each position of the target region. n represents all codons appearing in this position. When all RSCU values according to a particular position in the target region are RSCUis equal to zero. It means that there are few preferential or non-preferential codons existing at this position. In contrast, when CUB value is much more deviation than RSCU, codons with CUB are preferentially chosen at a particular position.

2.3. Analysis of codon usage characteristic of the translation initiation region

We analyzed the codon usage characteristics of the translation initiation region depending on R values, where the R value, computed as the ration , represents the relative abundance for a particular codon in the translation initiation region. ni represents the total number of a particular codon within the 1st to ith amino acids, Nrepresents the total number of corresponding amino acid in the 1st to ith amino acid ones, n is the total number of a certain codon within the whole coding sequence, and N is the total number of corresponding amino acids within the whole coding sequence. When R value is equal to 1.00, it means that the frequency of this codon in the target region is equal to the frequency of this codon in the whole coding sequence; when R value is lower than zero, it implies that the frequency of this codon in the target region is lower than that of the whole coding sequence; when R value is higher than zero, it suggests that the frequency of this codon is higher than that of the whole coding sequence.

2.4. Aanalysis of characteristics of positions with negative CUB in the target regions

To substantiate the characteristics of codon usage for positions with negative CUB in the target regions, we analyzed the target positions depending on the data, (i) the variations of codons and amino acids, (ii) R values for codons of the target positions.

3. Results

3.1. Multiple alignment analysis

The consensus amino acid sequence is based on the comparison of the strains in previous study [30]. The positions of amino acid conservation are listed in Table 1. The conservation of amino acid usage in translation region was analyzed. For ORF1a, 94% of amino acids in the target region of US serotype were invariant; 70% in the target region of EU serotype were conserved. For ORF2, 78% of amino acids were invariant in US serotype; 60% were invariant in EU serotype. Non-conserved amino acids scattered into the target regions of both US and EU serotypes. For ORF3, 74% of amino acids were invariant in US serotype; 60% were invariant in EU serotype, the most conserved amino acids tended to exist in the C' termination of the target regions of both US and EU serotypes. For ORF4, 76% of amino acids were invariant in US serotype; 72% were invariant in EU serotype. Non-conserved amino acids scattered in the flank of the target regions of both US and EU serotypes. For ORF5, 72% of amino acids were invariant in US serotype; 66% were invariant in EU serotype. Non-conserved amino acids scattered into the target regions of both US and EU serotypes. For ORF6, 96% of amino acids were invariant in US serotype; 82% were invariant in EU serotype, and non-conserved amino acids had a tendency to exist in the N' termination. For ORF7, 90% of amino acids were invariant in US serotype; 76% were invariant in EU serotype, and conserved amino acids scattered into the target region compared with that of US serotype. The various extents of the conserved amino acids encoded by ORFs of PRRSV suggested that these residues played an important role in virus biology.
Table 1

The positions of invariant amino acids in the translation initiation region

ORFSerotypeThe position of amino acid conservation in the translation initiation region
ORF1aUSThe 2nd to 17th, 19th to 34th, 36th to 41st, 43rd to 50th

EUThe 3rd, 6th to 13th, 15th to 18th, 20th to 23rd, 25th to 28th, 30th to 32nd, 34th, 35th, 39th to 41st, 44th, 46th, 48th to 50th

ORF2USThe 2nd to 4th, 6th, 8th, 11th to 13th, 15th to 22nd, 25th to 31st, 33rd to 41st, 43rd to 44th, 46th to 49th

EUThe 2nd to 4th, 7th, 12th to 13th, 15th, 18th, 20th, 22nd, 24th to 27th, 32nd to 37th, 40th, 41st, 43rd to 49th

ORF3USThe 4th, 5th, 7th, 9th to 12th, 14th, 16th to 19th, 21st, 22nd, 24th to 26th, 29th, 31st, 33rd to 47th, 49th, 50th

EUThe 2nd, 4th, 15th, 18th, 20th, 24th to 26th, 28th, 31st to 50th

ORF4USThe 2nd, 6th to 8th, 10th to 12th, 14th, 17th to 31st, 33rd, 34th, 36th to 41st, 44th,46th to 50th

EUThe 3rd, 4th, 6th, 7th, 9th, 12th, 13th, 17th to 32nd, 34th, 36th to 39th, 41st, 42nd, 44th, 46th to 48th, 50th

ORF5USThe 2nd, 6th to 8th, 10th, 12th, 14th, 15th, 18th to 23th, 26th to 28th, 30th to 34th, 36th, 39th to 46th, 48th to 50th

EUThe 3rd, 4th, 6th, 7th, 14th to 16th, 18th, 19th, 21st, 24th, 26th to 28th, 30th to 34th, 36th, 39th to 46th, 48th to 50th

ORF6USThe 2nd to 9th, 11th to 15th, 17th to 50th

EUThe 2nd, 4th, 5th, 7th, 8th, 15th to 22nd, 24th to 50th

ORF7USThe 2nd to 10th, 12th to 14th, 16th to 45th, 47th, 50th

EUThe 2nd, 3rd, 5th, 6th, 8th to 10th, 12th, 15th to 21st, 23rd to 28th, 30th, 31st, 33rd, 35th to 38th, 42nd to 50th
The positions of invariant amino acids in the translation initiation region

3.2. Characteristics of codon usage bias in the target regions

The bars of all positions in the translation initiation region represented the CUB degree (Figure 1). Although different invariant degrees of the amino acids exist in the target regions between US and EU serotypes, the similar patterns of codon usage are present in the target regions of both US and EU serotypes (Table 2). For ORF1a, 58% of positions possess the similar pattern of codon usage in the target regions of both serotypes. Although the two target regions corresponding to both the US and EU serotypes have a significant difference to the conservation in obvious amino acids, a large size of the similar patterns of codon usage exist in the target region and the most positions possessed the positive codon usage bais (Figure 1A). For ORF2, 34% of positions have the similar pattern of codon usage, and the positions in the N-terminal fragment had a tendency to choose low codon bias. It was also observed that the number of the positions with the negative codon usage bias for US serotype was more than that of EU serotype (Figure 1B). For ORF3, 62% of positions have the similar pattern of codon usage (Figure 1C). For ORF4, 72% positions contain the similar pattern of codon usage (Figure 1D). For ORF5, 40% of positions have the similar pattern of codon usage, and these positions with the similar pattern of codon usage do not appear to exist near the N' termination (Figure 1E). For ORF6, 26% of positions which contain the similar pattern of codon usage do not exist near the N' termination (Figure 1F). For ORF7, 44% of positions have the similar pattern of codon usage, and the most positions with low codon usage bias tend to exist near the N-terminal fragment (Figure 1G).
Figure 1

The CUB degree of translation initiation region in PRRSV ORFs, the white bar represents US serotype while the gray represents EU. A, ORF1a; B, ORF2; C, ORF3; D, ORF4; E, ORF5; F, ORF6; G, ORF7.

Table 2

The similar pattern of codon usage in the target regions in both US and EU serotypes

ORFsThe positions corresponding to similar codon usage pattern in the target region
ORF1athe 2nd to 4th, 6th to 8th, 10th, 11th, 14th, 15th, 17th, 20th to 24th, 26th, 29th to 36th, 39th, 44th, 45th, 48th
ORF2the 3rd to 5th, 14th, 17th, 21st, 23rd, 24th, 28th, 31st, 34th, 36th, 37th, 41st, 44th, 46th, 49th, 50th
ORF3the 2nd, 6th, 9th to 15th, 18th to 20th, 23rd to 26th, 28th to 31st, 33rd to 36th, 38th, 39th, 41st, 44th, 45th, 47th to 49th
ORF4the 2nd to 10th, 12th, 13th, 15th to 18th, 20th, 22nd to 25th, 27th to 29th, 33rd, 34th, 36th, 37th, 39th, 40th, 43rd, 45th, 46th, 47th to 50th
ORF5the 7th, 11th, 13th, 15th to 17th, 20th, 21st, 25th to 31st, 33rd, 37th, 39th, 41st, 50th
ORF6the 11th, 17th, 18th, 20th, 23rd, 25th, 33rd, 38th, 41st, 43rd, 44th, 49th
ORF7the 2nd, 5th, 6th, 8th, 9th, 15th, 18th, 21st to 23rd, 26th to 31st, 36th, 38th to 40th, 46th
The CUB degree of translation initiation region in PRRSV ORFs, the white bar represents US serotype while the gray represents EU. A, ORF1a; B, ORF2; C, ORF3; D, ORF4; E, ORF5; F, ORF6; G, ORF7. The similar pattern of codon usage in the target regions in both US and EU serotypes The various extents of the conserved pattern of codon usage for their positions in PRRSV ORFs suggest that CUB associated with these positions might modulate the corresponding gene expression.

3.3. The rate of codon usage frequency in the translation initiation region to that of the whole coding sequence

The R value for each codon was calculated and listed in Table 3. A higher R value indicated more preferential usage in the translation initiation site than that of the whole coding sequence. CUBvalue for each codon was listed in Table 4. Depending on the data from Table 3, 4 and comparison with the whole coding sequence of PRRSV, for ORF1a, the codons with negative CUB, namely GCA (Ala), GCG (Ala), CAA (Gln), AGU (Ser), ACA (Thr) and ACG (Thr), were more preferentially chosen in the target region for both serotypes; for ORF2, the codons, namely UGU (Cys), AUA (Ile), AAA (Lys), CCG (Pro), AGU (Ser) and UCG (Ser), were more preferentially used; for ORF3, the codons, namely UGU (Cys), AGC (Ser) and ACG (Thr), were more preferentially chosen; for ORF4, the codons, namely GAC (Asp), UUC (Phe), AGU (Ser) and UCG (Ser), were more preferentially chosen; for ORF5, the codons, namely UGU (Cys), CCG (Pro), UCG (Ser) and ACG (Thr), were more preferentially chosen; for ORF6, the codons, namely CAA (Gln), AUA (Ile) and CUA (Leu), were more preferentially used; for ORF7, the codons, namely GGA (Gly) and AAA (Lys), were more preferentially chosen. Due to these non-preferential codons, ribosomes might be stalled by them to regulate the efficiency of gene translation.
Table 3

Preferentially used codons in the target region in US and EU serotypes of PRRSV

ORF1aORF2ORF3

CodonUSEUUSEUUSEU
aGCAb1.69b1.4300.910.150.64
GCC1.000.981.422.1600.91
aGCGb2.25b1.3400b2.170
GCU00.761.030.832.941.88
aAGA000.5b1.3700
AGG1.220003.720
aCGAb2.4200000
CGC000002.05
CGG3.075.633.791.1400
aCGU000b1.070.47b2.90
AAC0.430.901.8600.121.01
aAAUb1.720.680.400b1.351.00
aGAC0b1.560000
GAU2.310.3701.260.380.25
UGC1.081.621.2500.360.18
aUGU0.920.50b1.01b1.51b1.44b1.43
aCAAb1.89b1.080.500.8600
CAG0.170.9201.2102.15
aGAA0.550.2800b1.500
GAG1.301.3801.140.271.30
aGGA00.390b2.4600
GGC1.401.633.1301.513.82
GGG2.001.1600.2900
GGU0.150.481.401.252.580.42
aCAC000b1.5800.72
CAU0000.211.411.32
aAUAb8.260b2.55b3.1900
AUC000002.58
AUU00.59004.310
aCUAb2.320.240.3200b4.42
CUC1.901.87001.840.77
CUG0.92000.651.390.43
CUU1.011.340.660.2401.02
aUUA0.280.220.64b3.5400
UUG00.801.992.040.741.29
aAAA00b1.27b2.6400
AAG000000
aUUCb1.010.480.17b1.090.990.95
UUU1.011.461.140.781.021.07
CCA00.981.501.0404.51
CCC2.270.11000.390
aCCG0b2.43b1.36b1.07b3.790
CCU1.010.8103.2100.24
aAGC0.42b1.5600.43b2.02b1.26
aAGUb6.36b8.28b4.94b4.130b1.87
UCA1.891.680.280.4101.17
UCC0.210.980.090.600.830.59
aUCG00b1.58b1.700b1.30
UCU2.791.331.160.632.211.22
aACAb1.97b1.48b6.170b1.240.70
ACC0.890.580000.70
aACGb2.22b1.490b1.83b1.74b2.11
aACU0b1.06000.960.82
UAC1.33001.361.322.03
aUAU0.370.30b1.770.5300.11
aGUA0b2.670b4.7900
GUC0.771.773.780.330.110.33
GUG1.710.3102.041.880
GUU0.760.532.9601.084.14

ORF4ORF5ORF6ORF7

CodonUSEUUSEUUSEUUSEU

aGCAb1.250.16000b1.600b1.09
GCC0.550.961.913.080.651.691.221.75
aGCGb1.27b1.090.850.57b1.60000
GCU0.981.201.3502.240.211.500.74
aAGA000b3.6900b2.780.71
AGG0000.15000.172.19
aCGA00b7.830b4.580.9900.33
CGC00002.295.7900.75
CGG000000.2300.29
aCGU000b1.800000
AAC3.671.361.341.4200.311.091.42
aAAU00.5600.200.290.860.890.83
aGACb1.38b1.390.580.18b1.330.640.500
GAU0.500.6902.650.670.860.750
UGC0.640.670.820.990.441.381.921.00
aUGUb1.410.99b1.13b1.02b1.330.2400.29
aCAA0.42b1.140.251.00b1.50b1.380.640.78
CAG1.460.95200.3301.131.07
aGAA0.500000.170.4500
GAG0.192.141.490.290000
aGGA000000.84b1.31b1.14
GGC0.6301.631.980.790.771.050.89
GGG001.030.541.871.970.630.85
GGU4.003.330.17000.561.351.11
aCAC000b2.900.78b2.0500
CAU01.481.9401.46000
aAUA000b2.58b1.12b1.860b3.29
AUC1.261.241.070.121.190.901.460
AUU0.731.201.340.370.800.0800
aCUA0b2.26b1.370b1.66b1.3500
CUC1.521.150.6000.190.7400
CUG01.140.700.991.191.481.461.78
CUU1.191.810.620.771.270.9400
aUUA00.230.5500.770.1600
UUG1.500.191.692.160.730.840.221.05
aAAAb1.210.20b1.341.0000.31b1.08b1.21
AAG0.691.820.230.382.431.320.980.86
aUUCb1.28b1.18b1.880.991.000.4200
UUU0.760.120.840.980.871.4900
CCA3.83000.763.0002.251.85
CCC0.401.670000.7900.59
aCCG00.43b4.25b1.2900b2.000.33
CCU00.240.17001.6500.50
aAGCb1.340b1.690.570.98b3.101.00b2.47
aAGUb3.67b2.5700b4.2800b2.51
UCA4.310.64002.20002.99
UCC0.7800.321.170.7901.351.26
aUCGb1.91b1.95b1.58b1.48b2.20000
UCU0.470.861.451.782.020.3100
aACA000b1.320.17b1.760b1.63
ACC1.781.410.980.670.830.5600
aACG0b1.69b6.79b2.13b2.35000
aACU0.780.2100.530.180.4300
UAC001.801.110.671.0500
aUAU000.480.90b1.500.5500
aGUA00000.980.2700
GUC0.921.040.340001.722.96
GUG0.520.102.7902.102.2900
GUU2.251.77000000

a presented the non-preferential codon.

b presented that the non-preferential codon was more preferentially chosen in the translation initiation region than that of the whole coding sequence.

Table 4

Synonymous codon usage bias for the whole coding sequence of PRRSV

aAACodonbCUBijAAaCodonbCUBij
AlaGCA-0.143LeuCUA-0.490
GCC0.309CUC0.164
GCG-0.350CUG0.377
GCU0.184CUU0.096
ArgAGA-0.107UUA-0.670
AGG0.092UUG0.522
CGA-0.258LysAAA-0.003
CGC0.411AAG0.003
CGG0.012PheUUC-0.083
CGU-0.149UUU0.083
AsnAAC0.040ProCCA0.047
AAU-0.04CCC0.059
AspGAC-0.045CCG-0.198
GAU0.045CCU0.092
CysUGC0.001SerAGC-0.128
UGU-0.001AGU-0.194
GlnCAA-0.018UCA0.075
CAG0.018UCC0.418
GluGAA-0.116UCG-0.317
GAG0.116UCU0.145
GlyGGA-0.441ThrACA-0.031
GGC0.369ACC0.439
GGG0.016ACG-0.347
GGU0.056ACU-0.061
HisCAC-0.153TyrUAC0.174
CAU0.153UAU-0.174
IleAUA-0.234ValGUA-0.635
AUC0.166GUC0.110
AUU0.068GUG0.406
GUU0.118

a mean amino acid

bThe CUBvalue was calculated following by the equation: CUB, and RSCUvalue came from the previous study [30].

Preferentially used codons in the target region in US and EU serotypes of PRRSV a presented the non-preferential codon. b presented that the non-preferential codon was more preferentially chosen in the translation initiation region than that of the whole coding sequence. Synonymous codon usage bias for the whole coding sequence of PRRSV a mean amino acid bThe CUBvalue was calculated following by the equation: CUB, and RSCUvalue came from the previous study [30].

3.4. The characteristics of codon usage for the target positions

The positions with negative CUB do not always use the codons with negative CUB, and the R value for the codons with negative CUB vary compared with R = 1.00. However, some target positions contain the codons with negative CUB and R values > 1.00, suggesting that some new characteristics might influence the translation efficiency of the corresponding coding sequence. In translation initiation region of ORF1a, the non-preferential codons (R value > 1.00) are preferentially used in the 4th (US and EU serotypes), 9th (US), 12th (EU), 19th (US), the 22nd (US and EU), 27th (US), 31st (US and EU) and 40th (US), while some non-preferential codons, which have R value < 1.00 or R value > 1.00, exist in the 16th (EU) and 30th (US and EU) positions. For ORF2, the non-preferential codons are more preferentially used in the 7th (EU), 8th (EU), 9th (EU), 11th (US), 20th (EU), 27th (EU), 30th (US), 33rd (US), 40th (EU), 43rd (EU), 44th (US) and 48th (EU) positions, while some non-preferential codons with R value > 1.00 or R value < 1.00 exist in the 12th (US) position. For ORF3, non-preferential codons (R value > 1.00) exist in the 4th (US), 13th (US and EU), 17th (EU), 26th (US and EU), 31st (US and EU), 32nd (EU) and 37th (US) positions, while the non-preferential codons with R value > 1.00 or R value < 1.00 are used in the 5th (EU), 6th (US and EU), 7th (US), 11th (US and EU),16th (US) and 43rd (EU) positions. For ORF4, the non-preferential codons with R value > 1.00 are used in the 3rd (US and EU), 7th (US and EU), 20th (US and EU), 27th (US and EU), 28th (US and EU), 29th (US and EU), 38th (EU),40th (US and EU), 41st (US), 44th (EU) and 49th (US and EU), while some non-preferential nodons with R value > 1.00 or R value < 1.00 are used in the 31st (EU) position. For ORF5, the non-preferential nodons with R value > 1.00 are used in the 9th (EU), 12th (US), 14th (EU), 22nd (US) 23rd (US), 32nd (US), 36th (US), 39th (EU), 40th (EU), 44th (EU), 48th (EU) and 49th (EU), while non-preferential codon with R value > 1.00 or R value > 1.00 are used in the 8th (US), 24th (US), 46th (US) and 47th (US) positions. For ORF6, the non-preferential codons (R value > 1.00) are used in the 3rd (US), 4th (EU), 7th (US), 13th (US), 14th (EU), 15th (EU), 19th (EU), 21st (US), 22nd (EU), 24th (EU), 26th (EU), 27th (US), 30th (EU), 31st (EU), 32nd (US), 37th (US), 40th (US), 45th (EU) and 48th (EU) positions, while some non-preferential codon (R value < 1.00 or R value > 1.00) are used in the 2nd (EU), 5th (US and EU), 46th (US) and 50th (US) positions. For ORF7, the non-preferential codons (R value > 1.00) are chosen in the 11th (US), 32nd (US), 40th (US and EU), 41st (US), 43rd (EU), 44th (EU), 48th (US) and 50th (US) positions, while some non-preferential codon (R value > 1.00 or R value < 1.00) are used in the 3rd (US), 24th (EU), 25th (EU) and 35th (US) positions. The rest positions with negative CUB do not arise from the existence of non-preferential codons but contain some preferential codons (CUB > 0), implying that these positions do not affect the efficiency of gene translation. The degeneracy of the genetic code enables the same amino acid sequences to be encoded and translated in different ways. However, the synonymous codon usage is not purely random.

4. Discussion

RNA virus possesses high mutation rates and therefore virus populations exist as dynamic and complex mutant distributions [36-41]. However, the redundant intensity of mutation has deleterious effects on the viral fitness. Thus, the robustness of viral sequences can perform a reduced sensitivity to perturbations affecting phenotypic expression. The balance between the high mutations and the robustness produce a dynamic population pool, termed as 'quasispecis' [36,42]. As to comparative genomics, it is generally accepted that sequences with a crucial function are conserved among different but related organisms [43-45]. In addition, Akashi found that the frequency of preferential codons is significantly higher at the conserved amino acid positions than that at the non-conserved amino acid positions among different Drosophila species, suggesting that translation selection favors the conserved pattern of synonymous codon usage to enhance the accuracy of gene expression [46]. A lot of experimental data have shown that rates of chain elongation during translation of proteins are not uniform [47]. Non-uniform character of distribution of codons with different usage frequencies along mRNA is assumed to be a main factor to modulate the translation rate. Extensive studies have been carried out previously on the determination of the translation rates and the overall level of gene expression for certain individual codons [48-52]. From this research, we observed that the conserved pattern of codon usage did not simply follow the corresponding positions in the conserved sequence fragment, suggesting that the conservation of codon usage within a gene sequence have an important function in modulating its translational rate. The positions with the conserved positive CUB enhance the accuracy and efficiency of their gene translation. It has been observed that preferential codons can reduce the frequency of amino acid misincorporations, resulting in an approximately 10-fold increase of protein products over non-preferential codons for the same amino acid [53]. However, the positions with negative CUB in the translation initiation region of each PRRSV subgenomic RNA are not ignored. Because these positions are likely to regulate the translation initiation rate to generate the target product with high activity. Lithwich and Margalit reported that CUB is most highly associated with protein expression and is most conserved [26]. Once a significant number of gene sequences have been obtained, it will be taken into consideration that biased codon usage can regulate the expression levels of individual genes by modulating the rates of polypeptide elongation [21,54-58]. Komar pointed out that although preferential codons enable the corresponding gene to be translated efficiently, the non-preferential codons replaced by the corresponding preferential codons can regulate the gene expression to perform the precise protein folding [59]. Lavner and Kotlar indicated that translation selection may shape codon bias pattern, not only to increase translation efficiency by favoring preferential codons in highly expressed genes, but also to decrease translation rate by favoring non-optimal codons in lowly expressed ones [60]. A relationship between the translation efficiency and CUB have been reported that it can lead to link between the protein folding by modulating the translational rate and the synonymous codon usage bias [47,61-65]. The nucleotide sequences around the N-terminal region of the protein appear to be particularly sensitive to the presence of rare codons [66,67]. Our data showed that some positions in the translation initiation regions of ORFs tended to preferentially choose non-preferential codons which were more preferentially used in these regions than the whole coding sequences. This phenomenon suggested that the determinant of the invariant pattern of codon usage is not only correlated with the conserved sequence, but also dependent of the translation selection. As codon usage pattern comprised of preferential and non-preferential codons contributes to different translation rates, it is possible to change the local translation rates of a gene by suitable selection of its synonymous codons. A gene sequence with non-preferential codons intends to encode turns, loops and domain linkers within its protein structure through the limited step to the translation rate [47,63,64,68]. Taken together, under the translation selection, the conserved non-preferential codons in the translation initiation regions of PRRSV may affect the translation efficiency so as to maintain the normal biological functions of their target products. Komar and Jaenicke indicated that the non-preferential coodns play an important role in maintaining the normal function or activity of CAT product [68]. It shows the importance of non-preferential codons to the formation of the target products. As non-preferential codons or even one aggregating near the translation initiation codon can decrease translation rate arising from the limitation of availability of tRNAs depending on the host cell [69], the view that non-preferential codons probably have a negative effect on gene expression can be explained by the 'minor codon modulator hypothesis' [70]. When the tRNA concentration of minor codons becomes extremely limited, ribosomes of the host cell block at the minor codons to inhibite the ribosome from entering into the initiation site effectively, thereby resulting in a decrease in the translation rate. Moreover, the non-preferential codons locating at the translation initiation region modulate the number of ribosomes that are sequestered by an mRNA if the rates of elongation at these codons were so sufficiently slow that stalled ribosomes could block access to the initiation signals [19,71]. In summary, the conserved non-preferential codons in the translation initiation region have a high relationship with the regulation of gene expression. And the conserved codons with negative CUB are preferentially used in the initial region, which may be explained by the minor codon modulator hypothesis and the translation selection. These codons within this critical region might play a negative role in regulation of gene expression.

List of abbreviations

PRRSV: Porcine reproductive and respitatory syndrome virus; SCUV: synonymous codon usage values; CUB: codon usage bias; US: Northern American isolate; EU: European isolate.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JHS and XXM carried out the molecular genetic studies, participated in the sequence alignment and drafted the manuscript. YLH, JDL and XSM participated in the sequence alignment. YXD and XNL participated in the design of the study and performed the statistical analysis. XPC conceived of the study, and participated in its design and coordination and helped to draft the manuscript. All authors read and approved the final manuscript.
  69 in total

1.  The positive relationship between codon usage bias and translation initiation AUG context in Saccharomyces cerevisiae.

Authors:  H Miyasaka
Journal:  Yeast       Date:  1999-06-15       Impact factor: 3.239

Review 2.  Forces that influence the evolution of codon bias.

Authors:  Paul M Sharp; Laura R Emery; Kai Zeng
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-04-27       Impact factor: 6.237

3.  Codon bias as a factor in regulating expression via translation rate in the human genome.

Authors:  Yizhar Lavner; Daniel Kotlar
Journal:  Gene       Date:  2004-12-24       Impact factor: 3.688

4.  Characteristics of codon usage bias in two regions downstream of the initiation codons of foot-and-mouth disease virus.

Authors:  Jian-hua Zhou; Jie Zhang; Yao-zhong Ding; Hao-tai Chen; Li-na Ma; Yong-sheng Liu
Journal:  Biosystems       Date:  2010-04-14       Impact factor: 1.973

5.  Codon usage in bacteria: correlation with gene expressivity.

Authors:  M Gouy; C Gautier
Journal:  Nucleic Acids Res       Date:  1982-11-25       Impact factor: 16.971

6.  Comparative genomics of foot-and-mouth disease virus.

Authors:  C Carrillo; E R Tulman; G Delhon; Z Lu; A Carreno; A Vagnozzi; G F Kutish; D L Rock
Journal:  J Virol       Date:  2005-05       Impact factor: 5.103

7.  RNA viruses as complex adaptive systems.

Authors:  Santiago F Elena; Rafael Sanjuán
Journal:  Biosystems       Date:  2005-02-23       Impact factor: 1.973

8.  The characteristics of the synonymous codon usage in enterovirus 71 virus and the effects of host on the virus in codon usage pattern.

Authors:  Yong-sheng Liu; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Zygmunt Pejsak; Yao-zhong Ding; Jie Zhang
Journal:  Infect Genet Evol       Date:  2011-03-05       Impact factor: 3.342

9.  Analysis of synonymous codon usage in porcine reproductive and respiratory syndrome virus.

Authors:  Yong-sheng Liu; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Yao-zhong Ding; Meng Wang; Jie Zhang
Journal:  Infect Genet Evol       Date:  2010-05-10       Impact factor: 3.342

10.  Quasispecies theory in the context of population genetics.

Authors:  Claus O Wilke
Journal:  BMC Evol Biol       Date:  2005-08-17       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.