Literature DB >> 17768673

Mutation pressure shapes codon usage in the GC-Rich genome of foot-and-mouth disease virus.

Jincheng Zhong1, Yanmin Li, Sheng Zhao, Shenggang Liu, Zhidong Zhang.   

Abstract

Foot-and-mouth disease (FMD) is economically the most important viral-induced livestock disease worldwide. In this study, we report the results of a survey of codon usage bias of FMD virus (FMDV) representing all seven serotypes (A, O, C, Asia 1, SAT 1, SAT 2, and SAT 3). Correspondence analysis, a commonly used multivariate statistical approach, was carried out to analyze synonymous codon usage bias. The analysis showed that the overall extent of codon usage bias in FMDV is low. Furthermore, the good correlation between the frequency of G + C at the synonymous third position of sense codons (GC3S) content at silent sites of each sequence and codon usage bias suggested that mutation pressure rather than natural (translational) selection is the most important determinant of the codon bias observed. In addition, other factors, such as the lengths of open reading frame (ORF) and the hydrophobicity of genes also influence the codon usage variation among the genomes of FMDV in a minor way. The result of phylogenetic analyses based on the relative synonymous codon usage (RSCU) values indicated a few obvious phylogenetic incongruities, which suggest that more FMDV genome diversity may exist in nature than is currently indicated. Our work might give some clues to the features of FMDV genome and some evolutionary information of this virus.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17768673      PMCID: PMC7089325          DOI: 10.1007/s11262-007-0159-z

Source DB:  PubMed          Journal:  Virus Genes        ISSN: 0920-8569            Impact factor:   2.332


Introduction

The phenomenon of synonymous codon usage bias exists in a wide range of paradigms from prokaryote to eukaryote. Due to different genomes having their own characteristic patterns of synonymous codon usage [1], it has not been easy to provide a satisfactory explanation for the particular pattern found in a given genome. Compositional constraints and translational selection are thought to be the main factors accounting for codon usage variation among genes in different organisms such as Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Dictyostelium discoideum [2], Drosophila melanogaster [3] and Caenorhabditis elegans [4]. However, in some prokaryotes with extremely high A + T or G + C contents [2] and human [5], mutation bias is the major factor accounting for the variation in codon usage. In contrast, it was reported that translational selection at silent sites played the most important role in shaping codon usage in Zea mays [6] and Arabidopsis thaliana [7]. Recently, codon usage was suggested to be related to gene function [8] and protein secondary structure [9]. Codon usage information has also been analyzed for viruses, such as human immunodeficiency virus (HIV) [10], nucleopolyhedroviruses [11], cauliflower mosaic virus (CMV) [12], human RNA viruses [13], H5N1 virus [14], and hepatitis A virus [15]. For example, HIV has a marked codon usage bias, due to its strong preference for the A nucleotide [10]. Rubella virus has a genomic G + C content of 0.70 [16]. Codon usage in Epstein-Barr virus (a DNA virus) may have an influence on regulation of latent versus productive infection [17]. In contrast, in nucleopolyhedroviruses codon usage appears to be simply a consequence of uneven base composition [11]. These published studies are mostly restricted to particular groups of viruses and have usually addressed phylogenetic questions [11, 18–20]. Moreover, mutational pressure rather than translational selection is the most important determinant of the codon bias in some RNA viruses [11, 13, 14, 21, 22]. However, a recent study showed that the G + C compositional constraint is the main factor that determines the codon usage bias in iridovirus genomes [23]. Clearly, studies of the synonymous codon usage in viruses can reveal information about the molecular evolution of individual genes and such information would be relevant in understanding the regulation of viral gene expression and also to vaccine design where the efficient expression of viral proteins may be required to generate immunity [24]. Foot-and-mouth disease (FMD) is a highly contagious disease of cattle and other cloven-hoofed animals. Apart from the influence on animal health and welfare, the economic impact of an outbreak of FMD can be of great importance for a country’s export trade. The 2001 European outbreak of FMD which mainly affected the UK is estimated to have cost of 6000 million Euros [25]. Effective vaccines and stringent control measures have enabled FMD eradication in most developed countries, which maintain unvaccinated, seronegative herds in compliance with strict international trade policies. However, Outbreaks with devastating economic consequences still occur in many developing regions of Asia, Africa and South America, posing a serious problem for commercial trade with FMD-free countries. FMD is caused by FMD virus (FMDV), a small, non-enveloped virus that contains a single stranded positive-sense RNA genome of about 8500 nucleotides from the Aphthovirus genus of the Picornaviridae family. The capsid is made of four proteins (VP1, VP2, VP3 and VP4) and there are seven serotypes (A, O, C, Asia I, SAT 1, SAT 2, and SAT 3) that can be further divided into many genotypes according to nucleotide difference in the capsid proteins [26]. Although genome sequences of FMDV have been published and some studies have been performed on them in recent years [27-30], little codon usage analysis is available. Such information might give some clues to the features of virus biology and some evolutionary information of this virus, and also it is of interest to understand the factors that shape codon usage in this species. In the present study, the codon usage bias was analyzed in these seven serotypes of FMDV genomes. The key evolutionary determinants of codon usage bias in these viruses were also investigated.

Materials and methods

FMDV genome sequences

A total of 40 FMDV genomes were used in this study (Table 1), including 23 Type O genomes, four type A, six type C, two type Asia I , one Type SAT1, three Type SAT2, and one Type SAT3. The complete sequences these FMDV genomes were obtained from EMBI (http://www.ebi.ac.uk/cgi/) and NCBI (http://www.ncbi.nlm.nih.gov/). Serial number (SN), length of each genome, EMBI or GenBank accession numbers are listed in Table 1. FMDV genomes SN 6, 9, 13 and 19 have been reported to be from cattle. FMDV genomes SN 14, 23 were from pigs and FMDV genomes SN 15,16 and 17 have been reported to pig-adapted isolates carrying a deletion in 3A (codons93–102). Alterations or deletion in this region are associated with the reduced ability of the virus to cause FMD in cattle. A program based on Perl has been developed to extract the annotated ORF sequences from each genome.
Table 1

Genomes examined, strain, length and accession numbersa

SNVirusStrainLength (bp)Accession No.
1FMDV-O1 Campos/Brazil/588168 AJ320488
2FMDV-OAkesu/588147 AF511039
3FMDV-OOMIII (artificially attenuated from strain Akesu/58)8083 AY359854
4FMDV-OCHA/1/99 (Tibet)8173 AF506822
5FMDV-OTibet/CHA/998183 AJ539138
6FMDV-OFRA/1/20018234 AJ633821
7FMDV-O1 Kaufbeuren/FRG/66 7804 X00871
8FMDV-OHKN/20028104 AY317098
9FMDV-OJPN/2000 (Miyazaki, Japan)7822 AB079061
10FMDV-OSAR/19/20008184 AJ539140
11FMDV-OSKR/20007813 AF377945
12FMDV-OSKR/20008182 AJ539139
13FMDV-OSKR/2000 (Chungju county; cattle7799 AY312587
14FMDV-O SKR/2002 (pigs)7803 AY312589
15FMDV-OTau-YuanTW97 (Taiwan, 1997)7739 AF154271
16FMDV-OChu-Pei (Taiwan) (pig strain)7733 AF026168
17FMDV-OYunlin/Taiwan/978134 AF308157
18FMDV-OTAW/2/99 (TC)8183 AJ539136
19FMDV-OTAW/2/99 (BOV)8183 AJ539137
20FMDV-ONY00 (China?)7731 AY333431
21FMDV-OHLJOC12/03(China)7767 DQ119643
22FMDV-Olz8104 DQ248888
23FMDV-OUKG/35/20018183 AJ539141
24FMDV-A10Argentina/61 (A61)7107 X00429
25FMDV-A12119/Kent/UK/32 (ab variant)7712 M10975
26FMDV-A22Azerbaijan/USSR/657820 X74812
27FMDV-C3 Argentina/858161 AJ007572
28FMDV-C1 Santa Pau/Spain/70 (C-S8c1)8115 AJ133357
29FMDV-C1 Santa Pau/Spain/70 (rp99)8115 AJ133358
30FMDV-C1 Santa Pau/Spain/70 (rp146)8115 AJ133359
31FMDV-C1 Santa Pau/Spain/70 (MARLS clone)8115 AF274010
32FMDV-Asia1YNBS/China/588163 AY390432
33FMDV-Asia1IND/63/728167 AY304994
34FMDV-SAT2Kenya/3/57 (KEN/3/57)7774 AJ251473
35FMDV-SAT2ZIM/7/838173 AF540910
36FMDV-SAT1SAT1–1bech8173NC_011451
37FMDV-SAT2SAT2–3kenya 11/608203NC_003992
38FMDV-SAT3SAT3–2sa57/598170NC_011452
39FMDV-AA10 Holland8161NC_011450
40FMDV-CFrom cell culture8115NC_002554

aSequence numbers 1 to 35 are from EMBI; Sequence numbers 36 to 40 are from NCBI

Genomes examined, strain, length and accession numbersa aSequence numbers 1 to 35 are from EMBI; Sequence numbers 36 to 40 are from NCBI

Synonymous codon usage measures

In order to examine synonymous codon usage without the confounding influence of amino acid composition of different ORF samples, relative synonymous codon usage values (RSCU) of different codons in each ORF sample was calculated as described previously [31]. Additionally, the ‘Effective Number of Codons’ (ENC) was used to quantify the codon usage bias of a ORF [32], which is the best overall estimator of absolute synonymous codon usage bias (Comeron and Aguade, 1998) [51]. The reported value of ENC is always between 20 (when only one codon is used for each amino acid) and 61 (when all codons are used equally) [32]. GC3S, the frequency of the nucleotide G + C at the synonymous third codon position (excluding Met, Trp and the termination codons), was also used to calculate the extent of base composition bias. Similarly, GC1S and GC2S are the frequencies of the nucleotide G + C at the synonymous first and second position, respectively. In oder to examine the relationship between codon usage variation and compositional constraints, the GC1S, GC2S and GC3S of each selected ORF have been calculated.

Correspondence analysis

Correspondence analysis (COA) was used to investigate the major trend in codon usage variation among ORFs. In order to minimize the effect of amino acid composition on codon usage, each ORF is represented as a 59-dimensional vector. Each dimension corresponds to the RSCU value of one sense codon (excluding Met, Trp and the stop codons). The axis of a correspondence analysis identifies the source of the variation among a set of multivariate data points. This method has been successfully used to investigate the variation of RSCU values among ORFs [33-38].

Statistical analysis

Correlation analysis was carried out using the Spearman’s rank correlation analysis method. Cluster analysis was done by Hierarchical cluster method and the distances between selected sequences were calculated by the Euclidean distance method.

Analysis tools

The RSCU, GC3s, ENC, G + C, GRAVY, Length value, and COA were calculated using the program CodonW version 1.4 (http://codonw.sourceforge.net). The correlation analysis and Cluster analysis were carried out by using the multianalysis software SPSS version 13.0 (http://spss.com).

Results

Synonymous codon usage in FMD

In order to better understand the synonymous codon usage variation among FMDV isolates, the average codon usage was analyzed for these 40 FMDV genomes. As shown in Table 2 (see http://222.210.17.171/yak/FMDV/ Table 2.doc for details)., the codons ending in G or C are favored and the global pattern of codon usage is very similar among all FMDV genome examined, which indicates that there have not been significant compositional changes since these species diverged from their last common ancestor.
Table 2

Synonymous codon usage in different Serotypes of FMDVa

FMDV Serotypes
OACSAT1SAT2SAT1Asia 1
AACodonNo.RSCUNo.RSCUNo.RSCUNo.RSCUNo.RSCUNo.RSCUNo.RSCU
PheUUU45.220.8251.250.9543.170.8441.000.7539.330.7129.000.5447.500.89
UUC65.171.1857.001.0560.331.1668.001.2572.671.2979.001.4659.001.11
LeuUUA2.390.073.000.090.330.011.000.032.670.092.000.061.500.05
UUG37.871.1442.751.3232.831.0334.001.0032.331.0231.000.9133.001.02
CUU37.041.1142.001.3042.671.3326.000.7628.670.9036.001.0645.001.39
CUC62.701.8858.751.8262.831.9774.002.1764.002.0064.001.8860.001.86
CUA5.650.176.250.198.830.284.000.126.330.208.000.247.500.23
CUG54.131.6341.251.2844.331.3866.001.9358.001.8163.001.8547.001.45
IleAUU36.961.0843.751.2339.831.1637.001.1029.000.8729.000.8542.001.24
AUC57.221.6855.501.5657.671.6861.001.8165.001.9568.002.0053.501.57
AUA8.170.247.750.225.670.163.000.096.000.185.000.156.500.19
ValGUU40.040.8942.250.9544.670.9844.000.9738.000.8446.000.9844.500.99
GUC49.651.1047.751.0749.331.0861.001.3558.671.2957.001.2246.001.02
GUA13.300.3014.000.3212.500.2711.000.2412.330.2711.000.2412.000.27
GUG76.701.7174.501.6776.831.6865.001.4473.331.6173.001.5678.501.74
SerUCU13.480.6615.750.7319.670.9518.000.8316.670.7615.000.6918.000.86
UCC30.831.5235.501.6430.171.4631.001.4339.671.8034.001.5730.501.45
UCA21.261.0523.501.0921.831.0619.000.8817.670.8017.000.7822.501.07
UCG17.390.8619.500.9016.500.8025.001.1521.670.9823.001.0618.500.88
ProCCU35.041.0735.251.1132.331.0041.001.2737.001.1633.001.0336.001.12
CCC40.221.2237.751.1938.671.2040.001.2437.331.1744.001.3838.501.20
CCA25.870.7926.500.8429.500.9129.000.9034.671.0928.000.8827.000.84
CCG30.220.9227.500.8728.670.8919.000.5918.330.5823.000.7227.500.85
ThrACU39.000.9439.000.9341.170.9842.001.0139.000.9434.000.8441.000.97
ACC71.611.7465.501.5662.501.4964.001.5467.001.6255.001.3764.501.53
ACA29.700.7239.750.9437.170.8941.000.9933.670.8247.001.1736.000.86
ACG24.910.6024.250.5827.330.6519.000.4625.670.6225.000.6227.500.65
AlaGCU45.300.9143.250.9149.000.9644.000.9635.670.7741.000.8951.501.03
GCC71.391.4470.501.4874.171.4664.001.4070.671.5263.001.3670.501.41
GCA49.170.9952.251.1050.330.9954.001.1848.331.0454.001.1747.000.94
GCG33.000.6624.250.5130.170.5921.000.4631.670.6827.000.5831.000.62
TyrUAU9.390.2310.250.2513.670.3414.000.3212.330.2914.000.3411.500.29
UAC71.261.7773.751.7668.171.6773.001.6873.671.7168.001.6670.001.72
HisCAU5.430.186.750.217.500.2211.000.3411.000.3410.000.307.500.23
CAC56.261.8257.001.7960.831.7854.001.6654.001.6656.001.7058.001.77
GlnCAA28.740.7433.000.8335.500.9140.001.0532.330.8638.000.9731.500.82
CAG49.351.2646.751.1742.831.1036.000.9543.001.1440.001.0345.501.19
AsnAAU15.740.3113.250.2716.330.3320.000.3714.000.278.000.1615.500.32
AAC86.171.6986.751.7384.171.6787.001.6388.331.7392.001.8483.501.69
LysAAA61.170.8662.250.8659.830.8568.000.9262.670.8557.000.7960.000.86
AAG81.571.1482.751.1481.001.1580.001.0885.001.1587.001.2180.501.15
AspGAU30.830.4338.000.5234.170.4835.000.4939.670.5339.000.5335.500.50
GAC112.431.57107.751.48108.331.52107.001.51110.671.47109.001.47107.501.51
GluGAA43.480.6640.750.6345.170.7051.000.7642.670.6654.000.8143.500.67
GAG87.611.3488.251.3783.001.3084.001.2487.331.3580.001.1986.501.33
CysUGU14.700.8516.500.9915.170.9411.000.6517.670.9616.000.8213.000.81
UGC20.091.1517.001.0217.331.0723.001.3519.001.0423.001.1819.501.20
ArgCGU15.830.9116.250.9114.330.8122.001.3222.001.2817.000.9415.500.86
CGC26.261.5128.001.5630.831.7524.001.4428.001.6237.002.0432.001.78
CGA5.570.324.250.244.670.2610.000.606.330.379.000.504.500.25
CGG13.960.8013.250.7416.170.927.000.428.330.488.000.4415.500.86
SerAGU13.520.6713.750.6413.830.6710.000.4616.000.7317.000.7814.500.69
AGC25.431.2521.751.0121.671.0527.001.2520.670.9324.001.1122.001.05
ArgAGA27.911.6031.251.7525.171.4328.001.6823.671.3723.001.2726.001.44
AGG15.090.8614.500.8114.830.849.000.5415.330.8915.000.8315.000.83
GlyGGU43.221.1035.750.9134.830.8936.000.8939.000.9842.001.0839.000.99
GGC41.651.0649.251.2554.331.3950.001.2343.331.1043.001.1149.501.26
GGA39.171.0039.000.9934.170.8742.001.0442.331.0745.001.1636.500.93
GGG33.300.8534.250.8733.670.8634.000.8433.670.8525.000.6532.500.83

aNote: AA, amino acids; No., number of codons; RSCU, cumulative relative synonymous codon usage

Synonymous codon usage in different Serotypes of FMDVa aNote: AA, amino acids; No., number of codons; RSCU, cumulative relative synonymous codon usage

Compositional properties of coding sequences

In order to investigate if these 40 FMD viruses’ coding sequences examined display similar compositional features, mean ENC and GC3S, were calculated and summarized in Table 3. The values of ENC among these FMDV genomes examined are very similar, which vary from 50.02 to 52.79 with a mean value of 51.58 and S.D. of 0.67. All the ENC values of these strains are more than 50. The data suggest the homogeneity of synonymous codon usage among FMDV genomes examined. The concept is further supported by the GC3S values for each FMDV strain, which range from 61.00 to 68.00% with a mean of 63.80% and S.D. of 0.01. Therefore, taken together with published data of codon usage bias among some RNA viruses [5, 13, 39–41], we could conclude that codon usage bias in FMDV genomes is less biased, and there is no significant variation of synonymous codon usage among FMDV seven serotypes.
Table 3

The values of ENC, GC3S and ORF length for these 40 FMDV strains

SNENCGC3S ORF length (bp)SNENCGC3S ORF length (bp)
150.590.6469992150.330.6466999
252.790.62669992250.020.6816969
352.740.62969542351.370.6346999
451.30.63269992452.060.6257002
551.250.63369932551.070.6366999
651.360.63369992652.380.6077011
750.870.63869992752.070.6246987
850.590.67269692852.440.6286984
951.320.63469992952.440.6296984
1051.340.63369963052.50.6296984
1151.850.62869993152.370.636984
1251.80.63269963252.070.6216990
1351.760.63269993351.470.6466993
1451.50.63769993451.230.6347008
1551.170.67369693550.40.6537008
1651.090.67569633651.150.6237020
1751.460.67269693751.430.6397008
1851.230.63669993851.290.6357008
1951.310.63569993951.870.6276999
2051.490.63369964052.370.636984
The values of ENC, GC3S and ORF length for these 40 FMDV strains

Correspondence analysis on codon usage

In order to investigate the variation of RSCU values among ORFs, correspondence analysis (COA) was implemented on these 40 FMDV genomes examined as a single-dataset-based on the RSCU value of each strain’ ORFs. As mentioned, the axis of a correspondence analysis identifies the source of the variation among a set of multivariate data points. The four largest trends in codon usage among these ORFs were observed: the first axis accounts for 31.20% of all variation among genomes, whereas the next three axes accounts for 16.50%, 12.40%, and 8.10%, respectively.

Effect of mutational bias on codon usage

In order to investigate if the evolution of codon usage bias is controlled by mutation pressure or by natural selection, firstly, G + C content at the first and second codon positions (G + C12) was compared with that at synonymous third codon positions (G + C3S) (Fig. 1). A highly significant correlation was observed (r = 0.432, P < 0.05), indicating that patterns of base composition are most likely the result of mutation pressure, and not natural selection, since the effects are present at all codon positions. Second, for each strain, actual codon bias was plotted against both G + C3S and the expected ENC value if codon usage bias is solely due to biased base composition (i.e., G + C content). Result showed that the actual codon usage indices are close to the values expected from their G + C composition, although all are slightly lower (Fig. 2). Thirdly, we plotted the first and second axis values in COA and GC3S values of each strains (Fig. 3A,B). The patterns of codon usage in different strains also appear to be closely related to the GC content on the third codon position. Linear regression analysis has been implemented to each virus genome to find some correlation between synonymous codon usage and nucleotide compositions of the ORFs. We also found that axis 1 coordinates are correlated with GC3S and GC (r = 0.843, P < 0.01; r = 0.813, P < 0.01), while there are significant correlations between axis 2 value and GC3S(r = 0.355, P < 0.05). Taken together, these analyses indicate that most of the codon usage bias among these FMDV genomes is directly related to the nucleotide composition. Furthermore mutational bias is the major factor responsible for the variation of synonymous codon usage among ORFs in these virus genomes.
Fig. 1

Correlation between GC content at first and second codon positions (GC12) with that at synonymous third codon positions (GC3S). ORF, open reading frame

Fig. 2

Distribution of the codon usage index, ENC, and GC content at synonymous third codon positions (GC3S). The curve indicates the expected codon usage if GC compositional constraints alone account for codon usage bias. ENC: effective number of codons; ORF, open reading frame

Fig. 3

Correlation between the first (A), second axis (B) values in COA and GC3S values of each FMDV strains. COA: Correspondence analysis; GC3S: the frequency of the nucleotide G + C at the synonymous third codon position (excluding Met, Trp and the termination codons); ORF, open reading frame

Correlation between GC content at first and second codon positions (GC12) with that at synonymous third codon positions (GC3S). ORF, open reading frame Distribution of the codon usage index, ENC, and GC content at synonymous third codon positions (GC3S). The curve indicates the expected codon usage if GC compositional constraints alone account for codon usage bias. ENC: effective number of codons; ORF, open reading frame Correlation between the first (A), second axis (B) values in COA and GC3S values of each FMDV strains. COA: Correspondence analysis; GC3S: the frequency of the nucleotide G + C at the synonymous third codon position (excluding Met, Trp and the termination codons); ORF, open reading frame

Effect of other factors on codon usage

Generally, mutational bias and natural selection, such as, ORF length and the hydrophobicity of each protein are thought to be the factors accounting for the codon usage variation among ORFs in different organisms. However, whether there is any selection pressure that also contributes to the codon usage variation among these virus ORFs and which selection pressure determines the codon usage variation remained to be understood. Therefore, we performed a linear regression analysis on axis 1, axis 2 and axis 3 between the hydrophobicity of each protein and ORF length. It was found that axis 2 coordinates are also significantly correlated with the hydrophobicity of each protein (r = 0.659, P < 0.01), while axis 1 and axis 3 coordinates are also significantly correlated with the ORF length (r = −0.564, P < 0.01; r = 0.610, P < 0.01) respectively, indicating that the hydrophobicity of each protein and ORF length are also critical in affecting these viruses’ codon usage, although they were less important than that of the mutational bias.

Cluster

Based on the RSCU variation of these 40 FMDV strains examined, a cluster tree was generated by using a hierarchical cluster method. As shown in Fig. 4, these 40 FMDV strains examined were divided into three main lineages (I, II and III).
Fig. 4

Cluster tree based on the relative synonymous codon usage (RSCU) values of 40 FMDV strains examined. The cluster tree was generated by using Hierarchical cluster method

Cluster tree based on the relative synonymous codon usage (RSCU) values of 40 FMDV strains examined. The cluster tree was generated by using Hierarchical cluster method Lineage I mainly contained strains of A, O, C and Asia 1, and branched to give eight sublineages (I1—I8). Five C [27–30, 39] strains were clustered into sublineage I1 and another C [26] was clustered into sublineage I5 separately. Sublineage I2 was composed of the three strains A [23, 24, 38], but another A [25] was clustered into sublineage I8 separately. Two O [1, 6] strains were clustered into sublineage I3. Sublineage I4 was composed of the two O strains [2], while 14 A strains [3–5, 8–13, 17–20, 22] and one Asia 1 strain [31] were clustered into sublineage I6. In the end, other Asia 1strain [32] was clustered into sublineage I7 separately. Lineage II comprised five strains [33–36, 37] from SAT1, SAT2 and SAT3. Lineage III included five strains [7, 14–16, 21] from O.

Discussion

Codon usage bias in FMDV was investigated in the present study. Up to now, it is unclear how FMDV serotypes might affect codon choice in FMD viruses and we used RSCU [42], ENC [32], COA [43, 44] and GC3S, to measure the synonymous codon usage bias in order to minimize the effects of serotypes on codon bias, which have been successfully used to analyze the variation of codon usage among different viruses species [11, 13, 14, 22, 45, 46]. The analysis revealed that codon usage bias is low in most cases. As a case in point, the values of ENC vary from 50.02 to 52.79 (S.D. = 0.67) and the GC3S values range from 61.00 to 68.00% (S.D. of 0.02). The average ENC value of 51.53 among 40 strains can be compared to those seen in other organisms such as H5N1 virus , severe acute respiratory syndrome Coronavirus (SARSCoV) and Porcine adenovirus where mean values of 50.91, 48.99, and 38.97, respectively, have been reported [14, 22, 47]. In the case of human RNA viruses, the average ENC value also probably lies close to 45 since the distribution of values for [13] individual virus ranges uniformly from just under 38.5 to 58.3 [13]. One possible explanation about why FMDV had had a lower codon usage bias than other RNA viruses [13, 14, 22, 47] is that a low bias is advantageous to viruses that need to replicate efficiently in vertebrate cells, with potentially distinct codon preferences. A general mutational bias, which affects the whole genome would, certainly account for the majority of the codon usage variation. The genome base compositions affected the codon usage in Entamoeba histolytica genome [48]. Although the Chlamydomonas reinhardtii genome [49] had high GC contents, there was a little evidence that the genome composition shaped the codon usages in this genome. C. elegans showed a weak, but statistically significantly negative correlation between ‘G + C’ content and gene expression levels [50]. In human RNA viruses, H5N1 virus and SARS Coronavirus, mutation pressure rather than natural (translational) selection is the most important determinant of the codon bias. In this study, the general association between codon usage bias and base composition suggests that mutational pressure, rather than natural (translational) selection is supported by the highly significant correlation between GC12 and GC3S (r = 0.432, P < 0.05), and the result of ENC-plot (Fig. 2). The fact that GC content varies in a similar way at all codon positions is usually assumed to be the result of mutational bias. A general mutational bias, which affects the whole genome would certainly account for the majority of the codon usage variation. A similar pattern of codon usage has been reported amongst some RNA viruses [13]. Since mutation rates in RNA viruses are much higher than those in DNA viruses [40], it is understandable that mutation pressure is the determinant source of codon usage bias in the 40 FMDV strains included in this study. Therefore, mutational bias is the major factor responsible for the variation of synonymous codon usage among ORFs in these virus genomes. In Drosophila [51] genome, longer genes had lower codon usage bias. But, the longer genes had higher expression level and higher codon usage bias in S.penumoniaes genome [52]. In some virus, such as nucleopolyhedroviruses [11], H5N1 virus [14], SARS Coronavirus [22], adenoviruses [47], ORF length has no effect on the variations of synonymous codon usage. Those indicated that different genomes had different ORF lengths which accommodated their particular genome’s best requirements, and there were not universal rules about ORF length and codon usage in all genomes. In this study, the ORF length had played a critical role in affecting FMDV codon usage. The mechanisms that lead this is not clear, which is needed a more comprehensive analysis. It was reported that codon choices were influenced the hydropathy level of each protein in Chlamydia trachomatis, and Thermotoga maritime [53, 54]. In this study, codon usage is significantly positively correlated with the hydrophobicity of each FMDV. The link with hydropathy and codon usage may be caused by the fact that the expressed sequences are hydrophilic just because they accomplish their function in the aqueous media of the cell. Up to date, phylogenetic analyses have been performed largely on FMDV sequences from the 1D coding regions. These analyses have permitted the discrimination among serotypically related FMDV strains [55]. Another analysis of the complete FMDV genomes indicated phylogenetic incongruities between different genomic regions which were suggestive of interserotypic recombination [56]. In this study, phylogenetic analyses based on the RSCU values of the 40 FMDV strains examined were carried out using a hierarchical cluster method. The result indicated complex phylogenetic relationships also exist between different FMDV isolates as determined by Carrillo et al. 2005 [56]. For instance, Five O strains (HKN/2002, Tau-Yuan TW97 (Taiwan, 1997), Chu-Pei (Taiwan) (pig strain), Yunlin/Taiwan/97, and lz were clustered into Lineage III; One C strain (Argentina/85) was clustered into sublineageI5 and one Asia 1 strain (YNBS/China/58) was clustered into sublineageI6. Although they all belong to EURO-SA topotype [55], five C in sublineage I1 are sub-serotype C1 and one C in sublineage I5 is sub-serotype C3. The three A strains in Sublineage I2 was EURO-SA topotype and A strains sublineageI8 is Asia topotype, O stains in sublineageI3 is EURO-SA topotype, while 14 O strains in sublineage I6 belong to ME-SA topotype, PanAsia stains, were clustered into. O strains in Lineage III are Cathay topotype. All five SAT strains were clustered into Lineage II. Taken together, just as stated by other researchers [30, 56–58], these results suggest that FMDV sequences may undergo intertypic recombination, which conceivably undergo complex recombination events and the result of phylogenetic analyses based on the RSCU values fail to display serotype-specific phylogenetic relationships. These observations raise interesting questions about FMDV genome evolution in nature and the relative contribution of recombination to the generation of FMDV genetic and population diversity. As we know FMD is highly contagious, affects all cloven-hoofed animals, and is caused by FMDV that exists as antigenically diverse serotypes and intra-typical variants (subtypes); Some published results has shown that the overall extent of codon usage bias in RNA viruses is low and there is a little variation in bias between genes or genomes [13, 14, 21, 22]. Our analysis revealed that although there are a few, variations in codon usage bias among different FMD viruses, codon usage bias in FMDV is low. Due to lack of data and politic factors, in this article, it is impossible to obtain information on the virus isolation background, vaccination etc, but clearly, a more comprehensive analysis is needed to reveal more information about codon usage bias variation within and among FMD viruses and what other factors are responsible, including the influence of factors, such as cell tropism, principal host species, method of transmission, and viral genetic structure. Such information would then allow us to more precisely judge the relative importance of mutation pressure versus natural selection in determining base composition and codon usage in these pathogens. The recent European epizootic of FMD has made us aware of the great economic losses to be endured because no effective preventive and control measures are available for FMD [59]. Up to our knowledge, our work is the first report of the codon usage analysis on FMDV, which has provided a basic understanding of the mechanisms for codon usage bias and the processes governing the evolution of FMDV
  56 in total

1.  Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii.

Authors:  H Naya; H Romero; N Carels; A Zavala; H Musto
Journal:  FEBS Lett       Date:  2001-07-20       Impact factor: 4.124

2.  Third position codon composition suggests two classes of genes within the Cauliflower mosaic virus genome.

Authors:  S M Leisner; D A Neher
Journal:  J Theor Biol       Date:  2002-07-21       Impact factor: 2.691

3.  The strength of translational selection for codon usage varies in the three replicons of Sinorhizobium meliloti.

Authors:  Lucía Peixoto; Alejandro Zavala; Héctor Romero; Héctor Musto
Journal:  Gene       Date:  2003-11-27       Impact factor: 3.688

4.  Identification and characterization of foot-and-mouth disease virus O1 Burgwedel/1987 as an intertypic recombinant.

Authors:  O Krebs; O Marquardt
Journal:  J Gen Virol       Date:  1992-03       Impact factor: 3.891

5.  What drives codon choices in human genes?

Authors:  S Karlin; J Mrázek
Journal:  J Mol Biol       Date:  1996-10-04       Impact factor: 5.469

6.  Efficient creation of sequencing libraries from blunt-ended restriction enzyme fragments.

Authors:  S Hassard; G Ward
Journal:  Biotechniques       Date:  1995-03       Impact factor: 1.993

7.  Genome comparison of a novel foot-and-mouth disease virus with other FMDV strains.

Authors:  Qian Feng; Huan Yu; Yingying Liu; Chengqiang He; Jingsong Hu; Huachun Sang; Neizheng Ding; Mingxiao Ding; Yin-Wan Wendy Fung; Lok-Ting Lau; Albert Cheung-Hoi Yu; Jianguo Chen
Journal:  Biochem Biophys Res Commun       Date:  2004-10-08       Impact factor: 3.575

Review 8.  Foot-and-mouth disease virus.

Authors:  Esteban Domingo; Eric Baranowski; Cristina Escarmís; Francisco Sobrino
Journal:  Comp Immunol Microbiol Infect Dis       Date:  2002-10       Impact factor: 2.268

Review 9.  Molecular epidemiology of foot-and-mouth disease virus.

Authors:  N J Knowles; A R Samuel
Journal:  Virus Res       Date:  2003-01       Impact factor: 3.303

10.  Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales.

Authors:  Wanjun Gu; Tong Zhou; Jianmin Ma; Xiao Sun; Zuhong Lu
Journal:  Virus Res       Date:  2004-05       Impact factor: 3.303

View more
  27 in total

1.  The effects of the codon usage and translation speed on protein folding of 3D(pol) of foot-and-mouth disease virus.

Authors:  Xiao-Xia Ma; Yu-Ping Feng; Jun-Lin Liu; Bing Ma; Li Chen; Yong-Qing Zhao; Peng-Hui Guo; Jun-Zhen Guo; Zhong-Ren Ma; Jie Zhang
Journal:  Vet Res Commun       Date:  2013-05-29       Impact factor: 2.459

2.  Compare the differences of synonymous codon usage between the two species within cardiovirus.

Authors:  Wen-qian Liu; Jie Zhang; Yi-qiang Zhang; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Yao-zhong Ding; Yongsheng Liu
Journal:  Virol J       Date:  2011-06-27       Impact factor: 4.099

3.  Comparative study of codon substitution patterns in foot-and-mouth disease virus (serotype O).

Authors:  Insung Ahn; Se Eun Bae; Hyeon Seok Son
Journal:  Exp Mol Med       Date:  2011-10-31       Impact factor: 8.718

4.  Analysis of codon usage in Newcastle disease virus.

Authors:  Meng Wang; Yong-Sheng Liu; Jian-Hua Zhou; Hao-Tai Chen; Li-Na Ma; Yao-Zhong Ding; Wen-Qian Liu; Yuan-Xing Gu; Jie Zhang
Journal:  Virus Genes       Date:  2011-01-20       Impact factor: 2.332

5.  Analysis of synonymous codon usage in hepatitis A virus.

Authors:  Yiqiang Zhang; Yongsheng Liu; Wenqian Liu; Jianhua Zhou; Haotai Chen; Yin Wang; Lina Ma; Yaozhong Ding; Jie Zhang
Journal:  Virol J       Date:  2011-04-16       Impact factor: 4.099

6.  Complete coding sequence characterization and comparative analysis of the putative novel human rhinovirus (HRV) species C and B.

Authors:  Piyada Linsuwanon; Sunchai Payungporn; Kamol Suwannakarn; Thaweesak Chieochansin; Apiradee Theamboonlers; Yong Poovorawan
Journal:  Virol J       Date:  2011-01-07       Impact factor: 4.099

7.  Analysis of codon usage and nucleotide composition bias in polioviruses.

Authors:  Jie Zhang; Meng Wang; Wen-qian Liu; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Yao-zhong Ding; Yuan-xing Gu; Yong-sheng Liu
Journal:  Virol J       Date:  2011-03-30       Impact factor: 4.099

8.  Comparison of correspondence analysis methods for synonymous codon usage in bacteria.

Authors:  Haruo Suzuki; Celeste J Brown; Larry J Forney; Eva M Top
Journal:  DNA Res       Date:  2008-10-21       Impact factor: 4.458

9.  Pandemic influenza A virus codon usage revisited: biases, adaptation and implications for vaccine strain development.

Authors:  Natalia Goñi; Andrés Iriarte; Victoria Comas; Martín Soñora; Pilar Moreno; Gonzalo Moratorio; Héctor Musto; Juan Cristina
Journal:  Virol J       Date:  2012-11-08       Impact factor: 4.099

10.  Synonymous codon usage in TTSuV2: analysis and comparison with TTSuV1.

Authors:  Zhicheng Zhang; Wei Dai; Dingzhen Dai
Journal:  PLoS One       Date:  2013-11-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.