Literature DB >> 20438864

Analysis of synonymous codon usage in porcine reproductive and respiratory syndrome virus.

Yong-sheng Liu1, Jian-hua Zhou, Hao-tai Chen, Li-na Ma, Yao-zhong Ding, Meng Wang, Jie Zhang.   

Abstract

In this study, we calculated the relative synonymous codon usage (RSCU) values and codon usage bias (CUB) values to implement a comparative analysis of codon usage pattern of open reading frames (ORFs) which belong to the two main genotypes of porcine reproductive and respiratory syndrome virus (PRRSV). By analysis of synonymous codon usage values in each ORF of PRRSV, the optimal codons for most amino acids were all C or G-ended codons except GAU for Asp, CAU for His, UUU for Phe and CCU for Pro. The synonymous codon usage patterns in different ORFs of PRRSV were different and genetically conserved. Among them, ORF1a, ORF4, ORF5 and ORF7 could cluster these strains into the two main serotypes (EU and US). Due to mutational pressure, compositional constraint played an important role in shaping the synonymous codon usage pattern in different ORFs, and the synonymous codon usage diversity in ORFs was correlated with gene function. The degree of CUB for some particular amino acids under strong selection pressure probably served as a potential genetic marker for each ORF in PRRSV. However, gene length and translational selection in nature had no effect on the synonymous codon usage pattern in PRRSV. These conclusions could not only offer an insight into the synonymous codon usage pattern and differentiation of gene function, but also assist in understanding the discrepancy of evolution among ORFs in PRRSV. Copyright 2010 Elsevier B.V. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20438864      PMCID: PMC7106271          DOI: 10.1016/j.meegid.2010.04.010

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


Introduction

It is well known that the genetic code chooses 64 codons to represent 20 standard amino acids and stop signals. These alternative codons for the same amino acid are termed as synonymous codons. Although synonymous mutations tend to occur in the third base position, the cases can be interchanged without altering the primary sequence of the protein product. Some reports indicate that synonymous codons are not chosen equally and randomly both within and between genomes (Dittmar et al., 2006, Grantham et al., 1980, Lloyd and Sharp, 1992, Martin et al., 1989, Xie et al., 1998). In general, translation selection in nature and compositional constraints under the mutational pressure are thought to be the two major factors accounting for codon usage variation among genomes in various organisms (Gu et al., 2004, Karlin and Mrázek, 1996, Lesnik et al., 2000, Sharp et al., 1986, Zhong et al., 2007, Zhou et al., 2005, Zhou et al., 2006). In some RNA viruses, compared with translation selection in nature, mutation pressure plays an important role in synonymous codon usage pattern (Gu et al., 2004, Levin and Whittome, 2000, Jenkins and Holmes, 2003). Porcine reproductive and respiratory syndrome virus (PRRSV) is an enveloped, single-stranded positive-sense RNA virus which is classified into the order Nidovirales of family Arteriviridae (Benfield et al., 1992, Cavanagh, 1997). Based on serological characteristics, there are the two main serotypes of PRRSV, namely the Northern American isolate (US) and the European isolate (EU) (Bautista et al., 1993, Collins et al., 1992, Meng et al., 1995, Wensvoort et al., 1991). In addition to differences between both serotypes of the viruses, there is obvious genetic variation within PRRSVs, as confirmed by measure of the nucleotide and amino acid sequences of the viruses. The PRRSV genome contains ORF1a, encoding papain-like cysteine protease, ORF1b, encoding RNA dependent RNA polymerase, ORF2-6, encoding envelop proteins, and ORF7, encoding the nucleocapsid protein (Conzelmann et al., 1993, Meulenberg et al., 1993). The PRRSV can infect swine population and lead to a series of clinical signs, including high mortality, reproductive failure, post-weaning pneumonia and growth reduction (Keffaber, 1989, Loula, 1991). It is reported that PRRSV could give rise to prolonged viremia and enable its replication in macrophages, and lead to persistent infections (Plagemann and Moennig, 1992). However, little information about codon usage pattern of PRRSV genome including the relative synonymous codon usage (RSCU) and codon usage bias (CUB) in the process of their evolution are available. In this study, the key genetic determinants of codon usage index in PRRSV were examined.

Materials and methods

Sequence data

The 13 complete RNA sequences of PRRSV were downloaded from the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/Genbank/) and detailed information about the viruses were listed in Table 1 . The nucleotide content of ORFs of each PRRSV strain were analyzed by biosoftware DNAStar 7.0 for windows.
Table 1

PRRSV isolates and genome sequences included in this study.

No.StrainSerotypeIsolationLength (bp)Accession no.
116244B (US field strain)USUSA15,428NC_001961
2SY0608 (US field strain)USChina15,335EU144079
3ATCC VR-2332 (US prototype)USUSA15,411U87392
4EuroPRRSV (EU field strain)EUUSA15,047AY366525
5Lelystad (EU field strain)EUNetherlands15,111M96262
6Ingelvac MLV (US vaccine strain)USUSA15,412EF484033
7HB-2(sh)/2002 (US field strain)USChina15,398AY262352
8Prime Pac (US vaccine starin)USUSA15,521DQ779791
9KNU-07 (EU field strain)EUSouth Korea15,038FJ349261
1001CB1 (EU field strain)EUThailand14,943DQ864705
11BJEU06-1 (EU field strain)EUChina15,059GU047344
12HKEU16 (EU field strain)EUChina: Hong Kong15,074EU076704
13NMEU 09-1 (EU filed strain)EUChina15,068GU047345
PRRSV isolates and genome sequences included in this study.

The calculation of the relative synonymous codon usage (RSCU)

To investigate the characteristics of synonymous codon usage without the confounding influence of amino acid composition among different sequences, the relative synonymous codon usage (RSCU) values among different codons in each ORF was calculated. The RSCU value of the ith codon for the jth amino acid was calculated according to the published equation (Sharp and Li, 1986). RSCU:where g is the observed number of the ith codon for jth amino acid which has n type of synonymous codons. The codons with RSCU values more than 1.0 have positive CUB, while the values <1.0 have relative negative CUB. When RSCU value is equal to 1.0, it means that this codon is chosen equally and randomly.

Codon usage bias (CUB) calculations

To calculate CUB, it is supposed that statistically equal and random usage of all available synonymous codons was the “neutral point” (RSCU0  = 1.00) for the development of serotype-specific codon usage (Zhou et al., 2010). CUB: More simply, CUB is the average value of difference between RSCU and RSCU0 at each position of the target region. n represents all codons appearing in this position. When all RSCU values according to a particular position in the target region are RSCU0, CUB is equal to 0. It means that there are few preferential or non-preferential codons existing at this position. In contrast, when CUB value is much more deviation than RSCU0, codons with CUB are preferentially chosen at a particular position.

Principal component analysis

Principal component analysis, which was a commonly used multivariate statistical method (Jolliffe, 2002, Mardia et al., 1979), was carried out to analyze the major trend in codon usage pattern among different strains. Each strain was represented as a 59 dimensional vector, and each dimension corresponded to the RSCU value of each sense codon, which only included several synonymous codons for a particular amino acid, excluding the codon of AUG, UGG and three stop codons. We set up a two-dimensional coordinate system, which was made up of the first principal component and the second principal component . This two-dimensional coordinate would report the genetic relationship among all strains.

Correlation analysis

Correlation analysis of PRRSV was used to identify the relationship between nucleotide composition and synonymous codon usage pattern (Ewens and Grant, 2001). This analysis was implemented based on the Spearman's rank correlation analysis way. All statistical processes were carried out by with statistical software SPSS11.5 for windows.

Results

Synonymous codon usage in PRRSV

The nucleotide contents in each ORF of 13 PRRSV isolates were analyzed and the comparison among the values of A3%, U3%, C3% and G3% indicated that the content of A3 was always lowest and the rest fluctuated similarly. The (C3  + G3)% in ORFs of PRRSV fluctuates from 38.13% to 59.68% (Table 2 ). The overall RSCU values of 59 sense codons in PRRSV were listed in Table 3 , respectively. Most optimal codons among strains represent C-ended or G-ended codons, however, there was an interesting findings that four codons of GAU, CAU, UUU, CCU and their corresponding amino acids of Asp, His, Phe, Pro were chosen preferentially. Most of A-ended codons, excluding CCA and UCA with little slight preferential usage for Pro and Ser were used weakly (Table 3). These results suggested that compositional limitation often played an integral role in the codon usage pattern of PRRSV.
Table 2

Identified ORFs (length > 250 bps) in the PRRSV (13 isolates) genome.

ORFNo.A%A3%U%U3%C%C3%G%G3%(C + G)%(C3 + G3)%
1a120.4516.3524.9628.1327.2432.1627.3623.3454.5955.51
220.4216.0025.0429.3526.9831.2527.5723.3754.5554.63
320.4616.1825.0428.3627.2432.0227.2623.4154.5055.44
420.1116.6426.8332.0227.5531.1025.5020.2253.0551.33
520.8417.1725.0928.1927.0231.6827.0522.9554.0754.63
620.4716.1325.0828.4127.2132.0227.2523.4154.4555.44
720.7617.0425.7029.5926.5530.5126.9922.8453.5353.35
820.8317.2225.0628.327.0531.7627.0623.0754.1154.84
920.0416.6026.8132.2527.6731.0725.4820.0753.1551.13
1019.9716.6326.6631.5727.7831.4825.5920.3153.3751.79
1119.7616.7126.9131.8227.7231.4825.6119.9727.7251.45
1219.9016.2926.9132.3127.5631.4525.6319.9353.1951.39
1319.6916.2326.6531.5527.7131.3425.9420.8653.6552.20



1b123.4217.4623.5826.1627.6734.7225.3421.6453.0056.36
223.5217.2323.7728.1027.2132.8325.5021.8252.7154.66
323.4617.6223.8226.7527.3734.1125.3521.5152.7255.62
423.6219.5424.2628.3926.9130.4525.2021.6052.1152.05
523.7718.2924.0927.6127.1233.4725.0220.6252.1454.09
623.6217.9023.8226.9627.3733.8225.1921.3052.5655.13
723.3818.0524.4128.726.8732.2225.3421.0152.2153.24
823.9218.2124.0827.6427.1933.5424.8220.5952.0154.13
923.6518.6123.7826.6827.4832.5525.0922.1552.5754.70
1023.8619.8823.8827.5027.3531.3124.9121.2952.2552.60
1123.6819.3024.0227.4827.2131.4325.0921.7752.3053.20
1223.7719.4623.9527.2227.2531.5925.0221.7252.2853.31
1323.6819.6023.6326.0927.3932.1527.3922.1452.6954.30



2121.2722.9732.4335.3122.9620.0023.3521.7046.3041.70
220.8821.2732.2234.8923.2220.8523.2222.9746.4343.82
320.8822.1232.3035.3122.9619.5723.8722.9746.8242.55
419.6018.8829.4728.3226.9329.6124.0023.1750.9352.78
519.5819.6532.1735.0423.2220.5125.0324.7848.2545.29
620.8822.1232.6836.1722.9619.5723.4822.1246.4341.70
720.7522.1232.1733.6123.6122.5523.4821.7047.0844.25
819.5819.6532.1735.0423.2220.5125.0324.7848.2545.29
920.5320.1730.0028.3226.4029.6123.0721.8849.4751.50
1019.4718.8829.6027.4626.9330.4724.0023.1750.9353.64
1119.2018.1029.7328.4427.0730.1724.0023.2751.0753.44
1220.2721.4530.0029.1826.8029.1822.9320.1749.7349.00
1320.0021.0330.2728.3226.1329.6123.6021.0349.7353.00



3120.0016.7329.2828.9726.0131.8324.7122.4450.7254.28
220.5216.0429.2830.8626.1433.3324.0519.7550.2053.08
320.5217.5529.1528.9726.1432.2424.1821.2250.3353.46
422.0619.9228.3227.0926.8231.8722.8121.1149.6252.98
519.0815.5128.8928.9726.6733.0625.3622.4452.0355.51
620.7817.1423.9229.3829.2831.8326.0121.6349.9353.46
718.9014.4029.0030.4526.9033.3325.2021.8152.1055.14
819.0815.5128.8928.9726.6733.0625.3622.4452.0355.51
923.1820.1527.5726.4826.9433.5922.3119.7649.2553.35
1021.1817.3927.5726.4827.4432.0123.8124.1151.2556.12
1121.3218.3629.2028.5727.1331.0222.3522.0449.4853.06
1222.0920.3228.2927.2327.5232.1122.0920.3249.6152.43
1321.1918.3628.1726.1227.5233.8723.1321.6350.6555.51



4119.9313.3731.6635.4624.7729.6523.6521.5148.4251.16
220.3013.9531.6630.2324.7736.0423.2819.7648.0455.81
320.3013.3723.2835.4631.6629.6524.7721.5148.0451.16
422.2814.7728.8031.8126.6330.6822.2822.7248.9153.40
519.1812.2031.1033.7225.7031.9724.0222.0949.7254.06
620.1113.3723.6534.8831.2830.2324.9521.5148.6051.74
720.4815.1130.3530.8126.6334.8822.5319.1849.1654.06
819.1812.2031.1033.7225.7031.9724.0222.0949.7254.06
922.6415.3427.1726.728.2634.6521.9223.2950.1857.95
1021.3814.7728.4430.6826.9931.8123.1922.7250.1854.54
1121.2114.2829.7330.9526.7032.1422.3522.6149.0554.76
1220.8312.9429.5532.9426.5229.4123.1124.7049.6254.11
1321.0213.6029.5531.3626.3330.7623.1124.2649.4355.02



5120.2313.5429.6830.7224.0534.3726.0421.3550.0855.72
220.0711.9729.3530.2024.7133.3325.8724.4750.5857.81
320.4013.5425.8730.7229.3534.3724.3821.3550.2555.72
420.4615.1029.5430.224.2635.9325.7418.7550.0054.68
520.0713.5430.0230.223.2232.2926.7023.9549.9256.25
620.4013.5429.3530.7224.3834.3725.8721.3550.2555.72
719.9012.5030.5132.2923.0531.2526.5323.9549.5955.20
820.0713.5430.0230.2023.2232.2926.7023.9549.9256.25
919.6413.5428.2226.5625.5837.5026.5722.3952.1559.89
1020.1317.1827.7225.5225.9139.5826.2417.7052.1557.29
1121.6217.7028.2228.1225.2535.9324.9218.0050.1754.16
1219.4713.9828.8829.5324.5935.2327.0622.0051.6556.47
1321.7816.3129.5431.5722.7731.5725.9120.5248.6852.10



6122.2917.8526.2923.2125.9034.5225.5224.4051.4358.92
222.6719.6426.4824.4025.7132.7325.1423.2150.8655.95
322.6719.6426.4824.4025.7132.7325.1423.2150.8655.95
423.3727.3825.4823.2127.0133.9224.1415.4751.1549.40
521.5217.8527.0526.1924.9530.9526.4825.0051.4355.95
622.6719.6426.4824.4025.5233.3325.3322.6150.8655.95
722.2919.0426.4823.8025.5233.3325.7123.8051.2457.14
821.5217.8527.0526.1924.9530.9526.4825.0051.4355.95
922.0324.4026.0524.4026.2532.1425.6719.0451.9251.19
1023.3726.1926.6325.5925.8631.5424.1416.6650.0048.21
1124.5214.2825.2930.9526.6332.1423.5622.6150.1954.76
1221.8422.7525.8620.9526.6337.1225.6719.1652.3056.28
1322.8023.9527.0126.9425.1030.5325.1018.5650.1949.10



7130.1119.0019.0923.9625.2723.9625.5433.0550.8157.02
230.1119.8319.3521.4824.7324.7925.8133.8850.5458.67
330.1119.0019.3524.7925.0023.1425.5433.0550.5456.19
427.6520.1619.9025.8026.1022.4826.3631.4552.4554.03
530.3819.0019.0923.1425.8125.6124.7332.2350.5457.85
630.1119.0019.3524.7925.0023.1425.5433.0550.5456.19
729.0319.0018.2819.8325.8126.4426.8834.7152.6959.82
830.3819.0019.0923.1425.8125.6124.7332.2350.5457.85
928.1720.1621.1925.8025.3223.3825.3230.6450.6554.03
1027.3919.3519.9025.0026.6124.1920.1031.4552.7155.64
1127.3921.1319.9026.8226.6123.5726.1028.4552.7152.03
1226.3618.6920.1629.2627.1325.2026.3631.7053.4956.91
1327.1317.7420.9328.2225.3220.9626.6133.0651.9454.03
Table 3

Synonymous codon usage in PRRSV ORFs.

AAaCodonRSCUbAAaCodonRSCUb
AlaGCA0.857LeuCUA0.510
GCC1.309CUC1.164
GCG0.650CUG1.377
GCU1.184CUU1.096



ArgAGA0.893UUA0.330
AGG1.092UUG1.522
CGA0.742LysAAA0.997
CGC1.411AAG1.003
CGG1.012PhecUUC0.917
CGU0.851UUU1.083



AsnAAC1.040ProcCCA1.047
AAU0.960CCC1.059



AspcGAC0.955CCG0.802
GAU1.045CCU1.092



CysUGC1.001SerAGC0.872
UGU0.999AGU0.806



GlnCAA0.982UCA1.075
CAG1.018UCC1.418



GluGAA0.884UCG0.683
GAG1.116UCU1.145



GlyGGA0.559ThrACA0.969
GGC1.369ACC1.439
GGG1.016ACG0.653
GGU1.056ACU0.939



HiscCAC0.847TyrUAC1.174
CAU1.153UAU0.826



IleAUA0.766ValGUA0.365
AUC1.166GUC1.110
AUU1.068GUG1.406
GUU1.118

The preferentially used codons for each amino acid are described in bold.

AA is the abbreviation of amino acid.

RSCU values are mean values.

The preferentially used codon is U-end codon.

Identified ORFs (length > 250 bps) in the PRRSV (13 isolates) genome. Synonymous codon usage in PRRSV ORFs. The preferentially used codons for each amino acid are described in bold. AA is the abbreviation of amino acid. RSCU values are mean values. The preferentially used codon is U-end codon.

Genetic relationship based on synonymous codon usage in PRRSV

Principal component analysis was carried out for the identified ORFs of all samples. The method detected one major trend in the first axis which can account for 14.59% of the total synonymous codon usage variation, and another major trend in the second axis for 10.81% of the total variation. A plot of the and the of each ORF in PRRSV is shown in Fig. S1. It appeared to be a little complex with some overlapping plots of ORF1a to ORF7. The plots of ORF1a and ORF1b which can produce viral nonstructural proteins aggregated highly, however, the plots of the rest scattered to different extents (Fig. S1). It was clear that the plots of ORF4-7 were far from each other obviously, implying that the functions of differential viral products played a role in codon usage pattern. The ORF1a, ORF4, ORF5 and ORF7 could distinguish US and EU serotypes obviously, and these plots reflected codon usage pattern corresponding to each ORF (Fig. S2). It is probably been suggested during the evolution of PRRSV, the function of viral proteins enabled PRRSV genome to shape the characteristic of the US and the EU serotypes of PRRSV. The ORF1a could reflect the conserved pattern of synonymous codon usage between the two serotypes compared with ORF4, 5 and 7. This phenomenon reflected that synonymous codon usage pattern of ORF1a was relatively conserved due to the function of nonstructural protein of PRRSV, in contrast, due to the functions of structural proteins, the codon usage patterns of ORF4, 5 and 7 for structural proteins were relative variable. However, the scattered plots of ORF3 reflected that the two main serotypes had a common characteristic of synonymous codon usage pattern, namely, the plots failed to distinguish the two virus serotypes obviously. It suggested that the viral product encoded by ORF3 of different serotypes of PRRSV had a slight ability to identify between US and EU serotypes.

Compositional properties of all ORFs of PRRSV

To analyze if the different viruses’ ORFs displayed similar compositional characteristics, all and values for strains were calculated by principle component analysis, and all the obtained positions content of A, U, C, G, and (C + G) (A%, U%, C%, G%, (C + G)%), and the third position content of A, U, C, G and (C3  + G3) (A3%, U3%, C3%, G3%, (C3  + G3)), were measured and listed in Table 2. By analysis of overlay scatter-plot, the content of each nucleotide at the synonymous third position of sense codons did not fluctuate following the total content of corresponding nucleotide, especially A3%, C3%, G3% (Fig. S3). It is implied that the pattern of synonymous codon usage may not be directly and simply correlated to nucleotide content, but to selection pressure (e.g. mutation pressure and gene function). However, it was observed that the relationship between (C3  + G3)% and (C + G)% did not reveal some special features, namely it did not reflect the real variation of C3% or G3%. Thus, the variation of (C3  + G3)% may have a slight correlation with the codon usage pattern of PRRSV. To analyze codon usage pattern regulated by natural selection or mutation pressure, the A%, U%, C%, G% and (C + G)% were compared with A3%, U3%, C3%, G3% and (C3  + G3)%, respectively. An interesting and complex correlation was observed. In detail, it was apparent that positive correlation existed among the nucleotide contents (i.e., A% and A3%) (p  < 0.01), however, there were three different correlation degrees including positive correlation, negative correlation or non-correlation among the nucleotides. Further, A3% has no correlation with C% or G%, suggesting nucleotide constraint can influence codon usage of the nucleotide A at the synonymous codon; the fluctuation of U3% can be affected by A%, C% and G%, respectively, and C3% is not related to G%, they does not indicate some special feature; however, G3% has a strong positive correlation with A% and non-correlation with C%, indicating (C + G)% and (C3  + G3)% may not reflect some true feature of synonymous codon usage as well (Table 4 ). In addition, the and values were compared with nucleotide composition of each sample (Table 5 ). Although the codon usage patterns in different strains appeared to be related to (C3  + G3)% to a slight extent, correlation analysis has been carried out to find significant correlation between nucleotide compositions and synonymous codon usage to a certain extent. In addition, there is no obvious relationship between codon usage indices ( values) and the length of different ORFs, for example there is no significant difference among codon usage indices for ORF1a, ORF1b and ORF2 (p  > 0.05), implying that the different length of genes did not account for codon usage variation of ORFs in PRRSV. Taken together, these analyses indicate that nucleotide compositions play a role in the pattern of codon usage. Furthermore, mutational pressure is the main factor responsible for the variation of synonymous codon usage among samples.
Table 4

Summary of correlation analysis between the A, U, C, G contents and A3, U3, C3, G3 contents in all selected samples.

A3%U3%C3%G3%(C3 + G3)%
A%r = 0.378**r = −0.612**r = −0.433**r = 0.738**r = 0.249**
U%r = −0.201*r = 0.635**r = 0.170NSr = −0.613**r = −0.458**
C%r = −0.124NSr = −0.125NSr = 0.368**r = −0.189NSr = 0.239*
G%r = −0.131NSr = −0.192NSr = 0.108NSr = 0.332**r = 0.306**
(C + G)%r = −0.006NSr = −0.367**r = 0.198*r = 0.126NSr = 0.375**

NS means non-significant (p > 0.05).

Means 0.01 < p < 0.05.

Means p < 0.01.

Table 5

Summary of correlation analysis between the first two axes in principle and nucleotide contents in samples.

Base compositionsf1 (15.083%)f2 (11.171%)
A3%r = −0.449**r = 0.441**
U3%r = 0.529**r = −0.219**
C3%r = 0.519**r = 0.256**
G3%r = −0.695**r = −0.454**
(C3 + G3)%r = −0.128NSr = −0.194**

NS means non-significant.

Means p < 0.01.

Summary of correlation analysis between the A, U, C, G contents and A3, U3, C3, G3 contents in all selected samples. NS means non-significant (p > 0.05). Means 0.01 < p < 0.05. Means p < 0.01. Summary of correlation analysis between the first two axes in principle and nucleotide contents in samples. NS means non-significant. Means p < 0.01.

Qualitative evaluation of codon usage bias in ORFs of PRRSV

There was a seemingly random variation in CUB between amino acids and ORFs. There were several synonymous codons with strong discrepancy for codon usage in each ORF. In details, as for ORF2, GGU for Gly, UUG for Leu, CCA for Pro are chosen preferentially; in ORF3, GAC for Asp, GGG for Gly, GUU for Val are chosen preferentially; in ORF4, AGG for Arg, CAU for His, CCC for Pro are used preferentially; in ORF5, CAA for Gln, AAA for Lys, CCG for Pro, AGC for Ser are used preferentially; in ORF6, GAA for Glu, CAC for Thr, ACA for Thr are used preferentially; as for ORF7, CGC for Arg, UGC for Cys, CUG for Leu, AAG for Lys, AGU for Ser, ACU for Thr, GUC for Val. In ORF1a–1b, there is no preferential condon (Fig. S4). These results may suggest that with the development of evolution of PRRSV, the discrepancy of synonymous codon usage was formed by accumulation of mutation.

Relationship between amino acids and codon usage pattern in PRRSV

In order to analyze whether the evolution of CUB was controlled by mutation pressure or by translational selection in nature, the CUB data had been calculated based on data listed in Table 3. This table displayed a numerical representation of the translational machinery. The distribution of CUB values is illustrated in Fig. S5. The transition from maximum-negative to maximum-positive values was smooth and there was no obvious or unambiguous border between the so-called dominant and prohibited codons, namely, all possible codons were used. The result indicated that translational selection in nature has no effect on the pattern of synonymous codon usage and the evolutionary pattern of PRRSV.

Discussion

Generally, previous reports indicates that many viruses including foot-and-mouth disease viruses, influenza A virus subtype H5N1, severe acute respiratory syndrome Coronavirus (SARSCoV) and human bocavirus, preferentially use C and G-ended codons (Zhao et al., 2008, Zhong et al., 2007, Zhou et al., 2005). It is unclear how the EU and US serotypes of PRRSV influence codon usage pattern to date. In this study, it is revealed that preferentially used codons are C, G and U-ended codons for the PRRSV, especially, four amino acids (Cys, His, Phe and Pro) preferentially used U-ended codons. One possible explanation for why PRRSV has three types of optimal codons, compared with other RNA viruses mentioned above was that more types of optimal codon is advantageous to PRRSV which need to replicate efficiently in host cells with potentially distinct codon preferences. Although PRRSV could not been classified into US and EU serotypes by the RSCU values of all ORFs, some reports on the genetic analysis of the ORF5 could help us understand the genetic relationships among different isolates (Cha et al., 2006, Chen et al., 2006, Mateu et al., 2006). It is known that ORF5 which encodes the major envelope glycoprotein (GP5) is one of the main immunogenic proteins of PRRSV and is the leading target for the development of the genetic engineering vaccines against PRRS. However, due to striking genetic and antigenic variability among different PRRSV isolates, GP5 of a strain has a negative effect on the cross-protection efficiency against heterologous PRRSV strains (Kim and Yoon, 2008, Meng, 2000). The synonymous codon usage pattern of ORF1a or ORF5 has the ability to distinguish between US and EU serotypes. To some degree, it seems that codon usage variation of ORF1a has a better ability to distinguish strain types than that of ORF5, and the latter has a remarkable characteristic of genetic variability within both homologous serotype and heterologous serotypes. However, the synonymous codon usage pattern of ORF3 does not group strains into US or EU serotype obviously, possibly suggesting that GP3 contains a common biological function in both serotypes. The result is consistent with some viewpoints about GP3 of PRRSV (Meulenberg et al., 1995). One possible explanation is that biological function of different viral products is developed by mutational pressure, rather than natural selection. It is also reported that there was a relationship between geographical location of sample origin and the genetic diversity of PRRSV (Stadejek et al., 2006). In this study, although synonymous codon usage variation fails to reflect that a geographical factor, the index has an obvious effect on codon usage pattern. This result is in agreement with the report by Pesch et al. (2005). As for RNA viruses, the major factor of shaping codon usage patterns appears to be mutation pressure rather than natural selection (Zhao et al., 2008, Zhong et al., 2007, Zhou et al., 2005). To reveal the main force driving codon usage variation, it was found that codon usage bias was strong correlated with overall genomic (G + C) content, implying that composition constraint under mutational pressure rather than natural selection for specific coding triplets (Shackelton et al., 2006). Naya et al. (2001) found that in Chlamydomonas reinhardtii genome with high (G + C) content, there was no evidence that composition constraint played a role in shaping codon usage pattern. The results indicated that there were two main forces, namely, natural selection and mutational pressure, existing in the process of evolution. In this study, each base composition at the third position of synonymous codon was always correlated to the other base composition (Table 4). The fact that (G + C) content varies in a similar way at all codon positions is usually assumed to be the result of mutational bias. The fact that (G + C) content varies in a similar way at all codon positions is usually assumed to be the result of mutational pressure. A general mutational pressure, which affects the whole genome would certainly account for the majority of the codon usage has been reported among some RNA viruses. Since mutation rates of RNA viruses are much higher than those DNA viruses (Jenkins and Holmes, 2003), it is understandable that mutation pressure plays a key role in shaping the synonymous codon usage pattern in different ORFs of the 13 PRRSV strains included in this study. In addition, the general association between codon usage indices and composition constraint shows that mutational pressure plays an important role in determining codon usage variation of PRRSV, which is supported by the highly significant correlation between codon usage indices and A3%, U3%, G3%, C3% (Table 5). Although the codon usage indices is strong correlated with composition constraint (C3  + G3)% has stronger correlation with values than , implying that it has no real correlation with codon usage indices. The fluctuation of (C3  + G3)% does not reflect C3% and G3% and (C3  + G3)% may not always present the degree of composition constraint, so the fluctuation of each nucleotide composition needs to be analyzed further. Sequence analysis indicated PRRSV achieve evolution through random mutation and intragenic recombination (Kapur et al., 1996, Meng et al., 1995, Murtaugh et al., 1995, Nelsen et al., 1999). Taken together, mutational pressure is one of the main factors responsible for the variation of synonymous codon usage among ORF coding sequences in PRRSV. The codon usage bias for synonymous codon in different ORFs of PRRSV shows different feature in the study. Among these ORFs, each includes some amino acids with strong selective discrepancy and some synonymous codons for a particular amino acid has been used preferentially. It seems that the codon usage discrepancy for some synonymous codon may serve as a potential genetic marker. Nielsen et al. (2001) examined the alternative of synonymous and non-synonymous amino acids in ORF1 of PRRSV and pointed out that there was a stronger selective pressure for amino acid conservation during spread in host animal. Storgaard et al. (1999) found that non-synonymous nucleotide mutations in ORF7 of PRRSV were not stronger than ORF5. These findings suggest that gene function and mutational pressure likely affect codon usage variation. In addition, the effect of natural translation selection in shaping synonymous codon usage is not observed in this study and this is consistent with those previous reports (Biro, 2008, Gu et al., 2004, Levin and Whittome, 2000, Zhou et al., 2005). These results probably assist us in understanding various factors influencing evolution of PRRSV.
  42 in total

1.  Ribosome traffic in E. coli and regulation of gene expression.

Authors:  T Lesnik; J Solomovici; A Deana; R Ehrlich; C Reiss
Journal:  J Theor Biol       Date:  2000-01-21       Impact factor: 2.691

2.  Translational selection shapes codon usage in the GC-rich genome of Chlamydomonas reinhardtii.

Authors:  H Naya; H Romero; N Carels; A Zavala; H Musto
Journal:  FEBS Lett       Date:  2001-07-20       Impact factor: 4.124

3.  The relationship between synonymous codon usage and protein structure.

Authors:  T Xie; D Ding; X Tao; D Dafu
Journal:  FEBS Lett       Date:  1998-08-28       Impact factor: 4.124

Review 4.  Codon catalog usage and the genome hypothesis.

Authors:  R Grantham; C Gautier; M Gouy; R Mercier; A Pavé
Journal:  Nucleic Acids Res       Date:  1980-01-11       Impact factor: 16.971

5.  Isolation of swine infertility and respiratory syndrome virus (isolate ATCC VR-2332) in North America and experimental reproduction of the disease in gnotobiotic pigs.

Authors:  J E Collins; D A Benfield; W T Christianson; L Harris; J C Hennings; D P Shaw; S M Goyal; S McCullough; R B Morrison; H S Joo
Journal:  J Vet Diagn Invest       Date:  1992-04       Impact factor: 1.279

6.  Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses.

Authors:  Tong Zhou; Wanjun Gu; Jianmin Ma; Xiao Sun; Zuhong Lu
Journal:  Biosystems       Date:  2005-04-07       Impact factor: 1.973

7.  Porcine reproductive and respiratory syndrome virus comparison: divergent evolution on two continents.

Authors:  C J Nelsen; M P Murtaugh; K S Faaberg
Journal:  J Virol       Date:  1999-01       Impact factor: 5.103

8.  Tissue-specific differences in human transfer RNA expression.

Authors:  Kimberly A Dittmar; Jeffrey M Goodenbour; Tao Pan
Journal:  PLoS Genet       Date:  2006-11-13       Impact factor: 5.917

9.  Lelystad virus, the causative agent of porcine epidemic abortion and respiratory syndrome (PEARS), is related to LDV and EAV.

Authors:  J J Meulenberg; M M Hulst; E J de Meijer; P L Moonen; A den Besten; E P de Kluyver; G Wensvoort; R J Moormann
Journal:  Virology       Date:  1993-01       Impact factor: 3.616

10.  Phylogenetic analyses of the putative M (ORF 6) and N (ORF 7) genes of porcine reproductive and respiratory syndrome virus (PRRSV): implication for the existence of two genotypes of PRRSV in the U.S.A. and Europe.

Authors:  X J Meng; P S Paul; P G Halbur; M A Lum
Journal:  Arch Virol       Date:  1995       Impact factor: 2.574

View more
  12 in total

1.  Genetic analysis of the PB1-F2 gene of equine influenza virus.

Authors:  Gang Lu; Wei Guo; Ting Qi; Jian Ma; Shihua Zhao; Zhige Tian; Jialiang Pan; Chao Zhu; Xiaojun Wang; Wenhua Xiang
Journal:  Virus Genes       Date:  2013-06-19       Impact factor: 2.332

2.  Comparative [corrected] codon usage between the three main viruses in pestivirus genus and their natural susceptible livestock.

Authors:  Jian-hua Zhou; Zong-liang Gao; Jie Zhang; Hao-tai Chen; Zygmunt Pejsak; Li-na Ma; Yao-zhong Ding; Yong-sheng Liu
Journal:  Virus Genes       Date:  2012-03-07       Impact factor: 2.332

3.  Analysis of codon usage in bovine viral diarrhea virus.

Authors:  Meng Wang; Jie Zhang; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Yao-zhong Ding; Wen-qian Liu; Yong-sheng Liu
Journal:  Arch Virol       Date:  2010-11-11       Impact factor: 2.574

4.  The characteristics of the synonymous codon usage in hepatitis B virus and the effects of host on the virus in codon usage pattern.

Authors:  Ming-ren Ma; Xiao-qin Ha; Hui Ling; Mei-liang Wang; Fang-xin Zhang; Shang-di Zhang; Ge Li; Wei Yan
Journal:  Virol J       Date:  2011-12-15       Impact factor: 4.099

5.  Mapping codon usage of the translation initiation region in porcine reproductive and respiratory syndrome virus genome.

Authors:  Jun-hong Su; Xiao-xia Ma; Ya-li He; Ji-dong Li; Xu-sheng Ma; Yong-xi Dou; Xue-nong Luo; Xue-peng Cai
Journal:  Virol J       Date:  2011-10-21       Impact factor: 4.099

6.  Significant differences in terms of codon usage bias between bacteriophage early and late genes: a comparative genomics analysis.

Authors:  Oriah Mioduser; Eli Goz; Tamir Tuller
Journal:  BMC Genomics       Date:  2017-11-13       Impact factor: 3.969

7.  Quasispecies evolution of the prototypical genotype 1 porcine reproductive and respiratory syndrome virus early during in vivo infection is rapid and tissue specific.

Authors:  Tahar Ait-Ali; Jean-Pierre Frossard; Zen H Lu; Xinglong Wang; Alison D Wilson; Daniel L W Dorey-Robinson; Alan L Archibald
Journal:  Arch Virol       Date:  2017-03-30       Impact factor: 2.574

8.  Evolutionary Patterns of Codon Usage in Major Lineages of Porcine Reproductive and Respiratory Syndrome Virus in China.

Authors:  Weixin Wu; Xinna Ge; Yongning Zhang; Jun Han; Xin Guo; Lei Zhou; Hanchun Yang
Journal:  Viruses       Date:  2021-05-31       Impact factor: 5.048

9.  Patterns of synonymous codon usage on human metapneumovirus and its influencing factors.

Authors:  Qiao Zhong; Weidong Xu; Yuanjian Wu; Hongxing Xu
Journal:  J Biomed Biotechnol       Date:  2012-11-05

10.  The effects of the context-dependent codon usage bias on the structure of the nsp1α of porcine reproductive and respiratory syndrome virus.

Authors:  Yao-zhong Ding; Ya-nan You; Dong-jie Sun; Hao-tai Chen; Yong-lu Wang; Hui-yun Chang; Li Pan; Yu-zhen Fang; Zhong-wang Zhang; Peng Zhou; Jian-liang Lv; Xin-sheng Liu; Jun-jun Shao; Fu-rong Zhao; Tong Lin; Laszlo Stipkovits; Zygmunt Pejsak; Yong-guang Zhang; Jie Zhang
Journal:  Biomed Res Int       Date:  2014-08-03       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.