Literature DB >> 15041183

Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales.

Wanjun Gu¹, Tong Zhou, Jianmin Ma, Xiao Sun, Zuhong Lu.

Abstract

In this study, we calculated the codon usage bias in severe acute respiratory syndrome Coronavirus (SARSCoV) and performed a comparative analysis of synonymous codon usage patterns in SARSCoV and 10 other evolutionary related viruses in the Nidovirales. Although there is a significant variation in codon usage bias among different SARSCoV genes, codon usage bias in SARSCoV is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that synonymous codon usage pattern in these virus genes was virus specific and phylogenetically conserved, but it was not host specific. Phylogenetic analysis based on codon usage pattern suggested that SARSCoV was diverged far from all three known groups of Coronavirus. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Codon

Year: 2004 PMID： 15041183 PMCID： PMC7127446 DOI： 10.1016/j.virusres.2004.01.006

Source DB: PubMed Journal: Virus Res ISSN： 0168-1702 Impact factor: 3.303

Introduction

Synonymous codons are not used equally both within and between genomes (Grantham et al., 1980, Martin et al., 1989, Lloyd and Sharp, 1992). Compositional constraints and natural selection are thought to be the two main factors accounting for codon usage variation among genes in different organisms (Karlin and Mrazek, 1996, Sharp et al., 1986, Lesnik et al., 2000). The diverse patterns of codon usage in mammals may arise from compositional constraints of the genomes (Karlin and Mrazek, 1996, Francino and Ochman, 1999, Majumdar et al., 1999, Ghosh et al., 2000). In contrast, in some unicellular organisms, such as Escherichia coli and Saccharomyces cerevisiae, high expressed genes have a strong selective preference for codons with a high concentration of the corresponding acceptor tRNA molecule, whereas low expressed genes displayed a more uniform pattern of codon usage (Gouy and Gautier, 1982, Grantham et al., 1981, Ikemura, 1981, Ikemura, 1985; Sharp et al., 1986, Lesnik et al., 2000). Moreover, mutational pressure rather than translational selection is the most important determinant of the codon bias in some human RNA viruses (Levin and Whittome, 2000, Jenkins et al., 2001, Jenkins and Holmes, 2003). Furthermore, replicational and transcriptional selection is responsible for the codon usage variation among the genes of Borrelia burgdorferi (McInerney, 1998). In some other researches, codon usage was also found to be related to gene function (Chiapello et al., 1998, Epstein et al., 2000, Ma et al., 2002), protein secondary structure (Chiusano et al., 1999, Chiusano et al., 2000, Oresic and Shalloway, 1998, Xie and Ding, 1998, Gupta et al., 2000), cellular location of gene products (Chiapello et al., 1999) and gene length (Coghlan and Wolfe, 2000, Marais and Duret, 2001, Moriyama and Powell, 1998). Severe acute respiratory syndrome (SARS) is a respiratory disease that was recently reported in Asia, North America and Europe (Chan-Yeung and Yu, 2003, Drazen, 2003). Although genome sequence of severe acute respiratory syndrome Coronavirus (SARSCoV) has been published and many studies have been performed on SARSCoV in recent months (Paul et al., 2003, Qin et al., 2003, Marra et al., 2003, Snijder et al., 2003), little genomic analysis is available on this virus. Codon usage data of SARSCoV might give some clues to the features of SARSCoV genome and some evolutionary information of this virus. Here, we analyzed the codon usage data of this virus and other viruses in the order Nidovirales. The key evolutionary determinants of codon usage bias in these viruses were also investigated.

Materials and methods

Materials

SARSCoV is a large, enveloped, positive-stranded RNA virus, which belongs to order Nidovirales, family Coronaviridae, genus Coronavirus in virus taxonomy (Marra et al., 2003). The complete genome and coding sequences of SARSCoV TOR2 isolation were obtained from GenBank (Version 134.0). To keep the statistical significance of codon usage bias, only sequences with length above 150 bps were analyzed (Table 1 ). To compare the codon usage pattern among different viruses, coding genes of 10 other viruses belonging to order Nidovirales (six viruses in the genus Coronavirus, four viruses in the genus Arterivirus) were also parsed from GenBank (Version 134.0) (Table 2 ).

Table 1

Identified ORFs (length > 150 bps) in the SARSCoV (TOR2 isolation) genomea, b

Gene product	La	ENC	GC_3S (%)	f₁′b

Putative orf1ab polyprotein	21222	48.47	32.20	−0.60
Orf1a polyprotein	13149	48.24	33.10	−0.57
Putative spike glycoprotein	3468	45.73	28.30	−0.85
Putative uncharacterized protein	825	47.66	34.50	−0.37
Putative uncharacterized protein	465	42.80	45.10	1.34
Putative small envelope protein E	231	59.06	38.70	0.34
Putative protein M	666	59.04	42.50	0.51
Putative uncharacterized protein	192	42.19	28.80	−1.08
Putative uncharacterized protein	269	43.05	30.60	−0.55
Putative nucleocapsid protein	1269	54.16	37.60	0.49
Putative uncharacterized protein	297	46.62	58.10	1.87

L represents the length of identified ORF.

f1′ represent the first axis values of each gene in CA.

Table 2

Phylogenetic breakdown, accession number, GC3S and the first two axis values in CA of 11 selected viruses in order Nidoviralesa, b

Organisma	Accession number	GC_3S(%)	f₁′b	f₂′b

Coronavirus
HCoV 229E	NC_002645	30.89	−0.84	−0.16
PEDV	NC_003436	37.32	−0.04	0.42
TGV	NC_002306	27.02	−0.99	−0.08
BCoV	NC_003045	29.43	−0.75	0.48
MHV	NC_001846	38.30	−0.16	0.27
AIBV	NC_001451	26.09	−0.90	−1.30
SARSCoV	NC_004718	37.23	0.05	0.36
Arterivirus
EAV	NC_002532	47.28	0.80	0.47
LDEV	NC_002534	45.18	0.53	0.43
PRRSV	NC_001961	53.76	1.31	0.55
SHFV	NC_003092	48.43	1.09	−0.14

Organism abbreviation: HCoV 229E, human Coronavirus 229E; PEDV, porcine epidemic diarrhea virus; TGV, transmissible gastroenteritis virus; BCoV, bovine Coronavirus; MHV, murine hepatitis virus; AIBV, avian infectious bronchitis virus; SARSCoV, SARS Coronavirus; EAV, equine arteritis virus; LDEV, lactate dehydrogenase elevating virus; PRRSV, porcine reproductive and respiratory syndrome virus; SHFV, simian hemorrhagic fever virus.

f1′ and f2′, respectively, represent the first axis mean value and the second axis mean value in CA of each genome.

Identified ORFs (length > 150 bps) in the SARSCoV (TOR2 isolation) genomea, b L represents the length of identified ORF. f1′ represent the first axis values of each gene in CA. Phylogenetic breakdown, accession number, GC3S and the first two axis values in CA of 11 selected viruses in order Nidoviralesa, b Organism abbreviation: HCoV 229E, human Coronavirus 229E; PEDV, porcine epidemic diarrhea virus; TGV, transmissible gastroenteritis virus; BCoV, bovine Coronavirus; MHV, murine hepatitis virus; AIBV, avian infectious bronchitis virus; SARSCoV, SARS Coronavirus; EAV, equine arteritis virus; LDEV, lactate dehydrogenase elevating virus; PRRSV, porcine reproductive and respiratory syndrome virus; SHFV, simian hemorrhagic fever virus. f1′ and f2′, respectively, represent the first axis mean value and the second axis mean value in CA of each genome.

Methods

Synonymous codon usage measures (RSCU)

Relative synonymous codon usage values of each codon in a gene were used to examine the synonymous codon usage without the confounding influence of amino acid composition (Sharp and Li, 1986). N3S, the frequency of base N at synonymous third codon positions, was also used to calculate the extent of base composition bias. Additionally, the effective number of codons of a gene (ENC) was used to quantify the codon usage bias of a gene (Wright, 1990), which is the best overall estimator of absolute synonymous codon usage bias (Comeron and Aguade, 1998). ENC value ranges from 20 (when only one codon is used per amino acid) to 61 (when all synonymous codons are equally used for each amino acid).

Correspondence analysis (CA)

Correspondence analysis was used to investigate the major trend in codon usage variation among genes. Each gene is represented as a 59 dimensional vector, and each dimension corresponds to the RSCU value of one sense codon (excluding AUG, UGG and three stop codons). CA based on RSCU values relies on two main steps (Mardia et al., 1979). The first step is to measure the similarities in codon usage using the squared Euclidean distance among all genes, and the resulting distance table will be used to compute the coordinates of the genes in a multidimensional space. The second step provides the visualization of these Euclidean distances through positioning genes by successive orthogonal projections of the cloud of points. Essentially, this process consists in finding the linear transformations f 1′,f 2′,…,f 58′ of the original variables f 1′,f 2′,…,f 59′. The f′-variables are calculated and ordered according to the values of relative variance. f 1′ is the maximum value; f 2′ is the next value and is by construction not correlated with f 1′. The same applies to f 3′,f 4′, and so on, until f 58′. So, genes with similar codon usage are neighbors on the components of projection.

Statistical methods

Linear regression analysis was used to find the correlation between codon usage bias and nucleotide composition. One tailed t-test was used to compare the variation of codon usage between different gene groups (Ewens and Grant, 2001). As a null hypothesis, it is assumed that mean values of codon usage indices in different gene groups is statistically the same. Under the null assumption, t-statistic could be calculated. Then, P-value is derived and it is taken as significance when P-value is below 0.05. A C++ program was developed to calculate the codon usage indices for each gene. CA and other statistical analysis were performed with statistical software SPSS 11.0.

Results

Synonymous codon usage in SARSCoV

The details of coding genes in SARSCoV and the overall RSCU values of 61 sense codons in SARSCoV were, respectively, shown in Table 1, Table 3 . All preferentially used codons in SARSCoV are all A-ended or U-ended codons (Table 3). SARSCoV is a GC poor genome with GC content of 37.52%. Due to compositional constraints, it is expected that A-ended and/or U-ended codons should be preferentially used in this genome. To study the codon usage variation among different SARSCoV genes, ENC and GC3S values of different SARSCoV genes were calculated (Table 1). ENC values of different SARSCoV genes vary from 42.19 to 59.06, with a mean value of 48.99 and S.D. of 6.41. Because all ENC values of SARSCoV genes are much higher (ENC>40), codon usage bias in SARSCoV genome is a little slight. However, there is a marked variation in codon usage pattern among different SARSCoV genes (S.D.=6.41). Similarly, GC3S values of each SARSCoV gene also confirm the heterogeneity of synonymous codon usage among different SARSCoV genes, which range from 28.3 to 58.1% with a mean of 37.23 and S.D. of 8.78%.

Table 3

Synonymous codon usage in SARSCoVa, b, c

AAa	Codon	RSCU	Nb	AAa	Codon	RSCU	Nb

Ala	GCU	2.08	531	Ile	AUU	1.72	410
	GCC	0.58	147		AUC	0.67	159
	GCA	1.13	288		AUA	0.62	148
	GCG	0.22	55	Cys	UGU	1.27	280
Gly	GGG	0.17	37		UGC	0.73	160
	GGA	0.85	182	Thr	ACU	1.66	427
	GGC	0.95	202		ACC	0.59	153
	GGU	2.02	431		ACG	0.18	46
Val	GUU	1.71	479		ACA	1.57	406
	GUC	0.67	188	Asn	AAU	1.24	449
	GUA	0.83	232		AAC	0.76	277
	GUG	0.78	219	Gln	CAA	1.16	298
Leu	UUA	1.04	238		CAG	0.84	214
	UUG	1.10	251	Tyr	UAU	1.12	345
	CUU	1.79	409		UAC	0.88	270
	CUC	0.83	191	His	CAU	1.29	187
	CUA	0.64	147		CAC	0.71	103
	CUG	0.60	138	Asp	GAU	1.24	463
Phe	UUC	0.77	260		GAC	0.76	282
	UUU	1.23	414	Glu	GAA	1.04	354
Pro	CCU	1.74	247		GAG	0.96	326
	CCC	0.40	57	Lys	AAA	1.04	421
	CCA	1.70	241		AAG	0.96	388
	CCG	0.16	22	Arg	CGU	1.77	153
Ser	UCU	1.96	310		CGC	0.72	62
	UCC	0.42	67		CGA	0.44	38
	UCA	1.70	270		CGG	0.09	8
	UCG	0.23	36		AGA	2.08	180
	AGU	1.17	186		AGG	0.90	78
	AGC	0.52	82

AA is the abbreviation of amino acid.

N represents the number of occurrence of each sense codon.

The preferentially used codons for each amino acid are displayed in bold.

Synonymous codon usage in SARSCoVa, b, c AA is the abbreviation of amino acid. N represents the number of occurrence of each sense codon. The preferentially used codons for each amino acid are displayed in bold.

Synonymous codon usage in different viruses is virus specific, but not host specific

CA was implemented for all identified ORFs from each of the 11 virus genomes as a single dataset, which consists of 103 coding sequences. CA detected one major trend in the first axis which accounted for 15.40% of the total variation, and none of the other axes individually accounted for more than 7.60% of the total variation. A plot of the first axis and the second axis of each gene was shown in Fig. 1 . Although this graph is a little complex with some overlap among genes from different genomes, it is clear that genes from a particular genome tend to cluster together. The separation of one virus genome from other virus genomes is determined to be significant on both axes (t-test, P-value <10−15 on the first axis and P-value <10−3 on the second axis). So, similar to codon usage in mammals and bacteria, synonymous codon usage in these viruses is also virus specific.

Fig. 1

A plot of the values of the first axis and the second axis of each gene in CA (abbreviations of the viruses: AIBV, avian infectious bronchitis virus; BCoV, bovine Coronavirus; EAV, equine arteritis virus; HCoV 229E, human Coronavirus 229E; LDEV, lactate dehydrogenase elevating virus; MHV, murine hepatitis virus; PEDV, porcine epidemic diarrhea virus; PRRSV, porcine reproductive and respiratory syndrome virus; SARSCoV, SARS Coronavirus; SHFV, simian hemorrhagic fever virus; TGV, transmissible gastroenteritis virus. f1′ and f2′, respectively, represent the values of the first and the second axis of each gene in CA). To show whether there is a correlation between virus codon usage and its host, these 103 virus genes were divided into several groups according to the virus host. For example, because both SARSCoV genes and human Coronavirus 229E infect human, genes in these two viruses were incorporated as a group. Next, t-test was also used to test whether the separation of different viral genes which infect different hosts is significant. The P-value is 0.57 on the first axis and is 0.08 on the second axis, which suggested that codon usage in different virus genes was not host specific.

Phylogenetic analysis of these viruses based on codon usage pattern

In Fig. 1, all virus genes in the genus Coronavirus were plotted in red. At the same time, all viral genes in the genus Arterivirus were plotted in blue. Coronavirus genes are mainly located on the left side of the plot, while a majority of Arterivirus genes are located on the right side. The separation of Coronavirus genes and Arterivirus genes on the first axis is statistically significant (t-test, P-value <10−15). Hence, synonymous codon usage appears to be conserved between phylogenetically related viruses. Also, SARSCoV genes were widely extended in the first axis (Fig. 1). Six of eleven SARSCoV genes were located in the cluster of Coronavirus genes, while the other five SARSCoV genes were located in the cluster of Arterivirus genes. Therefore, SARSCoV might have been diverged far from all three known Coronavirus groups. Comparing with all other viruses in the genus Coronavirus, it might be more evolutionary related to the genus Arterivirus.

Mutational bias is the main factor determines the codon usage variation among different viruses

Linear regression analysis was implemented to find whether there is some correlation between synonymous codon usage bias and nucleotide compositions. The R 2 value and significance level of these regression analyses was listed in Table 4 . The first axis value of each gene in CA is closely correlated with all the base compositions on the third codon position, while the second axis of each gene is correlated with some base compositions on the third codon position to a certain extent. Therefore, compositional constraint mainly determines the variation of synonymous codon usage among these virus genes.

Table 4

Summary of linear regression analysis between the first two axes in CA and the nucleotide contents on the third codon position in all selected virus genesa

Base composition	f₁′b	f₂′b

A_3S	0.791****	0.085*
T_3S	0.239****	0.444****
G_3S	0.484****	0.082*
C_3S	0.720****	0.0001^NS
GC_3S	0.936****	0.018^NS

NS in superscript represent non-significant.

Value in this table is the R2 value of each linear regression analysis.

f1′ and f2′, respectively, represent the values of the first and the second axis of each gene in CA.

P-value <0.01.

P-value <0.00001.

Summary of linear regression analysis between the first two axes in CA and the nucleotide contents on the third codon position in all selected virus genesa NS in superscript represent non-significant. Value in this table is the R2 value of each linear regression analysis. f1′ and f2′, respectively, represent the values of the first and the second axis of each gene in CA. P-value <0.01. P-value <0.00001. Furthermore, we plotted the first axis values in CA and GC3S values of each gene (Fig. 2 ). The GC3S mean value of genes in coronaviruses ranges from 26.09 to 37.32, and it ranges from 45.18 to 53.76 in arteriviruses (Table 2). Although codon usage bias appears to be conserved between evolutionary related viruses (Section 3.3), the patterns of codon usage in different virus genes also appear to be a direct function of the GC content on the third codon position of these genes.

Fig. 2

A dot plot of the first axis value in correspondence analysis and GC3S of each gene (f1′ denotes the first axis value in correspondence analysis of each gene, and GC3S denotes the G+C content on the third synonymous codon position of each gene).

Gene function also drives the codon usage variation among different viruses

The plot of ENC and GC3S is another effective way to explore codon usage variation among genes (Wright, 1990). ENC values of each virus gene were plotted against its corresponding GC3S (Fig. 3 ). The solid line represents the curve if codon usage is only determined by GC content on the third codon position. A large proportion of points lie near to the solid line on the left region of this distribution. It also suggests that mutational bias is the main factor determines the codon usage variation among these genes. However, there are also some points lying below the expected curve. Hence, other than mutational bias, there might be some additional factors drive the codon usage variation among these genes.

Fig. 3

ENC vs. GC3S plot of all virus genes (ENC denotes the effective number of codon of each gene, and GC3S denotes the G+C content on the third synonymous codon position of each gene. The solid line represents the relationship between GG3S and ENC under random codon usage assumption). To show whether translational selection or gene function were correlated with the observed variation in codon bias, all virus genes were grouped into several classes according to gene function. Because most of these viruses contain genes coding for RNA polymerase, envelop protein and structural glycoprotein, these three gene groups were selected to find whether there is some correlation between codon usage and gene function. One tailed t-test was then performed on ENC values of these genes with the hypothesis that there is no correlation between codon usage bias and gene function. Some associations have been found. Average codon usage bias is higher in RNA polymerase gene group than in envelop gene group (t-test, P-value=0.031), and it is higher in polymerase gene group than in structural glycoprotein gene group (t-test, P-value=0.002). But, there is no association between codon usage in structural glycoprotein gene group and envelop protein gene group (t-test, P-value=0.74). Because the structural glycoprotein and envelop protein are all structural proteins in these viruses and RNA polymerase is a nonstructural protein, it is clear that codon usage in structural genes is significantly diverged from that in nonstructural genes. On the other hand, structural genes are generally highly expressed than nonstructural genes. So, if translational selection was also contributed to codon usage bias in these genes, codon usage bias in structural genes should be higher than in RNA polymerase genes. However, RNA polymerase genes (ENC=49.25) were found to have greater codon usage bias than structural genes (ENC=54.60 for envelop gene and ENC=55.33 for structural glycoprotein). Hence, codon usage bias in these virus genes is not related to gene expression level. Furthermore, we also performed a linear regression analysis on ENC value and gene length of each gene. But, there was no significant correlation between codon usage and gene length in these virus genes (P-value>0.05). So, gene function, rather than translational selection and gene length, is another factor accounting for codon usage variation among these virus genes.

Discussion

Our analysis revealed that synonymous codon usage bias in SARSCoV was less biased, which was mainly determined by the base compositions on the third codon position. Comparative analysis of codon usage bias in the order Nidovirales also suggested that codon usage in these viruses was virus specific and mutational bias was the main factor drives the codon usage variation among these viruses. Gene function was also related to codon usage bias in these viruses to some extent. But, translational selection and gene length might have no effect on the codon usage pattern in these viruses. Some published results has shown that the overall extent of codon usage bias in RNA viruses is low and there is little variation in bias between genes (Levin and Whittome, 2000, Jenkins et al., 2001, Jenkins and Holmes, 2003). Although SARSCoV is a newly detected RNA virus infecting human, the synonymous codon usage pattern in SARSCoV we described here is also in accordance with these published codon usage pattern of human RNA viruses (Jenkins and Holmes, 2003). Because mutation rates in RNA viruses are much higher than those in DNA viruses (Drake and Holland, 1999), it is understandable that mutation pressure is the main determinant of codon usage bias in SARSCoV. Our analysis also revealed that there was no host specific codon usage pattern in these viruses. So, host genome might have no obvious effect on the evolution of these viruses. Some phylogenetic analysis of SARSCoV (Qin et al., 2003, Marra et al., 2003) has shown that SARSCoV does not closely resemble any of the three previously known groups in genus Coronavirus. But Snijder et al. (2003) has proposed that SARSCoV is most closely related to group 2 Coronaviruses. Based on different codon usage patterns in different coronaviruses, we revealed that codon usage patterns of each virus was phylogenetically distinct and SARSCoV might have been diverged far from all three known Coronavirus groups, which is in accordance with the results Qin et al. (2003) and Marra et al. (2003) proposed. Codon usage patterns and the phylogenetic results we proposed here are useful to understand the processes governing the evolution of SARSCoV, especially the roles played by mutation pressure and natural selection. Further, such information might be helpful to understand the pathogenesis and the origin of SARSCoV.

39 in total

1. Codon usage as a tool to predict the cellular location of eukaryotic ribosomal proteins and aminoacyl-tRNA synthetases.

Authors: H Chiapello; E Ollivier; C Landès-Devauchelle; P Nitschké; J L Risler
Journal: Nucleic Acids Res Date: 1999-07-15 Impact factor: 16.971

2. A functional significance for codon third bases.

Authors: R J Epstein; K Lin; T W Tan
Journal: Gene Date: 2000-03-21 Impact factor: 3.688

3. Relationship of codon bias to mRNA concentration and protein length in Saccharomyces cerevisiae.

Authors: A Coghlan; K H Wolfe
Journal: Yeast Date: 2000-09-15 Impact factor: 3.239

4. Codon usage in nucleopolyhedroviruses.

Authors: David B Levin; Beatrixe Whittome
Journal: J Gen Virol Date: 2000-09 Impact factor: 3.891

5. Characterization of a novel coronavirus associated with severe acute respiratory syndrome.

Authors: Paul A Rota; M Steven Oberste; Stephan S Monroe; W Allan Nix; Ray Campagnoli; Joseph P Icenogle; Silvia Peñaranda; Bettina Bankamp; Kaija Maher; Min-Hsin Chen; Suxiong Tong; Azaibi Tamin; Luis Lowe; Michael Frace; Joseph L DeRisi; Qi Chen; David Wang; Dean D Erdman; Teresa C T Peret; Cara Burns; Thomas G Ksiazek; Pierre E Rollin; Anthony Sanchez; Stephanie Liffick; Brian Holloway; Josef Limor; Karen McCaustland; Melissa Olsen-Rasmussen; Ron Fouchier; Stephan Günther; Albert D M E Osterhaus; Christian Drosten; Mark A Pallansch; Larry J Anderson; William J Bellini
Journal: Science Date: 2003-05-01 Impact factor: 47.728

6. What drives codon choices in human genes?

Authors: S Karlin; J Mrázek
Journal: J Mol Biol Date: 1996-10-04 Impact factor: 5.469

7. Evolution of base composition and codon usage bias in the genus Flavivirus.

Authors: G M Jenkins; M Pagel; E A Gould; P M de A Zanotto; E C Holmes
Journal: J Mol Evol Date: 2001-04 Impact factor: 2.395

8. The Genome sequence of the SARS-associated coronavirus.

Authors: Marco A Marra; Steven J M Jones; Caroline R Astell; Robert A Holt; Angela Brooks-Wilson; Yaron S N Butterfield; Jaswinder Khattra; Jennifer K Asano; Sarah A Barber; Susanna Y Chan; Alison Cloutier; Shaun M Coughlin; Doug Freeman; Noreen Girn; Obi L Griffith; Stephen R Leach; Michael Mayo; Helen McDonald; Stephen B Montgomery; Pawan K Pandoh; Anca S Petrescu; A Gordon Robertson; Jacqueline E Schein; Asim Siddiqui; Duane E Smailus; Jeff M Stott; George S Yang; Francis Plummer; Anton Andonov; Harvey Artsob; Nathalie Bastien; Kathy Bernard; Timothy F Booth; Donnie Bowness; Martin Czub; Michael Drebot; Lisa Fernando; Ramon Flick; Michael Garbutt; Michael Gray; Allen Grolla; Steven Jones; Heinz Feldmann; Adrienne Meyers; Amin Kabani; Yan Li; Susan Normand; Ute Stroher; Graham A Tipples; Shaun Tyler; Robert Vogrig; Diane Ward; Brynn Watson; Robert C Brunham; Mel Krajden; Martin Petric; Danuta M Skowronski; Chris Upton; Rachel L Roper
Journal: Science Date: 2003-05-01 Impact factor: 47.728

9. Codon usage in regulatory genes in Escherichia coli does not reflect selection for 'rare' codons.

Authors: P M Sharp; W H Li
Journal: Nucleic Acids Res Date: 1986-10-10 Impact factor: 16.971

10. Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system.

Authors: T Ikemura
Journal: J Mol Biol Date: 1981-09-25 Impact factor: 5.469

83 in total

1. Analysis of synonymous codon usage bias and phylogeny of coat protein gene in banana bract mosaic virus isolates.

Authors: Atul B Patil; Vijayendra S Dalvi; Akhilesh A Mishra; Bal Krishna; Abdul Azeez
Journal: Virusdisease Date: 2017-05-18

2. Dinucleotide Composition in Animal RNA Viruses Is Shaped More by Virus Family than by Host Species.

Authors: Francesca Di Giallonardo; Timothy E Schlub; Mang Shi; Edward C Holmes
Journal: J Virol Date: 2017-03-29 Impact factor: 5.103

3. Reply to "codon usage frequency of RNA virus genomes from high-temperature acidic-environment metagenomes".

Authors: Mark Young; Benjamin Bolduc; Daniel P Shaughnessy; Francisco F Roberto; Yuri I Wolf; Eugene V Koonin
Journal: J Virol Date: 2013-02 Impact factor: 5.103

4. Compare the differences of synonymous codon usage between the two species within cardiovirus.

Authors: Wen-qian Liu; Jie Zhang; Yi-qiang Zhang; Jian-hua Zhou; Hao-tai Chen; Li-na Ma; Yao-zhong Ding; Yongsheng Liu
Journal: Virol J Date: 2011-06-27 Impact factor: 4.099

5. Analysis of synonymous codon usage in foot-and-mouth disease virus.

Authors: Jian-Hua Zhou; Jie Zhang; Hao-Tai Chen; Li-Na Ma; Yong-Sheng Liu
Journal: Vet Res Commun Date: 2010-04-28 Impact factor: 2.459

6. Analysis of synonymous codon usage in the UL24 gene of duck enteritis virus.

Authors: Renyong Jia; Anchun Cheng; Mingshu Wang; Hongyi Xin; Yufei Guo; Dekang Zhu; Xuefeng Qi; Lichan Zhao; Han Ge; Xiaoyue Chen
Journal: Virus Genes Date: 2008-10-29 Impact factor: 2.332

7. Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences.

Authors: Iris Bahir; Menachem Fromer; Yosef Prat; Michal Linial
Journal: Mol Syst Biol Date: 2009-10-13 Impact factor: 11.429

8. Comparative study of synonymous codon usage variations between the nucleocapsid and spike genes of coronavirus, and C-type lectin domain genes of human and mouse.

Authors: Insung Ahn; Byeong-Jin Jeong; Hyeon Seok Son
Journal: Exp Mol Med Date: 2009-10-31 Impact factor: 8.718

9. Synonymous codon usage analysis of thirty two mycobacteriophage genomes.

Authors: Sameer Hassan; Vasantha Mahalingam; Vanaja Kumar
Journal: Adv Bioinformatics Date: 2010-02-01

10. Analysis of synonymous codon usage in Newcastle disease virus hemagglutinin-neuraminidase (HN) gene and fusion protein (F) gene.

Authors: Hong-Wei Cao; De-Shan Li; Hua Zhang
Journal: Virusdisease Date: 2013-11-12