Literature DB >> 32395662

Codon usage similarity between viral and some host genes suggests a codon-specific translational regulation.

Kunlakanya Jitobaom1, Supinya Phakaratsakul1, Thanyaporn Sirihongthong1, Sasithorn Chotewutmontri2, Prapat Suriyaphol3,4, Ornpreya Suptawiwat2, Prasert Auewarakul1.   

Abstract

The codon usage pattern is a specific characteristic of each species; however, the codon usage of all of the genes in a genome is not uniform. Intriguingly, most viruses have codon usage patterns that are vastly different from the optimal codon usage of their hosts. How viral genes with different codon usage patterns are efficiently expressed during a viral infection is unclear. An analysis of the similarity between viral codon usage and the codon usage of the individual genes of a host genome has never been performed. In this study, we demonstrated that the codon usage of human RNA viruses is similar to that of some human genes, especially those involved in the cell cycle. This finding was substantiated by its concordance with previous reports of an upregulation at the protein level of some of these biological processes. It therefore suggests that some suboptimal viral codon usage patterns may actually be compatible with cellular translational machineries in infected conditions.
© 2020 The Authors.

Entities:  

Keywords:  Cell cycle; Codon usage pattern; Gene expression; Genetics; Microbiology; RNA viruses; RSCU; Translation; Viral genetics; Virology; Viruses

Year:  2020        PMID: 32395662      PMCID: PMC7205639          DOI: 10.1016/j.heliyon.2020.e03915

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

The genetic code is degenerate. There are 61 triplet codons coding for 20 amino acids and 3 stop codons [1]. Therefore, each amino acid is encoded by several codons, with the exception of two amino acids (methionine and tryptophan). This codon redundancy results in synonymous codon usage, whereby one amino acid is encoded by 2, 4, or 6 codons [2]. Several previous studies have revealed that synonymous codons are utilized with different frequencies and are not randomly used by different genomes or genes. This non-randomness is referred to as codon usage bias [3, 4]. Each species preferentially uses different synonymous codons [5]. This results in a species-specific codon usage bias. Similarly, there are several tRNA species that carry the same amino acid. These tRNA species are called isoacceptors [6]. Codons and anti-codons in tRNA do not interact in a one-to-one fashion [7]. Base pairing at the third codon position is wobble; for example, G can pair with both cytosine (C) and uracil (U) [8]. It has been demonstrated that tRNA modification directly affects tRNA and mRNA wobble base pairing [9, 10]. Both the available tRNA isoacceptors and tRNA modification change with the cell cycle, and they can be altered by cellular stresses [11, 12]. For efficient protein translation, the codon usage pattern should correlate with the population of available tRNA isoacceptors [13]. In the cellular stress-response, the alteration could enhance the expression of stress-response genes, with the codon usage patterns compatible with the changed tRNA modifications. Those genes shown to be regulated by this codon-specific manner are called Modification Tunable Transcripts (MoTTs) [12, 14]. Two major models have been proposed to explain the causes of codon usage bias: mutation pressure, and translational selection [15]. As to mutation pressure, it is believed that GC content is the major factor driving codon usage bias [16, 17]. The high mutation rates of some nucleotides or codons result in nucleotide substitution that might contribute to lower frequencies of some nucleotides and codons [15]. Mutation pressure has been suggested to be the most important factor determining the codon usage bias in human RNA viruses [18, 19, 20]. However, there are correlations between the codon usage bias and other factors related to translation efficiency (such as available tRNAs, mRNA secondary structure, translation elongation rate, and the intragenic and intergenic codon bias) that cannot be explained by mutation pressure. This suggests that translational selection also influences codon usage bias [15]. The translational selection acts on codon usage bias to achieve efficient and accurate translation. The use of codons correlates with abundant tRNAs, resulting in a higher translation rate [21, 22, 23]. A correlation between codon usage bias and abundant tRNAs has been found in prokaryotes (such as E. coli [24]) and in some eukaryotes (such as S. cerevisiae [25], C. elegans [22], Drosophila [23], and human [26]). However, rare codons are preferred to encode some specific sets of genes or regions of genes, for instance, to enable protein oscillation in different phases of the cell cycle, slow down protein translation across the membrane, and reduce ribosome jamming and mRNA secondary structure at the 5′end of coding sequences [11, 27, 28, 29]. Therefore, the translational selection acting on the optimization of frequent and rare codon utilization is important in appropriate gene translation. Viral replication is dependent on the cellular machineries of the host cells. Thus, one would intuitively think that the codon usage of a viral genome should match that of its host in order to be efficiently expressed. Surprisingly, however, most viruses have codon usage patterns that are different from the codon usage preference of their hosts [30, 31, 32, 33]. A previous study indicated an alteration in the cellular tRNA level after the infection of human immunodeficiency virus type 1 (HIV-1) [34]. In contrast, the cellular tRNA level was found to be unchanged following vaccinia and influenza A virus (IAV) infection, whereas an alteration in the polysome-associated tRNA population was observed, particularly the population of polysome tRNA isoacceptors correlated with viral codon usage [35]. These findings suggest that the codon usage pattern and the regulation of translational machineries may influence gene expression in some viruses. In this study, we investigated the relationship between the codon usage bias of human genes and human RNA viruses. It is generally believed that the codon usage bias of viruses differs from that of human genes; however, various human genes possess various codon usage patterns [19, 26]. In addition, intragenic codon biases had previously been reported in humans and mice [36]. A comparison of the codon usage at the genome level is therefore too generalized; a more precise comparison at the single-gene level may provide a better insight into the viral codon usage bias.

Results

Principal component analysis of the relative synonymous codon usage (PCA of RSCU)

A total of 20,190 major transcript variants of human protein coding sequences were recruited from the GENCODE database (version 26). The protein coding sequences of 77 human RNA viruses were downloaded from the NCBI database. The open reading frames (ORFs) of the protein coding sequences were rechecked by ORFfinder (NCBI) before performing the RSCU calculation. The RSCU is a simple parameter that represents the codon usage bias of synonymous codons in a coding sequence. In our analysis, the RSCU was calculated from the protein coding sequences of the human and RNA viruses. The RSCU of each gene consists of 59 values corresponding to 59 synonymous codons; thus, the PCA was performed to simplify the data to a smaller number of principal factors as a summary feature of the codon usage pattern of each gene. The PCA successfully reduced the 59 values of each RSCU into two significant components. The RSCUs of the human genes and RNA viruses were represented by the coordinates of principal component 1 (PC1, x-axis) and principal component 2 (PC2, y-axis) plotted on the PCA of an RSCU graph (Figure 1). The RSCU and PCA of the RSCUs of the human genes and human RNA viruses are shown in Supplementary File 1.
Figure 1

The PCA of the RSCU. The RSCU of human genes and RNA viruses were subjected to PCA. Then, the simplified RSCU values of human genes and RNA viruses were plotted on the graph as the coordinates of component 1 (x) and 2 (y). The color keys indicate the groups of RNA viruses. Human genes are represented using transparent black dots.

The PCA of the RSCU. The RSCU of human genes and RNA viruses were subjected to PCA. Then, the simplified RSCU values of human genes and RNA viruses were plotted on the graph as the coordinates of component 1 (x) and 2 (y). The color keys indicate the groups of RNA viruses. Human genes are represented using transparent black dots. In Figure 1, the PCAs of the RSCUs of the human genes were represented with a transparent black dot; genes with a similar RSCU were located in the same area of the graph. The number of human gene located in each quadrant was counted: upper left, 6,020 genes; lower left, 4,227 genes; upper right, 4,664 genes; and lower right, 5,281 genes. Interestingly, many human genes were located densely in the right quadrants, specifically, between (x = 0.95 to 1.7) and (y = -0.7 to 0.6). Most RNA viruses were also located in the right quadrants. Additionally, negative sense-single strand RNA viruses (-ssRNA), ambisense RNA viruses (Ambi), and HIV-1 viruses were located in the area of the right quadrants where the human genes were densely located. However, a great variation was observed in some groups of RNA viruses, especially positive-sense single strand RNA viruses (+ssRNA), doubled strand RNA viruses (dsRNA), and retroviruses (Retro). The relationship between the PCA of the RSCU analysis and the codon adaptive index (CAI) was investigated. CAI is a common parameter used to assess the codon usage bias of a gene. It is calculated from the frequency of the overall codons in a given protein coding sequence with respect to a reference set of genes [37]. In our analysis, the human codon usage table, which is the average codon frequency of a human genome, was used as the reference set. From the results (Figure 1), the PCA of the RSCU of the human codon usage table was plotted near the x and y intercepts, showing the average codon usage pattern of all human genes. A number of human genes in the PCA of the RSCU graph were selected and subjected to the CAI calculation. The graphs of PC1 and CAI, of PC2 and CAI, were plotted; the linear regression and Pearson correlation coefficient (PCC) were subsequently analyzed. From Figure 2A, it was found that the CAI of genes gradually decreased with an increase in PC1 (R2 = 0.7958, PCC = -0.8921, p-value <0.0001), while a positive correlation was observed between CAI and PC2 (R2 = 0.5824, PCC = 0.7631, p-value < 0.0001). The percentages of the GC content at the third position of the codon (%G+C(3)) of the human genes were also determined. In a similar way to CAI (Figure 2B), %G+C(3) gradually decreased with an increase in PC1 (R2 = 0.9459, PCC = -0.9726, p-value < 0.0001). A weak correlation between PC2 and %G+C(3) was observed (R2 = 0.4373, PCC = 0.6613, p-value < 0.0001). Thus, the PCA of the RSCU analysis could be used to characterize the heterogeneity of the codon usage bias in the human genome, in which genes in the left-upper quadrant contain more optimal codon usage for high expression, whereas those in the right quadrants near or below the x-axis have less optimal codon usage.
Figure 2

The correlation between principal components (PC1, PC2), codon adaptive index (CAI), and percentage of GC content at the third position of codon (%G+C(3)) were determined using simple linear regression and Pearson correlation coefficient. (A) CAI was plotted with either PC1 (R2 = 0.7958, PCC = -0.8921, p < 0.001) or PC2 (R2 0.5824, PCC = 0.7631, p < 0.001). (B) %G+C(3) was plotted with either PC1 (R2 = 0.9459, PCC = -0.9726, p < 0.001) or PC2 (R2 = 0.4373, PCC = 0.6613, p < 0.001).

The correlation between principal components (PC1, PC2), codon adaptive index (CAI), and percentage of GC content at the third position of codon (%G+C(3)) were determined using simple linear regression and Pearson correlation coefficient. (A) CAI was plotted with either PC1 (R2 = 0.7958, PCC = -0.8921, p < 0.001) or PC2 (R2 0.5824, PCC = 0.7631, p < 0.001). (B) %G+C(3) was plotted with either PC1 (R2 = 0.9459, PCC = -0.9726, p < 0.001) or PC2 (R2 = 0.4373, PCC = 0.6613, p < 0.001). The PCAs of RSCUs of the human genes that coded for highly expressed proteins were plotted on a graph. The highly expressed proteins of humans had been previously identified using the proteomic approach (Figure 1; see the gene list in Supplementary Table 1). [38]. From Figure 1, most of the highly expressed proteins were located in the left quadrants. This is in agreement with our analysis showing the relationship between PCA and CAI, and it supports the validity of using PCA to predict CAI.

Human genes with codon usage bias similar to RNA viruses

From the PCA of the RSCU graph (Figure 1), it was demonstrated that most of the human RNA viruses were located in the right quadrants. The degree of difference in the codon usage pattern varied among the groups of viruses. This feature was also observed intragroup. The highest PC1 (x) belonged to rotavirus, indicating a high degree of difference in codon usage pattern compared to human genes and other RNA viruses. In particular, there were a number of human genes with RSCUs similar to RNA viruses (Figure 1), especially -ssRNA, Ambi, and HIV viruses, which were located in the right quadrants, where human genes were also located densely. To investigate the kinds of human genes with codon usage patterns similar to RNA viruses and the contributions of those genes in important biological processes, the human genes plotted in the same area with each subgroup of RNA viruses were retrieved. The criteria for selection of the human genes with RSCUs similar to RNA viruses are described in the methods section. Figure 3 represents the selected human genes with RSCUs similar to RNA viruses. These human genes were subjected to gene ontology (GO) enrichment analysis using GO-TermFinder to identify the overrepresented GO terms in biological processes [39]. REVIGO was then used to categorize the redundant GO terms [40]. The number of human genes with RSCUs similar to RNA viruses are listed in Table 1. The results (Figure 4) show the significant GO terms in the biological processes of human genes with RSCUs similar to RNA viruses. Only human genes retrieved from six subgroups of RNA viruses resulted in significant enrichment, namely, +ssRNA (subgroups 6, 7), -ssRNA (subgroups 3, 4), Retro (HIV-1), and Ambi (subgroup 4), where the human genes recruited from +ssRNA (subgroups 6, 7), –ssRNA (subgroup 4), Retro (HIV-1), and Ambi (subgroup 4) were in the same or adjacent area. The human genes with RSCUs similar to +ssRNA (subgroup 6), –ssRNA (subgroup 4), Retro (HIV-1), and Ambi (subgroup 4) shared similar GO terms in biological processes, including the cell cycle, the regulation of the cell cycle process, cell division, microtubule cytoskeletal organization, chromosome segregation, DNA repair, macromolecule catabolism, and cellular localization (Fig. 4A, B, D, E, and F), while human genes with RSCUs similar to +ssRNA (subgroup 3) were related to RNA processing (Figure 4C). The list of enriched GO terms in biological processes is shown in Supplementary File 2.
Figure 3

The human genes with RSCUs similar to RNA viruses in each subgroup are represented using transparent colored dots.

Table 1

Categories of human RNA viruses. RNA viruses were categorized based on the nucleic acid types of their genomes. The viruses in each group were divided into subgroups based on their PCA of RSCU. In subgroups containing more than one virus, the mean virus RSCU (mean vRSCU [a, b]) was calculated. The number of human genes with RSCUs similar to RNA viruses were also represented.

RNA virusesSubgroupsMembersPCA of RSCU component
No. of human genes with similar RSCU
1 (x)2 (y)
+ssRNA1Rubella virus-1.11152-2.3869367
2Hepatitis G virus-0.20805-1.33632
Hepatitis C virus-0.38012-0.86880
Mean vRSCU-0.29409-1.10256192
3Hepatitis E virus0.05314-1.99911
Ross river virus0.08755-1.48581
Chikungunya virus0.21340-1.65931
Sindbis virus0.25123-1.94371
Eastern equine encephalitis virus0.48833-1.30690
Western equine encephalitis virus0.47744-1.79137
Venezuelan equine encephalitis virus0.42568-1.26628
O'nyong nyong virus0.56417-1.63171
Mean vRSCU0.32012-1.63553101
4Enterovirus 710.64889-0.88714
Human coxsackievirus A90.54435-0.52889
Human coxsackievirus B40.65238-0.59692
Echoviruses0.57004-0.57533
Polioviruses0.76484-0.35123
Norwalk virus0.77184-0.52468
Sapporo virus0.62754-0.94239
Dengue virus type 10.91914-0.19005
Dengue virus type 20.856380.19119
Dengue virus type 30.88466-0.24134
Dengue virus type 40.80321-0.02901
Japanese encephalitis virus0.34886-0.42656
Murray Valley encephalitis virus0.76079-0.71468
St. Louis encephalitis virus0.61121-0.16698
West Nile virus0.42713-0.40840
Yellow fever virus0.55894-0.03048
Zika virus0.463680.08730
Kyasanur Forest disease virus0.26948-0.22710
Omsk hemorrhagic fever virus0.25430-0.25169
Tick-borne encephalitis virus0.15574-0.30465
Mean vRSCU0.59467-0.35595659
+ssRNA5SARS coronavirus1.38814-1.47854
MERS coronavirus1.40490-1.39321
Human astrovirus1.21324-1.04323
Mean vRSCU1.33543-1.30499169
6Rhinovirus A,B and C1.46621-0.28559
Human parechovirus1.48761-0.44531
Enterovirus 681.50423-0.49488
Mean vRSCU1.48602-0.40859946
7Hepatitis A virus1.96492-0.10146188
8Human coronaviruses1.83078-1.03063129
-ssRNA1Borna virus0.27188-1.00708233
2Rabies virus0.76128-0.25367
Mokola virus0.79901-0.11880
Measles virus0.58253-0.23633
Mean vRSCU0.71427-0.20293837
3Vesicular stomatitis virus1.10766-0.23715
Influenza A virus H3N20.98353-0.58886
Marburg virus1.29734-0.55782
Ebola viruses1.09920-0.67494
Influenza B virus1.28533-0.26627
Human parainfluenza virus type 11.09898-0.58691
Human parainfluenza virus type 31.48386-0.16452
Human parainfluenza virus type 21.25962-0.71004
Mumps virus0.97594-1.15319
Nipah virus1.21851-0.10109
Hendra virus1.06209-0.09805
Respiratory syncytial virus1.468900.17766
Metapneumovirus1.35819-0.42703
Mean vRSCU1.20763-0.41448898
4Influenza C virus1.72634-0.54316586
dsRNA1Colorado tick fever virus1.29334-1.66493
Mammalian orthoreovirus1.05591-1.47831
Human picobirnavirus0.95531-1.18070
Mean vRSCU1.10152-1.44131161
2Rotaviruses2.23210-1.295567
Retro1HTLV-10.25781-0.96125261
2HTLV-20.116130.15063790
3HIV-11.41668-0.265961037
4HIV-20.94712-0.34939812
Ambi1Sin nombre virus0.25123-1.9437170
2Lymphocytic choriomeningitis virus0.774880.08308
Lassa virus0.787190.24056
Machupo virus0.909320.37322
Rift valley fever virus0.81849-0.15444
Junin virus0.96106-0.12456
Mean vRSCU0.850190.08357950
3Guanarito virus1.09370-0.71405
La Crosse virus1.19379-0.84942
Crimean-Congo virus1.05851-0.55100
Mean vRSCU1.11533-0.70482529
4Bunyamwera virus1.39643-0.85192
Hantaan virus1.47251-0.11594
Seoul virus1.41416-0.37791
Mean vRSCU1.42770-0.44859522
Figure 4

GO terms enrichment in the biological processes from human genes with RSCUs similar to RNA viruses. GO terms with p ≤ 0.01 were taken as a significant enrichment and represented with -Log10 (p-value). The bold alphabets indicate GO terms that were found to be identical to the enriched GO terms of upregulated protein profiles in viral infections, determined by high-throughput quantitative proteomics; H (HIV-1), Z (Zika virus), F (influenza A virus), and D (dengue virus serotype 2).

The human genes with RSCUs similar to RNA viruses in each subgroup are represented using transparent colored dots. Categories of human RNA viruses. RNA viruses were categorized based on the nucleic acid types of their genomes. The viruses in each group were divided into subgroups based on their PCA of RSCU. In subgroups containing more than one virus, the mean virus RSCU (mean vRSCU [a, b]) was calculated. The number of human genes with RSCUs similar to RNA viruses were also represented. GO terms enrichment in the biological processes from human genes with RSCUs similar to RNA viruses. GO terms with p ≤ 0.01 were taken as a significant enrichment and represented with -Log10 (p-value). The bold alphabets indicate GO terms that were found to be identical to the enriched GO terms of upregulated protein profiles in viral infections, determined by high-throughput quantitative proteomics; H (HIV-1), Z (Zika virus), F (influenza A virus), and D (dengue virus serotype 2).

Codon usage bias of cell cycle regulated genes is similar to that of RNA viruses

From the previous section, we demonstrated that human genes in the GO terms of the cell cycle and the regulation of the cell cycle process adopt codon usage patterns similar to those of +ssRNA (subgroups 6, 7), –ssRNA (subgroup 4), retrovirus (HIV-1), and ambisense (subgroup 4) viruses. To confirm that the codon usage patterns of these RNA viruses are similar to human genes in the cell cycle and the regulation of the cell cycle process, the cell cycle codon score (CCCS) was used to evaluate the similarity of the codon usage pattern between that of viral genes and a set of cell cycle-regulated human genes (top-600 set) [11]. The CCCS of human RNA viruses had been calculated and is detailed in Table 2. A positive CCCS indicates that a gene has a codon usage pattern similar to the top-600 set. The results revealed that most of the +ssRNA viruses had a negative CCCS score. Only some +ssRNA viruses had a positive CCCS score (such as dengue viruses [DENVs], MERS-coronavirus, SARS-coronavirus, human coronaviruses, human enterovirus 68, and the hepatitis A virus), whereas most of the –ssRNA, dsRNA, HIV-1, HIV-2, and ambisense viruses had a positive CCCS score. However, HTLV-1 and HTLV-2 in the retrovirus group gave a negative CCCS.
Table 2

Cell-Cycle Codon Score (CCCS) of human RNA viruses.

GroupsGene isoform or virusesPCA components
CCCS
PC1 (x)PC2 (y)
+ssRNARubella virus-1.11152-2.38693-0.041109
Hepatitis G virus-0.20805-1.33632-0.012674
Hepatitis C virus-0.38012-0.8688-0.015136
Hepatitis E virus0.05314-1.99911-0.006142
Ross river virus0.08755-1.48581-0.011565
Chikungunya virus0.2134-1.65931-0.008180
Sindbis virus0.25123-1.94371-0.009176
Eastern equine encephalitis virus0.48833-1.3069-0.002974
Western equine encephalitis virus0.47744-1.79137-0.003173
Venezuelan equine encephalitis virus0.42568-1.26628-0.003554
O'nyong nyong virus0.56417-1.63171-0.001119
Enterovirus 710.64889-0.88714-0.000267
Human coxsackievirus A90.54435-0.52889-0.002991
Human coxsackievirus B40.65238-0.59692-0.000998
Echoviruses0.57004-0.57533-0.002082
Polioviruses0.76484-0.351230.001193
Norwalk virus0.77184-0.524680.001137
Sapporo virus0.62754-0.942390.004792
Dengue virus type 10.91914-0.190050.005112
Dengue virus type 20.856380.191190.005403
Dengue virus type 30.88466-0.241340.004578
Dengue virus type 40.80321-0.029010.003912
Japanese encephalitis virus0.34886-0.42656-0.003955
Murray Valley encephalitis virus0.76079-0.714680.002322
St.Louis encephalitis virus0.61121-0.166980.000160
West Nile virus0.42713-0.4084-0.003391
Yellow fever virus0.55894-0.03048-0.000690
Zika virus0.463680.0873-0.002752
Kyasanur Forest disease virus0.26948-0.2271-0.006086
Omsk hemorrhagic fever virus0.2543-0.25169-0.005482
Tick-borne encephalitis virus0.15574-0.30465-0.007427
SARS coronavirus1.38814-1.478540.027135
MERS coronavirus1.4049-1.393210.031220
Human astrovirus1.21324-1.043230.014968
Rhinovirus A,B and C1.46621-0.285590.013209
Human parechovirus1.48761-0.445310.013904
Enterovirus 681.50423-0.494880.013644
Hepatitis A virus1.96492-0.101460.022827
Human coronaviruses1.83078-1.030630.024501
-ssRNABorna virus0.27188-1.00708-0.002858
Rabies virus0.76128-0.253670.000205
Mokola virus0.79901-0.11880.001436
Measles virus0.58253-0.236330.000810
Vesicular stomatitis virus1.10766-0.237150.006431
Influenza A virus H3N20.98353-0.588860.008199
Marburg virus1.29734-0.557820.000675
Ebola viruses1.0992-0.674940.008390
Influenza B virus1.28533-0.266270.014146
Human parainfluenza virus type 11.09898-0.586910.015018
Human parainfluenza virus type 31.48386-0.164520.020038
Human parainfluenza virus type 21.25962-0.710040.016267
Mumps virus0.97594-1.153190.009273
Nipah virus1.21851-0.101090.011326
Hendra virus1.06209-0.098050.009457
Respiratory syncytial virus1.46890.177660.020368
Metapneumovirus1.35819-0.427030.016291
Influenza C virus1.72634-0.543160.021730
dsRNAColorado tick fever virus1.29334-1.66493-0.005517
Mammalian orthoreovirus1.05591-1.478310.006214
Human picobirnavirus0.95531-1.18070.003846
Rotaviruses2.2321-1.295560.028856
RetroHTLV-10.25781-0.96125-0.006770
HTLV-20.116130.15063-0.007187
HIV-11.41668-0.265960.008663
HIV-20.94712-0.349390.007142
AmbiSin nombre virus0.25123-1.943710.017737
Lymphocytic choriomeningitis virus0.774880.083080.006150
Lassa virus0.787190.240560.007772
Machupo virus0.909320.373220.008089
Rift valley fever virus0.81849-0.154440.003661
Junin virus0.96106-0.124560.009001
Guanarito virus1.0937-0.714050.009082
La Crosse virus1.19379-0.849420.007769
Crimean-Congo virus1.05851-0.5510.006304
Bunyamwera virus1.39643-0.851920.018548
Hantaan virus1.47251-0.115940.015166
Seoul virus1.41416-0.377910.015762
Cell-Cycle Codon Score (CCCS) of human RNA viruses. A previous study by Frenkel-Morgenstern et al. also demonstrated that the codon usage pattern of the cell cycle regulated genes (CCRs) influence the cell cycle-dependent protein expression [11]. Thus, the similarity of the codon usage pattern of the CCR genes and RNA viruses was investigated. CCR genes cycling at the protein level and non-cycle regulated genes (NCCRs) that were found not to cycle at the protein level were selected from previous studies (Table 3). The RSCUs of the CCR and NCCR genes were plotted on a graph and compared with the RSCUs of RNA viruses (Figure 5). The results showed that the CCR genes were located with -ssRNA, dsRNA, and some +ssRNA viruses, indicating similar codon usage patterns, while the RSCUs of the NCCR genes were distributed all over the graph with no resemblance to RNA viruses. This result corresponds with the CCCS of RNA viruses (Table 2).
Table 3

List of CCR and NCCR genes.

Gene symbolsDescriptionsReferences
CCRTRA2Btransformer 2 beta homolog[70, 71, 72, 73]
TOP1topoisomerase
E2F5E2F transcription factor 5
H2AFVH2A histone family member V
ANP32Eacidic nuclear phosphoprotein 32 family member E
STAG1stromal antigen 1
USP7ubiquitin specific peptidase 7
EZH2enhancer of zeste 2 polycomb repressive complex 2 subunit
RBBP7RB binding protein 7, chromatin remodeling factor
DDX5DEAD-box helicase 5
GTF2F2general transcription factor IIF subunit 2
GARSglycyl-tRNA synthetase[11]
TARSthreonyl-tRNA synthetase
EPRSglutamyl-prolyl-tRNA synthetase
NCCREFHD2EF-hand domain family member D2[74]
ZNF433zinc finger protein 433
STAG3stromal antigen 3
LMNAlamin A/C
E2F4E2F transcription factor 4
HMGA1high mobility group AT-hook 1
YPEL1yippee like 1
SETSET nuclear proto-oncogene
DDX46DEAD-box helicase 46
EZH1enhancer of zeste 1 polycomb repressive complex 2 subunit
HMGA2high mobility group AT-hook 2
SAE1SUMO1 activating enzyme subunit 1
FHAD1forkhead associated phosphopeptide binding domain 1
GAPDHglyceraldehyde-3-phosphate dehydrogenase[11]
WARStryptophanyl-tRNA synthetase
Figure 5

The graph shows the RSCUs of human cell cycle regulated (CCR) genes and non-cell cycle regulated (NCCR) genes, compared with the RSCUs of RNA viruses.

List of CCR and NCCR genes. The graph shows the RSCUs of human cell cycle regulated (CCR) genes and non-cell cycle regulated (NCCR) genes, compared with the RSCUs of RNA viruses.

Human genes with similar codon usage patterns to RNA virus were upregulated in viral infections

From the previous section, we demonstrated that human genes with codon usage patterns similar to RNA viruses contributed to some important biological processes, such as the cell cycle, the regulation of the cell cycle process, cell division, microtubule cytoskeletal organization, chromosome segregation, DNA repair, macromolecule catabolism, cellular localization, and RNA processing (Figure 4). When coupled with the fact that some viral infections can manipulate host cellular pathways (especially the translation machineries), this finding suggests that human genes with codon usage patterns similar to RNA viruses may be upregulated during viral infection [41]. To substantiate this hypothesis, sets of proteomics data of RNA virus infection were reanalyzed. The lists of upregulated protein profiles upon HIV-1 [42], IAV [43], Zika virus (ZIKV) [44], and dengue virus serotype 2 (DENV-2) [45] infections were obtained from previous studies. The lists of upregulated proteins upon viral infection were submitted to GO-TermFinder (Supplementary File 3), and the enriched GO terms were compared to the enriched GO terms of human genes with codon usage patterns similar to RNA viruses from every subgroup. Several enriched GO terms of upregulated protein profiles during viral infections were found to be identical to the GO terms of human genes with codon usage patterns similar to RNA viruses (Figure 4). In the case of HIV-1 and ZIKV, the identical GO terms included the cell cycle, the regulation of the cell cycle process, the mitotic cell cycle, organelle organization, cell division, microtubule-based process, and cellular localization. The identical GO terms of IAV and DENV-2 included macromolecule metabolic processes, nucleic acid metabolic process, chromosome organization, cellular stress response and RNA processing.

Discussions

Synonymous codons are distributed unequally and in a non-random fashion, which is referred to as codon usage bias [46]. Moreover, there are significant variations of codon usage bias among different species, and even among genes in the same organism [5, 21]. Theoretically, two major factors shape the codon usage bias: mutation pressure and translational selection [18]. Mutation pressure can result in uneven frequencies of nucleotide content, which can in turn influence codon usage bias [15, 47]. As to translational optimization, the frequent codons are usually found correlated with the population of tRNA isoacceptors [13, 15]. Thus, the frequent or optimal codons would result in more rapid protein translation due to the greater availability of tRNAs corresponding to the frequently used codons [48]. Although the replication of viruses relies on the host cell machinery, several viruses possess different a codon usage pattern to the codon usage preferences of their host [31]. For instance, the HIV-1 genome has been found to be A-rich [49]. The G-to-A hypermutation in the HIV-1 genome has been attributed to viral reverse transcriptase (RT), which lacks 3′ to 5′ exonuclease proofreading activity, leading to the misincorporation of nucleotides [50, 51]. In addition, the function of host enzymes of the APOBEC3 (A3) family has been found to partially contribute to a G-to-A mutation [52, 53]. Furthermore, a difference in codon usage has been observed among individual genes of HIV-1 [51]. The HIV-1 gag gene, encoded for structural protein, adopts a great difference in codon usage pattern compared to human host cells. In contrast, the HIV-1 genes involved in the regulation of the replication cycle, tat and rev genes, have been demonstrated to be more similar to human codon usage bias [54]. In this analysis, the PCA of RSCU represented codon usage patterns of human genes and RNA viruses as a coordinate of PC1 (x) and PC2 (y) on a graph. We demonstrated that the PCA of RSCU analysis is compatible with a well-established index, CAI. This suggested that the PCA of RSCU could be used in assessing codon usage bias and comparing the difference in codon usage pattern. In particular, the PCA of RSCU allowed a comparison to be made of individual genes in the whole genome scale. Human genes possess various, different codon usage patterns, as observed in each quadrant of a graph. As mentioned earlier, there are a number of human genes located densely in the right quadrants; these genes adopt a non-optimal codon usage pattern similar to CCR genes. Although most of the RNA viruses have a non-optimal codon usage pattern, a great variation in codon usage patterns was observed among the groups of +ssRNA, dsRNA, and retroviruses. The greatest difference in the codon usage patterns belonged to rotavirus, as seen in the graph. By comparison, the rubella virus exhibited a more similar codon usage pattern to humans. These results correspond with those of another study which demonstrated that the codon usage patterns of +ssRNA viruses are closer to human than other RNA viruses, and that the lowest CAI belongs to dsRNA viruses. In more detail, rubella virus (+ssRNA) had the highest CAI at 0.773, and rotavirus had the lowest CAI at 0.683 [55]. A number of human genes with codon usage patterns similar to that of RNA viruses were found by the present study (Figure 3). The human genes with RSCUs similar to RNA viruses were retrieved and subjected to gene ontology enrichment analysis. Interestingly, it was found that only human genes similar to groups of viruses in the right quadrant resulted in significant enrichment, namely, the cell cycle, the cell cycle regulation process, cell division, microtubule cytoskeletal organization, chromosome segregation, DNA repair, macromolecule catabolism, and cellular localization. The number of human genes with RSCUs similar to RNA viruses retrieved from each subgroup varied from only seven to a thousand genes (Table 1). However, the number of retrieved genes did not affect the significance level or the number of enriched GO terms in Figure 4; for instance, 898 genes for the –ssRNA subgroup 3 resulted in only 1 enriched GO term, whereas 188 genes for the +ssRNA subgroup 7 resulted in 5 GO terms. This suggests that the enriched GO terms did not result from a bias from the different numbers of retrieved genes among the virus groups. It is also possible that some virus groups with a limited number of retrieved genes might not have had sufficient statistical power to enable the detection of enriched GO terms. Among the retrieved human genes, only the genes that adopted a non-optimal codon usage pattern retrieved from the area where they were located densely in the right quadrants gave significant overrepresented GO terms. This suggested that the difference in codon usage bias in human genes might have specific functions. The contribution of non-optimal codon usage bias in human genes on the regulation of protein expression has been investigated in previous research [11, 56, 57]. One study on CCR genes revealed that the non-optimal codon usage pattern generates the oscillation in protein expression during cell cycle progression [11]. We demonstrated that the RSCUs of RNA viruses were similar to the RSCUs of CCR genes, using both the CCCS calculation and the PCA of RSCU. The results showed that –ssRNA, dsRNA, HIV-1, HIV-2, ambisense viruses, and a few viruses in the +ssRNA group exhibited non-optimal codon usage similar to that of the CCR genes. This suggests that despite having a non-optimal codon usage bias, viral genes might be efficiently expressed during the specific phase of the cell cycle correlated with the available tRNA population in that period. The tRNA population is tissue specific and varies with cellular conditions [58]. Alteration of the tRNA population depends on the level of aminoacyl tRNA synthetase and cellular ATP concentration [59]. In yeast cells, oscillation of the aminoacyl tRNA synthetase and ATP during the cell cycle has been found to result in an increase in tRNA levels in the G2/M phase, but a low tRNA level was observed toward the end of the G1 phase [11]. Therefore, with a low-charged tRNA concentration, the genes expressed during G1 prefer optimal codon usage bias, whereas the genes with non-optimal codon usage bias are highly expressed in the other phases of the cell cycle with a high-charged tRNA concentration [11]. The availability of charged tRNAs during the cell cycle may regulate protein translation in a codon usage-specific manner. Several studies have revealed viral subversion of the cell cycle by arresting via various mechanisms to generate the resources and favorable environment for viral replication and viral protein production [60]. Cell cycle arrest has been observed in both DNA and RNA virus infections. In some RNA viruses, cell cycle arrest at a specific phase may lead to an increase in viral protein translation [61]. During the G2/M phase, the expression of many proteins has been found to fluctuate by arresting at G2/M, viruses may use this mechanism to regulate protein expression [62]. Another study found that HIV-1 was more transcriptionally active during the G2 phase, and that arresting of the cell cycle may help limit the host immune response [63]. Moreover, an HIV-1 infection also causes an alteration at the cellular tRNA level [34]. As to avian coronavirus infections, G2/M arrest has been found with an increased viral protein expression [64]. Furthermore, rotavirus infection arrests the cell cycle in the S/G2 phase, favoring viral protein expression [61], while influenza A virus infection arrests the cell cycle in the G0/G1 phase, resulting in increased viral protein expression [65, 66]. These findings suggest that viruses may manipulate the cell cycle and cellular translation machinery to create available tRNA population favoring the viral codon usage pattern. Several GO terms of human genes with codon usage patterns similar to RNA viruses have been found by previous studies to be identical to the GO terms of upregulated protein profiles in viral infections. Global proteomic and phosphoproteomic changes in HIV-1 infected CD4+ T cells revealed that HIV-1 affected transcriptional and translational regulation, and targeted RNA or protein degradation in order to modulate biological processes (including signal transduction, cell cycle, metabolic processes, and the immune system) [42]. In addition, a study of ZIKV infected human neurospheres also found an upregulation profile of proteins involving cell cycle arrest. This resulted in an alteration of the cell cycle in order to regulate the transcription and translation of the host cells [44]. IAV infection targeted several cellular pathways to favor its replication, including aminoacyl-tRNA biosynthesis, glycolysis, fatty acid biosynthesis, and spliceosome [43]. Moreover, regulated proteins and phosphoproteins in DENV-infected cells were related to cellular macromolecule biosynthesis, RNA splicing, chromatin modification, and cell stress response, and these regulations help facilitate viral protein expression [45]. This suggests that human genes with codon usage patterns similar to that of viruses may be upregulated at the translational level in a viral infection. It is unclear whether viruses evolved to regulate the translational machinery in order to accommodate the codon usage pattern that had already been shaped by mutational pressure, or whether they adapted their codon usage pattern to match the cellular translational machinery condition in infected cells. These two possibilities are not mutually exclusive. Further studies are required to gain more insight into this new aspect of the virus-host interaction.

Materials and methods

Protein coding sequences of human genomes

The protein coding sequences of human genomes were downloaded from the GENCODE database (version 26) in FASTA format [67]. The data set provided the nucleotide sequences of the coding transcripts on the reference chromosomes, including multiple transcript variants for each gene. Thus, only the major transcript variants were selected for analysis. The major transcript variant is the longest transcript variant of the gene with a complete ORF. For each gene, one representative as a major transcript variant with the longest sequence length was selected using a custom python script (Supplementary File 4). The ORFs of the protein coding sequences were rechecked by the ORFfinder tool at the website https://www.ncbi.nlm.nih.gov/orffinder (NCBI) before performing further analysis. The transcript variants with an incomplete ORF were excluded and substituted by alternative variants. A list of the selected major transcripts variants is given in Supplementary File 1.

Sequences of human RNA virus genomes

The data set for the sequences of the human RNA virus genome utilized by a previous study was used [55]. A total of 77 RNA viruses that can cause diseases in humans were selected. The protein coding sequences of those RNA viruses were downloaded in FASTA format from the Nucleotide database (RefSeq, NCBI). For each virus species, the protein coding sequences from different isolations of the same virus species were downloaded. The sequences were selected based on their availability in the database (some viral protein coding sequences were only available in a few numbers). Any coding sequence with unidentified nucleotides that could not be translated or with an incomplete ORF was excluded. The selected human RNA viruses were categorized by their family, genus, and genome polarity. They comprised 39 positive-sense single strand RNA (+ssRNA) viruses; 18 negative-sense single strand RNA (–ssRNA) viruses; 4 double strand RNA (dsRNA) viruses; 4 retroviruses (retro); and 12 ambisense (+/-, ambi) viruses. Supplementary File 5 presents a list of the human RNA viruses used in this study, the accession numbers of the sequences, and the number of coding sequences of each virus.

The RSCU analysis

The RSCU is the ratio of the observed frequency of the codon in a gene to the expected frequency of the codon under the condition that all the synonymous codons are equally used. The three stop codons (TAA, TAG, TGA), Met (ATG), and Trp (TGG) were excluded from the analysis. The observed frequency of the codons in the genes was counted. The FASTA sequences were parsed, and the codons of the coding sequences for each transcript variant were counted by a python script and the Biopython library (Python version 3.5.2, with Biopython version 1.66; for the scripts, see Supplementary File 6). Then, the RSCU was calculated as follows:where n is the number of synonymous codons (1 ≤ n ≤ 6) for the amino acid, and X is the number of occurrences of codon i. The synonymous codons with RSCU values greater than 1.0 had a positive codon usage bias and were defined as abundant codons, while those with RSCU values less than 1.0 had a negative codon usage bias and were defined as less-abundant codons. In the case of RSCU values that were exactly 1.0, it meant that there was no codon usage bias, and the codons were chosen equally [68]. The RSCU of human RNA viruses were calculated using the CAIcal server, which is available at http://genomes.urv.es/CAIcal [37]. The multiple protein coding sequences from different isolations of the virus in the same species were submitted to the CAIcal server for the RSCU calculation. After that, the average RSCU was calculated to represent the RSCU of each RNA virus species. The RSCUs of the human genes and RNA viruses are provided in Supplementary File 1.

The calculation of CAI

The CAI of a specific gene was calculated using the CAI calculator on the CAIcal server [37]. The reference human codon usage table was obtained from the Codon Usage Database (http://www.kazusa.or.jp/codon/) [69].

PCA of RSCUs

The RSCUs of the protein coding sequences of 20,190 human genes and 77 human RNA viruses were input to the PCA. The PCA was performed using PASW Statistics for Windows, version 18 (SPSS Inc., Chicago, Ill., USA). The Kaiser–Meyer–Olkin Measure of Sampling Adequacy test (KMO–MSA) was also analyzed. The overall KMO–MSA was 0.799, which was greater than a cut off of 0.5, indicating that the sample size was adequate. The principle components were successfully extracted using covariance matrix and Quartimax rotation, which reduced the high dimensions of the dataset to a smaller number of dimensions. The selection of the significant components was based on a scree plot and the proportions of variance. The scree plot was a plot of the component numbers and eigenvalues; only the first two components had eigenvalues greater than 1, and they accounted for 45.5% of the total variance. The RSCU of the gene was represented by the coordinate of PC1 and PC2 (x, y) on the graph. The PCAs of the RSCU graphs were plotted and analyzed using GraphPad Prism 7 (GraphPad Software, Inc., CA).

The CCCS calculation

The CCCS of a gene evaluates the similarity of the codon usage between that of a specific gene and a set of cell cycle regulated human genes. The calculation has been previously described [11]. Briefly, the CCCS is the sum of the codon preference (CP) values of the cell cycle regulated human genes (top-600 set) over all codons in the coding sequence of a gene, normalized by the length of the cDNA. The CCCS of a specific gene was calculated as follows:where g is every codon of a gene, and CP top−600(codon) is the CP in the top-600 gene set (see cited reference, Table 1).

GO enrichment

On the PCA of the RSCU graph, the human genes located in the same area of RNA viruses were taken as human genes with RSCUs similar to RNA viruses. To select these human genes, the average RSCU of each group of RNA viruses was initially calculated. However, a great variation of RSCU was observed in each group of RNA virus categorized by the nucleic acid types of their genomes. Hence, the viruses in each group were divided into subgroups by their RSCU before calculation of the average PC1 (x) and PC2 (y); these RNA-virus subgroups are listed in Table 1. Subsequently, the average RSCU of the RNA viruses in each subgroup (mean vRSCU, coordinate [a, b]) were set as the circle center of a circle with a radius of 0.3 units (Figure 3). The radius was calculated to be minimal, to not exceed the standard deviation of the distance between the circle center and the human genes, and to cover most of the viruses in each subgroup. The human genes located within the circle were taken as the human genes with RSCUs similar to RNA viruses. The distance (r) from the circle center (a, b) was measured as follows:(a-x) where a and b are the coordinates of the mean vRSCU of each subgroup of RNA viruses, while x and y are the coordinates of the RSCUs of the human genes. The human genes with an RSCU similar to each group of RNA viruses were analyzed for GO enrichment using Go-TermFinder, and REVIGO was used to categorize the redundant GO terms [39, 40]. The whole genome of Homo sapiens was used as a background list. The overrepresented GO terms in the biological process were investigated in both analyses, with p ≤ 0.01 taken as a significant enrichment.

Statistical analysis

The simple linear regression analysis and Pearson correlation coefficient (PCC) were determined using GraphPad Prism 7 with p < 0.05 was taken as significant.

Declarations

Author contribution statement

Kunlakanya Jitobaom: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. Supinya Phakaratsakul: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Thanyaporn Sirihongthong: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data. Sasithorn Chotewutmontri, Prapat Suriyaphol: Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Ornpreya Suptawiwat: Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Prasert Auewarakul: Conceived and designed the experiments; Analyzed and interpreted the data.

Funding statement

This work was supported by Postdoctoral Fellowship, , Thailand (Grant no. R01612001, 2017), The Royal Golden Jubilee Ph.D. Program, Thailand (Grant No. PHD/0030/2556), Siriraj Graduate Scholarship, , and Thailand Research Fund (Grant No. IRN60W0002).

Competing interest statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.
  72 in total

1.  Accurate translation of the genetic code depends on tRNA modified nucleosides.

Authors:  Connie Yarian; Hannah Townsend; Wojciech Czestkowski; Elzbieta Sochacka; Andrzej J Malkiewicz; Richard Guenther; Agnieszka Miskiewicz; Paul F Agris
Journal:  J Biol Chem       Date:  2002-02-22       Impact factor: 5.157

2.  Synonymous codon usage, accuracy of translation, and gene length in Caenorhabditis elegans.

Authors:  G Marais; L Duret
Journal:  J Mol Evol       Date:  2001-03       Impact factor: 2.395

3.  Dynamic proteomics in individual human cells uncovers widespread cell-cycle dependence of nuclear proteins.

Authors:  Alex Sigal; Ron Milo; Ariel Cohen; Naama Geva-Zatorsky; Yael Klein; Inbal Alaluf; Naamah Swerdlin; Natalie Perzov; Tamar Danon; Yuvalal Liron; Tal Raveh; Anne E Carpenter; Galit Lahav; Uri Alon
Journal:  Nat Methods       Date:  2006-07       Impact factor: 28.547

Review 4.  Codon bias and gene expression.

Authors:  C G Kurland
Journal:  FEBS Lett       Date:  1991-07-22       Impact factor: 4.124

5.  Genome polarity of RNA viruses reflects the different evolutionary pressures shaping codon usage.

Authors:  Supinya Phakaratsakul; Thanyaporn Sirihongthong; Chompunuch Boonarkart; Ornpreya Suptawiwat; Prasert Auewarakul
Journal:  Arch Virol       Date:  2018-07-09       Impact factor: 2.574

6.  Fidelity of HIV-1 reverse transcriptase.

Authors:  B D Preston; B J Poiesz; L A Loeb
Journal:  Science       Date:  1988-11-25       Impact factor: 47.728

7.  Targeting Metabolic Reprogramming by Influenza Infection for Therapeutic Intervention.

Authors:  Heather S Smallwood; Susu Duan; Marie Morfouace; Svetlana Rezinciuc; Barry L Shulkin; Anang Shelat; Erika E Zink; Sandra Milasta; Resha Bajracharya; Ajayi J Oluwaseum; Martine F Roussel; Douglas R Green; Ljiljana Pasa-Tolic; Paul G Thomas
Journal:  Cell Rep       Date:  2017-05-23       Impact factor: 9.423

Review 8.  Codon usage and tRNA content in unicellular and multicellular organisms.

Authors:  T Ikemura
Journal:  Mol Biol Evol       Date:  1985-01       Impact factor: 16.240

9.  Non-optimal codon usage affects expression, structure and function of clock protein FRQ.

Authors:  Mian Zhou; Jinhu Guo; Joonseok Cha; Michael Chae; She Chen; Jose M Barral; Matthew S Sachs; Yi Liu
Journal:  Nature       Date:  2013-02-17       Impact factor: 49.962

10.  A new and updated resource for codon usage tables.

Authors:  John Athey; Aikaterini Alexaki; Ekaterina Osipova; Alexandre Rostovtsev; Luis V Santana-Quintero; Upendra Katneni; Vahan Simonyan; Chava Kimchi-Sarfaty
Journal:  BMC Bioinformatics       Date:  2017-09-02       Impact factor: 3.169

View more
  8 in total

1.  A scheme for inferring viral-host associations based on codon usage patterns identifies the most affected signaling pathways during COVID-19.

Authors:  Jayanta Kumar Das; Subhadip Chakraborty; Swarup Roy
Journal:  J Biomed Inform       Date:  2021-05-07       Impact factor: 8.000

2.  Rotavirus A Genome Segments Show Distinct Segregation and Codon Usage Patterns.

Authors:  Irene Hoxie; John J Dennehy
Journal:  Viruses       Date:  2021-07-27       Impact factor: 5.048

3.  Human genes with codon usage bias similar to that of the nonstructural protein 1 gene of influenza A viruses are conjointly involved in the infectious pathogenesis of influenza A viruses.

Authors:  Komi Nambou; Manawa Anakpa; Yin Selina Tong
Journal:  Genetica       Date:  2022-04-08       Impact factor: 1.633

4.  Bias at the third nucleotide of codon pairs in virus and host genomes.

Authors:  Ewan P Plant; Zhiping Ye
Journal:  Sci Rep       Date:  2022-03-16       Impact factor: 4.996

5.  Unheeded SARS-CoV-2 proteins? A deep look into negative-sense RNA.

Authors:  Martin Bartas; Adriana Volná; Christopher A Beaudoin; Ebbe Toftgaard Poulsen; Jiří Červeň; Václav Brázda; Vladimír Špunda; Tom L Blundell; Petr Pečinka
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

6.  Codon Usage Provides Insights into the Adaptive Evolution of Mycoviruses in Their Associated Fungi Host.

Authors:  Qianqian Wang; Xueliang Lyu; Jiasen Cheng; Yanping Fu; Yang Lin; Assane Hamidou Abdoulaye; Daohong Jiang; Jiatao Xie
Journal:  Int J Mol Sci       Date:  2022-07-04       Impact factor: 6.208

7.  Human genes with relative synonymous codon usage analogous to that of polyomaviruses are involved in the mechanism of polyomavirus nephropathy.

Authors:  Yu Fan; Duan Guo; Shangping Zhao; Qiang Wei; Yi Li; Tao Lin
Journal:  Front Cell Infect Microbiol       Date:  2022-09-08       Impact factor: 6.073

8.  Analysis of the Contribution of Intrinsic Disorder in Shaping Potyvirus Genetic Diversity.

Authors:  Guillaume Lafforgue; Thierry Michon; Justine Charon
Journal:  Viruses       Date:  2022-09-03       Impact factor: 5.818

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.