Literature DB >> 34977494

Distinct codon usage bias evolutionary patterns between weakly and strongly virulent respiratory viruses.

Feng Chen1, Jian-Rong Yang1.   

Abstract

Human respiratory viruses are of vastly different virulence, giving rise to symptoms ranging from common cold to severe pneumonia or even death. Although this most likely impacts molecular evolution of the corresponding viruses, the specific differences in their evolutionary patterns remain largely unknown. By comparing structural and nonstructural genes within respiratory viruses, greater similarities in codon usage bias (CUB) between nonstructural genes and humans were observed in weakly virulent viruses, whereas in strongly virulent viruses, it was structural genes whose CUBs were more similar to that of humans. Further comparisons between genomes of weakly and strongly virulent coronaviruses revealed greater similarities in CUBs between strongly virulent viruses and humans. Finally, using phylogenetic independent contrasts, dissimilation of viral CUB from that of humans was observed in SARS-CoV-2. Our work revealed distinct CUB evolutionary patterns between weakly and strongly virulent viruses, a previously unrecognized interaction between CUB and virulence in respiratory viruses.
© 2021 The Author(s).

Entities:  

Keywords:  Evolutionary biology; Microbial genomics; Phylogenetics; Virology

Year:  2021        PMID: 34977494      PMCID: PMC8704784          DOI: 10.1016/j.isci.2021.103682

Source DB:  PubMed          Journal:  iScience        ISSN: 2589-0042


Introduction

Human mastadenoviruses (HAdVs), human rhinoviruses (HRVs), human respiratory syncytial viruses (HRSVs), common human coronaviruses (CH-CoVs), and new human coronaviruses (NH-CoVs) are the most frequent causative agents of disease in humans and have significant impacts on morbidity and mortality worldwide (Boncristiani et al., 2009; Greenberg, 2016; Pillaiyar et al., 2020; Zhang et al., 2020). HAdVs and HRVs are usually associated with lower mortality common colds, involving rhinitis, pharyngitis, sneezing, hoarseness, and/or cough (Greenberg, 2011; Jacobs et al., 2013; Zhang et al., 2020). HRSVs and CH-CoVs may frequently cause serious lower respiratory tract illness or even death in children and elderly people (Alansari and Potgieter, 1994; Tognarelli et al., 2019). NH-CoVs (SARS-CoV, MERS-CoV, and SARS-CoV-2) are highly pathogenic emerging and reemerging viruses, and SARS-CoV-2 has posed a great threat to global public health since the end of 2019 (Choi et al., 2003; Esposito et al., 2020; Jiang et al., 2020; Pillaiyar et al., 2020; Wang et al., 2020). To predict the next occurrence of new large-scale public health emergencies caused by another NH-CoV, similar to SARS-CoV-2, it is important to systematically analyze patterns of coevolution between respiratory viruses and humans to better understand the specific evolutionary pattern of this particular virus. As the majority of mammalian viruses do not encode any tRNA, the translational efficiency of viral proteins is mainly determined by the similarity between the synonymous codon usage of viral genes and the tRNA supply of the host (Albers and Czech, 2016; Bahir et al., 2009; Chen et al., 2020; Tian et al., 2018). Many previous works have demonstrated that the dependence of viral gene expression on tRNA resources, in competition with host genes, will retain the virus whose CUB is more similar to that of host (Bahir et al., 2009; Lucks et al., 2008). In contrast, our preceding work suggested that overly increased viral expression would prevent excessive assimilation between viral and host CUBs, so as not to increase the translation load on hosts (Chen et al., 2020). The competition for host tRNA resources because of translational selection had been revealed in some respiratory tract viruses (Alonso and Diambra, 2020; Hernandez-Alias et al., 2021; Rice et al., 2021). However, the specific differences in the evolutionary pattern among different human respiratory tract viruses remain largely unknown. In this work, we systematically analyzed CUB evolutionary patterns for four types of respiratory tract viruses and their relationship with human CUB. In weakly virulent human respiratory viruses (HAdVs and HRVs), it was usually nonstructural genes whose CUBs were more similar to that of humans, whereas in strongly virulent human respiratory viruses (HRSVs, CH-CoVs, and NH-CoVs), structural genes tended to have more human-like CUBs. By comparing the CUB of weakly virulent CH-CoVs and that of strongly virulent NH-CoVs to the tRNA supply of humans, we found greater similarities in CUBs between NH-CoVs and humans. More importantly, dissimilation of CUB (gradual change of the viral CUB to become less similar to the host CUB) has been observed in SARS-CoV-2 since its emergence in humans.

Results

The CUB of early genes is more similar to that of humans in most serotypes of HAdVs

In this work, our goal is to compare the CUB evolutionary patterns of respiratory viruses with weak or strong virulence. To achieve this goal, we first defined the virulence of each respiratory virus according to the symptoms of humans during viral infection (See STAR methods). Then, we focused on one of the weakly virulent respiratory viruses (Human mastadenoviruses, HAdVs), whose genes are expressed in three different stages of viral infection (gene categories are listed on Table S1). During the first stage of HAdVs infection, early genes optimize the cellular milieu for viral replication to benefit the expression of intermediate and late genes in the second and final stage of viral infection, respectively (Nevins, 1987). Thus, the mRNA abundance of early genes at the initial stage of infection will be much lower than that of intermediate and late genes in the other two stages of infections. On that basis, we expected that the translational efficiencies of early genes need to be increased, for example, by increasing the similarities of their CUBs to the tRNA supply of the host, so as to increase the viability of the virus. In contrast, intermediate and late genes might be under weaker translational selection pressure, because the optimized cellular environment would enable sufficiently high translational efficiency. To test this hypothesis, we calculated the deviation from proportionality (Chen et al., 2020) (DP, see STAR methods) of synonymous codon usage of viral genes to the tRNA supply of the host for each gene of HAdV species downloaded from NCBI Virus (Hatcher et al., 2017). In HAdV-A, HAdV-C, HAdC-D, HAdV-E, and HAdV-F, the DP values of early genes were smaller than those of late genes (HAdV-A: Wilcoxon rank-sum test P = 0.0039; HAdV-C: Wilcoxon rank-sum test P = 1.1 × 10−107; HAdV-D: Wilcoxon rank-sum test P = 1.5 × 10−188; HAdV-E: Wilcoxon rank-sum test P = 1.8 × 10−109; HAdV-F: Wilcoxon rank-sum test P = 7.3 × 10−8; Figure 1); the opposite was true in HAdV-B and HAdV-G (HAdV-B: Wilcoxon rank-sum test P = 0; HAdV-G: Wilcoxon rank-sum test P = 2.6 × 10−5; Figure 1). The same patterns were observed between early and intermediate genes (HAdV-A: Wilcoxon rank-sum test P = 3.5 × 10−10; HAdV-B: Wilcoxon rank-sum test P = 8.1 × 10−11; HAdV-C: Wilcoxon rank-sum test P = 2.0 × 10−116; HAdV-D: Wilcoxon rank-sum test P = 6.9 × 10−133; HAdV-E: Wilcoxon rank-sum test P = 1.4 × 10−34; HAdV-F: Wilcoxon rank-sum test P = 0.26; HAdV-G: Wilcoxon rank-sum test P = 0.17; Figure 1). Five out of seven serotypes of HAdVs showed smaller DP values of early genes (binomial P = 0.063), indicating that the translational selection in early genes were marginally stronger than that in intermediate and late genes.
Figure 1

CUB evolutionary patterns among early, intermediate, and late genes of HAdVs

Early genes and human CUBs were more similar than CUBs of intermediate or late genes to that of humans. The difference between viral gene CUBs and human tRNA supply was measured by DP values. The p values represent comparisons among the DP values of early, intermediate, and late genes by the Wilcoxon rank-sum test. E (red), I (green), and L (blue) represent early, intermediate, and late genes, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1.

CUB evolutionary patterns among early, intermediate, and late genes of HAdVs Early genes and human CUBs were more similar than CUBs of intermediate or late genes to that of humans. The difference between viral gene CUBs and human tRNA supply was measured by DP values. The p values represent comparisons among the DP values of early, intermediate, and late genes by the Wilcoxon rank-sum test. E (red), I (green), and L (blue) represent early, intermediate, and late genes, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1. Intermediate genes upregulate the transcriptional activity of major late promoters and drive the expression of late genes and the synthesis of the corresponding structural proteins (Lutz and Kedinger, 1996; Lutz et al., 1997). Accordingly, the mRNA abundance of intermediate genes in the second stage of infection may be lower than that of late genes in the final stage. To further increase the expression of late genes, the translational efficiency of intermediate genes seems to be increased through increasing the similarities of CUBs to the tRNA supply of humans. However, it is not clear whether it is necessary for the similarities in CUBs between intermediate genes and humans to be further increased when the cellular surroundings for viral replication have been improved by early genes. To examine this problem, we compared the DP values of intermediate and late genes. We observed a skewed CUB evolution pattern between intermediate genes and late genes in different serotypes of HAdVs (Figure 1). In HAdV-D, HAdV-E, and HAdV-F, intermediate genes displayed more human-like CUBs comparing to late genes (HAdV-D: Wilcoxon rank-sum test P = 0.77; HAdV-E: Wilcoxon rank-sum test P = 0.31; HAdV-F: Wilcoxon rank-sum test P = 0.74; Figure 1); the opposite was true in HAdV-A, HAdV-B, HAdV-C, and HAdV-G (HAdV-A: Wilcoxon rank-sum test P = 7.7 × 10−5; HAdV-B: Wilcoxon rank-sum test P = 1.1 × 10−49; HAdV-C: Wilcoxon rank-sum test P = 2.6 × 10−21; HAdV-G: Wilcoxon rank-sum test P = 0.068; Figure 1). Compared to late genes, smaller DP values of intermediate genes were only observed in three out of seven serotypes of HAdVs (binomial P = 0.50), indicating no difference in the translational selection between intermediate and late genes.

The CUBs of nonstructural genes are more similar to that of humans in weakly virulent respiratory viruses

Among early, intermediate and late genes, the CUBs of early genes were more similar to that of humans in most serotypes of HAdVs. Early genes coded for nonstructural genes, whereas intermediate and late genes usually coded for structural genes (Fabry et al., 2005; Vellinga et al., 2005). Thus, we expected that nonstructural genes would have more human-like CUBs. To test it, we compared the DP values between nonstructural and structural genes of HAdVs (gene categories were listed in Table S1). As expected, smaller DP values were observed in nonstructural genes of HAdV-A, HAdV-C, HAdV-D, HAdV-E, and HAdV-F (HAdV-A: Wilcoxon rank-sum test P = 9.7 × 10−5; HAdV-C: Wilcoxon rank-sum test P = 1.9 × 10−125; HAdV-D: Wilcoxon rank-sum test P = 5.1 × 10−192; HAdV-E: Wilcoxon rank-sum test P = 1.4 × 10−97; HAdV-F: Wilcoxon rank-sum test P = 9.2 × 10−7; Figure 2A); the opposite was true in HAdV-B and HAdV-G (HAdV-B: Wilcoxon rank-sum test P = 0; HAdV-G: Wilcoxon rank-sum test P = 0.0018; Figure 2A).
Figure 2

CUB evolutionary patterns between nonstructural and structural genes in weakly virulent respiratory virus

(A and B) Greater similarities in CUB between nonstructural genes and humans were observed in weakly virulent HAdVs (A) and HRVs (B). The difference between viral gene CUBs and human tRNA supply was measured by DP. The p values represent comparisons between the DP values of structural and nonstructural genes by the Wilcoxon rank-sum test. Nonstructural and structural genes are colored with red and cyan, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1.

CUB evolutionary patterns between nonstructural and structural genes in weakly virulent respiratory virus (A and B) Greater similarities in CUB between nonstructural genes and humans were observed in weakly virulent HAdVs (A) and HRVs (B). The difference between viral gene CUBs and human tRNA supply was measured by DP. The p values represent comparisons between the DP values of structural and nonstructural genes by the Wilcoxon rank-sum test. Nonstructural and structural genes are colored with red and cyan, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1. As CUB evolutionary patterns may be vastly different among viruses, we wondered whether the same or opposite evolutionary patterns could be observed in another weakly virulent respiratory virus (human rhinoviruses, HRVs). This problem was checked by comparing the DP values between nonstructural and structural genes of HRVs (gene categories were listed in Table S1), and smaller DP values in nonstructural genes of all HRVs were observed (HRV-A: Wilcoxon rank-sum test P = 1.9 × 10−10; HRV-B: Wilcoxon rank-sum test P = 1.5 × 10−7; HRV-C: Wilcoxon rank-sum test P = 1.7 × 10−48; Figure 2B). Among ten serotypes of HAdVs and HRVs, eight showed smaller DP values in nonstructural genes (binomial P = 0.011), indicating greater similarities in CUBs between nonstructural genes and humans in weakly virulent respiratory viruses.

The CUBs of structural genes are more similar to that of humans in strongly virulent respiratory viruses

The results above showed greater similarities in CUBs between nonstructural genes and humans in HAdVs and HRVs. As these two types of viruses were defined as weakly virulent respiratory viruses (See STAR methods), we wondered whether the same or opposite patterns would be observed in human respiratory syncytial viruses (HRSVs: HRSV-A and HRSV-B) and human coronaviruses (H-CoVs: 229E, NL63, OC43, HKU1, SARS-CoV, MERS-CoV, and SARS-CoV-2), which are defined as strongly virulent respiratory viruses (See STAR methods). We examined this problem through comparing the DP values between nonstructural and structural genes of HRVs and H-COVs (gene categories were listed in Table S1), and observed smaller DP values in structural genes of all these two kinds of viruses (HRSV-A: Wilcoxon rank-sum test P = 7.3 × 10−86; HRSV-B: Wilcoxon rank-sum test P = 1.6 × 10−19; HKU1: Wilcoxon rank-sum test P = 3.1 × 10−31; NL63: Wilcoxon rank-sum test P = 1.9 × 10−126; 229E: Wilcoxon rank-sum test P = 4.5 × 10−11; OC43: Wilcoxon rank-sum test P = 0.12; SARS2: Wilcoxon rank-sum test P = 0; SARS: Wilcoxon rank-sum test P = 3.9 × 10−6; MERS: Wilcoxon rank-sum test P = 1.7 × 10−58; Figure 3). These results revealed that structural genes and humans CUBs seemed to be more similar in strongly virulent respiratory viruses (binomial P = 0), which is opposite to the evolutionary patterns in respiratory viruses with weak virulence.
Figure 3

CUB evolutionary patterns between nonstructural and structural genes in strongly virulent respiratory viruses

(A and B) Greater similarities in CUBs between structural genes and humans were observed in strongly virulent HRSVs (A) and H-CoVs (B). The difference between viral gene CUBs and human tRNA supply was measured by DP. The p values represent comparisons between the DP values of structural and nonstructural genes by the Wilcoxon rank-sum test. Nonstructural and structural genes are colored with red and cyan, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1.

CUB evolutionary patterns between nonstructural and structural genes in strongly virulent respiratory viruses (A and B) Greater similarities in CUBs between structural genes and humans were observed in strongly virulent HRSVs (A) and H-CoVs (B). The difference between viral gene CUBs and human tRNA supply was measured by DP. The p values represent comparisons between the DP values of structural and nonstructural genes by the Wilcoxon rank-sum test. Nonstructural and structural genes are colored with red and cyan, respectively. The dots represent the mean of DP values in each group. The numbers of genes in each group were listed in Table S1.

Dissimilated CUB evolutionary patterns in the coevolution of SARS-CoV-2 and humans

The opposite CUB evolutionary patterns between weakly and strongly virulent respiratory viruses indicated an association between viral virulence and CUBs. To further investigate this relationship, we focused on human coronaviruses, which were derived from numerous animal coronavirus species and entered humans through accidental human–animal contact. Although they may have similar origins, the symptoms of the host after their infections may be quite different. According to the symptoms and mortality of viral infection, the common human coronavirus (CH-CoVs: 229E, NL63, OC43, and HKU1) were defined as weakly virulent coronaviruses, whereas new human coronavirus (NH-CoVs: SARS-CoV, MERS-CoV, and SARS-CoV-2) were defined as strongly virulent coronaviruses (see STAR methods). It was previously shown that for some viruses, greater similarities between viral CUBs and the tRNA supply of host may lead to higher viral expression and translational load on host, therefore more severe symptoms of host (Chen et al., 2020). Based on that, we expected that the similarities in CUBs between NH-CoVs and humans may be greater than that in CUBs between CH-CoVs and humans. To test this hypothesis, the DP values of each human coronavirus downloaded from NCBI Virus were calculated. As expected, smaller DP values were observed in NH-CoVs comparing to CH-CoVs (Figure 4A), indicating that the symptoms of human coronavirus infection may be associated with the similarities in CUBs between the virus and the host because of different competitive abilities on host tRNA resources (Alonso and Diambra, 2020; Chen et al., 2020; Hernandez-Alias et al., 2021). Among twelve pairs of comparisons between NH-CoVs and CH-CoVs, eleven pairs showed smaller DP values in NH-CoVs (Figure 4B; binomial P = 0.00024), further supporting our hypothesis. The exception in the comparisons between SARS2 and 229E may indicate that the CUBs of coronaviruses are also associated with other selective pressures.
Figure 4

Dissimilation of codon usage bias in the co-evolution of SARS-CoV-2 and humans

(A) Greater similarities in CUB between new human coronaviruses and humans were observed. The p values represent comparisons between the DP values of common and new human coronaviruses by Wilcoxon rank-sum test.

(B) The distance of seven kinds of human coronaviruses CUBs to human tRNA supply. The p values represent comparisons between the DP values of each pair of human coronaviruses by Wilcoxon rank-sum test. ∗∗∗: P < 0.001.

(C) The CUBs of SARS-CoV-2 and humans have significantly dissimilated since December 23, 2019 inferred by original values of DP and collection date. Pearson's rank correlation coefficients and p values between the original values of DP and collection dates of each viral genome are shown. Each dot (N = 715,835) represents a genome of SARS-CoV-2.

(D) The CUBs of SARS-CoV-2 and humans have significantly dissimilated since the COVID-19 pandemic inferred by phylogenetic independent contrasts (PIC). Pearson's rank correlation coefficients and p values between the PIC of DP and collection dates are shown.

Dissimilation of codon usage bias in the co-evolution of SARS-CoV-2 and humans (A) Greater similarities in CUB between new human coronaviruses and humans were observed. The p values represent comparisons between the DP values of common and new human coronaviruses by Wilcoxon rank-sum test. (B) The distance of seven kinds of human coronaviruses CUBs to human tRNA supply. The p values represent comparisons between the DP values of each pair of human coronaviruses by Wilcoxon rank-sum test. ∗∗∗: P < 0.001. (C) The CUBs of SARS-CoV-2 and humans have significantly dissimilated since December 23, 2019 inferred by original values of DP and collection date. Pearson's rank correlation coefficients and p values between the original values of DP and collection dates of each viral genome are shown. Each dot (N = 715,835) represents a genome of SARS-CoV-2. (D) The CUBs of SARS-CoV-2 and humans have significantly dissimilated since the COVID-19 pandemic inferred by phylogenetic independent contrasts (PIC). Pearson's rank correlation coefficients and p values between the PIC of DP and collection dates are shown. To enable long-term coexistence of the virus and its host, the CUB of a virus must be similar to host tRNA supply, but not too similar such that the host becomes sick (Chen et al., 2020). On that basis, we expected that NH-CoVs and human CUBs should be gradually dissimilated as they mostly cause severe symptoms. We first tested this hypothesis by calculating the Pearson's correlation coefficients using the original values of DP and collection date. As expected, the difference between SARS-CoV-2 and human CUBs has significantly increased since its emergence in humans (Pearson's R value = 0.0024, P = 0.031, slope = 2.2 × 10−8; Figure 4C). As the DP values of all genomes are not statistically independent observations because of shared ancestry, we recalculated the correlation of DP values and collection dates using phylogenetic independent contrasts (PIC). The result still supported the dissimilation of SARS-CoV-2 and human CUBs (Pearson's R value = 0.018, P = 1.6 × 10−50, slope = 7.2 × 10−7; Figure 4D). No significantly increasing or decreasing tendencies were observed in other human coronaviruses because of too few genomes or lack of collection date (Table S2). These results revealed a gradual trend of dissimilation of CUBs between SARS-CoV-2 and humans. Nevertheless, the dissimilation of SARS-CoV-2 and human CUBs observed here could be a result of recent evolution but not necessarily representative in the long term.

Discussion

In the current study, we carried out a systematic analysis of CUB evolutionary patterns in different virulent human respiratory viruses. We found greater similarities in CUBs between nonstructural genes and humans in weakly virulent HAdVs and HRVs, but between structural genes and humans in strongly virulent HRSVs, CH-CoVs, and NH-CoVs. Further analysis revealed greater similarity in CUBs between NH-CoVs and humans compared to that in CUBs between CH-CoVs and humans. More importantly, SARS-CoV-2 and human CUBs have tended to dissimilate since the COVID-19 pandemic. There are two caveats worth discussion in regard to the relationship between virulence and CUB. First, although these viruses are quite different in genome structure, gene number, gene length, and so on (Table S1 and Figure S1), the symptoms of the host after infections are all related to the respiratory system. After the definition of viral virulence according to the symptoms of host, these viruses were comparable to some level. Second, the mechanism behind this relationship may be that greater similarities in CUBs between virus and host will increase the expression level of viruses, thus exacerbating the symptoms of the host. In this work, our results strongly support the relationship between virulence and CUB without rejecting the link between other selective pressures and CUB, such as the ZAP antiviral protein of host that may selectively reduce CpG dinucleotides in the SARS-CoV-2 genome (Kmiec et al., 2020; MacLean et al., 2021). Based on the results of this work, we modified our previous virus-host coevolution model (Chen et al., 2020) to better characterize the possible evolution patterns of strongly virulent NH-CoVs compared to weakly virulent CH-CoVs. Animal coronaviruses may have evolved to be optimal for long-term coexistence with animal species. By chance, infection with animal coronaviruses whose CUB is overly similar to (red area, left; Figure 5) or overly different from (red area, right; Figure 5) human tRNA supply may cause death of the host or the virus, respectively. Infection with an animal coronavirus whose CUB is not overly similar (green area, left; Figure 5) to the human tRNA supply may cause severe symptoms and form new member of NH-CoVs, such as SARS-CoV, MERS-CoV, or SARS-CoV-2. Infection with an animal coronavirus whose CUB is not overly different from (green area, right; Figure 5) the human tRNA supply may cause mild symptoms and form a member of CH-CoVs, such as 229E, NL63, OC43, or HKU1. To evolve for long-term coexistence, the CUBs of weakly virulent human coronaviruses will be converged with that of humans to produce more copies of the virus (white arrow from right; Figure 5), whereas the CUBs of strongly virulent human coronaviruses (especially in SARS-CoV-2 and human coevolution; Figures 4C and 4D) will be diverged from that of humans, causing a reduced translation load and toxicity to humans (white arrow from left; Figure 5).
Figure 5

Schematic diagram of the different CUB evolutionary patterns in common and new human coronaviruses

Infections of an animal coronavirus whose CUB is not overly similar (green, left part) to the human tRNA supply may cause severe symptoms and form a new human coronavirus, such as SARS-CoV, MERS-CoV, or SARS-CoV-2, whereas infection of an animal coronavirus whose CUB is not overly different (green, right part) from the human tRNA supply may cause mild to moderate symptoms and form a common human coronavirus, such as 229E, NL63, OC43, or HKU1. For long-term coexistence, common human coronaviruses with CUBs similar to that of the host were retained for highly viral expression levels (white, right arrow). In contrast, to evolve for low host toxicity, the expression levels of new human coronaviruses were decreased through dissimilating their CUBs from the tRNA supply of the host (white, left arrow).

Schematic diagram of the different CUB evolutionary patterns in common and new human coronaviruses Infections of an animal coronavirus whose CUB is not overly similar (green, left part) to the human tRNA supply may cause severe symptoms and form a new human coronavirus, such as SARS-CoV, MERS-CoV, or SARS-CoV-2, whereas infection of an animal coronavirus whose CUB is not overly different (green, right part) from the human tRNA supply may cause mild to moderate symptoms and form a common human coronavirus, such as 229E, NL63, OC43, or HKU1. For long-term coexistence, common human coronaviruses with CUBs similar to that of the host were retained for highly viral expression levels (white, right arrow). In contrast, to evolve for low host toxicity, the expression levels of new human coronaviruses were decreased through dissimilating their CUBs from the tRNA supply of the host (white, left arrow). On that basis, our work has potential practical implications. Considering the obvious association between CUBs and virulence, our approach can be used to screen all animal coronavirus mutants to predict the virulence through calculating viral and host CUBs. This may provide new methods for predicting the most dangerous animal coronaviruses or wild animal hosts and even for predicting the next occurrence of new large-scale public health emergencies. Besides, the dissimilation of SARS-CoV-2 CUBs to human tRNA supply may indicate lower translational efficiencies per mRNA, and therefore delayed onset or weakened symptoms of host than before. Thus, before the CUB difference of SARS-CoV-2 and humans become optimal for long-term coexistence, the lengthening incubation period of SARS-CoV-2 infection may further challenge our fight against this large-scale public health emergency.

Limitations of the study

There are limitations in the modeling of CUB evolutionary patterns in viruses with different virulence. On the one hand, there is no experiment data that can clearly quantify the viral virulence of each species. Indeed, we can only estimate the viral virulence using symptoms and mortality reported by the World Health Organization and Centers for Disease Control. Although this classification of viral virulence was rough, most patterns in each section were consistent by binomial test. On the other hand, the evolution of a virus was affected by both virulence and transmission capacity. The frequency of each species can be largely affected by the transmission capacity that may be less affected by CUBs. Although the transmission capacities of different species were not equal, dissimilation of CUB evolutionary patterns can still be observed in SARS-CoV-2 since its emergency in humans, indicating an important role of the interaction between CUB and virulence in the evolution of SARS-CoV-2.

STAR★Methods

Key resource table

Resource availability

Lead contact

Further requests for resources should be directed to and will be fulfilled by the lead contact, Feng Chen (chenfeng5@mail.sysu.edu.cn).

Materials availability

This study did not generate new materials.

Method details

The definition of viral virulence among human respiratory viruses

To define viral virulence, we compared the symptoms and mortality caused by the infections of human respiratory viruses. In details, most of HAdVs and HRVs are usually associated with upper respiratory tract disease, such as lower mortality common colds, involving rhinitis, pharyngitis, sneezing, hoarseness, and/or cough (Fendrick et al., 2003; Greenberg, 2011; Jacobs et al., 2013; Zhang et al., 2020). HRSVs may frequently cause serious lower respiratory tract illness and even death in children below the age of five (Azar and Landry, 2018; Tognarelli et al., 2019; Zhang et al., 2020). CH-CoVs (229E, NL63, OC43, and HKU1) are associated with both upper and lower respiratory tract disease, and even death in children and elderly people (Kahn and McIntosh, 2005). NH-CoVs (SARS-CoV, MERS-CoV, and SARS-CoV-2) infections usually result severe lower respiratory tract disease, and all these three viruses have posed great threats to global public health (Choi et al., 2003; Esposito et al., 2020; Jiang et al., 2020; Pillaiyar et al., 2020; Wang et al., 2020). Upper respiratory tract infections are usually associated with lower mortality and mild symptoms, including common cold, tonsillitis, sinusitis, laryngitis. On the other hand, lower respiratory tract infections usually result higher mortality and server symptoms, including pneumonia, bronchitis, and bronchiolitis. In this work, we defined viral virulence in the category of human respiratory viruses or human coronaviruses. In the first category, HAdVs and HRVs, which usually result upper respiratory tract, were defined as weakly virulent respiratory viruses. On contrast, HRSVs, CH-CoVs, and NH-CoVs were defined as strongly virulent respiratory viruses. In the category of human coronaviruses, NH-CoVs infections usually lead to more severe symptoms comparing to CH-CoVs. Based on that, NH-CoVs were defined as strongly virulent coronaviruses, while CH-CoVs were defined as weakly virulent coronaviruses.

Data sets used in this study

All reference and mutant sequences were downloaded from NCBI Virus (Hatcher et al., 2017). These sequences are derived from HAdVs (HAdV-A, HAdV-B, HAdV-C, HAdV-D, HAdV-E, HAdV-F, and HAdV-G), HRVs (HRV-A, HRV-B, and HRV-C), HRSVs (HRSV-A, HRSV-B), and human coronaviruses (229E, NL63, OC43, HKU1, SARS-CoV, MERS-CoV, and SARS-CoV-2). The genomes of HAdV-A (GenBank: NC_001460.1), HAdV-B (GenBank: NC_011203.1), HAdV-C (GenBank: NC_001405.1), HAdV-D (GenBank: NC_010956.1), HAdV-E (GenBank: NC_003266.2), HAdV-F (GenBank: NC_001454.1), HAdV-G (GenBank: NC_006879.1), HRV-A (GenBank: NC_038311.1), HRV-B (GenBank: NC_038312.1), and HRV-C (GenBank: EF186077.2), HRSV-A (GenBank: NC_038235.1), HRSV-B (GenBank: NC_001781.1), 229E (GenBank: NC_002645.1), NL63 (GenBank: NC_005831.2), OC43 (GenBank: NC_006213.1), HKU1 (GenBank: NC_006577.2), SARS-CoV (GenBank: NC_004718.3), MERS-CoV (GenBank: NC_019843.3), and SARS-CoV-2 (GenBank: NC_045512.2) were used as references for identifying the coding region of each species, respectively.

Calculation of viral codon frequency

All sequences were aligned to the reference sequence using BLASTN with default parameters (Camacho et al., 2009). Six types of open reading frames were chosen for each gene to maintain the highest amino acid identity with the reference sequence and to avoid including a stop codon until the end. Otherwise, these sequences were manually confirmed and removed. Then, the codon frequencies of each gene were scaled so that all groups of synonymous codons added up to 1 (Chen et al., 2020).

Viral and host CUB calculation

In this work, we used the metric DP, which means the deviation from proportionality of synonymous CUBs between viral genes and the tRNA supply of the host (Chen et al., 2020), to estimate the difference between viral and human CUBs. Specifically, we first calculated the Euclidean distance in synonymous codon usage between the viral genes and the tRNA supply of the host for each of the 18 amino acids with at least two synonymous codons. Here, n is the number of synonymous codons for amino acid i, Y is the fraction of codon j among the synonymous codons for amino acid i in the viral gene, and X is the tRNA supply represented by the fraction of codon j among the synonymous codons in the host transcriptome. The DP value of each viral gene is defined as the weighted geometric mean of all 18 D values.

Phylogenetic independent contrasts

The collection date values of each virus were transformed to numeric using “timeDate” Package in R. The phylogenetic trees of all viruses were directly generated and downloaded from NCBI virus (Hatcher et al., 2017). Phylogenetic independent contrasts (PIC) were calculated in R using normalized DP and collection date inferred by the phylogenetic tree of all genomes.

Quantification and statistical analysis

p values of less than 0.001 that calculated by Wilcoxon rank-sum tests were indicated by ∗∗∗.
REAGENT or RESOURCESOURCEIDENTIFIER
Bacterial and virus strains

Human mastadenoviruses ANCBI VirusHAdV-A
Human mastadenoviruses BNCBI VirusHAdV-B
Human mastadenoviruses CNCBI VirusHAdV-C
Human mastadenoviruses DNCBI VirusHAdV-D
Human mastadenoviruses ENCBI VirusHAdV-E
Human mastadenoviruses FNCBI VirusHAdV-F
Human mastadenoviruses GNCBI VirusHAdV-G
Human rhinoviruses ANCBI VirusHRV-A
Human rhinoviruses BNCBI VirusHRV-B
Human rhinoviruses CNCBI VirusHRV-C
Human respiratory syncytial viruses ANCBI VirusHRSV-A
Human respiratory syncytial viruses BNCBI VirusHRSV-B
Human coronavirus 229ENCBI Virus229E
Human coronavirus HKU1NCBI VirusHKU1
Human coronavirus NL63NCBI VirusNL63
Human coronavirus OC43NCBI VirusOC43
Severe acute respiratory syndrome coronavirus 2NCBI VirusSARS2
Severe acute respiratory syndrome coronavirusNCBI VirusSARS
Middle East respiratory syndrome-related coronavirusNCBI VirusMERS

Deposited data

Key codes and datasetsThis paperhttps://github.com/chenfengokha/CUBofSARS2
Features of human respiratory virusThis paperTable S1
CUB evolutionary patterns between coronaviruses and humans inferred by PICThis paperTable S2

Software and algorithms

R Studio 1.2.5RStudio, Inc.https://www.rstudio.com
R 3.6.1R Foundation for Statistical Computinghttps://www.R-project.org
  31 in total

1.  History and recent advances in coronavirus discovery.

Authors:  Jeffrey S Kahn; Kenneth McIntosh
Journal:  Pediatr Infect Dis J       Date:  2005-11       Impact factor: 2.129

2.  Properties of the adenovirus IVa2 gene product, an effector of late-phase-dependent activation of the major late promoter.

Authors:  P Lutz; C Kedinger
Journal:  J Virol       Date:  1996-03       Impact factor: 5.103

3.  The product of the adenovirus intermediate gene IX is a transcriptional activator.

Authors:  P Lutz; M Rosa-Calatrava; C Kedinger
Journal:  J Virol       Date:  1997-07       Impact factor: 5.103

4.  The economic burden of non-influenza-related viral respiratory tract infection in the United States.

Authors:  A Mark Fendrick; Arnold S Monto; Brian Nightengale; Matthew Sarnes
Journal:  Arch Intern Med       Date:  2003-02-24

5.  Outcomes and prognostic factors in 267 patients with severe acute respiratory syndrome in Hong Kong.

Authors:  Kin Wing Choi; Tai Nin Chau; Owen Tsang; Eugene Tso; Ming Chee Chiu; Wing Lok Tong; Po Oi Lee; Tak Keung Ng; Wai Fu Ng; Kam Cheong Lee; William Lam; Wai Cho Yu; Jak Yiu Lai; Sik To Lai
Journal:  Ann Intern Med       Date:  2003-11-04       Impact factor: 25.391

Review 6.  Update on Human Rhinovirus and Coronavirus Infections.

Authors:  Stephen B Greenberg
Journal:  Semin Respir Crit Care Med       Date:  2016-08-03       Impact factor: 3.119

7.  CpG Frequency in the 5' Third of the env Gene Determines Sensitivity of Primary HIV-1 Strains to the Zinc-Finger Antiviral Protein.

Authors:  Dorota Kmiec; Rayhane Nchioua; Scott Sherrill-Mix; Christina M Stürzel; Elena Heusinger; Elisabeth Braun; Marcos V P Gondim; Dominik Hotter; Konstantin M J Sparrer; Beatrice H Hahn; Daniel Sauter; Frank Kirchhoff
Journal:  mBio       Date:  2020-01-14       Impact factor: 7.867

8.  Translational adaptation of human viruses to the tissues they infect.

Authors:  Xavier Hernandez-Alias; Hannah Benisty; Martin H Schaefer; Luis Serrano
Journal:  Cell Rep       Date:  2021-03-16       Impact factor: 9.423

9.  Neutralizing Antibodies against SARS-CoV-2 and Other Human Coronaviruses.

Authors:  Shibo Jiang; Christopher Hillyer; Lanying Du
Journal:  Trends Immunol       Date:  2020-04-02       Impact factor: 16.687

View more
  1 in total

1.  Analysis of SARS-CoV-2 synonymous codon usage evolution throughout the COVID-19 pandemic.

Authors:  Ezequiel G Mogro; Daniela Bottero; Mauricio J Lozano
Journal:  Virology       Date:  2022-02-02       Impact factor: 3.616

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.