| Literature DB >> 36166106 |
Yanan Fu1,2, Yanping Huang1,2, Jingjing Rao1,2, Feng Zeng1,2, Ruiping Yang1,2, Huabing Tan1, Zhixin Liu3,4,5, Weixing Du6, Long Liu7,8,9.
Abstract
SARS-CoV-2 infection, which is the cause of the COVID-19 pandemic, has expanded across various animal hosts, and the virus can be transmitted particularly efficiently in minks. It is still not clear how SARS-CoV-2 is selected and evolves in its hosts, or how mutations affect viral fitness. In this report, sequences of SARS-CoV-2 isolated from human and animal hosts were analyzed, and the binding energy and capacity of the spike protein to bind human ACE2 and the mink receptor were compared. Codon adaptation index (CAI) analysis indicated the optimization of viral codons in some animals such as bats and minks, and a neutrality plot demonstrated that natural selection had a greater influence on some SARS-CoV-2 sequences than mutational pressure. Molecular dynamics simulation results showed that the mutations Y453F and N501T in mink SARS-CoV-2 could enhance the binding of the viral spike to the mink receptor, indicating the involvement of these mutations in natural selection and viral fitness. Receptor binding analysis revealed that the mink SARS-CoV-2 spike interacted more strongly with the mink receptor than the human receptor. Tracking the variations and codon bias of SARS-CoV-2 is helpful for understanding the fitness of the virus in virus transmission, pathogenesis, and immune evasion.Entities:
Year: 2022 PMID: 36166106 PMCID: PMC9514192 DOI: 10.1007/s00705-022-05612-6
Source DB: PubMed Journal: Arch Virol ISSN: 0304-8608 Impact factor: 2.685
Fig. 2The mutation spectrum of the spike protein and selection pressure analysis. (A) Substitutions in the animal-derived SARS-CoV-2 S gene. (B) dN-dS value for S gene sequences. dN = nonsynonymous changes/nonsynonymous site. dS = synonymous changes/synonymous site. (C) CAI values for SARS-CoV-2 sequences from humans and animals. "Bat-CoV" refers to RaTG13 from bat, "Pangolin-COV" refers to pangolin coronavirus (GenBank no. QLR06867.1), "other animal-SARS2" means SARS-CoV-2 isolated from the indicted animals, "the first SARS-CoV-2" refers to the virus isolated from a human host. (D) ENC plot analysis of the coronaviruses from animals and humans. (E) Neutrality plot analysis of the coronaviruses from animals and humans. (F) CAI values of the S sequences of SARS-CoV-2 from humans and animals.
Fig. 3Analysis of binding of the SARS-CoV-2 spike protein with human and mink receptors. (A) Comparison of the spike structure of mink SARS-CoV-2 with that of reference strain WIV04. The changed residues within mink SARS-CoV-2 are highlighted as yellow balls. (B) The free energy of binding of the wild-type RBD or mutants from mink SARS-CoV-2 to the human and mink receptor. (C) Amino acid changes involved in the stability of the RBD-ACE2 complex. Detailed structures for Y453, F486, and N501 are arranged from top to bottom. The green lines represent hydrophobic interactions, the orange lines indicate polar H-bonds, the red lines represent hydrogen bonds, and the pink-purple lines represent clashes. (D) Measurement of the binding of RBD mutants to human ACE2 (hACE2, upper panel) and mink ACE2 (mACE2, lower panel) by FACS. His-tagged wild-type RBD, RBD mutants, and NTD were incubated with cells expressing eGFP-fused ACE2. NTD was used as a negative control.
Fig. 1Composition and substitution analysis of SARS-CoV-2 isolated from animals. (A) The reported animals infected with SARS-CoV-2 with the defined transmission route from humans to animals. (B) Phylogenetic tree constructed by the maximum-likelihood method with the Tamura-Nei model in MEGA X with 500 bootstrap replicates. Red dots represent human sequences from infected animals, and blue dots represent sequences from infected white-tailed deer. (C) The proportions of uracil, guanine, thymine, and cytidine substitutions (nonsynonymous) in SARS-CoV-2 isolated from human or animals. (D) Base pair changes observed in the SARS-CoV-2 genomes. All of the transitions and transversions are listed in Supplementary Table S2. (E) The synonymous and nonsynonymous substitutions in mink SARS-Cov-2. (F) The relative proportion of each nucleotide substitution in the mink SARS-CoV-2 genome.
Fig. 4Codon usage and fitness analysis of the SARS-CoV-2-encoded proteins. (A) Probability of mutations in the receptor-binding motif (RBM). The data were analyzed using MEGA X software. The frequency was calculated using the Datamonkey server, and the figure was produced using WebLogo (https://weblogo.berkeley.edu/logo.cgi). (B) The synonymous codon usage of SARS-CoV-2. The figure was produced using WebLogo. The mink SARS-CoV-2 sequence (GenBank no. MT396266) was compared with that of SARS-CoV strain Tor2 (GenBank no. NC_004718.3). (C) Selective coefficient index for SARS-CoV-2 codons. The codons for the N501 mutants are shown in blue and the codons with the highest fitness are highlighted in red. (D) Analysis of relative synonymous codon usage in SARS-CoV-2-encoded proteins.