Literature DB >> 32379350

Codon bias analysis may be insufficient for identifying host(s) of a novel virus.

Yuzhou Gong1,2, Guannan Wen2, Jianping Jiang2, Feng Xie2.   

Abstract

Entities:  

Mesh:

Year:  2020        PMID: 32379350      PMCID: PMC7267327          DOI: 10.1002/jmv.25977

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   20.693


× No keyword cloud information.
To the Editor, A novel kind of coronavirus has infected more than 1600 000 people, claimed over 100 000 lives, and spread to 212 countries and territories since December 2019. The virus has been named severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) (provisionally termed novel coronavirus before), and the pneumonia caused by it is coronavirus disease 2019. Though many scientists and researchers are trying to identify the animals harboring SARS‐CoV‐2, the intermediate host remains under the veil. To many people's surprise, the most pioneering host exploration was from Ji et al that designated snake as the most likely intermediate host. The paper by Ji et al achieved indispensable importance in discovering SARS‐CoV‐2 is derived from a bat coronavirus and likely a recombination of two viruses. Yet its result implied snakes as the most possible intermediate host of the virus seems unlikely. Considering viruses that transmit from poikilotherm to endotherm are rare, and the snake‐host speculation has arose widespread public attention, as well as referred in other studies, we decided to take a close look at the host‐inferring method. Snake was suspected to be the most possible source of the coronavirus by Ji et al because of its closest distance of relative synonymous codon usage (RSCU) bias to SARS‐CoV‐2. This inferring methodology premises virus would evolve similar codon usage pattern to its hosts, which allows it to replicate more efficiently. Nevertheless, even though there were studies showing that RSCU bias between viruses and hosts are comparable, none has suggested such codon usage similarity is necessary or sufficient for successful viral infection. The reason under viral codon usage bias is far more complex than parasitic adaptation, which also involving mutation pressure, particular DNA/RNA or protein structure and genome size. In fact, many studies have shown the RSCU of virus is not similar to that of its host. , Whether the host‐unlike virus simply has limited time for evolutionary host optimization, or its high mutation rates (especially RNA virus) outpaced the process of selection that drive such resemblance is in debate. Therefore, attributing viral codon usage bias only to host adaptation and conducting host inference relies on it seems untenable. Even if the codon usage bias is most similar between a particular virus and its host, the average RSCU of any virus should be selected to match the average RSCU of a vertebrate is in doubt. The codon usage frequency varies significantly among genes within the same organism, so as in viruses. In a multicellular host, viruses are normally restricted to specific organ, tissue, or cell type. Thus, the RSCU of virus would be expected to resemble a particular RSCU calculated from expressed genes within the organ, tissue, or cell type of the host. More complicated, however, viral genes encoding structural proteins have more similar codon usage pattern to the host than other genes. , RSCU retrieved from Database such as Kazusa (http://www.kazusa.or.jp/codon/) presents only an average pattern, which does not reflect enormous intraspecific heterogeneity of codon usage, and undersampling of genes in some species may also introduce biases. As for the reason why snakes share the most similar codon usage pattern with SARS‐CoV‐2, we believed that is because of the inherent AT(U)‐rich base compositions in both the genomes of the coronaviruses and snakes. Base composition, like GC3s (guanine‐cytosine [GC] content on the third codon position) value has been illustrated as a strong determinant in shaping codon usage both in viruses , and higher multicellular eukaryotes. In a bid to test our speculation, we added viruses with known hosts and different AU3s values, as well as several host animals into Ji et al sampling pool. Coding sequences of their genomes were obtained from GenBank and the RSCU similarity was calculated using the same method by Ji et al (see detailed information in Supplementary Information Appendix 1). It turned out that viruses with high AU3s values are closer to animals have comparatively high AT3s values (herein N. atra, B. multicinctus, and M. condylurus) rather than AT3s low‐value species in RSCU distance (Figure 1). The result designated most investigated viruses (10/12) to infect snakes, regardless of their actual hosts.
Figure 1

Heatmap and value of Euclidean distance between virus and potential hosts based on RSCU values. The distance for each virus between different animals was indicated by gradient from red to canary. The species were sorted by AT(U)3s value and indicated by gradient from white to dark blue. BOMV, bamboo mosaic virus; CSFV, classical swine fever virus; EBOV, ebola virus; HAV, hepatitis A virus; HBV, hepatitis B virus; HIV, human immunodeficiency virus; HSV, herpes simplex viruses; MERS‐CoV, Middle East respiratory syndrome coronavirus; RSCU, relative synonymous codon usage; SARS‐CoV, severe acute respiratory syndrome coronavirus; ZIKV, zika virus

Heatmap and value of Euclidean distance between virus and potential hosts based on RSCU values. The distance for each virus between different animals was indicated by gradient from red to canary. The species were sorted by AT(U)3s value and indicated by gradient from white to dark blue. BOMV, bamboo mosaic virus; CSFV, classical swine fever virus; EBOV, ebola virus; HAV, hepatitis A virus; HBV, hepatitis B virus; HIV, human immunodeficiency virus; HSV, herpes simplex viruses; MERS‐CoV, Middle East respiratory syndrome coronavirus; RSCU, relative synonymous codon usage; SARS‐CoV, severe acute respiratory syndrome coronavirus; ZIKV, zika virus In conclusion, the RSCU similarity analysis only links virus to animal(s) that possesses comparable GC‐content. We did not rule out the speculation that snakes would be the intermediate host of SARS‐CoV‐2, but we suspect if the possibility of these species was higher than any other creature in transmitting the virus. Scientific research that gives a quick response to public emergencies is imperative and appreciative, yet the methodology applied within has to be carefully examined. Supplementary information Click here for additional data file. Supplementary information Click here for additional data file.
  8 in total

1.  Clinical Sequencing Uncovers Origins and Evolution of Lassa Virus.

Authors:  Kristian G Andersen; B Jesse Shapiro; Christian B Matranga; Rachel Sealfon; Aaron E Lin; Lina M Moses; Onikepe A Folarin; Augustine Goba; Ikponmwonsa Odia; Philomena E Ehiane; Mambu Momoh; Eleina M England; Sarah Winnicki; Luis M Branco; Stephen K Gire; Eric Phelan; Ridhi Tariyal; Ryan Tewhey; Omowunmi Omoniwa; Mohammed Fullah; Richard Fonnie; Mbalu Fonnie; Lansana Kanneh; Simbirie Jalloh; Michael Gbakie; Sidiki Saffa; Kandeh Karbo; Adrianne D Gladden; James Qu; Matthew Stremlau; Mahan Nekoui; Hilary K Finucane; Shervin Tabrizi; Joseph J Vitti; Bruce Birren; Michael Fitzgerald; Caryn McCowan; Andrea Ireland; Aaron M Berlin; James Bochicchio; Barbara Tazon-Vega; Niall J Lennon; Elizabeth M Ryan; Zach Bjornson; Danny A Milner; Amanda K Lukens; Nisha Broodie; Megan Rowland; Megan Heinrich; Marjan Akdag; John S Schieffelin; Danielle Levy; Henry Akpan; Daniel G Bausch; Kathleen Rubins; Joseph B McCormick; Eric S Lander; Stephan Günther; Lisa Hensley; Sylvanus Okogbenin; Stephen F Schaffner; Peter O Okokhere; S Humarr Khan; Donald S Grant; George O Akpede; Danny A Asogun; Andreas Gnirke; Joshua Z Levin; Christian T Happi; Robert F Garry; Pardis C Sabeti
Journal:  Cell       Date:  2015-08-13       Impact factor: 41.582

2.  Codon usage bias and A+T content variation in human papillomavirus genomes.

Authors:  Kong-Nan Zhao; Wen Jun Liu; Ian H Frazer
Journal:  Virus Res       Date:  2003-12       Impact factor: 3.303

3.  Viral adaptation to host: a proteome-based analysis of codon usage and amino acid preferences.

Authors:  Iris Bahir; Menachem Fromer; Yosef Prat; Michal Linial
Journal:  Mol Syst Biol       Date:  2009-10-13       Impact factor: 11.429

4.  Intra-genomic GC heterogeneity in sauropsids: evolutionary insights from cDNA mapping and GC(3) profiling in snake.

Authors:  Kazumi Matsubara; Shigehiro Kuraku; Hiroshi Tarui; Osamu Nishimura; Chizuko Nishida; Kiyokazu Agata; Yoshinori Kumazawa; Yoichi Matsuda
Journal:  BMC Genomics       Date:  2012-11-09       Impact factor: 3.969

5.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.

Authors: 
Journal:  Nat Microbiol       Date:  2020-03-02       Impact factor: 17.745

6.  Cross-species transmission of the newly identified coronavirus 2019-nCoV.

Authors:  Wei Ji; Wei Wang; Xiaofang Zhao; Junjie Zai; Xingguang Li
Journal:  J Med Virol       Date:  2020-04       Impact factor: 2.327

7.  Composition and divergence of coronavirus spike proteins and host ACE2 receptors predict potential intermediate hosts of SARS-CoV-2.

Authors:  Zhixin Liu; Xiao Xiao; Xiuli Wei; Jian Li; Jing Yang; Huabing Tan; Jianyong Zhu; Qiwei Zhang; Jianguo Wu; Long Liu
Journal:  J Med Virol       Date:  2020-03-11       Impact factor: 20.693

8.  Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales.

Authors:  Wanjun Gu; Tong Zhou; Jianmin Ma; Xiao Sun; Zuhong Lu
Journal:  Virus Res       Date:  2004-05       Impact factor: 3.303

  8 in total
  1 in total

1.  Molecular features similarities between SARS-CoV-2, SARS, MERS and key human genes could favour the viral infections and trigger collateral effects.

Authors:  Lucas L Maldonado; Andrea Mendoza Bertelli; Laura Kamenetzky
Journal:  Sci Rep       Date:  2021-02-18       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.