| Literature DB >> 32928234 |
Wei Hou1.
Abstract
The outbreak of coronavirus disease 2019 (Entities:
Keywords: COVID-19; Codon usage pattern; Coronaviruses; SARS-CoV-2
Mesh:
Substances:
Year: 2020 PMID: 32928234 PMCID: PMC7487440 DOI: 10.1186/s12985-020-01395-x
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Fig. 1Bioinformatic analyses of SARS-CoV-2 and other human coronaviruses. a Maximum likelihood phylogenetic tree of the whole genome sequences of SARS-CoV-2 Wuhan-Hu-1 (MN908947.3) and related coronaviruses including human coronavirus HCoV-229E (AF304460.1), HCoV-NL63 (AY567487.2), HCoV-OC43(AY585228.1), HCoV-HKU1(MH940245.1), and SARS-CoV (strain: Urbani, AY278741.1; strain: Tor2, AY274119.3), MERS-CoV (strain: HCoV-EMC, JX869059.2). b Heat map of RSCU values for the complete coding sequences of SARS-CoV-2 and other human coronaviruses. The heatmap analysis was performed using CIMminer. Each column represents a codon. Codons with higher RSCU values are highlighted with a red background. c The profiles of the relative synonymous codon usage for different genes of SARS-CoV-2 and other human coronaviruses. RSCU values were shown as the vertical bar graph. S:spike; E:envelop; M: membrane; N: nucleocapsid
Fig. 2Over-represented codons (RSCU value > 1.6) and under-represented codons (RSCU value < 0.6) were compared among SARS-CoV-2 and other human coronaviruses. a Complete gene; b S gene; c E gene; d M gene; e N gene
Fig. 3Graphical representation of synonymous codon usage pattern of each amino acid among SARS-CoV-2 and other human coronaviruses using Weblogo (https://weblogo.berkeley.edu/logo.cgi)
The parameters of codon usage bias among the coronaviruses analyzed in this study
| Coronaviruses | GenBank Accession | Genome Length (nt) | ICDI | CBI | ENC |
|---|---|---|---|---|---|
| SARS-CoV-2 Wuhan-Hu-1 | MN908947.3 | 29,903 | 0.144 | 0.306 | 45.38 |
| SARS-CoV Tor2 | AY274119.3 | 29,751 | 0.075 | 0.223 | 49.746 |
| SARS-CoV Urbani | AY278741.1 | 29,727 | 0.08 | 0.228 | 48.965 |
| MERS-CoV HCoV-EMC | JX869059.2 | 30,119 | 0.082 | 0.248 | 50.033 |
| HCoV-OC43 | AY585228.1 | 30,741 | 0.213 | 0.367 | 43.794 |
| HCoV-HKU1 | MH940245.1 | 29,811 | 0.372 | 0.532 | 35.617 |
| HCoV-229E | AF304460.1 | 27,317 | 0.172 | 0.358 | 43.45 |
| HCoV-NL63 | AY567487.2 | 27,553 | 0.307 | 0.476 | 37.275 |
Fig. 4Comparative analysis of SARS-CoV-2 and non-human coronaviruses. a Maximum likelihood phylogenetic tree of the whole genome sequences of SARS-CoV-2 Wuhan-Hu-1 (MN908947.3) and non-human coronaviruses including Scotophilus bat coronavirus 512(NC_009657.1), Swine enteric alphacoronavirus strain SeACoV-p10(MK977618.1), Bat coronavirus HKU4–1(NC_009019.1), Bat coronavirus HKU5–1(NC_009020.1), Bat coronavirus RaTG13(MN996532.1), Bat SARS-like coronavirus isolate bat-SL-CoVZC45(MG772933.1), Bat SARS-like coronavirus isolate bat-SL-CoVZXC21(MG772934.1), Pangolin coronavirus isolate PCoV_GX-P1E(MT040334.1), Pangolin coronavirus isolate PCoV_GX-P4L(MT040333.1), Avian coronavirus strain H120(MK071267.1), Avian coronavirus strain Ma5(KY626045.1), Bulbul coronavirus HKU11–796(FJ376620.1), Thrush coronavirus HKU12–600(NC_011549.1), Munia coronavirus HKU13–3514(NC_011550.1). b Heat map of RSCU values for the complete coding sequences of SARS-CoV-2 and non-human coronaviruses. The heatmap analysis was performed using CIMminer. Each column represents a codon. Codons with higher RSCU values are highlighted with a red background. c The profiles of the relative synonymous codon usage for complete gene of SARS-CoV-2 and non-human coronaviruses. Over-represented codons (RSCU value > 1.6) and under-represented codons (RSCU value < 0.6) were shown as line graph
Fig. 5Over-represented codons (RSCU value > 1.6) and under-represented codons (RSCU value < 0.6) were compared among SARS-CoV-2 and its phylogenetic relatives including RaTG13, Bat-SL-CoVZC45, Bat-SL-CoVZXC21, PCoV_GX-P1E, PCoV_GX-P4L. a Complete gene; b S gene; c E gene; d M gene; e N gene