Literature DB >> 32970329

Genomic and proteomic mutation landscapes of SARS-CoV-2.

Christian Luke D C Badua1, Karol Ann T Baldo1, Paul Mark B Medina1.   

Abstract

The ongoing pandemic caused by a novel coronavirus, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), affects thousands of people every day worldwide. Hence, drugs and vaccines effective against all variants of SARS-CoV-2 are crucial today. Viral genome mutations exist commonly which may impact the encoded proteins, possibly resulting to varied effectivity of detection tools and disease treatment. Thus, this study surveyed the SARS-CoV-2 genome and proteome and evaluated its mutation characteristics. Phylogenetic analyses of SARS-CoV-2 genes and proteins show three major clades and one minor clade (P6810S; ORF1ab). The overall frequency and densities of mutations in the genes and proteins of SARS-CoV-2 were observed. Nucleocapsid exhibited the highest mutation density among the structural proteins while the spike D614G was the most common, occurring mostly in genomes outside China and United States. ORF8 protein had the highest mutation density across all geographical areas. Moreover, mutation hotspots neighboring and at the catalytic site of RNA-dependent RNA polymerase were found that might challenge the binding and effectivity of remdesivir. Mutation coldspots may present as conserved diagnostic and therapeutic targets were found in ORF7b, ORF9b, and ORF14. These findings suggest that the virion's genotype and phenotype in a specific population should be considered in developing diagnostic tools and treatment options.
© 2020 Wiley Periodicals LLC.

Entities:  

Keywords:  SARS-CoV-2; coldspot; coronavirus; genetic variability; mutation; mutation hotspot; virus bioinformatics

Mesh:

Substances:

Year:  2020        PMID: 32970329      PMCID: PMC7537117          DOI: 10.1002/jmv.26548

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   20.693


INTRODUCTION

Coronavirus disease 2019 (COVID‐19) presented with pneumonia‐like symptoms surfaced from a seafood market at Wuhan, Hubei Province in China in December 2019, and has since spread across the globe. According to the WHO, it has affected 213 countries and territories with 23,057,288 people infected and 800,906 deaths worldwide. Mitigation of this public health crisis can be accomplished through effective public health safety protocols, vaccines, and targeted viral treatment. The scientific community has then been in haste to develop vaccines and therapeutic drugs to combat the COVID‐19. COVID‐19 is caused by a novel coronavirus, the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS‐CoV‐2). , It is a positive‐sense RNA virus, like SARS‐CoV and Middle East respiratory syndrome coronavirus, with a genome size of 29,903 nucleotides. Figure 1 shows the comparison of the genes and proteins between SARS‐CoV‐2 and SARS‐CoV (2003). Most of its genome codes for ORF1ab (~72%) which is involved in viral replication and pathogenesis, while other ORFs code for structural proteins (spike [S], envelope [E], membrane glycoprotein [M], and nucleocapsid [N]). Genes for accessory proteins are also present in SARS‐CoV‐2 as with SARS‐CoV, however, some of the proteins coding for these accessory proteins (ORF3a, ORF7b, ORF8, and ORF10) are yet to be identified for their function. , Mutations in coronaviruses are expected to have mutation rates ranging between 10−5 and 10−3 substitutions per nucleotide site per cell infection (s/n/c). , Accordingly, information can be obtained from SARS‐CoV‐2 genomes coming from initial cases in Wuhan, China up to the recent submissions.
Figure 1

SARS‐CoV‐2 structure, and comparison of the genomes and proteomes of SARS‐CoV‐2 (2019) and SARS‐CoV (2003). (A) Structure of the SARS‐CoV‐2, the etiologic agent of COVID‐19. Information on these proteins is publicly available from the COVID‐19 UniProt Resource (https://covid-19.uniprot.org/). (B) Comparison of the genomes and proteomes of SARS‐CoV‐2 and SARS‐CoV (2003). COVID‐19, coronavirus disease 2019; SARS‐CoV, severe acute respiratory syndrome coronavirus

SARS‐CoV‐2 structure, and comparison of the genomes and proteomes of SARS‐CoV‐2 (2019) and SARS‐CoV (2003). (A) Structure of the SARS‐CoV‐2, the etiologic agent of COVID‐19. Information on these proteins is publicly available from the COVID‐19 UniProt Resource (https://covid-19.uniprot.org/). (B) Comparison of the genomes and proteomes of SARS‐CoV‐2 and SARS‐CoV (2003). COVID‐19, coronavirus disease 2019; SARS‐CoV, severe acute respiratory syndrome coronavirus Determining mutation hotspots and coldspots in SARS‐CoV‐2 may provide insights on their effects on the properties (i.e., virulence, infectivity, and severity) and characteristics of SARS‐CoV‐2. Hence, drugs, vaccines, and diagnostics effective against SARS‐CoV‐2 variants are crucial today in containing the COVID‐19 pandemic. This study provides an overview of mutation characteristics at the coding and noncoding regions of the SARS‐CoV‐2 genome, as well as the mutations in the translated proteins.

MATERIALS AND METHODS

Collection of SARS‐CoV‐2 genomes

Publicly available genomes from 31 countries submitted to the National Center for Biotechnology Information (NCBI) nucleotide database and the GISAID EpiCoV™ database by January 19, 2020 to May 15, 2020, were collected for the study (Table S1). Gathering of 151 publicly available “complete” and/or “partial” (genome length > 29,700 nucleotides = complete; genome length < 29,700 nucleotides = partial) genomes of SARS‐CoV‐2 (reference sequence NC_045512, GenBank) was conducted from March 12, 2020 to May 15, 2020. There were two data collection points in this study: genomes from both databases that were submitted from December 2019 to March 2020 (86 genomes) and December 2019 to May 2020 (65 additional genomes). Manual grouping of these sequences according to three geographic areas was made for ease of analysis. China‐derived samples were classified under the “China,” United States‐derived samples were classified under the “USA,” while the genome sequences from other countries besides United States or China were classified under the “Others.” The overall data set containing all the samples from China, United States, and Others is referred as the “Total.”

Nucleotide and amino acid variant detection

Each genome sequence was aligned to NC_045512 using the MAFFT. , The default parameters as presented in the web tool were used for the multiple sequence alignment. The nucleotide variants from the reference sequence (NC_045512) were manually annotated and were re‐evaluated using the “Low Frequency Variant Detection” tool of the CLC Genomics Workbench 20.0.3. (QIAGEN Bioinformatics, Aarhus, Denmark). Mutations from both the coding and noncoding regions were recorded. Using the nucleotide mutations, the resulting amino acid mutations throughout the proteome of SARS‐CoV‐2 were determined. The amino acid changes were automatically annotated using the “Map Reads to Reference” tool and a subsequent run in the “Low Frequency Variant Detection” tool in the CLC Genomics Workbench 20.0.3. The resulting proteome from each SARS‐CoV‐2 genome was created and edited using CLC Genomics Workbench 20.0.3. The whole proteome was then aligned for phylogenetic analysis, and for identification of the resulting amino acid mutations.

Construction of phylogenetic trees

A phylogenetic tree based on the translated protein‐coding genes of SARS‐CoV‐2 was constructed using the same command‐line in IQ‐TREE version 1.6.12 and was also edited, and visualized using MEGA X. , , , The phylogenetic tree was constructed using an ultrabootstrap method considering 1000 and considered 151 genomes for the construction of the said tree. , , The resulting tree was edited and visualized using MEGA X.

Data analysis

Mutation hotspots were identified as genome sites with two or more occurring mutations; on the other hand, mutation coldspots are those with no occurring mutations. The characterization of nucleotide mutations was done in terms of the nature of the nucleotide substitution (transition or transversion) and insertion and deletions (indel). The mutation densities (Equation 1) in the genome and proteome of SARS‐CoV‐2 were determined. Amino acid substitutions were characterized according to the nature of change that occurred (e.g., leucine to isoleucine would be classified under “Similar Change,” serine to phenylalanine would be classified under “Polar <> Neutral,” aspartic acid to serine would be classified under “Charged <> Polar,” while glutamic acid to glycine would be “Charged <> Neutral”). Furthermore, amino acid substitutions leading to residues with similar nature were classified as “Similar Change,” while those substitutions that did not produce amino acids with similar nature were classified under “Dissimilar Change.”

RESULTS

The mutations in the genome and proteome of SARS‐CoV‐2 are described per geographic area (China, United States, and Others) in two time points (December 2019–March 2020; December 2019–May 2020). This section starts with a presentation of the phylogenetic data according to the nucleotide sequences and amino acids of SARS‐CoV‐2. Then, the nucleotide substitution types (transversions, transitions, and InDels) were identified per geographic area in the two time points. This was followed by a presentation of the amino acid substitutions due to nucleotide mutations. Finally, remarkable mutations and mutation patterns in the proteins of SARS‐CoV‐2 (S glycoprotein, ORF8, and N) were reported.

Nucleotide and amino acid‐based phylogenetic analyses of SARS‐CoV‐2 show three major clades of SARS‐CoV‐2 and a minor clade (P6810S ORF1ab)

The phylogenetic analysis of mutations in different regions was analyzed using MAFFT software and three major clades were identified. As shown in Figure 2, the L3606F (ORF1ab) is characterized by the color pink, P4715L (ORF1ab)/D614G (S) is highlighted by the color green, and L84S (ORF8)/S2839S (ORF1ab) is denoted by the color blue. The largest among these clades were the L84S (ORF8), having 43 samples. This was caused by a transition substitution in the ORF8 gene (T28144C) leading to an L84S substitution in the ORF8 protein. L84S (ORF8) had four subclades; two of these subclades had subclades as well (Figure 2B). The second‐largest major clade was the P4715L (ORF1ab)/D614G (S) having two subclades. These subclades were identified as R203K and G204R (N), and the G57H (ORF3a)/T265I (ORF1ab)/S3384L (ORF1ab). Lastly, the L3606F (ORF1ab) major clade contained 19 samples; 52.63% of these samples were from the “Others” geographic area, 36.84% were from the United States, while 10.53% were from China. This clade also has a subclade represented by the V378I mutation (ORF1ab; Figure 2B).
Figure 2

Phylogenetic tree of 151 SARS‐CoV‐2 from genomes collected from March 12, 2020 to April 2020 from NCBI GenBank™ and GISAID EpiCoV®. (A) Phylogenetic tree based on the genomes of SARS‐CoV‐2. (B) Phylogenetic tree based on the proteins of SARS‐CoV‐2. Individual viral samples are represented as dots. Samples under the geographic cluster “China” are colored red, blue for sequences under the geographic cluster “USA,” while for the “Others” geographic cluster, these are colored black. SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information

Phylogenetic tree of 151 SARS‐CoV‐2 from genomes collected from March 12, 2020 to April 2020 from NCBI GenBank™ and GISAID EpiCoV®. (A) Phylogenetic tree based on the genomes of SARS‐CoV‐2. (B) Phylogenetic tree based on the proteins of SARS‐CoV‐2. Individual viral samples are represented as dots. Samples under the geographic cluster “China” are colored red, blue for sequences under the geographic cluster “USA,” while for the “Others” geographic cluster, these are colored black. SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information A transversion substitution (29868G>C) in the 3ʹ‐untranslated region (UTR) of the SARS‐CoV‐2 genome was identified which defined the occurrence of a nucleotide‐based clade. This clade also contained a subclade bearing a missense mutation in the 2ʹ‐O‐ribose methyltransferase (nsp16) of ORF1ab (20692C>T; P6810S). Overall, the mutations classifying this clade were identified in five China‐derived samples, while P6810S has not been identified in current literature.

The proportion of transitions, transversion, and indels in SARS‐CoV‐2 genome is similar among the geographical areas

The genomic mutation profile of SARS‐CoV‐2 was evaluated, and the distribution of the mutations across the viral genomes from different geographical areas is summarized in Figure 3A. Overall, in total, 674 nucleotide mutations were identified using genome samples collected from December 2019–May 2020 (Table 1).
Figure 3

Characterization of nucleotide mutations in SARS‐CoV‐2. SARS‐CoV‐2 genomes were identified independently, and mutations were considered to occur spontaneously. Mutations were identified by identifying substitutions in the SARS‐CoV‐2 reference genome NCBI GenBank™ accession ID: NC_045512. (A) Nucleotide mutation frequency plot in total (overall), and in geographical clusters: China, United States, and Others. (B) Proportion of the nucleotide mutation types in SARS‐CoV‐2 genomes submitted on December 23, 2019–March 11, 2020, and (C) December 23, 2019–April 19, 2020. These were grouped as total, China, United States, and other. (D and E) Mutation density profiles of total SARS‐CoV‐2 genomes and clustered geographically: China, United States, and Others between the two time points. Mutation markers are colored according to the type of nucleotide change, that is, transition (blue), transversion (green), indel (violet). The maximum genome coverage of read‐mapped genomes for variant detection is indicated (e.g., N = 150 in total for overall data set). SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information

Table 1

Summary of detected nucleotide and amino acid mutations in SARS‐CoV‐2

Genome regionProtein/peptide chainDomainNucleotide positionReferenceAlleleFrequencyRelative frequencyAA positionAmino acid mutationType of AA change/mutation
5ʹ‐UTRN/AN/A4AT10.67N/AN/AN/A
26A21.33
28CT21.33
31AT10.67
34AT10.67
35AT21.33
36CT21.33
75CA10.67
104TA10.67
111TC10.67
112TG10.67
119CG10.67
120TC10.67
124GA10.67
186CT21.33
241CT2416.00
254CT21.33
ORF1abLeader protein/nsp1Leader protein/nsp2270AG10.012E>GCharged<>Neutral
313CT32.0016SilentSilent
490TA40.0375D>ESimilar Charge
508TGGTCATGTTATGGT20.0182G82_V86delDeletion
514TC10.6783SilentSilent
565TC10.67100SilentSilent
614GA10.01117A>TPolar<>Neutral
618AG10.01118Y>CPolar<>Neutral
654GA10.01130G>ECharged<>Neutral
686AAGTCATTT10.01141DeletionDeletion
721TC10.67152SilentSilent
nsp2nsp3884CT10.01207R>CCharged<>Polar
1059CT60.04265T>IPolar<>Neutral
1076CT10.01271P>SPolar<>Neutral
1102CT10.67279SilentSilent
1385CT10.01374H>YCharged<>Neutral
1397GA30.02378V>ISimilar Charge
1431ATG10.01389DeletionDeletion
1548GA10.01428S>NSimilar Charge
1623TC10.01453I>TPolar<>Neutral
1691AG10.01476I>VSimilar Charge
1895GT10.01544V>LSimilar Charge
2091CT10.01609T>IPolar<>Neutral
2269AT10.67668SilentSilent
2277TC10.01671I>TPolar<>Neutral
2388CT10.01708T>IPolar<>Neutral
2416CT21.33717SilentSilent
2446TC10.67727SilentSilent
2472CT74.67736SilentSilent
2717GA10.01818G>SPolar<>Neutral
nsp3nsp3 chain2875GA10.67870SilentSilent
2971GT10.01902M>ISimilar Charge
Nsp3 N‐terminal/DUF3655 (domain with unknown function)3037CT2315.33924SilentSilent
3099CT20.01945T>IPolar<>Neutral
3177CT40.03971P>LSimilar Charge
nsp3 chain3259GT10.01998Q>HCharged<>Polar
Macro domain3299TC10.671012SilentSilent
3333TTG10.011023Deletion
3411CT10.011049A>VSimilar Charge
3518GT10.011085V>FSimilar Charge
3738CT10.011158P>LSimilar Charge
nsp3 chain3778AG10.671171SilentSilent
Nsp3 ss‐polyA binding domain4234CT10.671323SilentSilent
4402TC64.001379SilentSilent
PL2Pro domain4780CT10.671505SilentSilent
Papain‐like viral protease4946TC10.011561S>PPolar<>Neutral
5062GT60.041599L>FSimilar Charge
5084AG10.011607I>VSimilar Charge
Papain‐like viral protease/peptidase C165572GT10.011769M>ISimilar Charge
5608AG10.671781SilentSilent
5784CT10.011840T>IPolar<>Neutral
5845AT10.011860K>NCharged<>Polar
Nucleic acid‐binding domain6026CT10.011921P>SPolar<>Neutral
6031CT10.671922SilentSilent
6035AG10.011924S>GPolar<>Neutral
6040CT21.331925SilentSilent
6312CA20.012016T>KCharged<>Polar
nsp3 chain6501CT10.012079P>LSimilar Charge
6636CT10.012124T>IPolar<>Neutral
6695CT10.012144P>SPolar<>Neutral
6819GT20.012185S>IPolar<>Neutral
6968CA10.012235L>ISimilar Charge
6996TC20.012244I>TPolar<>Neutral
7016GA10.012251G>SPolar<>Neutral
7105CT10.672280SilentSilent
7488CT10.012408T>IPolar<>Neutral
7866GT10.012534G>VSimilar Charge
8001AC10.012579D>ACharged<>Neutral
8388AG10.012708N>SSimilar Charge
nsp4nsp48653GT10.012796M>ISimilar Charge
8728AG10.672821SilentSilent
8782CT4328.672839SilentSilent
8945AG10.012894N>DCharged<>Polar
8987TA10.012908F>ISimilar Charge
9034AG10.672923SilentSilent
9157TC10.672964SilentSilent
9274AG10.673003SilentSilent
9430CT10.673055SilentSilent
9474CT10.013070A>VSimilar Charge
9477TA30.023071F>YSimilar Charge
9491CT10.013076H>YCharged<>Neutral
9534CT10.013090T>IPolar<>Neutral
9561CT10.013099S>LPolar<>Neutral
9924CT10.013220A>VSimilar Charge
10015CT10.673250SilentSilent
3C‐like proteinasePeptidase C30 domain10036CT10.673257SilentSilent
10232CT20.013323R>CCharged<>Polar
nsp6nsp610507CT10.673414SilentSilent
11075TTT10.013605_3606F_LinsFInsertion
11083GC10.013606L>FSimilar Charge
11083GT160.113606L>FSimilar Charge
11101AG10.673612SilentSilent
11410GA21.333715SilentSilent
11750CT10.013829L>FSimilar Charge
11752CT10.673829SilentSilent
nsp7nsp711764TA10.013833N>KCharged<>Polar
11916CT30.023884S>LPolar<>Neutral
11937GA10.013891C>YPolar<>Neutral
nsp8nsp811956CT10.673897SilentSilent
12102CT10.013946S>LPolar<>Neutral
12115CT10.673950SilentSilent
12473CT10.674070SilentSilent
12478GA20.014071M>ISimilar Charge
nsp9nsp912534CT10.014090T>IPolar<>Neutral
nsp10nsp1013072CT10.674269SilentSilent
13225CG10.014321F>LSimilar Charge
13226TC10.674321SilentSilent
RNA‐dependent RNA polymeraseRNA‐dependent RNA polymerase N‐terminal13620CT10.674452SilentSilent
13730CT20.014489A>VSimilar Charge
14408CT260.174715P>LSimilar Charge
RdRp chain14657CT10.014798A>VSimilar Charge
14786CT10.014841A>VSimilar Charge
14805CT64.004847SilentSilent
14849TG10.014862L>RCharged<>Neutral
14856AT10.674864SilentSilent
14858TA10.014865V>DCharged<>Neutral
RdRp catalytic15324CT42.675020SilentSilent
15418GT10.015052A>SPolar<>Neutral
15597TC10.675111SilentSilent
15607TC10.675115SilentSilent
RdRp Chain15910GT10.015216D>YCharged<>Neutral
15960CT10.675232SilentSilent
HelicaseCoVi ZnBD16272TG10.675336SilentSilent
16293CT10.675343SilentSilent
16325GC10.015354C>SSimilar Charge
16467AG10.675401SilentSilent
Helicase chain16877CT10.015538T>IPolar<>Neutral
17000CT10.015579T>IPolar<>Neutral
(+) RNA virus Helicase ATP‐binding domain17141CA10.015626A>DCharged<>Neutral
17247TC21.335661SilentSilent
17249CT10.015662A>VSimilar Charge
17280GT10.675672SilentSilent
17373CT53.335703SilentSilent
17376AG10.675704SilentSilent
17410CT10.015716R>CSimilar Charge
17423AG10.015720Y>CPolar<>Neutral
17470CT10.675736SilentSilent
(+) RNA virus Helicase C‐terminal domain17747CT90.065828P>LSimilar Charge
17825CT10.015854T>IPolar<>Neutral
17858AG100.075865Y>CPolar<>Neutral
17894CT10.015877A>VSimilar Charge
Guanine‐N7 methyltransferaseGuanine‐N7 methyltransferase18060CT117.335932SilentSilent
18115CT10.015951H>YCharged<>Neutral
18126TC10.675954SilentSilent
18603TC10.676113SilentSilent
18736TC30.026158F>LSimilar Charge
18744CT10.676160SilentSilent
18788CT10.016175T>IPolar<>Neutral
18814CT10.676184SilentSilent
18975TA10.676273SilentSilent
18996TC10.676244SilentSilent
18998CT20.016245A>VSimilar Charge
19065TC10.676267SilentSilent
19175AC10.016304D>ACharged<>Neutral
19610CT10.016449T>IPolar<>Neutral
N‐Endo, uridylate‐specific endoribonucleaseN‐Endo, uridylate‐specific endoribonuclease19684GT10.016474V>LSimilar Charge
20268AG10.676668SilentSilent
20281TC10.016673F>LSimilar Charge
20298ATT10.016679DeletionDeletion
20437AT10.016725S>CSimilar Charge
20449AT10.016729N>YPolar<>Neutral
2ʹ‐O‐Ribose methyltransferase2′‐O‐Ribose methyltransferase20692CT30.026810P>SPolar<>Neutral
20936CT10.016891T>MPolar<>Neutral
20995GA10.016911G>SPolar<>Neutral
21137AG10.016958K>RSimilar Charge
21147TC10.676961SilentSilent
21316GA10.017018D>NCharged<>Polar
21384T10.017041InsertionInsertion
21386CT10.017041S>FPolar<>Neutral
21387TT10.017042InsertionInsertion
21426TG10.677054SilentSilent
SSpike protein S1/surface glycoprotein S1Spike protein S1/surface glycoprotein S1 chain21595CT10.6711SilentSilent
Spike protein S1 N‐terminus21644TA10.0128Y>NPolar<>Neutral
21691CT10.6743SilentSilent
21707CT20.0149H>YCharged<>Neutral
21711CT10.0150S>LPolar<>Neutral
21784TA10.0174N>KCharged<>Polar
21830GT10.0190V>FSimilar Charge
21906AG10.01115Q>RCharged<>Polar
21991TTA10.01145DeletionDeletion
22033CA10.01157F>LSimilar Charge
22104GT10.01181G>VSimilar Charge
22151AG10.01197I>VSimilar Charge
22224CG10.01221S>WPolar<>Neutral
22303TG10.01247S>RCharged<>Polar
22432CT10.67290SilentSilent
22468GT10.67302SilentSilent
Spike receptor binding domain/spike protein S1 C‐terminal domain22785GT10.01408R>ICharged<>Neutral
Spike receptor binding domain23185CT10.67541SilentSilent
23271CT10.01570A>VSimilar Charge
Spike protein S1/surface glycoprotein S1 chain23403AG260.17614D>GCharged<>Neutral
23520CT10.01653A>VSimilar Charge
23613CT10.01684A>VSimilar Charge
Spike protein S2/surface glycoprotein S2Spike protein S2 chain23876GA10.01772V>ISimilar Charge
23929CT21.33789SilentSilent
23952TG10.01797F>CPolar<>Neutral
Spike protein S2′ chain/fusion peptide 124022TC10.67820SilentSilent
24023CT10.67821SilentSilent
Spike protein S2′ chain/heptad repeat 124325AG32.00921SilentSilent
24351CT10.01930A>VSimilar Charge
24370CT10.67936SilentSilent
Spike protein S2′ chain24694AT21.331044SilentSilent
24789CT10.011076T>IPolar<>Neutral
24862AG10.671100SilentSilent
Spike protein S2′ chain/heptad repeat 225156CT10.671198SilentSilent
ORF3aORF3a proteinORF3a protein25433CT10.0114T>IPolar<>Neutral
25533TA10.6747SilentSilent
25534GT10.0148V>FSimilar Charge
25563GT50.0357Q>HCharged<>Polar
25672CA10.0194L>ISimilar Charge
25687GT10.0199A>SPolar<>Neutral
25771CA10.01127L>ISimilar Charge
25775GT10.01128W>LSimilar Charge
25806AT10.67138SilentSilent
25810CG10.01140L>VSimilar Charge
25979GT30.02196G>VSimilar Charge
26048TG10.01219L>WSimilar Charge
26088CT10.67232SilentSilent
26144GT100.07251G>VSimilar Charge
EEnvelope proteinEnvelope protein26326CT21.3328SilentSilent
26354TA10.0137L>HCharged<>Neutral
MMembrane glycoproteinMembrane glycoprotein26526GT10.012A>SPolar<>Neutral
26530AG10.013D>GCharged<>Neutral
26729TC53.3369SilentSilent
26849GT10.01109M>ISimilar Charge
27046CT10.01175T>MPolar<>Neutral
ORF6ORF6 proteinORF6 protein27225GT10.018Q>HCharged<>Polar
27299TC10.0133I>TPolar<>Neutral
27384TC10.6761SilentSilent
ORF7aORF7a proteinSARS coronavirus X4 like27635CT10.0181S>LPolar<>Neutral
ORF8ORF 8 proteinORF 8 protein27925CT10.0111T>IPolar<>Neutral
27964CT10.0124S>LPolar<>Neutral
28077GC50.0362V>LSimilar Charge
28144TC430.2984L>SPolar<>Neutral
NNucleocapsid phosphoproteinNucleocapsid phosphoprotein28311CT10.0113P>LSimilar Charge
28378GT21.3335SilentSilent
28409CT10.0146P>SPolar<>Neutral
28657CT32.00128SilentSilent
28688TC32.00139SilentSilent
28792AT10.67173SilentSilent
28854CT20.01194S>LPolar<>Neutral
28857GT10.01195R>ICharged<>Neutral
28863CT30.02197S>LPolar<>Neutral
28878GA10.01202S>NSimilar Charge
28881GA100.07203R>KSimilar Charge
28882GA100.07204G>RCharged<>Neutral
28883GC100.07204G>RCharged<>Neutral
28887CT10.01205T>IPolar<>Neutral
28896CG20.01208A>GSimilar Charge
28916GA10.01215G>SPolar<>Neutral
28985GT10.01238G>CPolar<>Neutral
29029TC10.67252SilentSilent
29095CT53.33274SilentSilent
29140GT10.01289Q>HCharged<>Polar
29148TC10.01292I>TPolar<>Neutral
29230CT10.67319SilentSilent
29301AT10.01343A>VSimilar Charge
29303CT20.01344P>SPolar<>Neutral
29527GA21.33418SilentSilent
Between N and ORF10 non coding regionN/AN/A29540GA21.33
ORF10ORF10 proteinORF10 protein29563CT10.672SilentSilent
29573GA10.016V>ISimilar Charge
ORF10, stem loopN/AN/A29635CT10.67N/AN/AN/A
3′ UTR: 3′ UTRN/AN/A29695AG10.67N/AN/AN/A
29700AG32.00
29736GT32.00
29742GA10.67
29742GT21.33
29750CGATCGAGTG10.67
29751GC21.33
29786GC10.67
29844AG10.67
29845TG21.33
29846TA10.67
29847TG10.67
29848TG10.67
29861GA21.33
29864GA10.67
29867TA21.33
29868GA21.33
29868GC53.33
29870CA53.33
29873AT10.67

Abbreviations: E, envelope; M, membrane glycoprotein; N, nucleocapsid; S, spike; SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; UTR, untranslated region.

Characterization of nucleotide mutations in SARS‐CoV‐2. SARS‐CoV‐2 genomes were identified independently, and mutations were considered to occur spontaneously. Mutations were identified by identifying substitutions in the SARS‐CoV‐2 reference genome NCBI GenBank™ accession ID: NC_045512. (A) Nucleotide mutation frequency plot in total (overall), and in geographical clusters: China, United States, and Others. (B) Proportion of the nucleotide mutation types in SARS‐CoV‐2 genomes submitted on December 23, 2019–March 11, 2020, and (C) December 23, 2019–April 19, 2020. These were grouped as total, China, United States, and other. (D and E) Mutation density profiles of total SARS‐CoV‐2 genomes and clustered geographically: China, United States, and Others between the two time points. Mutation markers are colored according to the type of nucleotide change, that is, transition (blue), transversion (green), indel (violet). The maximum genome coverage of read‐mapped genomes for variant detection is indicated (e.g., N = 150 in total for overall data set). SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information Summary of detected nucleotide and amino acid mutations in SARS‐CoV‐2 Abbreviations: E, envelope; M, membrane glycoprotein; N, nucleocapsid; S, spike; SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; UTR, untranslated region. Generally, mutation frequencies among the geographical areas followed 3:1 transition to transversion ratio (Figure 3B,C), in which the C>T substitution was most common (44.7%), followed by T>C (13.95%). Interestingly, ORF3a and 3ʹ UTR genes had higher transversion density than transition similar between the two timepoints; while G>T transversion (10.83%), was the third most frequently occurring nucleotide change. Altogether, approximately similar proportions of nucleotide change types were observed between genomes among the geographical areas collected from December to March 2020 versus December–May 2020 (Figures 3B,C). These findings may suggest that the genomic mutation characteristics of SARS‐CoV‐2 from the earlier timepoint may not be significantly varied from the later period (e.g., between March and May 2020). Among the SARS‐CoV‐2 genomic regions, the UTRs yielded the highest mutation density, with 7.5 × 10−3 mutation density at the 5ʹ‐UTR and 2.5 × 10−2 mutation density at the 3ʹ‐UTR among all geographical areas, for both timepoints (Figure 3D,E). Notably, indels were found mostly at the UTRs. As shown in Figure 4B, no UTR mutations were common among all areas, while mutations common between United States and Others are at 5ʹ‐UTR (241C>T) and 3ʹ‐UTR (29742G>T and 29870C>A); and between China and Others, 26delA and 28C>T at the 5ʹ‐UTR were common. Overall, the UTRs are consistently densely mutated suggesting that these genome regions are mutation prone regions of the SARS‐CoV‐2 genome.
Figure 4

Commonly occurring hotspots were identified per geographical clusters (China, United States, and Others) and results were summarized as the number of SARS‐CoV‐2 genome positions with (A) missense and indel mutations at the protein‐coding regions and (B) detected mutations at the untranslated regions (UTRs), mainly, 5ʹ and 3ʹ‐UTR

Commonly occurring hotspots were identified per geographical clusters (China, United States, and Others) and results were summarized as the number of SARS‐CoV‐2 genome positions with (A) missense and indel mutations at the protein‐coding regions and (B) detected mutations at the untranslated regions (UTRs), mainly, 5ʹ and 3ʹ‐UTR

Most amino acid substitutions in SARS‐CoV‐2 genomes from the United States and Others geographic areas resulted to residues with a similar nature (“Similar Change”) for both time points

The impact of overall genomic mutation characteristics in the viral proteins were then investigated from the genomic data and the description of these will be according to geographic area and will be magnified towards the differences between the two time points. Most of the nucleotide mutations in the SARS‐CoV‐2 genome (62.01%) lead to missense mutation in their proteins. Genome reference positions or nucleotide mutation hotspots 11083 (ORF1ab; nsp6), 26144 (ORF3a), and 28144 (ORF8) were common among all geographical areas (Figure 4A). Most of the amino acid substitutions in China were “Polar ←→ Neutral” changes (66.67%) for the first time point, while this proportion decreased at the second time point (57.14%), with an addition of deletion mutations (1.43%). There was also an increase in substitutions where residues had a “Similar Change” in nature (e.g., valine ←→ isoleucine; 18.52% ‐ 12/2019‐03/2020; 31.43% ‐ 12/2019‐04/2020). These data could be seen in Figure 5B. Furthermore, the mutation hotspots based on mutation densities also changed in China, where mutations in the Spike glycoprotein, Protein 3a, Membrane protein, ORF6 protein, and ORF10 protein appeared in the second time point (Figure 5C).
Figure 5

Characterization of amino acid mutations in SARS‐CoV‐2. Collection dates refer to the collection dates according to the annotated date of collection from GISAID or NCBI GenBank. (A) Comparison of the amino acid mutations according to the nature of the change in charge between earlier and overall data, and across different geographic clusters (China, United States, and Others). (B) Proportion of the nature of the change in amino acid charge and Indels that occurred in the total sample population, and for the geographic clusters between earlier and overall data. (C) Comparison of the mutation density profiles between earlier and overall data. Red indicates mutation density values resulting in amino acids having dissimilar nature to the reference, while blue indicates the mutation density values that resulted in amino acids having similar nature to the reference. The maximum genome coverage of read‐mapped genomes for variant detection are indicated (e.g., N = 150 in overall total for overall data set). SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information

Characterization of amino acid mutations in SARS‐CoV‐2. Collection dates refer to the collection dates according to the annotated date of collection from GISAID or NCBI GenBank. (A) Comparison of the amino acid mutations according to the nature of the change in charge between earlier and overall data, and across different geographic clusters (China, United States, and Others). (B) Proportion of the nature of the change in amino acid charge and Indels that occurred in the total sample population, and for the geographic clusters between earlier and overall data. (C) Comparison of the mutation density profiles between earlier and overall data. Red indicates mutation density values resulting in amino acids having dissimilar nature to the reference, while blue indicates the mutation density values that resulted in amino acids having similar nature to the reference. The maximum genome coverage of read‐mapped genomes for variant detection are indicated (e.g., N = 150 in overall total for overall data set). SARS‐CoV‐2, severe acute respiratory syndrome coronavirus 2; NCBI, National Center for Biotechnology Information In the United States, the proportions of the type of amino acid substitutions did not change drastically (Figure 5). “Polar ←→ Neutral” mutations were almost similar between the two time points (36.36%12/2019‐03/2020; 36.57% ‐ 12/2019‐04/2020), while “Similar change” mutations changed minimally (46.75%12/2019‐03/2020; 47.01% ‐ 12/2019‐04/2020).). “Similar change” mutations had the highest frequency among the mutation types in United States samples. Mutation density presented in bar graphs show that there was an appearance of amino acid substitutions in the M and ORF7a proteins (Figure 5C). For the Others geographic area, there is a great change in the proportions of mutations that are “Polar ←→ Neutral,”“Charged ←→ Polar,” and “Charged ←→ Neutral” (Figure 5B). The proportion of “Polar ←→ Neutral” mutations in the earlier time point was higher than that of the second time point (31.71% → 22.49%) as shown in Figure 5B. The proportion of “Charged ←→ Polar” and “Charged ←→ Neutral” mutations increased between the two time points (4.88% → 7.10% “Charged ←→ Polar”; 4.88% → 18.34% “Charged ←→ Neutral”). Appearance of mutations in the M protein and ORF6 protein occurred in the Others geographic area according to the mutation density graphs (Figure 5C), with the appearance of “Similar Change” substitutions in the second time point for the ORF8 protein and N protein as compared to the initial time point (Figure 5C). The total count of amino acid substitutions in the proteins of SARS‐CoV‐2 was 381. From the data that was collected at the earlier timepoint, most of the mutations in the proteins were classified under “Similar Change” (44.83%), while insertions were the least frequent (1.15%). In addition of data from the later study timepoint, “Similar Change” mutations were most frequent however with decreased proportion (43.45%); and insertions was also the least frequent then at 0.84% proportion. The breakdown of the mutations in the SARS‐CoV‐2 proteins based on the collected genomes are shown in Figure 5A. Overall, there was an observed shift in the proportions of the different classes of amino acid mutations between the two collection periods and geographic areas. There was an increase in proportion of “Similar Change” mutations in China between the two collection periods, while deletion mutations emerged at later time (Figure 5A). In comparison with United States, the proportion of the classes of amino acid mutations were generally unchanged. Prominent mutations have been found and further evaluated in this study in a spacio‐temporal perspective, which involve both structural and nonstructural proteins of SARS‐CoV‐2.

The D614G substitution in the spike glycoprotein is the most frequently occurring mutation among the structural proteins and occurred mostly in the Others geographic area

In samples from China, the D614G substitution did not occur, in both time points (Figure 5A), however, in the United States samples, there was an increase in the frequency of the D614G mutation (n D614G = 1 → n D614G = 8; Figure 5A). The same pattern was seen in the Others geographic area (n D614G = 4 → n D614G = 18). The mutation density of the spike glycoprotein increased in all of the geographic areas (China, United States, and Others areas, based on Figure 5C). The D614G substitution in the Spike glycoprotein (S) occurred five times in the sample population from the data collected at earlier time and occurred 26 times from the overall total data. This mutation occurred with the P4715L (ORF1ab) mutation (Figures 2B and 5A). The D614G is a result of a transition mutation in the S gene of SARS‐CoV‐2 (23403A>G) and classified as “Charged←→Neutral” aa mutation. The mutation density S based on earlier data was 0.01414 mutation events/aa length of S glycoprotein, while this value approximately doubled based on the overall data. In addition, four other hotspots in the spike protein were detected in this study (Table 1). These data may suggest that the S variant occurred outside of China and is more observed in separate countries and in the United States.

ORF7b protein coldspots and ORF8 protein hotspots are conserved among all geographical areas

Among the geographical areas, no mutations were found in ORF6, ORF7a/7b, ORF9b, ORF10, and ORF14 proteins by the earlier study timepoint, hence considered as coldspots at that period (Figures 3D and 5B). On the other hand, at the later time point, only ORF7b, ORF9b, and ORF14 proteins were identified as mutation coldspots (Figures 3E and 5B). Note that it may be due to limitations in annotation of various viral genome regions that no mutations were detected in ORF9b and ORF14 proteins, as the study based the identification of genes and proteins from publicly available annotation to reference sequence (NCBI GenBank™ Accession ID: NC_045512). All in all, the ORF7b gene/protein was observed to have no mutations in all geographical region and between the study timepoints, therefore this gene may be potentially conserved in SARS‐CoV‐2. Prominently, ORF8 protein presented the highest mutation density among nonstructural proteins (0.223 mutations/aa site in overall total), similar in all geographical areas similar in two timepoints (Figures 3D,E and 5B). Collectively at the later timepoint, its mutation density almost doubled. Along with the increased in mutation densities in other notable sites: doubled in nsp3 (0.072 mutations/aa site by March‐0.136 mutations/aa site by May), and quadrupled in the RNA‐dependent‐RNA polymerase (RDRP) (0.0139 mutations/aa site by March‐0.0515 mutations/aa site by May). The recurrence of ORF8 mutations were attributed to L84S which consistently was the most frequently occurring in China and United States (Figures 3A and 5A). In Others, however, the most recurrent mutation varied that was G251V in protein 3a in earlier timepoint, while P4715L in RDRP by the later timepoint (Figures 3A and 5A). This may suggest that the distinctive abundance of ORF8 mutations is generally similar among different areas, as its collective frequency increases over time.

The Nucleocapsid Phosphoprotein (N) exhibited the highest mutation density among the structural proteins of SARS‐CoV‐2

For both time points, N had the highest mutation density (0.02148 for earlier data; 0.1122 for overall data). Twelve nucleotide sites considered as hotspots in N, comprising 48% of the mutations in N (Table 1). Mutation densities of the other structural proteins are shown in Figure 5B. Interestingly, 10 SARS‐CoV‐2 samples had a substitution mutation in nucleotide positions 28881–28883 (GGG>AAC). This nucleotide mutation led to two amino acid substitutions (R203K and G204R). The earliest recorded SARS‐CoV‐2 genome having this mutation was from Florida, USA (February 28, 2020; accession ID: MT276330) while the other nine genomes that have this mutation come from the Others geographic area (Israel, Peru, Brazil, Greece, Czech Republic, and Argentina). However, the order of mutation densities of structural proteins among geographic areas varied, with the Others geographic area having N as the third highest mutation density for the overall data (Figure 5C). These suggest that the mutation in the N protein did not occur initially in China but occurred first from the United States.

DISCUSSION

Presence of a novel mutation and a high frequency mutation in SARS‐CoV‐2

Nsp16 is responsible for the messenger RNA capping of the coronavirus genome, primarily to protect from host recognition. According to the crystal structure of nsp16, the domain of P6810 in nsp16 is unknown, however, it is characterized as part of a bend in nsp16. Proline exhibits conformational rigidity projected to result to a kink; its substitution may cause a change in the steric conformation of the aforementioned bend. In addition, one of the immediate surrounding amino acids of nsp16–nsp10 complex that is proximal to P6810 is a tryptophan at aa position 7029 of ORF1ab. Substitution of serine (P6810S) might exhibit an enhanced interaction for hydrogen bonding with tryptophan. , There is a need to further investigate this mutation to determine its significance in host evasion. It is important also to further evaluate its prevalence in the Chinese population, and in the global population to fully understand its implications in the function of nsp16. The increased recurrence of L84S mutation may suggest that this variant might be favorable for virus' survival across geographical regions. , The subclades of L84S have mutations that may affect viral replication, immune evasion, viral release, and virion assembly. , , , , Further research may ascertain the changes in the function of ORF8 due to this mutation, in virus replication, as well as potential changes in immune evasion and viral release.

Comparison of mutations in different SARS‐CoV‐2 studies reveal similarities and differences in mutation patterns

Observations in this study are consistent with the general pattern where transitions are more prevalent over transversions, perhaps due to steric considerations. , Interestingly, mutations in ORF3a (modulating host immune response), and 3′‐UTR (RNA stability and translation) consists largely of transversions, suggesting that these regions may be more erroneous than other regions and more prone to random substitution of transversions. This might suggest that there are changes in virulence and replication stability across global regions. Differences in findings may be observed based on previously published literature, using the mutation landscape of SARS‐CoV‐2. A study by Pachetti et al. described that a mutation in RDRP (nt14408) increased in count, 7 (February) to 10 (cumulative by March). This was consistent with this study's findings with greater recurrence; 4 occurrences (March) to 26 (cumulative by May). In addition, another research by Kim et al. also described the low frequency of mutations in E, M, and ORF7a, similar by this study's result. Other studies such as this described high frequency of mutations in ORF1ab and may be attributed to the relatively high genome length of the region. To address this, this study normalized the factor of gene length and presented the data through mutation densities of each gene in SARS‐CoV‐2. Discrepancies in mutation frequencies between this study and that of Tiwari and Mishra may be attributed to the following reasons: (1) In this study, a single frequency of a mutation is already considered a valid mutation. In contrast to Tiwari and Mishra's study, mutations should occur at least three times before these were considered as legitimate mutations. (2) Since the samples considered in this study were collected at a later time during the pandemic, thus providing more time and opportunity for the virus to accumulate mutations. In contrast to Tiwari and Mishra's study where samples were collected earlier into the pandemic, less time for the virus to accumulate mutations.

Implications of identified mutations in SARS‐CoV‐2 to treatment options and diagnostics

Remdesivir is currently at Phase 3 of COVID‐19 clinical trials, which is known to inhibit RDRP. The active component of remdesivir (GS‐441524; adenosine nucleotide analog) binds to RDRP catalytic site and halts nucleic acid elongation. The missense mutation (D722Y) occurred at the catalytic site along with neighboring variants (V472D and L469S), a change from an acidic to a nonpolar residue, may potentially result to increase in hydrophobicity at the region, leading to a more elusive conformation. This potential impact may significantly influence the RDRP conformation which might challenge the effectivity of remdesivir. Hence, SARS‐CoV‐2 RDRP mutations, especially considering regional variability, should be further investigated on their potential effect on RDRP structure and function to support the use of remdesivir. The absence of D614G mutation in China while it was abundant in the Others geographic area suggest potentially variable effectiveness of vaccines and neutralization factors that target the RBD among different geographic areas. Alternatively, relatively conserved regions in Spike heptad 1‐heptad two repeats, may present as potential drug or vaccine targets, inhibiting viral entry. As shown in this study, mutations in the Spike glycoprotein could confer alterations in its domains which may be involved in epitope recognition (i.e., RBD, S1‐N terminal domain) of neutralizing antibodies (nAbs). , Hence, binding of the potential nAb with putative SARS‐CoV‐2 epitopes may be hindered. Further studies should be done to evaluate putative effectiveness of neutralizing monoclonal antibodies against SARS‐CoV‐2. The changes in the mutation frequencies and densities in N imply that the evolution of the genes and proteins of N over time in different landmasses is beneficial for the adaptation of SARS‐CoV‐2 as it spreads globally. Currently, the WHO, and the Centers for Disease Control and Prevention recommend the use of N1 and N2 genes in COVID‐19 surveillance. Recent publications have criticized the use of these genes in COVID‐19 diagnosis using reverse transcriptase‐polymerase chain reaction (RT‐PCR) because of its relatively high mutation index. , There are variants that fall in the forward primer for N3, and in the reverse primer of N1, (nt 28688). This was a hotspot mutation in the genome and proteome of SARS‐C‐oV‐2, as observed in this study. These support that the variations in N may pose difficulties in diagnosis using N‐targeted primers for quantitative RT‐PCR. The SARS‐CoV‐2 genomes used in this study are assumed to have come from individuals undergoing COVID‐19 testing and before any of them received antiviral treatment. Since SARS‐CoV‐2 genomes from individuals who have received antiviral treatment are not currently available, comparisons on the mutation patterns between these two groups cannot be determined yet, but speculations can be made. Mutations in the virus can exist and persist in the absence of selective pressure, therefore the diversity of mutations is high and no variants exist with unusually high frequencies. This is likely the phenomenon we have observed, with a few exceptions like L84S (ORF8), D614G (S), and L3606F (ORF1ab). However, antiviral drugs can serve as selective pressure against certain types of mutations in the viruses, possibly reducing the overall diversity of the virus, but at the same time, increasing the frequencies of a select few virus variants that are resistant to the antiviral drug. These variants may be more dominant in the population and this may affect the overall patterns and frequencies of mutations in SARS‐CoV‐2. In conclusion, this study highlights the importance of the characterization of both nucleotide and amino acid mutation landscape in SAR‐CoV‐2 to identify hotspots and coldspots that may be significant in the effectivity of diagnostic tools and treatment options for COVID‐19, over the different areas worldwide as the pandemic continues.

CONFLICT OF INTERESTS

The authors declare that there are no conflict of interests regarding this study.

AUTHOR CONTRIBUTIONS

Christian Luke D. C. Badua, Karol Ann T. Baldo, and Paul Mark B. Medina designed this study. Christian Luke D. C. Badua and Karol Ann T. Baldo equally contributed to data collection, data analysis, technical graphics and processing, and writing the paper. Paul Mark B. Medina contributed to critical evaluation of the figures and results, and the critical review of the manuscript. All authors contributed to revising the manuscript and approving of the final version submitted. Supporting information. Click here for additional data file.
  28 in total

1.  Neighboring-nucleotide effects on single nucleotide polymorphisms: a study of 2.6 million polymorphisms across the human genome.

Authors:  Zhongming Zhao; Eric Boerwinkle
Journal:  Genome Res       Date:  2002-11       Impact factor: 9.043

2.  SARS-CoV envelope protein palmitoylation or nucleocapid association is not required for promoting virus-like particle production.

Authors:  Ying-Tzu Tseng; Shiu-Mei Wang; Kuo-Jung Huang; Chin-Tien Wang
Journal:  J Biomed Sci       Date:  2014-04-27       Impact factor: 8.410

3.  Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.

Authors:  Roujian Lu; Xiang Zhao; Juan Li; Peihua Niu; Bo Yang; Honglong Wu; Wenling Wang; Hao Song; Baoying Huang; Na Zhu; Yuhai Bi; Xuejun Ma; Faxian Zhan; Liang Wang; Tao Hu; Hong Zhou; Zhenhong Hu; Weimin Zhou; Li Zhao; Jing Chen; Yao Meng; Ji Wang; Yang Lin; Jianying Yuan; Zhihao Xie; Jinmin Ma; William J Liu; Dayan Wang; Wenbo Xu; Edward C Holmes; George F Gao; Guizhen Wu; Weijun Chen; Weifeng Shi; Wenjie Tan
Journal:  Lancet       Date:  2020-01-30       Impact factor: 79.321

4.  Genome-Wide Identification and Characterization of Point Mutations in the SARS-CoV-2 Genome.

Authors:  Jun-Sub Kim; Jun-Hyeong Jang; Jeong-Min Kim; Yoon-Seok Chung; Cheon-Kwon Yoo; Myung-Guk Han
Journal:  Osong Public Health Res Perspect       Date:  2020-06

5.  aLeaves facilitates on-demand exploration of metazoan gene family trees on MAFFT sequence alignment server with enhanced interactivity.

Authors:  Shigehiro Kuraku; Christian M Zmasek; Osamu Nishimura; Kazutaka Katoh
Journal:  Nucleic Acids Res       Date:  2013-05-15       Impact factor: 16.971

6.  Spread, circulation, and evolution of the Middle East respiratory syndrome coronavirus.

Authors:  Matthew Cotten; Simon J Watson; Alimuddin I Zumla; Hatem Q Makhdoom; Anne L Palser; Swee Hoe Ong; Abdullah A Al Rabeeah; Rafat F Alhakeem; Abdullah Assiri; Jaffar A Al-Tawfiq; Ali Albarrak; Mazin Barry; Atef Shibl; Fahad A Alrabiah; Sami Hajjar; Hanan H Balkhy; Hesham Flemban; Andrew Rambaut; Paul Kellam; Ziad A Memish
Journal:  mBio       Date:  2014-02-18       Impact factor: 7.867

7.  Neutralizing Antibodies against SARS-CoV-2 and Other Human Coronaviruses.

Authors:  Shibo Jiang; Christopher Hillyer; Lanying Du
Journal:  Trends Immunol       Date:  2020-04-02       Impact factor: 16.687

8.  Genomic characterization of a novel SARS-CoV-2.

Authors:  Rozhgar A Khailany; Muhamad Safdar; Mehmet Ozaslan
Journal:  Gene Rep       Date:  2020-04-16

9.  Genomic and proteomic mutation landscapes of SARS-CoV-2.

Authors:  Christian Luke D C Badua; Karol Ann T Baldo; Paul Mark B Medina
Journal:  J Med Virol       Date:  2020-10-08       Impact factor: 20.693

View more
  25 in total

1.  VIR-CRISPR: Visual in-one-tube ultrafast RT-PCR and CRISPR method for instant SARS-CoV-2 detection.

Authors:  Rui Wang; Yongfang Li; Yanan Pang; Fang Zhang; Fuyou Li; Shihua Luo; Chunyan Qian
Journal:  Anal Chim Acta       Date:  2022-05-13       Impact factor: 6.911

2.  Small-Molecule Thioesters as SARS-CoV-2 Main Protease Inhibitors: Enzyme Inhibition, Structure-Activity Relationships, Antiviral Activity, and X-ray Structure Determination.

Authors:  Thanigaimalai Pillaiyar; Philipp Flury; Nadine Krüger; Haixia Su; Laura Schäkel; Elany Barbosa Da Silva; Olga Eppler; Thales Kronenberger; Tianqing Nie; Stephanie Luedtke; Cheila Rocha; Katharina Sylvester; Marvin R I Petry; James H McKerrow; Antti Poso; Stefan Pöhlmann; Michael Gütschow; Anthony J O'Donoghue; Yechun Xu; Christa E Müller; Stefan A Laufer
Journal:  J Med Chem       Date:  2022-06-16       Impact factor: 8.039

3.  Exploring COVID-19 pathogenesis on command-line: A bioinformatics pipeline for handling and integrating omics data.

Authors:  Janaina Macedo-da-Silva; João Victor Paccini Coutinho; Livia Rosa-Fernandes; Suely Kazue Nagahashi Marie; Giuseppe Palmisano
Journal:  Adv Protein Chem Struct Biol       Date:  2022-05-12       Impact factor: 5.447

Review 4.  Biochemical features and mutations of key proteins in SARS-CoV-2 and their impacts on RNA therapeutics.

Authors:  Li Zeng; Dongying Li; Weida Tong; Tieliu Shi; Baitang Ning
Journal:  Biochem Pharmacol       Date:  2021-01-19       Impact factor: 5.858

5.  A deletion in SARS-CoV-2 ORF7 identified in COVID-19 outbreak in Uruguay.

Authors:  Yanina Panzera; Natalia Ramos; Sandra Frabasile; Lucía Calleros; Ana Marandino; Gonzalo Tomás; Claudia Techera; Sofía Grecco; Eddie Fuques; Natalia Goñi; Viviana Ramas; Leticia Coppola; Héctor Chiparelli; Cecilia Sorhouet; Cristina Mogdasy; Juan Arbiza; Adriana Delfraro; Ruben Pérez
Journal:  Transbound Emerg Dis       Date:  2021-03-05       Impact factor: 4.521

6.  A global analysis of conservative and non-conservative mutations in SARS-CoV-2 detected in the first year of the COVID-19 world-wide diffusion.

Authors:  Nicole Balasco; Gianluca Damaggio; Luciana Esposito; Flavia Villani; Rita Berisio; Vincenza Colonna; Luigi Vitagliano
Journal:  Sci Rep       Date:  2021-12-30       Impact factor: 4.379

7.  Distinct Symptoms and Underlying Comorbidities with Latitude and Longitude in COVID-19: A Systematic Review and Meta-Analysis.

Authors:  Yong Tian; Qian Wu; Hongwei Li; Qi Wu; Yi Xie; Li Li; Huaiyong Chen
Journal:  Can Respir J       Date:  2022-01-27       Impact factor: 2.130

8.  Ex vivo and in vivo suppression of SARS-CoV-2 with combinatorial AAV/RNAi expression vectors.

Authors:  Jonas Becker; Megan Lynn Stanifer; Sarah Rebecca Leist; Bettina Stolp; Olena Maiakovska; Ande West; Ellen Wiedtke; Kathleen Börner; Ali Ghanem; Ina Ambiel; Longping Victor Tse; Oliver Till Fackler; Ralph Steven Baric; Steeve Boulant; Dirk Grimm
Journal:  Mol Ther       Date:  2022-01-14       Impact factor: 12.910

9.  Rapid, inexpensive methods for exploring SARS CoV-2 D614G mutation.

Authors:  Sirwan M A Al-Jaf; Sherko S Niranji; Zana H Mahmood
Journal:  Meta Gene       Date:  2021-07-18

10.  Genomic and proteomic mutation landscapes of SARS-CoV-2.

Authors:  Christian Luke D C Badua; Karol Ann T Baldo; Paul Mark B Medina
Journal:  J Med Virol       Date:  2020-10-08       Impact factor: 20.693

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.