Literature DB >> 35085577

The importance of accessory protein variants in the pathogenicity of SARS-CoV-2.

Sk Sarif Hassan1, Pabitra Pal Choudhury2, Guy W Dayhoff3, Alaa A A Aljabali4, Bruce D Uhal5, Kenneth Lundstrom6, Nima Rezaei7, Damiano Pizzol8, Parise Adadi9, Amos Lal10, Antonio Soares11, Tarek Mohamed Abd El-Aziz12, Adam M Brufsky13, Gajendra Kumar Azad14, Samendra P Sherchan15, Wagner Baetas-da-Cruz16, Kazuo Takayama17, Ãngel Serrano-Aroca18, Gaurav Chauhan19, Giorgio Palu20, Yogendra Kumar Mishra21, Debmalya Barh22, Raner Jośe Santana Silva23, Bruno Silva Andrade24, Vasco Azevedo25, Aristóteles Góes-Neto26, Nicolas G Bazan27, Elrashdy M Redwan28, Murtaza Tambuwala29, Vladimir N Uversky30.   

Abstract

The coronavirus disease 2019 (COVID-19) is caused by the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS- CoV-2) with an estimated fatality rate of less than 1%. The SARS-CoV-2 accessory proteins ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 possess putative functions to manipulate host immune mechanisms. These involve interferons, which appear as a consensus function, immune signaling receptor NLRP3 (NLR family pyrin domain-containing 3) inflammasome, and inflammatory cytokines such as interleukin 1β (IL-1β) and are critical in COVID-19 pathology. Outspread variations of each of the six accessory proteins were observed across six continents of all complete SARS-CoV-2 proteomes based on the data reported before November 2020. A decreasing order of percentage of unique variations in the accessory proteins was determined as ORF3a > ORF8 > ORF7a > ORF6 > ORF10 > ORF7b across all continents. The highest and lowest unique variations of ORF3a were observed in South America and Oceania, respectively. These findings suggest that the wide variations in accessory proteins seem to affect the pathogenicity of SARS-CoV-2.
Copyright © 2022 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  ORF10; ORF3a; ORF6; ORF7a; ORF7b; ORF8; Pathogenicity; SARS-CoV-2

Mesh:

Substances:

Year:  2022        PMID: 35085577      PMCID: PMC8785432          DOI: 10.1016/j.abb.2022.109124

Source DB:  PubMed          Journal:  Arch Biochem Biophys        ISSN: 0003-9861            Impact factor:   4.114


Executive summary

SARS-CoV-2 accessory proteins ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 have putative functions to manipulate the host immune system. Inflammatory cytokines, such as interleukin 1β (IL-1β), IL-6, and TNF are critical in COVID-19 pathology. Extensive heterogeneity was found around six continents for each of the six accessory proteins of all the sequenced SARS-CoV-2 proteomes

Introduction

SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus-2) is the causative agent of the coronavirus disease 2019 (COVID-19) pandemic with an estimated fatality rate of less than 1% [1]. However, Dr Michael Ryan, Executive Director of the Health Emergencies Program at the World Health Organization (WHO), indicated in October 2020 that 760 million people might have been infected by SARS-CoV-2, which gives a hypothetical fatality rate of 0.14%, with approximately one million lives lost. SARS-CoV-2 is a member of the Betacoronavirus (lineage B) genus. The Sarbecovirus subgenus was suggested to had diverged from the lineage of Bat Coronavirus (BatCoV) RaTG13 in 1969 with the 95% highest posterior density interval of the years 1930–2000 [2]. Among previously identified human coronaviruses (HCoVs), Severe Acute Respiratory Syndrome-Coronavirus (SARS-CoV) causing the SARS epidemic in 2002–2004 is the closest member to SARS-CoV-2 [2,3]. SARS-CoV possesses eight open reading frames (ORFs), ORF3a, ORF3b, ORF6, ORF7a, ORF7b, ORF8a, ORF8b, and ORF9b, which were suggested to have more intrinsic and secondary roles other than the primary roles described for cellular entry in the viral life cycle [4,5]. For instance, the ORFs are transcribed throughout the second phase of replication by the positive strand subgenomic mRNA using a negative-sense viral RNA template [6]. Thus, due to their intrinsic nature, accessory proteins are not targets for positive-selection such as the extrinsic and primary functional Spike (S) protein containing the receptor-binding domain (RBD) and protease cleavage sites [7]. High-frequency non-synonymous mutations, such as D614G in the S protein detected in clinical SARS-CoV-2 isolates have increased host cell entry via the angiotensin converting enzyme 2 (ACE2) receptor and transmembrane protease serine 2 (TMPRSS2) [8]. Therefore, due to the intrinsic nature and secondary order in viral transcription, a less selective pressure to induce mutations in accessory proteins is expected. Thus, despite the 19–89 years of estimated genomic divergence between RaTG13 and SARS-CoV-2, the sequence identity between their accessory proteins is very high, being 98.5% for ORF3, 100% for ORF6, 97.5% for ORF7a, 97.6% for ORF7b, 95% for ORF8, and 100% for ORF10. This is indicative of that somehow the direct ancestor of SARS-CoV-2 had been exposed to almost no selection pressure to manipulate its intermediate host immunity for many years until the primary human infection occurred in Wuhan in 2019 (Fig. 1, Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6 ) [2]. SARS-CoV-2 and SARS-CoV accessory proteins have differences such as the putative ORF10 protein missing from SARS-CoV and the absence of the ORF3b and ORF9b proteins in SARS-CoV-2 [9,10]. Very little is known about the functions of the accessory proteins of SARS-CoV-2, although crystal or cryo-EM structures were solved for some of them. Examples are given by the Cryo-EM structure of SARS-CoV-2 ORF3a ion channel in lipid nanodiscs (PDB ID: 7KJR) {Kern, 2021 #58}, the X-ray crystal structure of the SARS-CoV-2 ORF7a ectodomain (PDB ID: 7CI3) {Zhou, 2021 #59}, and the crystal structure of the dimeric form of SARS-CoV-2 ORF8 accessory protein (PDB ID: 7JTL) {Flower, 2021 #61}.
Fig. 1

ClustalW alignment of SARS-CoV-2 and RaTG13 ORF3 proteins shows 98.5% sequence identity.

Fig. 2

ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87365.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 1) ORF6 proteins show 100% sequence identity, despite up to 89 years of genetic diversion.

Fig. 3

ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87366.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) The ORF7a proteins show 97.5% sequence identity, despite up to 89 years of genetic diversion.

Fig. 4

ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCB15096.1) and Ratg13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF7b proteins shows 97.6% sequence identity, despite up to 89 years of genetic diversion.

Fig. 5

ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87366.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF8 proteins show a 95% sequence identity, despite up to 89 years of genetic diversion.

Fig. 6

ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87369.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF10 proteins show a 97.3% sequence identity, despite up to 89 years of genetic diversion.

ClustalW alignment of SARS-CoV-2 and RaTG13 ORF3 proteins shows 98.5% sequence identity. The objectives of the present study were to depict the unique variability of all accessory proteins and their possible contributions to virus pathogenicity.

Materials and methods

Data acquisition

Sequences for accessory proteins ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 were downloaded from the complete SARS-CoV-2 proteomes on the National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov/)(Table 1 ).
Table 1

Total number of six accessory proteins of complete SARS-CoV-2 proteomes.

ProteinsAfricaAsiaEuropeNorth AmericaOceaniaSouth America
ORF3a2801175442127344106122
ORF62801181441127324106122
ORF102801174442127334106122
ORF7a2801179440127234106122
ORF7b2801138436125684106121
ORF82801172442127264106122

Note that all partial accessory proteins and sequences with ambiguous amino acids were excluded from the present study.

Total number of six accessory proteins of complete SARS-CoV-2 proteomes. Note that all partial accessory proteins and sequences with ambiguous amino acids were excluded from the present study. Furthermore, the unique accessory protein sequences were extracted for each continent. The unique protein accessions were renamed for each accessory protein as S1, S2, … etc., as shown in the Supplementary Tables (S1–S6). There were 510, 72, 158, 37, 190, and 44 unique accessory proteins available for ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10, respectively. For each continent, ranges and names of sequences are presented in Table 2 .
Table 2

Ranges and naming of unique sequences (continent-wise) for each accessory protein of SARS-CoV-2.

ContinentORF3aORF6ORf7aORF7bORF8ORF10
AfricaS1 to S7S1 to S3S1 to S6S1 to S2S1 to S5S1
AsiaS8 to S85S4 to S13S7 to S25S3 to S9S6 to S31S2 to S8
EuropeS86 to S115S14 to S19S26S10 to S11S32 to S41S9 to S12
North AmericaS116 to S442S20 to S58S27 to S126S12 to S30S42 to S165S13 to S36
OceaniaS443 to S495S59 to S69S127 to S153S31 to S36S166 to S186S37 to S42
South AmericaS496 to S510S70 to S72S154 to S158S37S187 to S190S43 to S44
Ranges and naming of unique sequences (continent-wise) for each accessory protein of SARS-CoV-2.

Evaluation of the per-residue predisposition of SARS-CoV-2 accessory proteins and their natural variants for intrinsic disorder

Per-residue disorder distribution within the amino acid sequences of SARS-CoV-2 accessory proteins ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 and their natural variants was evaluated by PONDR® VSL2, which is one of the more accurate standalone disorder predictors [[11], [12], [13], [14]]. The per-residue disorder predisposition scores are on a scale from 0.0 to 1.0, where 0.0 indicates fully ordered residues, and 1.0 indicates fully disordered residues. Values above the threshold of 0.5 are considered disordered residues, whereas residues with disorder scores between 0.25 and 0.5 are considered highly flexible, and residues with disorder scores between 0.1 and 0.25 are listed as moderately flexible.

Phylogenetic analysis

In a first step, the SARS-CoV-2 amino acid sequences of each ORF were initially filtered to remove redundant sequences and sequences with low quality (unknown amino acids “X”) by using the SeqKit program [15]⁠, with the tools fx2tab and rmdup. At this stage, the sequences which presented one or more “X” characters in their composition were removed, as well as redundant sequences (100% identical). Thereafter, amino acid sequences of each ORF group were aligned using the MegaX program [16], applying the MUSCLE algorithm for selection [17]. For all phylogeny estimation the Neighbor-joining method was used, as well as each input alignment was submitted to the phyloXML [18]⁠ program, with the multiple alignment inference option, maximum allowed gaps ratio 0.5 and minimum allowed non-gap sequence length 50 with distance calculator Kimura correction. In a last step, phylogenetic trees were analyzed and edited using the phyloXML tool [18].

Results and discussion

The essential known features of the six accessory proteins from SARS-CoV-2 are summarized below. ORF3a protein: The ORF3a is the largest SARS-CoV-2 accessory protein (275 amino acids long). It has 72.4% sequence identity with SARS-CoV ORF3a protein and 98.5% sequence identity with the Bat-CoV RaTG13 ORF3a protein [19,20] (Fig. 1). ORF3a is involved in virulence, infectivity, ion channel activity, morphogenesis, and virus release [21]. In SARS-CoV, ORF3a is a multifunctional protein co-localized with the E, M, and S proteins, forming a homo-tetrameric complex as a potassium-ion channel on the host cell membrane during viral assembly [5]. In SARS-CoV-2, the function of the ion-channel proteins (viroporins) ORF3a, ORF8a, and E is critical in tissue inflammation caused by CoVs [6]. Viroporin-mediated lysosomal disruption, and ion-redistribution activate the innate immune signaling receptor NLRP3 (NLR family pyrin domain-containing 3) inflammasome that leads to the expression of inflammatory cytokines such as interleukin 1β (IL-1β), IL-6, and tumor necrosis factor (TNF), causing tissue inflammation during respiratory illness [6] From another pathway, ORF3a interacts with TNF receptor-associated factor (TRAF3) protein with its protein binding domains, which leads to ASC ubiquitination, caspase 1 activation, and IL-1β maturation [22]. Additionally, ORF3a and ORF7a combined with E, S, NSP1 proteins, and MAPK pathway proteins (MAPK8, MAPK14, and MAP3K7) trigger proinflammatory cytokine signaling transcription factors such as STAT1, STAT2, IRF9, and NFKB1 [6]. Additionally, the SARS-CoV-2 ORF3a protein interacts with heme oxygenase-1 (HMOX1) that has a role in heme catabolism and the anti-inflammatory system [6]. ORF3a inhibits cGAS-STING in chicken, mouse and man in a unique fashion and blocks the nuclear accumulation of p65 to inhibit nuclear factor-κB signaling. Due to more effective innate immune suppression, it may allow more efficient SARS-CoV-2 replication in vivo. However, ORF3a was ineffective against the pathways associated with the RIG-I-like receptors (RLRs, which are a family of cytosolic pattern recognition receptors that are essential for detecting viral RNA and initiating the innate immune response) in contrast to the SARS-CoV-2 N protein, which showed strong inhibition of the RLR pathway [23]. The ion channel activity of the SARS-CoV-2 ORF3a, E and M proteins interferes with apoptotic pathways [19]. In a similar scenario, ORF3a of SARS-CoV increases the mRNA expression levels of all three subunits of fibrinogen, thus promoting fibrosis, one of the serious pathogenic aspects of SARS [24]. The expression of NFκB, IL8, and JNK, all involved in the inflammatory responses are also enhanced. Both SARS-CoV-2 ORF3a and ORF3b have showed ability to antagonize type-I interferon activation [25]. Interestingly, potent and durable antibody responses against IFN antagonist SARS-CoV-2 ORF3a, ORF3b, ORF7a and ORF8 proteins have been detected in children [26], which may explain why children are more resistant to SARS-CoV-2 infections [27]. However, it also raises the question, whether the mutations/truncations associated with those accessory proteins will influence the resistance seen in children? Similar to ORF8, ORDF3b is an immune-dominant protein that has been shown to induce high levels of antibody production during SARS-CoV-2 infections [28]. Sequence analysis of ORF3b identified a natural variant with a longer ORF3b reading frame in two patients with severe COVID-19, which enhanced interferon suppression and was potentially linked to viral pathogenesis and severity of COVID-19 [29]. ORF6 protein: SARS-CoV-2 ORF6 is a 61 amino acid long membrane-associated interferon (IFN) antagonist protein. ORF6 interacts with the karyopherin import complex that limits transcription factor STAT1, which down-regulates the IFN pathway [5]. ORF6 is internalized from the plasma membrane into endosomal vesicles. The SARS-CoV-2 ORF6 has a 68.9% sequence identity with the SARS-CoV ORF6 protein and a 100% sequence identity with the BatCoV RaTG13 ORF6 protein [5] (Fig. 2 ). SARS-CoV ORF6 and ORF3a, in association with other proteins such as M, NSP1 and NSP3 inhibit IRF3 signaling, repress interferon expression and stimulate the degradation of IFNAR1 and STAT1 [6]. ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87365.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 1) ORF6 proteins show 100% sequence identity, despite up to 89 years of genetic diversion. The SARS-CoV-2 ORF6 interacts with the NSP8 protein, and it can increase early infection at a low multiplicity with an increase in RNA polymerase activity [30]. It has been reported that ORF6 and ORF8 can inhibit the type-I IFN signaling pathway [30]. The ORF6 protein with the lysosomal targeting motif (YSEL) and diacidic motif (DDEE) induces intracellular membrane rearrangements resulting in a vesicular population and endosomal internalization of viral protein into infected cells increasing replication [31]. ORF7a and ORF7b proteins: ORF7a, a 121 aa type I transmembrane protein, interacts with SARS-CoV-2 structural proteins M, E, and S, which are essential for viral assembly. Hence, ORF7a is involved in viral replication, and virion-associated ORF7a protein may function during early infection. It has an 85.2% sequence identity with the SARS-CoV ORF7a protein and has a 97.5% sequence identity with BatCoV RaTG13 ORF7a protein [5] (Fig. 3). ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87366.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) The ORF7a proteins show 97.5% sequence identity, despite up to 89 years of genetic diversion. ORF7a interacts with the SARS-CoV-2 M, E and S structural proteins, which are essential for viral assembly, and hence ORF7a is involved in viral replication, and virion-associated ORF7a protein may function during early infection [[32], [33], [34]]. ORF7a induces pro-inflammatory cytokines and chemokines, such as IL-8 and RANTES [5]. SARS-CoV-2 ORF7a in combination with the E protein activates apoptosis by suppressing anti-apoptotic proteins [6]. While ORF7b is a 43 aa protein found in association with intracellular viral particles, it is also present in purified virions in the Golgi compartment. The SARS-CoV-2 ORF7b has an 85.4% sequence identity with SARS-CoV ORF7b protein and has a 97.6% sequence identity, with BatCoV RaTG13 ORF7a protein [5] (Fig. 4 ). ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCB15096.1) and Ratg13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF7b proteins shows 97.6% sequence identity, despite up to 89 years of genetic diversion. ORF7b is found associated with intracellular viral particles and purified virions. To date, there is extraordinarily little experimental evidence to support a role for ORF7a or ORF7b in SARS-CoV-2 replication [32]. ORF8 protein: ORF8 is a unique 121 aa long accessory protein in SARS-CoV-2, and it stands out by being poorly conserved among other CoVs, accordingly showing structural changes suggested to be related to the ability of virus spread [35]. ORF8 sequences of SARS-CoV-2 and RaTG13 share a 95% amino acid identity (Fig. 5 ). ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87366.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF8 proteins show a 95% sequence identity, despite up to 89 years of genetic diversion. ClustalW alignment of SARS-CoV-2 (NCBI GenBank ID BCA87369.1) and RaTG13 (NCBI GenBank ID MN996532.2, translated 5′3′ frame 2) ORF10 proteins show a 97.3% sequence identity, despite up to 89 years of genetic diversion. SARS-CoV-2 ORF8 interacts with the major histocompatibility complex (MHC) class-I molecules and down-regulates their surface expression on various cell types [36]. It has been reported earlier that inhibition of ORF8 could be a strategy to improve the special immune surveillance and to accelerate the eradication of SARS-CoV-2 in vivo [37]. ORF10 protein: The 38 aa long ORF10 accessory protein has been reported to be unique for SARS-CoV-2 containing eleven cytotoxic T lymphocyte (CTL) epitopes of nine amino acids each in length, across various human leukocyte antigen (HLA) subtypes [38,39]. ORF10 negatively affects the antiviral protein degradation process through its interaction with the Cul2 ubiquitin ligase complex [6]. The ORF10 protein is missing in SARS-CoV, but SARS-COV-2 ORF10 and RaTG13 ORF10 have a 97.3% sequence identity [40] (Fig. 6). For every continent, the total number of accessory proteins and the total number of unique sequences with respective percentages are presented in Fig. 7 . In summary, for all six continents, the total number of unique ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 accessory protein sequences are 419, 55, 122, 26, 147, and 32, respectively (Supplementary Figure S1). Furthermore, the percentage of unique sequences on each continent among all available accessory proteins are also enumerated (Fig. 7).
Fig. 7

Number of unique accessory proteins across six continents.

Number of unique accessory proteins across six continents. The percentages of each accessory protein across the six continents are presented as bar diagrams in Fig. 8 . The following observations were drawn from Fig. 8. Across all continents, the decreasing order of percentage of unique variations in the accessory proteins was observed as follows: ORF3a > ORF8 > ORF7a > ORF6 > ORF10 > ORF7b. The highest and lowest unique variations of ORF3a were observed in South America and Oceania, respectively. In addition, the highest percentage (statistically significant) of unique variations in each accessory protein was observed in South America. The lowest percentage of unique variations among ORF3a, ORF6, ORF7b, and ORF8 was observed in Oceania. It is worth noticing that the smallest number of unique variations of ORF7b and ORF7a was seen in North America and Europe, respectively. It is further noted that in Europe, the lowest variations among all accessory proteins were found in ORF7a. The smallest percentage of unique ORF10 variations was found in Oceania. With regards to the total unique variations across all accessory proteins of SARS-CoV-2, the decreasing order would be in South America > Asia > Europe > Africa > North America > Oceania.
Fig. 8

Bar representations of percentages of continental variations (A), and the percentage of unique accessory proteins (B).

Bar representations of percentages of continental variations (A), and the percentage of unique accessory proteins (B). ORF3a possessed the highest significant amount of unique variations across all six continents, while ORF10 showed the lowest variations in Africa, Asia, and Oceania. The lowest unique variations of ORF7b were observed in North America and South America. The percentage of unique accessory proteins among all unique sequences obtained across the six continents is represented as bar diagrams in Fig. 9 .
Fig. 9

Quantitative information of the accessory proteins.

Quantitative information of the accessory proteins. Among all available unique variations of the six accessory proteins of SARS-CoV-2, North America and South America exhibited the highest and lowest percentage of each accessory protein variation, respectively. The smallest number of unique variations of ORF3a, ORF6, and ORF10 were noticed in Africa. On the other hand, South America showed the lowest number of unique ORF6, ORF7a, ORF7b, and ORF8. Regarding ORF7b, the highest number of unique variations compared to the rest of the accessory proteins were observed in Africa, Asia, and Oceania. Furthermore, the highest percentage (84.35%) and lowest (0.82%) of unique variations of ORF8 and ORF7a (among all accessory proteins) were found in North America and Europe, respectively. Fig. 10 represents the continent-wise lists of identical sequences for each accessory protein. The following observations were made for each accessory protein based on the data shown (Fig. 10).
Fig. 10

Identical pairs of accessory protein sequences across all continents.

Identical pairs of accessory protein sequences across all continents. ORF3a: Note that the mutations described below were determined based on the Wuhan ORF3a sequence (YP 009724391). There were only two ORF3a sequences (marked in red), S2 (Africa, QOI60359) and S5 (Africa, QOI60335), which were present on all six continents. Note that the S2 (Africa-ORF3a) was identical with ORF3a (YP 009724391) from Wuhan, China. The other sequence, S5, is different from ORF3a (YP 009724391) by one missense mutation Q57H, a strain-determining mutation [41]. It was found that the ORF3a sequence S54 (Asia: QKK14624) possesses the single T175I mutation and is present on all continents except in Africa. The ORF3a sequences S62 (Asia: QMJ01306) and S63 (Asia: QJQ04482) possessed a single mutation each, G251V and G196V, respectively, compared to the Wuhan ORF3a (YP 009724391). These two sequences were present in Asia, Europe, North America, Oceania, and South America. The ORF3a sequence S4 (Africa: QLQ87565) has the single S171L mutation found on four continents, excluding Europe and Oceania. Two mutations, Q57H and D155Y, in sequence S34 (Asia), were present only on three continents, Asia, Europe, and North America. Sequence S53 (Asia) with the G172C mutation was found in Asia, Europe, and North America. The deletion mutation V255 occurred in S59 (Asia), which was found in Asia, Oceania, and South America. S68 (Asia) and S69 (Asia) possessed two mutations, H93Y and K67 N, respectively. These two ORF3a variants have been detected only on three continents, Asia, North America, and Oceania. The ORF3a sequence S103 containing the single T229I mutation is present only on three continents, Europe, North America, and Oceania. Another sequence, S104, with the P240L mutation has been detected only in Europe, North America, and South America. The V13L mutation was found in sequence S122 (ORF3a, North America) and is present on three continents, Oceania, North America, and South America. Further, there were 57 unique ORF3a variants detected only on two continents as listed in Table 3 .
Table 3

List of ORF3a sequences and their distribution over only two continents.

SequenceMutation(s)Present in the continent(s)SequenceMutation(s)Present in the continent(s)
S7D2GAsia and North AmericaS37Q57H, A103SAsia and North America
S8L15F, Q57HAsia and North AmericaS46L108FAsia and North America
S9T32IAsia and OceaniaS48W131CAsia and North America
S12S40L, Q57HAsia and North AmericaS49L140FAsia and North America
S13L41FAsia and North AmericaS50W149LAsia and North America
S17V48FAsia and EuropeS51T151IAsia and North America
S23Q57H, W131CAsia and North AmericaS58DEL(V255), N257DAsia and North America
S25Q57H, S166LAsia and North AmericaS65G172VAsia and North America
S26Q57H, S171LAsia and North AmericaS66D155YAsia and North America
S27Q57H, T175IAsia and North AmericaS67A99VAsia and North America
S28
Q57H, S216P
Asia and Europe
S70
K66 N
Asia and North America
Sequence
Mutation(s)
Present in Continent(s)
Sequence
Mutation(s)
Present in Continent(s)
S71A54S, Q57HAsia and North AmericaS167V55GNorth America and Oceania
S72A54SAsia and North AmericaS186Q57H, L101FNorth America and Oceania
S74G49VAsia and North AmericaS199Q57H, L140FNorth America and Oceania
S77I35T, Q57HAsia and North AmericaS289G100CNorth America and Oceania
S79D22YAsia and North AmericaS295V112FNorth America and Oceania
S82G18V, Q57HAsia and North AmericaS312L147FNorth America and South America
S83G18VAsia and North AmericaS319S166LNorth America and Oceania
S84K16 N, Q57HAsia and North AmericaS321S171LNorth America and South America
S89V55FEurope and North AmericaS325S177INorth America and Oceania
S92Q57H, V237FEurope and North AmericaS334T223INorth America and Oceania
S94Q57H, D155YEurope and North AmericaS338T229INorth America and Oceania
S95Q57H, A99VEurope and North AmericaS341P240LNorth America and South America
S100G172CEurope and North AmericaS378A110SNorth America and South America
S113A39SEurope and North AmericaS385H93YNorth America and Oceania
S115A33S, Q57HEurope and North AmericaS388H78YNorth America and Oceania
S137S26LNorth America and OceaniaS390K67 NNorth America and Oceania
S155L46FNorth America and OceaniaS444V13LOceania and South America
S163L53FNorth America and Oceania
List of ORF3a sequences and their distribution over only two continents. Fig. 11 represents a phylogenetic tree for SARS-CoV-2 ORF3a proteins. This ORF3a tree was composed by the alignment of 419 sequences, and its resultant phylogeny shows that there are no well-defined patterns for the grouping of sequences, as well as it is possibly not showing evolutionary relationships, but random mutation events. These results show that ORF3 does not seem to represent a target for selection pressure and, therefore, phylogenetic analysis of this protein does not provide noticeable grounds for making associations or evolutionary and/or lineage relationships between the strains.
Fig. 11

SARS-CoV-2 ORF3a amino acid phylogeny after group clustering.

SARS-CoV-2 ORF3a amino acid phylogeny after group clustering. ORF6: Note that the mutations described below were determined based on the Wuhan ORF6 sequence (YP 009724394). The sequence S2 (ORF6, Africa) was identical with YP 009724394 (China, Wuhan) ORF6, and this sequence was present on all six continents, whereas the ORF6 sequence, S10 (ORF6, Asia) with only the D53Y mutation, was found only in Asia, North America, and Oceania. The ORF6 sequences S38 (ORF6, North America) and S50 (ORF6, North America) possess a single mutation each, D2L and I33T, respectively, found on three continents, North America, Oceania, and South America. The ORF6 unique variant S7 (ORF6, Asia) possesses the E13D mutation found only in Asia and North America. The ORF6 sequence S12 (ORF6, Asia) possess a set of deletions,” FKVSIWNLD” (22–30 aa), and it appeared in Asia and North America only. The sequence S17 (ORF6, Europe) had the D61Y mutation, and it was found in Europe and North America. In addition, a single mutation H3Y occurred in S19 (ORF6, Europe), which was present in Europe and North America. The ORF6 sequence S27 (ORF6, North America) containing the W27L mutation was found in North America and Oceania only. Furthermore, the sequence S36 (ORF6, North America) with the D61H mutation was present in North America and Oceania only. Fig. 12 represents a phylogenetic tree for the ORF6 protein. This tree was constructed by the alignment of 55 sequences, and it was possible to identify four very distinct groups. On the other hand, most sequences did not present a clear grouping.
Fig. 12

SARS-CoV-2 ORF6 amino acid phylogeny after group clustering. Phylogenetic analysis identified four well-defined groups.

SARS-CoV-2 ORF6 amino acid phylogeny after group clustering. Phylogenetic analysis identified four well-defined groups. ORF7a: Mutations are based on the Wuhan ORF7a sequence (YP 009724395). The Wuhan ORF7a sequence YP 009724395 was found on all continents. V104F was found in S2 (ORF7a, Africa) in Africa, Asia, North America, and Oceania. The sequence S1 (ORF7a, Africa) had the P39L mutation, which was found in Africa, North America, and South America. S37F was found in the sequence S7 (ORF7a, Asia) in Asia, North America, and Oceania. The sequence S18 (ORF7a, Asia) has the A105V mutation found across Asia, North America, and Oceania. G38V was found in S24 (ORF7a, Asia) in Asia, North America, and Oceania. Also, there were 21 unique ORF7a variants present only on two continents. All mutations are listed in Table 4 .
Table 4

List of ORF7a sequences and their distribution over only two continents.

SequenceMutation(s)Present in the continent(s)SequenceMutation(s)Present in the continent(s)
S10V71IAsia and North AmericaS49S81LNorth America and Oceania
S12Q94HAsia and OceaniaS52S83LNorth America and Oceania
S14L116FAsia and North AmericaS54V93FNorth America and Oceania
S15T120IAsia and North AmericaS57L96FNorth America and Oceania
S21C67YAsia and North AmericaS61P99LNorth America and South America
S25A13TAsia and North AmericaS81E95QNorth America and Oceania
S34T28INorth America and OceaniaS90H73YNorth America and Oceania
S35V29LNorth America and South AmericaS107H47YNorth America and Oceania
S41T39INorth America and OceaniaS113P34SNorth America and Oceania
S47Q76HNorth America and OceaniaS124A8VNorth America and South America
S48R79CNorth America and Oceania
List of ORF7a sequences and their distribution over only two continents. List of ORF8 sequences and their distribution over only two continents. The phylogenetic analysis for the 122 amino acid sequences of the ORF7a revealed the presence of two clear groups, with the first group containing most of the sequences. On the other hand, four non-grouped sequences were found as well (Fig. 13 ).
Fig. 13

SARS-CoV-2 ORF7a amino acid phylogeny after group clustering. Two well-defined groups can be identified.

SARS-CoV-2 ORF7a amino acid phylogeny after group clustering. Two well-defined groups can be identified. ORF7b: Here, all mutations are accounted based on the Wuhan ORF7b sequence (YP 009725318). The sequence S2 (ORF7b, Africa) (identical to Wuhan ORF7b (YP 009725318)) was found on all six continents. It was found that only the C41F mutation was present in S8 (ORF7b, Asia), which appeared in Asia, North America, and Oceania. The sequence S1 (ORF7b, Africa) had the single mutation S5L, present in Africa and Asia. The sequence S5 (ORF7b, Asia) had the mutation S31L, and this sequence was found on two continents, Asia and North America only. L32F occurred in the sequence S10 (ORF7b, Europe), present in Europe and North America. Furthermore, the sequence S13 had the mutation L4F, and this sequence was found in North America and Oceania. For the ORF 7b proteins, phylogenetic analysis was performed using 26 amino acid sequences. Fig. 14 shows that the corresponding phylogenetic tree has three well-defined groups. In this phylogeny, an evolutionary proximity relationship between the sequences can be verified (Fig. 14).
Fig. 14

SARS-CoV-2 ORF7b amino acid phylogeny after group clustering. Analysis identified three well-defined groups.

SARS-CoV-2 ORF7b amino acid phylogeny after group clustering. Analysis identified three well-defined groups. ORF8: Mutations described below are determined regarding the Wuhan ORF8 sequence (YP 009724396). It was observed that the Wuhan ORF8 YP 009724396 sequence was found on every continent. Also, another sequence present on every continent was the single mutation L84S. The single mutaion V62L was observed in the sequence S2 (ORF8, Africa), which was found on all continents except South America, whereas the ORF8 sequence S38 (Europe) possessed the single mutation A65S, and the sequence was found in North America, Oceania, and South America. Further, the V62L and L84S mutations were observed in S12 (ORF8, Asia) in Asia, North America, and Oceania. The sequence S15 (ORF8, Asia) contained the mutation S67F, which was found in Asia, North America, and Oceania. The ORF8 sequence S24 (Asia) possessed the single mutation A65V, which was found in Asia, North America, and Oceania. In the ORF8 phylogenetic analysis, we used 147 amino acid sequences. Fig. 15 shows the presence of three well-defined groups. On the other hand, many sequences were not grouped, and did not present well-defined branches.
Fig. 15

Phylogenetic analysis of SARS-CoV-2 ORF8 protein identified three well-defined groups.

Phylogenetic analysis of SARS-CoV-2 ORF8 protein identified three well-defined groups. ORF10: Mutations are based on the Wuhan ORF10 sequence (YP 009725255). The Wuhan ORF10 (YP 009725255) was identical with S1 (ORF10, Africa), and it was found on every continent. The ORF10 sequence S6 (ORF10, Asia) had the mutation L37F, and the sequence was present in North America and Oceania only. The V30L mutation was only found in the ORF10 sequence S10 (Europe), which appeared in Europe, North America, and Oceania. The sequence S9 (ORF10, Europe) had the mutation S23F, and it was found in Europe and North America. The mutation D31Y appeared in the S12 sequence (ORF10, Europe), which was found in Europe and North America only. The results for the ORF10 phylogenetic analysis included 32 sequences and showed four groups, the first with eight sequences, the second with 16, and the last two groups with four sequences each (Fig. 16 ).
Fig. 16

SARS-CoV-2 ORF10 amino acid phylogenetic analysis identified four well-defined groups.

SARS-CoV-2 ORF10 amino acid phylogenetic analysis identified four well-defined groups. Concluding this section, one need to keep in mind that the phylogeny results are only suggestive and can be used for finding new possibilities to search for other genes in association with the vaccine and/or drug development, which typically works best with well-defined strain clades (see Table 5).
Table 5

List of ORF8 sequences and their distribution over only two continents.

SequenceMutation(s)Present in the continent(s)SequenceMutation(s)Present in the continent(s)
S1V33FAfrica and North AmericaS40P38SEurope and North America
S7T11IAsia and North AmericaS50T11KNorth America and Oceania
S8T12 NAsia and North AmericaS54S21 NNorth America and Oceania
S9V32LAsia and North AmericaS59S24L, DEL(DS)66–67, K68ENorth America and Oceania
S14G66CAsia and North AmericaS62S24LNorth America and Oceania
S16P93LAsia and North AmericaS68Q27KNorth America and Oceania
S17L95FAsia and North AmericaS108V114North America and Oceania
S25D63 NAsia and North AmericaS130A65VNorth America and Oceania
S26A51VAsia and North AmericaS147P36SNorth America and Oceania
S29D34GAsia and North AmericaS156G8RNorth America and Oceania
S39A55VEurope and North America

Featuring uniqueness of the accessory proteins

Here, certain basic descriptive statistics (mean, variance, lower bound, upper bound, and range) were employed to describe the variability of the percentage of the predicted intrinsically disordered residues (PPIDRs), molecular weight (MW), and isoelectric point (pI) of all the unique variants of all accessory proteins (Table 6 ). The zigzag behavior of the plots of PPIDRs, MW, and pI depicts wide variability of variants for each accessory protein (Supplementary Figures S2–S41).
Table 6

Descriptive statistics of PPIDR, MW, and IP of unique accessory proteins of SARS-CoV-2.

PPIDR of unique accessory proteins of SARS-CoV-2based on PONDR® VSL2
Accessory proteinsMeanVarianceLower boundUpper boundRange
ORF3a4.7560.23282.917.644.73
ORF625.7474.6921.3187.566.19
ORF7a3.510.57162.487.294.81
ORF7b44.66310.52737.2151.1613.95
ORF89.1251.2855.613.457.85
ORF10
18.67
5.0691
13.16
23.68
10.52
MW of unique accessory proteins of SARS-CoV-2
Accessory proteins
Mean
Variance
Lower bound
Upper bound
Range
ORF3a3112317917.5829187312702083
ORF67171.03371714.62881.2057542.844661.635
ORF7a13673.4150719.410874.51514328.653454.135
ORF7b5173.022651.265033.0055224.22191.215
ORF813841.421411.4312608.46514431.551823.085
ORF10
4446.53
1173.801
4389.085
4509.285
120.2
pI of unique accessory proteins of SARS-CoV-2
Accessory proteins
Mean
Variance
Lower bound
Upper bound
Range
ORF3a5.91270.02785.23496.58811.3532
ORF64.40130.0573.84365.75891.9153
ORF7a8.09320.04346.74868.59461.846
ORF7b3.95190.00633.63794.14420.5063
ORF85.63680.12234.74426.88292.1387
ORF108.24150.68576.06019.20433.1442
Descriptive statistics of PPIDR, MW, and IP of unique accessory proteins of SARS-CoV-2. The following observations were made based on the data shown in Table 6. The amount of total dispersion (based on range) of the percentage of PPIDR and MW of ORF6 variants was highest, whereas the highest amount of total dispersion of pI was observed for ORF10. The smallest amounts of total dispersions of the percentage of PPIDR, MW, and pI were found for ORF3a, ORF10, and ORF7b, respectively. The broad range and variance of the MW values of the unique ORF3a, ORF7a, ORF8, and ORF10 variants imply the wide variability of each set of ORF3a, ORF7a, ORF8, and ORF10 although range and variance of PPIDR and pI were not widely spread. In the case of the unique variance of ORF6, the range and variance of MW and percentage of PPIDR were found to be large, which implied the wide quantitative differences among the unique ORF6 variants. Furthermore, a moderately broad range and variance associated with the percentage of PPIDR and MW of ORF7a variants imply their moderate variability. In line with the previously reported data, Fig. 17, Fig. 18 and Table 6 show that all SARS-CoV-2 accessory proteins contain different levels of intrinsic disorder. In fact, based on their overall disorder predispositions, these proteins can be arranged as follows: ORF8 < ORF3a < ORF7a < ORF10 < ORD6 < ORF7b, where the difference in the overall intrinsic disorder predisposition between these proteins can be as high as 6-7-fold (compare data for ORF8 and ORF7b in Fig. 17). Furthermore, disorder predispositions of these proteins are sensitive to the mutations found in their natural variants. For example, Fig. 17 represents the effect of mutations in the natural variants on the overall disorder predisposition of accessory proteins and shows that the whole-protein disorder-related parameters, PPIDR and mean disorder score (MDS), can be dramatically changed by mutations. The largest variability of mutation-induced change in intrinsic disorder propensity is observed for ORF10 and ORF6 (see Fig. 17).
Fig. 17

Effect of mutations observed in unique natural variants of the SARS-CoV-2 accessory proteins on their overall intrinsic disorder predisposition evaluated in terms of percent of predicted intrinsically disordered residues (PPIDR) and mean disorder score (MDS). These data were generated using the PONDR® FIT [42] algorithm, which is a meta predictor that combines outputs of six predictors of intrinsic disorder, PONDR® VLXT [43], PONDR® VSL2 [44,45], PONDR® VL3 [46], FoldIndex [47], IUPred [48], and TopIDP [49]. PONDR® FIT is moderately more accurate than each of its component predictors [42]. For each mutant, the predicted percentage of intrinsically disordered residues (PPIDR) and mean disorder score (MDS) were calculated based on the outputs of this per-residue disorder predictors. Here, PPIDR in a query protein represents a percentage of residues with disorder scores exceeding 0.5. In this study, protein residues and regions were classified as disordered or flexible if their predicted disorder scores were above 0.5, or ranged between 0.15 and 0.5, respectively.

Fig. 18

Per-residue intrinsic disorder profiles generated for the SARS-CoV-2 accessory proteins and their natural variants by PONDR® VSL2, which systematically shows good performance in various comparative analyses, including recently conducted Critical assessment of protein intrinsic disorder prediction (CAID) experiment, where this tool was recognized as #3 predictor of 43 evaluated methods [31].

Effect of mutations observed in unique natural variants of the SARS-CoV-2 accessory proteins on their overall intrinsic disorder predisposition evaluated in terms of percent of predicted intrinsically disordered residues (PPIDR) and mean disorder score (MDS). These data were generated using the PONDR® FIT [42] algorithm, which is a meta predictor that combines outputs of six predictors of intrinsic disorder, PONDR® VLXT [43], PONDR® VSL2 [44,45], PONDR® VL3 [46], FoldIndex [47], IUPred [48], and TopIDP [49]. PONDR® FIT is moderately more accurate than each of its component predictors [42]. For each mutant, the predicted percentage of intrinsically disordered residues (PPIDR) and mean disorder score (MDS) were calculated based on the outputs of this per-residue disorder predictors. Here, PPIDR in a query protein represents a percentage of residues with disorder scores exceeding 0.5. In this study, protein residues and regions were classified as disordered or flexible if their predicted disorder scores were above 0.5, or ranged between 0.15 and 0.5, respectively. Per-residue intrinsic disorder profiles generated for the SARS-CoV-2 accessory proteins and their natural variants by PONDR® VSL2, which systematically shows good performance in various comparative analyses, including recently conducted Critical assessment of protein intrinsic disorder prediction (CAID) experiment, where this tool was recognized as #3 predictor of 43 evaluated methods [31]. Next, we looked at the effect of natural variants on local intrinsic disorder predisposition. Results of this analysis are shown in Fig. 18, which represents the per-residue disorder profiles generated by PONDR® VSL2 for all the proteins analyzed in this study. Fig. 18 generally supports the observation that intrinsic disorder predispositions could vary significantly between the natural variants of each individual accessory protein. Importantly, the largest mutation-induced variability is observed within the disordered or flexible regions of these proteins (i.e., regions characterized by the predicted disorder scores exceeding the 0.5 threshold and regions with disorder scores between 0.15 and 0.5). This is an important observation suggesting that the natural variability of SARS-CoV-2 accessory proteins is shaping their structural flexibility. SARS-CoV-2 is the first HCoVs with pandemic capacity due to its highly contagious nature deriving from the structural differences in its S protein, such as a flat sialic acid-binding domain, tight binding to its entry ACE2 receptor, and capacity to be cleaved by furin protease [50]. Based on more than 355 million confirmed cases of COVID-19 and additionally a large number of asymptomatic cases, SARS-CoV-2 is a highly contagious, but relatively weak pathogen considering the ratio of the number of patients with severe infections associated with the multiple organ dysfunction to the total number of infected [6], or relatively low mortality rate (∼2.2%). The host immunity modulated by the SARS-CoV-2 accessory proteins could be responsible at least for some of these pathological features. Based on various mutations of accessory proteins, SARS-CoV-2 has had very little selective pressure to tackle host immunity in nature after diverging with BatCoV RaTG13 19–89 years ago [2]. The genomic stability of the relatively large RNA genomes (around 30,000 nucleotides) of SARS-CoV-2, as other CoVs, is protected by proofreading proteins, such as 3′-5′ exonuclease non-structural protein 14 (NSP14) that assists RNA synthesis with a unique RNA proofreading function [51]. Muller's ratchet effect explains the extinctive effect of high mutation rates of asexual organisms such as viruses potentially leading to the accumulation of deleterious mutations in an irreversible manner [52]. Therefore, SARS-CoV-2 repairs its mutations to preserve its genomic stability as mutations can lead to pathological fitness losses or viral extinction [52]. However, there is a balance governed by genomic repair mechanisms such as NSP14, and viruses that require a certain degree of mutations to gain novel traits such as emergence transmission into zoonotic hosts [52]. For instance, a 29-nucleotide deletion mutation in the SARS-CoV ORF8 gene, was associated with a less pathogenic strain [52]. Similarly, SARS-CoV-2 variants with a 382-nucleotide deletion in ORF8, showed only mild symptoms in COVID-19 patients, as they did not require supplemental oxygen [52]. Only one variant identical to the Wuhan sequence (NC 045512) of each of the accessory ORF6, ORF7a, ORF7b, and ORF10 proteins was present on all continents. Most of the ORF3A variants with the prevalent non-synonymous amino acid substitutions (V13L, T14I, L46F, A54S, Q57H, S58 N, K75 N, A99V, L108F, R126S, G172V, G196V, F207L, T223I, G251I, G252V, N257S, and Y264C) possess a single point mutation [53,54]. Ten of these mutation sites occur within the transmembrane (TM) domain of ORF3a. Four of these variants contain the mutation Q57H paired with another amino acidic change (A99V, S58 N, Y264C, or G172V). Only two variants of ORF3a, differed by the clade/strain determining single mutation Q57H, were found on all six continents [41], and V13L, Q57H+A99V, G196V, and G252V were the most frequent mutations [54]. When Q57H and G251V (ORF3a) are combined with S19L and R203K/G204R in the nucleocapsid, these four mutations cause a dramatic change in viral protein structures [55]. In addition to being predominating in North America [53,56], some ORF3a variants were found on all six continents. This can be associated with virus evasion of the immune system leading to induction of cytokine, chemokine, and interferon-stimulated gene expression in primary human respiratory cells [25,57]. These dominating mutational effects are not limited to the modulation of the efficiency of viral pathogenesis, disease severity, and patient outcomes due to aggravation of the host immunity [21,53,58]. It may also play a role in viral ion-channel formation, viral particle loads, and virus release [21]. The precise roles of natural and/or variants of various SARS-CoV-2 ORFs on the outcome of COVID-19 patients are rather controversial [59] and need a more in depth analysis. Also, in ORF8, only two unique variants, differed by a strain determining single mutation L84S, appeared on all continents. So, the maximally intersecting family of variations across all accessory proteins turned out to be the smallest. These findings confirmed that all other variants of accessory proteins were due to demographic and environmental constraints. It was found that most of the unique variants of accessory proteins differed from the corresponding Wuhan accessory proteins by a single mutation, although basic descriptive statistics unfolded their respective wide variability. New variants of each accessory protein have been found in recent days and will continue to be discovered in the future. Significant amounts of unique variants of each accessory protein with wide variability might significantly contribute to the pathogenicity of SARS-CoV-2. Therefore, our firm conviction that naturally weakened stability (if achievable) of SARS-CoV-2 seems to be a far reachable goal, which needs to address the dangers of the present pandemic scenario. Also, unique accessory protein variants across individual continents would all be expected to be mixed, while international travels will resume without strict protective measures and restrictions. In this regard, it is our (SACRED, Self-Assembled COVID-19 Research & Education Directive, consisting of international experts in mathematics, physics, computer science, bioinformatics, nanotechnology, structural biology, molecular biology, immunology, and virology) strong recommendation to governmental and non-governmental organizations to take necessary measures to mitigate the spread of COVID-19.

Future perspective

In comparison to either SARS or MERS alone or combined, COVID-19 has caused more illness and death. CoVs can similarly trigger spreads and outbreaks in the coming years with different waves of variants as part of increased globalization. Broad spectrum genomics experiments should be used for the identification of possible genetic factors involved in COVID-19 development. Although costly and complicated, more genomics studies are required to assess the effect of host genomics and genetics on immune responses to CoV. Furthermore, understanding the progression and geographical location of SARS-CoV-2 viral genomics and genetics in the context of frequency and quantity of emerging viral variants and their association with viral infectivity, transmissibility, and clinical manifestation are issues to be addressed in future research and development programs.

Declaration of competing interest

The authors do not have any conflicts of interest to declare.
  58 in total

1.  TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder.

Authors:  Andrew Campen; Ryan M Williams; Celeste J Brown; Jingwei Meng; Vladimir N Uversky; A Keith Dunker
Journal:  Protein Pept Lett       Date:  2008       Impact factor: 1.890

2.  Length-dependent prediction of protein intrinsic disorder.

Authors:  Kang Peng; Predrag Radivojac; Slobodan Vucetic; A Keith Dunker; Zoran Obradovic
Journal:  BMC Bioinformatics       Date:  2006-04-17       Impact factor: 3.169

3.  Genome-Wide Identification and Characterization of Point Mutations in the SARS-CoV-2 Genome.

Authors:  Jun-Sub Kim; Jun-Hyeong Jang; Jeong-Min Kim; Yoon-Seok Chung; Cheon-Kwon Yoo; Myung-Guk Han
Journal:  Osong Public Health Res Perspect       Date:  2020-06

4.  Questions concerning the proximal origin of SARS-CoV-2.

Authors:  Murat Seyran; Damiano Pizzol; Parise Adadi; Tarek M A El-Aziz; Sk Sarif Hassan; Antonio Soares; Ramesh Kandimalla; Kenneth Lundstrom; Murtaza Tambuwala; Alaa A A Aljabali; Amos Lal; Gajendra K Azad; Pabitra P Choudhury; Vladimir N Uversky; Samendra P Sherchan; Bruce D Uhal; Nima Rezaei; Adam M Brufsky
Journal:  J Med Virol       Date:  2020-09-03       Impact factor: 2.327

5.  Unique and complementary suppression of cGAS-STING and RNA sensing- triggered innate immune responses by SARS-CoV-2 proteins.

Authors:  Yajuan Rui; Jiaming Su; Si Shen; Ying Hu; Dingbo Huang; Wenwen Zheng; Meng Lou; Yifei Shi; Meng Wang; Shiqi Chen; Na Zhao; Qi Dong; Yong Cai; Rongzhen Xu; Shu Zheng; Xiao-Fang Yu
Journal:  Signal Transduct Target Ther       Date:  2021-03-15

6.  A unique view of SARS-CoV-2 through the lens of ORF8 protein.

Authors:  Sk Sarif Hassan; Alaa A A Aljabali; Pritam Kumar Panda; Shinjini Ghosh; Diksha Attrish; Pabitra Pal Choudhury; Murat Seyran; Damiano Pizzol; Parise Adadi; Tarek Mohamed Abd El-Aziz; Antonio Soares; Ramesh Kandimalla; Kenneth Lundstrom; Amos Lal; Gajendra Kumar Azad; Vladimir N Uversky; Samendra P Sherchan; Wagner Baetas-da-Cruz; Bruce D Uhal; Nima Rezaei; Gaurav Chauhan; Debmalya Barh; Elrashdy M Redwan; Guy W Dayhoff; Nicolas G Bazan; Ángel Serrano-Aroca; Amr El-Demerdash; Yogendra K Mishra; Giorgio Palu; Kazuo Takayama; Adam M Brufsky; Murtaza M Tambuwala
Journal:  Comput Biol Med       Date:  2021-04-15       Impact factor: 6.698

7.  SARS-Cov-2 ORF3a: Mutability and function.

Authors:  Martina Bianchi; Alessandra Borsetti; Massimo Ciccozzi; Stefano Pascarella
Journal:  Int J Biol Macromol       Date:  2021-01-08       Impact factor: 6.953

8.  Molecular conservation and differential mutation on ORF3a gene in Indian SARS-CoV2 genomes.

Authors:  Sk Sarif Hassan; Pabitra Pal Choudhury; Pallab Basu; Siddhartha Sankar Jana
Journal:  Genomics       Date:  2020-06-12       Impact factor: 5.736

9.  SARS-CoV-2 and ORF3a: Nonsynonymous Mutations, Functional Domains, and Viral Pathogenesis.

Authors:  Elio Issa; Georgi Merhi; Balig Panossian; Tamara Salloum; Sima Tokajian
Journal:  mSystems       Date:  2020-05-05       Impact factor: 6.496

10.  Loss of orf3b in the circulating SARS-CoV-2 strains.

Authors:  Joy-Yan Lam; Chun-Kit Yuen; Jonathan Daniel Ip; Wan-Man Wong; Kelvin Kai-Wang To; Kwok-Yung Yuen; Kin-Hang Kok
Journal:  Emerg Microbes Infect       Date:  2020-12       Impact factor: 7.163

View more
  1 in total

1.  SARS-CoV-2 Variants Show a Gradual Declining Pathogenicity and Pro-Inflammatory Cytokine Stimulation, an Increasing Antigenic and Anti-Inflammatory Cytokine Induction, and Rising Structural Protein Instability: A Minimal Number Genome-Based Approach.

Authors:  Debmalya Barh; Sandeep Tiwari; Lucas Gabriel Rodrigues Gomes; Cecília Horta Ramalho Pinto; Bruno Silva Andrade; Shaban Ahmad; Alaa A A Aljabali; Khalid J Alzahrani; Hamsa Jameel Banjer; Sk Sarif Hassan; Elrashdy M Redwan; Khalid Raza; Aristóteles Góes-Neto; Robinson Sabino-Silva; Kenneth Lundstrom; Vladimir N Uversky; Vasco Azevedo; Murtaza M Tambuwala
Journal:  Inflammation       Date:  2022-10-10       Impact factor: 4.657

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.