Literature DB >> 33862077

Notable sequence homology of the ORF10 protein introspects the architecture of SARS-CoV-2.

Sk Sarif Hassan1, Diksha Attrish2, Shinjini Ghosh3, Pabitra Pal Choudhury4, Vladimir N Uversky5, Alaa A A Aljabali6, Kenneth Lundstrom7, Bruce D Uhal8, Nima Rezaei9, Murat Seyran10, Damiano Pizzol11, Parise Adadi12, Antonio Soares13, Tarek Mohamed Abd El-Aziz14, Ramesh Kandimalla15, Murtaza M Tambuwala16, Gajendra Kumar Azad17, Samendra P Sherchan18, Wagner Baetas-da-Cruz19, Amos Lal20, Giorgio Palù21, Kazuo Takayama22, Ángel Serrano-Aroca23, Debmalya Barh24, Adam M Brufsky25.   

Abstract

The current Coronavirus Disease 19 (COVID-19) pandemic, caused by Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) shows similar pathology to MERS and SARS-CoV, with a current estimated fatality rate of 1.4%. Open reading frame 10 (ORF10) is a unique SARS-CoV-2 accessory protein, which contains eleven cytotoxic T lymphocyte (CTL) epitopes each of nine amino acids in length. Twenty-two unique SARS-CoV-2 ORF10 variants have been identified based on missense mutations found in sequence databases. Some of these mutations are predicted to decrease the stability of ORF10 in silico physicochemical and structural comparative analyses were carried out on SARS-CoV-2 and Pangolin-CoV ORF10 proteins, which share 97.37% amino acid (aa) homology. Though there is a high degree of ORF10 protein similarity of SARS-CoV-2 and Pangolin-CoV, there are differences of these two ORF10 proteins related to their sub-structure (loop/coil region), solubility, antigenicity and shift from strand to coil at aa position 26 (tyrosine). SARS-CoV-2 ORF10, which is apparently expressed in vivo since reactive T cell clones are found in convalescent patients should be monitored for changes which could correlate with the pathogenesis of COVID-19.
Copyright © 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Intrinsic disorder; Mutations; ORF10; Pangolin-CoV-2020; SARS-CoV-2

Year:  2021        PMID: 33862077      PMCID: PMC8051021          DOI: 10.1016/j.ijbiomac.2021.03.199

Source DB:  PubMed          Journal:  Int J Biol Macromol        ISSN: 0141-8130            Impact factor:   6.953


Introduction

The Coronavirus Disease 19 (COVID-19) pandemic has affected the whole world with more than 131 million people infected and 2.85 million fatalities worldwide as of April, 5, 2021 [[1], [2], [3], [4], [5]]. The high fatality rate of 9.7% for SARS-CoV and 37% for Middle East Respiratory Syndrome Coronavirus (MERS-CoV) in comparison to 1.4% for SARS-CoV-2 has made it vital to monitor mutations within proteins such as ORF10 that could influence viral pathogenicity [[6], [7], [8]]. SARS-CoV-2 is a positive-sense, single-stranded RNA virus with four structural, sixteen non-structural, and six accessory proteins [9]. ORF10 is the smallest accessory protein (38 aa) in SARS-CoV-2, which can distinguish the infection faster than PCR techniques [10]. ORF10, present at the C-terminal of the genome, is hypothesized to be a transposon, although being distinct from larger transposons [10,11]. ORF10 contains a MoRF region from amino acid residue 3 to 7, a protein interaction site, which enables the intrinsic protein to adopt a set of conformations connected with different proteins [12,13]. High-throughput analysis revealed that ORF10 could interact with many host proteins such as multiple members of the Cullin-ubiquitin-ligase complex, which is essential for viral pathogenesis despite its small structure [12,[14], [15], [16], [17]]. Humans may not utilize any memory B and T cells elicited against other microorganisms to target ORF10 and fight SARS-CoV-2 [18]. No sequence homology was found with any protein in the NCBI protein depository. SARS-CoV-2 ORF10 was reported to have a 99.15% nucleotide similarity to Pangolin-CoV-2020 [19,20]. Here we explore mutations described for SARS-CoV-2 ORF10 variants, which in addition to their physiochemical and immunological properties; may possibly have an impact on pathogenesis. The SARS-CoV-2 and Pangolin-CoV ORF10 proteins are also compared.

Data and methods

Data acquisition

11,288 complete genomes of SARS-CoV-2 were retrieved from the National Center for Biotechnology Information (NCBI) database. There were 34 unique ORF10 accessory protein sequences. Only 22 sequences of these ORF10 proteins have one missense mutation, each with an ambiguous mutation in the remaining sequences. A nonsense mutation at position 29 resulted in a truncated ORF10 sequence (QJR96431.1) (Table 1 ).
Table 1

Twenty-four distinct ORF10 protein IDs correlated with geolocation.

AccessionGeo_locationCollection_date
YP_009725255China2019-12
QLJ57416USA: WA2020
QIS29991China: Hubei, Wuhan2020-01-10
QJR96431USA: CA2020-03-13
QKU54102USA: Washington, King County2020-03-15
QLA48060USA: NY2020-03-24
QNG41574USA: Minnesota2020-03-25
QKV08176USA: Washington, King County2020-03-26
QKV37245Australia: Northern Territory2020-03-27
QNI23218USA: Virginia2020-04
QLG99793USA: CA2020-04-16
QLY88596USA: GA2020-04-27
QNC04532USA2020-04-29
QNI25281USA: Virginia2020–05
QLI33453USA2020-05-12
QNC49349Pakistan2020-05-15
QMT94417USA: Washington, Yakima County2020-05-27
QMT54534USA: Washington, Yakima County2020-06-17
QLG76514Australia: Victoria2020-06-20
QNG42985USA: FL2020-06-23
QMT97141USA: FL2020-06-30
QNB17780Bangladesh2020-07-07
QMU93213USA: Wisconsin, Dane county2020-07-13
QNA70543Bangladesh2020-07-19
Twenty-four distinct ORF10 protein IDs correlated with geolocation. The SARS-CoV-2 genome (NC_045512) reference ORF10 (YP_009725255.1) was used to identify mutations [21]. The ORF10 variants of SARS-CoV-2 were compared by sequence-based homology (Fig. 1A) and phylogeny (Fig. 1B).
Fig. 1

(A): Multiple sequence alignment (MSA) of 24 SARS-CoV-2 ORF10 proteins; (B): phylogeny of 24 SARS-CoV-2 ORF10 sequences.

(A): Multiple sequence alignment (MSA) of 24 SARS-CoV-2 ORF10 proteins; (B): phylogeny of 24 SARS-CoV-2 ORF10 sequences. SARS-CoV-2 ORF10 sequences utilize a single amino acid change at a distinct position. These positions (18) vary widely from position 2 to 38 for the 22 SARS-CoV-2 ORF10 variants.

Methods

Webserver based predictions

The prediction of various properties of ORF10 proteins was determined by several webservers. The web server PROVEAN was used to estimate the effect of known mutations and the structural effect of these mutations, and another I-MUTANT webserver was used [[22], [23], [24]]. The QUARK webserver was used to predict the secondary structure of ORF10 proteins [[25], [26], [27]]. The ABTMpro webserver predicts whether the protein is a transmembrane protein from its sequence, and further predicts alpha-helices and beta sheets. Besides, the INNOVAGEN webserver was used for peptide property predictions [28]. The DIpro webserver predicts whether the protein sequence contains disulfide bonds, based on 2D recurrent neural network, support vector machine, graph matching and regression algorithms [29]. Protein antigenicity is predicted using the webserver ANTIGENpro. A two-stage architecture makes a prediction likelihood centered on several primary sequence representations and five machine learning algorithms [30]. The DisEMBL server uses the intrinsic distortion estimation of a single protein sequence [31]. Epitopes of a given aa specific sequence were spotted and analyzed for binding affinity using across 12 HLA (Human leukocyte antigen) subtypes (HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*24:02, HLA-A*26:01, HLA-B*07:02, HLA-B*08:01, HLA-B*27:05 B*39:01, B*40:01, B*58:01 and B*15:01). The Immune Epitope Database (IDEB) score was predicted using the IDEB immunogenicity tool [32,33].

Evaluating the per-residue predisposition of various ORF10 proteins for intrinsic disorder

Per-residue disorder distribution within ORF10 protein sequences was evaluated by PONDR-VSL2 [34], which is an accurate standalone disorder predictor [[35], [36], [37]]. Predisposition scores for the per-residual condition are 0 to 1, where 0 indicates residues entirely arranged, and 1 indicates residues completely disordered. Residues of disorder scores between 0.25 and 0.5 were considered extremely versatile, and disorder scores between 0.1 and 0.25 were considered mildly versatile. Residues with values higher than 0.5 were considered disordered.

Results

SARS-CoV-2 ORF10 mutations

Each ORF10 sequence has been aligned using the p-blast protein and omega blast suites of the NCBI, and missense mutations were identified in (Fig. 2A) [38,39]. Conserved and non-conserved residues in ORF10 proteins are identified and marked in different colors (Fig. 2B). MoRF (YINVF) is also predicted using MoRFchibi server for the ORF10 Wuhan sequence [40].
Fig. 2

(A): Mutations and their aa positions in ORF10 proteins; (B): conserved, mutated residues and molecular recognition features of ORF10 (YP_009725255) of SARS-CoV-2.

(A): Mutations and their aa positions in ORF10 proteins; (B): conserved, mutated residues and molecular recognition features of ORF10 (YP_009725255) of SARS-CoV-2. There are 22 unique mutations in 22 SARS- CoV-2 ORF10 variants. These missense mutations are found in the entire ORF10 sequence from the aa position 2 to 38. Arginine (R), valine (V), and leucine (L) are substituted to more than one aa at fixed positions (marked magenta in Fig. 2B). The largest conserved region across all the 24 ORF10 variants is ‘SLLLC’ at positions 15–19. Each unique variant (Table 1) of SARS-CoV-2 ORF10 possesses a single missense mutation (Table 2 ).
Table 2

Twenty-two ORF10 proteins, their corresponding mutations, and predicted effects with chemical property changes.

Accession IDMutationsType of mutationsPROVEAN scoreaEffect of mutations on structureRIbPolarity changesCharge
QNI23218.1G2DDeleterious−7Decrease7NP to PNeutral to acidic
QIS29991.1V6INeutral−1Decrease7NP to NPNeutral to neutral
QLI33453.1Y14CDeleterious−9Decrease2P to PNeutral to neutral
QNC04532.1R20IDeleterious−8Decrease3NP to NPBasic (strongly) to neutral
QLA48060.1R20KDeleterious−3Decrease8NP to PBasic (strongly) to basic
QMT97141.1S23FDeleterious−6Increase2P to NPNeutral to neutral
QMU93213.1R24CDeleterious−8Decrease7P to PBasic (strongly) to neutral
QMT54534.1R24LDeleterious−7Decrease9P to NPBasic (strongly) to neutral
QKU54102.1Y26HDeleterious−5Decrease8P to PNeutral to basic (weakly)
QNI25281.1V30ADeleterious−4Decrease9NP to NPNeutral to neutral
QNC49349.1V30LDeleterious−3Decrease4NP to NPNeutral to neutral
QNA70543.1L37FDeleterious−4Decrease7NP to NPNeutral to neutral
QKV37245.1T38IDeleterious−6Decrease5P to NPNeutral to neutral
QKV08176.1L37PDeleterious−7Decrease8NP to NPNeutral to neutral
QNB17780.1F35SDeleterious−8Decrease9NP to PNeutral to neutral
QMT94417.1D31YDeleterious−9Decrease6P to PAcidic to neutral
QLY88596.1A28VDeleterious−4Decrease5NP to NPNeutral to neutral
QLG76514.1N22TDeleterious−6Decrease1P to PNeutral to neutral
QLG99793.1I13MDeleterious−3Decrease8NP to NPNeutral to neutral
QNG42985.1P10SDeleterious−8Decrease8NP to PNeutral to neutral
QLJ57416.1A8VDeleterious−4Increase3NP to NPNeutral to neutral
QNG41574.1I4LNeutral−2Increase1NP to NPNeutral to neutral

PROVEAN score: If the PROVEAN score is equal to or below a predefined threshold (e.g., −2.5), the protein variant is predicted to have a “deleterious” effect. If the PROVEAN score is above the threshold, the variant is predicted to have a “neutral” effect.

RI: Reliability Index ranges from 0 to 9.

Twenty-two ORF10 proteins, their corresponding mutations, and predicted effects with chemical property changes. PROVEAN score: If the PROVEAN score is equal to or below a predefined threshold (e.g., −2.5), the protein variant is predicted to have a “deleterious” effect. If the PROVEAN score is above the threshold, the variant is predicted to have a “neutral” effect. RI: Reliability Index ranges from 0 to 9. It was established that the most diversified mutations are deleterious and resulted in decreased protein stability, thus indicating the amplification of intricate virulence of SARS-CoV-2 (Table 2).

Sequence homology and mutations of SARS-CoV-2 ORF10

SARS-CoV-2 ORF10 does not show homology with other proteins in the NCBI depository including the Bat CoV ORF10 [10]. Surprisingly, SARS-CoV-2 ORF10 showed 97.37% homology to Pangolin-CoV ORF10 (QIG55954.1 (release date: 2020-05-18; collection date: 2019-03-29; geo-location: China; host: Sunda pangolin (Manis javanica))) (Fig. 3 ) [20].
Fig. 3

Alignment of two ORF10 sequences (37 out of 38 identical residues) of Pangolin-CoV.

Alignment of two ORF10 sequences (37 out of 38 identical residues) of Pangolin-CoV. The only difference in the ORF10 sequences is between the serine (S) in the Pangolin-CoV and asparagine (N) in the SARS-CoV-2 at position 25, which according to the PROVEAN score (−3) is deleterious. Subsequently, the protein structural stability is predicted to decrease. Analysis of the per-residue intrinsic disorder predispositions of the ORF10 of SARS-CoV-2 and Pangolin-CoV provide evidence of their differences. The findings indicate that while the ORF10 SARS-CoV-2 and Pangolin-CoV proteins have very close disorder profiles, the residual disorder tendency of SARS-CoV ORF10 differs significantly, especially within its C-terminal half (Fig. 4A).
Fig. 4

(A) Comparison of the intrinsic disorder profile of the reference ORF10 protein from SARS-CoV-2 (YP_009725255) from the NC_045512 SARS-CoV-2 genome (China, Wuhan) (black curve) with ORF10 proteins from the Pangolin-CoV (QIG55954.1) and SARS-CoV TW-HP1 (UniProt ID: Q6SRY8).

(B) Predisposition of intrinsic disease of SARS-CoV-2 ORF10 single variants relative to SARS-CoV-2 (YP 009725255) ORF10 protein of the NC_045512 SARS-CoV-2 genome (China, Wuhan) (black curve). The analysis is conducted using the PONDR-VSL2 algorithm [34], one of the more accurate standalone disorder predictors [[35], [36], [37]]. A thin line (score = 0.5) is the threshold separating order from disorder. Residues with the predicted disorder scores ≥0.5 are considered as disordered, residues with the disorder scores ranging between 0.25 and 0.5 are flexible, whereas disorder scores below 0.25 correspond to ordered residues.

(A) Comparison of the intrinsic disorder profile of the reference ORF10 protein from SARS-CoV-2 (YP_009725255) from the NC_045512 SARS-CoV-2 genome (China, Wuhan) (black curve) with ORF10 proteins from the Pangolin-CoV (QIG55954.1) and SARS-CoV TW-HP1 (UniProt ID: Q6SRY8). (B) Predisposition of intrinsic disease of SARS-CoV-2 ORF10 single variants relative to SARS-CoV-2 (YP 009725255) ORF10 protein of the NC_045512 SARS-CoV-2 genome (China, Wuhan) (black curve). The analysis is conducted using the PONDR-VSL2 algorithm [34], one of the more accurate standalone disorder predictors [[35], [36], [37]]. A thin line (score = 0.5) is the threshold separating order from disorder. Residues with the predicted disorder scores ≥0.5 are considered as disordered, residues with the disorder scores ranging between 0.25 and 0.5 are flexible, whereas disorder scores below 0.25 correspond to ordered residues. Fig. 4B compares intrinsic disorder predispositions of the 24 unique variants of ORF10 protein from different SARS-CoV-2 isolates. It is seen that intrinsic disorder predispositions can vary significantly, especially within the C-terminal half of the protein. In fact, majority of substitutions found within the N-terminal region (residues 1–15; i.e., mutations G2D, I4L, V6I, A8V, P10S, I13M, and Y14C) have very little effect on the local intrinsic disorder predisposition of ORF10. On the other hand, ORF10 variants with the mutations within the C-terminal region (residues 20–38; i.e., mutations R20I/K, N22T, S23F, R24C/L, Y26H, A28V, V30A/L, D31Y, F35S, L37P/F, and T38I, as well as shortened QJR96431.1 variant, which is truncated due to a nonsense mutation at the position 29) typically show rather substantial variability in their local disorder predispositions. The most significant changes are observed within the “disorder hump” region (residues 20–30), intensity of which is increased in QKU54102.1 (Y26H), QNI25281.1 (V30A), and QNB17780.1 (F35S) ORF10 variants, whereas in the variants QMT54534.1 (R24L), QNC04532.1 (R20I), QMU93213.1 (R24C), and QMT97141.1 (S23F), this hump is either eliminated or noticeably flattened. Interestingly, comparison of Fig. 4A and B shows that the variability in the disorder predisposition between many variants of the ORF10 protein from various SARS-CoV-2 isolates is noticeably greater than that between the reference ORF10 from SARS-CoV-2 and ORF10 from Pangolin-CoV. On the other hand, none of the SARS-CoV-2 ORF10 variants (with the exception for the truncated QJR96431.1 variant) has as disordered C-terminal half as the ORF10 protein from SARS-CoV does.

Comparison of SARS-CoV-2 ORF10 and Pangolin-CoV ORF10

Provided that SARS-CoV-2 and Pangolin-CoV ORF10 have the highest sequence homology, we aimed to detect parity and difference between SARS-CoV-2 and Pangolin-CoV. Therefore, we performed a multi-dimensional analysis of both ORF10 proteins from structural, physicochemical, biophysical, and immunological aspects to understand the origin of SARS-CoV-2 from the ORF10 perspective. The correlations between sequences of SARS-CoV-2 and Pangolin-CoV ORF10 showed no disulfide connections (Fig. 5A). However, there are several variations. Due to the ABTMpro server and the inclusion of the bulk of hydrophobic amino acids, the SARS COV-2 ORF10 was predicted to be an alpha-helical transmembrane protein (probability 0.489) while the Pangolin-CoV ORF10 series was predicted to be a non-transmembrane protein (probability 0.513). The predicted probability of antigenicity of SARS-CoV-2 ORF10 was slightly higher than that of Pangolin-CoV ORF10. Both proteins were expected to be located in the capsid area of the virus as they both show a positive distance with Pangolin-CoV (0.1502) at a higher rate than SARS-CoV-2 (0.1141).
Fig. 5

(A): Basic properties of ORF10 proteins of SARS-CoV-2 and Pangolin-CoV; (B): peptide and solvent accessibility properties of ORF10 proteins of SARS-CoV-2 and Pangolin-CoV.

(A): Basic properties of ORF10 proteins of SARS-CoV-2 and Pangolin-CoV; (B): peptide and solvent accessibility properties of ORF10 proteins of SARS-CoV-2 and Pangolin-CoV. We characterized their secondary structure (Fig. 5B) for detailed insights into ORF10 proteins from SARS-CoV-2 and Pangolin-CoV and found that these are almost the same except for a significant variation of tyrosine (Y) at position 26 for SARS-CoV-2 ORF10. In SARS-CoV-2 ORF10, 23 of the residues are buried, and the solubility is significantly greater in 24 residues in SARS-CoV-2 ORF10 compared to Pangolin CoV. Subsequent in-depth physiochemical properties study of two of ORF10 SARS-CoV-2 and Pangolin-CoV proteins revealed the high similarity of extinction, isoelectric point, and net charging dependent on structural and fundamental proprietary studies (Fig. 6A). However, in contrast to Pangolin-CoV ORF10 (4422 g/mol), the molecular weight of SARS-CoV-2 ORF10 was higher because of the replacement of Pangolin-CoV S (low molecular weight) for N (high molecular weight) for SARS-CoV-2. The enzyme cleavage sites for the SARS-CoV-2 and Pangolin-CoV ORF10 were also indistinguishable for all proteases (Fig. 6B).
Fig. 6

(A): Physicochemical properties and hydropathy of ORF10 of SARS-CoV-2 and Pangolin-CoV; (B): enzymes and numbers of associated cleavages and their positions.

(A): Physicochemical properties and hydropathy of ORF10 of SARS-CoV-2 and Pangolin-CoV; (B): enzymes and numbers of associated cleavages and their positions. Protein intrinsic disorder analysis disclosed the presence of hotloops in both sequences within the same span of amino-acids (26–38). However, the presence of loops/coils (22–29) was a distinct characteristic of SARS-CoV-2 ORF10 and no such structures were observed for Pangolin-CoV ORF10 (Fig. 7 ).
Fig. 7

Disordered loops and hotloops of ORF10 of SARS-CoV-2 and Pangolin-CoV.

Disordered loops and hotloops of ORF10 of SARS-CoV-2 and Pangolin-CoV. We studied and identified nine amino acid epitopes 11 cytotoxic T-lymphocytes (CTLs), from the SARS-CoV-2 ORF10 series, in all 12 HLA subtypes to demonstrate the immunogenic properties of ORF10 and their associated epitopic mutations (Fig. 8 ). The scores were contrasted with the initial epitopes, thereby predicting that the binding affinity for Class-I MHC molecules will increase/decline due to mutations. All eleven epitopes and mutational epitopes have been analyzed using the IDEB tool to take their immunogenicity into account.
Fig. 8

In 12 HLA subtypes, 11 distinct epitopes were described and analyzed in the SARS-CoV-2 ORF10 for binding affinity using PICKPOCKET. Using the IDEB tool, the IDEB value was estimated. Eleven epitopes of the Wuhan SARS-CoV-2 ORF10 series (marked in orange). Red/blue scores indicate an increase/decline of the score for nine epitopes. The immunogenicity attribute remains unchanged with significant green values.

In 12 HLA subtypes, 11 distinct epitopes were described and analyzed in the SARS-CoV-2 ORF10 for binding affinity using PICKPOCKET. Using the IDEB tool, the IDEB value was estimated. Eleven epitopes of the Wuhan SARS-CoV-2 ORF10 series (marked in orange). Red/blue scores indicate an increase/decline of the score for nine epitopes. The immunogenicity attribute remains unchanged with significant green values.

Discussions

A detailed study of the ORF10 protein was carried out to evaluate its potential to yield to variants that could possibly alter viral pathogenicity. It was observed that each SARS-CoV-2 ORF10 sequence possesses one distinct mutation. Each of the twenty-two SARS-CoV-2 ORF10 variants is at a uniquely different position. None of these mutations in the SARS-CoV-2 ORF10, however, contributes to the determination of clades of SARS-CoV-2. Of all variants, a total of 13 variants were identified to possess mutations at amino acid positions 22–38 and in a region predicted to contain overlapping loops/coils and hot-loop regions of the ORF10 protein. All mutations were predicted to be deleterious with decreased effect on protein structure stability except S23F, which increased stability, denoting that these mutations play an active role in enhancing intrinsic propensity disorder (IPD) and allowing the protein to undergo more favorable interactions with other proteins. Two other mutations, I4L and V6I, were found to be in the MoRF region of ORF10, and which may also possibly contribute to the IPD as well. The mutations at positions 20 and 24 were also significant due to their sensitivity for trypsin activity. Four ORF10 variants (QNC04532.1, QMT54534.1, QMU93213.1 and QLA48060.1) possess four mutations at these two positions. Among them, three variants harboring the mutations R20I, R24L and R24C provide trypsin resistance, while the fourth variant (QLA48060.1) with the R20K mutation is susceptible to protease degradation. An amino acid homology of 97.37% was observed between SARS-CoV-2 ORF10 and Pangolin-CoV ORF10. Although most physicochemical and peptide properties are similar, the probability of antigenicity is greater for SARS-CoV-2 ORF10 than that of Pangolin-CoV ORF10 and consequently a stronger immune response is predicted for SARS-CoV-2 ORF10. A change from strand (Pangolin-CoV ORF10) to coil (SARS-CoV-2 ORF10) at position 26 (tyrosine (Y)), is predicted indicating the higher disordered state of the protein. A sequence with the Y26H mutation was also detected in SARS-CoV-2 ORF10, which showed that a hydrophobic amino acid was replaced by a hydrophilic amino acid, thus increasing the probability for more ionic interactions. Analysis identified ORF10 mutations predicted to alter binding affinity to respective HLA alleles and to possibly correspondingly change the immunogenicity of SARS-CoV-2 ORF10. Eight ORF10 variants (containing one of the following mutations each G2D, I4L, I13M, Y14C, Y26H, F35S, L37S and L37P (Table 2)) accounted for 40% of total mutations and demonstrated decreased affinity for MHC class I, 25% of the variants (carrying mutations R20K, R20I, R24C, R24L and D31Y) predict for increased affinity, and 35% of the variants (carrying mutations V6I, A8V, P10S, S23F, A28V and V30A) contain both high and low binding affinity epitopes. This may indicates that mutations in ORF10 predominantly decrease the affinity of epitopes to escape the host-immune system, while in the mixed cases the effect of increased affinity by mutations is nullified by the presence of mutations contributing to decreased affinity. For mutations showing only increased binding affinity epitopes, it is hypothesized that acquiring more than one mutation in a single sequence in the future will nullify them as well. In addition, the immunogenicity score prediction revealed that a large number of mutations had decreased or no effect and very few of them exhibited an increased immunogenicity score, which may be a possible strategy adopted by SARS-CoV-2 to evade the host-immune response. Six mutation-bearing sequences (QLJ57416.1, QMT97141.1, QLY88596.1, QNC49349.1, QMT54534.1, and QLG76514.1) were found to contain epitopes showing both high affinity binding for MHC class 1 and high immunogenicity, indicating that these epitopes can mount significant immune response and might serve as potential targets for vaccine candidates. More critical studies in ORF10 SARS-CoV-2 are necessary to monitor high frequency mutations that could change viral pathogenesis. ORF10 protein of SARS-CoV-2 and Pangolin-CoV are similar. However, there are predicted notable differences detected between these two ORF10 proteins in terms of loop/coil structure, antigenicity, solubility, and in mutational diversification of SARS-CoV-2. These significant disagreements of various physicochemical, structural, immunological properties despite an amino acid homology of 97.37% between the ORF10 proteins of SARS-CoV-2 and Pangolin-CoV are quite surprising, and deserving of further study. A question exists as to the expression of ORF10 both in vivo and in virally infected cell lines. In a small case series of two subjects, a SARS-CoV-2 strain with a truncation mutation of ORF10 was associated with mild disease and in vitro this strain replicated with the same efficiency as strains with non-truncated ORF10 [41]. It should be noted that both individuals infected with this strain had mild disease, and the VeroE6 cells used for viral culture lack native interferon (IFN) production. Evasion of IFN production as well as IFN signaling appears to be important in the pathogenesis of SARS-CoV-2, and interference with IFN induction or interferon sensitivity by ORF10 cannot be ruled out by these experiments [42]. In this vein, ORF10 expression is found in immune cells of subjects infected with SARS-CoV-2, and expression levels of ORF 10 are associated with disease severity [43]. Finally, T cells of acute and convalescent subjects with SARS-CoV-2 infection react to ORF10 in vitro [44]. Taken together, these data suggest that ORF10 is indeed expressed during infection and may be involved in disease severity. The analysis of mutations described in our study through various in vivo and in vitro models of COVID-19 infection and disease severity should be explored, and appear crucial in our understanding of disease pathogenesis.

CRediT authorship contribution statement

SSH conceived the problem and experiment(s). DA, SG, SSH, VNU examined the mutations. SSH, PPC, DA, SG and VNU analyzed the results. SSH wrote the primary draft of the article. AAAA and KL have made major editing to reach a final form. All authors critically reviewed, edited, and approved the final manuscript.

Declaration of competing interest

The authors do not have any conflicts of interest to declare.
  15 in total

1.  Evolution of SARS-CoV-2 in Spain during the First Two Years of the Pandemic: Circulating Variants, Amino Acid Conservation, and Genetic Variability in Structural, Non-Structural, and Accessory Proteins.

Authors:  Paloma Troyano-Hernáez; Roberto Reinosa; África Holguín
Journal:  Int J Mol Sci       Date:  2022-06-07       Impact factor: 6.208

2.  The importance of accessory protein variants in the pathogenicity of SARS-CoV-2.

Authors:  Sk Sarif Hassan; Pabitra Pal Choudhury; Guy W Dayhoff; Alaa A A Aljabali; Bruce D Uhal; Kenneth Lundstrom; Nima Rezaei; Damiano Pizzol; Parise Adadi; Amos Lal; Antonio Soares; Tarek Mohamed Abd El-Aziz; Adam M Brufsky; Gajendra Kumar Azad; Samendra P Sherchan; Wagner Baetas-da-Cruz; Kazuo Takayama; Ãngel Serrano-Aroca; Gaurav Chauhan; Giorgio Palu; Yogendra Kumar Mishra; Debmalya Barh; Raner Jośe Santana Silva; Bruno Silva Andrade; Vasco Azevedo; Aristóteles Góes-Neto; Nicolas G Bazan; Elrashdy M Redwan; Murtaza Tambuwala; Vladimir N Uversky
Journal:  Arch Biochem Biophys       Date:  2022-01-24       Impact factor: 4.114

3.  Non-Woven Infection Prevention Fabrics Coated with Biobased Cranberry Extracts Inactivate Enveloped Viruses Such as SARS-CoV-2 and Multidrug-Resistant Bacteria.

Authors:  Kazuo Takayama; Alberto Tuñón-Molina; Alba Cano-Vicent; Yukiko Muramoto; Takeshi Noda; José Luis Aparicio-Collado; Roser Sabater I Serra; Miguel Martí; Ángel Serrano-Aroca
Journal:  Int J Mol Sci       Date:  2021-11-24       Impact factor: 5.923

4.  3D Printed Cobalt-Chromium-Molybdenum Porous Superalloy with Superior Antiviral Activity.

Authors:  Arun Arjunan; John Robinson; Ahmad Baroutaji; Alberto Tuñón-Molina; Miguel Martí; Ángel Serrano-Aroca
Journal:  Int J Mol Sci       Date:  2021-11-24       Impact factor: 5.923

Review 5.  SARS-CoV-2-Specific Immune Response and the Pathogenesis of COVID-19.

Authors:  Evgenii Gusev; Alexey Sarapultsev; Liliya Solomatina; Valeriy Chereshnev
Journal:  Int J Mol Sci       Date:  2022-02-02       Impact factor: 5.923

6.  An issue of concern: unique truncated ORF8 protein variants of SARS-CoV-2.

Authors:  Sk Sarif Hassan; Vaishnavi Kodakandla; Elrashdy M Redwan; Kenneth Lundstrom; Pabitra Pal Choudhury; Tarek Mohamed Abd El-Aziz; Kazuo Takayama; Ramesh Kandimalla; Amos Lal; Ángel Serrano-Aroca; Gajendra Kumar Azad; Alaa A A Aljabali; Giorgio Palù; Gaurav Chauhan; Parise Adadi; Murtaza Tambuwala; Adam M Brufsky; Wagner Baetas-da-Cruz; Debmalya Barh; Vasco Azevedo; Nikolas G Bazan; Bruno Silva Andrade; Raner José Santana Silva; Vladimir N Uversky
Journal:  PeerJ       Date:  2022-03-21       Impact factor: 2.984

7.  Identification of G-quadruplex DNA sequences in SARS-CoV2.

Authors:  Amit K Maiti
Journal:  Immunogenetics       Date:  2022-03-18       Impact factor: 3.330

8.  The Potential of Eukaryotic Cell-Free Systems as a Rapid Response to Novel Zoonotic Pathogens: Analysis of SARS-CoV-2 Viral Proteins.

Authors:  Franziska Ramm; Srujan K Dondapati; Hoai Anh Trinh; Dana Wenzel; Ruben M Walter; Anne Zemella; Stefan Kubick
Journal:  Front Bioeng Biotechnol       Date:  2022-04-19

Review 9.  Structure and Function of Major SARS-CoV-2 and SARS-CoV Proteins.

Authors:  Ritesh Gorkhali; Prashanna Koirala; Sadikshya Rijal; Ashmita Mainali; Adesh Baral; Hitesh Kumar Bhattarai
Journal:  Bioinform Biol Insights       Date:  2021-06-22

10.  SARS-CoV-2 mutations in Brazil: from genomics to putative clinical conditions.

Authors:  Luis Fernando Saraiva Macedo Timmers; Julia Vasconcellos Peixoto; Rodrigo Gay Ducati; José Fernando Ruggiero Bachega; Leandro de Mattos Pereira; Rafael Andrade Caceres; Fernanda Majolo; Guilherme Liberato da Silva; Débora Bublitz Anton; Odir Antônio Dellagostin; João Antônio Pegas Henriques; Léder Leal Xavier; Márcia Inês Goettert; Stefan Laufer
Journal:  Sci Rep       Date:  2021-06-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.