Literature DB >> 33619452

Rare mutations in the accessory proteins ORF6, ORF7b, and ORF10 of the SARS-CoV-2 genomes.

Sk Sarif Hassan1, Pabitra Pal Choudhury2, Bidyut Roy3.   

Abstract

A total number of 3080 SARS-CoV-2 genomes from all continents are considered from the NCBI database. Every accessory protein ORF6, ORF7b, and ORF10 of SARS-CoV-2 possess a single missense mutation in less than 1.5% of the 3080 genomes. It has now been observed that different non-synonymous mutations occurred in these three accessory proteins. Most of these rare mutations are changing the amino acids such as hydrophilic to hydrophobic, acidic or basic to hydrophobic, and vice versa etc. So these highly conserved proteins might play an essential role in virus pathogenicity. This study opens a question whether it carries some messages about the virus rapid replications, and virulence.
© 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  ORF10; ORF6; ORF7b; Pathogenicity; Rare mutations

Year:  2021        PMID: 33619452      PMCID: PMC7890336          DOI: 10.1016/j.mgene.2021.100873

Source DB:  PubMed          Journal:  Meta Gene        ISSN: 2214-5400


Introduction

The accessory proteins of SARS-CoV-2 help the virus to infect its human host, replicate, and eventually spread from person to person. There are six accessory proteins ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10 in the SARS-CoV-2 genomes, which play important roles in the viral life cycle, and may contribute to the pathogenesis and virulence (Xu et al., 2009). These accessory proteins are unique to SARS-CoV-2, as they have least homology in amino acid sequence with accessory proteins of other coronaviruses. These proteins contribute to virus virulence though it does not directly affect the virus release, stability and pathogenesis (Xu et al., 2009). Therefore, close observations and understanding of these proteins may explain the differences in pathogenicity between SARS-CoV-2, and other known coronaviruses. Within a very quick time span, SARS-CoV-2 virus has been evolving for accounting various missense mutations in various proteins. It has been already observed by us that except one in ORF7b, there are no missense mutations over the proteins ORF6, and ORF10 across the SARS-CoV-2 genomes detected in 128 Indian patients (; Hassan et al., 2020). Missense mutations in the accessory proteins of SARS-CoV-2 might impact in heterogeneity of SARS-CoV-2 and may contribute to comprehending the pathogenic dynamics as well as virulence of the virus over time. The amino acid mutations among SARS-CoV-2 isolates from diverse locations could be linked with their geographical distributions. This rapidly evolving virus is capable of adapting swiftly to the diverge environments (Islam et al., 2020). So it is quite important to discover whether any missense mutations are found in those proteins across the SARS-CoV-2 genomes worldwide, other than India.

Functions of the accessory proteins ORF6, ORF7b, and ORF10

Here we present the some known reported functions of three accessory proteins ORF6, ORF7b, and ORF10 as follows: •  ORF6: This protein contains a hydrophobic N-terminal, and it has been suggested to have a N-endo-C-endo conformation (Gunalan et al., 2011). The ORF6 protein makes intracellular membrane rearrangements resulting in a vesicular population in the infected cell which could possibly serve some role in increasing replication of the virus (Frieman et al., 2007). It has been previously reported that ORF6 down-regulated the mRNA level of the co-transfected myc-nsp8 gene (Zhou et al., 2010). It is also reported that amino acids 53–56 in ORF6 of SARS-CoV constitute a putative diacidic motif (DDEE) which influences the suppression of the expression of co-transfected myc-nsp8 (Gunalan et al., 2011). •  ORF7b: ORF7b is very much unique since it does not have any sequence homology with other viral proteins (Schaecher et al., 2007). The accessory protein ORF7b has been reported to have a structural component in the SARS-CoV virions (Pekosz et al., 2006). The transmembrane domain of ORF7b is essential to retain the protein in the Golgi compartment (Schaecher et al., 2008). It has been found that ORF7b of SARS-CoV-2 is 81% identical to that of SARS-CoV. It is reported that absence of ORF7b does not affect virus replication (Pekosz et al., 2006). Alanine scanning experiments showed that the amino acids in 13–15, and 19–22 are critical for the retention of ORF7b in the Golgi complex (Schaecher et al., 2008). •  ORF10: ORF10 protein is the unique accessory protein in SARS-CoV-2 as it does not present in the SARS-CoV (Taiaroa et al., 2020), and hence it is sometimes called” mysterious protein”. The signature appearance of ORF10 in SARS-CoV-2 makes the novel coronavirus easily detectable compared to other PCR based methods (Koyama et al., 2020). Function of this protein is still undiscovered. Therefore any missense mutation in ORF10 is significantly worth determining, and requires further investigations in order to understand the functions of ORF10.

Methods

All the protein sequences of 3080 SARS-CoV-2 genomes are fetched from the NCBI virus database. Then for each of the three accessory proteins, amino acid sequences are exported in fasta format using file operations through Matlab. For each accessory protein, these sequences (fasta formatted) are blasted using Clustal-Omega, and found the mutations and accordingly their associated positions are accounted (Madeira et al., 2019).

Results

It is to be noted that among these virus genomes from 3080 patients, there are 2126 genomes from USA, 306 from Asia, 281 genomes from Europe, 365 genomes from Oceania and one genome from Africa. Here (Table 1 ), we present the missense mutations of the three accessory proteins ORF6, ORF7b, and ORF10 over the available 3080 SARS-CoV-2 genomes. Note that the mutation of an amino acid A 1 to an amino acid A 2 is denoted by A 1(l)A 2 where l denotes location in the reference amino acid sequence. In Table 1, also the changes of the R-group of each amino acid according to the mutations are presented. The Table 1 depicts the following:
Table 1

Missense mutations in ORF6, ORF7b, and ORF10 across 3080 SARS-CoV-2 genome sequences.

ProteinsGeo-locationMutationsR-group propertyProteinsGeo-locationMutationsR-group property
QJC19423-ORF6GermanyV(24)AHydrophobic to HydrophobicQJY40403-ORF7bIndia-BotadS(31)LHydrophilic to Hydrophobic
QJT72174-ORF6FranceD(61)YAcidic to HydrophilicQJX70209-ORF7bUSA-MIT(40)IHydrophilic to Hydrophobic
QJT72858-ORF6FranceH(3)YBasic to HydrophilicQJX74553-ORF7bUSAL(20)STOPHydrophobic to STOP
QJS54110-ORF6Greece: AthensT(21)IHydrophilic to HydrophobicQJS54820-ORF7bUSA:CAF(30)LHydrophobic to Hydrophobic
QJS56575-ORF6USA:WAI(33)THydrophobic to HydrophilicQJQ84777-ORF7bTHILANDC(41)FHydrophilic to Hydrophobic
QJR84997-ORF6USA:CAD(53)GAcidic to HydrophobicQJQ84801-ORF7bTHILANDC(41)FHydrophilic to Hydrophobic
QJR87301-ORF6Australia: VictoriaD(6)YAcidic to HydrophilicQJQ84813-ORF7bTHILANDC(41)FHydrophilic to Hydrophobic
QJR87841-ORF6Australia: VictoriaW(27)LHydrophobic to HydrophobicQJQ84825-ORF7bTHILANDC(41)FHydrophilic to Hydrophobic
QJR89461-ORF6Australia: VictoriaD(53)YAcidic to HydrophilicQJQ84837-ORF7bTHILANDC(41)FHydrophilic to Hydrophobic
QJR91693-ORF6Australia: VictoriaW(27)LHydrophobic to HydrophobicQJD47604-ORF7bUSA:CTF(19)LHydrophobic to Hydrophobic
QJQ39516-ORF6USA:MII(33)THydrophobic to HydrophilicQJC19833-ORF7bUSA:WAF(28)YHydrophobic to Hydrophilic
QJD48547-ORF6USA:WAK(42)NBasic to HydrophilicQKE45662-ORF7bUSA:CAT(40)IHydrophilic to Hydrophobic
QJD49135-ORF6USA:WAK(42)NBasic to HydrophilicQKE45866-ORF7bUSA:CAF(30)LHydrophobic to Hydrophobic
QJD49147-ORF6USA:WAK(42)NBasic to HydrophilicQJY51909-ORF10USA:MIR(24)CBasic to Hydrophilic
QJC21021-ORF6SpainN(34)SHydrophilic to HydrophilicQJX70372-ORF10USA:ILI(4)LHydrophobic to Hydrophobic
QJA16692-ORF6USA:WAK(42)NBasic to HydrophilicQJS56880-ORF10USA:IDS(23)FHydrophilic to Hydrophobic
QJA16812-ORF6USA:WAD(61)YAcidic to HydrophilicQJR96431-ORF10USA:CAQ(29)STOPAcidic to STOP
QIU81889-ORF6China: BeijingQ(8)HHydrophilic to BasicQJQ38873-ORF10USA:CAS(23)FHydrophilic to Hydrophobic
QIS60718-ORF6USA:WAK(42)NBasic to HydrophilicQJD48984-ORF10USA:WAR(24)LBasic to Hydrophobic
QIQ50036-ORF6USA:WAV(9)FHydrophobic to HydrophobicQIS60555-ORF10USA:WAI(4)LHydrophobic to Hydrophobic
QIQ50136-ORF6USA:WAK(42)NBasic to HydrophilicQIS29991-ORF10China: Hubei, WuhanV(6)IHydrophobic to Hydrophobic

QJY40403-ORF7b from India-Botad. Note that mutations marked as ‘bold’ are reported here for the first time.

Missense mutations in ORF6, ORF7b, and ORF10 across 3080 SARS-CoV-2 genome sequences. QJY40403-ORF7b from India-Botad. Note that mutations marked as ‘bold’ are reported here for the first time. • In less than 1.5% of the SARS-CoV-2 genomes, the accessory proteins ORF6 (1.49%), ORF7b (0.422%), and ORF10 (0.259%) possess the missense mutations as mentioned in Table 1. • The ORF7b protein (QJX74553) got truncated due to mutation L(20)STOP to a stop codon. Also the protein ORF10 (QJR96431) is truncated due to the mutation Q(29)STOP. • There are several changes of the R-group property of amino acids viz. hydrophobic to hydrophobic, acidic to hydrophilic, basic to hydrophilic, hydrophilic to hydrophobic, acidic to hydrophobic, hydrophobic to hydrophilic, hydrophilic to basic and basic to hydrophobic. • The putative diacidic motif (DDEE) in SARS-CoV of the ORF6 protein in the 53–56 position has been mutated to DEEQ in ORF6 in some of the SARS-CoV-2 genomes. This motif DEEQ of ORF6 has been mutated (D(53)Y) to YEEQ in QJR89461-ORF6 (Australia: Victoria). Strikingly, the motif DEEQ mutated (D(53)G) to GEEQ in the ORF6 protein across the 25 SARS-CoV-2 genomes from USA:CA as shown in Table 2 . These missense mutations may affect the suppression of the expression of co-transfected myc-nsp8.
Table 2

25 SARS-CoV-2 genomes from USA:CA having D(53)G mutation in ORF6.

USA: CA
Mutation: D(53)GAcidic to Hydrophobic
QKE49188-ORF6QKE49296-ORF6QKE49404-ORF6
QKE49200-ORF6QKE49320-ORF6QKE49416-ORF6
QKE49212-ORF6QKE49332-ORF6QKE49428-ORF6
QKE49224-ORF6QKE49344-ORF6QKE49440-ORF6
QKE49236-ORF6QKE49356-ORF6QKE49452-ORF6
QKE49248-ORF6QKE49368-ORF6QKE49464-ORF6
QKE49260-ORF6QKE49380-ORF6QKE49476-ORF6
QKE49272-ORF6QKE49392-ORF6QKE49488-ORF6
QKE49500-ORF6
25 SARS-CoV-2 genomes from USA:CA having D(53)G mutation in ORF6. • The transmembrane domain of ORF7b in SARS-CoV-2 is fully conserved with respect to that of SARS-CoV. In the virus sequence QJD47604-ORF7b (USA:CT) possesses a mutation F(19)L which might affect the retention of ORF7b in the Golgi complex. There is nonsense mutation L(20)STOP in the QJX74553-ORF7b (USA), which makes the protein non-functional. • The nonsense mutation in QJR96431-ORF10 (USA:CA) makes the protein non-functional. In USA, three distinct missense mutations R(24)C, I(4)L and S(23)F are found in each pair of genomes viz. {QJY51909 − ORF10,  QJD48984 − ORF10}, {QJX70372 − ORF10,  QIS60555 − ORF10}, and {QJS56880 − ORF10,  QJQ38873 − ORF10} respectively. Also there is another mutation V(6)I (hydrophobic to hydrophobic) in the virus QIS29991-ORF10 (China: Hubei, Wuhan) which might not affect the functions of ORF10 because of the synonymous change of R-group property.

Concluding remarks

The novel RNA virus SARS-CoV-2 has been evolving through several mutations over the associated proteins within a very short time from its discovery since December 2019. It has now been observed that few mutations occurred in these three proteins. As we see here, the total amount of mutations possessed by three accessory proteins ORF6, ORF7b, and ORF10 is around 1.5%. Since the mutations are few, we firmly believe that these mutated protein variants are beneficial for virus. This study opens a question whether it carries some messages about the virus rapid replications and virulence and their role in pathogenicity. Whether these mutations raise different symptoms in the affected patients, is certainly of worth investigation from the pathogenetic perspective. These conserved proteins may suggest for development of antiviral drugs or formulate new vaccines from a new starting point. Since the mutations are rare in numbers, we firmly believe that these mutated protein variants might elongate sustainability and pathogenicity of the virus. It will also be interesting to explore whether these rare mutations were evolved from China or other affected continents/countries. Recently, two variants VUI − 202012/01 and 501. V2 of SARS-CoV-2 (first detected in Europe and South Africa, respectively) became a concern to the world population due to the spread in different countries (Loconsole et al., 2021), GISAID. Of these two, VUI-202012/01 has more transmission ability. Sequence analysis has shown that these variants have mutations in S, N, NSP etc. genes and none of them has mutation in ORF6, ORF7b, and ORF10. So, it corroborates to our observation that these ORF proteins are highly conserved in the SARS-CoV2 genome. Although the clinical importance of these rare mutations are yet to be discovered, this study certainly would serve as a groundwork.

Author contributions

SH conceived the problem. SH determined the mutations. SH, PPC, and BR analysed the data and result. SH wrote the initial draft which was checked and edited by all other authors to generate the final version.

Declaration of Competing Interest

The authors do not have any conflicts of interest to declare.
  13 in total

1.  The transmembrane domain of the severe acute respiratory syndrome coronavirus ORF7b protein is necessary and sufficient for its retention in the Golgi complex.

Authors:  Scott R Schaecher; Michael S Diamond; Andrew Pekosz
Journal:  J Virol       Date:  2008-07-16       Impact factor: 5.103

2.  Structure, expression, and intracellular localization of the SARS-CoV accessory proteins 7a and 7b.

Authors:  Andrew Pekosz; Scott R Schaecher; Michael S Diamond; Daved H Fremont; Amy C Sims; Ralph S Baric
Journal:  Adv Exp Med Biol       Date:  2006       Impact factor: 2.622

3.  The ORF7b protein of severe acute respiratory syndrome coronavirus (SARS-CoV) is expressed in virus-infected cells and incorporated into SARS-CoV particles.

Authors:  Scott R Schaecher; Jason M Mackenzie; Andrew Pekosz
Journal:  J Virol       Date:  2006-11-01       Impact factor: 5.103

4.  A putative diacidic motif in the SARS-CoV ORF6 protein influences its subcellular localization and suppression of expression of co-transfected expression constructs.

Authors:  Vithiagaran Gunalan; Ali Mirazimi; Yee-Joo Tan
Journal:  BMC Res Notes       Date:  2011-10-25

5.  Missense mutations in SARS-CoV2 genomes from Indian patients.

Authors:  Sk Sarif Hassan; Pabitra Pal Choudhury; Bidyut Roy; Siddhartha Sankar Jana
Journal:  Genomics       Date:  2020-08-18       Impact factor: 5.736

6.  Genome Sequence of a SARS-CoV-2 VUI 202012/01 Strain Identified from a Patient Returning from London, England, to the Apulia Region of Italy.

Authors:  Daniela Loconsole; Anna Sallustio; Marisa Accogli; Francesca Centrone; Loredana Capozzi; Laura Del Sambro; Antonio Parisi; Maria Chironna
Journal:  Microbiol Resour Announc       Date:  2021-01-28

7.  Variant analysis of SARS-CoV-2 genomes.

Authors:  Takahiko Koyama; Daniel Platt; Laxmi Parida
Journal:  Bull World Health Organ       Date:  2020-06-02       Impact factor: 9.408

8.  Molecular conservation and differential mutation on ORF3a gene in Indian SARS-CoV2 genomes.

Authors:  Sk Sarif Hassan; Pabitra Pal Choudhury; Pallab Basu; Siddhartha Sankar Jana
Journal:  Genomics       Date:  2020-06-12       Impact factor: 5.736

9.  Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity.

Authors:  M Rafiul Islam; M Nazmul Hoque; M Shaminur Rahman; A S M Rubayet Ul Alam; Masuda Akther; J Akter Puspo; Salma Akter; Munawar Sultana; Keith A Crandall; M Anwar Hossain
Journal:  Sci Rep       Date:  2020-08-19       Impact factor: 4.379

10.  Severe acute respiratory syndrome coronavirus accessory protein 9b is a virion-associated protein.

Authors:  Ke Xu; Bo-Jian Zheng; Rong Zeng; Wei Lu; Yong-Ping Lin; Liang Xue; Li Li; Lei-Lei Yang; Chen Xu; Jie Dai; Fei Wang; Qing Li; Qing-Xi Dong; Rui-Fu Yang; Jia-Rui Wu; Bing Sun
Journal:  Virology       Date:  2009-04-25       Impact factor: 3.616

View more
  2 in total

1.  Clinical and genomic data of sars-cov-2 detected in maternal-fetal interface during the first wave of infection in Brazil.

Authors:  Maria de Fátima Carvalho Ferreira; Janeth Aracely Ramirez Pavon; Amanda Colichio Bini Napoleão; Gláucia Maria Duarte Preza Figueiredo; Patricia Cristina Borges Florêncio; Rayssa Basílio Dos Santos Arantes; Paula Sossai Rizzo; Maria Aparecida Mazzutti Verlangieri Carmo; Luciano Nakazato; Valéria Dutra; Rosane Christine Hahn; Renata Dezengrini Slhessarenko
Journal:  Microbes Infect       Date:  2022-02-02       Impact factor: 9.570

2.  An issue of concern: unique truncated ORF8 protein variants of SARS-CoV-2.

Authors:  Sk Sarif Hassan; Vaishnavi Kodakandla; Elrashdy M Redwan; Kenneth Lundstrom; Pabitra Pal Choudhury; Tarek Mohamed Abd El-Aziz; Kazuo Takayama; Ramesh Kandimalla; Amos Lal; Ángel Serrano-Aroca; Gajendra Kumar Azad; Alaa A A Aljabali; Giorgio Palù; Gaurav Chauhan; Parise Adadi; Murtaza Tambuwala; Adam M Brufsky; Wagner Baetas-da-Cruz; Debmalya Barh; Vasco Azevedo; Nikolas G Bazan; Bruno Silva Andrade; Raner José Santana Silva; Vladimir N Uversky
Journal:  PeerJ       Date:  2022-03-21       Impact factor: 2.984

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.