Literature DB >> 33766711

SARS-CoV-2 variants lacking ORF8 occurred in farmed mink and pangolin.

Filipe Pereira1.   

Abstract

The SARS-CoV-2 Variant of Concern 202012/01 (VOC-202012/01) is rapidly spreading worldwide owing to its substantial transmission advantage. The variant has changes in critical sites of the spike protein with potential biological significance. Moreover, VOC-202012/01 has a mutation that inactivates the ORF8 protein, whose absence can change the clinical features of the infection. Why VOC-202012/01 is more transmissible remains unclear, but spike mutations and ORF8 inactivation stand out by their known phenotypic effects. Here I show that variants combining relevant spike mutations and the absence of ORF8 occurred in SARS-CoV-2 and related viruses circulating in other host species. A truncated ORF8 (Q23stop) occurred in a SARS-CoV-2-related virus from a pangolin seized in China in 2017, also with several mutations in critical spike sites. Strikingly, I found that variants without ORF8 (E19stop) and with the N501T spike mutation circulated in farmed mink and humans from Denmark. Although with differences to VOC-202012/01, the identification of these variants highlights the danger of having reservoirs of SARS-CoV-2 and related viruses where more transmissible variants may occur and spill over to humans.
Copyright © 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Coronaviruses; Nonsense mutations; Transmissibility

Year:  2021        PMID: 33766711      PMCID: PMC7985683          DOI: 10.1016/j.gene.2021.145596

Source DB:  PubMed          Journal:  Gene        ISSN: 0378-1119            Impact factor:   3.688


Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the cause of coronavirus disease 2019 (COVID-19), has spread rapidly throughout the world following its emergence in Wuhan, China (Zhou et al., 2020). New sequence variants constantly arise during virus replication, some of which increase in frequency locally or worldwide. The spread of new variants over the course of an outbreak most often results from chance events, like when a limited number of individual viruses establish a new population during transmission (Grubaugh et al., 2020). However, the increase in frequency can result from a competitive advantage due to differences in viral infectivity, transmissibility or reactivity with neutralizing antibodies (Duffy et al., 2008). The SARS-CoV-2 Variant of Concern 202012/01 (VOC-202012/01), lineage B.1.1.7, is the first for which there is compelling evidence that it has a substantial transmission advantage (Chand et al., 2020, Davies et al., 2020, Rambaut et al., 2020b). Epidemiological evidence points to an estimated difference in reproduction numbers between VOC and non-VOC ranging between 0.4 and 0.7 (Volz et al., 2021). VOC-202012/01 originated in the UK in late Summer to early Autumn 2020 and rapidly spread worldwide. The variant has several mutations changing important sites in the interaction between the virus spike protein and the host angiotensin I converting enzyme 2 (ACE2). Moreover, VOC-202012/01 has a mutation (C27972T; Q27stop) that truncates the ORF8 protein or renders it to be inactive (Chand et al., 2020). It has been previously shown that SARS-CoV-2 can persist without a functional ORF8 protein (Gong et al., 2020, Pereira, 2020, Pereira, 2021, Su et al., 2020, Young et al., 2020). Patients infected with a SARS-CoV-2 variant lacking ORF8 had better clinical outcomes than those infected with wild-type variants, showing a longer duration of symptoms and a possible later onset (Young et al., 2020). SARS-CoV-2 is thought to have been transmitted to humans from bats, although it is possible that other mammalian species may have acted as “intermediate” hosts in the transmission (Andersen et al., 2020, Zhang and Holmes, 2020). The Malayan pangolins (Manis javanica) are likely candidates for intermediates by the discovery of SARS-CoV-2-related viruses in several animals illegally imported into the Chinese provinces of Guangdong and Guangxi (Han, 2020, Lam et al., 2020). Several species can potentially be infected by SARS-CoV-2 via their ACE2 proteins (Damas et al., 2020). During the COVID-19 pandemic, SARS-CoV-2 has been transmitted from humans to pets and other wild and domestic animals (Oreshkova et al., 2020, Sailleau et al., 2020, Sit et al., 2020). At least in the case of farmed mink, viruses had spilled back to humans (Munnink et al., 2021). The circulation of SARS-CoV-2 in other species increases the emergence of new variants, some of which may have significant differences to the ancestral state resulting from the adaptation to the new host. Therefore, the transmission back to humans may introduce variants with higher transmissibility or with the potential to escape the protection resulting from existing vaccines. In this regard, several mutations have been detected in the spike protein during the passage of SARS-CoV-2 through mink populations (Koopmans, 2021, Munnink et al., 2021). It is therefore important to identify variants with mutations of biological relevance, such as in the spike or accessory genes, which may have circulated in domestic or wild animals. Those variants may pose a risk for the ongoing COVID-19 pandemic. Here I show that SARS-CoV-2 variants without the ORF8 occurred in both pangolin and farmed mink, always in the background of mutations in spike sites of biological relevance. These results suggest that mutations as those observed in VOC-202012/01 may occur in other host species, introducing an additional risk to the ongoing pandemic.

Materials and methods

Genomic sequences from SARS-CoV-2 and related viruses

Genomic sequences from SARS-CoV-2 and related viruses were obtained from the GISAID Initiative (https://www.gisaid.org/) on January 11, 2021. I searched for ORF8 nonsense mutation is all available sequences from Canis lupus familiaris, Chlorocebus sabaeus, Felis catus, Manis javanica, Manis pentadactyla, Mus musculus, Mustela lutreola, Neovison vison, Panthera leo, Panthera tigris jacksoni, Rhinolophus affinis, Rhinolophus malayanus and Rhinolophus shameli. The sequences from pangolin (M. javanica) and mink (N. vison) with ORF8 nonsense mutations were classified into clades using the Nextclade tool (https://clades.nextstrain.org/) and the Phylogenetic Assignment of Named Global Outbreak LINeages (PANGOLIN) tool running in GISAID (Rambaut et al., 2020a).

SARS-CoV-2 related virus from pangolin (M. javanica) sample P1E

The raw sequence reads from the pangolin sample P1E were extracted from the SRA database, BioProject accession number PRJNA606875, experiment SRX7732093 (Lam et al., 2020). A blast analysis (Altschul et al., 1990) was performed to identify reads including the ORF8 position 27,960, where a mutation causes the premature stop codon Q23stop. The reads were assembled to the SARS-CoV-2 reference using the Geneious mapper under the ‘Highest sensitivity’ setting, running on Geneious Prime 2019.0.4 (https://www.geneious.com). Reads matching position 27,960 close to their ends were excluded to avoid sequencing artifacts. The final dataset included 152 reads (mean length=300.1; SD=0.5) holding the position 27,960.

SARS-CoV-2 in farmed mink (N. vison) from Denmark

All SARS-CoV-2 complete genomic sequences from Denmark with mutation ORF8 E19stop were downloaded from GISAID. The sequences were aligned using the MAFFT version 7 online service (Katoh et al., 2019), with the light-weight option for MSA of full-length SARS-CoV-2 genomes. The SARS-CoV-2 genome with accession number NC_045512.2 was used as reference. The maximum-likelihood phylogenetic tree was constructed using the PhyML 3.0 (Guindon et al., 2010) running in the ATGC bioinformatics platform (http://www.atgc-montpellier.fr). The GTR + I substitution model was selected using the SMS: Smart Model Selection in PhyML (Lefort et al., 2017), with option AIC (Akaike Information Criterion). The branch support was estimated using a bootstrap of 100 trees. The 3D structure of the spike glycoprotein (PDB: 6acj, EM 4.2 Angstrom) in complex with host cell receptor ACE2 were obtained using the CoVsurver enabled by GISAID.

Results

A SARS-CoV-2 variant with a truncated ORF8 (Q23stop) occurred in a pangolin (M. javanica) from Guangx, China

SARS-CoV-2-related viruses have been identified in Malayan pangolins (M. javanica), but is still unknown if they were at the origin of the COVID-19 pandemic (Lam et al., 2020, Dimonaco et al., 2021). I found that a SARS-CoV-2-related virus from a pangolin sample (sample P1E; EPI_ISL_410539) had a truncated ORF8 resulting from a Q23stop mutation (Table 1 ). This pangolin was seized by the Guangxi Customs, China, during their routine anti-smuggling operations in 2017 (Lam et al., 2020).
Table 1

SARS-CoV-2 and related viruses with variants lacking ORF8 identified in wild or domestic species.

SequenceGISAID Accession IDCollection dateLocationClade (Nextclade)Lineage (Pangolin)ORF8 nonsense mutationSpike amino acid substitutions
Pangolin (Manis javanica)Guangxi/P1EEPI_ISL_4105392017China/Guangxi19BB.15Q23stop92 including V483Q, F490Y, N501T, P681M
Mink (Neovison vison)mDK-28EPI_ISL_64142116/10/2020Denmark20AB.1E19stopT95I, D614G
mDK-29EPI_ISL_64142216/10/2020D614G
mDK-30EPI_ISL_64142316/10/2020D614G
mDK-154EPI_ISL_68300505/11/2020D614G
mDK-158EPI_ISL_68300905/11/2020D614G
mDK-159EPI_ISL_68301005/11/2020D614G
mDK-172EPI_ISL_68302304/11/2020N501T, D614G
mDK-173EPI_ISL_68302404/11/2020N501T, D614G
mDK-174EPI_ISL_68302504/11/2020N501T, D614G
SARS-CoV-2 and related viruses with variants lacking ORF8 identified in wild or domestic species. The pangolin sample P1E and human SARS-CoV-2 reference genomes had 80.3% of identical sites in the ORF8 gene, in line with previous studies (Mohammad et al., 2020, Pereira, 2020). Both lineages diverge by 16 out of 121 amino acids in the ORF8 protein (86.9% of identity). I aligned the ORF8 gene sequences from all pangolin samples reported in Lam et al. with the SARS-CoV-2 reference sequence (Fig. 1 A). After excluding a sequence with a large sequencing gap in ORF8, I found that all pangolin sample had an equal ORF8 gene, with exception of the Q23stop mutation at sample P1E. The similarity among samples from different pangolins suggests that the Q23stop mutation was recent, perhaps even within the sampled individual. If the lineage of sample P1E had acquired the ORF8 Q23stop mutation before, one should expect the accumulation of other mutations in a gene encoding a non-functional protein that was not under selection. The possibility of Q23stop resulting from a sequencing error should be considered in old samples stored in unknown conditions for a few years. However, I found that 151 out of 152 sequencing reads for ORF8 had a T that causes the premature stop codon, with a single read showing the reference C nucleotide. It is therefore unlikely that errors would generate such a clear pattern in obtained sequences.
Fig. 1

SARS-CoV-2-related viruses in Malayan pangolins (Manis javanica). A) Phylogeny of SARS-CoV-2 and related viruses sampled in bats and pangolins. SARS-CoV-2 from the COVID-19 epidemic are coloured in red, while related SARS-like coronaviruses are coloured in blue. The sample P1E (EPI_ISL_410539) is highlighted by a green box. Tree obtained from the Nextstrain website. B) and C) Identity plot for the alignment of ORF8 (B) and spike (C) genes from SARS-CoV-2-related viruses sequenced from five pangolin samples collected in Guangxi, China. Highlighted in B) is the ORF8 gene region with the mutation C27960T causing the premature stop codon Q23stop observed in sample P1E. The alignments include the reference SARS-CoV-2 genome (NC_045512.2). The most conserved positions are indicated by green bars.

SARS-CoV-2-related viruses in Malayan pangolins (Manis javanica). A) Phylogeny of SARS-CoV-2 and related viruses sampled in bats and pangolins. SARS-CoV-2 from the COVID-19 epidemic are coloured in red, while related SARS-like coronaviruses are coloured in blue. The sample P1E (EPI_ISL_410539) is highlighted by a green box. Tree obtained from the Nextstrain website. B) and C) Identity plot for the alignment of ORF8 (B) and spike (C) genes from SARS-CoV-2-related viruses sequenced from five pangolin samples collected in Guangxi, China. Highlighted in B) is the ORF8 gene region with the mutation C27960T causing the premature stop codon Q23stop observed in sample P1E. The alignments include the reference SARS-CoV-2 genome (NC_045512.2). The most conserved positions are indicated by green bars. In terms of the spike gene, sample P1E and human SARS-CoV-2 had 83.1% of identical sites, resulting in 99 out of 1274 amino acids differences (Fig. 1B). Among those differences, the pangolin sample P1E had amino acid changes in spike sites V483Q, F490Y, N501T and P681M. On the contrary to ORF8, all pangolin samples diverge in the spike gene, but all share the spike mutations of sample P1E analysed here.

SARS-CoV-2 variants combining ORF8 E19stop and spike N501T mutations circulated in farmed mink (N. vison)

The infection of farmed mink (N. vison) with SARS-CoV-2 that subsequently spilled back into humans prompted the Ministry of Environment and Food of Denmark the culling of all mink in the country on November 5, 2020 (Koopmans, 2021). Full-length virus genome sequencing revealed novel mutations in the spike protein during the passage through the mink population (Hammer et al., 2021, van Dorp et al., 2020). I found nine SARS-CoV-2 sequences with ORF8 E19stop mutation in mink from Denmark (Table 1). The sequences belong to clade 20A (Nextclade) or lineage B.1 that dominated the European outbreak in March 2020 (Rambaut et al., 2020a). The three sequences with the oldest collection date (2020–10-16) were all from mink (EPI_ISL_641421, EPI_ISL_641422 and EPI_ISL_641423). The phylogeny built with all SARS-CoV-2 sequences with the ORF8 E19stop from Denmark showed that one of the 2020–10-16 sequences splits before all the others (Fig. 2 A). It is therefore possible that the ORF8 E19stop mutation occurred in the mink population, and then spread to humans.
Fig. 2

SARS-CoV-2 variants with truncated ORF8 observed in farmed mink (Neovison vison). A) Maximum-likelihood phylogenetic tree with all SARS-CoV-2 complete genome sequences with the ORF8 E19stop mutation detected in Denmark. Sequences obtained from mink are shown in red. The cluster of sequences with the spike N501T mutation is highlighted by an orange box. Bootstrap support values are shown in main branches. B) Predicted 3D structure of the spike glycoprotein (grey ribbon) in complex with host cell receptor ACE2 (green ribbon), with amino acids 501 (yellow circles) and 614 (blue circles) highlighted. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

SARS-CoV-2 variants with truncated ORF8 observed in farmed mink (Neovison vison). A) Maximum-likelihood phylogenetic tree with all SARS-CoV-2 complete genome sequences with the ORF8 E19stop mutation detected in Denmark. Sequences obtained from mink are shown in red. The cluster of sequences with the spike N501T mutation is highlighted by an orange box. Bootstrap support values are shown in main branches. B) Predicted 3D structure of the spike glycoprotein (grey ribbon) in complex with host cell receptor ACE2 (green ribbon), with amino acids 501 (yellow circles) and 614 (blue circles) highlighted. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Five sequences had a mutation (N501T) in a spike site also mutated in the VOC-202012/01 variant (N501Y), three from mink and two from humans (Fig. 2A). The five sequences formed a cluster in the phylogenetic analysis, suggesting a single origin of the mutation. The three mink sequences were equal and diverge from the sequences found in humans by two and three mutations, all outside the spike and ORF8 genes (Table 2 ). In addition to spike N501T, these five sequences had the D614G mutation. Among sequences with the spike N501T mutation, the mink sequences had a collection date (2020–11-04) which is 12 and 18 days before the collection date of samples from humans (Fig. 2A), suggesting that N501T had an origin in the mink population. The N501Y mutation is located at the receptor binding domain (RBD) of the spike protein (Fig. 2B) and helped the adaptation of SARS-CoV-2 in BALB/c mice, being potentially associated with the increased virulence (Gu et al., 2020). Mutation N501T gives rise to a hydrogen bond to lysine 353 (van Dorp et al., 2020) and was shown to enhance ACE2 affinity (Starr et al., 2020). According to GISAID, spike mutation N501T occurred 47 times (0.02% of all samples with Spike sequence) in 9 countries.
Table 2

Number of differences among the five SARS-CoV-2 complete genome sequences reported in mink and human with both ORF8 E19stop and spike N501T.

Mink EPI_ISL_683023Mink EPI_ISL_683024Mink EPI_ISL_683025Human EPI_ISL_713271
Mink EPI_ISL_6830240
Mink EPI_ISL_68302500
Human EPI_ISL_713271333
Human EPI_ISL_7142622225
Number of differences among the five SARS-CoV-2 complete genome sequences reported in mink and human with both ORF8 E19stop and spike N501T.

Discussion

The ORF8 accessory gene is believed to interfere with immune responses and many putative cellular interactors have been identified, but its precise function remains unknown (Gordon et al., 2020, Flower et al., 2021). Expression of ORF8 is not essential for the virus replication, as suggested by the several variants lacking this gene identified worldwide (Pereira, 2020, Pereira, 2021, Zinzula, 2021, Alkhansa et al., 2021). This fact was first noticed in a lineage with an ORF8 382-nucleotide deletion detected in Singapore and Taiwan (Gong et al., 2020, Su et al., 2020). Curiously, middle and late phases of the 2002/2003 SARS epidemic were characterized by the spread of SARS-CoV with either partial or complete deletions of the ORF8 gene (Guan et al., 2003, C.S.M.E. Consortium, 2004). It remains unclear if the truncated ORF8 may have changed the SARS-CoV replication capacity or virulence in a way that favoured its adaptation to humans and/or its spread during the SARS epidemic (Guan et al., 2003; C.S.M.E. Consortium, 2004; Lau et al., 2005, Chen et al., 2007). Whatever the case, deletions at ORF8 were the most remarkable changes observed during the SARS epidemic. The same may be happening in SARS-CoV-2, as more transmissible lineages become dominant as the pandemic progresses. Patients infected with variants lacking ORF8 had less severe symptoms and a possible prolonged infection period that may increase the opportunities for the virus spread (Young et al., 2020). Interestingly, the VOC-202012/01 lacks ORF8 in addition to several spike mutations. It remains to be determined if the absence of ORF8 contributes to the increases transmissibility. The longer duration of symptoms resulting from the absence of ORF8 may increase the opportunity for transmission, and the milder symptoms may increase undiagnosed cases and the number of transmissions. The identification of a pangolin harbouring a SARS-CoV-2-related virus without a functional ORF8 suggests that loosing this accessory gene may be a common event. In addition to SARS-CoV, deletion of accessory genes have been observed in the Middle East respiratory syndrome related coronavirus (MERS-CoV) (Lamers et al., 2016), feline coronavirus (FCoV) (Kennedy et al., 2001), porcine respiratory coronavirus (PRCV) (Chen et al., 2019) and mouse hepatitis virus (MHV) (de Haan et al., 2002), often leading to an attenuation of virulence. It is clear that accessory genes play a dynamic role in such viruses, often varying along the progression of the pandemics. The pangolin sample P1E has 92 spike amino acid substitutions in relation to the reference SARS-CoV-2, including V483Q, F490Y, N501T and P681M. The spike protein consists of 2 subunits (S1 and S2) that mediate infection of host cells, with S1 contains a RBD responsible for recognizing and binding ACE2 (Zhou et al., 2020). The RBD region includes the positions 483, 490, 501, where pangolin and human viruses diverge. Changes in these RBD critical sites may influence the virus capacity to infect cells, and therefore be subject to intense selection. Although difficult to perform, a study of SARS-CoV-2-related viruses in pangolins lacking ORF8 could help to understand how ORF8 and spike mutation affect the transmissibility of these viruses. The detection of ORF8-deficient lineages in farmed mink is also remarkable considering the low frequency of such lineages in the human population, at least before the spread of VOC-202012/01. It is therefore likely that the loss of ORF8 occurred within the mink population, suggestive of adaptation of the virus to the new host or simple by random events. The identification of sequences from humans and mink clustering together suggests that lineages without ORF8 can spill over from one species to another, although it is not clear which species infected the other in this case. The mink population could also have been a good model to study how variants with and without ORF8 may have different transmissibility rates. Overall, the co-occurrence of ORF8 and spike mutations with phenotypic consequences is particularly problematic, as it may give rise to variants with higher transmissibility, as observed with VOC-202012/01. The data provided here suggests that such variants already occurred in other host species, revealing the risk of having reservoirs of SARS-CoV-2 in species close to humans.

Declaration of Competing Interest

The author declare that he has no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  5 in total

1.  SARS-CoV-2: Unique Challenges of the Virus and Vaccines.

Authors:  Ata Mahmoodpoor; Sarvin Sanaie; Parisa Samadi; Mehdi Yousefi; Nader D Nader
Journal:  Immunol Invest       Date:  2021-06-10       Impact factor: 3.657

2.  Household transmission of SARS-CoV-2 from humans to pets in Washington and Idaho: burden and risk factors.

Authors:  Julianne Meisner; Timothy V Baszler; Kathryn H Kuehl; Vickie Ramirez; Anna Baines; Lauren A Frisbie; Eric T Lofgren; David M DeAvila; Rebecca M Wolking; Dan S Bradway; Hannah Wilson; Beth Lipton; Vance Kawakami; Peter M Rabinowitz
Journal:  bioRxiv       Date:  2022-02-22

3.  The emergence, dynamics and significance of SARS-CoV-2 variants.

Authors:  Philippe Colson; Philippe Parola; Didier Raoult
Journal:  New Microbes New Infect       Date:  2022-02-01

Review 4.  Coronavirus Infection and Cholesterol Metabolism.

Authors:  Jun Dai; Huan Wang; Ying Liao; Lei Tan; Yingjie Sun; Cuiping Song; Weiwei Liu; Xusheng Qiu; Chan Ding
Journal:  Front Immunol       Date:  2022-04-21       Impact factor: 8.786

5.  The emergence, spread and vanishing of a French SARS-CoV-2 variant exemplifies the fate of RNA virus epidemics and obeys the Mistigri rule.

Authors:  Philippe Colson; Philippe Gautret; Jeremy Delerce; Hervé Chaudet; Pierre Pontarotti; Patrick Forterre; Raphael Tola; Marielle Bedotto; Léa Delorme; Wahiba Bader; Anthony Levasseur; Jean-Christophe Lagier; Matthieu Million; Nouara Yahi; Jacques Fantini; Bernard La Scola; Pierre-Edouard Fournier; Didier Raoult
Journal:  J Med Virol       Date:  2022-08-28       Impact factor: 20.693

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.