Literature DB >> 34570776

Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates.

Barbara Schrörs1, Pablo Riesgo-Ferreiro1, Patrick Sorn1, Ranganath Gudimella1, Thomas Bukur1, Thomas Rösler1, Martin Löwer1, Ugur Sahin1,2.   

Abstract

Due to the widespread of the COVID-19 pandemic, the SARS-CoV-2 genome is evolving in diverse human populations. Several studies already reported different strains and an increase in the mutation rate. Particularly, mutations in SARS-CoV-2 spike-glycoprotein are of great interest as it mediates infection in human and recently approved mRNA vaccines are designed to induce immune responses against it. We analyzed 1,036,030 SARS-CoV-2 genome assemblies and 30,806 NGS datasets from GISAID and European Nucleotide Archive (ENA) focusing on non-synonymous mutations in the spike protein. Only around 2.5% of the samples contained the wild-type spike protein with no variation from the reference. Among the spike protein mutants, we confirmed a low mutation rate exhibiting less than 10 non-synonymous mutations in 99.6% of the analyzed sequences, but the mean and median number of spike protein mutations per sample increased over time. 5,472 distinct variants were found in total. The majority of the observed variants were recurrent, but only 21 and 14 recurrent variants were found in at least 1% of the mutant genome assemblies and NGS samples, respectively. Further, we found high-confidence subclonal variants in about 2.6% of the NGS data sets with mutant spike protein, which might indicate co-infection with various SARS-CoV-2 strains and/or intra-host evolution. Lastly, some variants might have an effect on antibody binding or T-cell recognition. These findings demonstrate the continuous importance of monitoring SARS-CoV-2 sequences for an early detection of variants that require adaptations in preventive and therapeutic strategies.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34570776      PMCID: PMC8475993          DOI: 10.1371/journal.pone.0249254

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Since the first report of the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) outbreak [1, 2], it has transformed into a global pandemic infecting and threatening death for millions of people all over the globe. By July 9, 2021, the World Health Organization (WHO) reported 185,291,530 confirmed cases and 4,010,834 deaths caused by the SARS-CoV-2 outbreak [3]. After the approval of SARS-CoV-2 vaccines which are designed to invoke immune responses against the spike-glycoprotein (spike protein), it becomes necessary to track the mutations in spike protein and study their relevance for current and upcoming vaccines. Also the recently approved neutralizing antibody bamlanivimab targets the spike protein of SARS-CoV-2 [4]. Subunits of the spike protein are valuable targets for vaccine design as the protein is responsible for viral binding and entry to host cells [5, 6]. The spike protein consists of the N-terminal S1 and the C-terminal S2 subunits; the receptor-binding domain (RBD) in the S1 subunit binds to a receptor on the host cell surface and the S2 subunit fuses viral and host membranes [7]. The receptor binding domain (RBD) of the SARS-CoV-2 spike protein recognizes human angiotensin-converting enzyme 2 (ACE2) as its entry receptor, similar to SARS-CoV [8]. Interacting residues of the SARS-CoV-2 RBD with human ACE2 are highly conserved or share similar side chain properties with the SARS-CoV RBD [9]. In addition, the SARS-CoV-2 RBD shows significantly higher binding affinity to ACE2 receptor compared to the SARS-CoV RBD. In order to repress the infection, blocking the RBD binding was effective in ACE2-expressing cells [5]. Among the interacting sites in the SARS-CoV-2 RBD, particularly the amino acid residues L455, F486, Q493, S494, N501, and Y505 provide critical interactions with human ACE2 [10]. These interacting residues vary due to natural selection in SARS-CoV-2 and other related coronaviruses [11]. Similarly, worldwide SARS-CoV-2 genomic data shows ten RBD mutations which were caused due to natural selection by circulating among the human population [12]. RBD mutations particularly at N501 may enhance the binding affinity between SARS-CoV-2 and human ACE2 significantly, improving viral infectivity and pathogenicity [10]. It is reported that continuous evolution of SARS-CoV-2 among the global population results into six major subtypes which involve the recurrent D614G mutation of the spike protein [13]. Further, spread of such recurrent mutations within sub-populations might affect the severity of disease emergence and change the trajectory of the pandemic. Studies also report high intra-host diversity caused by low frequency subclonal mutations within a specific cohort [14]. It is evident that changes in the SARS-CoV-2 genome over time might show new mutations which might influence the development efforts of interventional strategies. The variability of epitopes of the RBD might hamper the development and use of neutralizing antibodies for cross-protective activities against mutant strains [15]. Mutational variants of the spike protein might as well lead to escape variants with respect to pre-existing cross-reactive CD4+ T cell responses [16] or long-term protection from re-infection through T cell memory. Hence, there is a necessity of constant monitoring of the rapidly changing mutation rates in the spike protein in SARS-CoV-2, which could have significant impact on virus infection, transmissibility and pathogenicity in the current pandemic. In this study, we gathered 1,036,030 genomic assemblies and 30,806 NGS datasets to detect non-synonymous spike protein mutations and infer their frequency within a given sample and the effect on potential antibody binding sites and known T cell epitopes.

Methods

SARS-CoV-2 assemblies

SARS-CoV-2 assemblies from human hosts were downloaded on April 13th, 2021 from GISAID (nucleotide sequences; [17]). Two samples were excluded for having seemingly wrong dates, 2012-03-21 and 2017-11-21. 2,014 additional samples were excluded as the assemblies were shorter than 1,000 bp. Unfortunately, all 98 samples from Ivory Coast failed to be loaded due to a technical problem, this issue will be amended in subsequent analyses. Pairwise alignments to the reference genomic sequence (MN908947.3) were performed using the Python package Biopython (version 1.79). Global alignment options were (extend_gap_score = -0.1, open_gap_score = -3,_mismatch_score = 1,_match_score = 2). Variants were called using the global alignment results. Any variant containing an N or any ambiguous IUPAC code (ie: (ie: R, Y, S, W, K, M, B, D, H, V)) was filtered out. With the previous procedure, we obtained an average of 22.425SNVs per sample respectively. We further excluded all samples with a number of variants greater than 75th percentile plus three times the interquartile range for each variant type separately. This resulted in 120 samples excluded due to an extremely high number of SNVs. The variants were subsequently normalized following the procedure described in Tan et al. 2015 [18] using vt (version 0.57721) and BCFtools (version 1.12). Finally, variants were annotated with SnpEff (version 5.0) [19]. This analysis workflow was implemented in the Nextflow framework and open sourced under the MIT license as the CoVigator NGS pipeline (https://github.com/TRON-Bioinformatics/covigator-ngs-pipeline) [20].

NGS data processing

All available NGS data for SARS-CoV-2 was downloaded on June 11th, 2021 from the European Nucleotide Archive (ENA) application programming interface (API) (https://www.ebi.ac.uk/ena/portal/api/; [21] and filtered for whole genome FASTQ data from Illumina instruments with a human sample background. Data were aligned to the reference MN908947.3 [2]. Short-read whole genome sequencing data were aligned with bwa (version 0.7.17) mem [22]. Output files in SAM format were sorted and converted to their binary form (BAM) using SAMtools (version 0.1.16) [23]. Variants were retrieved from the alignment files using BCFtools (version 1.9) mpileup (http://samtools.github.io/bcftools/) with the options to recalculate per-base alignment quality on the fly, disabling the maximum per-file depth, and retention of anomalous read pairs. Variants in gene gp02 (i.e. S gene) were annotated using SNPeff (version 4.3t) “ann” [19].

Filtering subclonal variants

NGS variants were filtered with at least 30 reads coverage and a fraction of supporting reads of at least 0.1 and less than 0.95 to identify high-confidence sub-clonal mutations [24].

Published SARS-CoV-2 T-cell epitopes and HLA binding prediction

SARS-CoV-2 antigens reported by Snyder et al. [25] where downloaded from https://clients.adaptivebiotech.com/pub/covid-2020 on 17NOV2020 (MIRA release 002.1). 343 spike protein epitopes with positive T cell response were extracted from the IEDB database (https://www.iedb.org/, accessed June 21th, 2021). HLA binding prediction was done with netMHCpan, version 4.1 [26].

Results

SARS-CoV-2 spike protein mutational profile from genome assemblies and NGS data

First, we determined the number of non-synonymous mutations in the spike protein per sample (for geographic background of the collected samples, see S1 Fig). Of the 1,036,030 analyzed genome assemblies and 30,806 NGS data sets, only 2.5% (26,746 samples) contained the wild type (WT) spike protein (Fig 1A). Samples of mutant viruses exhibited only few mutations in the spike protein with less than ten mutations for all but 4,193 sequences. However, the mean and median number of mutations increased over time from December 2019 (mean: 0.14, median: 0) to April 2021 (mean: 7.2, median: 7; Fig 1B). Overall, we detected 5,472 distinct non-synonymous mutations in the spike protein (S1 Table).
Fig 1

Most of the analyzed SARS-CoV-2 sequences differ from WT spike protein, but exhibit only few non-synonymous mutations.

(A) The histogram shows the number of non-synonymous spike protein mutations detected in the analyzed samples. (B) The mean (red) and median (blue) number of mutations per spike protein sequence increased over time. The top line gives the number n of samples per month. The boxplots indicate the monthly distributions of mutations per sample.

Most of the analyzed SARS-CoV-2 sequences differ from WT spike protein, but exhibit only few non-synonymous mutations.

(A) The histogram shows the number of non-synonymous spike protein mutations detected in the analyzed samples. (B) The mean (red) and median (blue) number of mutations per spike protein sequence increased over time. The top line gives the number n of samples per month. The boxplots indicate the monthly distributions of mutations per sample.

Recurrent variants in SARS-CoV-2 spike protein

Most of the observed variants in the assembly and NGS data sets were recurrent (Fig 2A) and only 22.4% of the variants were singular events in the combined assembly and the NGS data. The recurrent variants were distributed throughout the whole spike protein (Fig 2B and 2C). Among the recurrent variants, 21 and 14 mutations were found in at least 1% of the mutant assembly and NGS samples, respectively (labeled variants in Fig 2B and 2C). The most common mutation was D614G in both the genome assemblies (1,056,342 samples) and the NGS data (27,667 samples) located outside the RBD (positions 319–529), followed by the RBD variants Y501N in the assemblies (346,194 samples) and in the NGS data (5,987 samples). In total, 852 distinct mutations (646 recurrent) were detected in the RBD in the assemblies out of which only 5 were common to more than 1% of the mutated assembly sequences (Fig 2B). For the NGS samples, 259 mutations in total (105 recurrent) were found in the RBD (Fig 2C) and only two were detected in at least 1% of the mutant NGS samples. Overall, 1,637 mutations were commonly found in the assembly and NGS data (Fig 2D).
Fig 2

Recurrent variants are found throughout the whole spike protein.

(A) Most of the detected variants were recurrent events occurring in at least two samples from the assembly or NGS data sets. (B, C) Each data point represents a distinct protein sequence mutation in the spike protein. The labels indicate the amino acid exchange for variants found in more than 1% of the assemblies (B) or NGS samples (C). The RBD is highlighted in red. (D) 1,637 variants (grey) were detected both in the assemblies and the NGS data. (E) A subset of 35 variants co-occurred in at least 5000 of the mutated spike protein sequences (assemblies and NGS data combined). For better visibility, co-occurrences in less than 5000 samples were set to 0 (white tiles).

Recurrent variants are found throughout the whole spike protein.

(A) Most of the detected variants were recurrent events occurring in at least two samples from the assembly or NGS data sets. (B, C) Each data point represents a distinct protein sequence mutation in the spike protein. The labels indicate the amino acid exchange for variants found in more than 1% of the assemblies (B) or NGS samples (C). The RBD is highlighted in red. (D) 1,637 variants (grey) were detected both in the assemblies and the NGS data. (E) A subset of 35 variants co-occurred in at least 5000 of the mutated spike protein sequences (assemblies and NGS data combined). For better visibility, co-occurrences in less than 5000 samples were set to 0 (white tiles). Furthermore, 35 (0.64%) of the detected variants co-occurred frequently in at least 5000 of the mutated spike protein sequences when we combined assembly and NGS data (Fig 2E). Most prominent here, was the variant D614G which was found in combination with 4,066 other variants. The combination P681H/D614G was detected in 345,808 samples. The most frequent co-occurring mutations not involving D614G were P681H/T716I (324,269 samples).

Subclonal variants

In addition, we were interested in subclonal spike protein mutations (i.e. mutations with an observed variant frequency—as derived from the NGS reads—below 100%) which might either indicate co-infection with various SARS-CoV-2 strains and/or intra-host evolution of the virus. To this end, the fraction of variant supporting reads per sample of the detected mutations was determined. Most of the variants were observed with at least 95% of the reads supporting the respective variant nucleotide (Fig 3A and 3B). However, a portion of the overlapping reads pointing to subclonal events only confirmed among few mutations. Filtering for a depth of at least 30 reads and a fraction of supporting reads between 0.1 and 0.95 [24] resulted in 834 mutations observed in 732 samples (i.e. 2.59% of the NGS data sets with mutant spike protein) that could be classified as high-confident subclonal (Fig 3B). Most of these subclonal events were recurrent variants (Fig 3C). In some of the earlier samples, but also in some later cases, the fractions of supporting reads within the same sample differed notably (grey lines in Fig 3C) indicating the presence of more than two spike protein versions within the same host.
Fig 3

Variant frequencies of spike protein mutants indicate presence of multiple SARS-CoV-2 mutants in some samples.

(A) The boxplot shows the distributions of the fraction of supporting reads of the mutations found in the NGS data. The numbers of underlying samples are indicated above the collection dates. Most of the observed variants have a variant allele frequency of > = 0.95 and can be accounted as clonal. (B) Filtering for high-confidence subclonal variants (green) with sequencing depth > = 30 reads and fractions of supporting reads between 0.1 and 0.95. (C) Sample-wise depiction of high-confidence subclonal events. Some of the observed subclonal variants were recurrent (blue) and only few were individual (red). The samples were ordered by collection date (see also color bar at the bottom of the plot) and point sizes indicate sequencing depth (log10 scale). Subclonal variants of the same sample are linked with grey lines. The fraction of supporting reads of variants found in the same sample differed notably in some cases.

Variant frequencies of spike protein mutants indicate presence of multiple SARS-CoV-2 mutants in some samples.

(A) The boxplot shows the distributions of the fraction of supporting reads of the mutations found in the NGS data. The numbers of underlying samples are indicated above the collection dates. Most of the observed variants have a variant allele frequency of > = 0.95 and can be accounted as clonal. (B) Filtering for high-confidence subclonal variants (green) with sequencing depth > = 30 reads and fractions of supporting reads between 0.1 and 0.95. (C) Sample-wise depiction of high-confidence subclonal events. Some of the observed subclonal variants were recurrent (blue) and only few were individual (red). The samples were ordered by collection date (see also color bar at the bottom of the plot) and point sizes indicate sequencing depth (log10 scale). Subclonal variants of the same sample are linked with grey lines. The fraction of supporting reads of variants found in the same sample differed notably in some cases.

Effect of detected spike protein variants on potential antibody and T cell target sites

Next, we investigated whether the observed spike protein variants were relevant in the context of T cell recognition. 5390 (98.5%) of the 5,472 distinct variants hit at least one CD8+ or CD4+ T-cell epitope (Fig 4A) reported by Snyder et al. [25] no matter if they were recurrent or individual events and 4959 (90.6%) variants overlap with T-cell epitopes from the IEDB. In order to evaluate the qualitative impact of mutations in known T-cell epitopes, we collected 225 MHC class I epitopes with known HLA restriction from various literature sources [27-29], allowing the use of computational tools [26] to predict the difference of binding affinities of the mutant and the wild-type epitope. This computed value has been demonstrated to be predictive for immunogenicity [30]. The results for 15905 epitope-mutation pairs (Fig 4B) show a lower predicted binding affinity by 100-fold or more for 266 mutant epitopes and by 1000-fold or more for 13 mutant epitopes.
Fig 4

Variants affect antibody and T cell target sites.

(A) The number of published T cell epitopes (listed in the IEDB or recognized by CD8+ or CD4+ T cells as reported by Snyder et al. [25] that are affected by recurrent or individual spike protein variants is depicted. Most of the variants hit at least one epitope. (B) Predicted binding affinity rank of wild type epitope (WT) vs. the quotient of the predicted rank of wild type and mutant epitope (MUT) depicted as heat map. Lower ranks indicate better binding. A small quotient indicates worse binding prediction for the mutant epitope.

Variants affect antibody and T cell target sites.

(A) The number of published T cell epitopes (listed in the IEDB or recognized by CD8+ or CD4+ T cells as reported by Snyder et al. [25] that are affected by recurrent or individual spike protein variants is depicted. Most of the variants hit at least one epitope. (B) Predicted binding affinity rank of wild type epitope (WT) vs. the quotient of the predicted rank of wild type and mutant epitope (MUT) depicted as heat map. Lower ranks indicate better binding. A small quotient indicates worse binding prediction for the mutant epitope.

Discussion

Our study sheds light on non-synonymous variants in the spike protein of SARS-CoV-2 in a large cohort of samples from all over the world. While most analyzed sequences vary from the reference sample from Wuhan, China, our analysis of more than one million assembly and NGS samples shows an overall low mutation burden in the SARS-CoV-2 spike protein across different host populations (Fig 1). However, the mean and median number of variants per sample increased over time. Coronaviruses have fewer mutations compared to any other RNA virus due to its inherent 3’ to 5’ exoribonuclease activity [31]. This suggests that the SARS-CoV-2 genome is genetically stable and the vast majority of mutations have no phenotypic effect such as virus transmissibility and virulence [32, 33]. However, mutations of critical residues in the RBD of the spike protein might increase the virus transmission ability by enhancing the interaction [34]. Furthermore, vaccines or treatments targeting the spike protein might become less efficient, if the number of variants in the spike protein increases further, as described by McCallum et al. [35]. We identified a subset of mutations from the assembly and NGS data that are recurrent variants in the spike protein. Van Dorp et al. [36] have already reported such recurrent variants in SARS-CoV-2 evolution, which is a likely phenomenon of positive selection signifying the adaption of SARS-CoV-2 in human hosts. Furthermore, most recurrent variants show no evidence in increase of viral transmission and are likely induced by host immunity through RNA editing mechanisms [37]. However, some variants might significantly influence SARS-CoV-2 transmission and infectivity. Among such variants, the non-synonymous D614G mutation has become most prevalent among several populations. We identified around 99.1% of the samples with a D614G variant, which supports a previous theory of an increasing frequency of the D614G variant in the global pandemic [34]. Studies show evidence that the D614G variant is associated with high levels of viral RNA in COVID-19 patients, suggesting a role of D614G mutations in enhancing the viral infectivity in patients [34, 38–40]. In contrast to these findings, it remains unclear whether the D614G variant makes the infections more severe or may impact vaccine design [41], as the viral load does not correlate with disease severity and the variant is not in the RBD of the spike protein, which interacts with the human ACE2 protein. The RBD of the spike protein is a potential target for neutralizing antibodies and the variants in these regions might influence the infectivity and pathogenicity. We have identified high frequency variants in the RBD region from the assembly data, i.e. S477N, Y501N or R452L (Fig 2B and 2C). S477N occurs frequently and studies show that S477N has potential to affect the RBD stability and strengthen the binding with the human ACE2 protein [42, 43]. In our study, S477N was frequently co-occurring with D614G (Fig 2D). This combination was estimated to spread more rapidly than the D614G mutant alone [44]. Other RBD variants such as N439K and N440K also show enhanced binding affinity to the human ACE2 receptor and result in immune escape from a panel of neutralizing monoclonal antibodies [45-47]. Antibody-resistant RBD variants might affect the therapeutic potential of neutralizing monoclonal antibodies by escaping through disruption of epitopes. However, a significant portion of the detected variants represent individual events based on what could be deduced from the available data. This indicates the necessity to further collect SARS-CoV-2 isolates and monitor newly occurring variants. Here, the combination of assembly data (which appeared to be available in a timelier manner) and NGS samples (which also contain information on the clonality of the observed variants but which might be deposited with some delay) provide a valuable resource. Further, we identified subclonal variants with a fraction of supporting reads between 0.1 and 0.95 at a sequencing depth of more than 30 reads in 2.59% of the NGS samples with mutant spike protein (Fig 3). Subclonal variants are indicative of within-host viral diversity leading to transmission of multiple strains [24]. Low frequency variants could have been part of parallel evolution, where the same mutation rises to detectable frequencies in different lineages and it is observed as part of SARS-CoV-2 virus adaptation [48]. Further, recurrent mutations might point to co-infection with multiple strains. Sample-specific variants in turn might rather indicate that the mutation occurred after infection within the host. This viral diversity within the host might prevent complete clearance after treatment and thus might lead to the development of resistant strains. Also, subclonal variants should be considered for vaccine design as these might represent the next generation of the virus. The analyzed data sets also showed that a notable portion of the individual and recurrent mutations in the spike protein (98.5%) overlap with at least one known T-cell epitope. The influence on CD8+ T cell epitope generation by different HLA alleles was investigated for the three common mutations L5F, D614G and G1124V [49]. These mutations were predicted to result in epitope gains, losses or higher or lower HLA binding affinities. Our analysis suggests additional epitope-mutation pairs, which might result in a loss of the epitope and a chance of immune escape. All these findings demonstrate that SARS-CoV-2 mutants need to be set in the context of immune recognition to evaluate their implications for the global spreading of the pandemic and future preventive or therapeutic approaches in a timely manner.

Conclusion and outlook

Human infections with SARS-CoV-2 are spreading globally since the beginning of 2020, necessitating preventive or therapeutic strategies and first steps towards an end to this pandemic were done with the approval of the first mRNA and vector based vaccines against SARS-CoV-2. Here, we show different types of variants (recurrent vs. individual, clonal vs. subclonal, hitting T-cell epitope vs. not-hitting) that can be incorporated in global efforts to sustainably prevent or treat infections. The underlying computational strategy might serve as a template for a platform to constantly analyze globally available sequencing data. In combination with a web-based platform to administer the results, this could help further guiding global vaccine design efforts to overcome the threats of this pandemic also in the future. In addition, the results might serve as a starting point for further study of viral in-vivo evolution via tracking of subclonal variants and their co-occurrence in individual samples. The importance of our approach is underlined by the emergence of SARS-CoV-2 lineages like the UK lineage B.1.1.7 [50], which is characterized by the accumulation of 17 variants; eight of those are located in the spike protein. This lineage has a higher transmissibility compared to other lineages [51]. The occurrence of this lineage questioned the efficacy of current vaccines, but first results showed that it at least unlikely will escape BNT162b-induced protection [52]. Interestingly, the individual variants can be traced back to samples from February (P681H, T716I, N501Y, A570D, S982A, and D1118H) and April (N501Y, A570D) of 2020. It needs to be mentioned that the available data, although representing a large cohort, might not reflect the real distribution of the circulating variants as mostly samples of specific interest will be sequenced. International sequencing efforts, combined data analysis and prediction of variant impact will be important tools for the future in order to ensure an early detection of such genomic variants of concern.

Number and origin of publicly available SARS-CoV-2 sequence data over time.

The histogram shows the number of SARS-CoV-2 assembly sequences deposited at GISAID and NGS data deposited at SRA. Color coding indicates the sample origin. Countries summarized as “other” include: Afghanistan, Albania, Algeria, Andorra, Angola, Antigua and Barbuda, Argentina, Armenia, Aruba, Australia, Austria, Azerbaijan, Bahamas, Bahrain, Bangladesh, Barbados, Belarus, Belgium, Belize, Benin, Bermuda, Bolivia, Plurinational State of, Bonaire, Sint Eustatius and Saba, Bosnia and Herzegovina, Botswana, Brunei Darussalam, Bulgaria, Burkina Faso, Cambodia, Cameroon, Canada, Cayman Islands, Chile, Colombia, Comoros, Congo, Costa Rica, Croatia, Cuba, Cyprus, Czechia, Dominican Republic, Ecuador, Egypt, El Salvador, Equatorial Guinea, Estonia, Eswatini, Ethiopia, Faroe Islands, Finland, French Guiana, French Polynesia, Gabon, Gambia, Georgia, Ghana, Gibraltar, Guam, Guatemala, Guinea, Hong Kong, Hungary, Iceland, Indonesia, Iran, Islamic Republic of, Iraq, Ireland, Israel, Italy, Jamaica, Japan, Jordan, Kazakhstan, Kenya, Kuwait, Latvia, Lebanon, Lesotho, Liechtenstein, Lithuania, Luxembourg, Madagascar, Malawi, Malaysia, Mali, Malta, Martinique, Mauritius, Mexico, Moldova, Republic of, Monaco, Mongolia, Montenegro, Morocco, Mozambique, Myanmar, Nepal, Netherlands, New Zealand, Nigeria, North Macedonia, Northern Mariana Islands, Norway, Oman, Pakistan, Palestine, State of, Panama, Papua New Guinea, Paraguay, Peru, Philippines, Portugal, Reunion, Romania, Russian Federation, Rwanda, Saint Barthelemy, Saint Kitts and Nevis, Saint Lucia, Saint Martin (French part), Saint Vincent and the Grenadines, Saudi Arabia, Senegal, Serbia, Sierra Leone, Singapore, Slovakia, Slovenia, South Africa, Sri Lanka, Suriname, Sweden, Switzerland, Taiwan, Province of China, Thailand, Togo, Trinidad and Tobago, Tunisia, Uganda, Ukraine, United Arab Emirates, unknown, Uruguay, Uzbekistan, Venezuela, Bolivarian Republic of, Vietnam, Virgin Islands, British, Zambia, Zimbabwe and unknown. (TIF) Click here for additional data file.

Overview of the 5,472 distinct non-synonymous mutations in the spike protein of SARS-CoV-2 detected in genome assemblies and NGS data sets.

(XLSX) Click here for additional data file. 11 May 2021 PONE-D-21-09862 Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates PLOS ONE Dear Dr. Löwer, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Your manuscript was reviewed by 2 experts in the field. Both identified several important problems in your submission. Please review the attached comments and provide point-by-point responses. Please submit your revised manuscript by Jun 20 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols. We look forward to receiving your revised manuscript. Kind regards, Yury E Khudyakov, PhD Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ 3. Thank you for stating the following in the Financial Disclosure section: [The author(s) received no specific funding for this work.]. We note that one or more of the authors are employed by a commercial company: TRON gGmbH and BioNTech SE Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form. Please also include the following statement within your amended Funding Statement. “The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.” If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement. 2. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc. Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to  PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and  there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf. Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests 4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability. Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized. Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access. We will update your Data Availability statement to reflect the information you provide in your cover letter. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Partly Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: N/A Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Schrörs and colleagues's manuscript titled “Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates” analyzed 146,917 SARS-CoV-2 genome assemblies and 2,393 NGS datasets from GISAID, NCBI sing bioinformatics tools. The data were outdated and the take-home messages are unclear. 1、 Bioinformatics are very useful tools to predict important variants. While the NISAID analysis are now very thorough, the re-analyze should provide some new clues for biologists to reference, but not just the re-listing of data. 2、 The data analyzed here are mostly up to Sep 2020, while the sequences are increased several times since then. The update of the data is recommended. 3、 The idea to analyze the individual subclone is good, however, the conclusion is deficient. How about the frequency of subclonal variants happened in other viruses? The comparison of SARS-CoV-2 to other virused may provide more use information. 4、 The effect of spike mutations on antibody and T cell recognition is definitely the key point, however, the data here is rough and the mutations mentioned in Fig4A are not including the most important mutations that already been proved. Therefore, the reliability of the method used here need to be verified. Reviewer #2: Thank you for your hard work from sequence analysis and structure for mutation analysis to SASA analysis and T cell epitope analysis. 1. There is a place where English is awkward. The'of' between lines 66 and 67 is used twice. Even on line 67, the sentence that is awkward in the English expression needs to be corrected. 2. In line 97, what the following reference would needed. Please review it and add it if appropriate. Wu, F., Zhao, S., Yu, B. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265--269 (2020). https://doi.org/10.1038/s41586-020-2008-3 3. Check if it is correct to capitalize PDB '6VXX' on line 112, and this form is'closed', which is difficult to see as a single form that is good for analyzing the coupling between RDB and Spike. 'Open'(up) type PDB was also analyzed, so please add it. 4. As one of the most common mutant forms, D614G written in result (around line 134) , which seems particularly important for open conformation analysis. The SARS-CoV-2 Spike variant D614G favors an open conformational state BY RACHAEL A. MANSBACH, SRIRUPA CHAKRABORTY, KIEN NGUYEN, DAVID C. MONTEFIORI, BETTE KORBER, S. GNANAKARAN SCIENCE ADVANCES16 APR 2021: EABF3671 Please add a separate process for this or an explanation of the current data processing method. 5. In the analysis of the results, in spite of the many mutations, it is described that there are few mutations that cause significant conformation changes, and in the conclusion that nonetheless, it is argued that continuous screening is necessary. Claims that are not important may appear alternately and confusing, so if you can please write the part a little more refined form. 6. Please enhance the fig. esp. Fig1. and fig2. And please transform the legend in plot more intuitive. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 28 Jul 2021 Response to Reviewers Reviewer #1: Schrörs and colleagues's manuscript titled “Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates” analyzed 146,917 SARS-CoV-2 genome assemblies and 2,393 NGS datasets from GISAID, NCBI sing bioinformatics tools. The data were outdated and the take-home messages are unclear. 1、 Bioinformatics are very useful tools to predict important variants. While the NISAID analysis are now very thorough, the re-analyze should provide some new clues for biologists to reference, but not just the re-listing of data. Answer: We 100% percent agree that relisting existing data does not provide any benefit to researchers. The novelty of our analysis lies in the comprehensive detection of co-occurring variants, sub-clonal mutations and the impact of the mutations on T-cell immunogenicity of epitopes. All findings can be a valuable starting point for further experiments. 2、 The data analyzed here are mostly up to Sep 2020, while the sequences are increased several times since then. The update of the data is recommended. Answer: Good point – we massively increased the scope of the analysis and now include over 10^6 samples and the respective datasets up to a collection date of April 2021. 3、 The idea to analyze the individual subclone is good, however, the conclusion is deficient. How about the frequency of subclonal variants happened in other viruses? The comparison of SARS-CoV-2 to other virused may provide more use information. Answer: We agree that such a comparison might be useful. However, the detected subclones a relatively rare and the detection itself is only reliable because of the massive amount of available NGS data for SARS-CoV-2, which is not true for any other virus. Rather we recommend further experimental study of subclonal events and added this point to the conclusion of the manuscript. 4、 The effect of spike mutations on antibody and T cell recognition is definitely the key point, however, the data here is rough and the mutations mentioned in Fig4A are not including the most important mutations that already been proved. Therefore, the reliability of the method used here need to be verified. Thanks for pointing this out. After careful consideration, we concluded that the structure-based approach of determining potential effects on antibody binding was not well thought out and indeed lacks validation. We replaced this analysis by a more detailed and advanced analysis of the impact of the variants on HLA-epitiope binding, which is a commonly used method in computational cancer immunology. We cite proper references in order to demonstrate the validity. Reviewer #2: Thank you for your hard work from sequence analysis and structure for mutation analysis to SASA analysis and T cell epitope analysis. 1. There is a place where English is awkward. The'of' between lines 66 and 67 is used twice. Even on line 67, the sentence that is awkward in the English expression needs to be corrected. Answer: Thanks – we corrected this error. 2. In line 97, what the following reference would needed. Please review it and add it if appropriate. Wu, F., Zhao, S., Yu, B. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265--269 (2020). https://doi.org/10.1038/s41586-020-2008-3 Answer: Thanks, we added this reference. 3. Check if it is correct to capitalize PDB '6VXX' on line 112, and this form is'closed', which is difficult to see as a single form that is good for analyzing the coupling between RDB and Spike. 'Open'(up) type PDB was also analyzed, so please add it. 4. As one of the most common mutant forms, D614G written in result (around line 134) , which seems particularly important for open conformation analysis. The SARS-CoV-2 Spike variant D614G favors an open conformational state BY RACHAEL A. MANSBACH, SRIRUPA CHAKRABORTY, KIEN NGUYEN, DAVID C. MONTEFIORI, BETTE KORBER, S. GNANAKARAN SCIENCE ADVANCES16 APR 2021: EABF3671 Please add a separate process for this or an explanation of the current data processing method. 5. In the analysis of the results, in spite of the many mutations, it is described that there are few mutations that cause significant conformation changes, and in the conclusion that nonetheless, it is argued that continuous screening is necessary. Claims that are not important may appear alternately and confusing, so if you can please write the part a little more refined form. Answer to comments 3, 4 and 5: After careful consideration, we concluded that the structure-based approach of determining potential effects on antibody binding was not well thought out and indeed lacks validation. We replaced this analysis by a more detailed and advanced analysis of the impact of the variants on HLA-epitiope binding, which is a commonly used method in computational cancer immunology. We cite proper references in order to demonstrate the validity. The results of this analysis demonstrate, that the mutant form of an epitope has the potential to disrupt epitope recognition and that the continued data analysis of NGS data in this regard can detect such events early. 6. Please enhance the fig. esp. Fig1. and fig2. And please transform the legend in plot more intuitive. Answer: We enlarged subplots of Fig 1 and Fig 2, as well as improved the description of the Figures in the legends. Submitted filename: 20210716_Response_to_Reviewers.docx Click here for additional data file. 2 Aug 2021 Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates PONE-D-21-09862R1 Dear Dr. Löwer, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Yury E Khudyakov, PhD Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 17 Sep 2021 PONE-D-21-09862R1 Large-scale analysis of SARS-CoV-2 spike-glycoprotein mutants demonstrates the need for continuous screening of virus isolates Dear Dr. Löwer: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Yury E Khudyakov Academic Editor PLOS ONE
  36 in total

1.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

2.  Discovery of an RNA virus 3'->5' exoribonuclease that is critically involved in coronavirus RNA synthesis.

Authors:  Ekaterina Minskaia; Tobias Hertzig; Alexander E Gorbalenya; Valérie Campanacci; Christian Cambillau; Bruno Canard; John Ziebuhr
Journal:  Proc Natl Acad Sci U S A       Date:  2006-03-20       Impact factor: 11.205

3.  SARS-CoV-2 immune evasion by the B.1.427/B.1.429 variant of concern.

Authors:  Matthew McCallum; Jessica Bassi; Anna De Marco; Alex Chen; Alexandra C Walls; Julia Di Iulio; M Alejandra Tortorici; Mary-Jane Navarro; Chiara Silacci-Fregni; Christian Saliba; Kaitlin R Sprouse; Maria Agostini; Dora Pinto; Katja Culap; Siro Bianchi; Stefano Jaconi; Elisabetta Cameroni; John E Bowen; Sasha W Tilles; Matteo Samuele Pizzuto; Sonja Bernasconi Guastalla; Giovanni Bona; Alessandra Franzetti Pellanda; Christian Garzoni; Wesley C Van Voorhis; Laura E Rosen; Gyorgy Snell; Amalio Telenti; Herbert W Virgin; Luca Piccoli; Davide Corti; David Veesler
Journal:  Science       Date:  2021-07-01       Impact factor: 47.728

4.  NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data.

Authors:  Birkir Reynisson; Bruno Alvarez; Sinu Paul; Bjoern Peters; Morten Nielsen
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

5.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

6.  Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding.

Authors:  Tyler N Starr; Allison J Greaney; Sarah K Hilton; Daniel Ellis; Katharine H D Crawford; Adam S Dingens; Mary Jane Navarro; John E Bowen; M Alejandra Tortorici; Alexandra C Walls; Neil P King; David Veesler; Jesse D Bloom
Journal:  Cell       Date:  2020-08-11       Impact factor: 41.582

7.  Neutralization of SARS-CoV-2 lineage B.1.1.7 pseudovirus by BNT162b2 vaccine-elicited human sera.

Authors:  Alexander Muik; Ann-Kathrin Wallisch; Bianca Sänger; Kena A Swanson; Julia Mühl; Wei Chen; Hui Cai; Daniel Maurus; Ritu Sarkar; Özlem Türeci; Philip R Dormitzer; Uğur Şahin
Journal:  Science       Date:  2021-01-29       Impact factor: 47.728

8.  SARS-CoV-2-derived peptides define heterologous and COVID-19-induced T cell recognition.

Authors:  Annika Nelde; Tatjana Bilich; Jonas S Heitmann; Yacine Maringer; Helmut R Salih; Malte Roerden; Maren Lübke; Jens Bauer; Jonas Rieth; Marcel Wacker; Andreas Peter; Sebastian Hörber; Bjoern Traenkle; Philipp D Kaiser; Ulrich Rothbauer; Matthias Becker; Daniel Junker; Gérard Krause; Monika Strengert; Nicole Schneiderhan-Marra; Markus F Templin; Thomas O Joos; Daniel J Kowalewski; Vlatka Stos-Zweifel; Michael Fehr; Armin Rabsteyn; Valbona Mirakaj; Julia Karbach; Elke Jäger; Michael Graf; Lena-Christin Gruber; David Rachfalski; Beate Preuß; Ilona Hagelstein; Melanie Märklin; Tamam Bakchoul; Cécile Gouttefangeas; Oliver Kohlbacher; Reinhild Klein; Stefan Stevanović; Hans-Georg Rammensee; Juliane S Walz
Journal:  Nat Immunol       Date:  2020-09-30       Impact factor: 25.606

9.  A Novel Coronavirus from Patients with Pneumonia in China, 2019.

Authors:  Na Zhu; Dingyu Zhang; Wenling Wang; Xingwang Li; Bo Yang; Jingdong Song; Xiang Zhao; Baoying Huang; Weifeng Shi; Roujian Lu; Peihua Niu; Faxian Zhan; Xuejun Ma; Dayan Wang; Wenbo Xu; Guizhen Wu; George F Gao; Wenjie Tan
Journal:  N Engl J Med       Date:  2020-01-24       Impact factor: 91.245

10.  Serine 477 plays a crucial role in the interaction of the SARS-CoV-2 spike protein with the human receptor ACE2.

Authors:  Amit Singh; Georg Steinkellner; Katharina Köchl; Karl Gruber; Christian C Gruber
Journal:  Sci Rep       Date:  2021-02-22       Impact factor: 4.379

View more
  11 in total

1.  NGS data vectorization, clustering, and finding key codons in SARS-CoV-2 variations.

Authors:  Juhyeon Kim; Saeyeon Cheon; Insung Ahn
Journal:  BMC Bioinformatics       Date:  2022-05-17       Impact factor: 3.307

2.  Molecular insights into the differential dynamics of SARS-CoV-2 variants of concern.

Authors:  Nabanita Mandal; Aditya K Padhi; Soumya Lipsa Rath
Journal:  J Mol Graph Model       Date:  2022-04-14       Impact factor: 2.942

Review 3.  SARS-CoV-2: An Overview of the Genetic Profile and Vaccine Effectiveness of the Five Variants of Concern.

Authors:  Raluca Dumache; Alexandra Enache; Ioana Macasoi; Cristina Adriana Dehelean; Victor Dumitrascu; Alexandra Mihailescu; Roxana Popescu; Daliborca Vlad; Cristian Sebastian Vlad; Camelia Muresan
Journal:  Pathogens       Date:  2022-04-26

4.  Metagenomic pipeline for identifying co-infections among distinct SARS-CoV-2 variants of concern: study cases from Alpha to Omicron.

Authors:  Jose Arturo Molina-Mora; Estela Cordero-Laurent; Melany Calderón-Osorno; Edgar Chacón-Ramírez; Francisco Duarte-Martínez
Journal:  Sci Rep       Date:  2022-06-07       Impact factor: 4.996

Review 5.  Neutralising antibody escape of SARS-CoV-2 spike protein: Risk assessment for antibody-based Covid-19 therapeutics and vaccines.

Authors:  Daniele Focosi; Fabrizio Maggi
Journal:  Rev Med Virol       Date:  2021-03-16       Impact factor: 11.043

6.  Mutational landscape and in silico structure models of SARS-CoV-2 spike receptor binding domain reveal key molecular determinants for virus-host interaction.

Authors:  Shijulal Nelson-Sathi; P K Umasankar; E Sreekumar; R Radhakrishnan Nair; Iype Joseph; Sai Ravi Chandra Nori; Jamiema Sara Philip; Roshny Prasad; K V Navyasree; Shikha Ramesh; Heera Pillai; Sanu Ghosh; T R Santosh Kumar; M Radhakrishna Pillai
Journal:  BMC Mol Cell Biol       Date:  2022-01-07

7.  SARS-CoV-2 Amino Acid Mutations Detection in Greek Patients Infected in the First Wave of the Pandemic.

Authors:  Niki Vassilaki; Konstantinos Papadimitriou; Anastasios Ioannidis; Nikos C Papandreou; Raphaela S Milona; Vassiliki A Iconomidou; Stylianos Chatzipanagiotou
Journal:  Microorganisms       Date:  2022-07-15

8.  Considering epitopes conservity in targeting SARS-CoV-2 mutations in variants: a novel immunoinformatics approach to vaccine design.

Authors:  Mohammad Aref Bagherzadeh; Mohammad Izadi; Kazem Baesi; Mirza Ali Mofazzal Jahromi; Majid Pirestani
Journal:  Sci Rep       Date:  2022-08-18       Impact factor: 4.996

9.  SARS-CoV-2 genomes from Saudi Arabia implicate nucleocapsid mutations in host response and increased viral load.

Authors:  Tobias Mourier; Muhammad Shuaib; Sharif Hala; Sara Mfarrej; Fadwa Alofi; Raeece Naeem; Afrah Alsomali; David Jorgensen; Amit Kumar Subudhi; Fathia Ben Rached; Qingtian Guan; Rahul P Salunke; Amanda Ooi; Luke Esau; Olga Douvropoulou; Raushan Nugmanova; Sadhasivam Perumal; Huoming Zhang; Issaac Rajan; Awad Al-Omari; Samer Salih; Abbas Shamsan; Abbas Al Mutair; Jumana Taha; Abdulaziz Alahmadi; Nashwa Khotani; Abdelrahman Alhamss; Ahmed Mahmoud; Khaled Alquthami; Abdullah Dageeg; Asim Khogeer; Anwar M Hashem; Paula Moraga; Eric Volz; Naif Almontashiri; Arnab Pain
Journal:  Nat Commun       Date:  2022-02-01       Impact factor: 14.919

Review 10.  Facing the wrath of enigmatic mutations: a review on the emergence of severe acute respiratory syndrome coronavirus 2 variants amid coronavirus disease-19 pandemic.

Authors:  Jatin Chadha; Lavanya Khullar; Nidhi Mittal
Journal:  Environ Microbiol       Date:  2021-08-08       Impact factor: 5.476

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.