| Literature DB >> 35096288 |
Sara Donzelli1, Francesca Spinella2, Enea Gino di Domenico3, Martina Pontone3, Ilaria Cavallo3, Giulia Orlandi4, Stefania Iannazzo5, Giulio Maria Ricciuto6, Raul Pellini7, Paola Muti8, Sabrina Strano9, Gennaro Ciliberto10, Fabrizio Ensoli3, Stefano Zapperi11,12, Caterina A M La Porta13,14, Giovanni Blandino1, Aldo Morrone4, Fulvia Pimpinelli3.
Abstract
OBJECTIVES: Despite extensive efforts to monitor the diffusion of COVID-19, the actual wave of infection is worldwide characterized by the presence of emerging SARS-CoV-2 variants. The present study aims to describe the presence of yet undiscovered SARS-CoV-2 variants in Italy.Entities:
Keywords: D614G; Immune response; S939F; SARS-CoV-2; Spike mutations
Year: 2022 PMID: 35096288 PMCID: PMC8780065 DOI: 10.1016/j.csbj.2022.01.021
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Genetic Variability and phylogenetic analysis of Whole-Genome Consensus Sequences. A. For the mutations analysis, sequences of viral genomes and the reference sequence (GenBank ID NC_045512.2) were aligned with Clustal Omega [31], [32] and analyzed with MEGA X [33]. Nucleotide positions are referred to Wuhan-Hu-1(reference genome MN908947). B. For phylogenetic analysis, we inferred the maximum-likelihood tree using the edge-linked partition model in IQ-TREE () [34], [35].
Fig. 2A. S939F and D614G mutations genomic locations and characteristics by 2019nCoVR. The evidence level was graded into I-III according to the number of mutations in high-quality sequences and the density distribution of mutations (population frequency of class I is greater than 0.05, which indicates it is more credible; class II variant sites fall in high-density areas; population frequency of class III is less than 0.05, indicating its low reliability). The Variance Time calculates the population frequency of each mutation site over time, evaluates the variance dispersion of the site by calculating the variance of population frequency at each time point. The Variance Area, calculates the population frequency of each mutation site, evaluates the variance dispersion of the site by calculating the variance of population frequency in each region. The Ensembl Variation - Calculated variant consequences is a prediction of the effects that each allele of the variant may have on each transcript. B-C. Time (upper panel) and Area (lower panel) frequencies of S939F (B) and D614G (C) mutations by 2019nCoVR are indicated. Isolates number is indicated in blue, variation frequency is indicated in black. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3A-B. S939F (A) and D614G (B) mutations occurrence in the five continents of the world from 1 March 2020 to 31 January 2021 by COVID CG.
Fig. 4A-C. S939F mutation distribution (indicated in yellow) in Europe at the indicated time intervals by GSAID. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Mutations affect the number of likely T-cell epitopes in a HLA-dependent manner. The figure shows the number of highly ranked peptides from the reference and the mutated (D614G and S939F) SARS-CoV-2 spike protein for a set of HLA alleles, estimated with NetTepi as discussed in the Methods section. For some HLA alleles, the number of highly ranked peptides, the potential T-cell epitopes, differs for the reference and the mutated virus.
Fig. 6Difference in T-cell propensity and T-cell epitope combined score between reference and mutated peptides. A. After proteasome cleavage simulation, we obtain 1513 peptides that are common between the reference and the mutated virus. A small number of peptides are only present either in the reference virus or in the mutated virus. B. The distribution of T-cell propensities estimated by NetTepi is not affected by the D614G mutation (p = 0.99 according to the Kolmogorov-Smirnov test) while a significant change is observed for the S939F mutation (p = 0.01 according to the Kolmogorov-Smirnov test). The boxplot reports median and quartiles of the data. C. The mutations affect the T-cell epitope combined score of the peptides estimated by NetTepi in a HLA-dependent manner.
Fig. 7Effect of mutations on binding affinities for a broad range of HLA alleles. We report the binding affinities for the peptides only present in the reference and in the mutated spike protein obtained using MHCflurry 2.0. Individual peptides binding affinities are reported as dots. The boxplot reports median and quartiles of the same data. A. D614G mutation. B. S939F mutation.