| Literature DB >> 35429618 |
Stefano Gaiarsa1, Federica Giardina1, Gherard Batisti Biffignandi2, Guglielmo Ferrari1, Aurora Piazza2, Monica Tallarita1, Federica Novazzi1, Claudio Bandi3, Stefania Paolucci1, Francesca Rovida4, Giulia Campanini1, Antonio Piralla5, Fausto Baldanti4.
Abstract
Studies are needed to better understand the genomic evolution of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This study aimed to describe viral quasispecies population of upper and lower respiratory tract by next-generation sequencing in patients admitted to intensive care unit. A deep sequencing of the S gene of SARS-CoV-2 from 109 clinical specimens, sampled from the upper respiratory tract (URT) and lower respiratory tract (LRT) of 77 patients was performed. A higher incidence of non-synonymous mutations and indels was observed in the LRT among minority variants. This might be explained by the ability of the virus to invade cells without interacting with ACE2 (e.g. exploiting macrophage phagocytosis). Minority variants are highly concentrated around the gene portion encoding for the Spike cleavage site, with a higher incidence in the URT; four mutations are highly recurring among samples and were found associated with the URT. Interestingly, 55.8% of minority variants detected in this locus were T>G and G>T transversions. Results from this study evidenced the presence of selective pressure and suggest that an evolutionary process is still ongoing in one of the crucial sites of spike protein associated with the spillover to humans.Entities:
Keywords: Deep sequencing; Deleterious mutations; Minority variants; SARS-CoV-2; Spike protein; Spillover
Mesh:
Substances:
Year: 2022 PMID: 35429618 PMCID: PMC9008095 DOI: 10.1016/j.virusres.2022.198786
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 6.286
Fig. 1(A) Distribution of Cq in the URT and LRT samples. (B, C) Differences and correlation plots for Cq values in 28 paired URT and LRT samples. The statistic Spearman's correlation coefficient and linear regression R2 value are also reported.
Fig. 2Comparison of number of haplotypes and minority variants in upper vs lower respiratory tract samples. (A) Correlation between the number of minority variants and viral load expressed in cycle of quantification (Cq). (B) Correlation between the number of haplotypes and viral load expressed in Cq.
Fig. 3Distribution of the number of (A) minority variants, (B) haplotypes and (C) dN/dS ratio in the URT and LRT samples. Values are represented as a boxplot with all points inscribed. (D) Distribution of the weighted incidence of synonymous, non-synonymous, and insertions and deletions (Indel) in the URT and the LRT samples. (E) Distribution of the weighted incidence of frameshifting insertions and deletions in the URT and the LRT. Values are weighted by dividing them by the total number of minority variants in the sample.
Fig. 4(A) Distribution of the weighted incidence of minority variants in the S gene subdomains (NTD: N-Terminal Domain; RBD: Receptor-Binding Domain; SD1: Structural Domain 1; SD2: Structural Domain 2; FP: Fusion Peptide; HR1: Heptad Repeat 1; HR2: Heptad Repeat 2; TM: TransMembrane domain) in the URT and LRT samples. (B) Distribution of the weighted incidence of minority variants in the two S gene subunits in the URT and the LRT (S1: Subunit 1; S2: Subunit 2). (C) Distribution in the URT and in the LRT of the weighted incidence of minority variants, classified by mutation patterns. Values are weighted by dividing them by the total number of minority variants in the sample.
Fig. 5Graphical distribution of changes along S protein gene. (A) Number of samples containing minority variants in each position. Two separate histograms are used for URT and LRT samples, which are indicated upside down for image clarity. (B) Correlation of the presence of minority variants with URT and LRT in each position of the gene. Bar height represents the log10 (p-value) of the Fisher exact test. In the middle, a scheme of the gene subdomains and subunits is used as separator. NTD: N-Terminal Domain; RBD: Receptor-Binding Domain; SD1: Structural Domain 1; SD2: Structural Domain 2; FP: Fusion Peptide; HR1: Heptad Repeat 1; HR2: Heptad Repeat 2; TM: TransMembrane domain; S1: Subunit 1; S2: Subunit 2.
Description of the 9 Minority Variants (MVs) positions associated with either URT or LRT compared to the reference sequence (NC_045512.2). Global frequency is referred to the frequency of the mutation in the same amino acid in GISAID global database. Data are accessible at www.cov.lanl.gov .
| Codon position | Amino acid | Reference | Mutation | Mutation type | Associated with |
|---|---|---|---|---|---|
| 212 | 71 | C(S) | A(Y)/T(F) | NotSyn | LRT |
| 2055 | 685 | T(R) | G(R) | Syn | URT |
| 2058 | 686 | T(S) | G(R) | NotSyn | URT |
| 2060 | 687 | T(V) | G(G) | NotSyn | URT |
| 2100 | 700 | T(G) | G(G) | Syn | URT |
| 3005 | 1002 | A(Q) | T(L) / - | NotSyn/Del | LRT |
| 3483 | 1161 | A(S) | C(S) | Syn | URT |
| 3485 | 1162 | C(P) | T(L)/G(R) | NotSyn | LRT |
| 3596 | 1199 | A (D) | G(G)/T(G) | NotSyn | LRT |