| Literature DB >> 34030593 |
Rongsui Gao1, Wenhong Zu1, Yang Liu1, Junhua Li2, Zeyao Li3, Yanling Wen1, Haiyan Wang1, Jing Yuan4, Lin Cheng1, Shengyuan Zhang1, Yu Zhang5, Shuye Zhang6, Weilong Liu1, Xun Lan7, Lei Liu1, Feng Li8, Zheng Zhang1,9,10.
Abstract
New SARS-CoV-2 mutants have been continuously indentified with enhanced transmission ever since its outbreak in early 2020. As an RNA virus, SARS-CoV-2 has a high mutation rate due to the low fidelity of RNA polymerase. To study the single nucleotide polymorphisms (SNPs) dynamics of SARS-CoV-2, 158 SNPs with high confidence were identified by deep meta-transcriptomic sequencing, and the most common SNP type was C > T. Analyses of intra-host population diversity revealed that intra-host quasispecies' composition varies with time during the early onset of symptoms, which implicates viral evolution during infection. Network analysis of co-occurring SNPs revealed the most abundant non-synonymous SNP 22,638 in the S glycoprotein RBD region and 28,144 in the ORF8 region. Furthermore, SARS-CoV-2 variations differ in an individual's respiratory tissue (nose, throat, BALF, or sputum), suggesting independent compartmentalization of SARS-CoV-2 populations in patients. The positive selection analysis of the SARS-CoV-2 genome uncovered the positive selected amino acid G251V on ORF3a. Alternative allele frequency spectrum (AAFS) of all variants revealed that ORF8 could bear alternate alleles with high frequency. Overall, the results show the quasispecies' profile of SARS-CoV-2 in the respiratory tract in the first two months after the outbreak.Entities:
Keywords: SARS-CoV-2; Single nucleotide polymorphism (SNP); epitopes; host adaptation; intra-host single nucleotide variation (iSNV); quasispecies; selective pressure
Mesh:
Substances:
Year: 2021 PMID: 34030593 PMCID: PMC8158041 DOI: 10.1080/21505594.2021.1911477
Source DB: PubMed Journal: Virulence ISSN: 2150-5594 Impact factor: 5.882
Figure 1.Distribution of SNPs among COVID-19 patients
Figure 2.Comparison of SNPs’ alternate frequencies between mild and severe patients
Figure 3.Genomic distribution of SARS-CoV-2 SNPs
Positive selection analysis in the genes of SARS-CoV-2 based on the dataset in this study
| Gene | P value | Positive selected sites with P ≥ 0.95 | dN/dS of genes calculated by DnaSP_P |
|---|---|---|---|
| M | 0.999997 | 0.23845 | |
| N | 0.999999 | 0.378859 | |
| S | 0.2919166 | 0.438138 | |
| ORF3a | 0.1592752 | 45 W 49 G 50 V 80 V 89 T 251 G | NA |
| ORF8 | 0.2153693 | 0.362924 | |
| ORF10 | 0.999987 | 0.101222 |
Figure 4.The direct intracellular interaction between ORF8 and MHC I molecular, and ORF8 degrades MHC I in a dose-dependent manner
Figure 5.Dynamic of quasispecies’ composition identified in three COVID-19 patients
Figure 6.Co-occurring SNPs were found in S glycoprotein among COVID-19 patients
Figure 7.Functional and structural insights into S359N variant of SARS-CoV-2 S glycoprotein
ORF8 L84S variants and differential peptide binding to the HLA-A allele
| Protein | Peptide | Rank wt | Rank mut | Allele |
|---|---|---|---|---|
| HA | LLLAIVSLV | 0.3 | HLA-A*02:01 | |
| HA | GILGFVFTL | 0.8 | HLA-A*02:01 | |
| HA | KLYQNPTTYI | 0.47 | HLA-A*02:01 | |
| HA | VLLLAIVSL | 0.8 | HLA-A*02:01 | |
| HA | RLYQNPTTYI | 0.9 | HLA-A*02:01 | |
| ORF8 | NYTVSC | 0.22 | 0.3 | HLA-A*23:01 |
| ORF8 | NYTVSC | 0.33 | 0.7 | HLA-A*24:02 |
| ORF8 | YTVSC | 0.52 | 1.29 | HLA-A*02:06 |
| ORF8 | TVSC | 0.9 | 1.1 | HLA-A*68:02 |
| ORF8 | IGNYTVSC | 1.0 | 1.5 | HLA-A*23:01 |
A lower rank value designates better MHC class I; we were using the consensus percentile rank <1.0 as the cutoff value for binding. HLA-A*02:06, A*68:02, and A*23:01 accommodate the wild-type ORF8 peptides, yet not the variant L84S. The HLA-A*02:01 restricted H5N1 Flu A hemagglutinin epitopes show strong binding to HLA-A*02:01, which is predicted by the same method.