| Literature DB >> 35128175 |
Abozar Ghorbani1, Samira Samarfard2, Maziar Jajarmi3, Mahboube Bagheri4, Thomas P Karbanowicz5, Alireza Afsharifar1, Mohammad Hadi Eskandari6, Ali Niazi7, Keramatollah Izadpanah1.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of the coronavirus disease (COVID-19) pandemic, has infected millions of people globally. Genetic variation and selective pressures lead to the accumulation of single nucleotide polymorphism (SNP) within the viral genome that may affect virulence, transmission rate, viral recognition and the efficacy of prophylactic and interventional measures. To address these concerns at the genomic level, we assessed the phylogeny and SNPs of the SARS-CoV-2 mutant population collected to date in Iran in relation to globally reported variants. Phylogenetic analysis of mutant strains revealed the occurrence of the variants known as B.1.1.7 (Alpha), B.1.525 (Eta), and B.1.617 (Delta) that appear to have delineated independently in Iran. SNP analysis of the Iranian sequences revealed that the mutations were predominantly positioned within the S protein-coding region, with most SNPs localizing to the S1 subunit. Seventeen S1-localizing SNPs occurred in the RNA binding domain that interacts with ACE2 of the host cell. Importantly, many of these SNPs are predicted to influence the binding of antibodies and anti-viral therapeutics, indicating that the adaptive host response appears to be imposing a selective pressure that is driving the evolution of the virus in this closed population through enhancing virulence. The SNPs detected within these mutant cohorts are addressed with respect to current prophylactic measures and therapeutic interventions. CrownEntities:
Keywords: ACE2, Angiotensin-converting enzyme 2; Antiviral drugs; Bioinformatics; CSSE, Center for Systems Science and Engineering; E, Envelope; FP, Fusion peptide; HR1, Heptad repeat 1; HR2, Heptad repeat 2; IC, Intracellular domain; JHU, Johns Hopkins University; M, Membrane; Mutation detection; N, Nucleocapsid; NAG, N-acetylglucosamine; NSP, Non-structural proteins; NTD, N-terminal domain; Phylogenetic analysis; RBD, Receptor-binding domain; S, Spike glycoprotein; SARS-CoV-2; SARS-CoV-2, Severe acute respiratory syndrome coronavirus 2;; SD1, Subdomain 1; SD2, Subdomain 2; SNP, Single nucleotide polymorphism; SP, Structural proteins; TM, Transmembrane region; UTRs, Untranslated regions; Viral vaccines
Year: 2022 PMID: 35128175 PMCID: PMC8808475 DOI: 10.1016/j.genrep.2022.101537
Source DB: PubMed Journal: Gene Rep ISSN: 2452-0144
Important single-nucleotide polymorphisms (SNPs) in spike sequences of Iranian human SARS-COV-2 isolates and their effect on drug binding site.
| Spike domain | Reference | Allele | Count | Frequency | Amino acid change | Other variants within codon | Effect on drug binding site | Modified positions are the same as the variants | The mutation that influences antibody responses |
|---|---|---|---|---|---|---|---|---|---|
| NTD | C | G | 6 | 3.174603 | Thr19Arg | No | 1 drug hit | B.1.617 | + |
| NTD | G | A | 2 | 1.058201 | Arg21Lys | No | No drug hits | B.1.617 | + |
| NTD | TACATG | − | 11 | 5.820106 | His69_Val70del | Yes | No drug hits | B.1.1.7, B.1.525 | + |
| NTD | G | T | 4 | 2.116402 | Asp80Tyr | No | No drug hits | B.1.351 | + |
| NTD | TTA | − | 20 | 10.58201 | Tyr145del | No | 1 drug hit | B.1.1.7, B.1.525 | + |
| NTD | G | C | 2 | 1.058201 | Glu156Asp | Yes | No drug hits | B.1.617 | + |
| NTD | AG | CT | 2 | 1.058201 | Arg158Leu | Yes | No drug hits | B.1.617 | + |
| NTD | C | T | 4 | 2.116402 | Ala222Val | No | No drug hits | B.1.617 | + |
| RBD | G | C | 2 | 1.058201 | Val407Leu | No | 1 drug hit | − | + |
| RBD | G | A | 2 | 1.058201 | Ala411Thr | No | No drug hits | − | + |
| RBD | A | C | 3 | 1.587302 | Lys417Thr | Yes | 1 drug hit | B.1.351, B.1.1.28 | + |
| RBD (RBM) | C | G | 3 | 1.587302 | Asn439Lys | No | 1 drug hit | − | + |
| RBD(RBM) | T | A | 2 | 1.058201 | Asn448Lys | No | No drug hits | − | + |
| RBD(RBM) | T | G | 6 | 3.174603 | Leu452Arg | No | 1 drug hit | B.1.617, B.1.429 + B.1.427 | + |
| RBD(RBM) | G | A | 2 | 1.058201 | Asp467Asn | No | 1 drug hit | − | + |
| RBD(RBM) | G | A | 15 | 7.936508 | Ser477Asn | No | No drug hits | B.1.526 | + |
| RBD(RBM) | C | A | 6 | 3.174603 | Thr478Lys | No | No drug hits | B.1.617 | + |
| RBD(RBM) | G | A | 3 | 1.587302 | Glu484Lys | No | No drug hits | B.1.351, B.1.1.28, B.1.525 | + |
| RBD(RBM) | A | T | 23 | 12.16931 | Asn501Tyr | Yes | 1 drug hit | B.1.1.7, B.1.351, P.1 | + |
| RBD(RBM) | A | C | 2 | 1.058201 | Asn501Thr | Yes | 1 drug hit | B.1.351, B.1.1.28 | + |
| RBD | G | A | 2 | 1.058201 | Arg509Lys | No | 11 drug hits | − | + |
| RBD | C | A | 2 | 1.058201 | His519Asn | No | 2 drug hits | − | + |
| RBD | G | C | 2 | 1.058201 | Ala522Pro | No | 3 drug hits | − | + |
| RBD | T | G | 2 | 1.058201 | Val534Gly | Yes | 2 drug hits | − | + |
| RBD | AA | GT | 2 | 1.058201 | Lys535Val | Yes | No drug hits | − | + |
| C | A | 16 | 8.465608 | Ala570Asp | No | 1 drug hit | B.1.1.7 | + | |
| A | G | 95 | 50.26455 | Asp614Gly | No | 1 drug hit | B.1.17, B.1.351, P.1, B.1.525, B.1.429 + B.1.427, B.1.526, B.1.617, B.1.1.28 | + | |
| C | T | 4 | 2.116402 | His655Tyr | No | 1 drug hit | B.1.1.28 | + | |
| G | T | 3 | 1.587302 | Gln677His | No | No drug hits | B.1.525 | + | |
| C | A | 16 | 8.465608 | Pro681His | No | 1 drug hit | B.1.1.7 | + | |
| C | G | 6 | 3.174603 | Pro681Arg | No | 1 drug hit | B.1.617 | + | |
| C | T | 16 | 8.465608 | Thr716Ile | No | 1 drug hit | B.1.1.7 | + | |
| HR | G | A | 8 | 4.232804 | Asp950Asn | No | No drug hits | B.1.617 | + |
| HR | T | G | 17 | 8.994709 | Ser982Ala | No | No drug hits | B.1.1.7 | + |
| G | C | 16 | 8.465608 | Asp1118His | No | No drug hits | B.1.1.7 | + |
NTD; N-terminal domain (NTD), RBD; receptor-binding domain, RBM; receptor-binding motif, FP; fusion peptide, HR; heptad repeat1.
Fig. 1Position of SNPs on the 3D structure of SARS-CoV-2 Spike protein in Iranian isolates. SNPs have been shown in green color. A, B, and C show SNPs in a chain. D show SNPs in a chain that combined to other two chains. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Single-nucleotide polymorphisms (SNPs) in whole genome (except spike ORF) sequences of Iranian human SARS-COV-2 isolates and their effect on drug binding site.
| ORF | Reference | Allele | Count | Frequency | Amino acid change | Effect on putative drug-binding site (PDB drug name) |
|---|---|---|---|---|---|---|
| ORF1ab | G | T | 2 | 3.125 | Trp161Leu | – |
| ORF1ab | C | T | 16 | 25 | Arg207Cys | – |
| ORF1ab | G | C | 2 | 3.125 | Cys316Ser | – |
| ORF1ab | G | A | 36 | 56.25 | Val378Ile | – |
| ORF1ab | C | T | 6 | 9.375 | His417Tyr | – |
| ORF1ab | A | C | 2 | 3.125 | Glu489Ala | – |
| ORF1ab | C | T | 2 | 3.125 | Ala656Val | – |
| ORF1ab | C | T | 3 | 4.6875 | Ser944Leu | – |
| ORF1ab | T | C | 2 | 3.125 | Cys1114Arg | – |
| ORF1ab | C | T | 3 | 4.6875 | Leu1507Phe | – |
| ORF1ab | C | T | 2 | 3.125 | Thr1682Ile | – |
| ORF1ab | C | T | 3 | 4.6875 | Thr1760Ile | – |
| ORF1ab | G | T | 2 | 3.125 | Arg2159Leu | – |
| ORF1ab | C | T | 2 | 3.125 | Ser2488Phe | – |
| ORF1ab | A | G | 2 | 3.125 | Asn2596Ser | – |
| ORF1ab | G | T | 16 | 25 | Met2796Ile | – |
| ORF1ab | G | T | 32 | 50 | Leu3606Phe | – |
| ORF1ab | C | A | 2 | 3.125 | His3633Asn | – |
| ORF1ab | A | G | 3 | 4.6875 | Glu3909Gly | – |
| ORF1ab | C | T | 2 | 3.125 | Thr4129Ile | – |
| ORF1ab | G | T | 3 | 4.6875 | Gly4529Cys | – |
| ORF1ab | C | T | 27 | 42.1875 | Pro4715Leu | – |
| ORF1ab | C | T | 2 | 3.125 | Leu5030Phe | – |
| ORF1ab | C | T | 6 | 9.375 | Thr5193Ile | – |
| ORF1ab | C | T | 4 | 6.25 | His5614Tyr | UR7, VW4, UXG, N0E, S9S, UQS, VVD, JFM, VWD, NZG, VWJ, VWG, K34 |
| ORF1ab | C | T | 3 | 4.6875 | Thr5675Ile | VVG, STV |
| ORF1ab | T | G | 2 | 3.125 | Phe5799Val | K34 |
| ORF1ab | C | T | 2 | 3.125 | Ser5809Leu | VWM, VXG, K2P |
| ORF1ab | C | T | 2 | 3.125 | Thr6038Ile | – |
| ORF1ab | G | A | 6 | 9.375 | Gly6875Arg | – |
| ORF3a | G | T | 15 | 23.4375 | Gln57His | – |
| ORF3a | G | T | 3 | 4.6875 | Val112Phe | – |
| ORF3a | A | G | 3 | 4.6875 | Thr223Ala | – |
| ORF3a | C | T | 2 | 3.125 | Thr223Ile | – |
| ORFM | A | G | 3 | 4.6875 | Ile73Met | – |
| ORF7b | G | T | 3 | 4.6875 | Leu11Phe | – |
| ORF7b | C | T | 3 | 4.6875 | Thr40Ile | – |
| ORFN | G | T | 2 | 3.125 | Asp3Tyr | – |
| ORFN | C | A | 3 | 4.6875 | Pro13Thr | – |
| ORFN | C | T | 2 | 3.125 | Ala35Val | – |
| ORFN | C | T | 6 | 9.375 | Ser186Phe | – |
| ORFN | C | G | 2 | 3.125 | Asn192Lys | – |
| ORFN | C | T | 13 | 20.3125 | Ser194Leu | – |
| ORFN | GGG | AAC | 7 | 10.9375 | Gly204delinsLysArg | – |
| ORFN | C | T | 3 | 4.6875 | Ala220Val | – |
| ORFN | G | T | 2 | 3.125 | Met234Ile | – |
UR7: 1-(3-fluoro-4-methylphenyl)methanesulfonamide, VW4: (2S)-2-phenylpropane-1-sulfonamide, UXG: 1-(diphenylmethyl)azetidin-3-ol, N0E: N}-(4-hydroxyphenyl)-3-phenyl-propanamide, S9S: -[2-(4-fluorophenyl)ethyl]methanesulfonamide, UQS: N-[(2-fluorophenyl)methyl]-1H-pyrazol-4-amine, VVD: 5-(acetylamino)-2-fluorobenzoic acid, JFM: N-(2-phenylethyl) methanesulfonamide, VWD: (1R)-2-(methylsulfonyl)-1-phenylethan-1-ol, NZG: 3-(acetylamino)-4-fluorobenzoic acid, VWJ: N-(propan-2-yl)-1H-benzimidazol-2-amine, VWG: N-hydroxyquinoline-2-carboxamide, K34: 5-(1,3-thiazol-2-yl)-1H-1,2,4-triazole, VVG: N-(2-fluorophenyl) ethane sulfonamide, STV: -(1,3-benzodioxol-5-ylmethyl) ethane sulfonamide, VWM: (3R)-1-acetyl-3-hydroxypiperidine-3-carboxylic acid, VXG: (3S,4R)-1-acetyl-4-phenylpyrrolidine-3-carboxylic acid, K2P: 2-(trifluoromethoxy)benzoic acid.
Fig. 2Location of SNPs in the binding site of N-acetylglucosamine and polysorbate 80 with spike protein of SARS-CoV-2, Iranian isolates.
Transition/transversion and Gamma parameter based on whole-genome and spike gene sequences of Iranian isolates.
| Based on | Transition/transversion (R) | Gamma parameter for site rates |
|---|---|---|
| Whole genome | 1.33 | 0.05 |
| Spike gene | 0.64 | 0.339 |
Fig. 3The whole genome-based phylogenetic analysis of SARS-CoV-2 isolates from Iran and other countries and a candidate of new variants.
Physicochemical parameters of major new SARS-CoV-2 variants B.1.1.7 (Alpha), B.1.525 (Eta) and B.1.617 (Delta).
| Parameters | Molecular weight (kDa) | Theoretical pI | The instability index | Grand average of hydropathicity | Total number of negatively/positively charged residues |
|---|---|---|---|---|---|
| Wuhan | 141.178 | 6.24 | 33.01 | −0.079 | 110/103 |
| B.1.1.7 (Alpha) | 140.824 | 6.35 | 32.58 | −0.075 | 109/103 |
| B.1.525 (Eta) | 141.150 | 6.32 | 32.86 | −0.077 | 109/103 |
| B.1.617 (Delta) | 140.986 | 6.78 | 32.81 | −0.090 | 108/106 |