| Literature DB >> 35017872 |
Srinivasan Sivasubramanian1, Vidya Gopalan1, Kiruba Ramesh1, Padmapriya Padmanabhan1, Kiruthiga Mone1, Karthikeyan Govindan1, Selvakumar Velladurai1, Prabu Dhandapani2, Kaveri Krishnasamy1, Satish Srinivas Kitambi3.
Abstract
INTRODUCTION: The COVID-19 pandemic is associated with high morbidity and mortality, with the emergence of numerous variants. The dynamics of SARS-CoV-2 with respect to clade distribution is uneven, unpredictable and fast changing.Entities:
Keywords: COVID-19; Clade; India; SARS-CoV-2; mutations; phylogeny; spike protein
Year: 2021 PMID: 35017872 PMCID: PMC8697821 DOI: 10.4103/jgid.jgid_97_21
Source DB: PubMed Journal: J Glob Infect Dis ISSN: 0974-777X
Figure 1SARS-CoV-2 clade distribution pattern in India. (a) Pie chart showing the proportion of various clades of the genomes deposited from India in Global Initiative on Sharing All Influenza Data; (b) Schematic geographical map showing the proportion and distribution of clades from different states of India; (c) Month wise clade distribution during the year 2020
Figure 2Phylogenetic tree showing clade diversity for SARS-CoV-2 Indian isolates. These isolates fall under 7 genetic clades with the majority falling under GR clade
Amino acid substitution mutations observed across various regions of S proteins of Indian severe acute respiratory syndrome coronavirus 2 isolates
| Region | Position | Number of mutation sites | Number of mutations |
|---|---|---|---|
| Signal Peptide | 1-13 | 7 | 9 |
| N-Terminal Domain | 14-305 | 144 | 211 |
| Receptor Binding Domain | 319-541 | 53 | 63 |
| Protease Cleavage Site | 675-692 | 8 | 11 |
| Fusion Peptide | 788-806 | 6 | 6 |
| HR1 | 912-984 | 24 | 31 |
| HR2 | 1163-1213 | 15 | 18 |
| Transmembrane Domain | 1214-1237 | 13 | 16 |
| Cytoplasm Domain | 1238-1273 | 12 | 13 |
Figure 3Amino acid mutations and their frequency in different regions of S proteins of SARS-CoV-2 isolates from India. SP: Signal peptide, NTD: N-terminal domain, RBD: Receptor binding domain, PC: Protease cleavage site, FP: Fusion peptide, HR1: Heptad repeat 1, HR2: heptad repeat 2, TM: Transmembrane domain, CT: Cytoplasm domain
Figure 4Distribution and frequency of the most prevalent mutations of S protein of SARS-CoV-2 isolates circulated in India during the year 2020. D614G is predominant throughout the year with high frequency followed by L54F mutation. D614G, Q677H and P681H mutations originated during the first half of the year and their appearance was observed throughout the year; L54F, K77M and P812 L mutations emerged during the first half of the year but absent after few months of their appearance
Figure 5Frequency of amino acid mutations impacting O- and N-glycosylation patterns. Few sites such as S221, T602 and N1074 are having more than one mutation
N- and O-linked glycosylation sites of S protein of SARS-CoV-2 and amino acid mutations at these sites affecting the glycosylation pattern in Indian severe acute respiratory syndrome coronavirus 2 variants
| N-linked Glycosylation Site (NGS) | Mutation | O-linked Glycosylation Site (OGS) | Mutation |
|---|---|---|---|
| N17 | - | T73 | T73I |
| N61 | - | S161 | - |
| N74 | - | S221 | S221, S221F |
| N122 | - | T323 | T323I |
| N149 | N149G | S325 | - |
| N165 | N165S | T333 | - |
| N234 | N234Y | T345 | - |
| N282 | - | T602 | T602I, T602L |
| N331 | - | T605 | - |
| N343 | - | T618 | - |
| N603 | N603Y | T630 | - |
| N616 | - | T632 | - |
| N657 | - | T659 | - |
| N709 | N709K | S673 | - |
| N717 | - | T676 | - |
| N801 | - | S803 | - |
| N1074 | N1074D, N1074B | S810 | - |
| N1098 | - | S813 | S813I |
| N1134 | - | S982 | S982A |
| N1158 | - | T1051 | - |
| N1173 | - | T1076 | - |
| N1194 | - | T1077 | T1077I |
| T1136 | - | ||
| S1161 | - | ||
| S1170 | - | ||
| S1175 | - |
Figure 6Phylogenetic tree of isolates having distinct mutations in the gene of S protein. The tree was constructed by maximum-likelihood method with the tree having the root as SARS-CoV-2 Wuhan-Hu-1 sequence (NC_045512.2)