| Literature DB >> 33200080 |
Sofia Banu1, Bani Jolly2,3, Payel Mukherjee1, Priya Singh1, Shagufta Khan1, Lamuk Zaveri1, Sakshi Shambhavi1,3, Namami Gaur1, Shashikala Reddy4, K Kaveri5, Sivasubramanian Srinivasan5, Dhinakar Raj Gopal6, Archana Bharadwaj Siva1, Kumarasamy Thangaraj1, Karthik Bharadwaj Tallapaka1, Rakesh K Mishra1, Vinod Scaria2, Divya Tej Sowpati1.
Abstract
BACKGROUND: From an isolated epidemic, coronavirus disease 2019 has now emerged as a global pandemic. The availability of genomes in the public domain after the epidemic provides a unique opportunity to understand the evolution and spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus across the globe.Entities:
Keywords: COVID-19; Clade I/A3i; India; genetic epidemiology; phylogenomics
Year: 2020 PMID: 33200080 PMCID: PMC7543508 DOI: 10.1093/ofid/ofaa434
Source DB: PubMed Journal: Open Forum Infect Dis ISSN: 2328-8957 Impact factor: 3.835
Figure 1.Phylogenetic clusters and clades as generated by Nextstrain for the dataset of 1377 high-quality Indian severe acute respiratory syndrome coronavirus 2 genomes. Indian genomes were found to fall under 7 clusters with the majority of the genomes falling under clade A2a. The second largest cluster in India (purple) has been designated as clade I/A3i.
Figure 2.Shared variants among the variants that define the clusters of the Indian severe acute respiratory syndrome coronavirus 2 genomes. The size of the circle represents the allele frequencies of the respective variants, and clade-defining variants are marked with a red circle.
Figure 3.(A) Proportion of the I/A3i clade (purple) and A2a (teal) in the genomes sequenced from different states of India. The proportion of the A2a clade (teal) is also shown for comparison, whereas all other clades are shaded gray. (B) The short tree of the I/A3i clade diverging from a central point suggests a single point of introduction and spread across the different states. The 23 global genomes that were sampled before the first Indian genome from this cluster are highlighted in gray.
Nucleotide Substitution Rates of the Different Structural Protein Genes and Genome-Wide Across the Different Clusters and Clades in Indiaa
| Clade/ Cluster | S Gene | E Gene | M Gene | N Gene | Genome |
|---|---|---|---|---|---|
| All (N = 1376) | 3.55 × 10–3 | 4.57 × 10–3 | 4.69 × 10–3 | 6.94 × 10–3 | 1.76 × 10–3 |
| A2a (N = 1143) | 3.49 × 10–3 | 5.42 × 10–3 | 3.67 × 10–3 | 5.85 × 10–3 | 1.73 × 10–3 |
| I/A3i (N = 149) | 0.94 × 10–3 | 3.5 × 10–3 | 2.26 × 10–3 | 1.54 × 10–3 | 1.1 × 10–3 |
| B4 (N = 52) | 1.18 × 10–3 | 3.9 × 10–3 | 1.15 × 10–3 | 3.33 × 10–3 | 1.12 × 10–3 |
| A3 (N = 17) | 1.36 × 10–3 | 1.89 × 10–3 | 1.28 × 10–3 | 7.23 × 10–3 | 1.85 × 10–3 |
aThe estimates for A1a, B, and B1 were not computed because the clades encompassed very few genomes from India.
Functional Characteristics of the Four Variants That Define Clade I/A3i and Other Clades Across the Worlda
| Clade | Gene | Site | Mutation | PROVEAN Score/Prediction | SIFT Score/Prediction | Conservation Scores |
|---|---|---|---|---|---|---|
| A1a | ORF3a | G26144T | G251V | −8.581 | 0 | PhyloP: 4.256 |
| Deleterious | Deleterious | GERP: 1.65 | ||||
| A1a, A3, I/A3i | ORF1a | G11083T | L3606F | −1.4 | 0.01 | PhyloP: −1.32286 |
| Neutral | Deleterious | GERP: −3.3 | ||||
| A2 | S | A23403G | D614G | 0.598 | 0.3 | PhyloP: 2.25839 |
| Neutral | Tolerated | GERP: 1.65 | ||||
| A2a | ORF1b | C14408T | P314L | −0.914 | 0.31 | PhyloP: 3.30748 |
| Neutral | Tolerated | GERP: 1.65 | ||||
| A3 | ORF1a | G1397A | V378I | −0.199 | 0.62 | PhyloP: 0.227575 |
| Neutral | Tolerated | GERP: −1.81 | ||||
| A7 | ORF1a | C9924T | A3220V | −2.049 | 0.04 | PhyloP: 3.30935 |
| Neutral | Deleterious | GERP: 1.65 | ||||
| B, B1, B2, B4 | ORF8 | T28144C | L84S | 2.333 | 0.37 | PhyloP: −1.52089 |
| Neutral | Tolerated | GERP: −0.206 | ||||
| B4 | N | G28878A | S202N | −0.404 | 0 | PhyloP: 4.256 |
| Neutral | Deleterious | GERP: 1.65 | ||||
| I/A3i | N | C28311T | P13L | −1.23 | 0 | PhyloP: 3.27687 |
| Neutral | Deleterious | GERP: 1.65 | ||||
| I/A3i | RDRP/ORF1b | C13730T | A97V (A88V in ORF1b) | −3.611 | 0 | PhyloP: 3.31844 |
| Deleterious | Deleterious | GERP: 1.65 | ||||
| I/A3i | ORF1a | C6312A | T2016K | −0.352 | 0.03 | PhyloP: 3.29661 |
| Neutral | Deleterious | GERP: 1.61 |
aPROVEAN scores of less than −2.5 are considered deleterious in nature. Similarly, SIFT scores of 0 to 0.05 are considered deleterious.
Figure 4.(A) A week-wise stacked bar displaying the proportion of clades across India, starting from the week of March 1, 2020. (B) A bubble plot depicting the change in the predominant clades with time in various states. X-axis indicates the date on which the sample was collected, and color indicates the clade. Only those states with collection data across at least 2 months are plotted.