| Literature DB >> 33476804 |
Antara Sengupta1, Sk Sarif Hassan2, Pabitra Pal Choudhury3.
Abstract
Clades are monophyletic groups composed of a common ancestor and all its lineal descendants. As the propensity of virulence of a disease depends upon the type of clade the virus belongs to and it causes different fatality rates of disease in different countries, so the clade-wise analysis of SARS-CoV-2 isolates collected from different countries can illuminate the actual evolutionary relationships between them. In this study, 1566 SARS-CoV-2 genome sequences across ten Asian countries are collected, clustered, and characterized based on the clade they belong to. The isolates are compared to the Wuhan reference sequence" hCoV-19/Wuhan/WIV04/19″ to identify the mutations that occurred at different protein regions. Structural changes in amino acids due to mutations lead to functional instability of the proteins. Detailed clade-wise functional assessments are carried out to quantify the stability and vulnerability of the mutations occurring in SARS-CoV-2 genomes which can shade light on personalized prevention and treatment of the disease and encourage towards the invention of clade-specific vaccines.Entities:
Keywords: Asia; COVID-19; Clades; Deleterious mutations; SARS-CoV-2
Year: 2021 PMID: 33476804 PMCID: PMC7816605 DOI: 10.1016/j.meegid.2021.104724
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Collected SARS-CoV-2 genome sequences from COVID patients from ten different Asian Countries.
| Country | # Associated | #Isolate |
|---|---|---|
| Name | Isolate | in work |
| INDIA | 570 | 565 |
| THAILAND | 227 | 104 |
| BANGLADESH | 231 | 231 |
| IRAN | 172 | 106 |
| CHINA | 189 | 189 |
| JAPAN | 96 | 96 |
| SAUDI ARABIA | 58 | 58 |
| PAKISTAN | 10 | 9 |
| MALAYSIA | 9 | 9 |
| SRILANKA | 4 | 4 |
Fig. 1Calculate percentage of clades representing the isolates.
Clade-wise counting of isolates reported in 10 countries.
| COUNTRY | G | GH | GR | L | S | O | V |
|---|---|---|---|---|---|---|---|
| THAILAND | 12 | 0 | 0 | 1 | 39 | 51 | 1 |
| BANGLADESH | 6 | 13 | 208 | 1 | 1 | 2 | 0 |
| SRILANKA | 1 | 0 | 1 | 0 | 0 | 2 | 0 |
| CHINA | 2 | 0 | 1 | 64 | 48 | 71 | 3 |
| JAPAN | 2 | 5 | 71 | 13 | 2 | 3 | 0 |
| INDIA | 191 | 303 | 18 | 7 | 19 | 27 | 0 |
| IRAN | 32 | 1 | 0 | 4 | 0 | 69 | 0 |
| MALAYSIA | 0 | 1 | 0 | 0 | 0 | 8 | 0 |
| PAKISTAN | 0 | 1 | 5 | 2 | 0 | 1 | 0 |
| SAUDI ARABIA | 2 | 32 | 22 | 0 | 1 | 1 | 0 |
Fig. 2Calculating clade-wise percentage of mutations occurred in different countries.
Fig. 3Quantitative analysis of clade-wise non-synonymous mutation. (a) Percentage of non-synonymous mutations found in different clades; (b) Clade-wise percentage of non-synonymous mutations occurred in different protein regions.
Clade-wise segregation of mutated data.
| PROTEIN | G | GH | GR | L | S | O | V |
|---|---|---|---|---|---|---|---|
| E | 2 | 1 | 4 | 9 | 2 | 2 | 0 |
| M | 6 | 15 | 13 | 0 | 2 | 3 | 0 |
| N | 60 | 264 | 709 | 13 | 31 | 52 | 0 |
| NS3 | 30 | 376 | 709 | 12 | 10 | 24 | 3 |
| NS6 | 1 | 1 | 3 | 0 | 1 | 47 | 0 |
| NS7a | 4 | 3 | 1 | 1 | 2 | 0 | |
| NS7b | 0 | 1 | 0 | 6 | 0 | 0 | |
| NS8 | 2 | 18 | 18 | 1 | 69 | 4 | 0 |
| NSP1 | 4 | 11 | 1 | 9 | 2 | 0 | |
| NSP10 | 4 | 3 | 4 | 0 | 1 | 4 | 0 |
| NSP11 | 2 | 1 | 1 | 0 | 13 | 1 | |
| NSP12 | 241 | 383 | 366 | 9 | 18 | 45 | 3 |
| NSP13 | 24 | 21 | 26 | 7 | 3 | 12 | 0 |
| NSP14 | 49 | 109 | 20 | 4 | 5 | 10 | 0 |
| NSP15 | 22 | 10 | 31 | 5 | 5 | 11 | 0 |
| NSP16 | 20 | 23 | 9 | 11 | 1 | 11 | 0 |
| NSP2 | 52 | 82 | 245 | 9 | 11 | 20 | 4 |
| NSP3 | 109 | 114 | 156 | 27 | 21 | 82 | 0 |
| NSP4 | 50 | 8 | 10 | 16 | 9 | 11 | 0 |
| NSP5 | 16 | 24 | 24 | 1 | 4 | 3 | 0 |
| NSP6 | 3 | 25 | 32 | 4 | 4 | 26 | 7 |
| NSP7 | 0 | 1 | 1 | 0 | 3 | 1 | 0 |
| NSP8 | 2 | 7 | 8 | 1 | 14 | 7 | 0 |
| NSP9 | 4 | 4 | 1 | 0 | 1 | 0 | 0 |
| S | 317 | 583 | 384 | 18 | 34 | 599 | 1 |
Investigate the deleterious mutations in total dataset.
| PROTEIN | DELETERIOUS | PROVEAN | PROTEIN | DELETERIOUS | PROVEAN |
|---|---|---|---|---|---|
| MUTATION | SCORE | MUTATION | SCORE | ||
| S | D936Y | −2.60 | NSP2 | T85I | −4.09 |
| S | L752R | −3.84 | NSP2 | F368V | −2.69 |
| S | A288P | −2.80 | NSP2 | G339S | −3.13 |
| S | C851W | −10.16 | NSP2 | P129S | −3.17 |
| S | D820N | −4.67 | NSP2 | E309A | −3.33 |
| S | E725K | −3.75 | NSP3 | A1914D | −3.8 |
| S | G648R | −4.093 | NSP3 | P1558L | −4.532 |
| S | G669R | −3.58 | NSP4 | L111S | −1.54 |
| S | L533K | −2.62 | NSP4 | G309C | −6.73 |
| S | L916F | −3.74 | NSP5 | P108S | −4.53 |
| S | P863H | −5.12 | NSP5 | I106S | −4.41 |
| S | R905S | −5.62 | NSP5 | L75F | −3.79 |
| S | S758I | −4.13 | NSP5 | N142L | −4.3 |
| S | S875F | −3.37 | NSP5 | R279C | −6.46 |
| S | T716P | −3.347 | NSP6 | G277S | −6 |
| S | T874P | −5.59 | NSP6 | H64N | −2.97 |
| S | V534G | −3.12 | NSP12 | D303G | −6.05 |
| N | S193I | −2.76 | NSP1 | W161L | −7.8 |
| N | S194L | −4.27 | NSP13 | R392C | −2.73 |
| N | R191L | −3.27 | NSP14 | T113I | −4.25 |
| N | S180L | −3.40 | NSP15 | M330T | −5.62 |
| PROTEIN | DELETERIOUS | PROVEAN | NSP16 | R287I | −2.54 |
| MUTATION | SCORE | NSP16 | P12S | −7.73 | |
| NS3 | Q57H | −3.286 | NSP16 | G77R | −7.35 |
| NS3 | G251V | −8.58 | NSP12 | T803I | −3.55 |
| NS3 | D155Y | −6.83 | NSP12 | M124I | −3.048 |
| NS3 | G172C | −6.75 | NSP12 | P227S | −6.00 |
| NS3 | G172V | −6.76 | NSP12 | S607I | −2.83 |
| NS3 | N257D | −3.286 | NSP12 | A97V | −3.61 |
| NS3 | G251V | −8.58 | |||
| NS3 | W45R | −8.87 | |||
| NS7a | E95K | −2.61 | |||
| NS7a | P45L | −10 | |||
| NS7a | V104F | −2.83 | |||
| NS8 | P38R | −8.5 |
Clade-wise clustering of deleterious mutations in total dataset.
| MUTATION | G | GH | GR | L | S | O | V | MUTATION | G | GH | GR | L | S | O | V |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| S193I | 1 | 7 | 0 | 0 | 0 | 0 | 0 | V104F | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| S194L | 21 | 190 | 0 | 0 | 0 | 2 | 0 | P38R | 0 | 0 | 3 | 0 | 0 | 0 | 0 |
| R191L | 2 | 0 | 0 | 0 | 0 | 0 | 0 | W161L | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| S180L | 1 | 1 | 0 | 0 | 0 | 0 | 0 | R392C | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| D936Y | 1 | 1 | 0 | 0 | 0 | 0 | 0 | T113I | 0 | 0 | 0 | 1 | 0 | 4 | 0 |
| L752R | 0 | 0 | 0 | 0 | 0 | 2 | 0 | M330T | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| A288P | 0 | 0 | 0 | 0 | 0 | 3 | 0 | R287I | 0 | 0 | 3 | 0 | 0 | 0 | 0 |
| C851W | 1 | 0 | 0 | 0 | 0 | 1 | 0 | P12S | 0 | 0 | 0 | 4 | 0 | 0 | 0 |
| D820N | 0 | 0 | 0 | 0 | 0 | 2 | 0 | G77R | 0 | 0 | 0 | 3 | 0 | 1 | 0 |
| E725K | 0 | 0 | 0 | 0 | 0 | 2 | 0 | T85I | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| G648R | 1 | 0 | 0 | 0 | 0 | 3 | 0 | F368V | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| G669R | 0 | 0 | 0 | 0 | 0 | 4 | 0 | G339S | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| L533K | 0 | 0 | 0 | 0 | 0 | 2 | 0 | P129S | 0 | 0 | 3 | 0 | 0 | 0 | 0 |
| L916F | 1 | 0 | 0 | 0 | 0 | 1 | 0 | E309A | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| P863H | 4 | 15 | 0 | 0 | 0 | 0 | 0 | A1914D | 2 | 3 | 0 | 0 | 0 | 0 | 0 |
| R905S | 2 | 0 | 0 | 0 | 0 | 0 | 0 | P1558L | 0 | 0 | 3 | 2 | 0 | 2 | 0 |
| S758I | 2 | 0 | 0 | 0 | 0 | 0 | 0 | L111S | 0 | 0 | 0 | 0 | 5 | 0 | 0 |
| S875F | 1 | 0 | 0 | 0 | 0 | 2 | 0 | G309C | 1 | 2 | 0 | 0 | 0 | 0 | 0 |
| T716P | 0 | 0 | 0 | 0 | 0 | 2 | 0 | P108S | 0 | 0 | 7 | 0 | 0 | 0 | 0 |
| T874P | 0 | 0 | 0 | 0 | 0 | 3 | 0 | I106S | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| V534G | 0 | 0 | 0 | 0 | 0 | 2 | 0 | L75F | 1 | 2 | 0 | 0 | 0 | 0 | 0 |
| Q57H | 12 | 342 | 7 | 0 | 0 | 7 | 0 | N142L | 0 | 5 | 0 | 0 | 0 | 0 | 0 |
| G251V | 0 | 0 | 0 | 0 | 0 | 5 | 4 | R279C | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| D155Y | 0 | 2 | 5 | 0 | 0 | 0 | 0 | G277S | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| G172C | 0 | 0 | 3 | 0 | 0 | 0 | 0 | H64N | 1 | 1 | 0 | 0 | 0 | 0 | 0 |
| G172V | 0 | 0 | 3 | 0 | 0 | 0 | 0 | D303G | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| N257D | 1 | 1 | 0 | 0 | 0 | 0 | 0 | T803I | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| W45R | 0 | 0 | 0 | 7 | 0 | 0 | 0 | M124I | 0 | 0 | 3 | 0 | 0 | 0 | 0 |
| E95K | 1 | 1 | 0 | 0 | 0 | 0 | 0 | P227S | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| P45L | 0 | 2 | 0 | 0 | 0 | 0 | 0 | S607I | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| A97V | 0 | 0 | 0 | 3 | 0 | 22 | 0 |
Fig. 4Quantification of clade-wise deleterious non-synonymous mutation. (a) Clade-wise percentage of deleterious non-synonymous mutations in different protein regions; (b) Clade-wise calculations of deleterious non-synonymous mutations of total data set.
Investigating Stability of deleterious mutations in total dataset.
| Mutation | Mutation | Stability | ΔΔ | Mutation | Mutation | Stability | ΔΔ |
|---|---|---|---|---|---|---|---|
| type | type | ||||||
| S194L | Deleterious | Decreased | −0.47 | G77R | Deleterious | Decreased | −0.65 |
| D936Y | Deleterious | Decreased | −0.35 | T85I | Deleterious | Decreased | −0.91 |
| L752R | Deleterious | Decreased | −1.39 | F368V | Deleterious | Decreased | −1.74 |
| C851W | Deleterious | Decreased | −0.09 | G339S | Deleterious | Decreased | −1.2 |
| D820N | Deleterious | Decreased | −1.25 | P129S | Deleterious | Decreased | −1.82 |
| E725K | Deleterious | Decreased | −0.35 | E309A | Deleterious | Decreased | −0.96 |
| G648R | Deleterious | Decreased | −0.6 | A1914D | Deleterious | Decreased | −0.68 |
| G669R | Deleterious | Decreased | −0.14 | P1558L | Deleterious | Decreased | −0.36 |
| L533K | Deleterious | Decreased | −1.81 | L111S | Deleterious | Decreased | −2.29 |
| L916F | Deleterious | Decreased | −1.12 | G309C | Deleterious | Decreased | −1.07 |
| P863H | Deleterious | Decreased | −1.44 | P108S | Deleterious | Decreased | −1.73 |
| R905S | Deleterious | Decreased | −1.18 | I106S | Deleterious | Decreased | −2.24 |
| T716P | Deleterious | Decreased | −0.8 | L75F | Deleterious | Decreased | −1.09 |
| T874P | Deleterious | Decreased | −0.22 | R279C | Deleterious | Decreased | −0.67 |
| V534G | Deleterious | Decreased | −2.029 | G277S | Deleterious | Decreased | −1.3 |
| Q57H | Deleterious | Decreased | −0.9 | H64N | Deleterious | Decreased | −0.03 |
| G251V | Deleterious | Decreased | −0.54 | D303G | Deleterious | Decreased | −0.9 |
| G172C | Deleterious | Decreased | −0.83 | M124I | Deleterious | Decreased | −0.53 |
| G172V | Deleterious | Decreased | −0.41 | P227S | Deleterious | Decreased | −1.49 |
| N257D | Deleterious | Decreased | −0.21 | S193I | Deleterious | Increased | −0.29 |
| W45R | Deleterious | Decreased | −1.05 | R191L | Deleterious | Increased | −0.26 |
| E95K | Deleterious | Decreased | −0.58 | S180L | Deleterious | Increased | −0.12 |
| P45L | Deleterious | Decreased | −0.71 | A288P | Deleterious | Increased | −0.29 |
| V104F | Deleterious | Decreased | −1.47 | S758I | Deleterious | Increased | 0.43 |
| P38R | Deleterious | Decreased | −0.9 | S875F | Deleterious | Increased | 0.66 |
| W161L | Deleterious | Decreased | −0.51 | D155Y | Deleterious | Increased | 0.21 |
| R392C | Deleterious | Decreased | −1.29 | N142L | Deleterious | Increased | −0.05 |
| T113I | Deleterious | Decreased | −0.43 | T803I | Deleterious | Increased | 0.18 |
| M330T | Deleterious | Decreased | −1 | S607I | Deleterious | Increased | 0.37 |
| R287I | Deleterious | Decreased | −0.66 | A97V | Deleterious | Increased | −0.31 |
| P12S | Deleterious | Decreased | −1.29 |
Amino Acid changed due to deleterious mutations taken place.
| MUTATION | COUNT | MUTATION | COUNT |
|---|---|---|---|
| Q > H | 368 | M > I | 3 |
| S > L | 215 | P > R | 3 |
| A > V | 25 | R > I | 3 |
| P > H | 19 | S > F | 3 |
| P > S | 16 | C > W | 2 |
| G > R | 12 | D > G | 2 |
| G > V | 12 | D > N | 2 |
| S > I | 12 | E > A | 2 |
| P > L | 9 | E > L | 2 |
| T > I | 9 | F > V | 2 |
| D > Y | 9 | H > N | 2 |
| W > R | 7 | I > S | 2 |
| G > C | 6 | L > K | 2 |
| A > D | 5 | L > R | 2 |
| L > F | 5 | L > S | 2 |
| N > L | 5 | M > T | 2 |
| T > P | 5 | N > D | 2 |
| E > K | 4 | R¿ > L | 2 |
| R > C | 4 | R > S | 2 |
| A > P | 3 | V > F | 2 |
| G > S | 3 | V > G | 2 |