| Literature DB >> 33709963 |
Saurav Mandal1, Tanmoy Roychowdhury, Alok Bhattacharya.
Abstract
SARS-CoV-2 is a member of the Coronavirus family which recently originated from the Wuhan province of China and spread very rapidly through the world infecting more than 4 million people. In the past, other Coronaviruses have also been found to cause human infection, but not as widespread as COVID-19. Since Coronavirus sequences constantly change due to mutation and recombination, it is important to understand the pattern of changes and likely path the virus can take in the future. In this study, we have used the Shewhart control chart to identify and analyze hypervariable (hotspots) and hypovariable (coldspots) regions of the virus. Our analysis shows that SARS-CoV-2 has changed in a few regions of the genome. Analysis of SARS-CoV-1 and MERS sequences suggests that over time, mutations start accumulating in different regions and most likely SARS-CoV-2 may also follow a similar path. The results suggest a possible emergence of modified viruses over some time.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33709963 PMCID: PMC7856336
Source DB: PubMed Journal: J Biosci ISSN: 0250-5991 Impact factor: 1.826
Figure 1SNVs frequency identified at each genomic location by comparing NC_045512 genome along with all the SARS-CoV-2 isolates.
Figure 2Red dots indicate the Hotspot regions of SARS-Cov-2. Using Figure 1 binning of size 250 was performed to find the SNV frequency in each bin by applying Shewhart control chart. Refer to table 1 for hotspot location details.
SARS-CoV-2 hotspot location details.
| No. | Genomic position | Protein/peptide | Number of SNVs present | Domain | Comments | |
|---|---|---|---|---|---|---|
| From | To | |||||
| 1 | 0 | 250 | 5′-UTR ORF1 | 702 | Consists of stem-loop 5 | ATG present from location 265 |
| 2 | 1000 | 1250 | nsp2 | 171 | Topological domain | Plays a role in the modulation of host cell survival signaling pathway |
| 3 | 3000 | 3250 | nsp3 | 285 | Predicted phosphoesterase transmembrane domain 1 | Responsible for the cleavages located at the N-terminus of the replicase polyprotein |
| 4 | 8500 | 8750 | nsp4 | 8 | Transmembrane domain 2 (Helical) | Cleavage by PL-PRO |
| 5 | 14250 | 14500 | Nsp12 /RNA-dependent RNA polymerase | 264 | Consists of Helix, Zinc finger and sheet | Domain 2 of the nsp12 protein |
| 6 | 17500 | 18000 | Helicase | 318 | CV ZBD (Zinc Binding Domain), RNA virus helicase ATP-binding | ZBD in N-terminus displaying RNA and DNA duplex-unwinding activities with 5’ to 3’ polarity |
| 7 | 18000 | 18250 | Proofreading exoribonuclease | 154 | Chain | Homologous to Guanine-N7 methyltransferase |
| 8 | 23500 | 25500 | S protein | 63 | Receptor-binding domain (RBD) | RBD binds to the human ACE2 |
| 9 | 25500 | 25750 | ORF3a | 229 | Transmembrane domain(helical) | Forms homotetrameric potassium sensitive ion channels (viroporin) |
| 10 | 28000 | 28250 | ns8/ORF8 | 222 | Signal peptide and chain region | Plays a role in host-virus interaction |
Figure 3Shewhart Control Chart showing the hotspots in red dots above the UCL of SARS-CoV. The hotspot details are presented in supplementary table 1.
Figure 4Hotspots identified using SCC indicated as red dots with a bin size of 250. Supplementary table 2 shows the details of the hotspot locations.