| Literature DB >> 32218151 |
Suhas Srinivasan1, Hongzhu Cui2, Ziyang Gao2, Ming Liu2, Senbao Lu2, Winnie Mkandawire2, Oleksandr Narykov3, Mo Sun2, Dmitry Korkin1,2,3.
Abstract
During its first two and a half months, the recently emerged 2019 novel coronavirus, SARS-CoV-2, has already infected over one-hundred thousand people worldwide and has taken more than four thousand lives. However, the swiftly spreading virus also caused an unprecedentedly rapid response from the research community facing the unknown health challenge of potentially enormous proportions. Unfortunately, the experimental research to understand the molecular mechanisms behind the viral infection and to design a vaccine or antivirals is costly and takes months to develop. To expedite the advancement of our knowledge, we leveraged data about the related coronaviruses that is readily available in public databases and integrated these data into a single computational pipeline. As a result, we provide comprehensive structural genomics and interactomics roadmaps of SARS-CoV-2 and use this information to infer the possible functional differences and similarities with the related SARS coronavirus. All data are made publicly available to the research community.Entities:
Keywords: 2019 novel coronavirus; 2019-nCoV; COVID-19; SARS-CoV-2; interactome; protein-protein interactions; structural genomics
Mesh:
Substances:
Year: 2020 PMID: 32218151 PMCID: PMC7232164 DOI: 10.3390/v12040360
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
The list of SARS-CoV-2 proteins analyzed and structurally characterized in this work.
| Protein | Accession | wORF1ab Region | Modeled Length | Template PDB id | Trgt-Tmplt Seq ID | Organism |
|---|---|---|---|---|---|---|
| wS, surface glycoprotein | YP_009724390 | 1273 | 6ACK | 75% | SARS-CoV | |
| wE, envelope protein | YP_009724392 | 75 | 5X29 | 89% | SARS-CoV | |
| wORF7a | YP_009724395 | 121 | 1YO4 | 90% | SARS-CoV | |
| wN, nucleocapsid phosphoprotein | YP_009724397 | 419 | 2JW8 | 96% | SARS-CoV | |
| 1SSK | 83% | SARS-CoV | ||||
| 4UD1 | 51% | MERS-CoV | ||||
| wNsp1 | YP_009725297 | 13-127 | 115 | 2HSX | 86% | SARS-CoV |
| wNsp3-domain1 | YP_009725299 | 819-926 | 107 | 2GRI | 79% | SARS-CoV |
| wNsp3-domain2 | YP_009725299 | 1024-1198 | 175 | 2ACF | 72% | SARS-CoV |
| wNsp3-domain3 | YP_009725299 | 1232-1494 | 263 | 2WCT | 76% | SARS-CoV |
| wNsp3-domain4 | YP_009725299 | 1495-1550 | 66 | 2KAF | 70% | SARS-CoV |
| wNsp3-domain5 | YP_009725299 | 1564-1878 | 315 | 3E9S | 82% | SARS-CoV |
| wNsp3-domain6 | YP_009725299 | 1908-2763 | 113 | 2K87 | 82% | SARS-CoV |
| wNsp4 | YP_009725300 | 3173-3263 | 91 | 3VC8 | 60% | MHV |
| wNsp5 | YP_009725301 | 3264-3569 | 306 | 2GT7 | 96% | SARS-CoV |
| wNsp7 | YP_009725302 | 3860-3942 | 83 | 1YSY | 67% | SARS-CoV |
| wNsp8 | YP_009725304 | 4019-4132 | 114 | 6NUR | 85% | SARS-CoV |
| wNsp9 | YP_009725305 | 4041-4253 | 113 | 3EE7 | 99% | SARS-CoV |
| wNsp10 | YP_009725306 | 4262-4382 | 121 | 2G9T | 98% | SARS-CoV |
| wNsp12 | YP_009725307 | 4542-5311 | 770 | 6NUR | 97% | SARS-CoV |
| wNsp13 | YP_009725308 | 5325-5920 | 596 | 6JYT | 100% | SARS-CoV |
| wNsp14 | YP_009725309 | 5926-6451 | 526 | 5C8U | 95% | SARS-CoV |
| wNsp15 | YP_009725310 | 6452-6797 | 346 | 2H85 | 86% | SARS-CoV |
| wNsp16 | YP_009725311 | 6800-7087 | 288 | 2XYQ | 94% | SARS-CoV |
Figure 1Structural genomics and interactomics road map. Shown are the individual proteins and protein complexes targeted for structural characterization together with PDB ID of their templates from the Protein Data Bank (PDB).
Figure 2Structurally characterized non-structural proteins of SARS-CoV-2. Highlighted in pink are mutations found when aligning the proteins against their homologs from the closest related coronaviruses: human SARS-CoV, bat coronavirus BtCoV, and another bat betacoronavirus BtRf-BetaCoV. The structurally resolved part of wNsp7 shares 100% sequence identity to its homolog.
Figure 3Structurally characterized intra-viral and host–viral protein–protein interaction complexes of SARS-CoV-2. Human proteins (colored in orange) are identified through their gene names. For each intra-viral structure, the number of subunits involved in the interaction is specified.
Figure 4Evolutionary conservation of functional sites in SARS-CoV-2 proteins. (A) Fully conserved protein binding sites (PBS, light orange) of wNsp12 in its interaction with wNsp7 and wNsp8, while other parts of the protein surface shows mutations (magenta); (B) Both the major monoclonal antibody binding site (light orange) and the ACE2 receptor binding site (dark green) of wS are heavily mutated (binding site mutations are shown in red) compared to the same binding sites in other coronaviruses; mutations not located on the two binding sites are shown in magenta; (C) Nearly intact protein binding site (light orange) of wNsp (papain-like protease PLpro domain) for its putative interaction with human ubiquitin-aldehyde (binding site mutations of the only two residues are shown in red, non-binding site mutations are shown in magenta); (D) Fully conserved inhibitor ligand binding site (LBS, green) for wNsp5; non-binding site mutations are shown in magenta.
Figure 5Structurally characterized structural proteins and an ORF of SARS-CoV-2. Highlighted in pink are mutations found when aligning the proteins against their homologs from the closest related coronaviruses: human SARS-CoV, bat coronavirus BtCoV, and another bat betacoronavirus BtRf-BetaCoV. Highlighted in yellow are novel protein inserts found in wS.
Figure 6The unified interactome with SARS-CoV-2 interactions inferred through homology. (A) The SARS-CoV intra-viral and virus–host interactions are merged to create a unified interaction network indicating the types of proteins; the size of a node reflects the node degree. The SARS-CoV-2 interactions that are inferred from SARS-CoV are represented by green edges, with stars indicating the suspected disruption of the interaction based on the evolutionary conservation of the binding sites. The loops correspond to homo-oligomeric interactions. (B) Structural modeling prediction of interaction between SARS-CoV-2 S protein and three monoclonal antibodies.
Network parameters of the SARS-CoV intra-viral, virus–host and unified networks. The table shows topological statistics for the three networks. Among the many computed statistics, the shown parameters include the number of nodes and edges in the networks, the average degree, number of components (independent networks), diameter (maximum shortest path), and clustering coefficient.
| Network Parameters | SARS-CoV Intra-viral Interactome | SARS-CoV-Host Interactome | Unified Interactome |
|---|---|---|---|
| No. of nodes | 31 | 118 | 125 |
| No. of edges | 86 | 114 | 206 |
| No. of components | 1 | 8 | 2 |
| Diameter | 4 | 14 | 7 |
| Average degree | 4.710 | 1.95 | 3.04 |
| Clustering coefficient | 0.448 | 0.0 | 0.068 |