| Literature DB >> 32913581 |
João Trigueiro-Louro1,2, Vanessa Correia1, Inês Figueiredo-Nunes2, Marta Gíria2, Helena Rebelo-de-Andrade1,2.
Abstract
There are no approved target therapeutics against SARS-CoV-2 or other beta-CoVs. The beta-CoV Spike protein is a promising target considering the critical role in viral infection and pathogenesis and its surface exposed features. We performed a structure-based strategy targeting highly conserved druggable regions resulting from a comprehensive large-scale sequence analysis and structural characterization of Spike domains across SARSr- and MERSr-CoVs. We have disclosed 28 main consensus druggable pockets within the Spike. The RBD and SD1 (S1 subunit); and the CR, HR1 and CH (S2 subunit) represent the most promising conserved druggable regions. Additionally, we have identified 181 new potential hot spot residues for the hSARSr-CoVs and 72 new hot spot residues for the SARSr- and MERSr-CoVs, which have not been described before in the literature. These sites/residues exhibit advantageous structural features for targeted molecular and pharmacological modulation. This study establishes the Spike as a promising anti-CoV target using an approach with a potential higher resilience to resistance development and directed to a broad spectrum of Beta-CoVs, including the new SARS-CoV-2 responsible for COVID-19. This research also provides a structure-based rationale for the design and discovery of chemical inhibitors, antibodies or other therapeutic modalities successfully targeting the Beta-CoV Spike protein.Entities:
Keywords: ACE2, angiotensin-converting enzyme2; Bat-SL-CoVs, bat SARS-like coronavirus; Beta-CoVs, betacoronavirus; Betacoronavirus; CC, conserved cluster; CD, connector domain; CDP, consensus druggable pocket; CDR, consensus druggable residue; CH, central helix; CP, cytoplasmic domain; CR, connecting region; CS, conservation score; CoVs, coronavirus; Coronavirus disease; DGSS, DoGSiteScorer; DPP4, dipeptidyl peptidase-4; Druggability prediction; FP, fusion peptide; HR1, heptad repeat 1; HR2, heptad repeat 2; MERS-CoVs, middle east respiratory syndrome coronavirus; MERSr-CoVs, middle east respiratory syndrome-related coronavirus; MSA, multiple sequence alignment; NTD, N-terminal domain; Novel antiviral targets; PDB, Protein Data Bank; PDS, PockDrug-Server; RBD, Receptor-Binding Domain; S, Spike; SARS-CoV-2; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SARS-CoVs, severe acute respiratory syndrome coronavirus; SARSr-CoVs, severe acute respiratory syndrome-related coronavirus; SD1, subdomain 1; SD2, subdomain 2; SF, SiteFinder from MOE; SP, small pocket; Sequence conservation; Spike protein; Sv, shorter variant; T-RHS, top-ranked hot spots; TMPRSS2, transmembrane protease serine 2; aa, amino acid; hSARSr-CoVs, human Severe acute respiratory syndrome-related coronavirus; nts, nucleotides
Year: 2020 PMID: 32913581 PMCID: PMC7452956 DOI: 10.1016/j.csbj.2020.07.017
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Distinctive features (length variation and cleavage site) and number of sequences enclosed in the datasets of the Spike S1 and S2 subunits of each betacoronavirus analysed.
| Genus | Subgenus | Species | Lineage | Virus | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Polybasic | ||||||||||||||||
| Position in the genome (nts) a,b | Lenght | Position in the genome (nts) a | Lenght | Number of sequences | Position in the genome (nts) b | Lenght | Number of sequences | |||||||||
| nts | aa | nts | aa | nts | aa | |||||||||||
| Sarbecovirus | SARSr-Cov | B | SARS-CoV-2 | 21314–25135 | 3822 | 1273 | 21314–23368 | 2055 | 685 | 674 | 23369–25135 | 1767 | 588 | 682 | 682 | |
| SARS-CoV | 21228–24995 | 3768 | 1255 | 21228–23228 | 2001 | 667 | 114 | 23229–24995 | 1767 | 588 | 116 | – | ||||
| bat-SL-CoV | 21,264 - [24950–25073] | 3687–3810 | 1228–1269 | 21,264 - [23183–23306] | 1920–2043 | 640–681c | 50 | [23184–23307] - [24950–25073] | 1767 | 588 | 50 | – | ||||
| Merbecovirus | MERSr-Cov | C | MERS-CoV | 21178–25239 | 4062 | 1353 | 21178–23430 | 2253 | 751 | 248 | 23431–25239 | 1809 | 602 | 248 | 682 | |
| TOTAL | TOTAL | |||||||||||||||
a counting from the start codon of the first genome-encoded protein (nsp1); b includes the STOP codon; c 15 different lengths: 649 (15/50); 648 (9/50); 653 (7/50); 663 (5/50); 640, 668, and 667 (2/50); 681, 662, 657, 658, 647, 654, 646, and 671 (1/50); d SARS-CoV-2 numbering
Fig. 1Three-dimensional sequence conservation of the human SARSr-CoVs (A) and SARSr- and MERSr-CoVs (B) Spike protomers (PDB ID: 6VYB). The cs are mapped onto the S protein structures. The figure is coloured based on cs: the highest conservation positions are highlighted in shades of blue and the red regions indicate low conservation according to the cs scale. The S1 and S2 domains resolved are indicated in black and grey, respectively, in the hSARSr-CoV protomer. NTD: N-terminal domain; RBD: receptor-binding domain; SD: subdomain; FP: fusion peptide; CR: connecting region, HR: heptad repeat; CH: central helix. The figures were produced with PyMOL Molecular Graphics System [101]. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Summary of the conserved regions identified in the S1 and S2 subunits of the Spike protein. Only conserved clusters comprised by a minimum of 10 aa and with a conserved score ≥ 7 were considered for either the hSARSr-CoVs, the SARSr-CoVs (including the bat-SL-CoVs) and the SARSr- and MERSr-CoVs. Regions common to all three groups are highlighted in bold and blue.
| T51-T63 | 13 | S691-E702 | 12 | T51-T63 | 13 | S691-E702 | 12 | ||||||||||
| T95-G107 | 13 | M740-C749 | 10 | T95-G107 | 13 | M740-C749 | 10 | ||||||||||
| T274-C291 | 18 | N751-L767 | 17 | A262-P272 | 11 | N751-L767 | 17 | ||||||||||
| F797-P809 | 13 | T274-C291 | 18 | F797-P809 | 13 | ||||||||||||
| I326-F347 | 22 | – | – | ||||||||||||||
| R355-A372 | 18 | K835-I844 | 10 | I326-F347 | 22 | K835-I844 | 10 | – | – | ||||||||
| A846-A871 | 26 | R355-A372 | 18 | A846-A871 | 26 | – | – | ||||||||||
| N394-G416 | 23 | ||||||||||||||||
| I418-F429 | 12 | I418-F429 | 12 | ||||||||||||||
| G502-L518 | 17 | A1056-P1069 | 14 | G502-L518 | 17 | A1056-P1069 | 14 | ||||||||||
| Q1071-V1096 | 26 | Q1071-V1096 | 26 | ||||||||||||||
| P589-S605 | 17 | P589-S605 | 17 | ||||||||||||||
| A623-G639 | 17 | L1234-G1246 | 13 | A623-G639 | 17 | L1234-G1246 | 13 | ||||||||||
| N641-N657 | 17 | C1248-T1273 | 26 | N641-N657 | 17 | C1248-T1273 | 26 | ||||||||||
| – | – | – | – | – | – | ||||||||||||
Fig. 2Overall alignment of Spike-RBD druggability along with the conservation scores for each residue position. The druggability prediction was based on the descriptors algorithm of each pocket bioinformatics tool: SF, DGSS and PDS. The potential conserved druggable sites/residues are marked with an asterisk and the T-RHS are marked with a target. The conserved druggable pockets allocated to each site/residue along with the secondary structure elements are indicated at the top of the picture.
Fig. 3Mapping results of the highest-ranked consensus druggable pockets onto the Spike-RBD crystallographic structure of SARS-CoV-2 (PDB: 6VW1). The top-ranked pockets: CDP1 to CDP5 are highlighted in shades of pink, pale blue, orange, green, yellow and cyan, respectively. The number and aa composition of each CDP are shown in Supplemental Table S-2. The figures were produced with PyMOL Molecular Graphics System [101]. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Mapping results of the highest-ranked consensus druggable pockets onto the Spike trimer crystallographic structure of SARS-CoV-2 (PDB: 6VYB). The top-ranked pockets: CDP5T-S1, CDP6T-S1, CDP9T-S1, CDP2T-S2, CDP4T-S2, CDP8T-S2, CDP2T-S1S2, CDP3T-S1S2, CDP5T-S1S2 are highlighted in shades of yellow, turquoise, pale blue, cyan, orange, pink, grey, green and raspberry, respectively. The number and aa composition of each CDP are shown in Supplemental Table S-2. The figures were produced with PyMOL Molecular Graphics System [101]. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Overall alignment of Spike S1 and S2 druggability along with the conservation scores for each residue position. The druggability prediction was based on the descriptors algorithm of each pocket bioinformatics tool: SF and DGSS; and for all the predicted S state conformation (open, semi-open and open). The potential conserved druggable sites/residues are marked with an asterisk and the T-RHS are marked with a target. The conserved druggable residues shared by the S monomer and trimer conformations for the hSARSr-CoVs are coloured in blue; and the conserved druggable pockets allocated to each site/residue are indicated at the top of the picture. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)