| Literature DB >> 32363136 |
Sisi Kang1, Mei Yang1, Zhongsi Hong2, Liping Zhang3, Zhaoxia Huang1, Xiaoxue Chen1, Suhua He1, Ziliang Zhou1, Zhechong Zhou1, Qiuyue Chen1, Yan Yan1, Changsheng Zhang3, Hong Shan1,4,5, Shoudeng Chen1,4,6.
Abstract
The outbreak of coronavirus disease (COVID-19) caused by SARS-CoV-2 virus continually lead to worldwide human infections and deaths. Currently, there is no specific viral protein-targeted therapeutics. Viral nucleocapsid protein is a potential antiviral drug target, serving multiple critical functions during the viral life cycle. However, the structural information of SARS-CoV-2 nucleocapsid protein remains unclear. Herein, we have determined the 2.7 Å crystal structure of the N-terminal RNA binding domain of SARS-CoV-2 nucleocapsid protein. Although the overall structure is similar as other reported coronavirus nucleocapsid protein N-terminal domain, the surface electrostatic potential characteristics between them are distinct. Further comparison with mild virus type HCoV-OC43 equivalent domain demonstrates a unique potential RNA binding pocket alongside the β-sheet core. Complemented by in vitro binding studies, our data provide several atomic resolution features of SARS-CoV-2 nucleocapsid protein N-terminal domain, guiding the design of novel antiviral agents specific targeting to SARS-CoV-2.Entities:
Keywords: Antiviral targeting site; COVID-19; Coronavirus; Crystal structure; Nucleocapsid protein; RNA binding domain; SARS-CoV-2
Year: 2020 PMID: 32363136 PMCID: PMC7194921 DOI: 10.1016/j.apsb.2020.04.009
Source DB: PubMed Journal: Acta Pharm Sin B ISSN: 2211-3835 Impact factor: 11.413
Data collection and refinement statistics.
| Item | SARS-CoV-2 N-NTD |
|---|---|
| Protein Data Bank code | 6M3M |
| Wavelength (Å) | 1.5418 |
| Resolution range | 20.92–2.7 (2.796–2.7) |
| Space group | |
| Unit cell | |
| | 58.88, 92.68, 97.32 |
| | 90, 90, 90 |
| Total reflections | 98,913 (10,077) |
| Unique reflections | 15,133 (1481) |
| Multiplicity | 6.5 (6.8) |
| Completeness (%) | 99.55 (99.80) |
| Mean I/sigma (I) | 14.97 (2.9) |
| Wilson B-factor | 33.94 |
| 0.1043 (0.3172) | |
| 0.1135 (0.3436) | |
| 0.04388 (0.1303) | |
| CC1/2 | 0.991 (0.96) |
| CC | 0.998 (0.99) |
| Reflections used in refinement | 15,126 (1481) |
| Reflections used for | 1514 (138) |
| 0.2578 (0.3551) | |
| 0.2934 (0.4058) | |
| CC (work) | 0.908 (0.692) |
| CC (free) | 0.851 (0.635) |
| Number of non-hydrogen atoms | 3952 |
| Macromolecules | 3822 |
| Solvent | 130 |
| Protein residues | 499 |
| RMS (bonds) (Å) | 0.004 |
| RMS (angles) (°) | 0.72 |
| Ramachandran favored (%) | 96.48 |
| Ramachandran allowed (%) | 3.52 |
| Ramachandran outliers (%) | 0.00 |
| Rotamer outliers (%) | 0.00 |
| Clashscore | 11.15 |
| Average B-factor | 31.60 |
| Macromolecules | 31.86 |
| Solvent | 24.03 |
Statistics for the highest-resolution shell are shown in parentheses.
R-merge=∑∑|I(hkl)–|/∑∑I(hkl), where I(hkl) is the intensity measured for the i th reflection and < I(hkl) > is the average intensity of all reflections with indices hkl.
R-meas, redundancy–independent (multiplicity–weighted) Rmerge,.
R-pim, precision–indicating (multiplicity–weighted) Rmerge,.
R-work=∑||Fobs(hkl)|–|Fcalc(hkl)||/∑|Fobs(hkl)|.
R-free is calculated in an identical manner using 10% of randomly selected reflections that were not included in the refinement.
Figure 1Sequence features of SARS-CoV-2 nucleocapsid protein. (A) The complete whole genomic features of SARS-CoV-2 isolate Wuhan-Hu-1 (Genebank: MN908947). UTR: untranslated region; orf/ORF: open reading frame; TRS: transcriptional regulatory sequences; S: spike glycoprotein encoding region; E: envelope protein encoding region; M: membrane protein encoding region; N: nucleocapsid protein encoding region. The figure is illustrated by SnapGene Viewer. (B) Domain architectures of coronavirus nucleocapsid protein. NTD: N-terminal RNA-binding domain; CTD: C-terminal dimerization domain. (C) Multiple sequence alignment of SARS-CoV-2 N-NTD with SARS-CoV N-NTD (UniProtKB: P59595), MERS-CoV N-NTD (UniProtKB: R9UM87), HCoV-OC43 N-NTD (UniProtKB: P33469). Red arrows indicate conserved residues for ribonucleotide binding sites, and dash boxes indicate residues variably in the structural comparisons.
Figure 2Structural overview of SARS-CoV-2 N-NTD. (A) Ribbon representation of SARS-CoV-2 N-NTD molecules in one asymmetric unit. The four molecules are highlighted with different colors, respectively. (B) Superimpositions of four molecules in one asymmetric unit. The dash circles indicate the sandwiched structure composed of Loop region 1, β-sheet core, and Loop region 2. The β-strand is labeled with β1 to β7, and the 310 helix is labeled with 1. (C) Topological style illustration of SARS-CoV-2 N-NTD structure. (D) Electrostatic surface of the SARS-CoV-2 N-NTD. Blue denotes positive charge potential, while red indicates negative charge potential. The potential distribution was calculated by Pymol. The values range from –6kT (red) to 0 (white) and to +6kT (blue), where k is the Boltzmann constant, and T is the temperature.
Figure 3Comparison of SARS-CoV-2 N-NTD with related viral N-NTD structures. (A) Mapping the conserved surfaces of four CoV N-NTDs in SARS-CoV-2 N-NTD structure. The multiple sequence alignment used for mapping is shown in Fig. 1C. Blue denotes conserved residues among 4 CoVs N-NTD; green denotes one variation among 4 CoVs N-NTD; pink denotes two variations among 4 CoVs N-NTD; red denotes three variations among 4 CoVs N-NTD. (B) Electrostatic surface of the SARS-CoV-2 N-NTD, SARS-CoV N-NTD, MERS-CoV N-NTD, HCoV-OC43 N-NTD. Blue denotes positive charge potential, while red indicates negative charge potential. (C) Overall structural comparison of SARS-CoV-2 N-NTD with related viral N-NTD structures. Left: superimposition of SARS-CoV-2 N-NTD (blue) to SARS-CoV N-NTD (green); middle: superimposition of SARS-CoV-2 N-NTD (blue) to MERS-CoV N-NTD (brown); right: superimposition of SARS-CoV-2 N-NTD (blue) to HCoV-OC43 N-NTD (orange).
Figure 4A potential unique drug target pocket in SARS-CoV-2 N-NTD. (A) Detailed view of ribonucleotide binding pocket in superimposition structures between SARS-CoV-2 N-NTD with HCoV-OC43 N-NTD AMP complex. AMP, interacting residues and equivalents are highlighted with stick representation. (B) Electrostatic surface of the potential ribonucleotide binding pocket on SARS-CoV-2 N-NTD. (C) Electrostatic surface of the ribonucleotide binding pocket on HCoV-OC43 N-NTD. (D) Detailed view of phosphate group binding site in superimposition structures between SARS-CoV-2 N-NTD with HCoV-OC43 N-NTD AMP complex. (E) Dot representation of SARS-CoV-2 residues Thr55 and Ala56, which indicates potential steric clashes with the ribonucleotide phosphate group. (F) Dot representation of HCoV-OC43 N-NTD residues Ser67 and Gly68. (G) Detailed view of the nitrogenous base binding site in superimposition structures between SARS-CoV-2 N-NTD with HCoV-OC43 N-NTD AMP complex. (H) Electrostatic surface of the potential ribonucleotide nitrogenous base binding pocket on SARS-CoV-2 N-NTD. (I) Electrostatic surface of the ribonucleotide nitrogenous base binding pocket on HCoV-OC43 N-NTD. In electrostatic surface potential panels, blue denotes positive charge potential, while red indicates negative charge potential. The potential distribution was calculated by Pymol. The values range from –6kT (red) to 0 (white) and to +6kT (blue), where k is the Boltzmann constant, and T is the temperature.