| Literature DB >> 34665939 |
Zhenghu Jia1,2,3, Chen Liu2, Yuewen Chen2, Heng Jiang2, Zijing Wang2, Jialu Yao2, Jie Yang3, Jiaxing Zhu3, Boqing Zhang3, Zhiguang Yuchi2.
Abstract
The ongoing outbreak of COVID-19 caused by SARS-CoV-2 has resulted in a serious public health threat globally. Nucleocapsid protein is a major structural protein of SARS-CoV-2 that plays important roles in the viral RNA packing, replication, assembly, and infection. Here, we report two crystal structures of nucleocapsid protein C-terminal domain (CTD) at resolutions of 2.0 Å and 3.1 Å, respectively. These two structures, crystallized under different conditions, contain 2 and 12 CTDs in asymmetric unit, respectively. Interestingly, despite different crystal packing, both structures show a similar dimeric form as the smallest unit, consistent with its solution form measured by the size-exclusion chromatography, suggesting an important role of CTD in the dimerization of nucleocapsid proteins. By analyzing the surface charge distribution, we identified a stretch of positively charged residues between Lys257 and Arg262 that are involved in RNA-binding. Through screening a single-domain antibodies (sdAbs) library, we identified four sdAbs targeting different regions of nucleocapsid protein with high affinities that have future potential to be used in viral detection and therapeutic purposes.Entities:
Keywords: SARS-CoV-2; crystal structure; nanobodies; nucleocapsid protein
Mesh:
Substances:
Year: 2021 PMID: 34665939 PMCID: PMC8646419 DOI: 10.1111/febs.16239
Source DB: PubMed Journal: FEBS J ISSN: 1742-464X Impact factor: 5.622
Fig. 1Purification of the full‐length and truncated N‐proteins, and N‐protein targeting sdAbs. (A) A schematic picture of N‐protein domain organization (B) Sequence alignment of the six nonrepetitive sdAbs. The conserved residues are highlighted and the three CDR regions are indicated by the dashed boxes. (C) 15% SDS/PAGE showing protein marker (PM) in the left lanes and purified sdAbs in the right lanes. (D) Elution profile of the full‐length and truncated N‐proteins by SEC using a Superdex 200 16/600 column (GE Healthcare, Marlborough, MA, USA). The inset shows the plotted standard curve for this column and the representative 15% SDS/PAGE showing the purified N‐proteins.
Estimated molecular weights (MWs) and oligomeric forms of N‐protein constructs as determined by SEC using a Superdex 200 16/600 column.
| Construct | Elution volume (ml) | V/V0 | Estimated MW (kDa) | Estimated oligomeric form |
|---|---|---|---|---|
| FLN | 87.42 | 1.86 | 103.4 | 2.1 |
| NLC | 88.58 | 1.89 | 87.2 | 2.5 |
| NC | 92.27 | 1.96 | 61.6 | 2.2 |
| CTD | 100.16 | 2.13 | 26.5 | 2.1 |
| NTD | 103.10 | 2.20 | 19.0 | 1.2 |
Fig. 2sdAbs bind to the FLN. ITC binding isotherms show the interactions between six sdAbs (A–F) titrated into FLN.
Fig. 3sdAbs bind to the NLC. ITC binding isotherms show the interactions between four sdAbs (A–D) titrated into NLC.
Fig. 4sdAb‐N2 binds to the truncated N‐proteins. ITC binding isotherms show the interactions between sdAb‐N2 titrated into NTD+CTD (A) or NTD (B) or CTD (C). (D) Elution profile of the CTD+N2 complex and CTD alone by SEC using a Superdex 75 3.2/300 column and the representative 15% SDS/PAGE.
Thermodynamic parameters of binding between sdAbs and N‐proteins.
| sdAb | Construct of N‐protein | No. of sites (N) |
| ΔH (kcal·mol−1) | TΔS (kcal·mol−1) | ΔG (kcal·mol−1) |
|---|---|---|---|---|---|---|
| sdAb‐N1 | FLN | ‐ | No binding | ‐ | ‐ | ‐ |
| sdAb‐N2 | FLN | 0.457 | 1.75 ± 0.179 | 4.44 ± 0.146 | −12.3 | −7.86 |
| NLC | 0.530 | 2.24 ± 0.307 | 4.40 ± 0.206 | −12.1 | −7.71 | |
| NC | 0.495 | 1.77 ± 0.217 | 3.66 ± 0.147 | −11.5 | −7.85 | |
| CTD | 0.292 | 2.38 ± 0.243 | 5.85 ± 0.141 | −13.5 | −7.67 | |
| NTD | ‐ | No binding | ‐ | ‐ | ‐ | |
| sdAb‐N3 | FLN | 0.066 | 4.37 ± 1.68 | −80 ± 99.8 | 72.7 | −7.31 |
| NLC | 0.157 | 1.09 ± 0.485 | −2.90 ± 0.558 | −5.18 | −8.14 | |
| CTD | ‐ | No binding | ‐ | ‐ | ‐ | |
| NTD | ‐ | No binding | ‐ | ‐ | ‐ | |
| sdAb‐N4 | FLN | ‐ | No binding | ‐ | ‐ | ‐ |
| sdAb‐N5 | FLN | 0.697 | 3.97 ± 2.42 | −11.7 ± 3.22 | 4.3 | −7.37 |
| NLC | ‐ | No binding | ‐ | ‐ | ‐ | |
| sdAb‐N6 | FLN | 0.416 | 3.53 ± 0.50 | −17.8 ± 1.26 | 10.4 | −7.44 |
| NLC | ‐ | No binding | ‐ | ‐ | ‐ |
Fig. 5sdAb‐N3 binds with neither NTD nor CTD. ITC binding isotherms show the interactions between sdAb‐N3 titrated into NTD (A) or CTD (B).
Data collection and refinement statistics for the SARS‐CoV‐2 N‐CTD.
| Crystal | CTD (dimer) | CTD (dodecamer) |
|---|---|---|
| PDB ID |
|
|
| λ for data collection (Å) | 1.540562 | 0.9795 |
| Data collection | ||
| Space group | P1 | R3 |
| Cell dimension (Å) | ||
| a, b, c (Å) | 36.89, 37.21, 42.84 | 102.54, 102.54, 389.68 |
| α, β, γ, (°) | 78.68, 74.65, 65.46 | 90.00, 90.00, 120.00 |
| Total number of reflections observed | 69791 (7142) | 134323 (11777) |
| Number of unique reflections observed | 13449 (1358) | 27734 (2756) |
| Resolution limits | 28.70‐2.00 (2.07‐2.00) | 24.84‐3.10 (3.21‐3.10) |
| Rmerge | 0.062 (0.226) | 0.147 (0.589) |
| CC1/2 | 0.989 (0.475) | 0.989(0.837) |
| Average I/σ(I) | 25.26 (10.00) | 11.23 (2.46) |
| Completeness of data (%) | 99.94 (99.93) | 99.57 (99.31) |
| Data redundancy | 2.6 (2.7) | 4.8 (4.2) |
| Copies in the ASU | 2 | 12 |
| Refinement | ||
| Resolution limits | 28.70‐2.00 Å | 24.58‐3.10 Å |
| Number of reflections used | 13447 (1358) | 27675 (2754) |
|
Rfactor/Rfree (10% data) 7F2B Rfactor/Rfree (5% data) 7F2E | 0.157/0.214 | 0.268/0.297 |
| RMSD in bond‐lengths (Å) | 0.006 | 0.002 |
| RMSD in bond angles (°) | 0.80 | 0.43 |
| Number of atoms in the refined structure | ||
| Protein | 1648 | 8730 |
| Ligands | 32 | 20 |
| Solvent | 189 | 2 |
| Ramachandran plot (%) | ||
| Most favored | 98.08 | 93.72 |
| Additionally allowed | 1.92 | 5.44 |
| Average B‐factor (Å2) | 17.52 | 51.16 |
Values in parentheses refer to the highest resolution shell.
Fig. 6Crystal structures of N‐protein CTD. (A) The crystal structures of N‐protein CTD with 2 monomers in ASU (left) and 12 monomers in ASU (right). (B) Elution profile of N‐protein CTD by SEC using a Superdex 200 16/600 column (GE Healthcare). The inset shows the plotted standard curve for this column and the representative 15% SDS/PAGE showing the purified CTD protein. (C) The crystal structure of dimeric CTD shows the residues involved in dimer formation. (D) Sequence alignment of CTDs among SARS‐CoV‐1, SARS‐CoV‐2, and MERS. The residues involved in RNA‐binding are colored in yellow and involved in dimerization are colored in pink. (E) Surface view of dimeric CTD shows a positive binding pocket consisted of a stretch of positively charged residues between Lys257 and Arg262.