| Literature DB >> 28580734 |
Robert G Hammond1, Xuan Tan1, Margaret A Johnson1.
Abstract
The coronavirus nonstructural protein 3 (nsp3) is a multifunctional protein that comprises multiple structural domains. This protein assists viral polyprotein cleavage, host immune interference, and may play other roles in genome replication or transcription. Here, we report the solution NMR structure of a protein from the "SARS-unique region" of the bat coronavirus HKU9. The protein contains a frataxin fold or double-wing motif, which is an α + β fold that is associated with protein/protein interactions, DNA binding, and metal ion binding. High structural similarity to the human severe acute respiratory syndrome (SARS) coronavirus nsp3 is present. A possible functional site that is conserved among some betacoronaviruses has been identified using bioinformatics and biochemical analyses. This structure provides strong experimental support for the recent proposal advanced by us and others that the "SARS-unique" region is not unique to the human SARS virus, but is conserved among several different phylogenetic groups of coronaviruses and provides essential functions.Entities:
Keywords: NMR; SARS-unique domain; coronavirus; double-wing motif; frataxin; nonstructural protein; protein functional annotation; viral protein
Mesh:
Substances:
Year: 2017 PMID: 28580734 PMCID: PMC5563143 DOI: 10.1002/pro.3208
Source DB: PubMed Journal: Protein Sci ISSN: 0961-8368 Impact factor: 6.725
Figure 12D [15N,1H]‐HSQC spectrum of the HKU9 C domain. 1.2 mM 15N‐labeled HKU9 C in a 20 mM sodium phosphate (pH = 6.0), 150 mM NaCl, 3% d10‐DTT, 0.02% (w/v) NaN3 solution was measured on a Bruker Avance III 600MHz spectrometer. Backbone 15N–1H correlation peaks are indicated by single letter amino acid nomenclature. Arg and Trp assigned 15N‐1Hε correlation peaks are labeled. The amide side chain 15N–1H2 signals from Asn and Gln are shown with horizontal lines
Input for the Structure Calculation of HKU9 C and the Statistics of the 20 Energy‐Minimized Conformers Used to Represent the Solution Structure
| Quantity | Value |
|---|---|
| NOE upper distance limits | 1828 |
| Intraresidue (| | 311 |
| Sequential (| | 528 |
| Medium‐range (1<| | 329 |
| Long‐range (| | 660 |
| Dihedral angle constraints | |
| Talos + | 132 |
| HABAS (CYANA) | 347 |
| NOEs per residue | 23.74 |
| Long‐Range NOEs per residue | 8.57 |
| CYANA minimized target function | 1.70 ± 0.40 |
| Residual NOE Violations | |
| Number ≥ 0.2 Å | 6 |
| RMS violation | 0.0214 |
| Residual dihedral angle violations | |
| Number ≥ 5.0° | 1 |
| RMS violation | 0.3553 |
| RMSD from Ideal Geometry | |
| Bond Lengths, Å | 0.016 |
| Bond Angles, ° | 2.8 |
| RMSD to the mean coordinates, Å | |
| Backbone (574–645) | 0.34 ± 0.11 |
| Heavy Atom (574–645) | 0.72 ± 0.08 |
| Ramachandran plot statistics | |
| Most favored regions (%) | 90.2 |
| Allowed regions (%) | 6.1 |
| Disallowed regions (%) | 3.7 |
As determined by MOLPROBITY [81]. Calculated using PSVS version 1.5 [78].
Residue range used to calculate the backbone and heavy‐atom RMSDs.
Figure 2NMR solution structure of the HKU9 C domain. Wall‐eye stereo views are shown. (A) Ribbon representation of the representative conformer (nearest to the mean coordinates of the ensemble). Secondary structures are labeled. (B) Line representation of the 20‐conformer ensemble. The polypeptide backbone (blue) and selected side chains with solvent accessibility below 15% (red) are shown. (C) Topology diagram of HKU9‐C. The α‐helices (red) are indicated by rectangles and the β‐sheets (yellow) are indicated by arrows
Bioinformatics Results from HKU9 C Structure‐Based Alignment and Protein Function Prediction
| Identifier | Protein Name | RMSD | %ID | ||
|---|---|---|---|---|---|
| A | |||||
|
| PDB ID: 2KAF | SARS nsp3 C domain | 1.66 | 18 | |
| PDB ID: 4YPT | MHV nsp3 domain | 2.21 | 15 | ||
| PDB ID: 1KAF | Phage T4 MotA | 2.84 | 7 | ||
| PDB ID: 1YB3 |
| 3.12 | 1 | ||
|
| PDB ID: 2KAF | SARS nsp3 C domain | 1.8 | 18 | |
| PDB ID: 4YPT | MHV nsp3 domain | 2.3 | 15 | ||
| PDB ID: 1KAF | Phage T4 MotA | 2.7 | 9 | ||
| PDB ID: 4HS5 |
| 2.8 | 9 | ||
| PDB ID: 3T3X | Friedreich's Ataxia FTXN | 2.9 | 6 | ||
| PDB ID: 1YB3 |
| 3.3 | 1 | ||
TM‐Align[50] scales structure similarity to protein templates from residue‐specific alignment.
DALI[51] server uses structure‐based templates to provide matches in structure and function.
Predicted binding regions of the HKU9 C protein are described: residues from the α‐helices (α), residues from the solvent‐accessible β‐sheet (β), and the conserved polar face containing the Arg‐Asp‐Trp and Lys‐Arg‐Gly motif (CF).
The confidence score (C‐score) is used to evaluate the reliability of the prediction. The binding site score (BS‐score) evaluates how significant is the match between the predicted binding site and the template binding site. Alignment score values range from 0 to 1, with higher values having greater significance.
Figure 3(A) Ribbon representations of nonstructural protein 3 C domains. Conserved residues or conservative substitutions relative to HKU9 are highlighted in green (aliphatic), blue (basic), and red (acidic). Residue numbers are indicated with respect to the first residue in each protein. Left to right: HKU9 C, SARS SUD‐C (PDB ID: 2KAF),35 MHV C domain (PDB ID: 4YPT),30 human frataxin (PDB ID: 3T3X).52 (B) C domain electrostatic potential surfaces. Red areas represent positively charged regions, blue areas represent negatively charged areas, and white areas represent neutral areas. (C) Structure‐based sequence alignment of the C domain in the Rousettus bat coronavirus HKU9, related viral proteins, and with other proteins in the frataxin fold family: phage T4 MotA (PDB ID: 1KAF),53 hypothetical protein (PDB ID: 1YB3), Psychromonas ingrahamii FTXN (PDB ID: 4HS5),52 Ataxia FTXN (PDB ID: 3T3X),54 AcsD (PDB ID: 2W04).55 The alignment is based on structural alignments obtained with TM‐Align.50 PDB codes are included after each protein name. The residue numbers for HKU9‐C are indicated. Alpha helix regions are displayed in red (cylinders) and beta strands are shown in blue (arrows). Gaps are shown as dashes (‐) and insertions where additional secondary structures are present are indicated by forward slash marks (//). Residues indicated by stars (*) discussed in the text are involved in potential functional sites. The corresponding Dali scores for the pairwise alignment of each protein with HKU9 C and the percent amino acid identity between each protein and HKU9 C domain are listed. Dali scores of 2.0 and higher indicate significant sequence identity51
Figure 4Overlay of HKU9‐C (green) and SARS‐C (cyan) backbone with secondary structures shown. Sidechains from Arg 588‐Asp 589‐Trp 590, and Lys 609‐Arg 610‐Gly 611 (HKU9 C) and Arg 670‐Asp 671‐Trp 672, and Lys 687‐Arg 688‐Gly 689 (SARS C) are shown with the corresponding one‐letter amino acid code