| Literature DB >> 27577872 |
Manfred S Weiss1, Kay Diederichs2, Randy J Read3, Santosh Panjikar4, Gregory D Van Duyne5, A Gregory Matera6, Utz Fischer7, Clemens Grimm7.
Abstract
A recent publication by Seng et al. in this journal reports the crystallographic structure of refolded, full-length SMN protein and two disease-relevant derivatives thereof. Here, we would like to suggest that at least two of the structures reported in that study are incorrect. We present evidence that one of the associated crystallographic datasets is derived from a crystal of the bacterial Sm-like protein Hfq and that a second dataset is derived from a crystal of the bacterial Gab protein. Both proteins are frequent contaminants of bacterially overexpressed proteins which might have been co-purified during metal affinity chromatography. A third structure presented in the Seng et al. paper cannot be examined further because neither the atomic coordinates, nor the diffraction intensities were made publicly available. The Tudor domain protein SMN has been shown to be a component of the SMN complex, which mediates the assembly of RNA-protein complexes of uridine-rich small nuclear ribonucleoproteins (UsnRNPs). Importantly, this activity is reduced in SMA patients, raising the possibility that the aetiology of SMA is linked to RNA metabolism. Structural studies on diverse components of the SMN complex, including fragments of SMN itself have contributed greatly to our understanding of the cellular UsnRNP assembly machinery. Yet full-length SMN has so far evaded structural elucidation. The Seng et al. study claimed to have closed this gap, but based on the results presented here, the only conclusion that can be drawn is that the Seng et al. study is largely invalid and should be retracted from the literature.Entities:
Year: 2016 PMID: 27577872 PMCID: PMC5418738 DOI: 10.1093/hmg/ddw298
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Model and refinement statistics for 4NL7/Hfq
| 4NL7, Seng | 4NL7, calculated by Phenix | 4NL7, Re-refined with Phenix | Hfq model, Refined against reindexed 4NL7 dataset | |
|---|---|---|---|---|
| Space group | C2 | C2 | C2 | P21 |
| Cell constants (Å, Å, Å, | 107.1, 62.3, 57.1, | 107.1, 62.3, 57.1, | 107.1, 62.3, 57.1, | 107.1, 62.3, 57.1, |
| °,°,°) | 90.0, 95.1, 90.0 | 90.0, 95.1, 90.0 | 90.0, 95.1, 90.0 | 90.0, 95.1, 90.0 |
| Resolution (Å) | 3.0 | 14 - 3.0 (3.11 - 3.00) | 14 - 3.0 (3.11 - 3.00) | 14 - 3.0 (3.11 - 3.00) |
| No. of reflections | ||||
| In refinement | 7286 {6702} | 6702 (657) | 6702 (657) | 6702 (657) |
| For Rfree | – | 312 (30) | 312 (30) | 312 (30) |
| Dataset completeness (%) | 88.9 {96.7} | 88.9 | 88.9 | 44.5 |
| Rfree (%) | ||||
| Rwork (%) | 32.7 {29.6} | 36.1 (33.0) | 33.5 (31.7) | 28.2 (33.0) |
| Protein residues | – | 272 | 272 | 792 |
| Water molecules | – | 20 | 20 | 0 |
| RMS (bonds) (Å) | 0.009 {0.013] | 0.017 | 0.011 | 0.004 |
| RMS (angles) (°) | 1.74 {1.93} | 3.08 | 1.56 | 0.73 |
| Ramachandran favoured (%) | 100 | 85 | 66 | 95 |
| Ramachandran allowed (%) | – | 15 | 26 | 4.7 |
| Ramachandran outliers (%) | 0 | 0.37 | 8.1 | 0 |
| Clashscore | – | 63.1 | 38.1 | 7.7 |
| Average isotropic B-factor (Å2) | − {26.7} | 26.9 | 17.3 | 92.8 |
| − Protein | – | 26.9 | 17.4 | 92.8 |
| − Water | – | 20.0 | 10.4 | – |
Values in () parentheses for highest resolution shell. Values in {} parentheses given if deviating information available in 4NL7 PDB header. Model statistics calculated with Phenix.
Figure 1.Self rotation function of the 4NL7 dataset, our Hfq model and the 4NL7 model. The three self-rotation functions were scaled relative to one another so that the contour lines are at approximately the same absolute self rotation function value. The κ = 180° sections for each of the three self rotation functions are displayed in the upper row, the κ = 60° sections in the lower row.
Self-rotation function peaks at κ = 60° and κ = 180° indicative of a 622 point group symmetry as percentage of crystallographic peak height for the 4NL7 dataset, our Hfq model, and the 4NL7 model
| Resolution | 3 Å | 5 Å | 6 Å | |||
|---|---|---|---|---|---|---|
| κ (°) | 60 | 180 | 60 | 180 | 60 | 180 |
| 4NL7 dataset | 71% | 72% | 64% | 57% | 61% | 66% |
| Hfq model | 78% | 77% | 71% | 68% | 79% | 79% |
| 4NL7 model | 31% | 32% | 26% | 27% | n. d. | 30% |
Figure 2.Crystal structure of the E. coli Hfq protein solved from the 4NL7 dataset. (A) - The Hfq hexamer in ribbon depiction. One protomer is coloured in purple, the His residues are shown as pink sticks. (B) - View of the crystal packing along the c axis. (C) - View of the crystal packing along the a axis.
E. coli Hfq entries in the PDB with cell constants (axis lengths in A, cell angles in °) comparable to the 4NL7 cell reduced to P1
| PDB ID code | Space group | a | b | c | α | β | γ |
|---|---|---|---|---|---|---|---|
| Reduced to P1 | 62.0 | 62.0 | 94.4 | 57.1 | 94.4 | 60.4 | |
| P1 | 61.2 | 61.2 | 53.1 | 82.6 | 87.3 | 60.0 | |
| P1 | 61.9 | 62.2 | 81.3 | 78.6 | 86.2 | 59.9 |
Figure 3.Real space correlation coefficient plot as a function of residue number. (A) Plot for peptide chain A of our Hfq model refined against the 4NL7 data in space group P21. (B) Plot for the SMN model from Seng et al. from PDB entry 4NL7 without modifications.
Model and refinement statistics for 4NL6/Gab
| 4NL6, Seng | 4NL6, calculated by Phenix | 4NL6 dataset, reindexed and refined against bacterial Gab | |
|---|---|---|---|
| C2 | C2 | I422 | |
| 137.0, 169.8, 108.8 | 137.0, 169.8, 108.8 | 120.2, 120.2, 137.0 | |
| 90.0, 128.5, 90.0 | 90.0, 128.5, 90.0 | 90.0, 90.0, 90.0 | |
| 5.5 | 5.5 (5.649 - 5.5) | 5.5 (5.649 - 5.5) | |
|
| 7946 | 4511 (428) | 4511 (429) |
|
| – | 198 (22) | 198 (22) |
| 61.3 | 71.0 | 71.0 | |
| 30.6 {29.9} | 45.4 (49.2) | 15.3 (21.0) | |
| – | 880 | 1228 | |
| – | – | 0 | |
| – | – | 0.010 | |
| – | – | 1.25 | |
| – | – | 97 | |
| – | – | 2.7 | |
| – | – | 0.66 | |
| – | 30.7 | 10.92 | |
| – | 30.7 | 199 |
Values in () parentheses for highest resolution shell. Values in {} parentheses given if deviating information available in 4NL6 PDB header. Model statistics calculated with Phenix.