| Literature DB >> 28876235 |
Jill Trewhella1, Anthony P Duff2, Dominique Durand3, Frank Gabel4, J Mitchell Guss1, Wayne A Hendrickson5, Greg L Hura6, David A Jacques7, Nigel M Kirby8, Ann H Kwan1, Javier Pérez9, Lois Pollack10, Timothy M Ryan8, Andrej Sali11, Dina Schneidman-Duhovny12, Torsten Schwede13, Dmitri I Svergun14, Masaaki Sugiyama15, John A Tainer16, Patrice Vachette3, John Westbrook17, Andrew E Whitten2.
Abstract
In 2012, preliminary guidelines were published addressing sample quality, data acquisition and reduction, presentation of scattering data and validation, and modelling for biomolecular small-angle scattering (SAS) experiments. Biomolecular SAS has since continued to grow and authors have increasingly adopted the preliminary guidelines. In parallel, integrative/hybrid determination of biomolecular structures is a rapidly growing field that is expanding the scope of structural biology. For SAS to contribute maximally to this field, it is essential to ensure open access to the information required for evaluation of the quality of SAS samples and data, as well as the validity of SAS-based structural models. To this end, the preliminary guidelines for data presentation in a publication are reviewed and updated, and the deposition of data and associated models in a public archive is recommended. These guidelines and recommendations have been prepared in consultation with the members of the International Union of Crystallography (IUCr) Small-Angle Scattering and Journals Commissions, the Worldwide Protein Data Bank (wwPDB) Small-Angle Scattering Validation Task Force and additional experts in the field.Entities:
Keywords: DNA; RNA; SANS; SAXS; biomolecular structure; hybrid structural modelling; integrative structural biology; proteins; publication guidelines; small-angle scattering; structural modelling
Mesh:
Substances:
Year: 2017 PMID: 28876235 PMCID: PMC5586245 DOI: 10.1107/S2059798317011597
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Summary of guidelines for sample details
| Source of samples, including sample-purification protocol, a measure of the final purity and how it was determined. |
| Composition of the sample, including protein or nucleic acid sequences as measured, or FASTA IDs with the relevant ranges specified, plus fusion tags, ligands, cofactors, glycosylation or other modifications and the predicted molecular mass. |
| Solvent/buffer pH and composition, including additives such as free-radical scavengers used to minimize the effects of radiation damage during SAXS data acquisition, and a statement of how the SAS-measured solvent blank was obtained ( |
| Sample concentration(s) and method(s) of determination, including extinction coefficients and wavelengths when UV absorbance measurements are used. |
| In the case of combined SEC–SAS experiments, a description (or reference) to the system, column size/type/resin, injection sample concentration and volume and flow rate. |
| In the case of SANS contrast-variation experiments, the deuteration level of each biomolecular component ( |
| Any SAS-independent assessments of monodispersity over a range of conditions ( |
Summary of guidelines for data acquisition and reduction
| Instrument type ( |
| Beam dimensions and wavelength resolution (Δλ/λ) with data-smearing parameters where appropriate, and measured |
| References to documentation for detector type and characteristics including pixel size, the basis for error estimates and propagation ( |
| Number of sample exposures and exposure times, the normalization method ( |
| In the case of SANS contrast-variation experiments, sample and buffer transmissions referenced to transmissions of pure 1H2O and 2H2O, from which deuteration of the solvent can be checked. |
| Details of the sample environment, including measurement temperature, measurement cell type and path lengths, any special parameters controlled, |
| In the case of SEC–SAS experiments, description of (or reference to) system. |
| Standards measured and controls and method for placing SAS data on an absolute scale in cm−1, |
| Data-reduction protocol and software used, including version number. |
Summary of guidelines for data presentation, analysis and validation
| Difference scattering profiles [(particle + solvent) − (solvent scattering)] corresponding to the particle form factor deposited in a publicly available archive or made available as supplementary material and presented as a plot of log |
| (i) Intensities on an absolute scale in units of cm−1 with propagated standard errors (σ). Note: for Guinier plots [ln |
| (ii) For multiple curves on the same plot, data can be offset for clarity with the offsets given in the figure caption. |
| (iii) For SANS contrast-variation experiments, data from all contrast points. |
| (iv) Guinier |
|
|
|
|
| Where applied, the magnitude of corrections for solvent subtraction applied to the data as a potential warning that something is not correct if unduly large (say 1% percent of the solvent scattering level). |
| Where relevant, the method of data desmearing to correct for beam geometry and/or polychromaticity and the original smeared data be made available. |
| For a concentration series, note if no change in |
| A dimensionless Kratky plot as a check on the degree of folding and/or flexibility in the scattering particle. Kratky and/or Porod–Debye plots might alternatively be used to assess potential flexibility. |
| For SEC–SAS data a plot of |
| Description of the data processing used to obtain the final data set for analysis and modelling [including data reduction to |
| For contrast-variation experiments the nature and number of contrast points with a plot of normalized ± [ |
| For contrast-variation experiments on assemblies of components with different mean scattering densities, the |
| Software used for data processing and analysis [ |
Summary of reporting guidelines for structure modelling
| All software, including version numbers, used for modelling; three-dimensional shape, bead or atomistic modelling. |
| All modelling assumptions clearly stated, including adjustable parameter values. In the case of imposed symmetry, especially in the case of shape models, comparison with results obtained in the absence of symmetry restraints. |
| For atomistic modelling, a description of how the starting models were obtained ( |
| Any additional experimental or bioinformatics-based evidence supporting modelling assumptions and therefore enabling modelling restraints or independent model validation. |
| For three-dimensional models, values for adjustable parameters, constant adjustments to intensity, χ2 and associated |
| Analysis of the ambiguity and precision of models, |
Figure 1SEC–SAXS results for GI (blue), BSA (red) and CaM (black). (a) Plots showing I(0) (hollow squares) and R g (filled squares) as a function of time for the SEC–SAXS run. Data frames between the vertical bars were selected for averaging to obtain I(q) versus q. (b) I(q) versus q as log-linear plots with the inset showing the Guinier fits (yellow lines) for qR g < 1.3 with open symbols indicating data beyond the Guinier region. (c) Dimensionless Kratky plots for the data in (b). (d) P(r) versus r profiles from the data in (b) normalized to equal areas [i.e. proportional to P(r)/I(0)] for ease of comparison.
Figure 2Crystal structure modelling results. FoXS-derived models (red and black solid lines) for GI (PDB entry 1oad, tetramer), BSA (PDB entry 4f5s, chain A) and CaM (PDB entry 1cll with the additional N- and C-terminal residues modelled) fitted to I(q) versus q. The upper plot shows log I(q) versus q, while the lower inset plot is the error-weighted residual difference plot Δ/σ = [I exp(q) − cI mod(q)]/σ(q) versus q. The colour key for the data plots is the same as in Fig. 1 ▸.
Figure 3MultiFoXS modelling results for BSA and CaM. (a) Model fits for BSA: I(q) versus q (red squares) for one-state (black line) and three-state (cyan line) models assuming flexible residues 183–187 and 381–384. The lower inset shows the error-weighted residual difference plots for one-state (black squares) and three-state (cyan squares) models. (b) BSA DAMMIN model (wheat spheres) overlaid with the crystal structure (PDB entry 4f5s, chain A, blue ribbon) and one-state optimized model (magenta ribbon) and representative structures from the three-state optimized model (cyan ribbon models). (c) Model fits to I(q) versus q for CaM: I(q) versus q (black squares) for one-state (red line) and two-state (cyan line) models assuming flexible residues 1–3 and 77–81; the lower inset shows the error-weighted residual difference plots for the one-state (red squares) and two-state (cyan squares) models. (d) CaM DAMMIN model (wheat spheres) overlaid with the crystal structure (PDB entry 1cll, blue ribbon) and the one-state model (magenta ribbon) with the representative two-state models to the right (pink; calcium ions are depicted as yellow spheres). Model overlays were optimized using SUPCOMB (Kozin & Svergun, 2001 ▸).
Figure 4Ensemble modelling results for CaM. (a) I(q) versus q (black squares) with the EOM model (red line) and error-weighted difference plot for the model and experimental profiles (red squares). (b) Averaged and filtered DAMMIN model (grey spheres) overlaid with representative structures from the optimized ensemble. Structures are aligned by their N-terminal domains (magenta), showing variability in the relative disposition of the C-terminal domains (cyan). The calcium ions are depicted as yellow spheres. Given the variations in the selected structures, the overlay with the DAMMIN model was performed simply by eye in PyMOL. (c, d) R g and d max distributions, respectively, from EOM for the starting pool (black line) and the optimized ensemble (red line).
(a) Sample details.
| GI (tetramer) | BSA | CaM | |
|---|---|---|---|
| Organism |
|
|
|
| Source (catalogue No. or reference) | Hampton Research (HR7-100) | Sigma–Aldrich (A3294) |
|
| UniProt sequence ID (residues in construct) | P24300 (2–388) | P02769 (25–607) | P62155 (2–149) |
| Extinction coefficient [ | 1.075 | 0.646 | 0.178 |
|
| 0.732 | 0.732 | 0.716 |
| Particle contrast from sequence and solvent constituents, | 2.87 (12.39 − 9.52) | 2.86 (12.38 − 5.92) | 3.09 (12.61 − 5.92) |
|
| 172912 | 66400 | 16842 |
| SEC–SAXS column, 5 × 150 mm Superdex S200 | |||
| Loading concentration (mg ml−1) | 6 | 25 | 20.2 |
| Injection volume (µl) | 30 | 35 | 35 |
| Flow rate (ml min−1) | 0.45 | 0.45 | 0.45 |
| Average | 0.58 (0.20–1.09) | 1.81 (1.01–2.45) | 3.09 (2.38–3.55) |
| Solvent (solvent blanks taken from SEC flowthrough prior to elution of protein) | 25 m | ||
(b) SAXS data-collection parameters.
| Instrument/data processing | Australian Synchrotron SAXS/WAXS beamline with Dectris PILATUS 1M detector (Kirby |
| Wavelength (Å) | 1.0332 |
| Beam size (µm) | 250 × 130 |
| Camera length (m) | 2.683 |
|
| 0.00663–0.3104 |
| Absolute scaling method | Comparison with scattering from 1 mm pure H2O |
| Normalization | To transmitted intensity by beam-stop counter |
| Monitoring for radiation damage | X-ray dose maintained below 210 Gy, data frame-by-frame comparison |
| Exposure time | Continuous 1 s data-frame measurements of SEC elution |
| Sample configuration | SEC–SAXS with sheath-flow cell (Kirby |
| Sample temperature (°C) | 22 |
(c) Software employed for SAXS data reduction, analysis and interpretation.
| SAXS data reduction |
|
| Extinction coefficient estimate |
|
| Calculation of Δ |
|
| Basic analyses: Guinier, |
|
| Shape/bead modelling |
|
| Atomic structure modelling |
|
|
| |
|
| |
|
| |
| Missing sequence modelling |
|
| Three-dimensional graphic model representations |
|
(d) Structural parameters.
| GI (tetramer) | BSA | CaM | |
|---|---|---|---|
| Guinier analysis | |||
|
| 0.0759 ± 0.0008 | 0.0861 ± 0.0008 | 0.0554 ± 0.00008 |
|
| 32.87 ± 0.13 | 28.33 ± 0.05 | 21.74 ± 0.06 |
|
| 0.007 | 0.007 | 0.007 |
|
| 1.3 | 1.3 | 1.3 |
| Coefficient of correlation, | 0.999 | 0.999 | 0.999 |
|
| 178312 (1.03) | 65589 (0.99) | 21944 (1.31) |
|
| |||
|
| 0.0748 ± 0.00008 | 0.0850 ± 0.00006 | 0.0533 ± 0.00006 |
|
| 32.65 ± 0.04 | 28.32 ± 0.03 | 22.2 ± 0.06 |
|
| 92 | 87 | 72 |
|
| 0.007–0.243 | 0.007–0.282 | 0.0074–0.310 |
| χ2 (total estimate from | 0.929 (0.94) | 0.858 (0.96) | 0.855 (0.91) |
|
| 180191 (1.04) | 65354 (1.00) | 21718 (1.29) |
| Porod volume (Å−3) (ratio | 229000 (1.3) | 101000 (1.5) | 25200 (1.5) |
|
| 192400, 157.9 (0.91) | 82440, 67.9 (1.02) | 21550, 17.7 (1.05) |
(e) Shape model-fitting results.
| GI (tetramer) | BSA | CaM | |
|---|---|---|---|
|
| |||
|
| 0.007–0.243 | 0.007–0.282 | 0.007–0.310 |
| Symmetry, anisotropy assumptions |
|
|
|
| NSD (standard deviation), No. of clusters | 0.62 (0.01), 1 | 0.75 (0.63), 6 | 0.77 (0.02), 4 |
| χ2 range | 2.25–2.29 | 0.96–0.99 | 1.30–1.37 |
| Constant adjustment to intensities | Skipped, unable to determine | 1.51 × 10−4 | 1.48 × 10−4 |
| Resolution (from | 37 ± 3 | 32 ± 3 | 30 ± 3 |
|
| 134000 (0.77) | 66700 (1.00) | 16300 (0.97) |
|
| |||
|
| 0.007–0.243 | 0.007–0.282 | 0.007–0.310 |
| Symmetry, anisotropy assumptions |
|
|
|
| χ2, | 0.95, 0.04 | 0.85, 0.16 | 0.844, 0.53 |
| Constant adjustment to intensities | 2.697 × 10−5 | 7.736 × 10−5 | 1.877 × 10−4 |
(f) Atomistic modelling.
| Crystal structures | PDB entry | PDB entry | PDB entry 1cll+ |
|
| 0.007–0.243 | 0.007–0.282 | 0.007–0.310 |
|
| |||
| χ2, | 1.02, 0.05 | 4.4, 0.00 | 9.2, 0.00 |
| Predicted | 31.70 | 26.75 | 21.58 |
|
| 1.03, 0.81 | 0.99, 2.39 | 0.99, 2.94 |
|
| |||
| No constant subtraction | |||
| χ2, | 1.00, 0.05 | 2.78, 0.00 | 15.95, 0.00 |
| Predicted | 32.69 | 27.89 | 22.51 |
| Vol (Å), Ra (Å), Dro (e Å−3) | 230987, 1.80, 0.0130 | 76791, 1.80, 0.035 | 20271, 1.40, 0.025 |
| Constant subtraction allowed | |||
| χ2, | 1.01, 0.05 | 2.14, 0.00 | 12.62, 0.00 |
| Predicted | 32.71 | 28.01 | 22.11 |
| Vol (Å), Ra (Å), Dro (e Å−3) | 226689, 1.40, 0.013 | 76791, 1.80, 0.037 | 22012, 1.40, 0.055 |
| Multistate/ensemble models | |||
| Starting crystal structures | PDB entry | PDB entry 1cll+ | |
| Flexible residues | 183–187 and 381–384 | 1–3 (ADQ), 77–87 (KDTDS) | |
|
| |||
| No. of states | 1 | 1 | |
| χ2, | 1.05, 0.02 | 0.85, 0.31 | |
|
| 0.99, 0.63 | 1.05, 0.99 | |
|
| 27.59 | 21.03 | |
| Weights | 1 | 1 | |
| No. of states | 2 | 2 | |
| χ2, | 0.96, 0.09 | 0.79, 0.79 | |
|
| 1.02, 1.21 | 1.02, 1.50 | |
|
| 26.42, 32.35 | 22.32, 19.47 | |
| Weights | 0.83, 0.17 | 0.70, 0.30 | |
| No. of states | 3 | 3 | |
| χ2, | 0.82, 0.17 | 0.79, 0.79 | |
|
| 1.02, 0.94 | 1.02, 1.52 | |
|
| 26.42, 30.43, 29.80 | 22.32, 30.25, 19.00 | |
| Weights | 0.74, 0.08, 0.08 | 0.68, 0.13, 0.18 | |
|
| |||
| χ2, | 0.82, 0.79 | ||
| Constant subtraction | 0 | ||
| No. of representative structures | 13 | ||
(g) SASBDB IDs for data and models.
| GI | BS | CaM |
|---|---|---|
| SASDCK2 | SASDCJ3 | SASDCQ2 |
PDB entry 1cll+ is PDB entry 1cll plus the missing ADQ at the N-terminus and the C-terminal K missing in the crystal structure.
In FoXS the adjustable parameters c 1 and c 2 are adjustments for excluded volume and hydration density. c 1 can vary by 5% (0.95–1.05) and the maximum hydration adjustment c 2 of 4.0 corresponds to ∼0.388 e Å−3 (compared with bulk solvent density ρ = 0.334 e Å−3).
In CRYSOL the adjustable parameters are excluded volume (Vol in Å3), optimal atomic radius (Ra in Å) and Dro (optimal contrast of the hydration shell in e Å−3).
In MultiFoXS c 1 and c 2 are the same for all states in a set; the scale factor c is then optimized for each state and a relative weight w for each state n is output.