| Literature DB >> 26089768 |
David W Wright1, Stephen J Perkins1.
Abstract
Small-angle X-ray and neutron scattering techniques characterize proteins in solution and complement high-resolution structural studies. They are of particular utility when large proteins cannot be crystallized or when the structure is altered by solution conditions. Atomistic models of the averaged structure can be generated through constrained modelling, a technique in which known domain or subunit structures are combined with linker models to produce candidate global conformations. By randomizing the configuration adopted by the different elements of the model, thousands of candidate structures are produced. Next, theoretical scattering curves are generated for each model for trial-and-error fits to the experimental data. From these, a small family of best-fit models is identified. In order to facilitate both the computation of theoretical scattering curves from atomistic models and their comparison with experiment, the SCT suite of tools was developed. SCT also includes programs that provide sequence-based estimates of protein volume (either incorporating hydration or not) and add a hydration layer to models for X-ray scattering modelling. The original SCT software, written in Fortran, resulted in the first atomistic scattering structures to be deposited in the Protein Data Bank, and 77 structures for antibodies, complement proteins and anionic oligosaccharides were determined between 1998 and 2014. For the first time, this software is publicly available, alongside an easier-to-use reimplementation of the same algorithms in Python. Both versions of SCT have been released as open-source software under the Apache 2 license and are available for download from https://github.com/dww100/sct.Entities:
Keywords: X-ray scattering; analytical ultracentrifugation; constrained modelling; hydration shells; neutron scattering
Year: 2015 PMID: 26089768 PMCID: PMC4453981 DOI: 10.1107/S1600576715007062
Source DB: PubMed Journal: J Appl Crystallogr ISSN: 0021-8898 Impact factor: 3.304
Figure 1Schematic representation of a scattering experiment. (a) An incident beam is shown scattering from two point scatterers (represented by the black dots) within a globular macromolecule. The diffracted rays are in phase with each other but out of step by λ at the scattering angle shown, causing constructive interference. The accumulation of these events at low angles gives rise to the scattering pattern of the macromolecule. (b) In a typical small-angle scattering experiment, diffraction from high-scattering-density macromolecules in a low-scattering-density solution gives rise to a scattering pattern on an area detector. q is the scattering vector . The radial average of the scattering pattern about the position of the direct main beam gives rise to the scattering curve in reciprocal space.
Figure 2Constrained modelling algorithm in SCT. First, candidate full atomistic structures of the target macromolecule are generated. That illustrated is for human IgA1, taken from Boehm et al. (1999 ▶). A grid transformation is performed on each structure to produce a lower-resolution (coarse-grained) sphere model, which is used to calculate a theoretical scattering curve via the Debye equation. The R factor determines if the theoretical curve reproduces the experimental curve in the same Q range. Models with low R factors are inferred to represent the average solution structure.
Tasks involved in the constrained modelling process and the programs within the SCT suite that perform them
The programs in both the classic (Fortran-based) and modern (Python-based) versions are shown for each task.
| Task | Modern | Classic |
|---|---|---|
| Volume calculation |
|
|
| Sphere model parameter optimization |
| |
| Sphere model creation |
|
|
| Sphere model hydration |
|
|
| Calculate |
|
|
| Theoretical scattering calculation |
|
|
| Calculate |
|
|
| Curve comparison |
|
|
| Workflow |
|
|
Figure 3Grid transformation algorithms in SCT. (a) A two-dimensional schematic of the grid conversion shows how coarse-grained sphere models are derived from atomistic structures. A grid of equal divisions is created that contains all atoms within the input structure. If more than a specified cutoff number of atoms is found within a division, a ‘sphere’ is added to the final model with a radius of half the grid box width. This algorithm is applied in three dimensions to create sphere models from atomistic structures. (b) This schematic shows how up to 26 hydration spheres as required are added to each existing sphere in the ‘dry’ model to produce a hydrated sphere model. Hydration spheres are located on the corners and mid-points of the sides of a cube, with a dimension of four times the sphere radius (r). The original sphere is shown in green, with the hydration locations in black. (c) A hydration layer is required when modelling X-ray scattering data. The hydration layer of water molecules at the surface is added by surrounding each green sphere in the coarse-grained sphere model of the dry protein (top view) with blue hydration spheres of the same radius as shown (middle view). Overlapping and excess blue hydration spheres are subsequently filtered out to match the hydrated volume calculated from the macromolecular sequence, as shown at the bottom.
Residue volumes for both amino acids and monosaccharides used by sluv2.py in SCT to calculate the macromolecular volumes used in constrained modelling (Perkins, 1986 ▶)
In the ‘classic’ output option, as well as the output of classic sluv, these residue volumes are labelled PER85. The non-agreement of the naming with the 1986 publication is maintained for historical reasons.
| Residue name | Residue code | Volume (103nm3) |
|---|---|---|
| Alanine | ALA | 97.1 |
| Arginine | ARG | 192.9 |
| Asparagine | ASN | 127.4 |
| Aspartic acid | ASP | 125.3 |
| Cysteine | CYS | 112.4 |
| Glutamine | GLN | 147.3 |
| Glutamic acid | GLU | 148.0 |
| Glycine | GLY | 68.2 |
| Histidine | HIS | 158.3 |
| Isoleucine | ILE | 170.1 |
| Leucine | LEU | 182.8 |
| Lysine | LYS | 184.5 |
| Methionine | MET | 176.0 |
| Phenylalanine | PHE | 203.9 |
| Proline | PRO | 129.0 |
| Serine | SER | 103.3 |
| Threonine | THR | 129.0 |
| Tryptophan | TRP | 228.9 |
| Tyrosine | TYR | 202.3 |
| Valine | VAL | 142.3 |
| Fucose | FUC | 160.8 |
| Galactose | GAL | 166.8 |
| Glucose | GLC | 171.9 |
| Mannose | MAN | 170.8 |
|
| NAG | 222.0 |
|
| NGA | 232.9 |
| Sialic acid | SIA | 326.3 |
Data output from the three new output modes introduced in sluv2.py
All data are included in the ‘classic’ output mode and output from sluv.
| Output type | Macromolecular molecular weight (103kgmol1) | Absorption coefficient | Partial specific volume (nm3kg1) | Macromolecular volume (103nm3) |
|---|---|---|---|---|
| Model | No | No | No | Yes |
| AUC | Yes | Yes | Yes | No |
| Project | Yes | Yes | Yes | Yes |
Explanation of the parameters in the YAML input to the modern version of SCT
The same parameters are required for and , consequently they are both denoted rxs?. The format of the YAML file is shown in Supplementary Figure S2.
| Parameter | Type | Meaning | |
|---|---|---|---|
|
|
| Float | Minimum |
|
|
| Float | Maximum |
|
|
| Float | Minimum |
|
|
| Float | Maximum |
|
|
| Float | Minimum |
|
|
| Float | Maximum |
|
|
| Float | Minimum |
|
|
| Float | Maximum |
|
|
| Float | Minimum |
|
|
| Float | Maximum |
|
|
| Integer | Cutoff of the number of atoms in a grid box over which a sphere is added to a sphere model |
|
|
| Float | The length of the side of the grid boxes used in sphere model creation |
|
|
| Integer | Number of positions surrounding each atom onto which a ‘hydration sphere’ should be added when creating a hydrated sphere model (see Fig. 1 |
|
|
| Integer | Cutoff used to remove excess hydration spheres when creating a hydrated sphere model |
|
|
| Float | Maximum |
|
|
| Integer | Number of points at which to calculate |
|
|
| Integer | Number of bins in the distance histogram used with the Debye equation |
|
|
| Boolean | Choice of whether to include a smearing correction in the scattering curve |
|
|
| Float | Wavelength used in smearing calculation |
|
|
| Float | Wavelength spread |
|
|
| Float | Beam divergence |
|
|
| Float | Minimum |
|
|
| Float | Maximum |
Figure 4Comparison of neutron and X-ray scattering data with models of human immunoglobulin IgG4 composed of Fab and Fc regions (Rayner et al., 2014 ▶). (a) Atomistic structures of an extended asymmetric (left) and a compact asymmetric (right) IgG4 model. In these, the Fc region is viewed in a similar orientation. (b), (c) Comparisons of the calculated scattering curves for sphere models generated by SCT from the atomistic models with neutron and X-ray experimental data, respectively. The Q range is depicted from 0.2 to 1.6 nm−1. In both cases, the compact asymmetric model (blue curve: PDB code 4pto) gives good fits to the experimental data, whereas the extended model (red curve) does not.