| Literature DB >> 27980506 |
Stephen J Perkins1, David W Wright1, Hailiang Zhang2, Emre H Brookes3, Jianhan Chen4, Thomas C Irving5, Susan Krueger2, David J Barlow6, Karen J Edler7, David J Scott8, Nicholas J Terrill9, Stephen M King10, Paul D Butler11, Joseph E Curtis2.
Abstract
The capabilities of current computer simulations provide a unique opportunity to model small-angle scattering (SAS) data at the atomistic level, and to include other structural constraints ranging from molecular and atomistic energetics to crystallography, electron microscopy and NMR. This extends the capabilities of solution scattering and provides deeper insights into the physics and chemistry of the systems studied. Realizing this potential, however, requires integrating the experimental data with a new generation of modelling software. To achieve this, the CCP-SAS collaboration (http://www.ccpsas.org/) is developing open-source, high-throughput and user-friendly software for the atomistic and coarse-grained molecular modelling of scattering data. Robust state-of-the-art molecular simulation engines and molecular dynamics and Monte Carlo force fields provide constraints to the solution structure inferred from the small-angle scattering data, which incorporates the known physical chemistry of the system. The implementation of this software suite involves a tiered approach in which GenApp provides the deployment infrastructure for running applications on both standard and high-performance computing hardware, and SASSIE provides a workflow framework into which modules can be plugged to prepare structures, carry out simulations, calculate theoretical scattering data and compare results with experimental data. GenApp produces the accessible web-based front end termed SASSIE-web, and GenApp and SASSIE also make community SAS codes available. Applications are illustrated by case studies: (i) inter-domain flexibility in two- to six-domain proteins as exemplified by HIV-1 Gag, MASP and ubiquitin; (ii) the hinge conformation in human IgG2 and IgA1 antibodies; (iii) the complex formed between a hexameric protein Hfq and mRNA; and (iv) synthetic 'bottlebrush' polymers.Entities:
Keywords: molecular dynamics (MD); molecular modelling; scattering curve fits; small-angle-X-ray scattering (SAXS); small-angle-neutron scattering (SANS)
Year: 2016 PMID: 27980506 PMCID: PMC5139988 DOI: 10.1107/S160057671601517X
Source DB: PubMed Journal: J Appl Crystallogr ISSN: 0021-8898 Impact factor: 3.304
Figure 1Upper panel: Schematic domain structures of six SASSIE-modelled macromolecules. The domains are drawn approximately to scale according to their molecular structures. The major linkers varied in SASSIE searches are denoted by arrows (Table 1); some simplification has been made. The domain names are as follows: HIV-1 Gag: MA, matrix; CA, capsid; NC, nucleocapsid. MASP: CUB, C1r/C1s, Uefg and bone morphogenetic protein-1; EGF, epidermal growth factor; SCR, short complement regulator; SP, serine protease. Ub, ubiquitin. IgG2 and IgA1: Fab, fragment antigen binding; Fc, fragment crystallizable. Lower panel: Molecular structures for these six macromolecules, all drawn to the same scale in PYMOL (Schrodinger LLC). The best-fit structures are to be described as ensembles of structures and not as the single structures as shown. The domain colours follow those in the upper panel. That for HIV-1 Gag is taken from the starting model for the simulations, where the MA, CA and NC domains are taken from crystal or NMR structures, and the p2 domain is not shown (Table 1 ▸). That for MASP-3D is taken from the crystal structure of MASP-1 and was the starting model used to initiate the fitting. That for the K27-ubiquitin dimer is taken from the isopeptide dimer formed through Lys27 (distal Ub, orange; proximal Ub, yellow; K27, magenta). That for IgA1 is the final model from the SASSIE fits, but not showing the glycan chains (Fig. 4). That for the Hfq–mRNA complex is the input file used in the Complex MC tutorial; under this is the starting crystal structure of the Hfq core protein bound with two heptamer nucleotide chains (PDB code 4ht8). The mRNA chains are shown in orange.
Figure 2The GenApp and SASSIE infrastructures. (a) The use of GenApp to generate applications. The generator (green box) reads application definitions, module definitions and chosen target language information to assemble the application instances. Examples of target languages are shown in the cyan boxes (adapted from Brookes et al., 2015 ▸). In application to SASSIE, GenApp is able to take any set of executables (created using any set of programming languages) compatible with a certain platform (e.g. Windows or Linux) and present them together in the single web interface that is shown in Fig. 3(a). (b) In the SASSIE workflow, the schematic relationships between the SASSIE framework and five of the six main modules within SASSIE are shown within the cyan box. These modules are assembled using the GenApp deployment infrastructure. The two inputs for SASSIE are shown in yellow boxes. The two outputs from SASSIE are shown in green boxes. At this point, a decision is required in terms of whether the modelling is completed (red box).
Figure 3The SASSIE-web user interface. (a) The home page at https://sassie-web.chem.utk.edu/sassie2/. The six main modules of SASSIE are shown to the left. (b) The input screen to set up a Monomer MC simulation from the Simulate module is shown. (c) The χ2 filter input screen from the Analyze module is shown.
Atomistic modelling projects completed using SASSIE
| Biological system | HIV-1 Gag | Ubiquitin dimer (Ub2) | MASP dimers | Human IgG2 | Human IgA1 | Hfq-mRNA |
|---|---|---|---|---|---|---|
| Experimental data | Neutron scattering (NG3 30 m and NG7 30 m at NIST) | 600 and 800 MHz NMR structures; neutron scattering (NG3 30 m at NIST) | X-ray crystallography; analytical ultracentrifugation; X-ray scattering (BM29 at ESRF) | Neutron scattering (NG3 30 m and NG7 30 m at NIST) | Analytical ultracentrifugation; X-ray scattering (ID02 at ESRF); neutron scattering (SANS2d at ISIS) | X-ray scattering (12-ID-B at APS); chemical footprinting |
| Starting models for | NMR structures for MA and NC; crystal structure for CA | NMR structure for the Ub monomer | 3 crystal structures for CUB1–EGF–CUB1, CUB1–SCR1 and SCR1–SCR2–SP | Crystal structure for full-length mouse IgG2a | Crystal structures for the IgA1 Fab and Fc regions | Crystal structure for the core Hfq–mRNA complex |
| Structurally varied linker(s) in | 5 flexible linkers between the MA, CA and NC domains | C-terminal residues 72–76 of the distal Ub in the Ub dimer | 2 linkers in CUB1–EGF–CUB1; 5 linkers in full-length MASP | 3 amino acids in the IgG2 upper hinge | 2 | Residues 1–5 and 66–102 in the Hfq hexamer; the 128/129 hinge in mRNA |
| Number of models used in | 4800 HIV-1 Gag models | 30 000 K27-Ub2 dimer models | 1982–4517 models for CUB1–EGF–CUB1; 6173–30 910 models for full-length MASP | 56 511 IgG2 models | 172 833 truncated IgA1 models; 146 484 full-length IgA1 models | 24 991 Hfq models; 27 427 mRNA models; 19 132 models for the complex |
| Molecular mass (kDa) | 53 | 17 (dimer) | 75 and 170 | 150 | 164 | 67, 96 and 163 |
| Experimental | 3.4 | 18.5–19.4 for the K27-Ub2 dimer | 3.79–3.87 for CUB1–EGF–CUB1; 7.54–7.93 for full-length MASP | 4.75 | 5.93 | 3.36 nm (Hfq); 6.81 nm (mRNA); 5.80 nm (complex) |
|
| 0.09–2.50 (neutrons) | 0.30–4.0 (neutrons) | 0.06–2.20 (X-rays) | 0.07–3.00 (neutrons) | 0.13–2.10 (X-rays); 0.18–1.6 (neutrons) | 0.05–10.07 (X-rays) |
| Final | 1160 HIV-1 Gag models with χ2 of 1-2 | χ2 of 1.02 – 2.36 for 5 dimer conformational clusters |
| 1160 IgG2 models with χ2 < 2 |
| 917 Hfq–mRNA models with χ2 < 1.5 |
| Reference | Datta | Castañeda, Chaturvedi | Nan | Clark | Hui | Peng |
Q is defined as 4πsinθ/λ, where 2θ is the scattering angle and λ is the wavelength.
Figure 4The SASSIE modelling workflow for monomeric human IgA1 (Hui et al., 2015 ▸). This work and that in Fig. 5 ▸ was presented at the 16th International Conference on Small-Angle Scattering at the Technische Universität Berlin, Germany, on 13–18 September 2015. (a) The goodness-of-fit R factors for the calculated I(Q) curves from 146 484 hydrated IgA1 structures were calculated relative to the I(Q) curve extrapolated to zero concentration. The R factors were plotted against the R G value calculated for each hydrated model. The experimental R G value of 5.77 ± 0.04 nm (unless otherwise stated, uncertainties are reported as one standard deviation) is shown by the vertical blue line, and a coloured band indicates the ±10% range of X-ray R G values used for filtering the best-fit models. The inset shows an expanded view for the R factors below 10%. (b) The SAXS curve fit for the median best-fit structure for full-length IgA1 identified from a cluster of 112 best-fit structures (Figs. 1 and 5). The calculated I(Q) and P(r) curves are shown in red and compared with the experimental data in black. (c) The SANS fit for the unhydrated structure corresponding to the best-fit SAXS hydrated structure is also shown.
Figure 5Final molecular modelling results for human IgA1 antibody (Hui et al., 2015 ▸). The protein main chain is shown as a yellow ribbon. The structure was taken from the median of the 112 models in the best-fit cluster. The O-glycans at Thr225, Thr228 and Ser232 in the hinge and the N-glycans at Asn263 and Asn459 in the Fc region and tailpiece, respectively, are shown as green sticks. (a) View face on to the Fc region in the best-fit Y-shaped IgA1 structure. The two FcαR sites on the Fc region are shown occupied by two FcαR receptors (blue: PDB code 1ow0; Herr et al., 2003 ▸). (b) View edge on to the Fc region in this best-fit IgA1 structure. This view was rotated by 90° about a vertical axis, and the two blue FcαR receptors were deleted. This view shows the location of the O-glycans and N-glycans in IgA1 as green sticks.