| Literature DB >> 32280325 |
Daniel Franke1, Nelly R Hajizadeh1,2, Dmitri I Svergun1.
Abstract
This article presents IMSIM, an application to simulate two-dimensional small-angle X-ray scattering patterns and, further, one-dimensional profiles from biological macromolecules in solution. IMSIM implements a statistical approach yielding two-dimensional images in TIFF, CBF or EDF format, which may be readily processed by existing data-analysis pipelines. Intensities and error estimates of one-dimensional patterns obtained from the radial average of the two-dimensional images exhibit the same statistical properties as observed with actual experimental data. With initial input on an absolute scale, [cm-1]/c[mg ml-1], the simulated data frames may also be scaled to absolute scale such that the forward scattering after subtraction of the background is proportional to the molecular weight of the solute. The effects of changes of concentration, exposure time, flux, wavelength, sample-detector distance, detector dimensions, pixel size, and the mask as well as incident beam position can be considered for the simulation. The simulated data may be used in method development, for educational purposes, and also to determine the most suitable beamline setup for a project prior to the application and use of the actual beamtime. IMSIM is available as part of the ATSAS software package (3.0.0) and is freely available for academic use (http://www.embl-hamburg.de/biosaxs/download.html). © Daniel Franke et al. 2020.Entities:
Keywords: SAXS; computer programs; data simulation; small-angle X-ray scattering; solution scattering
Year: 2020 PMID: 32280325 PMCID: PMC7133063 DOI: 10.1107/S1600576720000527
Source DB: PubMed Journal: J Appl Crystallogr ISSN: 0021-8898 Impact factor: 3.304
Figure 1Simulation of native horse spleen ferritin (PDB code 1ier; Granier et al., 1997 ▸) at 6.0 m detector distance. The theoretical scattering pattern was calculated by CRYSOL (Svergun et al., 1995 ▸) with 5000 points, in the range from 0.0 to 5.0 nm−1. (a) The expected probability density for the simulated events of the flat background (blue) and of the sample (red), adjusted for the increase in the detector area. (b) The corresponding cumulative density functions used to determine the non-uniform s position from a uniform random number in [0, 1]. (c) The resulting image at a 6.0 m position with a Pilatus 6M detector mask applied; here the count data are displayed on a logarithmic scale for improved visualization.
Figure 2Examples of simulated data from samples of varying size and shape with constant wavelength at multiple distances of the Pilatus 6M detector from the sample position. All backgrounds were generated with an exposure time of 10 s, the samples with an exposure of 1 s. (a) The simulated data of three different samples after background subtraction and scaling by concentration, offset for clarity. These are native horse spleen ferritin (1ier) at 6.0 m detector distance at a concentration of 0.5 mg ml−1 (green) and 2 mg ml−1 (light blue); bovine serum albumin (4f5s; Bujacz, 2012 ▸) at 3.0 m detector distance at a concentration of 0.5 mg ml−1 (yellow) and 2.0 mg ml−1 (violet); and myoglobin (1wla; Maurus et al., 1997 ▸) at 1.5 m detector distance at a concentration of 0.5 mg ml−1 (dark blue) and 2.0 mg ml−1 (red). The calculated scattering curves are overlaid in white. Error bars of all data are relative errors in logarithmic scale {SE(I)/[Ilog(10)]} and partially truncated for visibility. (b)–(d) The respective standardized residuals of the simulated data minus the calculated input scattering, divided by the error estimate. Standardized residuals are expected to be randomly distributed and to follow a standard normal distribution, i.e. >99% of all standardized residuals should be located in the range [−3; +3].
Simulation summary and statistics
The three samples (exposure time 1 s) are simulated together with a flat background (exposure time 10 s). Each sample is simulated at two concentrations for the given sample–detector distance (different distances are used for all samples). At all positions a 3 mm circular ‘beamstop’ was masked out at the incident beam position, resulting in a different angular range at each detector position. All generated data sets have n = 2647 data points. Reduced χ2 and Anderson–Darling test statistics and associated probabilities of obtaining a result more extreme than the ones observed (p values) are provided for each setup.
| Sample (PDB code) | 1ier | 4f5s | 1wla | |||
|---|---|---|---|---|---|---|
| Concentration (mg ml−1) | 0.5 | 2.0 | 0.5 | 2.0 | 0.5 | 2.0 |
| Detector distance (m) | 6.0 | 3.0 | 1.5 | |||
| Angular range (nm−1) | [0.0145–3.8497] | [0.0291–7.6499] | [0.0581–14.9253] | |||
| χ2/( | 1.026 | 0.993 | 0.997 | 0.975 | 1.048 | 0.958 |
|
| 0.172 | 0.600 | 0.533 | 0.816 | 0.042 | 0.940 |
| Anderson–Darling | 2.771 | 0.530 | 1.932 | 1.414 | 0.742 | 2.548 |
|
| 0.0359 | 0.716 | 0.100 | 0.198 | 0.525 | 0.047 |