| Literature DB >> 26123630 |
Michael Martinez1, Neil J Bruce1, Julia Romanowska1, Daria B Kokh1, Musa Ozboyaci1,2, Xiaofeng Yu1,3, Mehmet Ali Öztürk1,3, Stefan Richter1, Rebecca C Wade1,4,5.
Abstract
The simulation of diffusional association (SDA) Brownian dynamics software package has been widely used in the study of biomacromolecular association. Initially developed to calculate bimolecular protein-protein association rate constants, it has since been extended to study electron transfer rates, to predict the structures of biomacromolecular complexes, to investigate the adsorption of proteins to inorganic surfaces, and to simulate the dynamics of large systems containing many biomacromolecular solutes, allowing the study of concentration-dependent effects. These extensions have led to a number of divergent versions of the software. In this article, we report the development of the latest version of the software (SDA 7). This release was developed to consolidate the existing codes into a single framework, while improving the parallelization of the code to better exploit modern multicore shared memory computer architectures. It is built using a modular object-oriented programming scheme, to allow for easy maintenance and extension of the software, and includes new features, such as adding flexible solute representations. We discuss a number of application examples, which describe some of the methods available in the release, and provide benchmarking data to demonstrate the parallel performance.Entities:
Keywords: Brownian dynamics; biomacromolecular diffusion; macromolecular association; parallelization; protein adsorption; protein flexibility; protein-solid state interactions
Mesh:
Substances:
Year: 2015 PMID: 26123630 PMCID: PMC4755232 DOI: 10.1002/jcc.23971
Source DB: PubMed Journal: J Comput Chem ISSN: 0192-8651 Impact factor: 3.376
Figure 1Simplified class diagram of the solute‐grid relationship (and the memory storage of simulation data). An array contains pointers to each instance of the solute class for each solute. A single setofgrid instance is used for all identical conformations of the solutes to minimize memory requirements. In the case of flexible solutes, an intermediate flexible object containing a linked‐list of pointers to the setofgrid instances of each conformation is added to each flexible solute.
Figure 2Workflow of an SDA 7 simulation. The simulation parameters are set in a single control input file, which also defines the location of the additional files needed for the simulation. The simulation proceeds through one of two routes, depending on whether it is a bimolecular or many‐molecule simulation. With bimolecular simulations looping over many trajectories with different initial configurations, and many‐molecule simulations consisting of a single trajectory of nsteps BD moves. In both simulation types, there are three main calculations that may be computed within a simulation time step: a force calculation (F), a BD propagation move (BD), and an energy evaluation (E). The simulation types are parallelized differently (shown here as an example with four OpenMP threads), with each thread in a bimolecular simulation running an independent trajectory and the force, BD and Energy calculations parallelized across threads in many‐molecule simulations. In many‐molecule simulations, all threads must synchronize between these three calculations. Both types of simulation produce output files that are consistent with each other. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]
Figure 3Test systems used for the validation and benchmarking of SDA 7. a) The electrostatically guided bimolecular association of barnase (cyan) and barstar (green). The + 0.5 kcal/(mol ) (blue) and –0.5 kcal/(mol ) (red) isosurfaces of the electrostatic potentials surrounding each solute are shown. b) A snapshot of the simulation of 256 HEWL molecules in solution (The HEWL molecules are colored differently for clarity). c) Docking of the globular domain of the linker histone H5 (blue) to the nucleosome. The crystal conformation (magenta, 70) and a structure generated with an elastic network model (cyan, 76) of the flexible region of the nucleosome are shown.
Scaling of the Case I (barnase–barstar) benchmarks on different numbers of cores.
| Number of cores | Association | Docking | ||||
|---|---|---|---|---|---|---|
| Time | Scaling |
| Time | Scaling | RMSD | |
| 1 | 5128 ± 126 | 1.00 ± 0.02 | 2.7 ± 0.2 | 31461 ± 503 | 1.00 ± 0.02 | 4.9 ± 0.4 |
| 2 | 2705 ± 154 | 1.90 ± 0.11 | 2.7 ± 0.1 | 15886 ± 250 | 1.98 ± 0.03 | 4.9 ± 0.4 |
| 4 | 1307 ± 16 | 3.92 ± 0.05 | 2.8 ± 0.3 | 7933 ± 115 | 3.97 ± 0.06 | 4.8 ± 0.4 |
| 8 | 663 ± 9 | 7.73 ± 0.10 | 2.7 ± 0.2 | 4047 ± 36 | 7.77 ± 0.07 | 4.9 ± 0.4 |
| 12 | 446 ± 6 | 11.50 ± 0.16 | 2.8 ± 0.3 | 2732 ± 48 | 11.51 ± 0.21 | 4.9 ± 0.4 |
| 16 | 341 ± 7 | 15.04 ± 0.30 | 2.8 ± 0.3 | 2085 ± 40 | 15.09 ± 0.29 | 4.8 ± 0.4 |
| 24 | 235 ± 4 | 21.83 ± 0.35 | 2.8 ± 0.2 | 1446 ± 27 | 21.75 ± 0.41 | 4.9 ± 0.4 |
| 32 | 180 ± 3 | 28.43 ± 0.52 | 2.8 ± 0.2 | 1130 ± 20 | 27.82 ± 0.51 | 4.8 ± 0.2 |
| 48 | 129 ± 8 | 39.69 ± 2.43 | 2.9 ± 0.3 | 899 ± 157 | 34.97 ± 6.10 | 4.9 ± 0.3 |
Mean simulation wall‐time (s) with standard deviation.
Mean simulation performance scaling, relative to mean single core time, with standard deviation.
k on (108 M−1s−1) calculated with an encounter complex definition of 2 independent native contacts within 6 Å, (see methods).
Root mean squared deviation (Å) of representative of second most populated cluster of barstar to barstar in the complex in the PDB file, 1BRS.
Figure 4Scaling performance for simulations of barnase and barstar to compute a) the bimolecular association rate constant and b) the structures of docked complexes. Error bars show standard deviations across 10 independent simulations performed for each processor count. Dotted lines indicate perfect scaling.
Scaling of the Case II (HEWL) benchmarks on different numbers of cores.
| Number of cores | Single protonation state | Three protonation states | ||||
|---|---|---|---|---|---|---|
| Minimum energy algorithm | Metropolis algorithm | |||||
| Time | Scaling | Time | Scaling | Time | Scaling | |
| 1 | 6426 ± 83 | 1.00 ± 0.01 | 6480 ± 217 | 1.00 ± 0.03 | 6431 ± 126 | 1.00 ± 0.02 |
| 2 | 3273 ± 30 | 1.96 ± 0.02 | 3337 ± 53 | 1.94 ± 0.03 | 3300 ± 44 | 1.95 ± 0.03 |
| 4 | 1673 ± 18 | 3.84 ± 0.04 | 1695 ± 18 | 3.82 ± 0.04 | 1675 ± 15 | 3.84 ± 0.03 |
| 8 | 872 ± 8 | 7.37 ± 0.07 | 878 ± 3 | 7.38 ± 0.03 | 866 ± 7 | 7.43 ± 0.06 |
| 12 | 599 ± 9 | 10.74 ± 0.16 | 611 ± 5 | 10.61 ± 0.09 | 598 ± 7 | 10.75 ± 0.12 |
| 16 | 463 ± 6 | 13.89 ± 0.18 | 475 ± 4 | 13.65 ± 0.12 | 468 ± 6 | 13.76 ± 0.17 |
| 24 | 346 ± 12 | 18.56 ± 0.67 | 354 ± 5 | 18.31 ± 0.25 | 350 ± 6 | 18.38 ± 0.32 |
| 32 | 288 ± 10 | 22.30 ± 0.78 | 308 ± 9 | 21.06 ± 0.62 | 298 ± 13 | 21.55 ± 0.93 |
| 48 | 266 ± 24 | 24.13 ± 2.16 | 318 ± 22 | 20.40 ± 1.43 | 302 ± 30 | 21.26 ± 2.11 |
Mean simulation wall‐time (s) with standard deviation.
Mean simulation performance scaling relative to mean single core time with standard deviation.
Figure 5Scaling performance for 10 ns simulations of 256 HEWL molecules a) with residue protonation states calculated at pH 6, and with protonation states at pH 3, 6 and 9 with conformational switching using the b) minimum energy, and c) Metropolis algorithms. Error bars show standard deviations across ten independent simulations performed for each processor count. Dotted lines indicate perfect scaling.
Clustering of Case III (linker histone H5‐nucleosome) docking.
| Conformation | Number of solutions | First cluster population (%) | RMSD |
|---|---|---|---|
| 70 | 248,678 | 82 | 3.4 |
| 71 | 176,171 | 84 | 5.4 |
| 72 | 43,138 | 85 | 8.0 |
| 73 | 45,952 | 83 | 7.5 |
| 74 | 22,960 | 37 | 7.6 |
| 75 | 112,010 | 71 | 5.5 |
| 76 | 90,214 | 59 | 19.5 |
Backbone RMSD (Å) of gH5 in the representative of the first cluster and the structure obtained by Pachov et al.