| Literature DB >> 33775140 |
Shunzhou Wan1, Robert C Sinclair1, Peter V Coveney1,2.
Abstract
Molecular dynamics simulation is now a widespread approach for understanding complex systems on the atomistic scale. It finds applications from physics and chemistry to engineering, life and medical science. In the last decade, the approach has begun to advance from being a computer-based means of rationalizing experimental observations to producing apparently credible predictions for a number of real-world applications within industrial sectors such as advanced materials and drug discovery. However, key aspects concerning the reproducibility of the method have not kept pace with the speed of its uptake in the scientific community. Here, we present a discussion of uncertainty quantification for molecular dynamics simulation designed to endow the method with better error estimates that will enable it to be used to report actionable results. The approach adopted is a standard one in the field of uncertainty quantification, namely using ensemble methods, in which a sufficiently large number of replicas are run concurrently, from which reliable statistics can be extracted. Indeed, because molecular dynamics is intrinsically chaotic, the need to use ensemble methods is fundamental and holds regardless of the duration of the simulations performed. We discuss the approach and illustrate it in a range of applications from materials science to ligand-protein binding free energy estimation. This article is part of the theme issue 'Reliability and reproducibility in computational science: implementing verification, validation and uncertainty quantification in silico'.Entities:
Keywords: free energy calculation; molecular dynamics simulation; uncertainty quantification
Year: 2021 PMID: 33775140 PMCID: PMC8059622 DOI: 10.1098/rsta.2020.0082
Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN: 1364-503X Impact factor: 4.226
Figure 1Molecular dynamics equilibrium distributions with long-range interactions are non-Gaussian. The main figures display the Fisher–Pearson coefficient of skewness (a,c) and the excess kurtosis (b,d) for distributions of predicted binding free energies (ΔG), or binding free energy differences (ΔΔG), using the ensemble-based molecular mechanics Poisson–Boltzmann surface area approach (ESMACS) (a,b) and thermodynamics integration (TIES) (c,d) approaches, respectively. The ESMACS results are obtained from 250 ligand–protein complexes, each with 25,000 frames accumulated from an ensemble simulation with 25 independent replicas. The TIES results include alchemical transformations of 50 pairs of ligands, from ensemble simulations comprising 20 or 40 replicas each. The inset shows distributions of binding free energies for two ligands, or ligand pairs, with the most negative and most positive skewnesses or kurtoses respectively. The best-fit Gaussian distributions are shown by black solid lines. See also Wan et al. [8].
Figure 2Measuring the toughness of different materials with a reactive forcefield. (a) Three structures: (i) neat epoxy polymer; (ii) epoxy-graphene nanocomposite and (iii) epoxy polymer with a defect are strained uniaxially. (b) The stress–strain curves are shown in the plot; lines indicate the average of six replica simulations while the shaded regions correspond to the standard deviations at each strain. While each replica varies, the ensemble average shows the three materials behave similarly. (c) Displays a snapshot from sample (ii) at the point of fracture. (Online version in colour.)
Figure 3Young's modulus of an epoxy resin measured with different simulation sizes. Each point is the average of 300 simulations, which make up the pink histograms for each box size. The 95% bootstrap confidence interval for increasing ensemble size is shown in the inset plot at the bottom right. (Online version in colour.)