| Literature DB >> 28244185 |
Abstract
In the residual electron density map of a fully refined X-ray protein model, there should be no peaks arising from modeling errors or missing atoms. Any residual peaks that do occur should be contributed by random residual intensity differences between the model and the data. If the model is incomplete (i.e., some atoms are missing), there will be more positive peaks than negative ones. On the other hand, if the model includes inappropriately located atoms, there will be an excess of negative peaks. In this study, random residual peaks are quantified using the probability density function P(x), which is defined as the probability for a peak having peak height between x and x + dx. It is found that P(x) is single-exponential and symmetric for both positive and negative peaks. Thus, P(x) can be used to discriminate residual peaks contributed by random noise in complete models from residual peaks being attributable to modeling errors in incomplete models. For a number of representative structures in the PDB it is found that P(x) has far more large (greater than 5 sigma) positive peaks than large negative peaks. This excess of large positive peaks suggests that the main defect in these refined structures is the omission of ordered water molecules.Entities:
Keywords: completeness of large models; cumulative probability function; distribution function; exponential function; probability density function; protein crystallography
Mesh:
Substances:
Year: 2017 PMID: 28244185 PMCID: PMC5405434 DOI: 10.1002/pro.3145
Source DB: PubMed Journal: Protein Sci ISSN: 0961-8368 Impact factor: 6.725