| Literature DB >> 22505256 |
Pavel V Afonine1, Ralf W Grosse-Kunstleve, Nathaniel Echols, Jeffrey J Headd, Nigel W Moriarty, Marat Mustyakimov, Thomas C Terwilliger, Alexandre Urzhumtsev, Peter H Zwart, Paul D Adams.
Abstract
phenix.refine is a program within the PHENIX package that supports crystallographic structure refinement against experimental data with a wide range of upper resolution limits using a large repertoire of model parameterizations. It has several automation features and is also highly flexible. Several hundred parameters enable extensive customizations for complex use cases. Multiple user-defined refinement strategies can be applied to specific parts of the model in a single refinement run. An intuitive graphical user interface is available to guide novice users and to assist advanced users in managing refinement projects. X-ray or neutron diffraction data can be used separately or jointly in refinement. phenix.refine is tightly integrated into the PHENIX suite, where it serves as a critical component in automated model building, final structure refinement, structure validation and deposition to the wwPDB. This paper presents an overview of the major phenix.refine features, with extensive literature references for readers interested in more detailed discussions of the methods.Entities:
Mesh:
Year: 2012 PMID: 22505256 PMCID: PMC3322595 DOI: 10.1107/S0907444912001308
Source DB: PubMed Journal: Acta Crystallogr D Biol Crystallogr ISSN: 0907-4449
Figure 1Flowchart of structure refinement as implemented in phenix.refine. The execution of some steps is subject to user-defined options. The main refinement body (shown with the gray arrow) is called a macro-cycle and is repeated several times. See text for details.
Figure 2Illustration of typical scenarios for occupancy refinement that phenix.refine handles automatically. (a) Residue having several alternative conformations marked with altLoc identifiers (two in this example, A and B). It is essential that all conformers have identical chain identifiers and residue numbers, while residue names can be different as shown in example (e). All atoms within each conformer must have identical occupancies. The sum of occupancies over all conformers is constrained to 1. (b) Single atoms with occupancy not equal to 0 or 1. (c) Exchangeable H/D sites (used in refinement against neutron data collected from partially deuterated sample). (d) Single-residue molecule with identical occupancies for all atoms (but not equal to 1 or 0). A user can overwrite this behavior or/and define constraints for any number of selected atoms or groups of atoms.
Figure 3The graphical user interface (GUI) for phenix.refine. (a) Configuration tab showing the refinement strategy and commonly used restraint and optimization settings. (b) Display of results, including summary of output files, tables and graphs of statistics and links to molecular-graphics software.
R factors, Ramachandran plot outliers (RO) and MolProbity clashscores (CS; Davis et al., 2007 ▶) for selected structures extracted from the PDB (published), extracted from PDB_REDO and after refinement using phenix.refine
All data cutoffs (resolution, ) were applied as reported in the original works in order to maintain the same reflections used in the calculations.
| Extracted from REMARK records of PDB file header | Calculated with |
| Re-refined in | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Code |
| Data source |
|
| RO (%) | CS |
| RO (%) | CS |
| RO (%) | CS |
|
| 4.3 | X-ray | 42.0/45.3 | 34.2/37.0 | 0.74 | 40.0 | 23.9/31.4 | 0.74 | 25.8 | 24.2/30.9 | 0.92 | 13.3 |
|
| 4.2 | X-ray | 34.3/40.6 | 31.0/36.1 | 1.33 | 37.1 | 34.3/39.9 | 1.33 | 37.1 | 25.8/32.7 | 0.98 | 25.7 |
|
| 4.2 | X-ray | 33.1/37.8 | 30.3/33.9 | 2.79 | 17.6 | 24.1/31.1 | 2.86 | 15.9 | 25.9/28.5 | 0.98 | 15.5 |
|
| 4.1 | X-ray | 29.0/35.4 | 33.7/37.0 | 0.28 | 24.5 | 33.8/36.7 | 0.28 | 24.6 | 28.5/31.9 | 0.28 | 11.8 |
|
| 4.0 | X-ray | 38.2/42.8 | 35.1/38.4 | 1.26 | 55.4 | 35.1/40.1 | 1.01 | 58.1 | 33.3/36.3 | 0.50 | 16.1 |
|
| 4.0 | X-ray | 30.2/35.4 | 21.8/22.8 | 2.82 | 21.6 | 18.4/19.9 | 1.32 | 15.9 | 18.1/21.5 | 0.94 | 11.1 |
|
| 4.0 | X-ray | 27.3/32.3 | 26.8/28.7 | 2.04 | 24.9 | 22.5/28.5 | 1.84 | 20.4 | 21.5/26.4 | 0.79 | 14.6 |
|
| 3.6 | X-ray | 26.7/33.3 | 26.7/33.5 | 2.08 | 40.4 | 27.6/34.5 | 2.08 | 40.5 | 26.8/29.3 | 0.50 | 12.6 |
|
| 3.2 | X-ray | 29.3/34.8 | 30.2/na | 10.3 | 93.1 | na | na | na | 22.2/25.5 | 0.37 | 24.8 |
|
| 2.4 | Neutron | 27.0/30.1 | 30.0/33.9 | 0 | 6.2 | na | na | na | 20.4/25.7 | 0 | 11.2 |
|
| 2.4 | X-ray | 21.2/33.0 | 20.1/31.6 | 1.69 | 31.0 | 21.5/28.9 | 1.69 | 14.6 | 22.0/27.2 | 1.27 | 16.4 |
|
| 1.4 | X-ray | 20.1/25.4 | 20.1/25.2 | 0.82 | 21.7 | 16.0/17.7 | 0.82 | 11.9 | 12.9/15.9 | 0.82 | 20.0 |
|
| 1.0 | X-ray | 23.2/24.7 | 22.85/na | 0 | 7.3 | 14.1/16.9 | 0 | 2.4 | 14.1/16.0 | 0 | 7.3 |
|
| 1.0 | X-ray | 19.5/19.8 | 19.7/19.0 | 0.89 | 10.0 | 16.1/17.5 | 0.89 | 8.6 | 14.4/15.3 | 0.89 | 15.5 |
|
| 0.92 | X-ray | 20.9/19.9 | 20.4/19.8 | 0 | 8.1 | 15.4/16.0 | 0 | 8.6 | 13.3/14.7 | 0 | 14.6 |
|
| 0.65 | Neutron | 9.5/na | 9.1/na | 0 | 0 | na | na | na | 7.8/9.6 | 0 | 0 |
Calculated in PHENIX after applying resolution and cutoffs, as reported in the PDB file header.
R factors as reported on the PDB_REDO web site (Joosten et al., 2009 ▶; http://www.cmbi.ru.nl/pdb_redo/); RO and CS computed using PHENIX.
Re-refinement of PDB-deposited structures using phenix.refine. Refinement strategy (model parameterization and number of refinement macro-cycles) varies depending on model and data quality. See text for details.
PDB or NDB (Berman et al., 1992 ▶) code.
na, value is not available either owing to a missing cross-validation set of reflections or the entry is not available in the database.
Figure 4Polygon images (Urzhumtseva et al., 2009 ▶) before (left) and after (right) re-refinement in phenix.refine for structures 1eic, 1g2y, 2elg and 2ppn. In all cases the polygon computed for structures before re-refinement in phenix.refine indicates one or more problems, for example high R free and R work and too small bond r.m.s.d. for 1eic or very high R factors and geometry deviations for 2elg (vertices are on the furthermost end of the histogram bar). Re-refinement in phenix.refine resulted in polygon vertices moved towards the center (squeezing the polygon) in most cases, indicating improvement of the corresponding model characteristics.
Figure 5Selected examples of (2mF obs − DF model, ϕmodel) nuclear density map improvement after re-refinement of structure 1c57 (neutron data). Left, original structure; right, after re-refinement in phenix.refine. Maps are contoured at the 1.5σ level. Note the improved orientation of exchangeable H/D atoms at Ser and Tyr O atoms. The systematic lack of density around H atoms is a consequence of the negative scattering length of H atoms and related density-cancellation effects (Afonine, Mustyakimov et al., 2010 ▶).
Figure 6Structures after refinement of a severely distorted model, shown in (a), using different refinement protocols: (b) dual-space refinement, (c) refinement using minimization only and (d) combined refinement using minimization and simulated annealing. The best available refined model is shown in gray in all panels.
Figure 7(a) Ensemble of structures illustrating the outcome of 100 identical simulated-annealing refinement runs apart from the random seed. (b) The distribution of R work and R free corresponding to each structure of the ensemble.