| Literature DB >> 23251350 |
Sjoerd J de Vries1, Martin Zacharias.
Abstract
Many of the most important functions in the cell are carried out by proteins organized in large molecular machines. Cryo-electron microscopy (cryo-EM) is increasingly being used to obtain low resolution density maps of these large assemblies. A new method, ATTRACT-EM, for the computational assembly of molecular assemblies from their components has been developed. Based on concepts from the protein-protein docking field, it utilizes cryo-EM density maps to assemble molecular subunits at near atomic detail, starting from millions of initial subunit configurations. The search efficiency was further enhanced by recombining partial solutions, the inclusion of symmetry information, and refinement using a molecular force field. The approach was tested on the GroES-GroEL system, using an experimental cryo-EM map at 23.5 Å resolution, and on several smaller complexes. Inclusion of experimental information on the symmetry of the systems and the application of a new gradient vector matching algorithm allowed the efficient identification of docked assemblies in close agreement with experiment. Application to the GroES-GroEL complex resulted in a top ranked model with a deviation of 4.6 Å (and a 2.8 Å model within the top 10) from the GroES-GroEL crystal structure, a significant improvement over existing methods.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23251350 PMCID: PMC3522670 DOI: 10.1371/journal.pone.0049733
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flowchart of the ATTRACT-EM protocol.
Figure 2Assembly results for the GroEL cis ring.
A) RMSD values of all generated models compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map and ranked by the same energy. B) RMSD values of the generated models compared to the reference structure; the top 20% models from A) were rescored with GVM using a 20 Å simulated density map. C) Overlay of the best-scoring structure from B) (green) on the reference structure (cyan) (RMSD: 5.4 Å). Image was generated with PyMol [63]. D) Overlay of the best-scoring refined structure (green) on the reference structure (cyan) (RMSD: 2.5 Å). Structures from C) were refined using the ATTRACT force field and rescored with GVM. Image was generated with PyMol [63].
Top 10 RMSD values for the GroEL cis ring after the assembly stage.
| Rank | Overall RMSD | ligand RMSD |
| 1 | 57.4 | 58.3 |
| 2 | 48.3 | 33.5 |
| 3 | 56.5 | 68.1 |
| 4 | 11.9 | 8.4 |
| 5 | 17.3 | 16.6 |
| 6 | 53.8 | 64.2 |
| 7 | 27.4 | 57.0 |
| 8 | 36.0 | 49.9 |
| 9 | 58.2 | 35.6 |
| 10 | 32.8 | 54.3 |
RMSD and ligand RMSD values of the top 10 models for the GroEL cis ring (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map and ranked by the same energy.
Figure 3The gradient vector representation.
A) A simple reference shape. B) The reference shape with a Laplacian filter applied to it. C) The gradient vector representation of the reference shape. The direction of the arrow indicates the direction of the gradient vector, whereas the size of the arrow shows its magnitude.
Top 10 RMSD values for the GroEL cis ring after the scoring stage.
| Rank | Overall RMSD | ligand RMSD |
| 1 | 5.4 | 6.3 |
| 2 | 6.2 | 13.1 |
| 3 | 4.7 | 7.9 |
| 4 | 6.8 | 10.5 |
| 5 | 5.9 | 11.2 |
| 6 | 6.2 | 7.7 |
| 7 | 8.7 | 7.7 |
| 8 | 8.3 | 17.1 |
| 9 | 7.2 | 13.6 |
| 10 | 8.0 | 14.9 |
RMSD and ligand RMSD values of the top 10 models for the GroEL cis ring (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map and the top 20% structures were ranked by gradient matching using a 20 Å simulated density map.
Top 10 RMSD values for the GroEL cis ring after the refinement stage.
| Rank | Overall RMSD | ligand RMSD |
| 1 | 2.5 | 1.0 |
| 2 | 6.2 | 7.5 |
| 3 | 6.6 | 4.8 |
| 4 | 7.2 | 11.1 |
| 5 | 6.2 | 9.8 |
| 6 | 8.1 | 12.5 |
| 7 | 7.1 | 11.0 |
| 8 | 7.1 | 1.0 |
| 9 | 7.2 | 7.5 |
| 10 | 7.8 | 4.8 |
RMSD and ligand RMSD values of the top 10 models for the GroEL cis ring (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map, ranked by gradient matching using a 20 Å simulated density map, and refined using the ATTRACT force field.
Figure 4Assembly results for the full GroES-GroEL complex, using simulated data.
A) RMSD values of generated models for the full GroES-GroEL complex, compared to the reference structure; models were assembled using a 40 Å simulated density map and rescored with GVM using a 20 Å simulated density map. B) RMSD values of recombined models for the full GroES-GroEL complex, compared to the reference structure; the top 100 models from A) were subjected to ring recombination; the resulting combinations were rescored with GVM.
Top 10 RMSD values for GroES-GroEL with simulated data, after the scoring stage.
| Overall RMSD | ||||
| GroEL | GroEL | |||
| rank | all | GroES | cis | trans |
| 1 | 10.8 | 33.5 | 5.5 | 3.8 |
| 2 | 9.5 | 26.4 | 5.0 | 6.5 |
| 3 | 8.4 | 15.2 | 7.6 | 7.2 |
| 4 | 10.2 | 18.5 | 6.0 | 11.2 |
| 5 | 11.6 | 23.7 | 12.2 | 6.5 |
| 6 | 9.6 | 21.8 | 6.9 | 7.8 |
| 7 | 12.3 | 23.9 | 8.1 | 12.5 |
| 8 | 11.8 | 24.3 | 12.1 | 6.7 |
| 9 | 11.3 | 27.1 | 8.3 | 8.5 |
| 10 | 13.2 | 30.7 | 9.3 | 8.4 |
RMSD values of the top 10 models for the full GroES-GroEL complex (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map and ranked by gradient matching using a 20 Å simulated density map.
Top 10 RMSD values for GroES-GroEL with simulated data, after ring recombination.
| ligand RMSD | |||||||||
| Overall RMSD | intra-ring | inter-ring | |||||||
| GroEL | GroEL | GroEL | GroEL | cis- | cis- | ||||
| Rank | all | GroES | cis | trans | GroES | cis | trans | GroES | trans |
| 1 | 4.0 | 4.3 | 3.9 | 3.9 | 4.2 | 7.1 | 5.2 | 9.8 | 7.5 |
| 2 | 3.9 | 4.3 | 3.9 | 3.8 | 4.2 | 7.1 | 7.7 | 9.8 | 7.3 |
| 3 | 4.4 | 4.3 | 3.9 | 4.9 | 4.2 | 7.1 | 4.7 | 9.8 | 5.5 |
| 4 | 4.1 | 4.3 | 3.9 | 4.2 | 4.2 | 7.1 | 8.4 | 9.8 | 3.2 |
| 5 | 4.4 | 4.3 | 3.9 | 5.0 | 4.2 | 7.1 | 10.2 | 9.8 | 8.3 |
| 6 | 4.1 | 4.3 | 3.9 | 4.3 | 4.2 | 7.1 | 4.0 | 9.8 | 5.8 |
| 7 | 4.5 | 4.3 | 3.9 | 5.1 | 4.2 | 7.1 | 9.3 | 9.8 | 6.2 |
| 8 | 5.6 | 15.0 | 3.0 | 3.9 | 27.0 | 2.5 | 5.2 | 15.9 | 6.0 |
| 9 | 6.0 | 16.0 | 3.7 | 3.9 | 20.2 | 6.0 | 5.2 | 16.9 | 7.8 |
| 10 | 7.1 | 20.2 | 4.1 | 3.9 | 42.4 | 5.5 | 5.2 | 20.2 | 11.3 |
RMSD and ligand RMSD values of the top 10 models for the full GroES-GroEL complex (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 40 Å simulated density map and ranked by gradient matching using a 20 Å simulated density map. Then, the top 100 models were subjected to ring recombination; the resulting combinations were again ranked by gradient matching.
Figure 5Assembly results for the full GroES-GroEL complex, using experimental data.
A) RMSD values of generated models for the full GroES-GroEL complex, compared to the reference structure; models were assembled using a 23.5 Å experimental density map [35] that was downsampled to 44.8 Å; then, models were rescored with GVM using the full density map. B) RMSD values of recombined models for the full GroES-GroEL complex, compared to the reference structure; the top 100 models from A) were subjected to ring recombination; the resulting combinations were again rescored with GVM. C) RMSD values after two ring recombinations, repeating the procedure on the top 100 models from B). Note the different scaling on the logarithmic Y axis. D) Best-scoring refined structure (green), overlaid onto the reference structure (cyan) (RMSD: 4.6 Å); Top-scoring structures from C) were refined using the ATTRACT force field, and rescored with GVM. Image was generated with PyMol [63].
Top 10 RMSD values for GroES-GroEL with experimental data, after the scoring stage.
| Overall RMSD | ||||
| GroEL | GroEL | |||
| Rank | all | GroES | cis | trans |
| 1 | 11.1 | 19.7 | 5.7 | 12.7 |
| 2 | 12.1 | 33.5 | 8.0 | 5.1 |
| 3 | 10.1 | 25.9 | 6.7 | 7.2 |
| 4 | 9.8 | 15.1 | 6.9 | 10.5 |
| 5 | 12.7 | 32.2 | 6.4 | 10.6 |
| 6 | 13.2 | 23.7 | 7.9 | 14.6 |
| 7 | 31.9 | 20.5 | 2.9 | 46.2 |
| 8 | 25.0 | 25.2 | 3.2 | 35.1 |
| 9 | 31.7 | 21.7 | 6.5 | 45.3 |
| 10 | 10.6 | 21.4 | 9.5 | 8.3 |
RMSD and ligand RMSD values of the top 10 models for the full GroES-GroEL complex (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 23.5 Å experimental density map that was downsampled to 44.8 Å; then, the models were ranked by gradient matching using the full density map.
Top 10 RMSD values for GroES-GroEL with experimental data, after ring recombination.
| ligand RMSD | |||||||||
| Overall RMSD | intra-ring | inter-ring | |||||||
| GroEL | GroEL | GroEL | GroEL | cis- | cis- | ||||
| rank | All | GroES | cis | trans | GroES | cis | trans | GroES | trans |
| 1 | 5.0 | 7.5 | 4.3 | 5.1 | 2.7 | 1.9 | 0.9 | 5.8 | 3.3 |
| 2 | 5.0 | 7.5 | 4.1 | 5.1 | 2.7 | 5.9 | 0.9 | 7.7 | 5.1 |
| 3 | 4.9 | 7.5 | 3.8 | 5.1 | 2.7 | 2.7 | 0.9 | 6.7 | 2.5 |
| 4 | 4.7 | 7.5 | 3.2 | 5.1 | 2.7 | 6.9 | 0.9 | 10.2 | 10.6 |
| 5 | 5.5 | 7.5 | 5.3 | 5.1 | 2.7 | 3.8 | 0.9 | 3.5 | 9.5 |
| 6 | 4.4 | 7.5 | 3.8 | 4.2 | 2.7 | 2.7 | 8.2 | 6.7 | 6.6 |
| 7 | 10.9 | 32.7 | 4.3 | 5.1 | 28.4 | 1.9 | 0.9 | 31.2 | 3.3 |
| 8 | 6.5 | 15.9 | 4.3 | 5.1 | 19.6 | 1.9 | 0.9 | 16.2 | 3.3 |
| 9 | 4.5 | 7.5 | 4.1 | 4.2 | 2.7 | 5.9 | 8.2 | 7.7 | 7.3 |
| 10 | 9.7 | 29.2 | 4.3 | 5.1 | 18.8 | 1.9 | 0.9 | 27.1 | 3.3 |
RMSD and ligand RMSD values of the top 10 models for the full GroES-GroEL complex (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 23.5 Å experimental density map that was downsampled to 44.8 Å; after that, the models were ranked by gradient matching using the full density map. Then, the top 100 models were subjected to ring recombination; the resulting combinations were again ranked by gradient matching. This recombination procedure was then repeated.
Top 10 RMSD values for GroES-GroEL with experimental data, after refinement.
| ligand RMSD | |||||||||
| Overall RMSD | intra-ring | inter-ring | |||||||
| GroEL | GroEL | GroEL | GroEL | cis- | cis- | ||||
| rank | All | GroES | Cis | trans | GroES | cis | trans | GroES | trans |
| 1 | 4.6 | 7.5 | 3.5 | 4.8 | 2.9 | 1.6 | 1.0 | 6.2 | 3.7 |
| 2 | 3.8 | 7.5 | 3.4 | 3.0 | 2.2 | 1.4 | 3.0 | 8.0 | 2.4 |
| 3 | 3.5 | 7.3 | 2.0 | 3.3 | 3.0 | 1.0 | 2.7 | 5.7 | 4.7 |
| 4 | 4.2 | 7.5 | 2.5 | 4.5 | 1.8 | 3.3 | 1.2 | 6.7 | 5.9 |
| 5 | 4.1 | 7.5 | 3.8 | 3.4 | 4.0 | 1.2 | 3.7 | 6.9 | 2.9 |
| 6 | 5.9 | 15.3 | 2.8 | 4.9 | 19.3 | 0.7 | 1.1 | 14.8 | 5.0 |
| 7 | 8.1 | 24.2 | 2.8 | 4.9 | 32.9 | 1.1 | 1.0 | 24.0 | 4.6 |
| 8 | 7.6 | 24.4 | 1.8 | 3.0 | 32.6 | 1.4 | 2.9 | 23.1 | 3.3 |
| 9 | 5.8 | 13.9 | 3.7 | 4.8 | 20.7 | 1.7 | 1.0 | 14.8 | 4.4 |
| 10 | 2.8 | 7.5 | 2.0 | 1.5 | 1.9 | 2.0 | 1.4 | 8.4 | 3.0 |
RMSD and ligand RMSD values of the top 10 models for the full GroES-GroEL complex (in Å), compared to the reference structure; models were fitted (energy-minimized) using a 23.5 Å experimental density map that was downsampled to 44.8 Å; after that, the models were ranked by gradient matching using the full density map. Then, the top 100 models were subjected to ring recombination; the resulting combinations were again ranked by gradient matching. This recombination procedure was then repeated. Finally, structures were refined using the ATTRACT force field, and again ranked by gradient matching.
Overview and comparison of all symmetric assembly results.
| Components | ATTRACT-EM | GMFit | IQP | |
| 1AON, cis ring | 7 (C7) | 2.5 (1.0) | - | 8.6* |
| 1AON, full | 21 (C7) | 4.6* | 14.7* | - |
| 1AFW | 2 (C2) | 6.8 (0.7) | 1.0 | 0.9 |
| 2NIC | 3 (C3) | 0.6(0.3) | 1.8 | 1.1 |
| 7CAT | 4 (D2) | 5.4 | 2.3 | - |
| 2REC | 6 (C6) | 2.9 | 2.3 | 1.0 |
Overview of all computational assembly results of ATTRACT-EM for symmetric assemblies, and comparisons to GMFit [29] and IQP [31]. All values are RMSDs in Å. Values in parentheses are ligand RMSDs. All results were obtained with 20 Å simulated maps, except the results marked with *, which were obtained with a 23.4 Å experimental map ([35], EMD code 1046). The “components” column, shows the number of components, and the symmetry of the complex within parentheses.
Overview of the assembly results for dimmers.
| assembly stage | refinement stage | integrated protocol | location restraints | |||||
| RMSD | l-RMSD | RMSD | l-RMSD | RMSD | l-RMSD | RMSD | l-RMSD | |
| 1AFW | 6.3 | 5.7 | 6.8 | 0.7 | 0.3 | 0.7 | 4.3 | 6.4 |
| top 10 | 5.9 | 4.5 | 6.8 | 0.7 | 0.3 | 0.6 | 4.0 | 3.0 |
| all | 5.9 | 3.3 | 6.8 | 0.7 | 0.3 | 0.2 | 4.0 | 1.7 |
| 1AFW | 7.3 | 17.9 | 17.3 | 26.2 | 0.9 | 0.6 | 3.0 | 6.5 |
| top 10 | 7.3 | 17.9 | 17.3 | 26.2 | 0.9 | 0.6 | 3.0 | 6.5 |
| all | 7.3 | 4.6 | 9.0 | 7.5 | 0.9 | 0.2 | 3.0 | 3.7 |
| 1AVX | 38.8 | 28.4 | - | - | 16.4 | 25.0 | 3.9 | 6.3 |
| top 10 | 22.7 | 25.8 | - | - | 10.6 | 13.5 | 3.9 | 5.0 |
| all | 13.2 | 2.9 | - | - | 10.1 | 1.8 | 3.9 | 3.2 |
| 1AY7 | 19.0 | 50.1 | - | - | 13.9 | 19.4 | 3.8 | 5.9 |
| top 10 | 13.5 | 21.7 | - | - | 13.9 | 19.4 | 2.9 | 3.5 |
| all | 10.6 | 4.7 | - | - | 7.6 | 2.0 | 2.9 | 3.2 |
| 1WQ1 | 7.8 | 11.8 | 12.3 | 21.6 | 14.4 | 24.3 | 3.4 | 6.3 |
| top 10 | 5.5 | 7.6 | 6.6 | 10.3 | 5.4 | 4.7 | 3.4 | 3.6 |
| all | 5.3 | 3.3 | 6.1 | 5.9 | 5.2 | 2.5 | 3.4 | 3.5 |
| 1WQ1 | 12.3 | 20.3 | 4.5 | 6.8 | 3.8 | 0.8 | 6.9 | 11.6 |
| top 10 | 5.3 | 7.0 | 4.5 | 6.8 | 3.8 | 0.8 | 3.1 | 4.6 |
| all | 4.9 | 4.0 | 4.5 | 1.2 | 3.8 | 0.8 | 3.1 | 1.7 |
Overview of all computational assembly results of ATTRACT-EM for symmetric assemblies. Indicated are the result for the top-scoring structure, the best result in the top 10 scoring structures, and the best result among all structures. All RMSDs are in Å. All complexes are unbound non-symmetric heterodimers unless indicated otherwise.
Note that true RMSDs were computed towards the bound complex: e.g. for 1WQ1, the theoretical best RMSD (ligand-RMSD) is 0 Å (0 Å) for the bound assembly but 1.7 Å (2.0 Å) for the unbound assembly, due to the conformational differences between bound and unbound forms.
= the complex is a symmetric homodimer; C2 symmetry restraints were used.
= the complex is a symmetric homodimer; C2 symmetry restraints were not used.
= the complex is in the bound form.