| Literature DB >> 32002441 |
Catherine L Lawson1, Helen M Berman, Wah Chiu.
Abstract
Electron cryo-microscopy (cryo-EM) is increasingly being used to determine 3D structures of a broad spectrum of biological specimens from molecules to cells. Anticipating this progress in the early 2000s, an international collaboration of scientists with expertise in both cryo-EM and structure data archiving was established (EMDataResource, previously known as EMDataBank). The major goals of the collaboration have been twofold: to develop the necessary infrastructure for archiving cryo-EM-derived density maps and models, and to promote development of cryo-EM structure validation standards. We describe how cryo-EM data archiving and validation have been developed and jointly coordinated for the Electron Microscopy Data Bank and Protein Data Bank archives over the past two decades, as well as the impact of evolving technology on data standards. Just as for X-ray crystallography and nuclear magnetic resonance, engaging the scientific community via workshops and challenging activities has played a central role in developing recommendations and requirements for the cryo-EM structure data archives.Entities:
Year: 2020 PMID: 32002441 PMCID: PMC6980868 DOI: 10.1063/1.5138589
Source DB: PubMed Journal: Struct Dyn ISSN: 2329-7778 Impact factor: 2.920
FIG. 1.Cryo-EM: contrasting early (2000) vs recent (2019). (a) Cryo-EM structure of the E. coli 70S ribosome determined by the Frank group at 11.5 Å, one of the first maps deposited in the EMDB archive (EMD-1003). It is shown here superimposed with manually fitted components deposited to PDB (1eg0). (b) Helical segment of the 1.8 Å apoferritin map used as a target in the 2019 model challenge, with a fitted model.
FIG. 2.Growth of data archives and community activity timeline. (a) Released map entries in EMDB and released EM model coordinate entries in PDB are shown, cumulative by year. Milestones (indicated with arrows) are described in the main text. Plot source: emdataresource.org. (b) Workshops (yellow circles) and challenges (orange circles) related to data standards and validation development are plotted according to the year they were held. Numbers within the circles correspond to Tables II and III rows.
Cryo-EM community data archiving and validation workshops.
| # | Year | Title/location | Organizers | Description | Key outcomes |
|---|---|---|---|---|---|
| 1 | 2002 | IIMS Workshop, UK | Kim Henrick, José-María Carazo, Stephen Fuller | Promote software development in the field of 3DEM molecular structure determination | Guidelines and release policies for the new EMDB archive. |
| 2 | 2004 | 3DEM Developers workshop, UK | Kim Henrick | Developer review of tools and software practices used in the field of cryoEM | Priorities developed for EMDB including electron tomography, PDB-EMDB cross-referencing, lossless map conversion, review of community map standards and conventions |
| 3 | 2004 | Cryo-EM Structure Deposition Workshop, NJ, USA | Helen Berman, Wah Chiu, Michael Rossmann | Develop community consensus on data items needed for deposition of maps and atomic models derived from cryoEM | Need for deposition one-stop-shop articulated. Recommendations incorporated in extended EM data dictionary |
| 4 | 2005 | 3DEM Developers Workshop, UK | Kim Henrick | Introduced EM data dictionary to software developers to facilitate its integration into 3DEM packages and electronic notebooks | Agreement to adopt a common set of conventions for maps |
| 5 | 2006 | CryoEM Standards Task Force, TX, USA | Wah Chiu, David Belnap, José-María Carazo | Gather cryoEM map conventions and formats with associated metadata used by different developers | Key conventions (e.g., Euler angles) were evaluated for interoperability and conversion tools were created |
| 6 | 2008 | Electron Crystallography Data Model Workshop, CA, USA | Wah Chiu, Cathy Lawson | Gather expert advice on metadata items in the EM dictionary pertaining to electron crystallography | Recommendations incorporated into EM data dictionary |
| 7 | 2010 | EM Validation Task Force, NJ, USA | Helen Berman, Wah Chiu, Gerard Kleywegt, Cathy Lawson | Expert review of potential validation standards for maps and models produced by 3DEM reconstruction | Recommendations summarized in white paper, |
| 8 | 2011 | Model Challenge Workshop, HI, USA | Steve Ludtke, Cathy Lawson, Gerard Kleywegt, Helen Berman, Wah Chiu | Computational groups described and compared tools they used to model a selected set of published cryoEM structures | Results published in |
| 9 | 2011 | Data Management Challenges in 3DEM, UK | Ardan Patwardhan, Gerard Kleywegt, Jason Swedlow | Gather expert advice on key topics in data archiving and validation for 3DEM data, including data model, validation, raw-data archiving | Recommendations summarized in white paper. |
| 10 | 2012 | 3DEM Modeling Workshop, TX, USA | Wah Chiu | Current challenges in creating and validating coordinate models built into cryo-EM maps | Recommendations gathered for modeling and validation standards and future model challenges |
| 11 | 2012 | 3D Cellular Context for the Macromolecular World, UK | Ardan Patwardhan, Gerard Kleywegt, Jason Swedlow | Discussions on data archiving and validation for emerging 3D cellular imaging techniques | Recommendations summarized in white paper |
| 12 | 2012 | Instruct Image Processing Center Developer Workshop, Spain | José-María Carazo | Effort to standardize information exchange in single particle reconstruction and to establish algorithm benchmarking | CTF benchmarking challenge |
| 13 | 2015 | CryoEM Model Validation Workshop, MA, USA | Wah Chiu, Cathy Lawson, Paul Adams | Modeling experts met to present and discuss challenges in modeling into cryoEM maps | Gathered recommendations |
| 14 | 2015 | Building Bridges between Cellular and Molecular Structural Biology, UK | Ardan Patwardhan, Gerard Kleywegt | Expert discussions on how to integrate structural data from a diverse range of public archives covering cellular and molecular structural biology | Recommendations to develop tools/file formats for map segmentation, and tools to support biological structure annotation described in white paper |
| 15 | 2017 | Model Challenge Assessment, LA, USA | Wah Chiu, Cathy Lawson, Paul Adams | First pass analyses of models and data submitted to the 2016 Model Challenge | Recommended metrics implemented on model challenge comparison website |
| 16 | 2017 | CryoEM Structure Joint Challenges Workshop, CA, USA | Cathy Lawson, Wah Chiu | Joint review of the 2016 Map and Model Challenge activities | Results described in |
| 17 | 2019 | Frontiers in cryo-EM Validation, UK | Gerard Kleywegt, Garib Murshudov, Elena Orlova, Ardan Patwhardhan, Alan Roseman, Peter Rosenthal, Maya Topf, Martyn Winn | Discuss current and future community needs/challenges for validation tools to support maps and models from single-particle analysis | Meeting featured in 2018 Nature editorial. |
| 18 | 2019 | Model Metrics Workshop, CA, USA | Cathy Lawson, Wah Chiu | Review modeling processes and assessment results of the 2019 Model Metrics Challenge and plan for future events | Recommendations editorial and full event manuscript are in preparation |
Cryo-EM community challenge activities.
| # | Year | Event | Organizing group(s) | Description | Key outcomes |
|---|---|---|---|---|---|
| C1 | 2004 | Particle Picking Challenge | National Resource for Automated Molecular Microscopy (La Jolla) | Compare diverse particle selection algorithms | Algorithms from 12 developer groups were compared and contrasted |
| C2 | 2010 | Model Challenge | EMDataResource, NCMI | Computational groups were asked to apply their tools to a selected set of published cryoEM structures. | Results published in |
| C3 | 2015 | CTF Challenge | Instruct Image Processing Center (Madrid), NCMI | Evaluate community/algorithm abilities to estimate key parameters of EM Contrast Transfer Function for a wide set of experimental conditions | CTF benchmarking challenge summary and results published |
| C4 | 2016 | Map Challenge | EMDataResource | Establish benchmark datasets, develop best practices, evolve criteria for validation of 3DEM reconstructions | Results described in |
| C5 | 2016 | Model Challenge | EMDataResource | Establish benchmark datasets, develop best practices, evolve criteria for validation of 3DEM map-derived models | Results described in |
| C6 | 2019 | Model Metrics Challenge | EMDataResource | Identify metrics most suitable for evaluating/comparing fit of atomic coordinate models into cryo-EM maps in 1.8–3.0 Å resolution range | Recommendations editorial and full event manuscript are in preparation |
FIG. 3.Cryo-EM experimental workflow. The major steps for determining a structure using cryo-EM single particle reconstruction are shown. The specimen shown is Helicobacter pylori vacuolating cytotoxin A oligomer.
FIG. 4.Current systems for deposition, archiving, and accessing cryo-EM structures. Worldwide, every cryo-EM structure (map, experimental metadata, and optionally coordinate model) is deposited and processed through the wwPDB OneDep system (deposit.wwpdb.org), following the same annotation and validation workflow also used for X-ray crystallography and NMR structures. Map-only depositions yield an EMDB entry, while joint map+model depositions yield both EMDB and PDB entries. Workflow metadata collected in OneDep are passed to both EMDB and PDB. EMDB holds all workflow metadata while PDB holds a subset of the metadata; see Table I. The PDB and EMDB archives are accessible by FTP and rsync at wwPDB mirror sites in the US, UK, and Japan. Released cryo-EM structure data from both archives can be accessed via EMDataResource, EMDB, and wwPDB partner websites.
PDBx/mmCIF EM Dictionary used by wwPDB OneDep. As described in Fig. 4, all workflow metadata categories are collected by the OneDep system. Most categories are archived in both PDB and EMDB; asterisked categories are archived only in EMDB.
| Image | Experimental | ||
| em_experiment | em_buffer | em_3d_reconstruction | |
| em_software | em_buffer_component | em_image_processing | em_map* |
| em_crystal_formation | em_particle_selection | em_structure_factors* | |
| em_embedding | em_volume_selection | em_layer_lines* | |
| em_entity_assembly | em_sample_support | em_ctf_correction | |
| em_entity_assembly_molwt | em_specimen | ||
| em_entity_assembly_naturalsource | em_staining | em_2d_crystal_entity | |
| em_entity_assembly_recombinant | em_vitrification | em_3d_crystal_entity | |
| em_virus_entity | em_helical_entity | ||
| em_virus_natural_host | em_single_particle_entity | ||
| em_virus_shell | em_fiducial_markers* | ||
| em_focused_ion_beam* | em_euler_angle_assignment* | ||
| em_grid_pretreatment* | em_final_classification* | ||
| em_diffraction | em_high_pressure_freezing* | em_start_model* | |
| em_diffraction_shell | em_shadowing* | ||
| em_diffraction_stats | em_support_film* | Structure | |
| em_image_recording | em_tomography* | em_3d_fitting | |
| em_image_scans | em_tomography_specimen* | em_3d_fitting_list | |
| em_imaging | em_ultramicrotomy* | em_fsc_curve* | |
| em_imaging_optics |
FIG. 5.Workshop Participant Photos. (a) 2004 Cryo-EM structure deposition workshop; (b) 2010 EM Validation Task Force (EM VTF) Workshop; (c) 2011 model challenge workshop; (d) 2017 joint challenges workshop. Image in (d) Reprinted with permission from C. L. Lawson and W. Chiu, J. Struct. Biol. 204(3), 523–526 (2018). Copyright 2018 Elsevier.
FIG. 6.Overall structure of the EM extension data dictionary circa 2005. New categories of data items recommended by participants of the 2004 Cryo-EM Structure Deposition Workshop are shown in green.
FIG. 7.Changing cryo-EM resolution landscape. Annual distribution of depositor-reported resolution for map entries released into EMDB. The sharp increase at 2–4 Å resolution is a direct consequence of the recent advances in image detection and processing. Inset: example Fourier Shell Correlation (FSC) plot, which is the current standard for estimating map resolution. The correlation between two independent half-map reconstructions (blue curve) falls with decreasing spatial frequency; the resolution estimate (in this case 3.9 Å) is read at FSC = 0.143 (dash-dotted horizontal line). Plot source: emdataresource.org. Inset FSC plot source: EMDB visual analysis.