Literature DB >> 32002441

Evolving data standards for cryo-EM structures.

Catherine L Lawson1, Helen M Berman, Wah Chiu.   

Abstract

Electron cryo-microscopy (cryo-EM) is increasingly being used to determine 3D structures of a broad spectrum of biological specimens from molecules to cells. Anticipating this progress in the early 2000s, an international collaboration of scientists with expertise in both cryo-EM and structure data archiving was established (EMDataResource, previously known as EMDataBank). The major goals of the collaboration have been twofold: to develop the necessary infrastructure for archiving cryo-EM-derived density maps and models, and to promote development of cryo-EM structure validation standards. We describe how cryo-EM data archiving and validation have been developed and jointly coordinated for the Electron Microscopy Data Bank and Protein Data Bank archives over the past two decades, as well as the impact of evolving technology on data standards. Just as for X-ray crystallography and nuclear magnetic resonance, engaging the scientific community via workshops and challenging activities has played a central role in developing recommendations and requirements for the cryo-EM structure data archives.
© 2020 Author(s).

Entities:  

Year:  2020        PMID: 32002441      PMCID: PMC6980868          DOI: 10.1063/1.5138589

Source DB:  PubMed          Journal:  Struct Dyn        ISSN: 2329-7778            Impact factor:   2.920


INTRODUCTION

Electron cryo-microscopy (cryo-EM) has very recently become a mainstream area of structural biology and medicine, enabling 3D visualization of a wide variety of biologically important complexes that were previously inaccessible to science. Early cryo-EM 3D density maps typically lacked atomic detail, yielding only the overall molecular shape, but could still sometimes be interpreted at a “pseudo-atomic” level via fitting of previously known coordinates or homology models [Fig. 1(a)]. Recent major technological advances now make it increasingly possible to directly visualize atomic details [Fig. 1(b)]. These achievements were recognized by the award of the 2017 Chemistry Nobel Prize to cryo-EM pioneers Dubochet, Frank, and Henderson.
FIG. 1.

Cryo-EM: contrasting early (2000) vs recent (2019). (a) Cryo-EM structure of the E. coli 70S ribosome determined by the Frank group at 11.5 Å, one of the first maps deposited in the EMDB archive (EMD-1003). It is shown here superimposed with manually fitted components deposited to PDB (1eg0). (b) Helical segment of the 1.8 Å apoferritin map used as a target in the 2019 model challenge, with a fitted model.

Cryo-EM: contrasting early (2000) vs recent (2019). (a) Cryo-EM structure of the E. coli 70S ribosome determined by the Frank group at 11.5 Å, one of the first maps deposited in the EMDB archive (EMD-1003). It is shown here superimposed with manually fitted components deposited to PDB (1eg0). (b) Helical segment of the 1.8 Å apoferritin map used as a target in the 2019 model challenge, with a fitted model. The development of cryo-EM is directly reflected by the growth of cryo-EM structure depositions contributed worldwide to public data archives [Fig. 2(a)]. The archiving systems and underlying data standards supporting deposition, annotation, release, and validation of cryo-EM structures and the associated metadata describing cryo-EM experiments have been developed over time to support this growth. We outline here the history of these systems and describe the process by which data standards have been developed, highlighting the role of engaging the scientific community to develop recommendations and requirements. The archiving systems and standards continue to evolve as technology drives the need for new descriptors and validation metrics.
FIG. 2.

Growth of data archives and community activity timeline. (a) Released map entries in EMDB and released EM model coordinate entries in PDB are shown, cumulative by year. Milestones (indicated with arrows) are described in the main text. Plot source: emdataresource.org. (b) Workshops (yellow circles) and challenges (orange circles) related to data standards and validation development are plotted according to the year they were held. Numbers within the circles correspond to Tables II and III rows.

Growth of data archives and community activity timeline. (a) Released map entries in EMDB and released EM model coordinate entries in PDB are shown, cumulative by year. Milestones (indicated with arrows) are described in the main text. Plot source: emdataresource.org. (b) Workshops (yellow circles) and challenges (orange circles) related to data standards and validation development are plotted according to the year they were held. Numbers within the circles correspond to Tables II and III rows.
TABLE II.

Cryo-EM community data archiving and validation workshops.

#YearTitle/locationOrganizersDescriptionKey outcomes
12002IIMS Workshop, UKKim Henrick, José-María Carazo, Stephen FullerPromote software development in the field of 3DEM molecular structure determinationGuidelines and release policies for the new EMDB archive.10,11,39 Deposition system to collect EMDB data and maps9
220043DEM Developers workshop, UKKim HenrickDeveloper review of tools and software practices used in the field of cryoEMPriorities developed for EMDB including electron tomography, PDB-EMDB cross-referencing, lossless map conversion, review of community map standards and conventions
32004Cryo-EM Structure Deposition Workshop, NJ, USAHelen Berman, Wah Chiu, Michael RossmannDevelop community consensus on data items needed for deposition of maps and atomic models derived from cryoEMNeed for deposition one-stop-shop articulated. Recommendations incorporated in extended EM data dictionary13
420053DEM Developers Workshop, UKKim HenrickIntroduced EM data dictionary to software developers to facilitate its integration into 3DEM packages and electronic notebooksAgreement to adopt a common set of conventions for maps22
52006CryoEM Standards Task Force, TX, USAWah Chiu, David Belnap, José-María CarazoGather cryoEM map conventions and formats with associated metadata used by different developersKey conventions (e.g., Euler angles) were evaluated for interoperability and conversion tools were created40,41
62008Electron Crystallography Data Model Workshop, CA, USAWah Chiu, Cathy LawsonGather expert advice on metadata items in the EM dictionary pertaining to electron crystallographyRecommendations incorporated into EM data dictionary13
72010EM Validation Task Force, NJ, USAHelen Berman, Wah Chiu, Gerard Kleywegt, Cathy LawsonExpert review of potential validation standards for maps and models produced by 3DEM reconstructionRecommendations summarized in white paper,25 and implemented in OneDep validation reports14,18
82011Model Challenge Workshop, HI, USASteve Ludtke, Cathy Lawson, Gerard Kleywegt, Helen Berman, Wah ChiuComputational groups described and compared tools they used to model a selected set of published cryoEM structuresResults published in Biopolymers journal special issue42
92011Data Management Challenges in 3DEM, UKArdan Patwardhan, Gerard Kleywegt, Jason SwedlowGather expert advice on key topics in data archiving and validation for 3DEM data, including data model, validation, raw-data archivingRecommendations summarized in white paper.23 Web-based visualization tools developed.28 EMPIAR raw data archive created.15 Extended data model implemented17
1020123DEM Modeling Workshop, TX, USAWah ChiuCurrent challenges in creating and validating coordinate models built into cryo-EM mapsRecommendations gathered for modeling and validation standards and future model challenges
1120123D Cellular Context for the Macromolecular World, UKArdan Patwardhan, Gerard Kleywegt, Jason SwedlowDiscussions on data archiving and validation for emerging 3D cellular imaging techniquesRecommendations summarized in white paper24
122012Instruct Image Processing Center Developer Workshop, SpainJosé-María CarazoEffort to standardize information exchange in single particle reconstruction and to establish algorithm benchmarkingCTF benchmarking challenge43 and EMX convention for CTF and single-particle parameters developed44
132015CryoEM Model Validation Workshop, MA, USAWah Chiu, Cathy Lawson, Paul AdamsModeling experts met to present and discuss challenges in modeling into cryoEM mapsGathered recommendations5 directly used in development of 2016 Model Challenge
142015Building Bridges between Cellular and Molecular Structural Biology, UKArdan Patwardhan, Gerard KleywegtExpert discussions on how to integrate structural data from a diverse range of public archives covering cellular and molecular structural biologyRecommendations to develop tools/file formats for map segmentation, and tools to support biological structure annotation described in white paper45
152017Model Challenge Assessment, LA, USAWah Chiu, Cathy Lawson, Paul AdamsFirst pass analyses of models and data submitted to the 2016 Model ChallengeRecommended metrics implemented on model challenge comparison website46
162017CryoEM Structure Joint Challenges Workshop, CA, USACathy Lawson, Wah ChiuJoint review of the 2016 Map and Model Challenge activitiesResults described in Journal of Structural Biology special issue;30 also featured in Nature Methods editorial47
172019Frontiers in cryo-EM Validation, UKGerard Kleywegt, Garib Murshudov, Elena Orlova, Ardan Patwhardhan, Alan Roseman, Peter Rosenthal, Maya Topf, Martyn WinnDiscuss current and future community needs/challenges for validation tools to support maps and models from single-particle analysisMeeting featured in 2018 Nature editorial.48 Recommendations white paper is in preparation
182019Model Metrics Workshop, CA, USACathy Lawson, Wah ChiuReview modeling processes and assessment results of the 2019 Model Metrics Challenge and plan for future eventsRecommendations editorial and full event manuscript are in preparation
TABLE III.

Cryo-EM community challenge activities.

#YearEventOrganizing group(s)DescriptionKey outcomes
C12004Particle Picking ChallengeNational Resource for Automated Molecular Microscopy (La Jolla)Compare diverse particle selection algorithmsAlgorithms from 12 developer groups were compared and contrasted49
C22010Model ChallengeEMDataResource, NCMIComputational groups were asked to apply their tools to a selected set of published cryoEM structures.Results published in Biopolymers journal special issue42
C32015CTF ChallengeInstruct Image Processing Center (Madrid), NCMIEvaluate community/algorithm abilities to estimate key parameters of EM Contrast Transfer Function for a wide set of experimental conditionsCTF benchmarking challenge summary and results published43
C42016Map ChallengeEMDataResourceEstablish benchmark datasets, develop best practices, evolve criteria for validation of 3DEM reconstructionsResults described in Journal of Structural Biology special issue;30 also featured in Nature Methods Editorial47
C52016Model ChallengeEMDataResourceEstablish benchmark datasets, develop best practices, evolve criteria for validation of 3DEM map-derived modelsResults described in Journal of Structural Biology special issue;30 also featured in Nature Methods Editorial47
C62019Model Metrics ChallengeEMDataResourceIdentify metrics most suitable for evaluating/comparing fit of atomic coordinate models into cryo-EM maps in 1.8–3.0 Å resolution rangeRecommendations editorial and full event manuscript are in preparation

CRYO-EM STRUCTURE DATA ARCHIVING

The Protein Data Bank (PDB), established in 1971 as a public archive for atomic coordinates of biological structures derived from X-ray crystallography, began accepting models derived from nuclear magnetic resonance spectroscopy (NMR) in 1988, and from electron microscopy (EM) and electron crystallography (EC) in 1990. In recognition of the fact that publicly available 3D density maps could accelerate discovery in structural biology and medicine, the Electron Microscopy Data Bank (EMDB) at the European Bioinformatics Institute (EBI) was launched in 2002 with support from the European Union. EMDB's launch was quickly followed by a pair of editorials in Structure and Nature Structural Biology encouraging electron microscopists to deposit their density maps. Similar to BioMagResBank (BMRB), which archives experimental data from NMR, the EMDB accepts maps determined using any cryo-EM method, including single particle reconstruction with any symmetry, helical filament reconstruction, subtomogram averaging, tomography, and electron crystallography, along with metadata describing the full experimental workflow (Fig. 3).
FIG. 3.

Cryo-EM experimental workflow. The major steps for determining a structure using cryo-EM single particle reconstruction are shown. The specimen shown is Helicobacter pylori vacuolating cytotoxin A oligomer.

Cryo-EM experimental workflow. The major steps for determining a structure using cryo-EM single particle reconstruction are shown. The specimen shown is Helicobacter pylori vacuolating cytotoxin A oligomer. In 2006, scientists in the UK (EMDB) and USA [Research Collaboratory for Structural Bioinformatics (RCSB) and the National Center for Macromolecular Imaging (NCMI)] initiated a collaboration funded by National Institutes of Health (NIH) aimed to ensure that data archiving and validation standards for cryo-EM maps and models would be coordinated internationally. The project, formerly known as EMDataBank, was recently rebranded as EMDataResource (EMDR; emdataresource.org). The EMDR project website serves as a global resource for cryo-EM structure data archiving and retrieval, news, events, software tools, data standards, validation methods, and community challenges. EMDep, designed and implemented at EBI, was the first system designed to collect and annotate maps and associated metadata for EMDB. In 2008, the EMDR team created a joint map+model deposition system for cryo-EM structures by connecting EMDep with AutoDep and ADIT (AutoDep Input Tool), the PDB data collection systems at the EBI and RCSB sites. The system that was implemented enabled a one stop shop for cryo-EM model and map depositions. Joint curation ensured that maps and models were deposited at the same physical scale and in the same coordinate frame. Journals that publish cryo-EM structures began to require authors to deposit maps to EMDB and models to PDB. This system supported the processing and release of nearly 4000 maps and 1000 models over a nine-year period (2008–2015). In 2012, the Electron Microscopy Public Image Archive (EMPIAR) was established at EBI. Supported by the UK Medical Research Council and UK Biotechnology & Biological Sciences Research Council, EMPIAR enables cryo-EM scientists to archive and share raw images and intermediate data files associated with their maps deposited to EMDB. Making recently collected image data broadly available has multiple benefits, including accelerating development of reconstruction software, and enriching resources for cryo-EM scientists in training. EMPIAR has its own deposition and curation system, but accesses metadata from the related EMDB entry. Individual entry storage sizes can be up to 15 TB. Approximately 4% of EMDB entries deposited since 2012 have associated EMPIAR entries. The Worldwide PDB (wwPDB) is the global organization that manages the PDB archive. In 2016, deposition, annotation, and release of cryo-EM structure maps and models were migrated to the wwPDB OneDep system (Fig. 4), using requirements that were initiated and developed by EMDR. At that time, it became mandatory to deposit maps to EMDB for all cryo-EM models deposited to PDB. In addition, structure validation reports, which can be provided by depositors in an official PDF format to journal editors and reviewers as part of manuscript review, began to be produced for all cryo-EM structures.
FIG. 4.

Current systems for deposition, archiving, and accessing cryo-EM structures. Worldwide, every cryo-EM structure (map, experimental metadata, and optionally coordinate model) is deposited and processed through the wwPDB OneDep system (deposit.wwpdb.org), following the same annotation and validation workflow also used for X-ray crystallography and NMR structures. Map-only depositions yield an EMDB entry, while joint map+model depositions yield both EMDB and PDB entries. Workflow metadata collected in OneDep are passed to both EMDB and PDB. EMDB holds all workflow metadata while PDB holds a subset of the metadata; see Table I. The PDB and EMDB archives are accessible by FTP and rsync at wwPDB mirror sites in the US, UK, and Japan. Released cryo-EM structure data from both archives can be accessed via EMDataResource, EMDB, and wwPDB partner websites.

Current systems for deposition, archiving, and accessing cryo-EM structures. Worldwide, every cryo-EM structure (map, experimental metadata, and optionally coordinate model) is deposited and processed through the wwPDB OneDep system (deposit.wwpdb.org), following the same annotation and validation workflow also used for X-ray crystallography and NMR structures. Map-only depositions yield an EMDB entry, while joint map+model depositions yield both EMDB and PDB entries. Workflow metadata collected in OneDep are passed to both EMDB and PDB. EMDB holds all workflow metadata while PDB holds a subset of the metadata; see Table I. The PDB and EMDB archives are accessible by FTP and rsync at wwPDB mirror sites in the US, UK, and Japan. Released cryo-EM structure data from both archives can be accessed via EMDataResource, EMDB, and wwPDB partner websites.
TABLE I.

PDBx/mmCIF EM Dictionary used by wwPDB OneDep. As described in Fig. 4, all workflow metadata categories are collected by the OneDep system. Most categories are archived in both PDB and EMDB; asterisked categories are archived only in EMDB.

Top levelSample/specimen preparationImage processing & reconstructionExperimental data
em_experimentem_bufferem_3d_reconstruction
em_softwareem_buffer_componentem_image_processingem_map*
em_crystal_formationem_particle_selectionem_structure_factors*
Sample descriptionem_embeddingem_volume_selectionem_layer_lines*
em_entity_assembly em_sample_support em_ctf_correction
em_entity_assembly_molwtem_specimen
em_entity_assembly_naturalsourceem_stainingem_2d_crystal_entity
em_entity_assembly_recombinantem_vitrificationem_3d_crystal_entity
em_virus_entityem_helical_entity
em_virus_natural_hostem_single_particle_entity
em_virus_shellem_fiducial_markers*
em_focused_ion_beam*em_euler_angle_assignment*
Data collectionem_grid_pretreatment*em_final_classification*
em_diffractionem_high_pressure_freezing*em_start_model*
em_diffraction_shellem_shadowing*
em_diffraction_statsem_support_film*Structure analysis
em_image_recordingem_tomography*em_3d_fitting
em_image_scansem_tomography_specimen*em_3d_fitting_list
em_imagingem_ultramicrotomy*em_fsc_curve*
em_imaging_optics

CREATING A DATA DICTIONARY

The foundation of any data repository is its data representation scheme. Based in part on the International Union of Crystallography dictionary for small molecule crystallography Crystallographic Information File (CIF), the Macromolecular Crystallographic Information File (mmCIF) was developed in the 1990s to support rich data content for the macromolecular crystallographic experiment and its results, with precise data type definitions, logical groupings for related data items, explicit parent-child relationships, enumerations for controlled vocabulary, extensibility, and many other features embedded in a computer-readable format. This dictionary is now the Master Format for the PDB. Particularly relevant for cryo-EM, very large complexes are readily represented, since mmCIF has no limits on the number of atoms or polymer chains. Following the lead of the crystallographic community, an mmCIF extension dictionary containing data terms for cryo-EM experiments was drafted jointly in the early 2000s based on requirements provided by the cryo-EM community. The dictionary was vetted and expanded by the scientific community via multiple workshops, and subsequently integrated by EMDR into the PDBx/mmCIF dictionary for use in the hybrid joint map+model deposition system. In 2015, based on feedback from additional workshops, the EMDR team further modified and expanded the dictionary in several ways. Hierarchical descriptions of complex specimens were enabled, and experimental descriptions for each of the cryo-EM methods were extended. The >500 term EM dictionary (Table I) is now the basis for cryo-EM depositions to both EMDB and PDB in the wwPDB OneDep system. The dictionary continues to be updated regularly to support the evolving needs of the scientific community. PDBx/mmCIF EM Dictionary used by wwPDB OneDep. As described in Fig. 4, all workflow metadata categories are collected by the OneDep system. Most categories are archived in both PDB and EMDB; asterisked categories are archived only in EMDB.

GATHERING COMMUNITY REQUIREMENTS

Developing a trusted scientific data repository requires careful attention to the interplay among science, technology, and community interest. Workshops and Challenges are two types of community outreach activities that are effective in bringing these three elements together; both have been employed multiple times to move EM data and validation standard development forward [Fig. 2(b)]. Workshops (typically 2–3 days) enable groups of experts to review current practices and develop recommendations, while Challenges (taking place over several months to a year) provide forums for experts to exercise and demonstrate current workflows and test novel procedures. Challenges can incorporate one or more workshops for planning or results review. Tables II and III list and summarize goals and outcomes of 18 international workshops and six challenges held over the past two decades. Below we provide additional descriptions of selected activities, as well as a montage of workshop photos (Fig. 5).
FIG. 5.

Workshop Participant Photos. (a) 2004 Cryo-EM structure deposition workshop; (b) 2010 EM Validation Task Force (EM VTF) Workshop; (c) 2011 model challenge workshop; (d) 2017 joint challenges workshop. Image in (d) Reprinted with permission from C. L. Lawson and W. Chiu, J. Struct. Biol. 204(3), 523–526 (2018). Copyright 2018 Elsevier.

Cryo-EM community data archiving and validation workshops. Cryo-EM community challenge activities. Workshop Participant Photos. (a) 2004 Cryo-EM structure deposition workshop; (b) 2010 EM Validation Task Force (EM VTF) Workshop; (c) 2011 model challenge workshop; (d) 2017 joint challenges workshop. Image in (d) Reprinted with permission from C. L. Lawson and W. Chiu, J. Struct. Biol. 204(3), 523–526 (2018). Copyright 2018 Elsevier.

EM extension dictionary development

The main goal of the 2004 Cryo-EM Structure Deposition Workshop [Fig. 5(a)], attended by ∼30 scientists including cryo-EM, image processing, crystallography, database, funding agency, and journal representatives, was to develop a global community consensus on data items for deposition of density maps and atomic models derived from cryo-EM studies. Terms were reviewed category-by-category in two focus groups, and recommendations for revisions and extensions were obtained (Fig. 6). Furthermore, participants unanimously requested a “one-stop shop” for deposition and retrieval of the cryo-EM map and model data. Following the workshop, the dictionary was further revised with follow-up input from attendees. The resulting dictionary was presented at the 2005 3DEM Gordon Research Conference, and EMDR's project website became the requested one-stop-shop portal.
FIG. 6.

Overall structure of the EM extension data dictionary circa 2005. New categories of data items recommended by participants of the 2004 Cryo-EM Structure Deposition Workshop are shown in green.

Overall structure of the EM extension data dictionary circa 2005. New categories of data items recommended by participants of the 2004 Cryo-EM Structure Deposition Workshop are shown in green. The EM extension dictionary was next reviewed by software developers at the 2005 3DEM Developers Workshop to facilitate its integration with major 3DEM packages and electronic notebook systems. There were two important outcomes: (a) the draft dictionary was unanimously accepted by the participants and (b) a set of proposed conventions for describing EM micrographs and density maps was developed. The conventions enable a standardized approach to image interpretation and presentation, with recommended units for common parameters, rotation and symmetry notations, and common sense principles such as “objects should have overall positive density” (early image correction procedures sometimes generated objects darker than their background depending on image processing and display software). The conventions were subsequently incorporated into the EM extension dictionary to facilitate representation of map-related data items in PDB and EMDB. Data standards for cryo-EM structures were further addressed at the 2011 Data Management Challenges in 3D Electron Microscopy Workshop and the 2015 Building Bridges between Cellular and Molecular Structural Biology Workshop. Needs for hierarchical sample description as well as extensions to cryo-EM experimental sub-method descriptions were recognized. A future archival segmentation file format, for which requirements were gathered at the 2015 meeting, will make use of the hierarchy, enabling map regions to be connected with biological annotations.

Developing validation standards

At the 2010 EM Validation Task Force (EM VTF) Workshop [Fig. 5(b)], an international group of experts explored how to assess cryo-EM maps, models, and other data deposited into EMDB and PDB. For maps, participants recognized a critical need to develop standards for assessing map resolution and accuracy. They recommended establishing two fully independent image datasets at the outset for evaluating resolution by Fourier Shell Correlation (FSC); at the time, this was not typically done, but it is now the standard procedure. However, they also advised that maps still be carefully inspected to ensure that the resolution estimate by FSC is in accordance with the map's visible features. The EM VTF's 2012 white paper notably called for the scientific community to develop new criteria for the evaluation of maps and for the evaluation of fit of the model to the experimental map density. In contrast, in 2011 the VTF for X-ray crystallography published a comprehensive and detailed set of recommendations to validate structures and experimental data determined using X-ray crystallography. The difference reflects the fact that cryo-EM is still a rapidly evolving field. Validation standards and raw image data archiving were additional topics of discussion at the 2011 Data Management Challenges in 3D Electron Microscopy Workshop. Several services were developed and implemented at EBI in response to workshop recommendations. The EMPIAR raw data archive was created, and stand-alone FSC and tilt-pair servers were developed for depositors to validate their cryo-EM maps. In addition, Visual Analysis web pages were designed to display an informative series of images and plots for every EMDB entry, and to help users assess data quality of released cryo-EM maps and models. Two EMDR-sponsored challenges subsequently aimed to address the 2010 EM VTF's call for improved metrics to evaluate both maps and fit of models to experimental data (2016 Map and Model Challenges). Following the 2017 Joint Challenges Workshop at Stanford, which had over 90 participants [Fig. 5(d)]; key results and recommendations were collated into a virtual special issue of the Journal of Structural Biology published in December 2018. The Map Challenge provided a unique forum for critically evaluating the standard method for estimating map resolution by FSC (Fig. 7, inset). A key observation was that as currently practiced, the procedure is not sufficiently standardized: a number of different variables (e.g., map box size, voxel size, filtering and masking practice, and threshold value for interpretation) can substantially impact the outcome. As a result, different expert practitioners can arrive at different resolution estimates for the same level of map details. For example, two of the apoferritin maps submitted to the challenge had practitioner-estimated resolutions of 3.1 Å and 3.5 Å, respectively, though they were indistinguishable by eye. A direct conclusion is that any “reported-resolution”-based search or ranking for maps or associated models will have limited reliability. In follow-up discussions at the 2019 Frontiers in Cryo-EM Validation Workshop, one suggestion made was to have the archives independently estimate resolution by FSC from deposited unmasked, minimally filtered half-maps. This procedure would likely make comparisons between maps less susceptible (though not completely impervious) to variations in practitioner practice.
FIG. 7.

Changing cryo-EM resolution landscape. Annual distribution of depositor-reported resolution for map entries released into EMDB. The sharp increase at 2–4 Å resolution is a direct consequence of the recent advances in image detection and processing. Inset: example Fourier Shell Correlation (FSC) plot, which is the current standard for estimating map resolution. The correlation between two independent half-map reconstructions (blue curve) falls with decreasing spatial frequency; the resolution estimate (in this case 3.9 Å) is read at FSC = 0.143 (dash-dotted horizontal line). Plot source: emdataresource.org. Inset FSC plot source: EMDB visual analysis.

Changing cryo-EM resolution landscape. Annual distribution of depositor-reported resolution for map entries released into EMDB. The sharp increase at 2–4 Å resolution is a direct consequence of the recent advances in image detection and processing. Inset: example Fourier Shell Correlation (FSC) plot, which is the current standard for estimating map resolution. The correlation between two independent half-map reconstructions (blue curve) falls with decreasing spatial frequency; the resolution estimate (in this case 3.9 Å) is read at FSC = 0.143 (dash-dotted horizontal line). Plot source: emdataresource.org. Inset FSC plot source: EMDB visual analysis. The 2017 Joint Challenges Workshop also sparked lively discussions about the potential for model-based metrics to estimate not only model quality, but also to provide one or more independent measures of map resolvability. Several procedures of this type have been proposed and tested. EMRinger evaluates whether density peaks in the map fall within the possible rotameric configuration for the carbon-β atom in a side chain. Other procedures have been developed to measure map quality. For example, Z-scores capture how much larger the cross-correlations score (CCS) is for atoms in such features at their placed location compared to the CCS at displaced positions. Another recently devised experimental metric, Q-score, measures resolvability of the individual atom(s) in reference to the model.

Changing validation goals

Looking at the distribution of reported resolution of maps released into EMDB annually over the past few years (Fig. 7), one can readily see a striking sharp recent increase in maps in the 2–4 Å range. This development is a direct result of recent technological improvements, and it changes the “goal-posts” for developing validation methods, adding urgency to the need for metrics to validate structures at near-atomic to atomic resolution. The 2019 Model Metrics Challenge and associated 2019 Model Metrics Workshop were designed with the goal of evaluating metrics for map-model fit of moderately high-resolution maps (3.1–1.8 Å). A full write-up will be published elsewhere, and we note two findings here. First, the new metrics that by some means combine both model and map quality (e.g., EMRinger and Q-score) appear to be quite useful for ranking sets of structures. Second, refined Atomic Displacement Parameters (ADPs), which were included in about half of the models submitted by challenge participants, could modestly improve fit of the model to the map, particularly for the highest resolution (1.8 Å) target map. The meaning of refined ADPs/B-factors in the context of a cryo-EM density map is less clear. Best practices (e.g., to avoid overfitting) will need to be investigated.

WHERE WE ARE, WHAT'S NEXT

The initial EM validation report format released in 2016 focused on assessment of model geometry for PDB entries. As will be reported in more detail in a future publication, additional sections covering map analysis and visualization and map-model fit analysis and visualization will become available to EMDB and PDB depositors by early 2020. The Visual Analysis web pages hosted at EBI since 2012 have served as a test-bed for the development of the new features, which will include (a) several types of orthogonal images of the deposited map and map superimposed with model; (b) FSC curves to support depositor-reported map resolution; and (c) map-model fit statistics via “atom inclusion,” the percentage of modeled atoms falling inside a map at its recommended contour level. The new features will enable scientists (depositors, annotators, journal editors, and manuscript reviewers) to make initial assessments of map features, map quality, and map-model fit, bypassing the need to first download/view files in a graphics program. A planned meeting in January 2020 at EBI organized by wwPDB will bring together cryo-EM and data archiving experts to discuss the current state of data archiving for cryo-EM structures derived from the single-particle reconstruction method, and to solicit recommendations on what data should be included and/or made mandatory in depositions and associated validation reports. The following points might be considered as part of the deliberations: Can estimation of map resolution be better standardized across the community? This would enable fairer comparisons among maps determined in different laboratories and using different software packages. Additional metrics (beyond atom inclusion) are available that describe map-model fit, including several novel procedures that effectively yield a joint assessment of map and model quality in a broad resolution range. How should map-model fit be reported as part of a structure determination and in a joint map+model deposition? What best practice recommendations can be made for refinement of ADPs in cryo-EM models at different resolutions? How should we evaluate multiple structures determined from a single specimen that may have variable quality and resolution?
  10 in total

Review 1.  Transactions from the 69th Annual Meeting of the American Crystallographic Association: Data best practices-current state and future needs.

Authors:  Nicholas K Sauter; John P Rose; Talapady N Bhat
Journal:  Struct Dyn       Date:  2020-03-24       Impact factor: 2.920

2.  Three-dimensional Structure Databases of Biological Macromolecules.

Authors:  Vaishali P Waman; Christine Orengo; Gerard J Kleywegt; Arthur M Lesk
Journal:  Methods Mol Biol       Date:  2022

3.  Objective crystallographic symmetry classifications of a noisy crystal pattern with strong Fedorov-type pseudosymmetries and its optimal image-quality enhancement.

Authors:  Peter Moeck
Journal:  Acta Crystallogr A Found Adv       Date:  2022-04-28       Impact factor: 2.331

4.  Cryo-EM model validation recommendations based on outcomes of the 2019 EMDataResource challenge.

Authors:  Catherine L Lawson; Andriy Kryshtafovych; Paul D Adams; Pavel V Afonine; Matthew L Baker; Benjamin A Barad; Paul Bond; Tom Burnley; Renzhi Cao; Jianlin Cheng; Grzegorz Chojnowski; Kevin Cowtan; Ken A Dill; Frank DiMaio; Daniel P Farrell; James S Fraser; Mark A Herzik; Soon Wen Hoh; Jie Hou; Li-Wei Hung; Maxim Igaev; Agnel P Joseph; Daisuke Kihara; Dilip Kumar; Sumit Mittal; Bohdan Monastyrskyy; Mateusz Olek; Colin M Palmer; Ardan Patwardhan; Alberto Perez; Jonas Pfab; Grigore D Pintilie; Jane S Richardson; Peter B Rosenthal; Daipayan Sarkar; Luisa U Schäfer; Michael F Schmid; Gunnar F Schröder; Mrinal Shekhar; Dong Si; Abishek Singharoy; Genki Terashi; Thomas C Terwilliger; Andrea Vaiana; Liguo Wang; Zhe Wang; Stephanie A Wankowicz; Christopher J Williams; Martyn Winn; Tianqi Wu; Xiaodi Yu; Kaiming Zhang; Helen M Berman; Wah Chiu
Journal:  Nat Methods       Date:  2021-02-04       Impact factor: 28.547

5.  State-dependent sequential allostery exhibited by chaperonin TRiC/CCT revealed by network analysis of Cryo-EM maps.

Authors:  Yan Zhang; James Krieger; Karolina Mikulska-Ruminska; Burak Kaynak; Carlos Oscar S Sorzano; José-María Carazo; Jianhua Xing; Ivet Bahar
Journal:  Prog Biophys Mol Biol       Date:  2020-08-28       Impact factor: 3.667

Review 6.  An RNA-centric historical narrative around the Protein Data Bank.

Authors:  Eric Westhof; Neocles B Leontis
Journal:  J Biol Chem       Date:  2021-03-18       Impact factor: 5.157

Review 7.  Evolution of Standardization and Dissemination of Cryo-EM Structures and Data Jointly by the Community, PDB and EMDB.

Authors:  Wah Chiu; Michael F Schmid; Grigore Pintilie; Catherine L Lawson
Journal:  J Biol Chem       Date:  2021-03-17       Impact factor: 5.157

8.  EMDA: A Python package for Electron Microscopy Data Analysis.

Authors:  Rangana Warshamanage; Keitaro Yamashita; Garib N Murshudov
Journal:  J Struct Biol       Date:  2021-12-13       Impact factor: 2.867

Review 9.  Discerning best practices in XFEL-based biological crystallography - standards for nonstandard experiments.

Authors:  Alexander Gorel; Ilme Schlichting; Thomas R M Barends
Journal:  IUCrJ       Date:  2021-06-30       Impact factor: 4.769

Review 10.  Deep Learning-Based Advances in Protein Structure Prediction.

Authors:  Subash C Pakhrin; Bikash Shrestha; Badri Adhikari; Dukka B Kc
Journal:  Int J Mol Sci       Date:  2021-05-24       Impact factor: 5.923

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.