Literature DB >> 28580912

Trends in the Electron Microscopy Data Bank (EMDB).

Ardan Patwardhan1.   

Abstract

Recent technological advances, such as the introduction of the direct electron detector, have transformed the field of cryo-EM and the landscape of molecular and cellular structural biology. This study analyses these trends from the vantage point of the Electron Microscopy Data Bank (EMDB), the public archive for three-dimensional EM reconstructions. Over 1000 entries were released in 2016, representing almost a quarter of the total number of entries (4431). Structures at better than 6 Å resolution now represent one of the fastest-growing categories, while the share of annually released tomography-related structures is approaching 20%. The use of direct electron detectors is growing very rapidly: they were used for 70% of the structures released in 2016, in contrast to none before 2011. Microscopes from FEI have an overwhelming lead in terms of usage, and the use of the RELION software package continues to grow rapidly after having attained a leading position in the field. China is rapidly emerging as a major player in the field, supplementing the US, Germany and the UK as the big four. Similarly, Tsinghua University ranks only second to the MRC Laboratory for Molecular Biology in terms of involvement in publications associated with cryo-EM structures at better than 4 Å resolution. Overall, the numbers point to a rapid democratization of the field, with more countries and institutes becoming involved.

Entities:  

Keywords:  EMDB; Electron Microscopy Data Bank; cryo-EM; direct electron detector; electron tomography; resolution

Mesh:

Year:  2017        PMID: 28580912      PMCID: PMC5458492          DOI: 10.1107/S2059798317004181

Source DB:  PubMed          Journal:  Acta Crystallogr D Struct Biol        ISSN: 2059-7983            Impact factor:   7.652


Introduction

Recent technological advances such as the introduction of the direct electron detector have transformed the field of cryo-EM and the landscape of molecular and cellular structural biology (Kühlbrandt, 2014 ▸; Bai et al., 2015 ▸; Eisenstein, 2016 ▸). Structures achieving resolutions that were once considered to be the preserve of the more established structural techniques of X-ray crystallography and nuclear magnetic resonance (NMR) are becoming a routine occurrence. At the same time, there is a greater emphasis on trying to understand the cellular context of macromolecules by placing sub-tomogram averages into tomographic reconstructions and by exploiting correlative imaging techniques (Davies et al., 2011 ▸; Mattei et al., 2016 ▸). The Electron Microscopy Data Bank (EMDB; http://emdb-empiar.org) was established in 2002 (Tagari et al., 2002 ▸) at the European Bioinformatics Institute (EMBL–EBI) and is the single global public repository for three-dimensional reconstructions derived from EM data (Lawson et al., 2016 ▸; Patwardhan & Lawson, 2016 ▸). The EMDB contains structures determined by single-particle averaging, electron crystallo­graphy and electron tomography (ET). Its entries range from high-resolution structures in which side-chain densities are resolved to low-resolution reconstructions of cellular samples in which the distributions of biomacromolecules can be studied. The EMDB is a unique international resource and enjoys overwhelming support from the EM community. In this study, I have data-mined the EMDB and PubMed (for publications related to EMDB entries) to obtain a birds-eye perspective of the trends affecting the field. This analysis will be useful for those wanting to obtain a general idea of where the field is heading and, more concretely, for informing and justifying future investments in technology.

Methods

This analysis is based primarily on the metadata included with the publicly released EMDB entries. The information is taken at face value and includes details about the sample, microscopy, image processing and validation (for example, the reported resolution). In order to obtain this information I used the EMDB advanced search that is available via a web form (http://emdb-empiar.org/emsearch) and an API, and the EMStats web service that provides dynamic interactive charts on the current state of the EMDB (http://emdb-empiar.org/emstats). The API queries are summarized in Table 1 ▸. Author affiliation information is not available from EMDB metadata directly. In order to obtain this information, a Python script was written to query PubMed for author affiliation information from publications related to EMDB entries. Manual cleanup of this data had to be performed in Excel to remove redundancy (e.g. ‘UK’ and ‘United Kingdom’). It should also be noted that there are limitations to the consistency of author affiliation information obtained from PubMed in terms of the format and comprehensiveness (prior to 2012 it is only available for corresponding authors, and even now it may not be provided for all authors), which may have some impact on the analysis presented. Moreover, no attempt is made to distinguish between the relative contributions of multiple authors, and all are treated equally.
Table 1

Summary of the search queries used in the analysis

All queries are prefixed by http://www.ebi.ac.uk/pdbe/emdb/searchResults.html/?, e.g. http://www.ebi.ac.uk/pdbe/emdb/searchResults.html/?q=status:REL AND ribosom*. For the direct electron-detector queries it should be noted that for some entries more than one detector has been used and the queries may return incorrect information. All entries with multiple detectors were checked manually and the numbers were adjusted accordingly.

QueryDescriptionRelated tables and figures
q=status:REL AND ribosom*Search for a mention of ‘ribosom’ in all metadata fields of the released entriesTable 2
q=status:REL AND (phage OR virus)Search for ‘phage’ or ‘virus’ in all metadata fields of the released entriesTable 2
q=status:REL AND resolution:[1 TO 4] AND NOT resolution:4Search for released entries with a reported resolution ≥1 Å and <4 ÅFig. 4
q=status:REL AND NOT resolution:[* TO *]Search for released entries where the reported resolution is undefinedFig. 4
q=status:REL AND detector:falconSearch for released entries where an FEI Falcon camera has been usedFig. 5
q=status:REL AND detector:("direct electron")Search for released entries where a Direct Electron camera has been usedFig. 5
q=status:REL AND detector:("gatan k2")Search for released entries where a Gatan K2 camera has been usedFig. 5

Results

There were 4431 released entries by the end of 2016, of which 1065 were released in 2016: an increase of over 50% when compared with the 640 entries released in 2015, suggesting a rapid acceleration in the pace of depositions (Fig. 1 ▸). Extrapolation of the curve (x 4 curve fitted in Excel with R 2 = 0.9994) points to around 10 000 entries by 2020.
Figure 1

EMDB released entries. The cumulative number of released entries is shown as red bars with the y axis on the right-hand side. The number of entries released annually is shown as a blue marked line with the y axis on the left-hand side.

The number of publications associated with new EMDB entries increased by 25% in 2016 to over 300 (Fig. 2 ▸ a). The number of entries per publication continues to grow, gradually reaching ∼3 in 2017 (Fig. 2 ▸ b). This is indicative of the fact that a growing number of EM experiments involve the examination of related structures with small differences obtained, for instance, by using three-dimensional classification to separate the data.
Figure 2

Analysis of publications associated with EMDB entries. (a) Count of the number of unique primary citations associated with new EMDB entries. (b) Ratio of the number of new EMDB entries with associated publications to the number of unique publications.

In Fig. 3 ▸ the number of entries released per year is split between the sub-methods single-particle, helical, sub-tomogram averaging, tomography and crystallography (two-dimensional and three-dimensional). Single-particle entries continue to be the main category, with ∼76% of the total. The numbers of tomography and sub-tomogram entries have been increasing, with a 70% year-on-year increase from 2015 to 2016; the proportion of tomography-related entries as a share of the total is increasing gradually and currently stands at ∼18% of the total. The tomography category itself (i.e. excluding sub-tomogram averaging) has more than doubled in the past two years: eight entries in 2014, 29 in 2015 and 75 in 2016. This could indicate a greater compliance from the ET community (and enforcement by journals) to follow the recommendation derived in consensus with the community that a representative tomographic reconstruction be deposited even for studies that did not involve sub-tomogram averaging (Patwardhan et al., 2012 ▸, 2014 ▸). Ribosomes and viruses continue to be the main sample categories studied, with an ∼40% share of EMDB entries (Table 2 ▸). The numbers over the past two years are in line with the long-term averages of ∼14 and 29% for ribosomes and viruses, respectively.
Figure 3

EMDB entries by EM sub-method. Stacked graph showing the number of annually released EMDB entries by sub-method category: single-particle (blue), helical (red), sub-tomogram averaging (green), tomography (purple) and crystallography (light blue).

Table 2

Numbers and percentages of ribosome and virus-related EMDB entries

YearRibosomesVirusesAll
201035 (16%)69 (31%)224
201147 (18%)69 (27%)259
201228 (7%)155 (37%)417
201363 (12%)209 (38%)544
2014106 (17%)125 (20%)615
2015101 (16%)159 (25%)640
2016137 (13%)294 (28%)1065
Sum517 (14%)1080 (29%)3764
Trends for different resolution bands based on the reported resolution are shown in Fig. 4 ▸(a). The fastest growing bands in the past two years have been the <4 Å and the 4–6 Å bands, which together comprised ∼40% of the entries in 2016. At the same time the numbers in the 8–10 and 10–15 Å bands have not experienced any appreciable growth and have decreased as a proportion of the total from an historic average of greater than 25% to ∼15%. An analysis of resolution trends for single-particle entries (Fig. 4 ▸ b) shows that the median resolution, which had been fairly constant until 2014 at ∼16 Å, is now on a downward trend, reaching ∼6 Å in 2016. The highest resolutions exhibit a stepwise trend, with a substantial drop in 2008 to 4 Å and then in 2015 to below 3 Å. The standard deviation of the resolution has been gradually increasing over the years, indicative of the fact that while the fraction of high-resolution structures is increasing, there are still many structures than can only be determined to low resolutions.
Figure 4

Reported resolutions of EMDB entries. (a) Resolution trends of released EMDB entries. Stacked graph comprised of the number of released entries separated into different resolution bands. (b) Statistical analysis of released EMDB entries from single-particle experiments.

Fig. 5 ▸ presents trends in direct electron-detector usage. Almost 70% of the entries released in 2016 were determined using direct electron detectors and this is up from 47% in 2015 (Fig. 5 ▸ a). Following a rapid rise from 2012 to 2014, over 80% of the <4 Å resolution structures in the past three years were determined using direct electron detectors (Fig. 5 ▸ b). The Gatan K2 and the FEI Falcon II were the main cameras used (Fig. 5 ▸ c), and for <4 Å resolution structures over twice as many released structures in 2016 involved Gatan K2 cameras compared with the FEI Falcon II (Fig. 5 ▸ d).
Figure 5

Direct electron-detector usage in EMDB entries. (a) Blue bars represent the number of released EMDB entries obtained using direct electron detectors (y axis on the left-hand side) and the red marked line represents the fraction of the total (y axis on the right-hand side). (b) The same as (a) but for structures at better than 4 Å resolution. (c) Trends for the three major direct electron-detector manufacturers: FEI, Direct Electron and Gatan. (d) The same as (c) but for structures at better than 4 Å resolution.

Trends for a selection of major software packages used in cryo-EM are analysed in Fig. 6 ▸. The exponential rise in RELION (Scheres, 2012 ▸) usage clearly stands out, and it was used in almost 46% of the structures determined in 2016. The the traditional stalwarts of EM image processing, IMAGIC (van Heel et al., 1996 ▸) and SPIDER (Shaikh et al., 2008 ▸) appear to be stagnant or declining over the past few years in terms of usage. Even the growth in usage of EMAN2 (Tang et al., 2007 ▸), a rising star as recently as 2013, appears to be continuing at a much more measured pace. However, it should be noted that while depositors can specify more than one software package during deposition, there is a tendency for many to only specify the main package used, leading to an underrepresentation of the use of other packages. Therefore, while it is safe to interpret the rapid rise of RELION, declining or stagnant trends should be treated with some caution. Two packages with smaller usage numbers but with longterm rising trends are IMOD (Kremer et al., 1996 ▸) and FREALIGN (Grigorieff, 2007 ▸). The former is associated with the rising number of tomography-related entries in EMDB. Trends for microscope usage (Fig. 7 ▸) show that microscopes by FEI have an overwhelming lead that has been reinforced in recent years.
Figure 6

Usage trends for a selection of major EM software packages. It should be noted that the packages are not mutually exclusive and that more than one package may have been used in the same experiment.

Figure 7

Microscope-usage trends based on microscope manufacturer. It should be noted that a substantial proportion of the ‘Other’ category are in fact FEI microscopes that have been classified incorrectly in the EMDB deposition process. There are entries where more than one microscope has been specified; however, the number of such entries is quite small and less than a handful involve the use of microscopes from different manufacturers.

In order to study the geographic reach of cryo-EM, I have exploited author affiliation information from PubMed based on the primary citations of EMDB entries as described in §2 and the results are presented in Tables 3 ▸, 4 ▸ and 5 ▸. Table 3 ▸ shows the geographic distribution of publications associated with new EMDB entries since 2010. One clear trend is that the geographic spread has widened substantially: in 2010 there were five countries with five or more publications, whereas in 2016 there were 15. China has experienced a 13-fold increase to 43 publications in this time period, is the fourth highest producer of EM publications and has supplanted Japan as the leading Asian nation. The US has consistently been the leading nation over the analysed period, followed by Germany and the UK. The trends suggest that more recent investments in high-end EM infrastructure and people by Sweden, Australia, Singapore, Austria and the Czech Republic are starting to produce results. A similar analysis for publications associated with EMDB entries at better than 4 Å resolution is presented in Table 4 ▸. The general trends are similar: the geographic spread of the technique is widening, the US has a leading position followed by the UK, Germany and China, and high-resolution cryo-EM is growing rapidly in China. Another notable trend is that the number of publications associated with the UK is substantially greater than Germany (48 compared with 34). An institute-based analysis of publications associated with EMDB entries at better than 4 Å resolution is presented in Table 5 ▸. The two leading institutes in terms of numbers have been the MRC Laboratory of Molecular Biology (MRC–LMB) and Tsinghua University. While MRC–LMB has clearly been dominating the scene, the numbers for 2015 and 2016 are fairly similar (14 and 15 publications, respectively) and the table suggests that a substantial proportion of the growth is owing to the rapidly widening usage of high-resolution EM with the proliferation of high-end EM instrumentation and accessibility to the wider structural biology community via centres such as eBIC at the Diamond Light Source (Saibil et al., 2015 ▸).
Table 3

Publications associated with new EMDB entries by country

Only countries with five or more publications over the period 2010–2016 are included in the table.

Country2010201120122013201420152016Sum
USA555086116139135172753
Germany19252022425359240
UK14102716425169229
China34310132943105
France846816162785
Japan674611112065
Spain47117871155
Canada153210141954
Switzerland22886151253
The Netherlands01139151645
Finland316615325
Sweden202028923
Australia1020531122
Singapore003325619
Austria012241717
Czech Republic000021710
Israel00031228
Russia00001247
Belgium01003026
Denmark11100216
New Zealand20000136
India00100135
Republic of Korea01200115
Taiwan00011125
Table 4

Publications associated with EMDB entries at better than 4 Å resolution by country

Only countries with three or more publications over the period 2014–2016 are included in the table.

Country201420152016Sum
USA8335293
UK8182248
Germany6101834
China281626
Switzerland15410
France0538
Japan0167
The Netherlands1135
Sweden0224
Canada1023
Czech Republic0123
Spain0123
Table 5

Institutes of authors of publications associated with EMDB entries at better than 4 Å resolution by country

Only institutes with four or more publications over the period 2014–2016 are included.

Institution201420152016Sum
MRC Laboratory of Molecular Biology8141537
Tsinghua University071118
University of California San Francisco05611
University of California Los Angeles07310
Harvard University/Medical School2439
Janelia Research Campus0459
University of Munich2349
Chinese Academy of Sciences1348
National Cancer Institute1258
Max Planck Institute for Biophysical Chemistry0167
Purdue University1157
University of California Berkeley1157
University of Texas3317
University of Virginia2327
Columbia University0246
ETH1326
Max Planck Institute for Biochemistry0066
Rockefeller University0066
University of Washington0336
The Scripps Research Institute0145
University of Oxford0325
Birkbeck College0044
European Molecular Biology Laboratory1124
Lawrence Berkeley National Laboratory1214

Discussion

The trends in the EMDB underscore the fast-paced changes currently taking place in the cryo-EM field driven by game-changing technologies such as the direct electron detector. Headline high-resolution structures in the past few years have demonstrated the potential of the technique in a wider structural context and have prompted widespread biomedical interest and even adoption by the pharmaceutical industry [for example, the Cambridge Pharmaceutical Cryo-EM Research Consortium (including Astex Pharmaceuticals, AstraZeneca, GlaxoSmithKline, Heptares Therapeutics and UCB, MRC–LMB and the University of Cambridge’s Nanoscience Centre), Novartis, Genentech and Pfizer]. Major investments are under way to set up and expand cryo-EM facilities worldwide, which are likely to substantially increase the available capacity to produce cryo-EM structures. In fact, EMDB trends show that while the US, Germany and the UK continue to maintain leadership in the field, cryo-EM activity has risen rapidly in China to the point where they rank fourth in the number of structures being produced, and that in general a democratization has been taking place, with a substantially broadened base of countries and institutes involved (Stuart et al., 2016 ▸). Furthermore, the rising number of ET structures underlines the growing importance associated with understanding biomacromolecular structure and function in the cellular context. The extrapolated value of 10 000 structures by 2020, which essentially amounts to a doubling of the number of structures being deposited yearly, therefore seems a credible prospect. The technology trends suggest an increasing consolidation towards the use of particular solutions with a proven track record. The most common mode is the use of FEI microscopes, in particular the Titan Krios, for high-resolution cryo-EM, RELION for image processing and the Gatan K2 camera for high-resolution work. However, other technological solutions have also been shown to give comparable results, for example a 3.4 Å resolution alcohol oxidase structure obtained using a Jeol 3200FSC microscope (Vonck et al., 2016 ▸), a 1.8 Å resolution glutamate dehydrogenase structure obtained using the FREALIGN software package (Merk et al., 2016 ▸) and a 2.5 Å resolution Trypanosoma cruzi 60S ribosomal subunit structure (Liu et al., 2016 ▸) obtained using a Falcon II camera. Furthermore, competitors are trying to redress the imbalance with new and improved solutions; for instance, Jeol with a new automatic specimen-loading system, a redesigned FREALIGN with a graphical user interface (cisTEM; personal communication with Timothy Grant, Janelia Research Campus), and the FEI Falcon III camera, which will even be retrofitted onto the most recently sold FEI microscope systems with a Falcon II camera. Finally, there are a number of nascent technological developments such as phase plates (Danev & Baumeister, 2016 ▸) and specimen-preparation robots [for example Spotiton (Razinkov et al., 2016 ▸) from the Carragher laboratory at NYSBC and Vitrojet from the Peters laboratory at Maastricht University] that may reach a level of maturity in the next few years to significantly impact the cryo-EM field and depositions to EMDB. It may therefore be useful to repeat the analysis of EMDB trends on an annual or biannual basis to factor in such developments.
  23 in total

1.  FREALIGN: high-resolution refinement of single particle structures.

Authors:  Nikolaus Grigorieff
Journal:  J Struct Biol       Date:  2006-06-02       Impact factor: 2.867

2.  SPIDER image processing for single-particle reconstruction of biological macromolecules from electron micrographs.

Authors:  Tanvir R Shaikh; Haixiao Gao; William T Baxter; Francisco J Asturias; Nicolas Boisset; Ardean Leith; Joachim Frank
Journal:  Nat Protoc       Date:  2008       Impact factor: 13.491

3.  Macromolecular organization of ATP synthase and complex I in whole mitochondria.

Authors:  Karen M Davies; Mike Strauss; Bertram Daum; Jan H Kief; Heinz D Osiewacz; Adriana Rycovska; Volker Zickermann; Werner Kühlbrandt
Journal:  Proc Natl Acad Sci U S A       Date:  2011-08-11       Impact factor: 11.205

4.  The structure and flexibility of conical HIV-1 capsids determined within intact virions.

Authors:  Simone Mattei; Bärbel Glass; Wim J H Hagen; Hans-Georg Kräusslich; John A G Briggs
Journal:  Science       Date:  2016-12-16       Impact factor: 47.728

5.  The democratization of cryo-EM.

Authors:  David I Stuart; Sriram Subramaniam; Nicola G A Abrescia
Journal:  Nat Methods       Date:  2016-07-28       Impact factor: 28.547

Review 6.  How cryo-EM is revolutionizing structural biology.

Authors:  Xiao-chen Bai; Greg McMullan; Sjors H W Scheres
Journal:  Trends Biochem Sci       Date:  2014-11-07       Impact factor: 13.807

7.  RELION: implementation of a Bayesian approach to cryo-EM structure determination.

Authors:  Sjors H W Scheres
Journal:  J Struct Biol       Date:  2012-09-19       Impact factor: 2.867

8.  A 3D cellular context for the macromolecular world.

Authors:  Ardan Patwardhan; Alun Ashton; Robert Brandt; Sarah Butcher; Raffaella Carzaniga; Wah Chiu; Lucy Collinson; Pascal Doux; Elizabeth Duke; Mark H Ellisman; Erik Franken; Kay Grünewald; Jean-Karim Heriche; Abraham Koster; Werner Kühlbrandt; Ingvar Lagerstedt; Carolyn Larabell; Catherine L Lawson; Helen R Saibil; Eduardo Sanz-García; Sriram Subramaniam; Paul Verkade; Jason R Swedlow; Gerard J Kleywegt
Journal:  Nat Struct Mol Biol       Date:  2014-10       Impact factor: 15.369

9.  Cryo-EM single particle analysis with the Volta phase plate.

Authors:  Radostin Danev; Wolfgang Baumeister
Journal:  Elife       Date:  2016-03-07       Impact factor: 8.140

10.  Structure of Alcohol Oxidase from Pichia pastoris by Cryo-Electron Microscopy.

Authors:  Janet Vonck; David N Parcej; Deryck J Mills
Journal:  PLoS One       Date:  2016-07-26       Impact factor: 3.240

View more
  31 in total

Review 1.  New approaches towards the understanding of integral membrane proteins: A structural perspective on G protein-coupled receptors.

Authors:  Reinhard Grisshammer
Journal:  Protein Sci       Date:  2017-06-07       Impact factor: 6.725

Review 2.  Dawning of a new era in TRP channel structural biology by cryo-electron microscopy.

Authors:  M Gregor Madej; Christine M Ziegler
Journal:  Pflugers Arch       Date:  2018-01-17       Impact factor: 3.657

3.  Structure of spliceosomal ribonucleoproteins.

Authors:  Daniel Aaron Pomeranz Krummel; Kiyoshi Nagai; Chris Oubridge
Journal:  F1000 Biol Rep       Date:  2010-05-24

4.  EMDB Web Resources.

Authors:  Sanja Abbott; Andrii Iudin; Paul K Korir; Sriram Somasundharam; Ardan Patwardhan
Journal:  Curr Protoc Bioinformatics       Date:  2018-04-09

Review 5.  Big data in cryoEM: automated collection, processing and accessibility of EM data.

Authors:  Philip R Baldwin; Yong Zi Tan; Edward T Eng; William J Rice; Alex J Noble; Carl J Negro; Michael A Cianfrocco; Clinton S Potter; Bridget Carragher
Journal:  Curr Opin Microbiol       Date:  2017-10-31       Impact factor: 7.934

6.  A New Protocol for Atomic-Level Protein Structure Modeling and Refinement Using Low-to-Medium Resolution Cryo-EM Density Maps.

Authors:  Biao Zhang; Xi Zhang; Robin Pearce; Hong-Bin Shen; Yang Zhang
Journal:  J Mol Biol       Date:  2020-08-06       Impact factor: 5.469

7.  Unifying Biological Image Formats with HDF5.

Authors:  Matthew T Dougherty; Michael J Folk; Erez Zadok; Herbert J Bernstein; Frances C Bernstein; Kevin W Eliceiri; Werner Benger; Christoph Best
Journal:  Commun ACM       Date:  2009-10-01       Impact factor: 4.654

8.  MAINMASTseg: Automated Map Segmentation Method for Cryo-EM Density Maps with Symmetry.

Authors:  Genki Terashi; Yuki Kagaya; Daisuke Kihara
Journal:  J Chem Inf Model       Date:  2020-03-30       Impact factor: 6.162

9.  The European Bioinformatics Institute in 2017: data coordination and integration.

Authors:  Charles E Cook; Mary T Bergman; Guy Cochrane; Rolf Apweiler; Ewan Birney
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

10.  Recent developments in the CCP-EM software suite.

Authors:  Tom Burnley; Colin M Palmer; Martyn Winn
Journal:  Acta Crystallogr D Struct Biol       Date:  2017-05-31       Impact factor: 7.652

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.