| Literature DB >> 34866606 |
Brinda Vallat1, Benjamin Webb2, Maryam Fayazi1, Serban Voinea3, Hongsuda Tangmunarunkit3, Sai J Ganesan2, Catherine L Lawson1, John D Westbrook1, Carl Kesselman1, Andrej Sali2, Helen M Berman4.
Abstract
Structures of many complex biological assemblies are increasingly determined using integrative approaches, in which data from multiple experimental methods are combined. A standalone system, called PDB-Dev, has been developed for archiving integrative structures and making them publicly available. Here, the data standards and software tools that support PDB-Dev are described along with the new and updated components of the PDB-Dev data-collection, processing and archiving infrastructure. Following the FAIR (Findable, Accessible, Interoperable and Reusable) principles, PDB-Dev ensures that the results of integrative structure determinations are freely accessible to everyone. open access.Entities:
Keywords: IHM-dictionary; PDB-Dev; PDBx/mmCIF; data standards; integrative modeling
Mesh:
Substances:
Year: 2021 PMID: 34866606 PMCID: PMC8647179 DOI: 10.1107/S2059798321010871
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1PDB-Dev archiving system. The different components of the PDB-Dev data pipeline include methods for data collection, processing, archiving and distribution.
Figure 2Steps involved in the PDB-Dev data-harvesting workflow. Gray boxes represent steps that are managed by the depositor/curator and white boxes represent steps carried out by software agents in the data-harvesting system. Backward arrows represent steps that allow the depositor/curator to revisit the previous step and fix any errors that are detected.
Figure 3The PDB-Dev data-harvesting system consists of (a) a relational data catalog (ERMRest) created based on the PDBx/mmCIF and IHM-dictionary data standards, (b) an object-storage system (Hatrac) for user-uploaded and system-generated files, (c) a data model-driven web interface (Chaise) that automatically adapts its interface based on the underlying data model and (d) an automated pipeline to support the data-harvesting workflow. Both data depositors and curators can interact with the system using the web interface.
Figure 4PDB-Dev data processing and archiving. Steps that have been already implemented are shown in gray boxes and steps that are under development are shown in checked boxes.
Figure 5Statistics of current structures in PDB-Dev based on (a) the experimental data type used to obtain input spatial restraints and (b) the software used for modeling. The data include 66 released entries as well as 21 entries that have been processed and are kept on hold for publication as of 30 June 2021.
Figure 6Examples of integrative structures archived in PDB-Dev. (a) Multi-scale structure of the nuclear pore complex from yeast (Kim et al., 2018 ▸), (b) atomic structure of Vaccinia virus DNA polymerase catalytic subunit E9 complexed with the C-terminal region of the processivity factor component A20 (Bersch et al., 2021 ▸), (c) one of the two conformational states obtained from multi-state modeling of the HCN voltage-gated ion channel (Dai et al., 2019 ▸) and (d) one of the four ordered states of the human complement C3(H2O) from steps in a reaction pathway (Chen et al., 2016 ▸).