| Literature DB >> 31278630 |
Brinda Vallat1, Benjamin Webb2, John Westbrook1,3, Andrej Sali4,5,6,7, Helen M Berman8,9.
Abstract
Limitations in the applicability, accuracy, and precision of individual structure characterization methods can sometimes be overcome via an integrative modeling approach that relies on information from all available sources, including all available experimental data and prior models. The open-source Integrative Modeling Platform (IMP) is one piece of software that implements all computational aspects of integrative modeling. To maximize the impact of integrative structures, the coordinates should be made publicly available, as is already the case for structures based on X-ray crystallography, NMR spectroscopy, and electron microscopy. Moreover, the associated experimental data and modeling protocols should also be archived, such that the original results can easily be reproduced. Finally, it is essential that the integrative structures are validated as part of their publication and deposition. A number of research groups have already developed software to implement integrative modeling and have generated a number of structures, prompting the formation of an Integrative/Hybrid Methods Task Force. Following the recommendations of this task force, the existing PDBx/mmCIF data representation used for atomic PDB structures has been extended to address the requirements for archiving integrative structural models. This IHM-dictionary adds a flexible model representation, including coarse graining, models in multiple states and/or related by time or other order, and multiple input experimental information sources. A prototype archiving system called PDB-Dev ( https://pdb-dev.wwpdb.org ) has also been created to archive integrative structural models, together with a Python library to facilitate handling of integrative models in PDBx/mmCIF format.Entities:
Keywords: Deposition; Hybrid modeling; Integrative modeling; Model validation; PDB; mmCIF dictionary
Mesh:
Substances:
Year: 2019 PMID: 31278630 PMCID: PMC6692293 DOI: 10.1007/s10858-019-00264-2
Source DB: PubMed Journal: J Biomol NMR ISSN: 0925-2738 Impact factor: 2.835
Fig. 1The four-step modeling workflow as implemented in the Integrative Modeling Platform. The workflow is illustrated by its application to structure determination of the Nup84 heptamer (Shi et al. 2014). In this application, crystallographic structures and comparative models are used to represent the seven components of the Nup84 complex. The scoring function incorporates data extracted from CX-MS experiments and 2DEM class average images. The sampling explores both the conformations of the components and their configuration, searching for those assembly structures that satisfy the spatial restraints as accurately as possible. In this case, the result is an ensemble of many good-scoring models that satisfy the input data within acceptable thresholds. The sampling is then assessed for convergence, models are clustered, and evaluated by the degree to which they satisfy the data used to construct them as well as omitted data. The protocol can iterate through the four stages, until the models are judged to be satisfactory, most often based on their precision and the degree to which they satisfy the data. The resulting models are deposited in PDB-Dev (Burley et al. 2017; Vallat et al. 2018) with accession number PDBDEV_ 00000001
Fig. 2Number of multi-method structures archived in the PDB over the years (data as of December 6, 2018)
Combination of methods used to determine multi-method structures currently archived in the PDB and the number of PDB entries with these method combinations (data as of December 6, 2018)
| Existing experimental method combinations | Entries released in PDB |
|---|---|
| X-ray crystallography + solution NMR | 1 |
| X-ray crystallography + neutron diffraction | 81 |
| X-ray crystallography + solution scattering | 2 |
| X-ray crystallography + EPR | 7 |
| Solution NMR + solid-state NMR | 4 |
| Solution NMR + EM | 1 |
| Solution NMR + solid-state NMR + EM | 1 |
| Solution NMR + neutron diffraction | 1 |
| Solution NMR + solution scattering | 17 |
| Solution NMR + EPR | 1 |
| Solution NMR + theoretical model | 7 |
| EM + solid-state NMR | 6 |
| EM + solution scattering | 2 |
| EM + solution scattering + solid-state NMR | 1 |
| Fiber diffraction + solid-state NMR | 1 |
Fig. 3A snapshot of integrative structural models deposited in PDB-Dev. a Nup84 sub-complex (PDBDEV_ 00000001 (Shi et al. 2014)), b Nup133 sub-complex (PDBDEV_ 00000016 (Kim et al. 2014)), c Nup82 sub-complex (PDBDEV_ 00000020 (Fernandez-Martinez et al. 2016)), d Pom152 sub-complex (PDBDEV_ 00000017 (Upla et al. 2017)), e, f, g Nuclear pore complex 1-spoke, 3-spokes & 8-spokes (PDBDEV_ 00000010, PDBDEV_ 00000011, PDBDEV_ 00000012 (Kim et al. 2018)), h Mediator complex (PDBDEV_ 00000003 (Robinson et al. 2015)), i Exosome complex (PDBDEV_ 00000002 (Shi et al. 2015)), j 16 s RNA—Methyl transferase A complex (PDBDEV_ 00000014 (van Zundert et al. 2015)), k Human complement system C3(H2O) (PDBDEV_ 00000021 (Chen et al. 2016)), l Fruit fly chromosome 2L segment (PDBDEV_ 00000008 (Trussart et al. 2015)), m Ecm29 protein with 26S proteasome complex (PDBDEV_00000026 (Wang et al. 2017a)), n Pol II(G) complex (PDBDEV_00000025 (Jishage et al. 2018)), o Mitochondrial cysteine desulfurase complex (PDBDEV_ 00000015 (Cai et al. 2018)), p Diubiquitin (PDBDEV_ 00000004 (Liu et al. 2018)), q, r, s Human serum albumin domains A, B & C (PDBDEV_ 00000005, PDBDEV_ 00000006, PDBDEV_ 00000007 (Belsom et al. 2016)), t Human Rev7 dimer (PDBDEV_ 00000009 (Rizzo et al. 2018)), u E6AP-E6-p53 enzyme–substrate complex (PDBDEV_00000023 (Sailer et al. 2018))
Fig. 4Statistics of current structures in PDB-Dev (including structures released and structures on-hold for publication as of December 6, 2018). a Plot of number of entries in PDB-Dev as a function of the type of input experimental restraints. b Plot of number of entries in PDB-Dev as a function of the integrative modeling software application used