| Literature DB >> 33305318 |
Tamas Lazar1,2, Elizabeth Martínez-Pérez3,4, Federica Quaglia5, András Hatos5, Lucía B Chemes6, Javier A Iserte3, Nicolás A Méndez6, Nicolás A Garrone6, Tadeo E Saldaño7, Julia Marchetti7, Ana Julia Velez Rueda7, Pau Bernadó8, Martin Blackledge9, Tiago N Cordeiro8,10, Eric Fagerberg11, Julie D Forman-Kay12,13, Maria S Fornasari7, Toby J Gibson4, Gregory-Neal W Gomes14,15, Claudiu C Gradinaru14,15, Teresa Head-Gordon16, Malene Ringkjøbing Jensen9, Edward A Lemke17,18, Sonia Longhi19, Cristina Marino-Buslje3, Giovanni Minervini5, Tanja Mittag20, Alexander Miguel Monzon5, Rohit V Pappu21, Gustavo Parisi7, Sylvie Ricard-Blum22, Kiersten M Ruff21, Edoardo Salladini19, Marie Skepö11,23, Dmitri Svergun24, Sylvain D Vallet22, Mihaly Varadi25, Peter Tompa1,2,26, Silvio C E Tosatto5, Damiano Piovesan5.
Abstract
The Protein Ensemble Database (PED) (https://proteinensemble.org), which holds structural ensembles of intrinsically disordered proteins (IDPs), has been significantly updated and upgraded since its last release in 2016. The new version, PED 4.0, has been completely redesigned and reimplemented with cutting-edge technology and now holds about six times more data (162 versus 24 entries and 242 versus 60 structural ensembles) and a broader representation of state of the art ensemble generation methods than the previous version. The database has a completely renewed graphical interface with an interactive feature viewer for region-based annotations, and provides a series of descriptors of the qualitative and quantitative properties of the ensembles. High quality of the data is guaranteed by a new submission process, which combines both automatic and manual evaluation steps. A team of biocurators integrate structured metadata describing the ensemble generation methodology, experimental constraints and conditions. A new search engine allows the user to build advanced queries and search all entry fields including cross-references to IDP-related resources such as DisProt, MobiDB, BMRB and SASBDB. We expect that the renewed PED will be useful for researchers interested in the atomic-level understanding of IDP function, and promote the rational, structure-based design of IDP-targeting drugs.Entities:
Year: 2021 PMID: 33305318 PMCID: PMC7778965 DOI: 10.1093/nar/gkaa1021
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.PED 4.0 entry statistics. Stacked histogram of models per ensemble for different measurement methods in PED 4.0, binned based on the number of the consisting conformer models.
Figure 2.Chain compactness of PED 4.0 entries. Radius of gyration of protein chains plotted against their chain length. Each dot represents a given chain in a given ensemble. The reference curves (54) represent values specific for folded proteins (purple), random coils (blue), denatured proteins (green) and fully extended chains (red). Four long folded proteins (PED00007, PED00010, PED00014 and PED00162) with over 300 residues are omitted, but fit well to the purple trend line.
Figure 3.Example for PED’s Protein page. Protein page P04637 summarizes the human p53 ensembles currently stored in PED for both the N-terminal and C-terminal disordered region. The feature viewer also integrates intrinsic disorder evidence from DisProt.
Figure 4.Example for PED’s Entry page. Entry page is shown for the C-terminal disordered region of p53 in a tetrameric complex with Ca2+-bound S100B (PED00063). The feature viewer shows chain-specific information, while molecular graphics, Ramachandran maps and Rg distribution are presented below.