| Literature DB >> 32195305 |
Akhila Melarkode Vattekatte1,2,3,4, Tarun Jairaj Narwani1,2,4, Aline Floch2,5,6,7, Mirjana Maljković8, Soubika Bisoo1,2,4, Nicolas K Shinada1,2,4,9,10, Agata Kranjc1,2,4, Jean-Christophe Gelly1,2,4,11, Narayanaswamy Srinivasan12, Nenad Mitić8, Alexandre G de Brevern1,2,4,11.
Abstract
Intrinsic Disorder Proteins (IDPs) have become a hot topic since their characterisation in the 90s. The data presented in this article are related to our research entitled "A structural entropy index to analyse local conformations in Intrinsically Disordered Proteins" published in Journal of Structural Biology [1]. In this study, we quantified, for the first time, continuum from rigidity to flexibility and finally disorder. Non-disordered regions were also highlighted in the ensemble of disordered proteins. This work was done using the Protein Ensemble Database (PED), which is a useful database collecting series of protein structures considered as IDPs. The data set consists of a collection of cleaned protein files in classical pdb format that can be readily used as an input with most automatic analysis software. The accompanying data include the coding of all structural information in terms of a structural alphabet, namely Protein Blocks (PBs). An entropy index derived from PBs that allows apprehending the continuum between protein rigidity to flexibility to disorder is included, with information from secondary structure assignment, protein accessibility and prediction of disorder from the sequences. The data may be used for further structural bioinformatics studies of IDPs. It can also be used as a benchmark for evaluating disorder prediction methods.Entities:
Keywords: Ensembles; Entropy; Local protein conformation; PDB; Protein disorder; Structural alphabet
Year: 2020 PMID: 32195305 PMCID: PMC7078294 DOI: 10.1016/j.dib.2020.105383
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Specification Table
| Subject | Biochemistry. |
| Specific subject area | Structural Bioinformatics, proteins disorder. |
| Type of data | A collection of atom coordinates in the pdb format, tables, text files and Figures. |
| How data were acquired | A survey of the Protein Ensemble Database (PED). |
| Data format | Raw, analysed and filtered |
| Parameters for data collection | A Protein Ensemble Database survey was performed in march 2019. The data set consists of PED stores 25,473 protein structures of 60 ensembles in 24 entries in the Protein Data Bank (pdb) format. The atom coordinate files were cleaned and treated as described below and as such may be used for further automatic analysis. |
| Description of data collection | Every entry of PED was analysed, i.e. some have inconsistencies. Then, all cleaned files were used for Protein Blocks (PBs) assignment, the frequency of each PBs was calculated, and local entropy was computed. All files are provided. In a similar way, DSSP was used to assign secondary structure and solvent accessibility for each residue. The dataset also collects disorder prediction generated from DisoPred and PrDOS webserver. Flat text files are provided for simple use and some Figures for better visualisation. |
| Data source location | University of Paris, Paris, France. |
| Data accessibility | Data is given in the paper. |
| Related research article | Akhila Melarkode Vattekatte, Tarun Jairaj Narwani, Aline Floch, Mirjana Maljković, Soubika Bisoo, Nicolas K. Shinada, Agata Kranjc, Jean-Christophe Gelly, Narayanaswamy Srinivasan, Nenad Mitić & Alexandre G. de Brevern, (2020) “A structural entropy index to analyse local conformations in Intrinsically Disordered Proteins”, |