| Literature DB >> 27899601 |
Damiano Piovesan1, Francesco Tabaro1,2, Ivan Mičetić1, Marco Necci1, Federica Quaglia1, Christopher J Oldfield3, Maria Cristina Aspromonte4, Norman E Davey5,6, Radoslav Davidović7, Zsuzsanna Dosztányi8,9, Arne Elofsson10, Alessandra Gasparini4, András Hatos1,9, Andrey V Kajava11,12,13, Lajos Kalmar9,14, Emanuela Leonardi4, Tamas Lazar15,16, Sandra Macedo-Ribeiro17, Mauricio Macossay-Castillo15,16, Attila Meszaros9, Giovanni Minervini1, Nikoletta Murvai9, Jordi Pujols18, Daniel B Roche11,12, Edoardo Salladini19, Eva Schad9, Antoine Schramm19, Beata Szabo9, Agnes Tantos9, Fiorella Tonello1,20, Konstantinos D Tsirigos10, Nevena Veljković7, Salvador Ventura18, Wim Vranken15,16,21, Per Warholm10, Vladimir N Uversky22,23, A Keith Dunker3, Sonia Longhi24, Peter Tompa25,15,16, Silvio C E Tosatto26,20.
Abstract
The Database of Protein Disorder (DisProt, URL: www.disprot.org) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous entries to purge DisProt from conflicting cases, and also upgraded the functional classification scheme to reflect continuous advance in the field in the past 10 years or so. We define IDPs as proteins that are disordered along their entire sequence, i.e. entirely lack structural elements, and IDRs as regions that are at least five consecutive residues without well-defined structure. We base our assessment of disorder strictly on experimental evidence, such as X-ray crystallography and nuclear magnetic resonance (primary techniques) and a broad range of other experimental approaches (secondary techniques). Confident and ambiguous annotations are highlighted separately. DisProt 7.0 presents classified knowledge regarding the experimental characterization and functional annotations of IDPs/IDRs, and is intended to provide an invaluable resource for the research community for a better understanding structural disorder and for developing better computational tools for studying disordered proteins.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27899601 PMCID: PMC5210544 DOI: 10.1093/nar/gkw1056
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.DisProt sample entry, human p53 protein (DP00086). Several experiments have been carried out to characterize the human p53 protein. DisProt reports literature evidence for IDRs. In particular, 11 different IDR evidences (Region Evidences) have been collected from nine different papers by two different curators. Most of these are related to the N-terminus and come from different types of experiments (Disorder Region Details). Disorder regions and the number of DisProt evidences, separated into confident and ambiguous annotations, can be compared with structural information from the Pfam and MobiDB databases in the Disorder Overview. DisProt also provides function annotation of IDRs by reporting molecular function, transition and partner terms (Functional Annotation). A literature reference is provided for each annotated IDR, linked to the relevant PubMed entry.
DisProt annotation content
| Method/function | Proteins | Regions | Residues |
|---|---|---|---|
| 333 | 592 | 32 926 | |
| 326 | 683 | 20 742 | |
| 261 | 352 | 53 935 | |
| 75 | 95 | 13 961 | |
| 62 | 67 | 12 206 | |
| 53 | 69 | 7723 | |
| 34 | 34 | 6326 | |
| 237 | 273 | 41 833 | |
| 564 | 1505 | 151 498 | |
| 489 | 1199 | 106 670 | |
| 444 | 1108 | 119 665 |
Distribution of DisProt annotation based on experimental evidence (method) and disorder function (function). As each annotated disorder region corresponds to one piece of experimental evidence, multiple regions can map to the same sequence segment. If a protein is annotated multiple times with the same type of experiment it is counted once. The number of residues is the sum of region lengths.
Figure 2.Distribution of disorder segment lengths. Segment lengths are binned in groups of 10 residues, e.g. the column 10 showing lengths between 10 and 19 residues. The current DisProt release is distinguished by experimental technique (X-ray in green, NMR in blue and other methods in red). The previous DisProt release is shown in a single gray bar as it did not have the experimental technique in a machine-readable format.
Major functional categories of the MFUN ontology of DisProt
| MFUN code | Generic functional category | Functional category |
|---|---|---|
| MFUN_01 | Entropic chain | Flexible linker/spacer |
| Entropic bristle | ||
| Entropic clock | ||
| Entropic spring | ||
| Structural mortar | ||
| Self-transport through channel | ||
| MFUN_02 | Molecular recognition: assembler | Assembler |
| Localization (targeting) | ||
| Localization (tethering) | ||
| Prion (self-assembly, polymerization) | ||
| Liquid-liquid phase separation/demixing (self-assembly) | ||
| MFUN_03 | Molecular recognition: scavenger | Neutralization of toxic molecules |
| Metal binding/metal sponge | ||
| Water storage | ||
| MFUN_04 | Molecular recognition: effector | Inhibitor |
| Disassembler | ||
| Activator | ||
| cis-regulatory elements (inhibitory modules) | ||
| DNA bending | ||
| DNA unwinding | ||
| MFUN_05 | Molecular recognition: display site | Phosphorylation |
| Acetylation | ||
| Methylation | ||
| Glycosylation | ||
| Ubiquitination | ||
| Fatty acylation (myristolation and palmitoylation) | ||
| Limited proteolysis | ||
| MFUN_06 | Molecular recognition: chaperone | Protein detergent/solvate layer |
| Space filling | ||
| Entropic exclusion | ||
| Entropy transfer |
The functional schemes are an open hierarchy. One goal of sharing information with the community through DisProt is to refine our views of the functional modes of IDPs.