| Literature DB >> 26582925 |
Rita Pancsa1, Mihaly Varadi2, Peter Tompa3, Wim F Vranken4.
Abstract
Proteins fulfil a wide range of tasks in cells; understanding how they fold into complex three-dimensional (3D) structures and how these structures remain stable while retaining sufficient dynamics for functionality is essential for the interpretation of overall protein behaviour. Since the 1950's, solvent exchange-based methods have been the most powerful experimental means to obtain information on the folding and stability of proteins. Considerable expertise and care were required to obtain the resulting datasets, which, despite their importance and intrinsic value, have never been collected, curated and classified. Start2Fold is an openly accessible database (http://start2fold.eu) of carefully curated hydrogen/deuterium exchange (HDX) data extracted from the literature that is open for new submissions from the community. The database entries contain (i) information on the proteins investigated and the underlying experimental procedures and (ii) the classification of the residues based on their exchange protection levels, also allowing for the instant visualization of the relevant residue groups on the 3D structures of the corresponding proteins. By providing a clear hierarchical framework for the easy sharing, comparison and (re-)interpretation of HDX data, Start2Fold intends to promote a better understanding of how the protein sequence encodes folding and structure as well as the development of new computational methods predicting protein folding and stability.Entities:
Mesh:
Year: 2015 PMID: 26582925 PMCID: PMC4702845 DOI: 10.1093/nar/gkv1185
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Database structure: Start2Fold is organized hierarchically into four levels: the entries (level 1) are on the top; each entry might have multiple associated protein chains (level 2); which in turn might have several experimental residue sets (level 3), with information on their associated residues (level 4). Each level records relevant information that is displayed on the accession screens or in the XML files of the entries.
Figure 2.The accession screen: every piece of information that is stored in Start2Fold is displayed on the accession screens. The top section of these screens hosts an integrated JSmol applet for interactive visualization (panel A). General information of the protein chain(s) and relevant information on the experimental sets are displayed below (panel B). The complete entry can be downloaded in XML format, while the sequences can be retrieved in FASTA format and the residue/segment-specific information in the form of a list file by clicking on the respective links.