| Literature DB >> 26615193 |
Thomas Hrabe1, Zhanwen Li1, Mayya Sedova1, Piotr Rotkiewicz1, Lukasz Jaroszewski1, Adam Godzik2.
Abstract
The PDBFlex database, available freely and with no login requirements at http://pdbflex.org, provides information on flexibility of protein structures as revealed by the analysis of variations between depositions of different structural models of the same protein in the Protein Data Bank (PDB). PDBFlex collects information on all instances of such depositions, identifying them by a 95% sequence identity threshold, performs analysis of their structural differences and clusters them according to their structural similarities for easy analysis. The PDBFlex contains tools and viewers enabling in-depth examination of structural variability including: 2D-scaling visualization of RMSD distances between structures of the same protein, graphs of average local RMSD in the aligned structures of protein chains, graphical presentation of differences in secondary structure and observed structural disorder (unresolved residues), difference distance maps between all sets of coordinates and 3D views of individual structures and simulated transitions between different conformations, the latter displayed using JSMol visualization software.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26615193 PMCID: PMC4702920 DOI: 10.1093/nar/gkv1316
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Cluster overview PDBFlex page provides information and several visualization options for each cluster of PDB chains with identical (within the threshold) sequences. The user can access this page either by providing the PDB id or a sequence of a protein to be analyzed (for instance 2hphA) or by browsing the list all available clusters (this list can be sorted either by Cα RMSD or by cluster size). The middle panel shows the cluster overview page for cluster of conformations representing the structure of the E. colid-galactose-binding periplasmic protein, an example that would be used throughout the text. The top row shows the two most diverse structures in the cluster (left and right JSmol panels) and an animation of the transition between them (central JSmol panel). Displayed below are basic numerical data about the cluster. The Distance Difference Map illustrates the details of the structural rearrangement between the two most diverse conformations and the local flexibility preview presents local Cα RMSD variation along the sequences of the cluster members.
Figure 3.PDBFlex statistics. Approximately 47% of all clusters have 5 or less members and ∼78% of all clusters have 10 or less PDBs. 25% of out of all clusters show very low variability with a maximal Cα RSMD of less than 0.5Å, while ∼15% of all clusters have maximum Cα RSMD above 3Å.
Figure 2.Detailed information about a structural diversity in a cluster representing a single protein. Left panel: Local flexibility view visualization based on master-slave, multiple-sequence-alignment of all cluster members. The top row in the view shows the secondary structure of the master sequence. Rows below correspond to aligned sequences of cluster members. The green curve below the master sequence shows average local Cα RMSD in the cluster (peaks in the curve correspond to regions of high local structural diversity in the cluster). Right panel: Global flexibility view based on Cα RMSD values calculated from all-against-all alignment of cluster members. Large, colored circles in the plot indicate centers of sub-clusters with their respective structures shown below. The user can search for and select any PDB structure on the plot, highlight structures with specific ligands or select a few structures for further visual inspection.