| Literature DB >> 27169728 |
Woonghee Lee1, Chad M Petit2, Gabriel Cornilescu3, Jaime L Stark3, John L Markley4.
Abstract
We introduce AUDANA (Automated Database-Assisted NOE Assignment), an algorithm for determining three-dimensional structures of proteins from NMR data that automates the assignment of 3D-NOE spectra, generates distance constraints, and conducts iterative high temperature molecular dynamics and simulated annealing. The protein sequence, chemical shift assignments, and NOE spectra are the only required inputs. Distance constraints generated automatically from ambiguously assigned NOE peaks are validated during the structure calculation against information from an enlarged version of the freely available PACSY database that incorporates information on protein structures deposited in the Protein Data Bank (PDB). This approach yields robust sets of distance constraints and 3D structures. We evaluated the performance of AUDANA with input data for 14 proteins ranging in size from 6 to 25 kDa that had 27-98 % sequence identity to proteins in the database. In all cases, the automatically calculated 3D structures passed stringent validation tests. Structures were determined with and without database support. In 9/14 cases, database support improved the agreement with manually determined structures in the PDB and in 11/14 cases, database support lowered the r.m.s.d. of the family of 20 structural models.Entities:
Keywords: 3D structure determination; Automated structure calculation; NOE assignment; PACSY database; PONDEROSA; Sequence-structure correlation
Mesh:
Substances:
Year: 2016 PMID: 27169728 PMCID: PMC4921114 DOI: 10.1007/s10858-016-0036-y
Source DB: PubMed Journal: J Biomol NMR ISSN: 0925-2738 Impact factor: 2.835
Fig. 1AUDANA is a new NOESY assignment algorithm for NMR based protein 3D structure determination that can be launched with a few clicks from either NMRFAM-SPARKY or Ponderosa Client. The AUDANA algorithm, which is carried out on the Ponderosa Server, employs the PACSY DB for bioinformatics, PDB files for 3D atom coordinates, TALOS-N for backbone angle constraints and order parameters from chemical shifts, and Xplor-NIH for simulated annealing. The results can be analyzed with the PyMOL and NMRFAM-SPARKY tools available as part of Ponderosa Analyzer
Fig. 2Illustration of how AUDANA extracts inter-proton contacts from structures of proteins with homologous sequences and compares them with possible distance constraints from the experimental 3D-NOE spectra. a A PDB model is chosen. b Hydrogens are added. c Interproton distances are calculated. d Distances less than 5.5 Å are tabulated. e Predicted NOE peaks are filtered against experimental data. f Those that match (blue) are retained; those that do not (red) are discarded
Fig. 3The four phases used by AUDANA in calculating NMR structure models and validating distance constraint sets. After completing each of phase I–III, constraints in the recycle bin are checked to determine if they are in agreement with the intermediate structure; if so, they are reincorporated into the constraint set but with zero endurance scores to make them susceptible to removal if they are violated in subsequent cycles. a In phase I, only distance constraints categorized as robust with high endurance scores are used for structure calculation. Highly violated constraints that score below zero after the score update go into the recycle bin. b In phase II, constraints in the intermediate category are included in the structure calculation. An initial filter based on the intermediate structure is applied to the constraints, and those that are strongly violated are removed. Constraints with endurance scores below zero after the update also go into the recycle bin. c In the phase III, constraints in the uncertain category are included; a more stringent filter against the intermediate structure is imposed. d Phase IV is explicit water refinement with the constraints from the last cycle. e Mapping of the robustness of constraints, color-coded by their endurance scores, onto the structure. a, b, c Dynamic hydrogen bond detection. During phase I, II and phase III, AUDANA detects potential hydrogen bonds from NOE cross peak patterns for secondary structures and generates idealized H-bond constraints for the calculation. After each calculation cycle, the H-bonds are reevaluated by measuring interatomic distances, and H-bond constraints that violate the structure are eliminated from use in the following cycle