Literature DB >> 29281006

PINE-SPARKY.2 for automated NMR-based protein structure research.

Woonghee Lee1, John L Markley1.   

Abstract

Summary: Nuclear magnetic resonance (NMR) spectroscopy, along with X-ray crystallography and cryoelectron microscopy, is one of the three major tools that enable the determination of atomic-level structural models of biological macromolecules. Of these, NMR has the unique ability to follow important processes in solution, including conformational changes, internal dynamics and protein-ligand interactions. As a means for facilitating the handling and analysis of spectra involved in these types of NMR studies, we have developed PINE-SPARKY.2, a software package that integrates and automates discrete tasks that previously required interaction with separate software packages. The graphical user interface of PINE-SPARKY.2 simplifies chemical shift assignment and verification, automated detection of secondary structural elements, predictions of flexibility and hydrophobic cores, and calculation of three-dimensional structural models. Availability and implementation: PINE-SPARKY.2 is available in the latest version of NMRFAM-SPARKY from the National Magnetic Resonance Facility at Madison (http://pine.nmrfam.wisc.edu/download_packages.html), the NMRbox Project (https://nmrbox.org) and to subscribers to the SBGrid (https://sbgrid.org). For a detailed description of the program, see http://www.nmrfam.wisc.edu/pine-sparky2.htm. Contact: whlee@nmrfam.wisc.edu or markley@nmrfam.wisc.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29281006      PMCID: PMC5925765          DOI: 10.1093/bioinformatics/btx785

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Numerous different groups and institutions have developed the software packages in current use in the field of biomolecular NMR, and many utilize different nomenclatures, data input procedures, and even computer operating systems. These differences have impeded research progress, particularly by non-experts. The Integrative NMR package (Lee ) offered a partial solution by integrating NMRFAM-SPARKY (Lee ) with APES for peak picking (Shin ), PINE for automated assignment (Bahrami ), ARECA (Dashti ) for validation of peak assignments; TALOS-N for shift based torsion angle restraints (Shen and Bax, 2013), CS-Rosetta (Shen ), for structure determination from chemical shifts, AUDANA (Lee ) and PONDEROSA-C/S (Lee ) for automated structure determination from NOE spectra, and data visualization by NDP-PLOT and an enhanced mode of the PyMOL software package (The PyMOL Molecular Graphics System, Version 1.7.4 Schrödinger, LLC.). However, as part of this package, the original PINE-SPARKY (Lee ) was cumbersome. Users had to pick peaks from NMR spectra, generate a set of peak list files, open a web browser, visit the PINE web page and submit generated peak list files one-by-one for each experiment. To import and verify chemical shifts, the user had to wait for an email notice, download and unpack the compressed results, use the PINE2SPARKY converter to apply PINE probabilistic assignments to SPARKY projects, and use PINE-SPARKY extensions to create the actual assignment labels. Only after following these steps could the user carry out further analysis, such as validation of chemical shift referencing by LACS (Wang ), secondary structure determination by PECAN (Eghbalnia ), analysis of chemical shifts by reference to the PACSY database (Lee ) or 3 D structure determination by CS-Rosetta (Shen ). PINE-SPARKY.2, which comes as a plug-in to NMRFAM-SPARKY, integrates all of these tasks and provides, in addition, easy-to-use visual analysis tools based on probability theory. PINE-SPARKY.2 incorporates a new server and various programs written in CGI/Perl, PYTHON, and BASH scripts that integrate PINE, PECAN, LACS, PACSY, TALOS-N, and CS-ROSETTA. Typing the two-letter-code he calls up the user manual, which is also available on-line (http://www.nmrfam.wisc.edu/pine-sparky2.htm).

2 Implementation

PINE-SPARKY.2 can be launched from the automated assignment sub-menu of NMRFAM menu or by typing the two-letter-code ep in the NMRFAM-SPARKY. This opens a graphical user interface that makes all the features of NMRFAM-SPARKY simultaneously accessible. Users provide name and email information (Fig. 1A), which the plug-in uses to interact with the PINE web server. The PINE web server sends an email containing a URL where the results can be retrieved. Users can provide a sequence file directly to the plugin (Fig. 1B), otherwise the sequence in the Sequence Entry plug-in (two-letter-code sq), will be imported.
Fig. 1.

Graphical user interface (GUI) for PINE-SPARKY.2 (A)–(F) input; see text). (G) Completeness Counter. (H) Example of a structural model obtained from assigned chemical shifts by CS-Rosetta

Graphical user interface (GUI) for PINE-SPARKY.2 (A)–(F) input; see text). (G) Completeness Counter. (H) Example of a structural model obtained from assigned chemical shifts by CS-Rosetta The PINE-SPARKY.2 plug-in offers three options (Fig. 1C): (i) Use pre-assignment: This option is used to restrain already assigned resonances. (ii) Use selective labeling: This option allows specification of the amino acid types expected in a spectrum. (iii) Run CS-Rosetta with PINE outputs: This option executes 3 D structure calculations using the CS-Rosetta server (http://csrosetta.bmrb.wisc.edu/csrosetta hosted by BMRB). PINE-SPARKY.2 supports 19 different NMR experiments. The user specifies the NMR experiments with spectral data to be analyzed by clicking the Add button from the spectrum list (Fig. 1DE). Peaks need to be identified in the spectra to be analyzed, and this can be accomplished with the automated peak-picking program APES (two-letter-code ae). Then, an assignment job is submitted to the PINE web server (Supplementary Fig. S1) by clicking the Submit button. A unique Key identifier generated by the PINE-SPARKY importer (Fig. 1F) handles cross-talk between PINE-SPARKY.2 and the PINE web server. PINE-SPARKY.2 checks the status file from the URL associated with the Key, and the PINE web server updates the status of the PINE job in the status file. Predictions of secondary structures (PECAN), referencing errors (LACS), hydrophobicities (PACSY), torsion angles and flexibilities (TALOS-N), and 3 D structures (CS-ROSETTA) are executed sequentially by BASH and PYTHON scripts based on the chemical shifts with the highest probabilities given by PINE (described more fully in the manual). By clicking the Check button, the PINE-SPARKY importer retrieves the results. Replacement of previously downloaded results can be accomplished by clicking the Browse button before clicking on Check. The PINE-SPARKY importer automatically sets up visualization of the PINE results and incorporates them into the current project. It asks a series of interactive yes/no questions to determine whether the user wants to (i) download the results in the PINE sub-directory under working directory; (ii) visualize secondary structures determined by the PECAN algorithm (Supplementary Fig. S2A); (iii) visualize PINE probabilities for spin system assignments (Supplementary Fig. S2B); (iv) visualize chemical shift referencing analysis by the LACS algorithm (Supplementary Fig. S2C);(v) visualize hydrophobic core residues predicted from PACSY database (Supplementary Fig. S2D; Lee ); (vi) visualize RCI S2 (random coil index order parameter; Berjanskii and Wishart, 2005); and/or (vii) generate PINE probabilistic labels and accept the most probable ones with P > 0.5 (Supplementary Fig. S3). Then the Completeness Counter (two-letter-code cm) can be used to find unassigned resonances (Fig. 1G). As a test of PINE-SPARKY.2, we used data from three multidimensional NMR spectra (2 D 1 H, 15 N-HSQC, 3 D CBCA(CO)NH and HNCACB) from the small (110 amino acid residue) protein AeSCP-2 (BMRB Entry 16662) as inputs for automated assignment and fed the assignment results into CS-Rosetta for structure determination from chemical shifts alone. We compared the resulting structure (Fig. 1H) with that determined manually from NOE data (PDB ID 2KSH; Singarapu ). Following superposition, the pairwise backbone RMSD for the two structures was 1.21 Å and the all-heavy-atom RMSD was 2.14 Å (Supplementary Fig. S4; see the Supplementary Material for details). Click here for additional data file.
  15 in total

1.  Differences in the structure and dynamics of the apo- and palmitate-ligated forms of Aedes aegypti sterol carrier protein 2 (AeSCP-2).

Authors:  Kiran K Singarapu; James T Radek; Marco Tonelli; John L Markley; Que Lan
Journal:  J Biol Chem       Date:  2010-03-31       Impact factor: 5.157

2.  A simple method to predict protein flexibility using secondary chemical shifts.

Authors:  Mark V Berjanskii; David S Wishart
Journal:  J Am Chem Soc       Date:  2005-11-02       Impact factor: 15.419

3.  Consistent blind protein structure generation from NMR chemical shift data.

Authors:  Yang Shen; Oliver Lange; Frank Delaglio; Paolo Rossi; James M Aramini; Gaohua Liu; Alexander Eletsky; Yibing Wu; Kiran K Singarapu; Alexander Lemak; Alexandr Ignatchenko; Cheryl H Arrowsmith; Thomas Szyperski; Gaetano T Montelione; David Baker; Ad Bax
Journal:  Proc Natl Acad Sci U S A       Date:  2008-03-07       Impact factor: 11.205

Review 4.  Structural proteomics by NMR spectroscopy.

Authors:  Joon Shin; Woonghee Lee; Weontae Lee
Journal:  Expert Rev Proteomics       Date:  2008-08       Impact factor: 3.940

5.  Linear analysis of carbon-13 chemical shift differences and its application to the detection and correction of errors in referencing and spin system identifications.

Authors:  Liya Wang; Hamid R Eghbalnia; Arash Bahrami; John L Markley
Journal:  J Biomol NMR       Date:  2005-05       Impact factor: 2.835

6.  Probabilistic validation of protein NMR chemical shift assignments.

Authors:  Hesam Dashti; Marco Tonelli; Woonghee Lee; William M Westler; Gabriel Cornilescu; Eldon L Ulrich; John L Markley
Journal:  J Biomol NMR       Date:  2016-01-02       Impact factor: 2.835

7.  PACSY, a relational database management system for protein structure and chemical shift analysis.

Authors:  Woonghee Lee; Wookyung Yu; Suhkmann Kim; Iksoo Chang; Weontae Lee; John L Markley
Journal:  J Biomol NMR       Date:  2012-08-19       Impact factor: 2.835

8.  NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy.

Authors:  Woonghee Lee; Marco Tonelli; John L Markley
Journal:  Bioinformatics       Date:  2014-12-12       Impact factor: 6.937

9.  PINE-SPARKY: graphical interface for evaluating automated probabilistic peak assignments in protein NMR spectroscopy.

Authors:  Woonghee Lee; William M Westler; Arash Bahrami; Hamid R Eghbalnia; John L Markley
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

10.  Integrative NMR for biomolecular research.

Authors:  Woonghee Lee; Gabriel Cornilescu; Hesam Dashti; Hamid R Eghbalnia; Marco Tonelli; William M Westler; Samuel E Butcher; Katherine A Henzler-Wildman; John L Markley
Journal:  J Biomol NMR       Date:  2016-03-29       Impact factor: 2.835

View more
  10 in total

Review 1.  Non-Uniform and Absolute Minimal Sampling for High-Throughput Multidimensional NMR Applications.

Authors:  Dawei Li; Alexandar L Hansen; Lei Bruschweiler-Li; Rafael Brüschweiler
Journal:  Chemistry       Date:  2018-06-19       Impact factor: 5.236

2.  Solution structure and dynamics of the mitochondrial-targeted GTPase-activating protein (GAP) VopE by an integrated NMR/SAXS approach.

Authors:  Kyle P Smith; Woonghee Lee; Marco Tonelli; Yeongjoon Lee; Samuel H Light; Gabriel Cornilescu; Srinivas Chakravarthy
Journal:  Protein Sci       Date:  2022-05       Impact factor: 6.993

3.  Backbone assignments and conformational dynamics in the S. typhimurium tryptophan synthase α-subunit from solution-state NMR.

Authors:  Varun V Sakhrani; Eduardo Hilario; Bethany G Caulkins; Mary E Hatcher-Skeers; Li Fan; Michael F Dunn; Leonard J Mueller
Journal:  J Biomol NMR       Date:  2020-05-15       Impact factor: 2.835

4.  Toho-1 β-lactamase: backbone chemical shift assignments and changes in dynamics upon binding with avibactam.

Authors:  Varun V Sakhrani; Rittik K Ghosh; Eduardo Hilario; Kevin L Weiss; Leighton Coates; Leonard J Mueller
Journal:  J Biomol NMR       Date:  2021-07-04       Impact factor: 2.582

5.  Solution structures of the Shewanella woodyi H-NOX protein in the presence and absence of soluble guanylyl cyclase stimulator IWP-051.

Authors:  Cheng-Yu Chen; Woonghee Lee; Paul A Renhowe; Joon Jung; William R Montfort
Journal:  Protein Sci       Date:  2020-12-10       Impact factor: 6.993

6.  Backbone and sidechain 1H, 15N and 13C resonance assignments of the free and RNA-bound tandem zinc finger domain of the tristetraprolin family member from Selaginella moellendorffii.

Authors:  Stephanie N Hicks; Ronald A Venters; Perry J Blackshear
Journal:  Biomol NMR Assign       Date:  2022-03-12       Impact factor: 0.731

7.  Solution structure of human myeloid-derived growth factor suggests a conserved function in the endoplasmic reticulum.

Authors:  Valeriu Bortnov; Marco Tonelli; Woonghee Lee; Ziqing Lin; Douglas S Annis; Omar N Demerdash; Alex Bateman; Julie C Mitchell; Ying Ge; John L Markley; Deane F Mosher
Journal:  Nat Commun       Date:  2019-12-09       Impact factor: 14.919

8.  Solution NMR Determination of the CDHR3 Rhinovirus-C Binding Domain, EC1.

Authors:  Woonghee Lee; Ronnie O Frederick; Marco Tonelli; Ann C Palmenberg
Journal:  Viruses       Date:  2021-01-22       Impact factor: 5.048

9.  POKY software tools encapsulating assignment strategies for solution and solid-state protein NMR data.

Authors:  Ira Manthey; Marco Tonelli; Lawrence Clos Ii; Mehdi Rahimi; John L Markley; Woonghee Lee
Journal:  J Struct Biol X       Date:  2022-08-28

10.  ssPINE: Probabilistic Algorithm for Automated Chemical Shift Assignment of Solid-State NMR Data from Complex Protein Systems.

Authors:  Adilakshmi Dwarasala; Mehdi Rahimi; John L Markley; Woonghee Lee
Journal:  Membranes (Basel)       Date:  2022-08-26
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.