| Literature DB >> 29752607 |
João M C Teixeira1, Simon P Skinner2, Miguel Arbesú3, Alexander L Breeze2, Miquel Pons3.
Abstract
We present Farseer-NMR ( https://git.io/vAueU ), a software package to treat, evaluate and combine NMR spectroscopic data from sets of protein-derived peaklists covering a range of experimental conditions. The combined advances in NMR and molecular biology enable the study of complex biomolecular systems such as flexible proteins or large multibody complexes, which display a strong and functionally relevant response to their environmental conditions, e.g. the presence of ligands, site-directed mutations, post translational modifications, molecular crowders or the chemical composition of the solution. These advances have created a growing need to analyse those systems' responses to multiple variables. The combined analysis of NMR peaklists from large and multivariable datasets has become a new bottleneck in the NMR analysis pipeline, whereby information-rich NMR-derived parameters have to be manually generated, which can be tedious, repetitive and prone to human error, or even unfeasible for very large datasets. There is a persistent gap in the development and distribution of software focused on peaklist treatment, analysis and representation, and specifically able to handle large multivariable datasets, which are becoming more commonplace. In this regard, Farseer-NMR aims to close this longstanding gap in the automated NMR user pipeline and, altogether, reduce the time burden of analysis of large sets of peaklists from days/weeks to seconds/minutes. We have implemented some of the most common, as well as new, routines for calculation of NMR parameters and several publication-quality plotting templates to improve NMR data representation. Farseer-NMR has been written entirely in Python and its modular code base enables facile extension.Entities:
Keywords: Chemical shift perturbations; Data analysis; Intrinsically disordered proteins; NMR spectroscopy; Paramagnetic-NMR; Proteins
Mesh:
Substances:
Year: 2018 PMID: 29752607 PMCID: PMC5986830 DOI: 10.1007/s10858-018-0182-5
Source DB: PubMed Journal: J Biomol NMR ISSN: 0925-2738 Impact factor: 2.835
Fig. 1A schematic representation of the Farseer-NMR Cube. Examples of different variables are given of each Cube’s axis. Each coloured rectangle, the first one highlighted with ‘.csv’ for representation is a 2D-NMR peaklist
Fig. 2Three examples of the implemented Farseer-NMR plotting templates. 1H and 15N chemical shifts were generated from a synthetic data set simulating seven points of a ligand titration of a 100 residues protein. a Extended bar plot template and b compacted bar plot template representing the combined chemical shift perturbations (CSP), calculated according to Williamson (2013), for the last experiment in the series (with 800 µM ligand) versus the reference experiment (ligand free). Black bars represent residues measured in that spectrum. Unassigned residues are represented in grey a × ticks or b background shade. Red bars represent missing residues, i.e. residues that have been observed previously in the series but have disappeared at a given ligand concentration; the last measured value is kept. Prolines are identified by character “P”. Blue and gold bars (labelled 1 and 2, respectively) represent two pairs of residues with alternative assignment, that after plot analysis can be swapped with confidence (14F ↔ 48R, 18Q ↔ 97M), colours and labels are representations of the user annotations present in the original peaklists. A significance threshold is represented as a red line. c Progression of the CSP parameter represented individually for each residue. Peaks that have disappeared along the series are easily identified (22F), as well as unassigned peaks. The represented plots are crops of the full pictures generated by Farseer-NMR that represent the whole series. All colours, sizes, shapes, and labels are user-configurable
Fig. 3A screenshot of the Farseer-NMR user interface. The Peaklist selection tab is shown loaded with an artificial dataset as an example. The Farseer-NMR Cube variables are set up and the corresponding data points are populated with peaklists
Fig. 4A schematic representation of the Comparative/Stacking Analysis workflow. Farseer-NMR can easily generate combined data from experiments (e.g. Intensity ratios for each residue between paramagnetic and diamagnetic samples in PRE experiments—generated along the z axis—and compare the derived results as a function of an external variable (e.g. ligand concentration, corresponding to the x axis of the Farseer-NMR’s Cube) as a single combined plot or as a PRE versus ligand concentration curves for individual residues