| Literature DB >> 32459331 |
Gabriele Orlando1,2, Daniele Raimondi3, Luciano Porto Kagami1, Wim F Vranken1,4,5.
Abstract
Nuclear magnetic resonance (NMR) spectroscopy data provides valuable information on the behaviour of proteins in solution. The primary data to determine when studying proteins are the per-atom NMR chemical shifts, which reflect the local environment of atoms and provide insights into amino acid residue dynamics and conformation. Within an amino acid residue, chemical shifts present multi-dimensional and complexly cross-correlated information, making them difficult to analyse. The ShiftCrypt method, based on neural network auto-encoder architecture, compresses the per-amino acid chemical shift information in a single, interpretable, amino acid-type independent value that reflects the biophysical state of a residue. We here present the ShiftCrypt web server, which makes the method readily available. The server accepts chemical shifts input files in the NMR Exchange Format (NEF) or NMR-STAR format, executes ShiftCrypt and visualises the results, which are also accessible via an API. It also enables the "biophysically-based" pairwise alignment of two proteins based on their ShiftCrypt values. This approach uses Dynamic Time Warping and can optionally include their amino acid code information, and has applications in, for example, the alignment of disordered regions. The server uses a token-based system to ensure the anonymity of the users and results. The web server is available at www.bio2byte.be/shiftcrypt.Entities:
Year: 2020 PMID: 32459331 PMCID: PMC7319548 DOI: 10.1093/nar/gkaa391
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Distribution of the ShiftCrypt values for proteins in urea (blue histogram) and in their native conformation (orange distribution).
Figure 2.Comparison of the ShiftCrypt profile for two proteins, GB1-SC35 and a GCN4 leucine zipper, in presence (orange) and absence (blue) of urea.
Figure 3.Distribution of the alignment scores based on ShiftCrypt values, with Denat. indicating denatured proteins (by addition of urea), and native proteins in their native conformation. The box-plots represents the distribution of the alignment scores obtained by aligning all sequences between these datasets. The higher the alignment scores, the more diverse the ShiftCrypt profiless.
Figure 4.Alignment of the Thermus thermophilus (green) and Thermotoga maritima (blue) L11 ribosomal proteins obtained using the ShiftCrypt alignment. The coloured regions represent the secondary structure elements of the T. thermophilus protein: red for beta sheet, green for alpha helix and blue for coil. The region highlighted with the black box represents a region annotated as intrinsically disordered in the homologous protein of Geobacillus stearothermophilus