Literature DB >> 30500878

TADOSS: computational estimation of tandem domain swap stability.

Aleix Lafita1, Pengfei Tian2, Robert B Best2, Alex Bateman1.   

Abstract

SUMMARY: Proteins with highly similar tandem domains have shown an increased propensity for misfolding and aggregation. Several molecular explanations have been put forward, such as swapping of adjacent domains, but there is a lack of computational tools to systematically analyze them. We present the TAndem DOmain Swap Stability predictor (TADOSS), a method to computationally estimate the stability of tandem domain-swapped conformations from the structures of single domains, based on previous coarse-grained simulation studies. The tool is able to discriminate domains susceptible to domain swapping and to identify structural regions with high propensity to form hinge loops. TADOSS is a scalable method and suitable for large scale analyses.
AVAILABILITY AND IMPLEMENTATION: Source code and documentation are freely available under an MIT license on GitHub at https://github.com/lafita/tadoss. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30500878      PMCID: PMC6612889          DOI: 10.1093/bioinformatics/bty974

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Protein misfolding and aggregation is a major problem for cells and organisms, and the cause of severe human diseases like Alzheimer’s. Recent studies have shown an increased propensity of aggregation in proteins containing identical domains in tandem (Borgia ; Wright ). One of the misfolded conformations identified in these experiments are domain swaps, i.e. part of the structure of a domain folds into its adjacent domain. Domain swapping has been associated with protein aggregation (Rousseau ), so its computational prediction is of widespread biomedical and biotechnological interest. A recent study by Tian and Best (2016) demonstrated using coarse-grained simulations of tandem pairs of identical domains that the difference in stability between the native and the domain-swapped conformations correlates with the swapping propensity. They also described an alchemical approach (i.e. simplified model) to estimate the free energy difference of the two conformations that can be generally used to predict domain swapping. Here we present an improved and fully automated version of the method, named TAndem DOmain Swap Stability predictor (TADOSS), which can be used to systematically find domains susceptible to domain swapping and identify the regions of the structure with the highest propensity to form hinge loops.

2 Description

As described originally by Tian and Best (2016), the total free energy difference between the native and swapped conformations can be split into the energy of joining the N- and C- termini of the domain () and the energy of cutting the domain (), i.e. forming a hinge loop between the swapped domains. TADOSS systematically evaluates all possible cut positions in the domain and calculates the free energy contribution of forming a hinge loop of at least three residues. The profile of is valuable to identify the regions of the domain that are more susceptible to form hinge loops in the tandem domain swaps, as shown in Figure 1.
Fig. 1.

Alchemical free energy of forming a hinge loop () centered at every residue position in an SH3 domain (ECOD: e1shgA1). Energies can be visualized as a profile along the domain sequence (A) or mapped in 3D to the domain structure (B). Higher (more positive) free energy differences (red) indicate regions of the structure more likely to form hinge loops. For the SH3 domain, the alchemical free energy of joining the termini () is −6.1 kcal/mol. The most stable domain swap conformation is predicted to be forming a hinge loop centered at Position 33, with a total of −6.8 kcal/mol. The corresponding tandem domain-swapped structure extracted from the simulations by Tian and Best (2016) is also shown (C)

Alchemical free energy of forming a hinge loop () centered at every residue position in an SH3 domain (ECOD: e1shgA1). Energies can be visualized as a profile along the domain sequence (A) or mapped in 3D to the domain structure (B). Higher (more positive) free energy differences (red) indicate regions of the structure more likely to form hinge loops. For the SH3 domain, the alchemical free energy of joining the termini () is −6.1 kcal/mol. The most stable domain swap conformation is predicted to be forming a hinge loop centered at Position 33, with a total of −6.8 kcal/mol. The corresponding tandem domain-swapped structure extracted from the simulations by Tian and Best (2016) is also shown (C) The length of the linker between adjacent domains also plays an important role in the stability of tandem domain swaps. Longer linkers allow the connection of the N- and C-termini of the domain, thereby increasing the . To account for this effect, we have introduced an optional parameter in TADOSS to reduce the effective distance between the termini of the domain proportional to the length of the linker. Finally, the total free energy difference of a tandem domain swap is obtained by summing up the free energy differences for cutting () and joining () the domain. The most susceptible domain swaps are those with the maximum (most positive). More information about the energetic model and its parameters can be found in the supplementary materials.

3 Results

The alchemical free energy difference from TADOSS correlates well with the free energies obtained in the simulations by Tian and Best (2016), although the scale differs by a factor of two approximately (Supplementary Fig. S1). We also find an agreement in the effect of the linker length between the simulated and alchemical free energy differences (Supplementary Fig. S2). Using the free energy profile from TADOSS, it is possible to reproduce experimental measurements and identify with good accuracy the folding units of a DHFR domain characterized by Iwakura (Supplementary Fig. S3). Furthermore, hinge loops of experimentally determined domain swap dimers presented by Ding correspond to maximums of the profile, as expected (Supplementary Fig. S4). We also provide a dataset of alchemical estimations for T-group (topology) representatives in the ECOD database (Cheng ). We find a significant proportion of domains susceptible to tandem domain swapping (Supplementary Fig. S5).

4 Implementation

TADOSS is written in Python and bundled in a Bash script with a simple interface to the user. The program takes the structure of a protein domain as a PDB file and generates output files with the alchemical free energy differences. The method requires BioPython (Cock ) to parse and manipulate the domain structures and either GROMACS (Abraham ) or Reduce (Word ) to add the hydrogens. The structure of the input domain is represented using a coarse-grained structure-based (Gō-like) model, as described by Karanicolas and Brooks (2002). Native interactions in the structure are attractive and the relative contact energies are set according to the Miyazawa-Jernigan matrix (Miyazawa and Jernigan, 1996). The running time for an example domain of 159 residues on a MacBook Pro 2.9 GHz Intel Core i5 with 16 GB RAM is about 4 s. The method scales quadratically with the number of residues in the input domain structure (Supplementary Fig. S6).

Funding

This work was supported by the intramural research program of the National Institute of Diabetes and Digestive and Kidney Diseases (grant number ZIA DK075104-06) of the National Institutes of Health to P.T. and R.B.B. This work was funded by the European Molecular Biology Laboratory. Conflict of Interest: none declared. Click here for additional data file.
  11 in total

1.  Systematic circular permutation of an entire protein reveals essential folding elements.

Authors:  M Iwakura; T Nakamura; C Yamane; K Maki
Journal:  Nat Struct Biol       Date:  2000-07

2.  The origins of asymmetry in the folding transition states of protein L and protein G.

Authors:  John Karanicolas; Charles L Brooks
Journal:  Protein Sci       Date:  2002-10       Impact factor: 6.725

3.  The importance of sequence diversity in the aggregation and evolution of proteins.

Authors:  Caroline F Wright; Sarah A Teichmann; Jane Clarke; Christopher M Dobson
Journal:  Nature       Date:  2005-12-08       Impact factor: 49.962

4.  Topological determinants of protein domain swapping.

Authors:  Feng Ding; Kirk C Prutzman; Sharon L Campbell; Nikolay V Dokholyan
Journal:  Structure       Date:  2006-01       Impact factor: 5.006

5.  Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Authors:  Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon
Journal:  Bioinformatics       Date:  2009-03-20       Impact factor: 6.937

6.  Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation.

Authors:  J M Word; S C Lovell; J S Richardson; D C Richardson
Journal:  J Mol Biol       Date:  1999-01-29       Impact factor: 5.469

Review 7.  Implications of 3D domain swapping for protein folding, misfolding and function.

Authors:  Frederic Rousseau; Joost Schymkowitz; Laura S Itzhaki
Journal:  Adv Exp Med Biol       Date:  2012       Impact factor: 3.650

8.  ECOD: an evolutionary classification of protein domains.

Authors:  Hua Cheng; R Dustin Schaeffer; Yuxing Liao; Lisa N Kinch; Jimin Pei; Shuoyong Shi; Bong-Hyun Kim; Nick V Grishin
Journal:  PLoS Comput Biol       Date:  2014-12-04       Impact factor: 4.475

9.  Structural Determinants of Misfolding in Multidomain Proteins.

Authors:  Pengfei Tian; Robert B Best
Journal:  PLoS Comput Biol       Date:  2016-05-10       Impact factor: 4.475

10.  Transient misfolding dominates multidomain protein folding.

Authors:  Alessandro Borgia; Katherine R Kemplen; Madeleine B Borgia; Andrea Soranno; Sarah Shammas; Bengt Wunderlich; Daniel Nettels; Robert B Best; Jane Clarke; Benjamin Schuler
Journal:  Nat Commun       Date:  2015-11-17       Impact factor: 14.919

View more
  3 in total

Review 1.  Tandem domain swapping: determinants of multidomain protein misfolding.

Authors:  Aleix Lafita; Pengfei Tian; Robert B Best; Alex Bateman
Journal:  Curr Opin Struct Biol       Date:  2019-06-28       Impact factor: 6.809

2.  Periscope Proteins are variable-length regulators of bacterial cell surface interactions.

Authors:  Fiona Whelan; Aleix Lafita; James Gilburt; Clément Dégut; Samuel C Griffiths; Huw T Jenkins; Alexander N St John; Emanuele Paci; James W B Moir; Michael J Plevin; Christoph G Baumann; Alex Bateman; Jennifer R Potts
Journal:  Proc Natl Acad Sci U S A       Date:  2021-06-08       Impact factor: 11.205

3.  Exploring the sequence fitness landscape of a bridge between protein folds.

Authors:  Pengfei Tian; Robert B Best
Journal:  PLoS Comput Biol       Date:  2020-10-13       Impact factor: 4.475

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.