| Literature DB >> 32060385 |
Francesca Nerattini1, Luca Tubiana1, Chiara Cardelli1, Valentino Bianco1, Christoph Dellago1, Ivan Coluzza2,3.
Abstract
Isolating the properties of proteins that allow them to convert sequence into the structure is a long-lasting biophysical problem. In particular, studies focused extensively on the effect of a reduced alphabet size on the folding properties. However, the natural alphabet is a compromise between versatility and optimisation of the available resources. Here, for the first time, we include the impact of the relative availability of the amino acids to extract from the 20 letters the core necessary for protein stability. We present a computational protein design scheme that involves the competition for resources between a protein and a potential interaction partner that, additionally, gives us the chance to investigate the effect of the reduced alphabet on protein-protein interactions. We devise a scheme that automatically identifies the optimal reduced set of letters for the design of the protein, and we observe that even alphabets reduced down to 4 letters allow for single protein folding. However, it is only with 6 letters that we achieve optimal folding, thus recovering experimental observations. Additionally, we notice that the binding between the protein and a potential interaction partner could not be avoided with the investigated reduced alphabets. Therefore, we suggest that aggregation could have been a driving force in the evolution of the large protein alphabet.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32060385 PMCID: PMC7021711 DOI: 10.1038/s41598-020-59401-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pictorial representation of the steps employed to enforce a competition for amino acid availability between protein and a protein , and to test its effect on the folding ability of protein in presence and absence of the artificial partner. (I) Create a Caterpillar version of the experimentally determined crystal structure of protein G (II) Shape four competing partner proteins modelled as moulds of increasing portions of the protein G. The size of the mould will influence the competition for resources, as further explained in the following sections. The larger the surface, the higher the competition. (III) Design each of the four systems considering simultaneously the proteins and . The procedure consists in searching for the ensemble of sequences that minimise the energy of both protein and while keeping the system conformation frozen in space. The competition for the amino acids is created at this stage of our simulations. (IV) After selecting the best designed sequence (see the Design subsection for details about the criterion) for each system, isolate the portion relative to the protein and test its folding ability in a single-protein folding simulation. (V) Check how the folding of the latter sequences is influenced by the presence of protein frozen in the simulation box (bearing the sequence designed concurrently to protein ).
Figure 2Folding free energy profiles F/kT of single protein (only protein , no protein ) at reduced temperature 0.55 as a function of DRMSD from the native target structure (protein G structure, PDB ID: 1pgb). Different colours correspond to protein sequences obtained via the design procedure in the presence of the protein characterised by the value specified in the key. Right hand side: configurations corresponding to the free energy minimum for each system are represented in red, compared to the native protein G (in green). for ; for ; for and .
Figure 3Folding free energy landscapes F/kT at reduced temperature 0.76 as a function of the distance from the native protein G as target and the inter-protein distance from the folded protein bound to protein (configurations depicted in the panels). The binding affinity decreases along with the protein surface size, as shown by the value of the association constants K in the plot key.