Literature DB >> 30562021

Complex Molecules That Fold Like Proteins Can Emerge Spontaneously.

Bin Liu1, Charalampos G Pappas1, Ennio Zangrando2, Nicola Demitri3, Piotr J Chmielewski4, Sijbren Otto1.   

Abstract

Folding can bestow macromolecules with various properties, as evident from nature's proteins. Until now complex folded molecules are the product either of evolution or of an elaborate process of design and synthesis. We now show that molecules that fold in a well-defined architecture of substantial complexity can emerge autonomously and selectively from a simple precursor. Specifically, we have identified a self-synthesizing macrocyclic foldamer with a complex and unprecedented secondary and tertiary structure that constructs itself highly selectively from 15 identical peptide-nucleobase subunits, using a dynamic combinatorial chemistry approach. Folding of the structure drives its synthesis in 95% yield from a mixture of interconverting molecules of different ring sizes in a one-step process. Single-crystal X-ray crystallography and NMR reveal a folding pattern based on an intricate network of noncovalent interactions involving residues spaced apart widely in the linear sequence. These results establish dynamic combinatorial chemistry as a powerful approach to developing synthetic molecules with folding motifs of a complexity that goes well beyond that accessible with current design approaches. The fact that such molecules can form autonomously implies that they may have played a role in the origin of life at earlier stages than previously thought possible.

Mesh:

Substances:

Year:  2019        PMID: 30562021      PMCID: PMC6356852          DOI: 10.1021/jacs.8b11698

Source DB:  PubMed          Journal:  J Am Chem Soc        ISSN: 0002-7863            Impact factor:   15.419


Introduction

The chemistry of life relies on biopolymers (proteins, nucleic acids) folding into specific conformations that dictate their properties. It is generally believed that the complex folded structures encountered in biology are the result of millions of years of evolution. Much of the research on synthetic foldamers[1−13] is driven by the desire to bypass evolution, go beyond the constraints of using only nature’s building blocks, and directly access structures that fold like proteins, but are based on completely synthetic subunits. The ultimate goal is achieving new and sophisticated functions that require the molecular complexity of extended folded structures. This goal is still largely out of reach and foldamers able to exhibit specific function have remained rare,[14−16] due to the huge challenge of obtaining new modes of folding in designed proteins,[17,18] molecules that mimic peptides or nucleic acids,[5,8,9] and in completely abiotic molecules.[1−4,6,7,10−13] The approach taken to accessing new synthetic foldamers has until now relied almost exclusively on design, followed by multistep synthesis. An impressive new range of backbones have been developed that fold into a variety of well-defined architectures, including secondary structures such as helices and sheets.[19] Yet, the design approach tends to be based on relatively simple and small-range assembly motifs, primarily driven by interactions between residues close to each other in the oligomeric chain of monomer units. Foldamers that rely on long-range interactions (between residues further apart in the oligomeric chain, as observed in the folding of proteins and nucleic acids) have remained difficult to access due to a lack of reliable design rules. Hence, alternative approaches are needed for accessing fundamentally new classes of foldamers that rely on long-range interactions. Dynamic combinatorial chemistry[20,21] has been suggested as a useful selection tool for accessing such new folded structures. In brief, in a dynamic combinatorial library (DCL) building blocks react with each other to give rise to a mixture of oligomeric compounds that continuously exchange these building blocks between them. When a specific library member is able to form efficient intramolecular noncovalent interactions, inducing it to fold, this compound should be more stable than other library members that are unable to engage in such interactions. Hence, the library composition should shift in favor of the foldamer. Indeed, several groups have reported folding-driven changes in library compositions.[22−29] However, until now this approach has failed to deliver fundamentally new folding motifs. This lack of success is surprising, given that dynamic combinatorial methods have proven successful in the discovery of new and often unexpected host–guest systems,[30,31] interlocked structures,[31,32] and self-replicating molecules.[33] All of these systems rely, like foldamers, on noncovalent interactions as the prime selection criterion. We now report results that should prompt a revival of the combinatorial approach to foldamers. We discovered a new macrocycle that folds into a complex but precisely defined secondary and tertiary structure, constructing itself out of 15 identical peptide-nucleobase subunits. Folding of the structure drives its synthesis in 95% yield from a mixture of interconverting molecules of different ring size in a one-step process. Single-crystal X-ray crystallography and NMR analyses reveal a new folding pattern based on an intricate network of hydrogen bonds and π-stacking interactions featuring long-range interactions.

Results and Discussion

We designed chimeric building block 1, which contains an amino-acid and a nucleobase subunit, both key structural elements involved in the folding of proteins and nucleic acids, respectively. We reasoned that the presence of these two types of subunits, which are central in nature’s folded macromolecules, should maximize the chances of accessing foldamers in our dynamic libraries. At the same time, and unlike nature, having both subunits present within a single building should give rise to fundamentally new foldamers. Building block 1 was synthesized in six steps as outlined in the Supporting Information. The building block is equipped with two thiol groups, which, upon exposure to atmospheric oxygen, oxidize to give rise to an equilibrium mixture of different macrocyclic disulfides,[30] which interconvert through thiol–disulfide exchange (Figure a). Ordinarily, in the absence of any noncovalent interactions, DCLs of building blocks with the benzenedithiol core of 1 are dominated by small trimeric and tetrameric macrocycles (the dimer is too strained to form in significant quantities). Indeed, a small DCL made from a 50 μM solution of building block 1 in borate buffer (50 mM, pH 8.2) consisted mostly of cyclic tetramer (top trace in Figure b; composition monitored by UPLC). However, repeating the same experiment at higher building block concentration (500 μM) led to the emergence of an unusually large macrocycle, consisting of 15 subunits of 1, as evident from mass spectrometric analysis of the relevant peak in the UPLC chromatogram (Figure S20). Increasing the concentration of the building block or addition of NaCl (1.0 M) to the mixture enhanced the formation of the 15mer, accounting for up to 95% of the library material (Figure b, trace 4). Experiments with other salts revealed similar effects (Figure S4), indicating that the exact nature of the salt is not important and that the formation of the 15mer benefits from a high ionic strength, suggesting that the salt acts to reduce charge repulsion within the 15mer. The 15mer is most likely the thermodynamic product, as partial reduction followed by reoxidation led to the almost quantitative re-formation of the 15mer (Figure S6). Experiments in the absence of salt but in the presence of 50% acetone (bottom trace in Figure b) or other cosolvents (Figure S5) gave only negligible amounts of 15mer, suggesting that hydrophobic interactions are important in stabilizing this compound. These observations, together with the fact that the 15mer is only poorly retained on the UPLC column, suggest that the compound adopts a folded structure. This hypothesis was confirmed when we characterized its structure using tandem mass spectrometry, circular dichroism (CD) spectroscopy, X-ray crystallography, and NMR spectroscopy.
Figure 1

Selective formation of the 15mer. (a) Dynamic combinatorial library of amino-acid nucleobase building block 1. Arrows indicate two sets of π-stacks observed between phenyls and adenines. The remaining three stacks are not shown for clarity. (b) UPLC chromatograms (absorption at 254 nm) showing library compositions after 14 days of stirring (in 50 mM borate buffer, pH = 8.2) at a building block concentration of (1) 0.050 mM; (2) 0.50 mM; (3) 2.0 mM; (4) 0.50 mM in the presence of 1.0 M NaCl; and (5) 0.50 mM in the presence of 50% acetone. (c) CD spectrum of monomer 1 (dotted line) and 15mer (solid line) in water at 298 K.

Selective formation of the 15mer. (a) Dynamic combinatorial library of amino-acid nucleobase building block 1. Arrows indicate two sets of π-stacks observed between phenyls and adenines. The remaining three stacks are not shown for clarity. (b) UPLC chromatograms (absorption at 254 nm) showing library compositions after 14 days of stirring (in 50 mM borate buffer, pH = 8.2) at a building block concentration of (1) 0.050 mM; (2) 0.50 mM; (3) 2.0 mM; (4) 0.50 mM in the presence of 1.0 M NaCl; and (5) 0.50 mM in the presence of 50% acetone. (c) CD spectrum of monomer 1 (dotted line) and 15mer (solid line) in water at 298 K. The 15mer was isolated by preparative HPLC (isolated yield: 90%) and then analyzed by means of MALDI TOF/TOF. The results (Figure S7) show that the 15mer fragments into smaller components, ranging from tetramer up to 12mer, indicating that the 15mer is a single macrocycle and not a system of interlocked rings (a catenane). The CD spectrum of the 15mer showed an intense positive band at 260 nm, which can be attributed to the absorption of the aromatic dithiol[34] and the terminal adenine. Importantly, the CD signals of the 15mer are dramatically enhanced compared to those of monomer 1 (Figure c), which suggests that the aromatic rings reside in a well-defined chiral environment, even though these rings are relatively remote from the chiral center in building block 1, located on the amino-acid residue. Detailed insights into the structure of the 15mer were obtained from single-crystal X-ray diffraction data. Crystals of the 15mer were prepared by slow diffusion of acetone into an aqueous solution of the 15mer. The crystal structure (Figure ) confirms that the 15mer is a single giant macrocycle[35] linked together by 15 disulfide bonds to give a 75-atom ring. Figure a shows the extended conformation of this ring for clarity, which, in reality, is collapsed into a compact but intricately folded structure. Figure e shows the 75-atom ring in its highly twisted conformation. Overall, the structure is characterized by a hydrophobic core and presents its hydrophilic groups on its surface, similar to what is observed in folded proteins. The most notable structural motif is the stacking of aromatic rings. Five stacks of three phenyl rings can be identified (shown in Figure f), capped with an adenine ring at the top and bottom (except where these adenines are recruited for crystal packing), as shown for one of these stacks in Figure g. The distances between the three phenyl rings in these stacks are in the range of 3.43–3.45 Å, while the distances between the phenyl and adenine rings are somewhat larger (3.46–3.49 Å), but all in the range typical for π-stacking.[36] Interestingly, the rings that end up in the same stack are spaced far apart in the extended structure; as indicated in Figure a, a stack is composed of the phenyl rings from the i, i+2, and i+4 residues, while the capping adenines belong to the i–3 and the i+7 residues. Except for the adenines recruited for crystal packing (disrupting the otherwise 5-fold symmetry of the structure), this arrangement gets repeated five times in an interdigitated fashion (the second stack is indicated by a dotted line in Figure a; the other stacks are not shown for clarity). The five stacks of rings are arranged in a tiled fashion, as shown in Figure f. Thus, not only can secondary structure elements be identified (corresponding to the stacks of rings), but since these stacks adopt well-defined orientations with respect to each other, also tertiary structure is present, which has only very recently been achieved in designed foldamers.[37] Apart from π-stacking interactions, also hydrogen bonds play an important role in stabilizing this folded structure. Between the building blocks that constitute the π-stack (residues i, i+2, and i+4), five hydrogen bonds are observed between the NH and CO groups (Figure h).
Figure 2

X-ray crystal structure of the 15mer. (a) Top view of the central cavity of the macrocycle. (b) Side view. (c) Top and (d) side view of the 15mer in space-filling representation. (e) The ring of disulfide bonds connecting the phenyl rings. (f) Core part of the foldamer, showing five stacks of three phenyl rings connected by disulfide bonds (two of the five sets of stacked phenyls are indicated by arrows in the extended structure shown in Figure a). (g) Top view of the 15mer highlighting one stack of three core phenyl rings and two adenines on the top and bottom of the stack. (h) Set of intermolecular hydrogen bonds formed between the three building blocks that constitute the stack of three phenyl rings. Solvent molecules, hydrogen atoms, and disorders are omitted for clarity. C atoms are shown in gray, N in purple, O in red, and S in yellow; except in panel f the C atoms of the macrocycle core are shown in light blue.

X-ray crystal structure of the 15mer. (a) Top view of the central cavity of the macrocycle. (b) Side view. (c) Top and (d) side view of the 15mer in space-filling representation. (e) The ring of disulfide bonds connecting the phenyl rings. (f) Core part of the foldamer, showing five stacks of three phenyl rings connected by disulfide bonds (two of the five sets of stacked phenyls are indicated by arrows in the extended structure shown in Figure a). (g) Top view of the 15mer highlighting one stack of three core phenyl rings and two adenines on the top and bottom of the stack. (h) Set of intermolecular hydrogen bonds formed between the three building blocks that constitute the stack of three phenyl rings. Solvent molecules, hydrogen atoms, and disorders are omitted for clarity. C atoms are shown in gray, N in purple, O in red, and S in yellow; except in panel f the C atoms of the macrocycle core are shown in light blue. The solution-phase 1H NMR spectrum (Figure a) of the isolated 15mer (D2O, 298 K) is consistent with the X-ray crystal structure. The presence of sharp signals is in agreement with a highly ordered compact structure. In the spectrum the monomeric unit 1 appears in three separate sets of signals, indicative of a C5 symmetry for the 15mer in solution. The signals of the protons of the phenyl rings are distributed over a wide range of chemical shifts (from 5.8 to 7.6 ppm). The large upfield shift of two of these (to 6.6 and 5.8 ppm, respectively, versus 7.2 and 7.3 ppm in monomer 1) indicates that some of them are adjacent to the face of an aromatic ring. 2D-NMR studies enabled the complete assignment of the spectrum of the 15mer as well as the assignment of several through-space interactions that are consistent with the X-ray crystal structure (Figure b). These data indicate that, also in solution, the 15mer adopts a folded structure that is very similar to that observed in the X-ray crystal structure.
Figure 3

NMR and CD characterization of the 15mer. (a) 1H NMR spectrum of the 15mer (500 MHz, D2O, 298 K), with signal assignments corresponding to the labeling shown in Figure a. (b) Repeating unit of the 15mer with observed NOEs. (c) Changes in the Cotton effect intensities at specified wavelengths in the CD spectra of a DMF solution of the 15mer observed upon cooling from 373 K to 248 K.

NMR and CD characterization of the 15mer. (a) 1H NMR spectrum of the 15mer (500 MHz, D2O, 298 K), with signal assignments corresponding to the labeling shown in Figure a. (b) Repeating unit of the 15mer with observed NOEs. (c) Changes in the Cotton effect intensities at specified wavelengths in the CD spectra of a DMF solution of the 15mer observed upon cooling from 373 K to 248 K. We then attempted to unfold the 15mer. Temperature-dependent 1H NMR experiments on aqueous solutions of the 15mer were performed, but no significant spectral changes were observed, even at temperatures up to 353 K (Figure S12). Acetone was added (which disrupts the process of formation of the 15mer; vide supra), but the NMR spectra remained essentially unchanged (Figure S13), indicating that the 15mer, once formed, does not readily unfold. Unfolding was eventually achieved by dissolving the 15mer in DMF-d7. At room temperature sharp signals corresponding to the folded 15mer were still observed (Figure S14), but these broadened significantly when the temperature was increased to 363 K, suggesting (at least partial) unfolding. At 373 K one set of relatively sharp signals was observed indicating a 15-fold symmetry of the system and, thus, complete unfolding of the macrocycle. The original spectrum was retained upon cooling the sample to room temperature. The unfolding–refolding process was further analyzed by variable-temperature CD spectroscopy in dimethylformamide (DMF) (Figure S17). Reversible attenuation and enhancement of the CD signals were observed by heating and cooling the DMF solution (Figure c). Finally, we investigated the extent to which foldamer formation depends on the structure of the building block. Foldamer formation was critically dependent on the presence of the nucleobase, but not specific for adenine, as replacement of adenine by guanine also gave the 15mer in 95% yield (Figure S18), while without any nucleobase only cyclic trimers and tetramers were detected (Figure S19).

Conclusions

These results establish dynamic combinatorial chemistry as a promising tool for identifying new foldamers with unprecedented folds of substantial structural complexity that are currently impossible to predict or even rationalize after the event. This method allows selective access to oligomers of precisely defined length in remarkably high yield and with minimal synthetic effort. Having thus established an efficient discovery tool for synthetic structures that fold like proteins, a new frontier in foldamer science is opening up, which should enable the field to become populated with fundamentally new folds. Indeed, in the months following the discovery of the 15mer described herein, we succeeded in identifying a handful of other foldamers of different ring size, based on different building blocks. Their structural characterization is currently underway and will be reported in due course. Accumulation of a sufficiently large number of such structures should, on the long-term, allow rules to be identified that may bring the holy grail in foldamer science a step closer: accessing new folds by design. Our findings are also relevant in the context of the origin of life, as they prove that folded structures of considerable complexity can emerge selectively and spontaneously from a mixture of interconverting molecules, simply as a result of their thermodynamic stability. Such folded structures could have existed and played a role in the origin of life at earlier stages than previously thought possible.
  31 in total

1.  Interconversion of single and double helices formed from synthetic molecular strands.

Authors:  V Berl; I Huc; R G Khoury; M J Krische; J M Lehn
Journal:  Nature       Date:  2000-10-12       Impact factor: 49.962

Review 2.  A field guide to foldamers.

Authors:  D J Hill; M J Mio; R B Prince; T S Hughes; J S Moore
Journal:  Chem Rev       Date:  2001-12       Impact factor: 60.622

3.  Selection and amplification of hosts from dynamic combinatorial libraries of macrocyclic disulfides.

Authors:  Sijbren Otto; Ricardo L E Furlan; Jeremy K M Sanders
Journal:  Science       Date:  2002-07-26       Impact factor: 47.728

4.  Backbone thioester exchange: a new approach to evaluating higher order structural stability in polypeptides.

Authors:  Matthew G Woll; Samuel H Gellman
Journal:  J Am Chem Soc       Date:  2004-09-15       Impact factor: 15.419

5.  Dynamic combinatorial chemistry.

Authors:  Peter T Corbett; Julien Leclaire; Laurent Vial; Kevin R West; Jean-Luc Wietor; Jeremy K M Sanders; Sijbren Otto
Journal:  Chem Rev       Date:  2006-09       Impact factor: 60.622

Review 6.  Foldamers as versatile frameworks for the design and evolution of function.

Authors:  Catherine M Goodman; Sungwook Choi; Scott Shandler; William F DeGrado
Journal:  Nat Chem Biol       Date:  2007-05       Impact factor: 15.040

7.  Amplification of acetylcholine-binding catenanes from dynamic combinatorial libraries.

Authors:  Ruby T S Lam; Ana Belenguer; Sarah L Roberts; Christoph Naumann; Thibaut Jarrosson; Sijbren Otto; Jeremy K M Sanders
Journal:  Science       Date:  2005-03-10       Impact factor: 47.728

8.  Folding-driven synthesis of oligomers.

Authors:  K Oh; K S Jeong; J S Moore
Journal:  Nature       Date:  2001 Dec 20-27       Impact factor: 49.962

9.  Foldamers with heterogeneous backbones.

Authors:  W Seth Horne; Samuel H Gellman
Journal:  Acc Chem Res       Date:  2008-07-01       Impact factor: 22.384

10.  Dynamic covalent chemistry on self-templating peptides: formation of a disulfide-linked beta-hairpin mimic.

Authors:  Yamuna Krishnan-Ghosh; Shankar Balasubramanian
Journal:  Angew Chem Int Ed Engl       Date:  2003-05-16       Impact factor: 15.336

View more
  11 in total

1.  Primitive selection of the fittest emerging through functional synergy in nucleopeptide networks.

Authors:  Anil Kumar Bandela; Nathaniel Wagner; Hava Sadihov; Sara Morales-Reina; Agata Chotera-Ouda; Kingshuk Basu; Rivka Cohen-Luria; Andrés de la Escosura; Gonen Ashkenasy
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-02       Impact factor: 11.205

2.  Dynamic Exchange of Substituents in a Prebiotic Organocatalyst: Initial Steps towards an Evolutionary System.

Authors:  Anna C Closs; Maximilian Bechtel; Oliver Trapp
Journal:  Angew Chem Int Ed Engl       Date:  2021-11-27       Impact factor: 16.823

3.  Cyclic Thiosulfonates for Thiol-Mediated Uptake: Cascade Exchangers, Transporters, Inhibitors.

Authors:  Takehiro Kato; Bumhee Lim; Yangyang Cheng; Anh-Tuan Pham; John Maynard; Dimitri Moreau; Amalia I Poblador-Bahamonde; Naomi Sakai; Stefan Matile
Journal:  JACS Au       Date:  2022-03-22

4.  Hybrid Sequences that Express both Aromatic Amide and α-Peptidic Folding Features.

Authors:  Xiaobo Hu; Pradeep K Mandal; Brice Kauffmann; Ivan Huc
Journal:  Chempluschem       Date:  2020-07       Impact factor: 2.863

5.  Spontaneous Emergence of Self-Replicating Molecules Containing Nucleobases and Amino Acids.

Authors:  Bin Liu; Charalampos G Pappas; Jim Ottelé; Gaël Schaeffer; Christoph Jurissek; Priscilla F Pieters; Meniz Altay; Ivana Marić; Marc C A Stuart; Sijbren Otto
Journal:  J Am Chem Soc       Date:  2020-02-18       Impact factor: 15.419

6.  Macrocycles of higher ortho-phenylenes: assembly and folding.

Authors:  Zacharias J Kinney; Viraj C Kirinda; C Scott Hartley
Journal:  Chem Sci       Date:  2019-08-09       Impact factor: 9.825

7.  Biomimetic selenocystine based dynamic combinatorial chemistry for thiol-disulfide exchange.

Authors:  Andrea Canal-Martín; Ruth Pérez-Fernández
Journal:  Nat Commun       Date:  2021-01-08       Impact factor: 14.919

8.  An Approach to the De Novo Synthesis of Life.

Authors:  Sijbren Otto
Journal:  Acc Chem Res       Date:  2021-12-29       Impact factor: 22.384

9.  Alternating Current Electrolysis for the Electrocatalytic Synthesis of Mixed Disulfide via Sulfur-Sulfur Bond Metathesis towards Dynamic Disulfide Libraries.

Authors:  Lars Erik Sattler; Chris Josef Otten; Gerhard Hilt
Journal:  Chemistry       Date:  2020-02-18       Impact factor: 5.236

10.  Dynamic covalent self-assembly of mechanically interlocked molecules solely made from peptides.

Authors:  Hendrik V Schröder; Yi Zhang; A James Link
Journal:  Nat Chem       Date:  2021-08-23       Impact factor: 24.427

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.