Fluorous mixture synthesis minimizes the effort to synthesize small-molecule libraries by labelling the molecules rather than the reaction vessels. Reactants are labelled with fluorinated tags and products can later be demixed based on the fluorine content. A limit in the number of available tags can be overcome by using binary encoding so that a total of four tags can label uniquely a library of 16 compounds. This strategy, however, means that separation based on fluorine content alone is not possible. Here, we solve this problem by selectively removing one tag after an initial demixing step; a second demixing provides each individual compound. The usefulness of this strategy is demonstrated by the synthesis of a library that contains all 16 diastereomers of the natural products macrosphelides A and E. Macrosphelide D was not in this library, and so its assigned structure was incorrect. We determined its constitution by using NMR spectroscopy and its configuration by synthesizing four candidate stereoisomers.
Fluorous mixture synthesis minimizes the effort to synthesize small-molecule libraries by labelling the molecules rather than the reaction vessels. Reactants are labelled with fluorinated tags and products can later be demixed based on the fluorine content. A limit in the number of available tags can be overcome by using binary encoding so that a total of four tags can label uniquely a library of 16 compounds. This strategy, however, means that separation based on fluorine content alone is not possible. Here, we solve this problem by selectively removing one tag after an initial demixing step; a second demixing provides each individual compound. The usefulness of this strategy is demonstrated by the synthesis of a library that contains all 16 diastereomers of the natural products macrosphelides A and E. Macrosphelide D was not in this library, and so its assigned structure was incorrect. We determined its constitution by using NMR spectroscopy and its configuration by synthesizing four candidate stereoisomers.
Fluorous mixture synthesis (FMS) is a powerful technique for minimizing the effort in making small libraries of complex organic molecules in individual, pure form.[1-3] Whereas the reaction vessels are labeled in traditional parallel synthesis, it is the molecules that are labeled in FMS. For example, stereoisomer libraries of natural products can be made by using the structure of a fluorous tag to encode the configuration of one or more stereocenters of the target. The resulting differentially tagged molecules are called quasiisomers.[4] Syntheses can be conducted on mixtures of quasiisomers (saving effort), then late stage demixing based on the fluorous tags separates the quasiisomers. Finally, detagging gives the true stereoisomers.Fluorous tags serve varying purposes in organic synthesis.[5-8] The fluorous tags in FMS serve two roles—they identify the tagged quasiisomers (information) and they provide for their separation by fluorous HPLC (demixing). Both roles are based on fluorine content. The second role is referred to as a “strategic separation”[9] because the separation is dictated primarily by the tag, not the molecule that is tagged.Fluorous tags usually bear simple perfluoroalkyl groups (RF, –(CF2)nF) for the encoding, so the number of tags (and therefore tagged components) in a mixture would appear to be large. However, this is not so. Groups from RF = CF3 up to about C9F19 are preferred because larger tags begin to convey undesirable teflon-like properties such as low solubility. Within this already limited tag set, precursors with even numbers of carbon atoms are less expensive and more readily available than those with odd numbers.So four groups emerge as especially practical, C2F5, C4F9, C6F13, and C8F17, and it is important to leverage these groups as much as possible. In standard single tagging, the four fluorous tags are used to encode and mix four molecules. For example, we made four-compound mixtures of pinesaw fly sex pheromones by using fluorous tags to encode the configurations at C2 and C3 with a fluorous PMB tag on C2 (Figure 1a).[10,11] Each tag has a different number of fluorines, and this forms the basis of demixing by fluorous HPLC.[12]
Figure 1
Single and double tagging strategies: (a) four individual fluorous tags provide four pinesaw fly pheromones; (b) two pairs of fluorous tags provide four lagunapyrones, but four different tags are still needed to prevent redundant fluorine content; (c) orthogonal tagging with four fluorous tags and four oligoethylene glycol (OEG) tags provides sixteen murisolins.
The approach has been extended to double tagging as illustrated with lagunapyrone.[13] Here a pair of two-compound mixtures was coupled together to make a four-compound mixture of quasiisomers (Figure 16). In this application, the final tagged compounds are demixed based on the sum of two tags. Variants of this approach were used for cytostatins[14] and passifloricins.[15]Double tagging with fluorous tags and oligoethyleneglycol (OEG) tags[16] allows 16 compounds to be made in one mixture, as illustrated with murisolin stereoisomers (Figure 6).[17,18] Here eight different tags are needed, four fluorous and four OEG, to make the “double mixture”. Both a fluorous and an OEG demixing (double demixing) are needed to separate this mixture. In this approach, the protecting groups bearing the tags are identical or very similar (here DMB and PMB) so that they can be removed together under the same conditions after demixing. For practical reasons, this approach is best applied when two large fragments are made separately by single mixture synthesis (one fluorous-tagged mixture and one OEG-tagged mixture), then coupled very late in the synthesis to make the double mixture. This was the strategy used to make the murisolin stereoisomer library.Purely from the information standpoint, the most efficient method of tagging is to use simple binary encoding with a single class of tag. In principle, sixteen compounds could be tagged at two sites with only four fluorous groups (24 = 16), thirty-two with only five groups (25 = 32), and so on. However, binary fluorous encoding seems impractical at first glance because of redundancy. For example, a binary-tagged mixture with pairs of the four fluorous RF groups would have four different tagged compounds containing 22 fluorines (C2F5/C8F17, C4F9/C6F13, C6F13/C4F9, C8F17/C2F5). There are other groups of two and three compounds in this mixture that have redundancies as well (see Figure 2 below). In short, binary encoding works for identification but not separation.
Figure 2
Binary tagging with four fluorous tags. Redundancies are lifted by sequential (rather than simultaneous) removal of the tags: (a) the encoding pattern and HPLC retention order of 16 compounds with four RF groups; (b) detagging and second demixing exemplified with fraction 4 (22 fluorine atoms) produces four individual compounds.
We recognized that all the redundancies in a binary-encoded mixture could be lifted to enable separation by sequential removal of the two tags. To prove the principle, we report here synthesis and isolation of sixteen stereoisomers of macrosphelide natural products A[19,20] and E[21-23] (1-A and 1-E, below) from a single mixture of sixteen tagged quasiisomers encoded with only four different RF groups. We also expected this library to contain macrosphelide D (1-D),[24] but it did not. We deduced the correct structure of 1-D from spectra, then proved this by synthesis.This new binary encoding capitalizes on orthogonal protecting groups,[25-27] which are commonly used in synthesis to hide or reveal functional groups at predetermined times. Here we have no need for that application; instead we retool orthogonal protecting groups for the first time for use in strategic separation.
Results
Binary tagging with selective tag removal
Figure 2a shows the tag encoding pattern for sixteen compounds that results from a 4 × 4 tagging with two fluorous tags in the “even” perfluoroalkyl series (C2F5, C4F9, C6F13, C8F17). Only two compounds, the first with 10 fluorines and the last with 34, have unique fluorine contents. The others all share the same number of fluorine atoms with one, two or three other members of the set. This is the redundancy problem.Imagine an ideal HPLC demixing of this set of compounds, which should produce seven fractions based on fluorine content. The first and last compounds are already individual and can be detagged. Now imagine that the tags are orthogonal, and that either one (but not both) of the two tags is removed from the other five mixtures. This breaks all the remaining redundancies, as illustrated for fraction 4 (22 fluorines) in Figure 2b. Demixing of the mono-detagged fractions followed by detagging now produces all sixteen target compounds as individual samples of unambiguous structure based on the tag pattern.
FMS of the macrosphelide stereoisomer library
To test this idea of double tagging/double demixing, we selected the stereoisomeric natural products macrosphelides A, D and E, because they and their congeners have high anti-cancer activity,[18-22] because their syntheses are well established,[20,28-31] and especially because they have two identical sub-units (C5–C9 and C11– C15). Accordingly four stereoisomers of this sub-unit (2, Figure 3a) will provide a stereoisomer library of macrosphelides containing all 16 diastereomers with the S configuration fixed at C3 and all possible configurations at the other four stereocenters. This library should contain macrosphelide A (1-A) and the enantiomer of macrosphelide E (ent-1-E). One of the remaining 14 isomers should be macrosphelide D (1-D), whose stereostructure was not completely assigned.
Figure 3
Fragments, tagged quasiisomers and mixture synthesis: (a) structures of the four single-isomer fragments 2; (b) tagging scheme for the fluorous mixture synthesis of the 16-quasiisomer mixture shown with the tagged quasiisomers 3a-d and 4a-d. Each isomer of 2 gets two tags (FPMB and FTIPS) with the same number of fluorine atoms; (c) fluorous mixture synthesis of final tagged quasiisomer mixture M-8a-d/a-d starting from precursors 3a-d and 4a-d. Because the substrates are four- or sixteen-compound mixtures, each of the six reactions was conducted only once. Samples with the “M” prefix are mixtures with the letter suffixes indicating the tags (as FPMB/FTIPS for the samples with both tags). In turn, the tags encode the configuration. For example, one of the sixteen components in M-8a-d/a-d is 8ab. Its FPMB group a bears C2F5 (which encodes 14R,15R) and its FTIPS group b bears C4F9 (which encodes 8S,9S).
The structures of subunits 2 and FPMB- and FTIPS-tagged pairs of quasiisomers 3 and 4 are summarized in Figure 3a,b. The four individual stereoisomeric precursors 2 are readily available in 2–5 steps by modifying the asymmetric dihydroxylation route of Omura and Smith.[20] These were each tagged with a fluorous PMB group (FPMB) and the TES group was removed in one step to give a first set of quasiisomers 3a-d.Tagging of the isomers of 2 with a fluorous silyl group (FTIPS) followed by mixing and removal of the trichloroethyl ester with zinc dust gave a second set of quasiisomers 4a-d. The same encoding was used for both groups. In other words, the (R,R)-isomer was encoded with both C2F5FPMB and FTIPS groups (“a” series), the (S,S) with C4F9 groups (“b” series), and so on (“c” and “d” series).The reaction steps of the fluorous mixture stage of the synthesis are summarized in Figure 3c. The four quasiisomers 3a-d bearing the FPMB groups were mixed and then coupled with the TBS ether of (S)-3-hydroxybutanoic acid (S-5) to give M-6a-d. The TBS group was removed and the resulting mixture was coupled with the four-component quasiisomer mixture of acids 4a-d. This gave the first 16-quasiisomer mixture M-7a-d/a-d. The TES group was removed with HCl and the 2,2,2-trichloroethyl ester was cleaved with zinc dust. The resulting hydroxy acid mixture was subjected to Yamaguchi lactonization[32] to give macrolactone M-8a-d/a-d. This 16-quasiisomer mixture exhibited a single spot on standard silica TLC analysis, so it was purified by flash chromatography to provide the sample for demixings.Table 1 summarizes the relevant information on the final mixture of tagged quasiisomers 8, including tagging pattern, configuration and HPLC fraction. After macrolactonization, the fluorous analytical HPLC trace (see Supporting Information) was more complex than the seven peaks implied by Figure 2a. In addition to the primary separation based on fluorine content, some secondary separation based on the tag structures or the structure of the macrolactone component evidently occurred. Nonetheless, this secondary separation was small relative to the tag-based separation, leaving obvious gaps between the quasiisomers of different fluorine content. In the preparative demixing of 8 we collected the first five fractions based on the fluorine content, deliberatively combining any partially resolved peaks with same fluorine content. The two components of fraction 6 were well resolved and were collected separately. Like fraction 1, fraction 7 was a single compound.
Table 1
Tagging pattern for the macrosphelide library of quasiisomers 8
double tag number
configurations
FPMB RF
FTIPS RF
total fluorine #
HPLC fraction
8aa
3S,8R,9R,14R,15R
C2F5
C2F5
10
1
8ab
3S,8S,9S,14R,15R
C2F5
C4F9
14
2
8ba
3S,8R,9R,14S,15S
C4F9
C2F5
14
2
8ac
3S,8S,9R,14R,15R
C2F5
C6F13
18
3
8bb
3S,8S,9S,14S,15S
C4F9
C4F9
18
3
8ca
3S,8R,9R,14S,15R
C6F13
C2F5
18
3
8ad
3S,8R,9S,14R,15R
C2F5
C8F17
22
4
8bc
3S,8S,9R,14S,15S
C4F9
C6F13
22
4
8cb
3S,8S,9S,14S,15R
C6F13
C4F9
22
4
8da
3S,8R,9R,14R,15S
C8F17
C2F5
22
4
8bd
3S,8R,9S,14S,15S
C4F9
C8F17
26
5
8cca
3S,8S,9R,14S,15R
C6F13
C6F13
26
5
8db
3S,8S,9S,14R,15S
C8F17
C4F9
26
5
8dc
3S,8S,9R,14R,15S
C8F17
C6F13
30
6a
8cd
3S,8R,9S,14S,15R
C6F13
C8F17
30
6b
8ddb
3S,8R,9S,14R,15S
C8F17
C8F17
34
7
precursor of ent-macrosphelide E (ent-1-E)
precursor of macrosphelide A (1-A)
A set of 19F NMR spectra confirmed that the fractions contained the expected tags. Reading the spectra of fractions 6a and 6b also identified these samples. The four single component fractions (1, 6a, 6b, 7) were directly detagged to give the first four pure isomers. The remaining four fractions had 2, 3 or 4 quasiisomers with the same fluorine content. The FTIPS group was removed from these fractions with TBAF, leaving only the FPMB group. Then these fractions were demixed into the individual components, which now have different fluorine content.This process is exemplified with the second fraction in Figure 4. The two components of fraction 2 after the first demixing each have 14 fluorine atoms. The (3S,8S,9S,14R,15R) component 8ab with the C2F5 PMB tag and the C4F9 silyl tag is mixed with its (3S,8R,9R,14S,15S)-isomer 8ba bearing the C4F9 PMB tag and the C2F5 silyl tag.
Figure 4
Double demixing illustrated with fraction 2. The first demixing produced four single-compound fractions (1, 6a, 6b (which separated even though they have the same number of fluorines) and 7) and four mixture fractions numbered 2–5. Fraction 2, for example, contains double-tagged quasiisomers 8ab and 8ba, each having 14 fluorines. Removal of the FTIPS tag provides a mixture of mono detagged adducts 9a and 9b that are readily separated in the second demixing because one (9a) has five fluorines and the other (9b) has nine.
The silyl tag was removed from this two-component mixture to provide the (3S,8S,9S,14R,15R) isomer 9a with the C2F5 PMB mixed with the (3S,8R,9R,14S,15S)-isomer 9b bearing the C4F9 PMB tag (84%). These two compounds were readily demixed by fluorous HPLC. The PMB tags were removed to give the true macrosphelide isomers in individual, pure form. The yields for PMB removal were 76% and 65%, respectively, after flash chromatography. These yields are typical of those obtained from the other 14 quasiisomers.Likewise, the other three mixed fractions were processed to give all sixteen true macrosphelide isomers (see Supplementary Figure S2). Each isomer was purified by preparative reverse phase HPLC to provide the final samples (1–2 mg each), which were fully characterized by the usual means. Copies of the 1H and 13C NMR spectra of all the isomers are contained in the Supporting Information.Despite the structural similarities, each of the 16-macrosphelide stereoisomers exhibited unique 1H (700 MHz) and 13C (175 MHz) NMR spectra. As expected, the spectra for the 3S,8R,9S,14R,15S isomer (1-A) matched those of macrosphelide A while the spectra for the 3S,8S,9R,14S,15R (ent-1-E) isomer matched those of macrosphelide E. These matches show that the double mixture synthesis succeeded, and accordingly provide proof of the principle of binary fluorous mixture synthesis with sequential tag removal. Though the principal goal was now accomplished, an unexpected problem presented itself.
Correction of the structure of macrosphelide D and confirmation of the structure of macrosphelide M
To our surprise, the spectral data reported for macrosphelide D[20] did not match any of the 16 sets of spectra from the stereoisomer library. This proves that macrosphelide D is not a stereoisomer of macrosphelides A and E. While the 1H and 13C resonances for the C1–C9 region of macrosphelide D fall within the range of those of the stereoisomer library members, there are significant differences in the C10-C15 region.The resonances H14 and H15 in the 1H NMR spectrum were especially informative (Figure 5a). In all 16 macrosphelide isomers, these fall into narrow ranges: 4.17–4.49 ppm for H14 and 4.85–5.27 ppm for H15. In contrast, at 5.05 ppm, H14 of macrosphelide D is too far downfield. And at 4.06 ppm, H15 is too far upfield. A similar problem was evident in the 13C NMR spectra. Based on these values, we hypothesized that macrosphelide D was a ring-contracted constitutional isomer of A and E with the acyl group of C1 bonded to O14 rather than O15.
Figure 5
Macrosphelide D structure candidates and syntheses: (a) analysis of key chemical shifts in 1H NMR spectra of macrosphelides A, D and E suggests four candidate structures (10-13) for macrosphelide D; (b) step-by-step summary of the synthesis of candidate 10, which proved to be macrosphelide D; (c) schematic summary of the precursors and bond connections to make candidates 11-13.
Candidate structures for macrosphelide D are shown in Figure 5a with the same carbon numbering system as the A/E series. If macrosphelide D has the same configurations at the pairs of adjacent stereocenters as macrosphelides A and E, then it could be either 10 (C3 S) or 11 (C3 R). Structure 11 is already a known natural product called macrosphelide M.[33] Unfortunately, the spectra of macrosphelide D are reported in chloroform-d1 and those of macrosphelide M are in acetone-d6, so we could not assess at this point whether the two natural samples are the same or different. However, since macrosphelides A and D were isolated from the same broth, the structure 10 seems more likely than 11.We also considered two other similar structures, 12 and 13, for macrosphelide D. Here it is C11 that is contracted down from the oxygen on C9 to the adjacent oxygen on C8. This seemed prudent since two of the hydroxy ester sub-units of these tris-lactones are so similar.The four candidates isomers 10-13 were synthesized individually from suitably protected fragments bearing standard protecting groups: TBS (t-butyldimethylsilyl) to block hydroxy groups needed for esterification, and MEM (methoxyethoxy methyl) to block free hydroxy groups. Isomer 10 proved to be macrosphelide D, and its synthesis is summarized in Figure 5b. Alcohol (S)-14 was coupled with acid 15 bearing MEM and TBS protecting groups on the oxygens of C8 and C9, respectively. Removal of the TBS group produced 16, which in turn was coupled with 17 (a constitutional isomer of 15 with the protecting groups reversed) to make the second ester at C11.Removal of the TBS group on O14 followed by reductive deesterification provided hydroxy acid 18 ready for macrolactonization. Yamaguchi reaction as above, then removal of the two MEM groups with TFA, provided target structure 10. All the reactions in this sequence worked well, except for the Yamaguchi reaction sequence, which gave only 26% yield. This yield includes the prior deesterification step; however, the macrolactonization step was clearly the problem. Nonetheless, ample 10 (8 mg) was obtained for complete characterization.Figure 5c summarizes the syntheses of the other three isomers. These sequences were generally similar to the one in Figure 5b (see Supplementary Schemes S1-S4). However, since all the suitably protected fragments were readily available, we decided to vary the order of the esterification steps. Isomer 11 was made by uniting fragments 17, 19, and R-5, first by making an ester from C11 to the oxygen on C9, then from C1 to the oxygen on C14. Finally, macrolactonization between C5 and the oxygen on C3 and deprotection provided 11. Gratifyingly, the two-step sequence of reductive deesterification and Yamaguchi macrolactonization worked in an improved 56% overall yield. Yields of the other steps were comparable to those for 10 (see Supporting Information).The bond for final macrolactonization was changed again for isomers 12 and 13. These were made by uniting fragments 17, 19, and R- or S-5, first by making an ester from C1 to the oxygen on C15, then from C5 to the oxygen on C3. Now macrolactonization between C11 and the oxygen on C8 and deprotection provided 11 and 12. The final three steps (reductive deesterification, Yamaguchi macrolactonization and removal of the MEM group) were conducted with purification of the intermediates to provide 12 and 13 in 38% and 42% overall isolated yields, respectively.We recorded 1H and 13C NMR spectra of the four candidate structures in chloroform-d1 (for comparison of the reported data for macrosphelide D) and acetone-d6 (for comparison of the reported data for macrosphelide M). These are data tabulated in Supplementary Tables S2 and S3. The first step was to compare the spectra of the new samples 10-13 to each other. As expected from the prior library spectra, the four groups of four spectra were similar. Yet each spectrum was unique, exhibiting significant differences with its three comparison spectra.Finally, by comparing these spectra to those reported, we learned 1) that the assigned structure 11 of macrosphelide M is correct, 2) that macrosphelide M and D are not the same or enantiomers, and 3) that macrosphelide D is indeed 10. So macrosphelide D and M are ring-contracted constitutional isomers of macrosphelides A and E, respectively.Translactonization can be induced by acid or base[34] and has been observed with many other hydroxy macrolactone natural products. Recent examples include apoptolidin,[35,36] dolabelide[37] and dictyostatin.[38] This raises the question of whether D/M and A/E are all natural products, or whether trans-lactonization occurs during isolation. The reported conditions for isolation of these natural products do not involve acid or base treatment, so the isomers may occur naturally. If the opportunity arises, then it would be interesting to look for the iso-D and iso-M structures 12 and 13 in extracts with macrosphelides A, D, E, and M. In addition, each constellation of stereocenters has a possible double ring contracted product in which translactonization occurs at both possible hydroxy lactone subunits.
Discussion
Binary encoding of sixteen compounds with four fluorous tags (RF) provides a unique identity for each compound but does not provide for the complete separation of the mixture based on fluorine content due to redundancies. This problem is solved by selective removal of either one of the tags after the first demixing. A second demixing of the resulting mixed fractions provides each compound individually.The principle of double tagging and double demixing has been proved by the synthesis of a library of all sixteen diastereomers of the natural products macrosphelides A and E. The core mixture stage of the synthesis (Figure 2c) required only 6 chemical reactions to make all the isomers. Contrast this to an analogous parallel synthesis approach to the stereoisomer library, which would require 72 reactions.The spectra for the natural product macrosphelide D did not match any of those of the library members. This problem was unexpected, but having the complete stereoisomer library helped us to solve it quickly. First, we knew immediately that macrosphelide D was not a stereoisomer of A and E. Working in the traditional way, if we had made one candidate stereoisomer for D and found that its spectra did not match the natural products, then we probably would have first followed by making other candidate stereoisomers. In a worst case, 14 more isomers would have had to been made serially to learn that it was the constitution and not the stereostructure that was the problem.Second, having the complete library of macrosphelide A/E spectra, we could quickly see what data fit with macrosphelide D and what did not. This led us directly to a set of four candidate isomers, one of which proved to be the natural product. Finally, since we had already learned that all the tris-lactone stereoisomers of macrosphelides A and E had unique spectra, it was unnecessary to make the complete library of D/M stereoisomers. There cannot be other stereoisomers in this library that have spectra substantially identical to those of D or M.So we set out to use the known structures of macrosphelides A and E to help prove the principle of fluorous mixture synthesis with binary tagging, then to use the other members of library to prove the stereostructure of macrosphelide D. It turned out that macrosphelide D was not a stereoisomer of A and E, yet the data from the library members still led us quickly to the correct structure.
Methods
The Supporting Information contains detailed descriptions of the synthesis and demixings, characterization data for all new individual compounds, and copies of 1H and 13C NMR spectra for the 16-stereoisomer library of 1 and the candidate stereoisomers of macrosphelide D.
Authors: M Hayashi; Y P Kim; H Hiraoka; M Natori; S Takamatsu; T Kawakubo; R Masuma; K Komiyama; S Omura Journal: J Antibiot (Tokyo) Date: 1995-12 Impact factor: 2.649
Authors: Romain Bejot; Thomas Fowler; Laurence Carroll; Sophie Boldon; Jane E Moore; Jérôme Declerck; Véronique Gouverneur Journal: Angew Chem Int Ed Engl Date: 2009 Impact factor: 15.336
Authors: Jingjing Wu; Paula Lorenzo; Siying Zhong; Muhammad Ali; Craig P Butts; Eddie L Myers; Varinder K Aggarwal Journal: Nature Date: 2017-07-26 Impact factor: 49.962