| Literature DB >> 29090689 |
E Oueis1, H Stevenson, M Jaspars, N J Westwood, J H Naismith.
Abstract
Biocatalysis is a fast developing field in which an enzyme's natural capabilities are harnessed or engineered for synthetic chemistry. The enzyme PatG is an extremely promiscuous macrocyclase enzyme tolerating both non-natural amino acids and non-amino acids within the substrate. It does, however, require a proline or thiazoline at the C-terminal position of the core peptide which means the final product must contain this group. Here, we show guided by structural insight we have identified two synthetic routes, triazole and a double cysteine, that circumvent this requirement. With the triazole, we show PatGmac can macrocyclise substrates that do not contain any amino acids in the final product.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29090689 PMCID: PMC5708355 DOI: 10.1039/c7cc06550g
Source DB: PubMed Journal: Chem Commun (Camb) ISSN: 1359-7345 Impact factor: 6.222
Fig. 1PatG substrates: (A) accepted substitutions within final product are shown in blue, but so far require proline or thiazoline (orange). (B) Triazole and double cysteine bypass this requirement. (C) Macrocycles synthesized with PatGmac so far.16d,e
Fig. 2Left: Hydrogen bonding between the side chain of Asn717 and the carbonyls of substrate. We conclude these hydrogen bonds are critical in substrate recognition and explain the requirement for an l-configured non-glycine residue at position residues 7 of substrate (Phe is shown from PDB ; 4AKT). Right: PatGmac macrocyclisation substrate requirements: a heterocycle before AYD adopting a cis- or cis-like conformation, a side chain (R ≥ Me) preceding the heterocycle. We conclude two substrate carbonyls (coloured green & pink) hydrogen bonded with Asn717 are critical.
Synthetic peptides 1–26, the resulting PatGmac reaction outcome, and isolated macrocyclic peptides and their yields
| Peptide | Substrate sequence | Product | Cyclic | Yield |
|
|
| NR | ||
|
|
| NR | ||
|
|
| NR | ||
|
|
| NR | ||
|
|
| NR | ||
|
|
| NR | ||
|
|
|
| ||
|
|
|
|
| 32 |
|
|
| NR | ||
|
|
| NR | ||
|
|
|
|
| 40 |
|
| NR | |||
|
|
| NR | ||
|
|
| NR | ||
|
| NR | |||
|
|
|
|
| 40 |
|
|
|
| ||
|
|
|
|
| 36 |
|
|
|
|
| 34 |
|
|
|
| ||
|
|
|
|
| 37 |
|
|
| L | ||
|
|
| NR | ||
|
|
| L | ||
|
|
| NR | ||
|
|
|
| ||
|
|
|
| ||
|
|
|
|
Underlined is the minimal recognition sequence AYD where the peptide is cleaved. Tz = 1,4-anti-triazole, Tz= 1,5-syn-triazole, βA = β-alanine, β2-homoA = (R)-3-amino-2-methylpropanoic acid, d-amino acids in lower case, [CC] = disulphide bond.
NR = No reaction, C = Cyclic, L = Linear; major product shown in table, minor product shown in brackets.
Isolated macrocycles.
Their yields.
Fig. 3Non-proline containing macrocycles macrocyclised by PatGmac.