| Literature DB >> 29483897 |
Muriel Gondry1, Isabelle B Jacques1, Robert Thai2, Morgan Babin1, Nicolas Canu1, Jérôme Seguin1, Pascal Belin1, Jean-Luc Pernodet1, Mireille Moutiez1.
Abstract
Cyclodipeptide synthases (CDPSs) use as substrates two amino acids activated as aminoacyl-tRNAs to synthesize cyclodipeptides in secondary metabolites biosynthetic pathways. Since the first description of a CDPS in 2002, the number of putative CDPSs in databases has increased exponentially, reaching around 800 in June 2017. They are likely to be involved in numerous biosynthetic pathways but the diversity of their products is still under-explored. Here, we describe the activity of 32 new CDPSs, bringing the number of experimentally characterized CDPSs to about 100. We detect 16 new cyclodipeptides, one of which containing an arginine which has never been observed previously. This brings to 75 the number of cyclodipeptides formed by CDPSs out of the possible 210 natural ones. We also identify several consensus sequences related to the synthesis of a specific cyclodipeptide, improving the predictive model of CDPS specificity. The improved prediction method enables to propose the main product synthesized for about 80% of the CDPS sequences available in databases and opens the way for the deciphering of CDPS-dependent pathways. Analysis of phylum distribution and predicted activity for all CDPSs identified in databases shows that the experimentally characterized set is representative of the whole family. Our work also demonstrates that some cyclodipeptides, precursors of diketopiperazines with interesting pharmacological properties and previously described as being synthesized by fungal non-ribosomal peptide synthetases, can also be produced by CDPSs in bacteria.Entities:
Keywords: activity prediction; biosynthetic pathways; cyclodipeptide MS/MS; cyclodipeptide synthase; diketopiperazine; secondary metabolites; tRNA-dependent enzymes
Year: 2018 PMID: 29483897 PMCID: PMC5816076 DOI: 10.3389/fmicb.2018.00046
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Cyclodipeptides produced by the newly characterized CDPSs.
| 62 | 2,650 | cFM | |||
| 63 | 14,802 | cWW | |||
| 64 | 9,573 | cWW | |||
| 65 | – | ||||
| 66 | 2,729 | cFY | |||
| 67 | 1,969 | cFY | |||
| 68 | 11,431 | cLW | |||
| 69 | 11,709 | cWY | |||
| 70 | 18,457 | cWW | |||
| 71 | 4,667 | cWY | |||
| 72 | – | ||||
| 73 | cWA | 138 | |||
| 74 | 15,655 | ||||
| 75 | 7,616 | ||||
| 76 | – | ||||
| 77 | 1,715 | cLV | |||
| 78 | 51 | cFG | |||
| 79 | – | ||||
| 80 | – | ||||
| 81 | – | ||||
| 82 | 5,945 | cLP | |||
| 83 | 140 | cPR | |||
| 84 | 54 | cFL | |||
| 85 | 4,220 | cLI | |||
| 86 | 926 | cLL | |||
| 87 | 3,121 | cGE | |||
| 88 | – | ||||
| 89 | 3,454 | ||||
| 90 | cHP | 688 | cHP | ||
| 91 | 2,786 | cFF | |||
| 92 | – | ||||
| 93 | – | ||||
| 94 | 12,312 | cYY | |||
| 95 | 11,078 | cYY | |||
| 96 | 10,901 | cYY | |||
| 97 | 10,095 | cYY | |||
| 98 | 6,265 | cPM | |||
| 99 | – | ||||
| 100 | 1,871 | ||||
| 101 | 3,406 | cYY | |||
| 102 | 596 | ||||
| 103 | 16,190 | cFL |
Cyclodipeptides produced by BL21 AI-pREP4 expressing the recombinant CDPS; cyclodipeptides are ranked according to the peak area on UV chromatograms recorded at 214 nm (see also Supplemental Data Set .
Quantities of cyclodipeptide produced are given in gray when standard are available for establishing of calibration curves (see also Supplemental Data Set .
cLI quantities were estimated using the calibration curve for cLL.
Figure 1Current diversity of the cyclodipeptides produced by the CDPS family. The amino acids forming the cyclodipeptide ring are indicated in the one-letter code at the top and right. Previously identified cyclodipeptides are in blue, cyclodipeptides identified with the present set of CDPSs are in orange. Light colors indicate cyclodipeptides detected in low amounts (UV peak area at 214 nm less than 200). The distribution of cyclodipeptides according to NYH or XYP subfamilies is shown in Figure S1.
Sequence motifs of the pockets P1 and P2 of characterized CDPSs, colored according to the aminoacyl mainly accommodated (inferred from sequence alignments and proximity with characterized CDPSs or enzymes synthesizing homo-cyclodipeptides).
| Y | VGITMLFN | Y | LRFDQLP | |
| Y | VGVTMLFN | Y | LAYSQLP | |
| Y | VGITMLFN | Y | LAFSQLP | |
| Y | VGVTMLFN | Y | LSFEQLP | |
| F | VGITMFFV | Y | MHFGVTP | |
| F | LGVAIFLC | Y | HFVGKLP | |
| F | LGVVLFFT | F/M | QDFDKMP | |
| F | LGLVLFLT | L | QALEKMP | |
| W | LGVALFLS | L | MSFGMAQ | |
| W | LGVALFFA | W | MLHGVMP | |
| W | LGVALFFA | W | MGFAEPL | |
| W | LGVALFFH | W | MQFKTLA | |
| W | LGIALFFS | Y | MLHGLTP | |
| W | LGVALFFS | Y | MTFPQIP | |
| W | LGIPLFFV | P/A | VRARAGF | |
| W | LGIPLFFV | P | ARAVMVP | |
| W | LGVPLLLA | P | ASVGKID | |
| P | CGHPWLLY | P | TDVHRIP | |
| P | LGFPWFMF | L | LKAGQFE | |
| P | VGLPWFFF | R | ATAGRIS | |
| P | LGLPWLFF | M | NQCNSFR | |
| L | LGFPWLVF | V/L/I | TGARRMD | |
| F/G ? | LGLAWIGC | F/G ? | VAYHRLA | |
| F/I ? | LGLPWFLF | F/I ? | IRVTRSP | |
| C | VGYPSIFF | G/C | SAIKDPT | |
| L/I ? | FGLGCVFM | L/I ? | FRYRGFS | |
| L/I ? | FGLGCIFF | L/I ? | FSYRGFG | |
| G | LFVAWIVI | E | NTYRKLT | |
| A/P ? | FAYVHAVN | A/P ? | FFYRGID | |
| H | GAYAWELY | P/E | FEWKYTR | |
| F | FGLAWSLL | F/Y | LNYSTYA | |
| Y | FPLCWISE | Y | LKLRREE | |
Orange: Y, blue: F, green: W, yellow: P, violet: L. When accommodation in P1 or P2 could not be deduced from existing data, amino acyl accommodated are indicated in both pockets and signaled by a question mark. Groups of CDPSs phylogenetically close are framed. NYH CDPSs previously grouped in the same specificity group than AlbC are labeled in blue.
Figure 2Phylogenetic tree of known and putative CDPSs (June 2017) (see also Supplemental Data Set 6 for a better resolution and aLRT values). The CDPSs experimentally characterized prior to this study are labeled with blue dots and CDPS characterized within this study with pink dots (dark for active ones, light for inactive ones). Branches corresponding to NYH CDPSs are black; those corresponding to XYP ones are gray. Labels of groups with predicted activity are colored respectively in green (cLL), blue (cCC), light pink (cWW), orange (cPX), dark pink (cW1X), gray (cW2X), red (cFX), violet (cYY) for NYH CDPSs, and light blue (cAE), brown (cLE), dark blue (cGE), yellow (cAA), light green (cGG), and fushia (cLI/cLL) for XYP CDPSs. The arrow indicates the CDPS 17, previously suspected to belong to a third CDPS subfamily (SYQ). See also Supplemental Data Set 5 for the sequence alignment of the whole set of CDPSs.
Specificity groups identified and their consensus motifs of P1 and P2 determined from characterized CDPSs.
Specificity groups are named according to the main cyclodipeptide synthesized; the family NYH or XYP of the CDPSs forming the group is indicated. The numbers in brackets (.
The consensus motif for cXE was obtained using cAE, cLE and cGE groups of CDPSs, that for cAX was obtained using cAE and cAA groups.
Figure 3Predicted activities of the 765-CDPS set. The different specificities predicted are colored using the same code as for the phylogenetic tree (Figure 2). “Characterized” refers to enzymes experimentally characterized but not belonging to a predictable group.
Figure 4Phyla distribution of putative (A) and characterized (B) CDPSs and according to their belonging to NYH (C) or XYP (D) family.