| Literature DB >> 24069909 |
Monique Royer1, Ralf Koebnik, Mélanie Marguerettaz, Valérie Barbe, Guillaume P Robin, Chrystelle Brin, Sébastien Carrere, Camila Gomez, Manuela Hügelland, Ginka H Völler, Julie Noëll, Isabelle Pieretti, Saskia Rausch, Valérie Verdier, Stéphane Poussier, Philippe Rott, Roderich D Süssmuth, Stéphane Cociancich.
Abstract
BACKGROUND: Various bacteria can use non-ribosomal peptide synthesis (NRPS) to produce peptides or other small molecules. Conserved features within the NRPS machinery allow the type, and sometimes even the structure, of the synthesized polypeptide to be predicted. Thus, bacterial genome mining via in silico analyses of NRPS genes offers an attractive opportunity to uncover new bioactive non-ribosomally synthesized peptides. Xanthomonas is a large genus of Gram-negative bacteria that cause disease in hundreds of plant species. To date, the only known small molecule synthesized by NRPS in this genus is albicidin produced by Xanthomonas albilineans. This study aims to estimate the biosynthetic potential of Xanthomonas spp. by in silico analyses of NRPS genes with unknown function recently identified in the sequenced genomes of X. albilineans and related species of Xanthomonas.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24069909 PMCID: PMC3849588 DOI: 10.1186/1471-2164-14-658
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Physical map of the genomic region containing the XaPPTase gene in strains and of the corresponding region in other sequenced species of . White arrows: gene vipA (probable UDP-glucose/GDP-mannose dehydrogenase gene) and gene pgsA (probable CDP-diacylglycerol-glycerol-3-phosphate-3-phosphatidyltransferase gene). Dark grey arrows: XaPPTase gene (encoding a 4′-phosphopantetheinyl transferase active on peptidyl carrier protein domains). Light grey arrows: NRPS (nonribosomal peptide synthesis) genes. Coloured arrows correspond to genes described in Figures 3 and 4. Hp: hypothetical protein gene (ortholog of accession Dd703_3065 of strain Ech703 of Dickeya dadantii). TA: transaminase gene (ortholog of accession Dd703_3064 of strain Ech703 of D. dadantii). DH: lactate dehydrogenase gene. DaT: gene involved in biosynthesis of 2,4-diaminobutyric acid (Dab). Length of arrows is not proportional to the length of genes. Salmon-coloured oval circle: Eight genes present in contig 112.1 of X. oryzae strain X11-5A that are conserved in other strains of X. oryzae and are not predicted to be involved in NRPS biosynthesis. : tRNA. Orientation of the tag indicates the orientation of the tRNA gene in the genomic regions. Amino acid specificity of each tRNA is indicated above or below each tag according to the orientation of the tRNA gene. ???? : undetermined sequence located between contigs. Two contigs separated by this tag may be located in two different genomic regions (they are not necessarily contiguous). : this tag indicates that corresponding genomic regions are not contiguous. Data on strain CFBP4834-R of X. axonopodis pv. phaseoli were obtained from an unpublished finished genome sequence (M.-A. Jacques, personal communication).
Figure 3Representation of NRPS sequences of strains BAI3 and X11-5A of pv. A. Representation of the NRPS locus META-B of X. oryzae pv. oryzae strain BAI3. B. Representation of contigs of X. oryzae pv. oryzae strain X11-5A that contain a nucleotide sequence encoding at least one A domain and/or a nucleotide sequence encoding non-NRPS genes similar to genes found in the locus META-B of strain X. albilineans GPE PC73. ???? : undetermined sequence located between contigs. Two contigs separated by this tag may be located in two different genomic regions (they are not necessarily contiguous). Brown arrows: staM and staM’ genes predicted to be required for β-hydroxylation of two unidentified amino acids, respectively. Purple arrow: daT gene predicted to be required for biosynthesis of Dab (2,4-diamino butyric acid). Orange arrows: genes predicted to be required for biosynthesis of Dpg (3,5-dihydroxyphenyl-glycine). Dark blue arrows: other genes conserved in all META-B loci (ABC transporter, MbtH-like protein and transcriptional regulator AraC). Green arrows and yellow arrow: genes conserved in strains X11-5A and XaS3 predicted to be required for biosynthesis of an unknown non-proteinogenic amino acid (Hp: hypothetical protein gene, TA: transaminase gene, DH: lactate dehydrogenase gene). Light blue arrow: syrP gene predicted to be required for β-hydroxylation of an unknown amino acid. Black arrows: NRPS genes. Length of arrows is not proportional to the length of genes. All information regarding characteristics and color-code for NRPS modules are as detailed in Figure 2. IS: insertion sequence. FS: frameshift mutation. Dotted box or dotted arrow: pseudogene. Dotted circle: non-functional module.
Figure 4Representation of NRPS sequences of spp. strain XaS3. A. Representation of contigs of Xanthomonas spp. strain XaS3 that contain a nucleotide sequence encoding at least one A domain and/or a nucleotide sequence encoding non-NRPS genes similar to genes found in the locus META-B of strain X. albilineans GPE PC73. B. Comparison of the locus META-A of X. albilineans strain GPE PC73 with the two contigs of Xanthomonas spp. strain XaS3 predicted to belong to a locus similar to this locus META-A. Brown arrows: staM and staM’ genes predicted to be required for β-hydroxylation of two unidentified amino acids, respectively. Purple arrow: daT gene predicted to be required for biosynthesis of Dab (2,4-diamino butyric acid). Orange arrows: genes predicted to be required for biosynthesis of Dpg (3,5-dihydroxyphenyl-glycine). Dark blue arrows: other genes conserved in all META-B loci (ABC transporter, MbtH-like protein and transcriptional regulator AraC). Green arrows and yellow arrow: genes conserved in strains X11-5A and XaS3 predicted to be required for biosynthesis of an unknown non-proteinogenic amino acid (Hp: hypothetical protein gene; TA: transaminase gene; DH: lactate dehydrogenase gene). Black arrows: NRPS genes. Length of arrows is not proportional to the length of genes. All information regarding characteristics and color-code for NRPS modules are as detailed in Figure 2. IS: insertion sequence. XALB1: albicidin biosynthesis gene cluster. Incomplete circles: incomplete nucleotide sequence located at the end or at the beginning of a contig encoding only one part of a module.
Similarity between XaPPTase and other PPTases involved in NRPS and fatty acid biosynthesis in bacteria (from [25])
| Albicidin and unknown | XabA | Pc | G | --(x)39-- | - | ||
| AlbXXI | ? | G | --(x)39-- | 100% /- | |||
| XaPPTase | ? | G | --(x)39-- | 94% / 95% | |||
| XaPPTase | ? | G | --(x)39-- | 83% / 87% | |||
| Unknown | XaPPTase | All sequenced | ? | G | --(x)38-- | 40% / 51% | |
| Unknown | XaPPTase | ? | G | --(x)38-- | 41% / 55% | ||
| Unknown | BBta_3710 | ? | AL | --(x)35-- | 25% / 36% | ||
| Enterobactin | EntD | Pd | P | --(x)36-- | 23% / 31% | ||
| Mycobactin | PptT | Pe | S | --(x)35-- | 25% / 34% | ||
| Gramicidin | Gsp | Pd | P | --(x)35-- | 14% / 21% | ||
| Surfactin | Sfp | A/Pf | P | --(x)35-- | 17% / 25% | ||
| Fatty acids | AcpS | Ad | GL | --(x)40-- | 9% / 14% | ||
| Fatty acids | AcpT | ? | E | --(x)34-- | 13% / 22% | ||
| Fatty acids | AcpS | Ae | G | --(x)41-- | 11% / 17% | ||
| Unknown | HetI | ? | RL | --(x)37-- | 25% / 31% | ||
| Unknown | HetI | ? | RL | --(x)37-- | 22% / 28% | ||
| Unknown | HetI | All sequenced | ? | RL | --(x)37 -- | 20% / 25% | |
| Consensus | (V/I)G(I/V)D** | (F/W)(S/C/TxKE(S/A)xxK** |
a str. = strain; substr. = substrain.
b A, specific for acyl carrier protein (ACP) domains; P, specific for peptidyl carrier protein (PCP) domains.
c[25].
d[12].
e[15].
f[36].
*: XaPPTase gene from strain PXO99A was mis-annotated (correct start codon is upstream of the one deposited in GenBank).
**: Consensus signature sequences of PPTase domains as defined by [14,15].
Figure 2Representation of the NRPS loci META-A, META-B and META-C of strain GPE PC73. Brown arrow: staM gene predicted to be required for β−hydroxylation of an unidentified amino acid. Purple arrow: daT gene predicted to be required for biosynthesis of Dab (2,4-diamino butyric acid). Orange arrows: genes predicted to be required for biosynthesis of Dpg (3,5-dihydroxyphenylglycine). Dark blue arrows: other genes conserved in all META-B loci (ABC transporter, MbtH-like protein and transcriptional regulator AraC). Black arrows: NRPS genes. Length of arrows is not proportional to the length of genes. NRPS modules are represented by circles. Large circles indicate complete NRPS modules (containing domains C, A and PCP). The amino acid predicted to be assembled by the corresponding module is indicated within each circle. ?X: unknown amino acid specific to the NRPS signature UnknownX (Additional file 1). Brown circles: NRPS modules predicted to be specific for tyrosine. Purple circles: NRPS modules predicted to be specific to Dab. Orange circles: NRPS modules predicted to be specific to Dpg. The sequential order of amino acid incorporation from the N- to the C-terminus is illustrated by the number above each circle. Small black circles represent TE domains. The starter module circled in red exhibits a C-domain belonging to the same phylogenetic clade as starter C-domains identified by Rausch et al. [6] (Additional file 2). Starter modules circled in green indicate a C-domain belonging to the distant phylogenetic clade containing starter C-domains of loci META-A and META-C (Additional file 2). Elongation modules circled in blue indicate a C-domain belonging to the same phylogenetic clade as dual C/E domains identified by Rausch et al. [6] (Additional file 2). Amino acids predicted to be epimerized by these domains are in blue. IS: insertion sequence. Dotted box: pseudogene.
Similarities between genes predicted to be required for biosynthesis of Dpg in strain GPE PC73 of and genes previously characterized from strain DSM 5908 of [42]and from strain NRRL15009 of [40]
| XALc_1059 | DpgA (CAC48378) | 55% / 69% | DpgA (AAM80548) | 54% / 67% | Type III chalcone synthase, generates 3,5-dihydroxyphenylacetyl-CoA from four malonyl-CoA |
| XALc_1060 | DpgB (CAC48379) | 31% / 46% | DpgB (AAM80547) | 29% / 45% | Belongs to crotonase/Enoyl-CoA hydratase superfamily, enhances DpgA activity |
| XALc_1061 | DpgC (CAC48380) | 48% / 62% | DpgC (AAM80546) | 48% / 62% | Metal- and cofactor-free 3,5-dihydroxyphenylacetyl-CoA 1,2-dioxygenase, converts 3,5-dihydroxyphenylacetyl-CoA into 3,5-dihydroxyphenylglyoxylate* |
| XALc_1062 | HpgT (CAC48367) | 43% / 58% | HpgT (AAM80549) | 42% / 58% | Transaminase, generates 3,5-dihydroxyphenyl-glycine from 3,5-dihydroxyphenylglyoxylate |
| XALc_1063 | DpgD (CAC48381) | 62% / 74% | DpgD (AAM80545) | 61% / 74% | Belongs to crotonase/Enoyl-CoA hydratase superfamily, enhances DpgA activity |
* This function was characterized for biosynthesis of vancomycin [43].
Similarities between NRPS-associated genes present in loci META-B of spp. XaS3, strain GPE PC73 and pv. strains BAI3 and X11-5A
| StaM | β-hydroxylation | 98% / 99% | 90% / 95% | 90% / 94% |
| StaM’ | β-hydroxylation | Not encoded | Not encoded | 87% / 92% |
| MbtH | Assembly of amino acids by NRPS | 95% / 96% | 94% / 96% | 94% / 96% |
| ABC | Secretion of the synthesized peptides | 91% / 93% | 78% / 83% | 77% / 81% |
| DpgD | Biosynthesis of Dpg | 94% / 96% | 86% / 90% | 86% / 90% |
| HpgT | Biosynthesis of Dpg | 90% / 93% | 80% / 88% | 83% / 91% |
| DpgC | Biosynthesis of Dpg | 92% / 95% | 80% / 87% | 81% / 88% |
| DpgB | Biosynthesis of Dpg | 78% / 86% | 69% / 79% | 69% / 79% |
| DpgA | Biosynthesis of Dpg | 92% / 94% | 85% / 89% | 86% / 90% |
| AraC | Transcriptional regulation | 93% / 96% | 57% / 72% | 57% / 71% |
| DaT | Biosynthesis of Dab | 95% / 97% | Not encoded | 76% / 84% |
| Hypothetical protein | Unknown | Not encoded | Not encoded | 76% / 84% |
| Transaminase | Unknown | Not encoded | Not encoded | 74% / 82% |
| Lactate dehydrogenase | Unknown | Not encoded | Not encoded | 79% / 89% |
Dab: 2,4-diamino butyric acid; Dpg: 3,5-dihydroxyphenyl-glycine.
Figure 5Representation of the NRPS locus present in the same region as XaPPTase in pv. strain BLS256. Comparison with the similar genomic region of X. axonopodis pv. citri strain 306. NRPS modules are represented by circles. Large circles indicate complete NRPS modules (containing domains C, A and PCP). The amino acid predicted to be assembled by the corresponding module is indicated within each circle. Small black circles represent TE domains. Black arrows: NRPS genes. The starter module circled in red exhibits a C-domain belonging to the same phylogenetic clade as starter C-domains identified by Rausch et al. [6] (Additional file 2). Elongation modules circled in blue exhibit a C-domain belonging to the same phylogenetic clade as dual C/E domains identified by Rausch et al. [6] (Additional file 2). Amino acids predicted to be epimerized by these domains are in blue. : tRNA. Orientation of the tag indicates the orientation of the tRNA gene in the genomic regions. Amino acid specificity of each tRNA is indicated above or below each tag according to the orientation of the tRNA gene. IS: insertion sequence. Dotted arrows: pseudogenes. Dotted and incomplete circles: incomplete and non-functional modules.
predicted data on non-ribosomally synthesized peptides potentially produced by strains
| - Lipopeptide synthesized by META-B with the amino acid sequence Gly-Unk2-Thr-D-Asn-Tyr-Asn-D-Tyr-Ser-D-Asn-Asp-Asn-Dpg-Dpg-Ser-D-Thr-Unk2 | - Glycine linked to a β-hydroxy-carboxylic acid and a glycosyl residue | |
| - Unknown amino acid linked to a β-hydroxycarboxylic acid and a glycosyl residue | ||
| - Lipopeptide synthesized by META-A with the amino acid sequence Dab-Ser-Ser-Thr-D-Asn-Val-D-Phe-Dab-Dpg-Unk3-Unk2-Tyr | ||
| - Lipopeptide synthesized by META-C with the amino acid sequence Unk4-Phe-Dab-Asn-Ala-D-Unk3-Gln | ||
| - Two lipopeptides synthesized by META-B and META-A, respectively.** | - Glycine linked to a β-hydroxy-carboxylic acid and a glycosyl residue | |
| -Lipopeptide synthesized by META-B with the amino acid sequence Gly-Unk7-Unk2-Thr-Asn-D-Thr-Tyr-D-Ile-Ser-Asn-D-Asn-Ser-Dpg-Dpg-Unk6-D-Dpg-Unk4 | | |
| - Two lipopeptides synthesized by META-B.** | | |
| | - A cyclolipopeptide with the amino acid sequence D-Asn-D-Val-D-Leu-D-Thr-D-Leu-Val | |
| No precise data on the number and nature of the non-ribosomally synthesized peptides produced by this strain because of the lack of information (only draft genome sequence including several unassembled contigs that contain NRPS sequences) | ||
Unk: Unknown amino acid; Dab: 2,4-diamino butyric acid; Dpg: 3,5-dihydroxyphenyl-glycine.
* The number and nature of amino acid(s) β-hydroxylated by proteins encoded by staM, staM’ or syrP remain unidentified.
** Only partial amino acid sequences of these lipopeptides are available due to insufficient information (contigs containing gene clusters META-B or META-A are unassembled).