| Literature DB >> 29467735 |
Marike Palmer1, Emma T Steenkamp1, Martin P A Coetzee2, Jochen Blom3, Stephanus N Venter1.
Abstract
Bacteriologists have strived toward attaining a natural classification system based on evolutionary relationships for nearly 100 years. In the early twentieth century it was accepted that a phylogeny-based system would be the most appropriate, but in the absence of molecular data, this approach proved exceedingly difficult. Subsequent technical advances and the increasing availability of genome sequencing have allowed for the generation of robust phylogenies at all taxonomic levels. In this study, we explored the possibility of linking biological characters to higher-level taxonomic groups in bacteria by making use of whole genome sequence information. For this purpose, we specifically targeted the genus Pantoea and its four main lineages. The shared gene sets were determined for Pantoea, the four lineages within the genus, as well as its sister-genus Tatumella. This was followed by functional characterization of the gene sets using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. In comparison to Tatumella, various traits involved in nutrient cycling were identified within Pantoea, providing evidence for increased efficacy in recycling of metabolites within the genus. Additionally, a number of traits associated with pathogenicity were identified within species often associated with opportunistic infections, with some support for adaptation toward overcoming host defenses. Some traits were also only conserved within specific lineages, potentially acquired in an ancestor to the lineage and subsequently maintained. It was also observed that the species isolated from the most diverse sources were generally the most versatile in their carbon metabolism. By investigating evolution, based on the more variable genomic regions, it may be possible to detect biologically relevant differences associated with the course of evolution and speciation.Entities:
Keywords: Enterobacteriaceae; KEGG; Pantoea; bacterial systematics; genome-inferred biology; phenotype
Year: 2018 PMID: 29467735 PMCID: PMC5808187 DOI: 10.3389/fmicb.2018.00113
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1Experimental strategy followed for the lineages within Pantoea in the study. Lineages were identified from the subtree of Pantoea from the protein sequence maximum-likelihood tree of all the shared genes of Palmer et al. (2017). Average nucleotide identity (ANI) values were used as a measure of relatedness between species of a lineage as obtained from Palmer et al. (2017). Shared gene sets were determined from the genome sequences of species within each lineage. Gene sets were then annotated with the Kyoto Encyclopedia of Genes and Genomes (KEGG), followed by BLAST verification and locus comparisons of characterized genes. Uncharacterized genes were subjected to Blast2GO analyses. A similar strategy was followed for the generic comparisons with the exception of the locus comparisons.
Genomes analyzed in the study.
| R 190 | Apple, Korea | ||
| LMG 24248T | Onion seed, South Africa | ||
| LMG 2665T | Pineapple, Brazil | ||
| 11-2 | Hypersaline lake, Hawaii | ||
| LMG 5343T | Human, USA | ||
| LMG 25383T | Infant formula, - | ||
| LMG 24534T | Human, France | ||
| LMG 2657T | Orchid, USA | ||
| LMG 24200T | Eucalyptus, Uganda | ||
| EGD-AAK13 | Soil, India | ||
| aB | Bark beetle, USA | ||
| LMG 2781T | Human, USA | ||
| LMG 25382T | Infant formula, - | ||
| LMG 26273T | Eucalyptus, Colombia | ||
| LMG 26275T | Eucalyptus, Rwanda | ||
| LMG 5345T | Human, USA | ||
| DC 283 | Maize, USA | ||
| LMG 2632T | Fox millet, India | ||
| C9-1 | Apple, USA | ||
| LMG 26277T | Eucalyptus, South Africa | ||
| At-9b | Leaf cutter ant, USA | ||
| A4 | Rafflesia flower, Malaysia | ||
| GM01 | Poplar, USA | ||
| LMG 23360T | Pineapple, Philippines | ||
| ATCC 33301T | Human, USA | ||
| NML 06-3099T | Human, Canada |
Figure 2A bar graph indicating the number of genes for each isolate, separated into genes highly conserved in the sister genera (Pantoea + Tatumella), through to genes not shared by all closely related species or unique genes. The dendrogram was inferred for the different lineages and their relationships to each other from the amino acid based topology of the core genome of Palmer et al. (2017). The length of each bar is indicative of the size of each genome analyzed (in terms of the number of genes). The different lineages are indicated with colored blocks (blue—P. ananatis lineage; green—P. agglomerans lineage; yellow—P. rodasii lineage; red—P. dispersa lineage). All genomes analyzed encoded a similar number of genes, with the genome of P. rodasii encoding the highest number of genes (~5,800) and P. eucrina encoding the least number of genes (~3,800).
Figure 3An overview of the gene sets shared between Pantoea and Tatumella and among the various lineages of Pantoea. For perspective, these are indicated relative to the genome-based species tree for Pantoea generated by Palmer et al. (2017). The lineages examined in this study are indicated with colored branches (green—P. agglomerans lineage; blue—P. ananatis lineage; yellow—P. rodasii lineage; red—P. dispersa lineage) with the representative of each lineage indicated in bold. The inner track indicates the sizes of the gene sets (in number of genes) of the lineages within Pantoea. The second track indicates the proportion of the genes annotated with the KEGG database involved in each of the functional classes (see legend). Values in the third track depicts the percentage of annotated genes involved in each functional class. The outer track indicates the size of the shared gene sets for the genera Pantoea and Tatumella, respectively.
Intergeneric differences in multi-gene pathways.
| D-ribose utilization |
Converts D-ribose to D-ribose-1-P Ribose donor during pyrimidine cycling Involved in co-factor cycling | Giorgelli et al., | ||
| D-mannitol utilization |
Converts D-mannitol to β-D-fructose-6-P Used as carbon source | Berkowitz, | ||
| Myo-inositol utilization |
Converts 2-deoxy-5-keto-D-gluconate-6-P to myo-inositol Used as carbon source | Anderson and Magasanik, | ||
| Sucrose utilization | sacA, MGAM, SI |
Degrades sucrose to ADP-glucose and D-fructose Used as carbon source | Fouet et al., | |
| Glycogen degradation |
Converts glycogen to trehalose or amylose Used as carbon source | Baecker et al., | ||
| β-oxidation |
Cleaves fatty acids cyclically into acetyl-CoA (even numbered) or succinyl-CoA (odd-numbered) Used in citric acid cycle | Schulz, | ||
| Purine metabolism |
Converts guanine to (S)-allantoin Process involved in nitrogen cycling | Kimiyoshi et al., | ||
| Creatine degradation | E3.5.3.3, E3.5.1.59 |
Converts creatine and N-carbamoylsarcosine to sarcosine | Yoshimoto et al., | |
| Histidine degradation |
Converts L-histidinol to urocanate Use of histidine as sole carbon source | Mehler and Tabor, | ||
| L-aspartate degradation |
Converts L-aspartate to nicotinate-D-ribonucleotide Process involved in co-factor cycling | Yang et al., | ||
| Glutamine ABC Transporter |
Transports glutamine from outside environment into cell through the use of ATP | Nohno et al., | ||
| Glycine/ betaine/proline ABC Transporter |
Transports glycine, betaine and proline from outside environment into cell through the use of ATP | Gowrishankar, | ||
| Glutathione ABC Transporter |
Transports glutathione from outside environment into cell through the use of ATP | Suzuki et al., | ||
| Osmoprotectant ABC Transporter |
Transports osmoprotectants from outside environment into cell through the use of ATP | Kempf and Bremer, | ||
| Taurine ABC Transporter |
Transports taurine from outside environment into cell through the use of ATP Allows uptake for subsequent degradation of taurine for sulfur cycling | Van der Ploeg et al., | ||
| L-arabinose transporter |
Transports L-arabinose from outside environment into cell through the use of ATP Required for utilization of L-arabinose as sole carbon source | Scripture et al., | ||
| Microcin C ABC Transporter |
Transports microcin C from outside environment into cell through the use of ATP Results in susceptibility to antimicrobial microcin C | Vanneste et al., | ||
| N-acetylmuramic acid PTS |
Transports and phosphorylates N-acetylmuramic acid through the use of ATP Essential for the use of N-acetylemuramic acid (cell wall component) as carbon source | Dahl et al., | ||
| N-acetyl-D-glucosamine PTS |
Transports and phosphorylates N-acetyl-D-glucosamine through the use of ATP Essential for the use of N-acetyl-D-glucosamine (cell wall component) as carbon source | Jaeger and Mayer, | ||
| Arbutin/salicin PTS |
Transports and phosphorylates arbutin and salicin through the use of ATP Essential for the use of plant derived glycosides as sole carbon source | Hall and Xu, |
PTS refers to phosphotransferase system.
Brackets indicate the number of taxa for which the specific locus is present out of all taxa in the genus.
Distribution multi-gene pathways among the Pantoea lineages.
| Polyketide sugar unit biosynthesis | dTDP-4-oxo-6-deoxy-D-glucose <-> dTDP-4-dehydro-beta-L-rhamnose | v | v | v | v | |
| dTDP-L-rhamnose <-> dDTP-4-dehydro-beta-L-rhamnose | v | v | v | v | ||
| Biosynthesis of siderophore group non-ribosomal peptides | 2,3-Dihydroxybenzoate <-> Enterochelin | v | − | v | + | |
| 2,3-Dihydroxybenzoate <-> (2S,3S)-2,3-Dihydro-2,3-dihydroxybenzoate | v | − | v | + | ||
| (2S,3S)-2,3-Dihydro-2,3-dihydroxybenzoate <-> Isochorismate | v | − | v | + | ||
| Isochorismate <-> Chorismate | v | − | v | + | ||
| Fructose and mannose metabolism | L-Rhamnulose <-> L-Rhamnose | + | v | v | + | |
| L-Rhamnulose 1-P <-> L-Rhamnulose | + | v | v | + | ||
| Glycerone-P + S-Lactaldehyde <-> L-Rhamnulose 1-P | + | v | v | + | ||
| Lysine degradation | L-Lysine <-> N6-Hydroxy-L-Lysine | − | + | − | − | |
| N6-Hydroxy-L-lysine <-> N6-Acetyl-N6-hydroxy-L-lysine | − | + | − | − | ||
| N6-Acetyl-N6-hydroxy-L-lysine <-> N2-Citryl-N6-acetyl-N6-hydroxy-L-lysine | − | + | − | − | ||
| N2-Citryl-N6-acetyl-N6-hydroxy-L-lysine + N6-Acetyl-N6-hydroxy-L-lysine <-> Aerobactin | − | + | − | − | ||
| Chloroalkane degradation | Chloroacetic acid <-> Glycolate + Hydrochloric acid | 2-haloacid DH | + | + | − | + |
| Chloroacetaldehyde <-> Chloroacetic acid | aldehyde DH | + | + | − | + | |
| Benzoate degradation | Succinyl-CoA + 3-Oxoadipate <-> Succinate + 3-Oxoadipyl-CoA | − | + | − | − | |
| 2-Oxo-2,3-dihydrofuran-5-acetate + H2O <-> 3-Oxoadipate | − | + | − | − | ||
| 3,4-Dihydroxybenzoate + Oxygen <-> 3-Carboxy-cis,cis-muconate | − | + | − | − | ||
| Pentose and glucuronate interconVersions | D-Glucuronate <-> D-fructuronate | + | + | v | + | |
| D-Galacturonate <-> D-Tagaturonate | + | + | v | + | ||
| D-Altronate <-> D-Tagaturonate | + | + | v | + | ||
| D-Altronate <-> 2-Dehydro-3-deoxy-D-gluconate | + | + | v | + | ||
The presence or absence of each pathway was verified using BLAST searches with the relevant sequences against the respective genomes.
The lineages are indicated as follows: Ag = P. agglomerans lineage, An = P. ananatis lineage, Di = P. dispersa lineage and Ro = P. rodasii lineage. The presence of genes are indicated with “+”, their absence with “−”, while “v” is used to indicate the presence in some but not all members of a lineage.
Figure 4The gene cluster involved in “Polyketide sugar unit biosynthesis” across the lineages of Pantoea. The dendrogram was inferred from the amino acid based topology of the core genome of Palmer et al. (2017). The different lineages are indicated with colored blocks. The first locus, containing rfbA, rfbB, rfbC, and rfbD (purple), was conserved in most members of Pantoea. The normalized dN-dS values for each codon position were plotted as an indication of the selective pressures upon the codons. Both rfbA and rbfB could be observed to experience mainly purifying selection (proportion of non-synonymous substitutions < proportion of synonymous substitutions), while rfbC and rfbD evolved mainly under diversifying selection (proportion of non-synonymous substitutions > proportion of synonymous substitutions). The second locus was identified in P. ananatis, P. stewartii subsp. stewartii and P. dispersa, with a partial locus present in P. stewartii subsp. indologenes (maroon). This locus lacked an rfbB gene and evolved mainly under purifying selection.
Figure 5The gene clusters involved in “Lysine degradation” and “Biosynthesis of siderophore group non-ribosomal peptides”. The dendrogram was inferred from the species tree of Palmer et al. (2017). Lineages are indicated with colored blocks. Both these clusters encode for the biosynthesis of siderophores, namely aerobactin and enterobactin, respectively. The locus required for the production of aerobactin was conserved in members of the P. ananatis lineage, while the locus required for enterobactin biosynthesis was present in most other members of Pantoea. The enterobactin biosynthesis locus was completely absent from the genomes of the members of the P. ananatis lineage, while the aerobactin locus was lacking in all other members of Pantoea. As an indication of selective pressures on the loci, the normalized dN-dS value at each codon position was plotted across the clusters.
Figure 6Differences between the lineages in processes involved in “Environmental Information Processing”. The presence (+) or absence (−) of complete (all genes required for functional system) two component systems, ABC transporters and PTSs in the genomes of the species in the main lineages within Pantoea. The dendrogram of the relationships within and between lineages were inferred from Palmer et al. (2017). The separate lineages are indicated with colored blocks.
Figure 7The top 20 biological processes (Level 3) for the P. ananatis, P. agglomerans and P. rodasii lineages and the 16 biological processes for the P. dispersa lineage, of the 182 genes annotated with Blast2GO. The dendrogram indicating the relationships between the lineages was inferred from the species tree of Palmer et al. (2017). The GO terms for the shared processes are indicated in the same color across the lineages. Processes highlighted in bold are shared by all four lineages.
Pathogenicity-associated processes with differences between the lineages.
| O-antigen biosynthesis |
Encodes for the synthesis of polysaccharide unit of O-antigen Cause variation on outer membrane of cells Assists in host immune response evasion | Stevenson et al., | ||
|
Encodes for the synthesis of the O-antigen's polysaccharide Cause variation on outer membrane of cells Assists in host immune response evasion | Stevenson et al., | |||
| Siderophore production |
Encodes for the synthesis of aerobactin Efficient at scanvenging iron during nutrient limitation Assists in resistance against iron-dependent antimicrobials | Montgomerie et al., | ||
|
Encodes for the synthesis of enterobactin High affinity for iron, but susceptible to inactivation by lipocalin-2 produced by mammalian hosts Iron acquisitioning in hosts | Fiedler et al., | |||
| Polyamine biosynthesis |
Synthesis of spermidine from S-adenosyl-L-methionine Involved in protein and nucleic acid biosynthesis, expression regulation, membrane functioning and ROS scavenging Contributes toward pathogenesis through involvement in biofilm formation, escape from phagolysosomes and toxin production | Khan et al., |
Brackets indicate the number of taxa for which the specific locus is present out of all taxa in a lineage. For complete distribution patterns see Supplementary File .
Niche-associated processes (non-pathogenic) with differences between the lineages.
| L-rhamnose utlization |
Convert L-rhamnose to S-lactaldehyde Catabolism of L-rhamnose as carbon source Potentially contributes toward niche expansion |
| Badía et al., | |
| D-galacturonate utilization |
Involved in the utilization of glucuronate and galacturonate as carbon sources Integral parts of plant cell walls Can contribute toward colonization of plants |
| Walton, | |
| Benzoate degradation |
Involved in the utilization of protochatechuate Catabolism of protochatechuate as carbon source Potentially contributes toward niche expansion |
(note, contains deletion) | Song, | |
| Chloroalkane and chloroalkene degradation | - |
Conversion of chloroacetaldehyde to glycolate Chloroacetaldehyde binds to DNA and causes conformational changes leading to mutations Removes mutagen from environment and utilize it as carbon source via glycolate |
| Young Kim et al., |
| Methionine salvage pathway |
Salvage methionine through the conversion of 5-methylthio-D-ribose to 3-(methylthio)-propanoate Allows sulfur cycling Can survive in low-sulfur environments |
| Sekowska et al., |
Brackets indicate the number of taxa for which the specific locus is present out of all taxa in a lineage. For complete distribution patterns see Supplementary File .