| Literature DB >> 34938279 |
Nitin K Singh1, Céline Lavire2, Joseph Nesme3, Ludovic Vial2, Xavier Nesme2, Christopher E Mason4, Florent Lassalle5, Kasthuri Venkateswaran1.
Abstract
Strains of Agrobacterium genomospecies 3 (i.e., genomovar G3 of the Agrobacterium tumefaciens species complex) have been previously isolated from diverse environments, including in association with plant roots, with algae, as part of a lignocellulose degrading community, from a hospital environment, as a human opportunistic pathogen, or as reported in this study, from a surface within the International Space Station. Polyphasic taxonomic methods revealed the relationship of Agrobacterium G3 strains to other Agrobacterium spp., which supports the description of a novel species. The G3 strains tested (n = 9) were phenotypically distinguishable among the strains from other genomospecies of the genus Agrobacterium. Phylogenetic analyses of the 16S rRNA gene, gyrB gene, multi-locus sequence analysis, and 1,089-gene core-genome gene concatenate concur that tested G3 strains belong to the Agrobacterium genus and they form a clade distinct from other validly described Agrobacterium species. The distinctiveness of this clade was confirmed by average nucleotide identity (ANI) and in silico digital DNA-DNA hybridization (dDDH) comparisons between the G3 tested strains and all known Agrobacterium species type strains, since obtained values were considerably below the 95% (ANI) and 70% (dDDH) thresholds used for the species delineation. According to the core-genome phylogeny and ANI comparisons, the closest relatives of G3 strains were Agrobacterium sp. strains UGM030330-04 and K599, members of a novel genomospecies we propose to call genomovar G21. Using this polyphasic approach, we characterized the phenotypic and genotypic synapomorphies of Agrobacterium G3, showing it is a bona fide bacterial species, well separated from previously named Agrobacterium species or other recognized genomic species. We thus propose the name Agrobacterium tomkonis for this species previously referred to as Agrobacterium genomospecies 3. The type strain of A. tomkonis is IIF1SW-B1T (= LMG 32164 = NRRL B-65602). Comparative genomic analysis show A. tomkonis strains have species-specific genes associated with secretion of secondary metabolites, including an exopolysaccharide and putative adhesins and resistance to copper. A. tomkonis specific gene functions notably relate to surface adhesion and could be involved to colonize nutrient-poor and harsh habitats. The A. tomkonis strains from the ISS showed presence of a 40-kbp plasmid and several other potential mobile genetic elements detected that could also be part of conjugative elements or integrated prophages.Entities:
Keywords: Agrobacterium tomkonis; ISS; MLSA; genomovar G3; metagenomics; phylogenomics
Year: 2021 PMID: 34938279 PMCID: PMC8685578 DOI: 10.3389/fmicb.2021.765943
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of the draft whole-genome sequences of several Agrobacterium tomkonis strains.
|
|
|
|
|
|
|
| NCBI WGS accession |
|
|
|
|
|
| # contigs (> = 0 bp) | 137 | 154 | 150 | 44 | 55 |
| # contigs (> = 1000 bp) | 76 | 81 | 80 | 35 | 38 |
| Total length (> = 1000 bp) | 6,234,019 | 6,273,081 | 6,273,787 | 5,480,209 | 5,440,309 |
| Largest contig | 644,466 | 644,466 | 726,576 | 1,342,491 | 928,660 |
| Total length | 6,240,486 | 6,282,603 | 6,282,746 | 5,483,735 | 5,445,056 |
| GC (%) | 59.18 | 59.17 | 59.17 | 59.12 | 59.21 |
| N50 | 350,004 | 350,004 | 354,854 | 378,069 | 320,860 |
| # of genes | 5,917 | 5,962 | 5,964 | 5,223 | 5,147 |
| # CDS | 5,863 | 5,908 | 5,910 | 5,175 | 5,098 |
FIGURE 1Phylogeny of 40 Agrobacterium spp. strains based on 16S rRNA gene, gyrB, and MLSA. Conserved marker gene phylogenies of 40 Agrobacterium distinct strains (41 when including the two published genome versions for strain CFBP 6623. (A) 16S rRNA gene phylogeny, based on 1,602 aligned positions), rooted with sequence from Rhizobium leguminosarum USDA 2370T; (B) gyrB gene phylogeny, based on 2,477 aligned positions; (C) multi-locus sequence analysis phylogeny, based on the concatenated alignments of genes parE, gyrB, recA, and rpoB, resulting into 9,968 aligned positions. All gene sequences were extracted from the 41 studied and then aligned with Clustal Omega (for coding genes, alignment was performed at the protein level and then reverse-translated into codons). Maximum-likelihood trees were inferred with RAxML-NG 1.0.0 under the model GTR + FO + G4m, taking the best of 20 independent inferences, started with 10 random trees and 10 parsimony-optimized tree. Branch supports were estimated with 200 Felsenstein bootstrap trees. Only branch support over 70% are displayed; full information on the trees are available at https://doi.org/10.6084/m9.figshare.14792148; https://doi.org/10.6084/m9.figshare.14792169; https://doi.org/10.6084/m9.figshare.14792100.
FIGURE 2Phylogeny of 40 Agrobacterium spp. strains based on the concatenation of 1,089 core-genome genes. Maximum-likelihood phylogenetic tree of 40 Agrobacterium distinct strains (41 when including the two published genome versions for strain CFBP 6623) obtained using the bioinformatic pipeline Pantagruel (data available at https://doi.org/10.6084/m9.figshare.14792178). The tree was computed with RAxML 8.1.2 based on the concatenation of 1,089 core-genome gene alignments under the GTRCATX model, with 200 rapid bootstraps and rooted with RAxML tree-balance algorithm. All branch supports are 100%, unless displayed.
List of Agrobacterium genomes used in this study and Average Nucleotide Identity (ANI), Average Amino acid Identity (AAI), and digital DNA-DNA Hybridization (dDDH) values of A. tomkonis IIF1SW-B1T compared with all other tested Agrobacterium strains.
|
|
|
|
| ||
|
|
|
| |||
|
| 100.00 | 100.00 | 100.00 | 3 | |
|
| 100.00 | 100.00 | 100.00 | 3 | |
|
| 100.00 | 100.00 | 100.00 | 3 | |
|
| 98.08 | 97.62 | 84.90 | 3 | |
|
| 97.74 | 97.25 | 83.60 | 3 | |
|
| 97.82 | 97.77 | 83.50 | 3 | |
|
| 97.64 | 97.73 | 83.90 | 3 | |
|
| 98.00 | 98.09 | 85.20 | 3 | |
|
| 98.08 | 98.15 | 85.40 | 3 | |
|
| 98.12 | 97.61 | 84.80 | 3 | |
|
| 90.58 | 93.91 | 41.70 | 21 | |
|
| 90.51 | 93.39 | 41.50 | 21 | |
|
| 88.63 | 91.71 | 36.70 | 7 | |
|
| 88.59 | 92.49 | 36.20 | 13 | |
|
| 88.66 | 92.25 | 36.30 | 7 | |
|
| 88.56 | 91.77 | 36.30 | 7 | |
|
| 88.11 | 91.62 | 35.40 | 5 | |
|
| 87.97 | 90.95 | 35.40 | 4 | |
|
| 87.79 | 91.06 | 35.30 | 4 | |
|
| 87.54 | 90.09 | 34.50 | 6 | |
|
| 87.39 | 90.88 | 33.90 | 8 | |
|
| 87.41 | 90.68 | 33.70 | 8 | |
|
| 87.76 | 91.98 | 34.20 | 9 | |
|
| 87.55 | 91.18 | 33.90 | 9 | |
|
| 87.04 | 89.42 | 33.70 |
| |
|
| 87.21 | 89.99 | 33.60 | 14 | |
|
| 86.72 | 90.34 | 32.30 | 1 | |
|
| 86.66 | 89.97 | 32.20 | 1 | |
|
| 86.60 | 89.84 | 32.30 | 1 | |
|
| 86.37 | 90.58 | 31.90 | 2 | |
|
| 86.51 | 89.89 | 32.10 | 1 | |
|
| 86.51 | 90.59 | 31.80 | 2 | |
|
| 78.42 | 77.11 | 21.80 | – | |
|
| 77.93 | 75.37 | 21.30 | – | |
|
| 77.90 | 76.96 | 21.30 | – | |
|
| 77.96 | 75.98 | 21.10 | – | |
|
| 77.42 | 75.84 | 21.00 | – | |
|
| 77.81 | 76.35 | 21.20 | – | |
|
| 77.67 | 76.62 | 21.10 | – | |
|
| 77.51 | 76.47 | 21.00 | – | |
|
| 76.04 | 66.57 | 20.60 | – | |
*
Differential biochemical characteristics between A. tomkonis and closely related species.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| D-Raffinose | + | + | + | + | + | + | − |
| α-D-Lactose | + | + | + | + | + | + | − |
| Acetic Acid | + | + | + | + | + | + | − |
| L-Arginine | + | + | − | − | − | + | nd |
| Methyl Pyruvate | + | w | + | − | w | w | + |
| N-Acetyl-D-Galactosamine | + | + | − | w | w | + | − |
| D-Galacturonic Acid | − | + | + | + | + | + | − |
| D-Saccharic Acid | − | − | − | − | − | − | + |
| D-Glucuronic Acid | w | w | + | + | + | + | − |
| L-Lactic Acid | w | + | − | + | + | + | + |
| D-Malic Acid | w | + | − | w | w | + | nd |
| Bromosuccinic Acid | w | w | − | − | − | − | + |
| Acetoacetic Acid | w | w | + | − | − | − | nd |
| Pectin | w | + | − | w | − | − | nd |
*Data from
FIGURE 3Visualization of the Agrobacterium pangenome. Genomes are ordered as layers using a tree based on average nucleotide identity (ANI) values matrix (Euclidean distance, Ward clustering). Color hue of ANI values are ranging from 70% ANI (white) to 100% ANI (red) and displayed as a heatmap on the right-side panel. Gene clusters are represented as vertical black bars indicating their presence in each genome layer. The central tree is ordering gene clusters based on their presence/absence in genomes using Euclidean distance and Ward clustering. The “Core GC” highlight indicates the gene clusters identified in all genomes. Gene clusters that are found in a single genome are collapsed as “Singletons GC.” The “A. tomkonis ISS strains GC” highlight indicates gene clusters found exclusively on the three strains isolated from the International Space Station. The figure was created in Anvi’o v6.