Literature DB >> 35448968

Whole genome sequencing and taxonomic profiling of two Pantoea sp. isolated from environmental samples in Israel.

Yehoudit Guedj-Dana1, Inbar Cohen-Gihon1, Ofir Israeli1, Ohad Shifman1, Tamar Aminov1, Shahar Rotem1, Raphael Ber1, Anat Zvi2.   

Abstract

OBJECTIVE: As part of a research aiming at the isolation of bacteria secreting growth inhibiting compounds, cultures of Francisella tularensis were implanted in environmental samples and monitored for inhibition zones on agar. Two antibiotic-like secreting bacteria were isolated, their genomic sequence was deciphered and taxonomic profiling analysis classified them as belonging to the Pantoea genus. DATA DESCRIPTION: Two bacterial isolates exhibiting growth inhibition zones to F. tularensis (LVS) were analyzed using the Oxford Nanopore Technology (ONT). Preliminary de novo assembly of the reads was performed, followed by taxonomic profiling based on Multi Locus Sequence Analysis (MLSA) and implementation of the Average Nucleotide Identity (ANI) measure. The genomic sequences resulted in the identification of two different Pantoea species, denoted EnvD and EnvH. Subsequent de novo genome assembly generated 5 and 10 contigs for EnvD and EnvH, respectively. The largest contig (4,008,183 bps and 3,740,753 bps for EnvD and EnvH, respectively), overlaps to a major extent to the chromosome of closely related Pantoea species. ANI values calculated for both isolates revealed two apparently new species of the Pantoea genus. Our study deciphered the identity of two bacteria producing antibiotic-like compounds, and the genomic sequence revealed they represent distinct Pantoea species.
© 2022. The Author(s).

Entities:  

Keywords:  De novo assembly; Environmental samples; Oxford nanopore; Pantoea; Taxonomy; Whole genome sequencing

Mesh:

Substances:

Year:  2022        PMID: 35448968      PMCID: PMC9022320          DOI: 10.1186/s12863-022-01049-7

Source DB:  PubMed          Journal:  BMC Genom Data        ISSN: 2730-6844


Objective

Pantoea is a genus of Gram-negative bacteria belonging to the family Erwiniaceae. These bacteria were isolated from a multitude of environmental sources (such as plants, animals, water, soil and human). They are distributed in nature, and over 25 species were already documented [1-4]. In this study we present the taxonomic profiling of two environmental isolates which were identified by their ability to inhibit growth of F. tularensis (LVS) colonies. The isolates were sequenced by long read sequencing and assembled by a de novo assembly. We calculated the ANI measure to establish the species relatedness and the MLSA method was implemented to generate a phylogenetic tree showing these variants as new Pantoea species.

Data description

In this study, over 20 environmental samples were collected from distinct different geographic locations in Israel (natural sources). The samples were prepared by wiping a 20 cm2 area of urban asphalt roads with 3 sterile cotton swabs (Copan Italia SPA) moistened aseptically in sterile phosphate-buffered saline (PBS) solution. The swabs were discarded after extraction by vortex in 5 mL PBS. After 2 min. of passive sedimentation of dust and dirt, 4.2 mL were transferred to new sterile tube. An aliquot of 0.1 mL of each environmental sample was mixed with 0.1 mL of PBS containing 108 cfu/ml of indicator bacteria (F. tularensis LVS), the mix was spread on cysteine heart agar with hemoglobin (CHA, Difco), plates were incubated for 2 days at 37 °C and monitored for colonies demonstrating inhibition zone for the indicator strain. Several such colonies were further isolated to pure cultures on CHA, and their inhibition to LVS was verified. Two of the isolates showed differential capability to grow in Brain Heart Infusion broth, and were denoted herein as EnvD (from Talpiot industrial zone, Jerusalem) and EnvH (from Romema neighborhood, Haifa). For DNA purification, EnvD and EnvH colonies were grown on CHA, suspended to high density in 1 mL PBS, mixed 1:1 with ATL buffer, heated at 100 °C for 30 min before DNA purification using the QIAamp DNA blood minikit (Qiagen). Whole genome sequencing was conducted in GenoHub facility (https://genohub.com) using the Oxford Nanopore MinION Technology. Libraries for both EnvD and EnvH were prepared using the SQK-LSK109 ligation sequencing kit (Oxford Nanopore Technologies) and sequencing was conducted using an R9 flow cell. Basecalling was performed using Guppy v 4.4.2. A total of 210,000 reads for EnvD (mean average length of 3854 bps) and 190,000 reads for EnvH (mean average length of 1881 bps) were obtained, resulting in a coverage of 155x and 89x, respectively (data file 1) [5]. De novo assembly was conducted by implementing Flye [6], designed for the assembly of long reads generated by ONT. Five contigs were obtained for EnvD, the largest harboring a length of 4,008,183 bps, the N50 value being therefore 4,008,183. The EnvH generated reads were assembled to a total of 10 contigs, the longest being 3,740,753 bps (which is therefore the N50 value). A rough and preliminary estimation of the taxonomical relatedness of the two isolates conducted by a Blast analysis [7] against the nucleotide database (nt, https://www.ncbi.nlm.nih.gov/nucleotide/), disclosed similarity to the Pantoea genus. Subsequently, the longest contig of each isolate was compared to all publicly available Pantoea sequences (National Center for Biotechnology Information, NCBI), revealing that the best matching hits for EnvD and EnvH are Pantoea agglomerans and Pantoea stewartii, respectively. Accordingly, the contigs generated by the assembly were aligned to their closest species, using Mauve [8]. The alignments display a nearly complete coverage of the reference chromosome by the largest contig, both for EnvD and EnvH. In addition, two of the remaining contigs overlap plasmid regions in the reference genomes (data file 2) [9]. To further characterize whether the sequenced genomes can be associated with one of the already known Pantoea species, we used the ANI measure as a well-established whole genome similarity metrics [10-14]. The ANI value was estimated using the FastANI algorithm [12]. Pairwise ANI values of at most 82% were obtained for each of the sequenced genomes with genomes representing known Pantoea species [15]. According to the generally accepted cutoff value of 95% used as a boundary for species delineation [14, 16], it appears that EnvD and EnvH constitute new species within the Pantoea genus. To note, the pairwise ANI value between EnvD and EnvH is 83%, therefore representing two distinct lineages. To assign the taxonomic profiling of the two isolates, we implemented the Multi Locus Sequence Analysis (MLSA) typing method, tailored for phylogeny analysis of Pantoea species [17, 18], using five core genes that are effective at species-level delineation of the genus Pantoea: fusA, gyrB, leuS, pyrG and rpoB [1]. The sequence of the genes orthologous to EnvD and EnvH were extracted from the contig sequences covering the chromosome regions (contig 4 for EnvD and contig 3 for EnvH), and concatenated into a mini-gene. Alongside, the five protein-coding genes were extracted from 37 representative, reference and/or type strains of Pantoea and Tatumella species (for a complete list of species included in the set of 37 genomes, refer to [17]). The multiple alignment of the 37 concatenated sequences together with the concatenated sequences originating from EnvD and EnvH was constructed using the MAFFT algorithm (Multiple Alignment using fast Fourier Transform) [19] of the MegAlign™ Pro (©1993–2020) (DNASTAR®). Phylogenetic analysis of the aligned sequences was conducted using the PhyML tool [20], which estimates Maximum-Likelihood phylogenies (NGPhylogeny.fr [21]). Each of the new isolates, EnvD and EnvH, forms a distinct and separate branch (data file 3) [22]. While EnvH is related to Pantoea stewartii, Pantoea ananatis and Pantoea allii species, EnvD is not only branching from a separate cluster, but is also relatively distant from species in this cluster (which includes, among others, Pantoea agglomerans, corroborating with the preliminary assignment described above for EnvD). To conclude, two growth-inhibiting bacteria from environmental samples collected from two distinct areas in Israel were identified and assigned as belonging to the Pantoea genus. Their taxonomical profiling unveiled that these bacteria can be classified as new Pantoea species diverging from known Pantoea species described up to date. Further experimental characterization of the mechanism involved in the LVS growth inhibition of these Pantoea is now undertaken. Please see Table 1 for links to Data files 1-4 and Data sets 1-3. 1.
Table 1

Overview of data files/data sets

LabelName of data file/data setsFile types (file extension)Data repository and identifier (DOI or accession number)
Data file 1Sequencing and assembly metricsPortable Document Format file (.pdf)

https://doi.org/10.6084/m9.figshare.15111765.v1

[5]

Data file 2Alignment of contigs to a reference genomePortable Document Format file (.pdf)

https://doi.org/10.6084/m9.figshare.15105198.v1

[9]

Data file 3ANI values for Pantoea sp. EnvD and Pantoea sp. EnvHPortable Document Format file (.pdf)

https://doi.org/10.6084/m9.figshare.19204658.v2

[15]

Data file 4Phylogenetics analysesPortable Document Format file (.pdf)https://doi.org/10.6084/m9.figshare.15111588.v3 [22]
Data set 1Sequencing reads of Pantoea sp. EnvD and Pantoea sp. EnvHFastq file (.fastq.gz)SRP316834 [23]
Data set 2Genome assembly of Pantoea sp. EnvDFASTA / GenBank / ASN.1JAGTWO000000000.1 [24]
Data set 3Genome assembly of Pantoea sp. EnvHFASTA / GenBank / ASN.1JAGTWN000000000.1 [25]
Overview of data files/data sets https://doi.org/10.6084/m9.figshare.15111765.v1 [5] https://doi.org/10.6084/m9.figshare.15105198.v1 [9] https://doi.org/10.6084/m9.figshare.19204658.v2 [15]

Limitations

The de novo assembly resulted in a number of contigs; while the longest contig very nearly cover the chromosome region, some other regions of the EnvD and EnvH sequenced genomes are fragmentally covered. By reason of practical considerations, the phylogeny relatedness of the two Pantoea sp. EnvD and EnvH is based on a limited dataset of 37 known Pantoea species, comprising of representative, reference and/or type strains of Pantoea and Tatumella species.
  17 in total

1.  Genomic insights that advance the species definition for prokaryotes.

Authors:  Konstantinos T Konstantinidis; James M Tiedje
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-08       Impact factor: 11.205

2.  Phylogeny and identification of Pantoea species and typing of Pantoea agglomerans strains by multilocus gene sequencing.

Authors:  Alexis Delétoile; Dominique Decré; Stéphanie Courant; Virginie Passet; Jennifer Audo; Patrick Grimont; Guillaume Arlet; Sylvain Brisse
Journal:  J Clin Microbiol       Date:  2008-12-03       Impact factor: 5.948

3.  Shifting the genomic gold standard for the prokaryotic species definition.

Authors:  Michael Richter; Ramon Rosselló-Móra
Journal:  Proc Natl Acad Sci U S A       Date:  2009-10-23       Impact factor: 11.205

4.  An In-depth Analysis of a Multilocus Phylogeny Identifies leuS As a Reliable Phylogenetic Marker for the Genus Pantoea.

Authors:  James T Tambong; Renlin Xu; Cynthia-Anne Kaneza; Jean-Claude Nshogozabahizi
Journal:  Evol Bioinform Online       Date:  2014-07-27       Impact factor: 1.625

5.  Taxonomic affiliation of new genomes should be verified using average nucleotide identity and multilocus phylogenetic analysis.

Authors:  María José Figueras; Roxana Beaz-Hidalgo; Mohammad J Hossain; Mark R Liles
Journal:  Genome Announc       Date:  2014-12-04

6.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Authors:  Chirag Jain; Luis M Rodriguez-R; Adam M Phillippy; Konstantinos T Konstantinidis; Srinivas Aluru
Journal:  Nat Commun       Date:  2018-11-30       Impact factor: 14.919

7.  NGPhylogeny.fr: new generation phylogenetic services for non-specialists.

Authors:  Frédéric Lemoine; Damien Correia; Vincent Lefort; Olivia Doppelt-Azeroual; Fabien Mareuil; Sarah Cohen-Boulakia; Olivier Gascuel
Journal:  Nucleic Acids Res       Date:  2019-07-02       Impact factor: 16.971

8.  Taxogenomics and Systematics of the Genus Pantoea.

Authors:  James T Tambong
Journal:  Front Microbiol       Date:  2019-10-30       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.