Literature DB >> 33986093

Draft Genome Sequences of Legionella Presumptive Novel Species Isolated during Environmental Surveillance in Artificial Water Systems.

Luna Girolamini1, Silvano Salaris1, Massimiliano Orsini2, Maria Rosaria Pascale1, Marta Mazzotta1, Antonella Grottola3, Sandra Cristino4.   

Abstract

We present the draft genome sequences of three Legionella strains that were isolated from a hotel water distribution system. Legionella species identification was performed by macrophage infectivity potentiator (mip) and RNA polymerase β subunit (rpoB) gene sequencing. Whole-genome sequencing and average nucleotide identity results supported the hypothesis of new Legionella species isolation.
Copyright © 2021 Girolamini et al.

Entities:  

Year:  2021        PMID: 33986093      PMCID: PMC8142579          DOI: 10.1128/MRA.00307-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

The Legionella genus contains pathogenic Gram-negative bacteria that are ubiquitous in soil and water environments. It consists of more than 60 species, all of them potentially able to cause Legionnaires’ disease, a severe form of pneumonia (1). The Legionella sp. strains 27fs60 (S60), 30fs61 (S61), and 30cs62 (S62) were isolated from three different samples from a hotel’s hot water distribution system in the Emilia-Romagna region (Italy) during a routine Legionella surveillance program. Water sampling and Legionella isolation were performed according to ISO 19458:2006 and ISO 11731:2017, respectively (2, 3). Samples were seeded onto selective medium with glycine-vancomycin-polymyxin B-cycloheximide (GVPC) and were incubated for 15 days at 35°C ± 2°C in 2.5% CO2. Suspected colonies were subcultured on buffered charcoal yeast extract (BCYE) without l-cysteine (Thermo Fisher Scientific, Basingstoke, UK). The DNA was extracted with InstaGene matrix (Bio-Rad, Hercules, CA, USA), and identification of isolates was performed by macrophage infectivity potentiator (mip) and RNA polymerase β subunit (rpoB) gene sequencing (4, 5). Amplicons were sequenced using BigDye chemistry and analyzed on an ABI Prism 3100 genetic analyzer (Applied Biosystems, Foster City, CA, USA). The mip sequences were compared with the European Working Group for Legionella Infections (EWGLI) database. A BLAST search of the NCBI database was carried out for both mip and rpoB gene sequences. The best match returned was Legionella quateirensis reference strain ATCC 49507 (GenBank accession number GCA_001467955.1), with similarities of 98.45% and 94.8% for mip and rpoB, respectively. One hundred nanograms of genomic DNA was used for next-generation sequencing (NGS) library preparation using the Illumina Nextera XT DNA library preparation kit (New England Biolabs, Ipswich, MA, USA). Sequencing was performed on the Illumina NextSeq 500 platform (2 × 150-bp paired-end reads). Raw reads were used as input data for TORMES v.1.2.0 (6), an automated pipeline for analysis of whole bacterial genomes. TORMES includes sequence quality filtering (PRINSEQ v.0.20.4) (7) and de novo genome assembly (SPAdes v.13.4.1) (8), as well as other downstream analyses not used for our purpose. Scaffolding was performed using TORMES contigs as input for CSAR v.1.1.1 (9) with an evolutionarily related reference genome, i.e., Legionella fallonii (GenBank accession number NZ_LN614827.1) The final assemblies were further improved using Geneious Prime v.2020.2.4 software (10) and were submitted to GenBank with annotation by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v.4.3 (11). Default parameters were used for all software tools unless otherwise noted. Table 1 summarizes results from assembly and annotation by the PGAP and the completeness of genome assembly determined by Benchmarking Universal Single-Copy Orthologs (BUSCO) v.5.0.0 (12).
TABLE 1

Genome statistics from NCBI and BUSCO quality analyses

AttributeData for strain:
27fs60 (S60)30fs61 (S61)30cs62 (S62)
No. of raw reads4,184,0623,851,7263,626,424
Avg read length (bp)149149149
Coverage (×)142131124
Total length (bp)4,211,9193,709,4974,136,543
No. of contigs233732
GC content (%)39.0039.1039.00
N50 (bp)312,097166,809176,017
No. of coding sequences3,5423,1553,491
No. of rRNAs333
No. of tRNAs413739
BUSCO results (% [no. of genes])
 Complete99.2 (123)93.5 (116)95.2 (118)
 Single-copy complete99.2 (123)93.5 (116)95.2 (118)
 Duplicated complete0.0 (0)0.0 (0)0.0 (0)
 Fragmented0.8 (1)0.8 (1)0.8 (1)
 Missing0 (0)5.7 (7)4.0 (5)
Total no. of BUSCO genes124124124
Genome statistics from NCBI and BUSCO quality analyses The FastANI tool (13) was used to compare the average nucleotide identity (ANI) of the three strains against 1,009 Legionella sequences that had been downloaded from the NCBI database using the ncbi-genome-download tool (https://github.com/kblin/ncbi-genome-download). FastANI identified the closest relative of strain S60 to be L. quateirensis NCTC 12376 (GenBank accession number GCA_900452695.1) (91.31%) and the closest relative of strains S61 and S62 to be L. quateirensis ATCC 49507 (91.45% and 91.44%, respectively). Since the assumption is that two strains showing pairwise ANI values below a given threshold (95% or 96 %) belong to different species (14), our results led us to consider these strains new species. Studying the whole genome allows investigators to better identify already known species and to discover new ones, improving the knowledge of the ecological, virulence, and resistance characteristics of Legionella.

Data availability.

The draft genome assemblies are available in the GenBank database and can be accessed with SRA and assembly accession numbers SRP292355 and JADOBG000000000 (S60), SRP295125 and JADWVM000000000 (S61), and SRP295130 and JADWVN000000000 (S62).
  11 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  TORMES: an automated pipeline for whole bacterial genome analysis.

Authors:  Narciso M Quijada; David Rodríguez-Lázaro; Jose María Eiros; Marta Hernández
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  BUSCO: Assessing Genome Assembly and Annotation Completeness.

Authors:  Mathieu Seppey; Mosè Manni; Evgeny M Zdobnov
Journal:  Methods Mol Biol       Date:  2019

4.  Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes.

Authors:  Mincheol Kim; Hyun-Seok Oh; Sang-Cheol Park; Jongsik Chun
Journal:  Int J Syst Evol Microbiol       Date:  2014-02       Impact factor: 2.747

5.  CSAR: a contig scaffolding tool using algebraic rearrangements.

Authors:  Kun-Tze Chen; Chia-Liang Liu; Shang-Hao Huang; Hsin-Ting Shen; Yi-Kung Shieh; Hsien-Tai Chiu; Chin Lung Lu
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

6.  Application of RNA polymerase beta-subunit gene (rpoB) sequences for the molecular differentiation of Legionella species.

Authors:  Kwan Soo Ko; Hae Kyung Lee; Mi-Yeoun Park; Keun-Hwa Lee; Yeo-Jun Yun; So-Yon Woo; Hiroshi Miyamoto; Yoon-Hoh Kook
Journal:  J Clin Microbiol       Date:  2002-07       Impact factor: 5.948

7.  Quality control and preprocessing of metagenomic datasets.

Authors:  Robert Schmieder; Robert Edwards
Journal:  Bioinformatics       Date:  2011-01-28       Impact factor: 6.937

8.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

9.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Authors:  Chirag Jain; Luis M Rodriguez-R; Adam M Phillippy; Konstantinos T Konstantinidis; Srinivas Aluru
Journal:  Nat Commun       Date:  2018-11-30       Impact factor: 14.919

10.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

View more
  1 in total

1.  Combining Traditional and Molecular Techniques Supports the Discovery of a Novel Legionella Species During Environmental Surveillance in a Healthcare Facility.

Authors:  Luna Girolamini; Maria Rosaria Pascale; Marta Mazzotta; Simona Spiteri; Federica Marino; Silvano Salaris; Antonella Grottola; Massimiliano Orsini; Sandra Cristino
Journal:  Front Microbiol       Date:  2022-06-13       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.