Literature DB >> 35142544

Draft Genome Sequence of Legionella Species Isolated from Drinking Water in an Italian Industry.

Luna Girolamini1, Silvano Salaris1, Massimiliano Orsini2, Maria Rosaria Pascale1, Marta Mazzotta1, Antonella Grottola3, Sandra Cristino1.   

Abstract

We report the draft genome sequences of an environmental Legionella strain isolated from an industrial water distribution system in Italy. Macrophage infectivity potentiator (mip) and β-subunit of RNA polymerase (rpoB) genes were used to perform the species identification. Whole-genome sequencing (WGS) and average nucleotide identity (ANI) identified the isolate as belonging to a presumptive novel Legionella species, with a genome length of 3,281,851 bp.

Entities:  

Year:  2022        PMID: 35142544      PMCID: PMC8830301          DOI: 10.1128/mra.01152-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Legionella spp. are pathogenic Gram-negative bacteria that are ubiquitous in water and soil. Legionella includes more than 66 species and some of them are potentially able to cause a severe form of pneumonia, called Legionnaires’ disease (1). The Legionella sp. strain 31fI33 was isolated from a drinking water in a company located in the Emilia-Romagna region (Italy) during a routine Legionella surveillance program. Water samples and Legionella isolation were performed according to ISO 19458:2006 and ISO 11731:2017, respectively (2, 3). Samples were seeded onto selective medium with glycine-vancomycin-polymyxin B-cycloheximide (GVPC) (Thermo Fisher Diagnostics, Basingstoke, UK) and incubated until 15 days at 35 ± 2°C in 2.5% CO2. Suspected colonies were subcultured on buffered charcoal yeast extract (BCYE) with and without l-cysteine (Thermo Fisher). DNA isolation was performed by InstaGene Matrix (Bio-Rad, Hercules, CA, USA) and the identification of isolate was performed by macrophage infectivity potentiator (mip) and RNA polymerase β subunit (rpoB) genes sequencing (4, 5). A BigDye kit was used for the sequencing reaction and DNA sequences were analyzed on an ABI PRISM 3100 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). Sequences obtained were analyzed on the BLAST platform by the National Center for Biotechnology Information (NCBI) and European Working Group for Legionella Infections (EWGLI) databases. The best match returned was L. feeleii, reference strain ATCC 35072 (GenBank accession no. GCA_001648615.1), with similarities of 98.2% and 95.1% for mip and rpoB, respectively. An Illumina Nextera XT DNA Library Preparation kit (Illumina, New England Biolabs, Ipswich, MA, USA) was used to perform next-generation sequencing (NGS) library preparation using 100 ng of DNA. Subsequently, the Illumina NextSeq 500 platform (2 × 250 paired-end reads) was used for the sequencing. The bioinformatics workflow for the whole-genome sequencing (WGS) analysis included the following steps: through TORMES v.1.2.0 (6), an automated pipeline for analysis of the whole bacterial genome, raw reads were subjected to sequence quality filtering (PRINSEQ v.0.20.4) (7) and de novo genome assembly (SPAdes v.13.4.1) (8). The generated contigs were passed to CSAR v.1.1.1 (9) in order to build the scaffolds. Scaffolding was performed using the genomes of different evolutionarily related organisms based on taxonomic identification: Legionella hackeliae strain ATCC 35250 (GenBank accession no. LN681225.1) by using Kraken2 v.2.0.9 (10), and Legionella feeleii strain ATCC 35072 based on mip and rpoB identification. The best scaffolding result was obtained for L. hackeliae. To close or reduce the gaps contained in the CSAR output, a remapping of the reads using the scaffolds as a reference sequence was performed with Geneious Prime v.2021.2.2 software (http://www.geneious.com) (11). The obtained draft genome was submitted to GenBank requiring the annotation, performed by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP v.4.3) (12) with the following accession numbers: SRR16560654 and JAJHHJ000000000. The results, summarized in Table 1, represent the assembling and annotation by the PGAP and the completeness of the genome assembly determined by Benchmarking Universal Single-Copy Orthologs (BUSCO) v.5.0.0 (13).
TABLE 1

Genome statistics data obtained from NCBI and BUSCO quality analyses

AttributeData for strain 31fl33
No. of raw reads1,321,792
Avg read length (bp)253
Coverage (×)99
Total length (bp)3,281,851
No. of contigs3
GC content (%)41.3
N50 (bp)1,369,339
No. of coding sequences2,968
No. of rRNAs1
No. of tRNAs41
BUSCO results (% [no. of genes])
 Complete100 (124)
 Single-copy complete100 (124)
 Duplicated complete0.0 (0)
 Fragmented0.0 (0)
 Missing0 (0)
 Total no. of BUSCO genes124
Genome statistics data obtained from NCBI and BUSCO quality analyses A DFAST_QC (14) analysis was carried out using FastANI (15) to calculate average nucleotide identity (ANI) for a taxonomic identity of the genome by querying against 13,000 reference genomes from NCBI type strains. FastANI identified the L. feeleii WO-44C (ATCC 35072) (GenBank accession no. GCA_900639755.1) as the closest relative strain for our isolate 31fI33, with a similarity of 93.99%. Therefore, we can consider this strain as a new Legionella species due to the assumption that two strains belonging to different species show pairwise ANI values below a 95% identity threshold (16).

Data availability.

The draft genome assembly is available in the GenBank database and can be accessed with SRA and assembly accession numbers SRR16560654 and JAJHHJ000000000.
  13 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  TORMES: an automated pipeline for whole bacterial genome analysis.

Authors:  Narciso M Quijada; David Rodríguez-Lázaro; Jose María Eiros; Marta Hernández
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes.

Authors:  Mincheol Kim; Hyun-Seok Oh; Sang-Cheol Park; Jongsik Chun
Journal:  Int J Syst Evol Microbiol       Date:  2014-02       Impact factor: 2.747

4.  CSAR: a contig scaffolding tool using algebraic rearrangements.

Authors:  Kun-Tze Chen; Chia-Liang Liu; Shang-Hao Huang; Hsin-Ting Shen; Yi-Kung Shieh; Hsien-Tai Chiu; Chin Lung Lu
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

5.  Application of RNA polymerase beta-subunit gene (rpoB) sequences for the molecular differentiation of Legionella species.

Authors:  Kwan Soo Ko; Hae Kyung Lee; Mi-Yeoun Park; Keun-Hwa Lee; Yeo-Jun Yun; So-Yon Woo; Hiroshi Miyamoto; Yoon-Hoh Kook
Journal:  J Clin Microbiol       Date:  2002-07       Impact factor: 5.948

6.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

7.  High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.

Authors:  Chirag Jain; Luis M Rodriguez-R; Adam M Phillippy; Konstantinos T Konstantinidis; Srinivas Aluru
Journal:  Nat Commun       Date:  2018-11-30       Impact factor: 14.919

8.  Improved metagenomic analysis with Kraken 2.

Authors:  Derrick E Wood; Jennifer Lu; Ben Langmead
Journal:  Genome Biol       Date:  2019-11-28       Impact factor: 17.906

9.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

10.  DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication.

Authors:  Yasuhiro Tanizawa; Takatomo Fujisawa; Yasukazu Nakamura
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

View more
  1 in total

1.  Combining Traditional and Molecular Techniques Supports the Discovery of a Novel Legionella Species During Environmental Surveillance in a Healthcare Facility.

Authors:  Luna Girolamini; Maria Rosaria Pascale; Marta Mazzotta; Simona Spiteri; Federica Marino; Silvano Salaris; Antonella Grottola; Massimiliano Orsini; Sandra Cristino
Journal:  Front Microbiol       Date:  2022-06-13       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.