Literature DB >> 25323728

Genome Sequence of Legionella massiliensis, Isolated from a Cooling Tower Water Sample.

Isabelle Pagnier1, Olivier Croce1, Catherine Robert1, Didier Raoult1, Bernard La Scola2.   

Abstract

We present the draft genome sequence of Legionella massiliensis strain LegA(T), recovered from a cooling tower water sample, using an amoebal coculture procedure. The strain described here is composed of 4,387,007 bp, with a G+C content of 41.19%, and its genome has 3,767 protein-coding genes and 60 predicted RNA genes.
Copyright © 2014 Pagnier et al.

Entities:  

Year:  2014        PMID: 25323728      PMCID: PMC4200166          DOI: 10.1128/genomeA.01068-14

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Legionella massiliensis strain LegAT was isolated from an environmental water sample from a cooling tower located in South France, using an amoebal coculture procedure (1). This is a Gram-negative and Gimenez-positive bacillus classified in the genus Legionella. Based on the sequencing of the complete 16S rRNA gene, using BLASTn (2), the closest related species is Legionella birminghamensis (accession no. NR044953), with 96.7% similarity. Regarding the macrophage infectivity potentiator (mip) gene, strain LegAT exhibits 78% similarity with Legionella feeleii (accession no. FJ009368). The strain L. massiliensis LegAT is unable to grow on Columbia with 5% sheep blood agar and is able to grow after 3 days on buffered charcoal yeast extract (BCYE) medium under a 5% CO2 atmosphere. The bacteria showed negative reactions for oxidase, cefinase, and gelatinase. L. massiliensis LegAT is deposited in the DSMZ (DSM 24804T) and CSUR (SCUR P146T) culture collections. We therefore sequenced the whole genome of L. massiliensis LegAT in order to determine the phylogenetic relationships with closely related Legionella species. The DNA genome was sequenced using two high-throughput next-generation sequencing (NGS) technologies: Roche 454 (3) and MiSeq Illumina (Illumina, Inc., San Diego, CA). A library of 5-kb paired-ends was constructed, loaded on a PicoTiterPlate (PTP), and sequenced with the Roche-GS FLX Titanium sequencing kit XLR70. MiSeq Illumina sequencing was performed using two applications, paired-end and mate-pair Nextera libraries, in a 2 × 250 bp run for each barcoded library. The reads from various sequencing technologies were first assembled separately. The reads from 454 sequencing were assembled into contigs and scaffolds using Newbler version 2.8 (Roche, 454 Life Sciences). The Illumina reads were trimmed using Trimmomatic (4) and then assembled with the SPAdes software (5, 6) while adding contigs generated from Roche 454. The obtained contigs were combined by the SSPACE (7) and Opera softwares (8) and helped by GapFiller (9) to reduce the set. Some manual refinements using CLC Genomics software (CLC bio, Aarhus, Denmark) and homemade tools improved the genome. Finally, the draft genome of L. massiliensis LegAT consists of 8 contigs without gaps, containing 4,387,007 bp and a G+C content of 41.19%. Noncoding genes and miscellaneous features were predicted using RNAmmer (10), ARAGORN (11), Rfam (12), Pfam (13), and Infernal (14). Coding DNA sequences (CDSs) were predicted using Prodigal (15), and functional annotation was achieved using BLAST+ (16) and HMMER3 (17) against the UniProtKB database (18). The genome was shown to contain at least 60 predicted RNAs, including 7 rRNAs, 40 tRNAs, 1 transfer-messenger RNA (tmRNA), and 12 miscellaneous RNAs. A total of 3,767 genes were also identified, representing a coding capacity of 3,811,533 bp (coding percentage, 86.88%). Among these genes, 174 (4.62%) were founded as putative proteins and 1,326 (35.2%) were assigned as hypothetical proteins. Moreover, 2,405 genes matched a least one sequence in the Clusters of Orthologous Groups (COGs) database (19, 20) with BLASTp default parameters.

Nucleotide sequence accession numbers.

The L. massiliensis strain LegAT genome sequence has been deposited at EMBL under the accession numbers CCVW01000001 to CCVW01000008.
  20 in total

1.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors:  Dean Laslett; Bjorn Canback
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

2.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

4.  Legionella tunisiensis sp. nov. and Legionella massiliensis sp. nov., isolated from environmental water samples.

Authors:  Angélique Campocasso; Mondher Boughalmi; Ghislain Fournous; Didier Raoult; Bernard La Scola
Journal:  Int J Syst Evol Microbiol       Date:  2012-02-03       Impact factor: 2.747

5.  Assembling single-cell genomes and mini-metagenomes from chimeric MDA products.

Authors:  Sergey Nurk; Anton Bankevich; Dmitry Antipov; Alexey A Gurevich; Anton Korobeynikov; Alla Lapidus; Andrey D Prjibelski; Alexey Pyshkin; Alexander Sirotkin; Yakov Sirotkin; Ramunas Stepanauskas; Scott R Clingenpeel; Tanja Woyke; Jeffrey S McLean; Roger Lasken; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2013-10       Impact factor: 1.479

6.  Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

Authors:  Song Gao; Wing-Kin Sung; Niranjan Nagarajan
Journal:  J Comput Biol       Date:  2011-09-19       Impact factor: 1.479

7.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

8.  Toward almost closed genomes with GapFiller.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  Genome Biol       Date:  2012-06-25       Impact factor: 13.583

9.  Ongoing and future developments at the Universal Protein Resource.

Authors: 
Journal:  Nucleic Acids Res       Date:  2010-11-04       Impact factor: 16.971

10.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.