Literature DB >> 25502663

Genome Sequence of Bacillus simplex Strain P558, Isolated from a Human Fecal Sample.

Olivier Croce1, Perrine Hugon1, Jean-Christophe Lagier1, Fehmida Bibi2, Catherine Robert1, Esam Ibraheem Azhar, Didier Raoult, Pierre-Edouard Fournier3.   

Abstract

Bacillus simplex strain P558 was isolated from a fecal sample of a 25-year-old Saudi male. We sequenced the 5.98-Mb genome of the strain and compared it to that of B. simplex strain 1NLA3E.
Copyright © 2014 Croce et al.

Entities:  

Year:  2014        PMID: 25502663      PMCID: PMC4263825          DOI: 10.1128/genomeA.01241-14

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Bacillus simplex was first described in 1901 by Meyer and Gottheil (1). This bacterium is an environmental microorganism, notably found in soil. To date, it has not been found in humans. Here, we sequenced the genome from B. simplex strain P558 that was isolated from a fecal sample of a 25-year-old Saudi male living in Jeddah, Saudi Arabia, as part of a culturomics study aiming to isolate all bacterial species present in the human gut (2). Based on the sequencing of the complete 16S rRNA gene, strain P558 was found to exhibit 99.93% sequence identity with B. simplex strain AM2 (GenBank accession no. JQ435679), its closest phylogenetic relative. B. simplex strain P558 was deposited in the CSUR collection under number CSUR P558. We sequenced the whole genome of B. simplex strain P558 with the MiSeq sequencer (Illumina, San Diego, CA, USA) using a mate-pair Nextera XT sample preparation kit (Illumina) in a 2× 250-bp run. The Illumina reads were trimmed using Trimmomatic (3) and then assembled with the SPAdes software (4). The obtained contigs were combined using the SSPACE (5) and Opera (6) softwares, helped by GapFiller (7) to reduce the set. Some manual refinements using the CLC Genomics software (CLC bio, Aarhus, Denmark) improved the genome assembly quality. Overall, the draft genome of B. simplex strain P558 consists of 8 scaffolds and a single gap, for a total size of 5,983,568 bp and a G+C content of 40.23%. The coding DNA sequences were predicted using Prodigal (8), and functional annotation was achieved using BLAST+ (9) and HMMER3 (10) against the UniProtKB database (11). Noncoding genes and miscellaneous features were predicted using the RNAmmer (12), ARAGORN (13), Rfam (14), Pfam (15), and Infernal (16) softwares. The genome assembly of B. simplex strain P558 consists of 8 contigs and contains 5,814 protein-coding genes and 169 predicted RNA genes, including 11 rRNAs (3 complete rRNA operons), 50 tRNAs, 1 transfer-messenger RNA (tmRNA), and 107 miscellaneous RNAs. The coding capacity was 4,862,007 bp (81.26% of the total genome). Among the predicted genes, 4,248 genes (73%) matched a least one sequence in the Clusters of Orthologous Groups database (17), with BLASTp default parameters. In addition, 296 (5.09%) and 900 (15.48%) genes were annotated as encoding putative and hypothetical proteins, respectively. In comparison with the genome of B. simplex strain 1NLA3E (GenBank accession no. NC_021171), the phylogenetically closest available sequenced genome, strain P558, is larger (5,983,568 and 4,815,602 bp, respectively) and has a higher G+C content (40.23 and 37.95%, respectively) and more protein-coding genes (5,814 and 4,410 genes, respectively), but it has a smaller ratio of genes per Mb (972 and 1,092 genes/Mb, respectively).

Nucleotide sequence accession numbers.

The genome sequence from B. simplex strain P558 has been deposited in EMBL under accession numbers CCXW01000001 to CCXW01000008.
  16 in total

1.  ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.

Authors:  Dean Laslett; Bjorn Canback
Journal:  Nucleic Acids Res       Date:  2004-01-02       Impact factor: 16.971

2.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

Authors:  Song Gao; Wing-Kin Sung; Niranjan Nagarajan
Journal:  J Comput Biol       Date:  2011-09-19       Impact factor: 1.479

4.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

5.  Toward almost closed genomes with GapFiller.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  Genome Biol       Date:  2012-06-25       Impact factor: 13.583

6.  Ongoing and future developments at the Universal Protein Resource.

Authors: 
Journal:  Nucleic Acids Res       Date:  2010-11-04       Impact factor: 16.971

7.  Accelerated Profile HMM Searches.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

8.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

9.  RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors:  Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal:  Nucleic Acids Res       Date:  2007-04-22       Impact factor: 16.971

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  4 in total

Review 1.  Bacillus spp. as Bio-factories for Antifungal Secondary Metabolites: Innovation Beyond Whole Organism Formulations.

Authors:  Bruno Salazar; Aurelio Ortiz; Chetan Keswani; Tatiana Minkina; Saglara Mandzhieva; Satyendra Pratap Singh; Bhagwan Rekadwad; Rainer Borriss; Akansha Jain; Harikesh B Singh; Estibaliz Sansinenea
Journal:  Microb Ecol       Date:  2022-05-23       Impact factor: 4.552

2.  Mining the phytomicrobiome to understand how bacterial coinoculations enhance plant growth.

Authors:  Maskit Maymon; Pilar Martínez-Hidalgo; Stephen S Tran; Tyler Ice; Karena Craemer; Teni Anbarchian; Tiffany Sung; Lin H Hwang; Minxia Chou; Nancy A Fujishige; William Villella; Jérôme Ventosa; Johannes Sikorski; Erin R Sanders; Kym F Faull; Ann M Hirsch
Journal:  Front Plant Sci       Date:  2015-09-24       Impact factor: 5.753

3.  Antifungal Activity of Bacillus Species Against Fusarium and Analysis of the Potential Mechanisms Used in Biocontrol.

Authors:  Noor Khan; Pilar Martínez-Hidalgo; Tyler A Ice; Maskit Maymon; Ethan A Humm; Najmeh Nejat; Erin R Sanders; Drora Kaplan; Ann M Hirsch
Journal:  Front Microbiol       Date:  2018-10-02       Impact factor: 5.640

4.  Diversity and evolutionary dynamics of spore-coat proteins in spore-forming species of Bacillales.

Authors:  Henry Secaira-Morocho; José A Castillo; Adam Driks
Journal:  Microb Genom       Date:  2020-10-14
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.