Literature DB >> 35142540

Complete Genome Sequence of Bacillus sp. Strain NC3, Isolated from Trichonephila Spider Ground Extract.

Masahiro Ito1,2,3, Satomu Hasunuma1.   

Abstract

Bacillus sp. strain NC3, an aerobic Gram-positive bacterium, was isolated from a suspension of ground Trichonephila clavata specimens in saline in Japan. Here, we report the complete genome sequence of this bacterium, which has a 3.72-Mbp genome, containing 3,717 protein-coding sequences, 24 rRNA-coding sequences, and 81 tRNA-coding sequences.

Entities:  

Year:  2022        PMID: 35142540      PMCID: PMC8830360          DOI: 10.1128/mra.01110-21

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

We report the whole-genome sequence of Bacillus sp. strain NC3, which was isolated from ground Trichonephila clavata specimens in Japan (36.2231N, 139.6347E). The collected spiders were ground using a Beads Crusher μT-01 (TAITEC, Japan). The ground sample was suspended in sterile saline and spread onto modified neutral complex medium (15.5 g of K2HPO4, 4.5 g of KH2PO4, 0.05 g of MgSO4·7H2O, 0.34 g of citric acid, 0.5 g of peptone, 0.2 g of yeast extract, 5 g of glucose, and 11.7 g of NaCl per liter of deionized water [pH 7.0]) plates containing 400 mM cesium chloride (pH 7.0). Strain NC3 was isolated in this way. This bacterium appeared to be most closely related to Bacillus altitudinis 41KF2bT, based on 16S rRNA gene sequence identity (1). A single colony of NC3 was grown in modified neutral complex medium at 30°C for 18 h (2), and genomic DNA was prepared using a Genomic-tip 20/G (Qiagen, Japan) according to the manufacturer’s instructions. For long-read sequencing, a DNA library with barcodes added to a sample using a native barcoding expansion kit (Oxford Nanopore Technologies [ONT], UK) was prepared using a ligation sequencing kit and sequenced with a GridION sequencer (ONT) on an R9.4.1 flow cell. Raw sequence data were base called using Guppy (v4.0.11+f1071ce) (3). The adapter sequences were removed using Porechop (v0.2.3) (4) and reads of 1,000 bases or less were removed using Filtlong (v0.2.0) (5), yielding 220,304 high-quality paired-end reads. The average length of the long reads was 9,460.5 bp, and the total number of base pairs was 2,318,728,121. For short-read sequencing, a DNA library was prepared using the MGIEasy FS DNA library preparation set (MGI Tech, China) according to the manufacturer’s instructions, and the library's quality was confirmed using a fragment analyzer and double-stranded DNA (dsDNA) 915 reagent kit (Advanced Analytical Technologies Inc., USA). The reaction time for enzyme cleavage was 4 min, and fragments with an average length of 442 bp were prepared. Circularized DNA was prepared using the prepared library and the MGIEasy circularization kit according to the manual. DNBSEQ 2 × 200-bp paired-end sequencing was performed using a DNBSEQ-G400 sequencing instrument according to the manufacturer’s instructions. The adapter sequences were removed using Cutadapt (v2.7) (6). About 3.5 million read pairs (1.05 Gbp) were sampled from the sequence, from which the adapter sequences were removed using Seqkit (v0.11.0) (7); Sickle (v1.33) (8) was used to remove bases with quality scores of less than 20, and reads with a base number of fewer than 127 bases and their paired reads were discarded, yielding 6,204,960 high-quality paired-end reads. The average length of the short reads was 200 bp, and the total number of base pairs was 2,469,058,400. The N50 value was 3,715,529 bp. High-quality short-read and long-read sequence data were assembled under the default conditions of Unicycler (v0.4.7) (9), and the genome was circularized. The results of the coding graph assembled using Bandage (v0.8.1) (10) were confirmed, and the integrity of the assembled genomic data was confirmed using CheckM (v1.1.2) (11). The coverage calculated from the total bases used for assembly was 1,289×. Default parameters were used for all software except where otherwise noted. The final chromosome sequence was 3,715,529 bp (G+C content, 41.4%). Automatic annotation was performed using Annotated at DFAST (12), which predicted 3,717 coding sequences, as well as 24 rRNA genes and 81 tRNA genes.

Data availability.

The DDBJ/EMBL/GenBank accession number for the whole-genome sequence of Bacillus altitudinis strain NC3 is AP025264. Raw sequencing data were deposited in the SRA under accession number DRS209021.
  8 in total

1.  SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation.

Authors:  Wei Shen; Shuai Le; Yan Li; Fuquan Hu
Journal:  PLoS One       Date:  2016-10-05       Impact factor: 3.240

2.  Bacillus aerius sp. nov., Bacillus aerophilus sp. nov., Bacillus stratosphericus sp. nov. and Bacillus altitudinis sp. nov., isolated from cryogenic tubes used for collecting air samples from high altitudes.

Authors:  S Shivaji; P Chaturvedi; K Suresh; G S N Reddy; C B S Dutt; M Wainwright; J V Narlikar; P M Bhargava
Journal:  Int J Syst Evol Microbiol       Date:  2006-07       Impact factor: 2.747

3.  The relationship between a coiled morphology and Mbl in alkaliphilic Bacillus halodurans C-125 at neutral pH values.

Authors:  Shun Fujinami; Takako Sato; Masahiro Ito
Journal:  Extremophiles       Date:  2011-07-24       Impact factor: 2.395

4.  Bandage: interactive visualization of de novo genome assemblies.

Authors:  Ryan R Wick; Mark B Schultz; Justin Zobel; Kathryn E Holt
Journal:  Bioinformatics       Date:  2015-06-22       Impact factor: 6.937

5.  CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes.

Authors:  Donovan H Parks; Michael Imelfort; Connor T Skennerton; Philip Hugenholtz; Gene W Tyson
Journal:  Genome Res       Date:  2015-05-14       Impact factor: 9.043

6.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

7.  Performance of neural network basecalling tools for Oxford Nanopore sequencing.

Authors:  Ryan R Wick; Louise M Judd; Kathryn E Holt
Journal:  Genome Biol       Date:  2019-06-24       Impact factor: 13.583

8.  DFAST: a flexible prokaryotic genome annotation pipeline for faster genome publication.

Authors:  Yasuhiro Tanizawa; Takatomo Fujisawa; Yasukazu Nakamura
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

  8 in total
  1 in total

1.  Complete Genome Sequence of Bacillus sp. Strain TM2, Isolated from Ground Tetragnatha Spider Extract.

Authors:  Masahiro Ito; Satomu Hasunuma
Journal:  Microbiol Resour Announc       Date:  2022-05-10
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.