Literature DB >> 35543511

Complete Genome Sequences of Four Parageobacillus Strains Isolated from Soil in Japan.

Kentaro Miyazaki1,2,3, Kae Hosoya1,4.   

Abstract

We isolated four Parageobacillus strains from soil in Japan and completely sequenced their genomes. Three of four strains showed ≥98.9% average nucleotide identity (ANI) to Parageobacillus caldoxylosilyticus S1812T, while one strain, designated KH3-4, showed the highest ANI (91%) to Parageobacillus thermantarcticus M1T, suggesting the species novelty of KH3-4.

Entities:  

Year:  2022        PMID: 35543511      PMCID: PMC9202437          DOI: 10.1128/mra.00204-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Parageobacillus is a genus of betaproteobacteria in the family Burkholderiaceae that is Gram-positive and a facultatively anaerobic thermophile. Parageobacillus species have great biotechnological potential (1), for example, as a source for thermophilic enzymes (2), fuel production (3, 4), and the bioremediation of environmental pollutants (5). At the time of writing, there are six validly named species in the genus Parageobacillus (https://lpsn.dsmz.de/genus/parageobacillus). So far, seven complete genome sequences have been reported for Parageobacillus, including for Parageobacillus caldoxylosilyticus (1 strain), Parageobacillus thermoglucosidasius (4 strains), and Parageobacillus toebii (2 strains). We collected soil samples from the city of Tsukuba, Japan. The samples were suspended in distilled water and spread over Lennox LB agar (1.6% [wt/vol]) plates. After incubation at 65°C overnight, dozens of well-separated single colonies were isolated; colony PCR was conducted to analyze the 16S rRNA genes using a set of primers, Bac8f(C) and UN1542r (6). Among the colonies, four strains, designated KH1-5, KH1-6, KH3-4, and KH3-5, which were expected to belong to the genus Parageobacillus, were subjected to complete genome analysis. To prepare the genomic DNA, cells were grown in 5 mL LB broth at 65°C for 24 h with vigorous shaking (200 rpm). The genomic DNA was purified using a blood and cell culture DNA mini kit (Qiagen). For long-read sequencing, unsheared genomic DNA (1 μg) was treated using a short-read eliminator kit (Circulomics) to remove fragments of <10 Kbp, and a library was constructed using a ligation sequencing kit (Oxford Nanopore Technologies [ONT]). Sequencing was performed using a GridION X5 system on a FLO-MIN106 R9.41 revD flow cell (ONT). Base calling was conducted using Guppy v.4.0.11. The raw sequencing data (Table 1) were filtered (Q < 10; length, <1,000 bases) using NanoFilt v.2.7.1 (7). For short-read sequencing, a library was constructed using an MGIEasy FS PCR free DNA library prep set (MGI) with a ~400 to 500-bp insert. Paired-end sequencing (2 × 150 bases) was then performed on a DNBSEQ-400 instrument (MGI). The raw sequencing data (Table 1) were filtered (Q < 30; length, <20 bases) using fastp v.0.20.1 (8). The trimmed long- and short-read data were assembled using Unicycler v.0.4.8 (9), and the assembly was polished using Pilon v.1.24 (10). Each strain contained a single circular chromosome, and KH3-5 contained one circular plasmid; the circularity was confirmed using Unicycler.
TABLE 1

Sequencing metrics for the four Parageobacillus strains in this study

StrainBioSample accession no.Chromosome or plasmidDNBSEQ (short-read) data
GridION (long-read) data
Length (bp)GC content (%)GenBank accession no.
No. of paired-end readsTotal length (Mb)SRA accession no.No. of readsN50 (bp)Total length (Mb)SRA accession no.
P. caldoxylosilyticus KH1-5 SAMD00442691 Chromosome7,592,5381,139 DRR346603 136,9645,757552 DRR346607 3,850,76544.3 AP025623
P. caldoxylosilyticus KH1-6 SAMD00442692 Chromosome7,161,7691,074 DRR346604 179,5159,8371,244 DRR346608 3,850,77344.3 AP025624
Parageobacillus sp. KH3-4 SAMD00442693 Chromosome9,226,5241,384 DRR346605 981,5294,5313,090 DRR346609 3,816,93243.0 AP025627
P. caldoxylosilyticus KH3-5 SAMD00442694 Chromosome7,158,4421,074 DRR346606 1,107,1804,6443,572 DRR346610 3,832,28544.2 AP025625
Plasmid (pPcaKH3-5b)6,88951.7 AP025626
Sequencing metrics for the four Parageobacillus strains in this study Automatic annotation was conducted using DFAST v.1.2.15 (11); the genomic features are summarized in Table 1. A JSpecies analysis (12) revealed that KH1-5, KH1-6, and KH3-5 showed ≥98.9% average nucleotide identity (ANI) to each other and to the type strain of P. caldoxylosilyticus (strain S1812; GenBank accession number GCF_019272935.1), while KH3-4 showed the highest ANI (91.9%) to the type strain of P. thermantarcticus (strain M1; GCF_900111865.1), suggesting the species novelty of KH3-4 (95% ANI being the cutoff for the delineation of a species). For all software, default parameters were used.

Data availability.

All four Parageobacillus strains reported in this paper are associated with BioProject accession number PRJDB12551. The BioSample accession numbers, genome sequences, and raw sequencing data are available under the accession numbers listed in Table 1.
  11 in total

1.  Metabolic engineering of Geobacillus thermoglucosidasius for high yield ethanol production.

Authors:  R E Cripps; K Eley; D J Leak; B Rudd; M Taylor; M Todd; S Boakes; S Martin; T Atkinson
Journal:  Metab Eng       Date:  2009-08-22       Impact factor: 9.783

Review 2.  The genus Geobacillus and their biotechnological potential.

Authors:  Ali H Hussein; Beata K Lisowska; David J Leak
Journal:  Adv Appl Microbiol       Date:  2015-05-05       Impact factor: 5.086

3.  Evolutionary engineering of Geobacillus thermoglucosidasius for improved ethanol production.

Authors:  Jiewen Zhou; Kang Wu; Christopher V Rao
Journal:  Biotechnol Bioeng       Date:  2016-04-22       Impact factor: 4.530

4.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

5.  DFAST and DAGA: web-based integrated genome annotation tools and resources.

Authors:  Yasuhiro Tanizawa; Takatomo Fujisawa; Eli Kaminuma; Yasukazu Nakamura; Masanori Arita
Journal:  Biosci Microbiota Food Health       Date:  2016-07-14

6.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

7.  PCR Primer Design for 16S rRNAs for Experimental Horizontal Gene Transfer Test in Escherichia coli.

Authors:  Kentaro Miyazaki; Mitsuharu Sato; Miyuki Tsukuda
Journal:  Front Bioeng Biotechnol       Date:  2017-02-28

8.  fastp: an ultra-fast all-in-one FASTQ preprocessor.

Authors:  Shifu Chen; Yanqing Zhou; Yaru Chen; Jia Gu
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

9.  JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison.

Authors:  Michael Richter; Ramon Rosselló-Móra; Frank Oliver Glöckner; Jörg Peplies
Journal:  Bioinformatics       Date:  2015-11-16       Impact factor: 6.937

10.  NanoPack: visualizing and processing long-read sequencing data.

Authors:  Wouter De Coster; Svenn D'Hert; Darrin T Schultz; Marc Cruts; Christine Van Broeckhoven
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.