Literature DB >> 33664146

Complete and Circularized Bacterial Genome Sequence of Gordonia sp. Strain X0973.

Christopher A Gulvik1, Dhwani Batra2, Lori A Rowe2, Milli Sheth2, Sarah Nobles2,3, Justin S Lee2, John R McQuiston1, Brent A Lasker4.   

Abstract

Gordonia sp. strain X0973 is a Gram-positive, weakly acid-fast, aerobic actinomycete obtained from a human abscess with Gordonia araii NBRC 100433T as its closest phylogenetic neighbor. Here, we report using Illumina MiSeq and PacBio reads to assemble the complete and circular genome sequence of 3.75 Mbp with 3,601 predicted coding sequences.

Entities:  

Year:  2021        PMID: 33664146      PMCID: PMC7936644          DOI: 10.1128/MRA.01479-20

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

In 1971, the genus Gordonia was proposed by Tsukamura to describe a group of aerobic, Gram-positive, weakly acid-fast, nonmotile, catalase-positive, arylsulfatase-negative, coccobacillary actinomycetes with an oxidative carbohydrate metabolism (1). Members of the genus Gordonia have been isolated from a variety of environments, including soil, water, plants, animals, wastewater, and activated sludge (2, 3). Gordonia species are noted for their biodegradative and bioremediation capabilities and their ability to synthesize novel secondary metabolites and industrially relevant enzymes (2–4). A few members are opportunistic pathogens and have been isolated from human sources, including sputum, sternal wound, lung, and ear infections (5). Isolate X0973 was acquired by the Centers for Disease Control and Prevention (CDC) for identification in 2012 from the hand abscess of a patient living in Missouri. Isolate X0973 was grown aerobically on Trypticase soy agar supplemented with 5% sheep blood (TSAB) at 35°C and then identified as a member of the genus Gordonia by 16S rRNA gene BLAST analysis (GenBank accession number KR259249) of the most similar 16S rRNA gene sequence to a validated type strain, Gordonia amarae ATCC 27808. In this investigation, we report the complete genome sequence of a rare human pathogen, Gordonia sp. strain X0973. Gordonia sp. X0973 was obtained from the Special Bacteriology Reference Laboratory at the CDC and was cultured in a flask of 20 ml of Trypticase soy broth for 4 days at 35°C and 200 rpm from a single colony grown on TSAB. Cells were lysed and genomic DNA was purified using the Power Microbial DNA isolation kit. Genomic DNA libraries were prepared using the NEBNext Ultra DNA library prep kit. An Illumina MiSeq system using a v2 reagent kit generated 2 × 250-bp reads. Sequence reads (1,955,100 total) were filtered for read quality, base-called, and demultiplexed using bcl2fastq v2.20. Needle-sheared and Blue Pippin size-selected (20 kbp) DNA libraries were created with the SMRTbell template prep kit v1.0. The DNA/polymerase binding kit P6v2 and C4v2 chemistry were used for sequencing on an RS II instrument, which generated 202,665 total reads (N50, 11,603 bp). Default parameters were used for all software unless otherwise specified. Bash5tools v0.8.0 was used to extract subreads of ≥500 bp. These 101,000 long reads (876 Mbp) were scrubbed as previously described (6, 7). Long-read scrubbing removed 214.5 Mbp (24.5%) due to low-quality scores, repaired 35.5 Mbp (4.1%) of low-quality nucleotides, and discarded 212.6 Mbp (25.2%) of chimeras, and 6.0 Mbp (0.7%) of adaptamers were clipped off. The 89,627 scrubbed reads (N50, 9,878 bp; 650.0 Mbp) were assembled in Flye v2.7-b1585 with the “-g 3.8m” setting (8), which produced a 3,747,033-Mbp single circularized contig with 172× coverage. The Unicycler polish function in Unicycler v0.4.9b was used to correct assembly errors and depended on Bowtie v2.3.4.3, Pilon v1.23, and ALE v20180904 (9–12). Sequential rounds of Illumina read polishing were performed until the assembly likelihood score no longer improved and 927 variants were corrected (1 single nucleotide polymorphism [SNP] and 926 single base insertions of C or G). The chromosome was reoriented with dnaA being the start of the contig, and the GC content was calculated as 68.8% using Biopython v1.74 (13). CheckM v1.0.13 suggested that the assembly is 100% complete (14). Annotation was performed with PGAP v4.11, which predicted 3,601 coding sequences, 2 rRNA gene operons, and 46 pseudogenes (15). The relatedness of the strain X0973 genome to Gordonia type strains was determined based on average nucleotide identity (ANI) (80.9% ± 5.4%) and digital DNA-DNA hybridization (dDDH) (22.6% [20.3% to 25.0%]) with Gordonia araii NBRC 100433T (GCF_000241265.1) as the closest neighbor (16, 17).

Data availability.

The whole-genome sequence of Gordonia sp. X0973 has been deposited in the DDBJ/ENA/GenBank database under the accession number CP054691. The version described in this paper is the first version, CP054691.1. The raw sequence data were deposited in the SRA under accession numbers SRR11951410 and SRR11951409.
  15 in total

Review 1.  Biology of the metabolically diverse genus Gordonia.

Authors:  Matthias Arenskötter; Daniel Bröker; Alexander Steinbüchel
Journal:  Appl Environ Microbiol       Date:  2004-06       Impact factor: 4.792

Review 2.  The strengths and weaknesses of Gordonia: a review of an emerging genus with increasing biotechnological potential.

Authors:  Oliver Drzyzga
Journal:  Crit Rev Microbiol       Date:  2012-05-03       Impact factor: 7.624

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

Review 4.  An insight into the ecology, diversity and adaptations of Gordonia species.

Authors:  Harshada Sowani; Mohan Kulkarni; Smita Zinjarde
Journal:  Crit Rev Microbiol       Date:  2017-12-25       Impact factor: 7.624

5.  Biopython: freely available Python tools for computational molecular biology and bioinformatics.

Authors:  Peter J A Cock; Tiago Antao; Jeffrey T Chang; Brad A Chapman; Cymon J Cox; Andrew Dalke; Iddo Friedberg; Thomas Hamelryck; Frank Kauff; Bartek Wilczynski; Michiel J L de Hoon
Journal:  Bioinformatics       Date:  2009-03-20       Impact factor: 6.937

6.  Genome sequence-based species delimitation with confidence intervals and improved distance functions.

Authors:  Jan P Meier-Kolthoff; Alexander F Auch; Hans-Peter Klenk; Markus Göker
Journal:  BMC Bioinformatics       Date:  2013-02-21       Impact factor: 3.169

7.  Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement.

Authors:  Bruce J Walker; Thomas Abeel; Terrance Shea; Margaret Priest; Amr Abouelliel; Sharadha Sakthikumar; Christina A Cuomo; Qiandong Zeng; Jennifer Wortman; Sarah K Young; Ashlee M Earl
Journal:  PLoS One       Date:  2014-11-19       Impact factor: 3.240

8.  Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2017-06-08       Impact factor: 4.475

9.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

10.  Complete Genome Sequence of Nocardia farcinica W6977T Obtained by Combining Illumina and PacBio Reads.

Authors:  Christopher A Gulvik; Robert A Arthur; Ben W Humrighouse; Dhwani Batra; Lori A Rowe; Brent A Lasker; John R McQuiston
Journal:  Microbiol Resour Announc       Date:  2019-01-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.