Literature DB >> 35980180

Complete Genome of Aeromonas encheleia Strain SOD01 Isolated from an Urban Freshwater Stream.

Jade Arruda1, Taylor Brysgel1, Elizabeth Capestro1, Isabelle Flaherty1, Grace Foltz1, Abbey Gardner1, Michaela Hunt1, Jiana Ingrassia1, Sean O'Donnell1, Gregory Pappas1, Emily Peterson1, Sarah Ramsaran1, Marcelo Rocha1, Laura E Williams1.   

Abstract

We isolated Aeromonas encheleia strain SOD01 from an urban freshwater stream in Providence, RI. De novo assembly of PacBio RSII data followed by polishing with Illumina MiSeq data generated a complete 4,450,115 bp genome with 61.8% GC content. PGAP annotation predicted 3,877 protein-coding genes, 127 tRNA, and 31 rRNA.

Entities:  

Year:  2022        PMID: 35980180      PMCID: PMC9476934          DOI: 10.1128/mra.00673-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

To isolate prey bacteria for studies of predatory Bdellovibrio (1), we soaked a sterile swab in water collected from an urban stream in Providence, RI (41.835 N, 71.443 W) then swabbed trypticase soy agar. After incubating plates at 28°C, we picked a colony and conducted three rounds of streaking. To establish freezer stock, we combined 0.5 mL 50% glycerol with 0.5 mL overnight culture grown in trypticase soy broth (TSB) at 28°C. Using the Wizard Genomic DNA Purification kit (Promega, Madison, WI), we obtained genomic DNA from overnight cultures grown from freezer stock in TSB at 28°C. We extracted DNA from separate cultures for long- and short-read libraries. For long reads, the University of Maryland Institute for Genome Sciences sheared genomic DNA using g-TUBE (Covaris) at 3400 rpm, and then size selected it on a Blue Pippin instrument (Sage Scientific) with an 11,000 bp cutoff. The library was prepared using SMRTbell Template Prep Kit 1.0 (PacBio) and sequenced with P6-C4 chemistry using one SMRT cell on PacBio RS II. For short reads, the University of Rhode Island Genomics and Sequencing Center sheared genomic DNA using a Covaris S220 focused ultrasonicator. The library was prepared using the PrepX DNA Library kit (TaKaRa Bio), visualized on a high-sensitivity BioAnalyzer chip (Agilent), quantified using the KAPA Illumina Quantification kit (Roche), and sequenced on an Illumina MiSeq to obtain 2 × 250 bp paired-end reads. Unless otherwise noted, default parameters were used for all software. For de novo assembly of the PacBio data (138,319 subreads at N50 12,507 bp), we compared Hierarchical Genome Assembly Process v3 (HGAP3) (2) and Canu 2.2 (3). To test HGAP3, we launched an Amazon EC2 instance of SMRT Portal v2.3.0 and set the estimated genome size at 4.5 Mbp, which yielded a 4,468,532 bp contig. After using BLASTN (4) and EMBOSS 6.6.0.0 extractseq (5) to identify and trim overlap between contig ends, the closed contig was 4,447,847 bp. To test Canu, we set the estimated genome size at 4.45 Mbp, which yielded a 4,464,381 bp contig. After using extractseq to trim overlap based on information from Canu, the closed contig was 4,448,166 bp. Comparison using dnadiff (6) identified only 20 SNVs and 2,559 indel bp between HGAP3 and Canu closed contigs, demonstrating good agreement. We proceeded with Canu output and used BLASTN comparisons and extractseq to rotate the closed contig so that the first base corresponded to the dnaA start codon. For polishing, we aligned raw Illumina MiSeq read 1 and read 2 datasets (5,253,695 reads each) separately to the contig using Burrows-Wheeler aligner “mem” (BWA-mem) 0.7.17 (7) with the option to report all possible alignments for each read. Based on samtools (8) analysis, 5,190,922 R1 reads and 5,087,523 R2 reads aligned to at least one location. We used Polypolish v0.5.0 (9) to remove alignments based on insert size and then correct the sequence. Polypolish reported 456.6× coverage, and dnadiff analysis identified correction of one SNV and 1,949 indel bp, yielding a 4,450,113 bp contig. To confirm, we aligned MiSeq R1 and R2 reads to the corrected contig using POLCA within MaSuRCA 4.0.8 (10). POLCA corrected two indel bp, yielding a final genome sequence of 4,450,115 bp and 61.8% GC. Prokaryotic Genome Annotation Pipeline (PGAP) (11) annotation predicted 3,877 protein-coding genes, 127 tRNA, and 31 rRNA. Digital DNA:DNA hybridization analysis with the Type Strain Genome Server (12) classified SOD01 as Aeromonas encheleia.

Data availability.

Aeromonas sp. strain SOD01 genome sequence was deposited in GenBank (accession no. CP099717). PacBio and MiSeq reads were deposited in the SRA (BioProject accession no. PRJNA852219 and SRA accession no. SRX15851419 and SRX15851420, respectively).
  11 in total

1.  EMBOSS: the European Molecular Biology Open Software Suite.

Authors:  P Rice; I Longden; A Bleasby
Journal:  Trends Genet       Date:  2000-06       Impact factor: 11.639

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  The MaSuRCA genome assembler.

Authors:  Aleksey V Zimin; Guillaume Marçais; Daniela Puiu; Michael Roberts; Steven L Salzberg; James A Yorke
Journal:  Bioinformatics       Date:  2013-08-29       Impact factor: 6.937

4.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

5.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

6.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

7.  TYGS is an automated high-throughput platform for state-of-the-art genome-based taxonomy.

Authors:  Jan P Meier-Kolthoff; Markus Göker
Journal:  Nat Commun       Date:  2019-05-16       Impact factor: 14.919

8.  Variation in genome content and predatory phenotypes between Bdellovibrio sp. NC01 isolated from soil and B. bacteriovorus type strain HD100.

Authors:  Laura E Williams; Nicole Cullen; Joseph A DeGiorgis; Karla J Martinez; Justina Mellone; Molly Oser; Jing Wang; Ying Zhang
Journal:  Microbiology       Date:  2019-10-08       Impact factor: 2.777

9.  Polypolish: Short-read polishing of long-read bacterial genome assemblies.

Authors:  Ryan R Wick; Kathryn E Holt
Journal:  PLoS Comput Biol       Date:  2022-01-24       Impact factor: 4.475

10.  NCBI prokaryotic genome annotation pipeline.

Authors:  Tatiana Tatusova; Michael DiCuccio; Azat Badretdin; Vyacheslav Chetvernin; Eric P Nawrocki; Leonid Zaslavsky; Alexandre Lomsadze; Kim D Pruitt; Mark Borodovsky; James Ostell
Journal:  Nucleic Acids Res       Date:  2016-06-24       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.