Literature DB >> 35652632

Complete Genome Sequence of a Phage Infecting Sphingomonadaceae.

Shang Shen1,2, Tetsunobu Anazawa1, Tomonari Matsuda1, Yoshihisa Shimizu1.   

Abstract

We isolated a phage infecting a member of the Sphingomonadaceae family from a freshwater lake. The phage has a DNA genome of 41,771 bp, with a GC content of 61.7%. The genome harbors 50 predicted protein-coding genes and an auxiliary metabolic gene, which encodes a protein belonging to the radical S-adenosylmethionine superfamily.

Entities:  

Year:  2022        PMID: 35652632      PMCID: PMC9302152          DOI: 10.1128/mra.00366-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Members of the family Sphingomonadaceae play important roles in aquatic environments (1–4). However, little is known about the phages infecting Sphingomonadaceae and how they control the abundance of this family. The phage, infecting a member of the Sphingomonadaceae family, was isolated from a water sample collected from the surface of a freshwater lake (Lake Biwa, Japan; 35°23′21.0″N, 136°07′51.0″E) and from a strain of Sphingomonadaceae family (BSN-002 [deposited in GenBank under accession number CP091804]) (5). The collected lake water was filtered through 0.2-μm-pore-size polycarbonate filters. The filtrates and the preincubated host strain were incubated using the double-agar method at 25°C for 2 weeks. Virions contained in the plaques were purified twice using the same method. Phage DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Hilden, Germany). The library was prepared using a Nextera XT DNA sample preparation kit (Illumina, San Diego, CA) according to the manufacturer’s protocol. The sample was then sequenced using a MiSeq sequencing system with a V3 (2× 300-bp reads) reagent kit (Illumina). Bioinformatic analysis was performed as described below, and default parameters were used for all software unless otherwise specified. Raw reads (524,664 paired-end reads) with low-quality regions were removed using Trimmomatic with default settings (v.0.39) (6), and finally, 515,345 paired-end reads were obtained. The phage genome was assembled using SPAdes with the –careful option (v.3.13.1) (7) and identified as a complete (i.e., circular) genome using ccfind (v.1.1) (8). Open reading frames (ORFs) were predicted using Prodigal with the -p meta option (v.2.6.3) (9) and annotated by eggNOG-Mapper v.1.0.3 (10) using the -m diamond option. ORFs were also annotated using HMMER v.3.1b2 software (11) against the Prokaryotic Virus Orthologous Groups databases (12) and using the highly sensitive HMM-HMM search (8) against Pfam v.31.0 databases using HHsearch (13) and JackHMMER (14). The genome of the phage (VSN-002) infecting a member of the Sphingomonadaceae family was found to comprise 41,771 bp, with a GC content of 61.7%. Of the 75 predicted ORFs, 41 were leftward oriented, 65 started with ATG, 8 started with GTG, and 2 started with TTG (Table 1). A total of 50 proteins were encoded in the phage genome, varying in length from 138 to 3,186 bp. Of the 50 proteins, 19 were classified as hypothetical or uncharacterized proteins, and 31 were assigned putative functions. The phage genome was found to encode proteins related to phage replication (e.g., terminase, DNA helicase, capsid, and tail). An auxiliary metabolic gene (AMG), which may be expressed during infection to increase viral production by promoting host metabolism, was also found in the phage genome. The phage genome was found to have a radical S-adenosylmethionine (SAM) superfamily that may be involved in critical roles in numerous biosynthetic pathways in bacteria (e.g., synthesis of protein cofactors) (15). BLASTn analysis showed that our Sphingomonadaceae phage was not homologous (more than 95% similarity) to previously reported bacteriophages, and the host of this phage was also not assigned to any existing species in the Genome Taxonomy Databases (5).
TABLE 1

Summary of predicted open reading frames and their annotation

ParameterValue
Length of genome (bp)41,771
GC content (%)61.7
No. of predicted ORFs
 Total75
 Leftward41
 Rightward34
 Start codon with ATG65
 Start codon with GTG8
 Start codon with TTG2
No. of ORFs annotated
 Total50
 Hypothetical or uncharacterized proteins19
 Putative functions31
Summary of predicted open reading frames and their annotation

Data availability.

The complete bacteriophage genome has been deposited in GenBank (accession number MZ127829). Raw reads from whole-genome sequencing data were deposited in the DDBJ (accession number DRR361504).
  15 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Protein homology detection by HMM-HMM comparison.

Authors:  Johannes Söding
Journal:  Bioinformatics       Date:  2004-11-05       Impact factor: 6.937

Review 3.  Molecular characteristics of xenobiotic-degrading sphingomonads.

Authors:  Andreas Stolz
Journal:  Appl Microbiol Biotechnol       Date:  2008-11-11       Impact factor: 4.813

4.  Hidden Markov model speed heuristic and iterative HMM search procedure.

Authors:  L Steven Johnson; Sean R Eddy; Elon Portugaly
Journal:  BMC Bioinformatics       Date:  2010-08-18       Impact factor: 3.169

5.  Accelerated Profile HMM Searches.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

6.  Prokaryotic Virus Orthologous Groups (pVOGs): a resource for comparative genomics and protein family annotation.

Authors:  Ana Laura Grazziotin; Eugene V Koonin; David M Kristensen
Journal:  Nucleic Acids Res       Date:  2016-10-26       Impact factor: 16.971

7.  Draft Genome Sequences of Sphingomonadaceae Strains Isolated from a Freshwater Lake.

Authors:  Shang Shen; Tetsunobu Anazawa; Tomonari Matsuda; Yoshihisa Shimizu
Journal:  Microbiol Resour Announc       Date:  2022-04-06

8.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

9.  Draft Genome Sequence of Two Sphingopyxis sp. Strains, Dominant Members of the Bacterial Community Associated with a Drinking Water Distribution System Simulator.

Authors:  Vicente Gomez-Alvarez; Stacy Pfaller; Randy P Revetta
Journal:  Genome Announc       Date:  2016-03-31

10.  eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences.

Authors:  Jaime Huerta-Cepas; Damian Szklarczyk; Kristoffer Forslund; Helen Cook; Davide Heller; Mathias C Walter; Thomas Rattei; Daniel R Mende; Shinichi Sunagawa; Michael Kuhn; Lars Juhl Jensen; Christian von Mering; Peer Bork
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.