Literature DB >> 18710568

Genome sequence of the Bacteroides fragilis phage ATCC 51477-B1.

Shawn A Hawkins1, Alice C Layton, Steven Ripp, Dan Williams, Gary S Sayler.   

Abstract

The genome of a fecal pollution indicator phage, Bacteroides fragilis ATCC 51477-B1, was sequenced and consisted of 44,929 bases with a G+C content of 38.7%. Forty-six putative open reading frames were identified and genes were organized into functional clusters for host specificity, lysis, replication and regulation, and packaging and structural proteins.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18710568      PMCID: PMC2535602          DOI: 10.1186/1743-422X-5-97

Source DB:  PubMed          Journal:  Virol J        ISSN: 1743-422X            Impact factor:   4.099


Findings

Bacteriophages infecting Escherichia coli and Bacteroides fragilis serve as fecal pollution indicators (1). Bacteroides phages are more attractive fecal indicators than E. coli because Bacteroides are more abundant in fecal matter, provide higher host specificity, and are anaerobic and thus less likely to reproduce in aquatic environments than E. coli [1-4]. Phage ATCC 700786-B1, infecting B. fragilis RYC2056, serves as an ISO standard reference phage, but the host is also susceptible to phages in other animal feces [5,6]. Phage ATCC 51477-B1 infects the host B. fragilis HSP40, which is reported to be only susceptible to phages in human feces and surface water polluted with municipal/septic wastewater [7]. A drawback to using Bacteroides phages as water quality indicators results from the requirement to cultivate them on an anaerobic host [6]. This can be circumvented with direct phage detection via PCR [8], but assay design is difficult because only one gene for one Bacteroides bacteriophage is publically available. The lack of ability to detect Bacteroides bacterial phage is startlingly inadequate because the host genera is dominant in the human gut and contains important antibiotic resistant pathogens [3,9]. In this study, the genome of phage ATCC 51477-B1 was sequenced to promote fecal source tracking assay development and aid construction of a B. fragilis phage bioreporter [10]. Phage ATCC 51477-B1 was propagated on B. fragilis HSP40 (ATCC 51477) at 37°C in anaerobic Bacteroides phage recovery medium (BPRM) amended with kanamycin, nalidixic acid, bile salts, and Oxyrase (Oxyrase; Mansfield, OH) [11,12]. Phage were purified and concentrated from lysate using polyethylene glycol [13] and displayed a non-elongated icosohedral head and non-contractile tail consistent with the family Siphoviridae and other B. fragilis phages isolated from municipal wastewater (Figure 1) [14]. Structural dimensions were similar to B. fragilis phage B40-8 with respect to head diameter (60 ± 4.0 μm) and tail length (162 ± 17.0 μm), but the tail diameter was greater (13.4 ± 1.1 μm versus 9.3 ± 0.4 μm) [13].
Figure 1

Transmission electron micrographs of phage ATCC 51477. (A) Magnified view of a single phage (bar = 50 nm). (B) Four additional phage all displaying tail fibers (bar = 200 nm).

Transmission electron micrographs of phage ATCC 51477. (A) Magnified view of a single phage (bar = 50 nm). (B) Four additional phage all displaying tail fibers (bar = 200 nm). Phage DNA was extracted with Lambda minipreps (Promega, Madison, WI) and digested with HindIII. Several HindIII restriction fragments were cloned into pUC19 [15] and sequenced using primer walking. PCR reactions bridging the cloned HindIII restriction fragments were cloned and sequenced to confirm fragment order. Non-redundant areas of the genome that were difficult to clone were directly sequenced. In total, dideoxynucleotide sequencing provided 2× coverage except at the 5' and 3' ends. Confirmation of sequence data was sought by GS FLX pyrosequencing (MWG Biotech, Inc., High Point, NC) which produced 23,263 reads, 58,730 base calls, and assembly of 62 contigs using the GS De Novo Assembler. Thirty eight of the contigs with more than 3 reads were co-assembled with the dideoxynucleotide sequences using DNASTAR Lasergene 7.1 software suite (DNASTAR, Inc., Madison, WI). A total of 21,935 pyrosequencing reads in 27 contigs were maintained in the final assembly which aligned at 91% similarity for 100× coverage of the phage genome, including the 5' and 3' ends. The final assembled genome was 44,929 bp with a 38.7% G+C content (Figure 2).
Figure 2

Phage ATCC 51477-B1 genome map. The direction and size of 46 putative ORFs are illustrated with arrows. ORFs labeled in green display high similarity to proteins with known function. ORFs labeled in red display high similarity to other putative open reading frames without assigned functions. ORFs labeled in blue have low similarity (< 0.1) to other putative open reading frames. Sequences with high DNA similarity to the phage B40-8 structural proteins are shown in yellow. Putative B. fragilis promoters are shown as inverted orange triangles, the location of repeats are shown as purple boxes, and polymorphisms are indicated with black lines.

Phage ATCC 51477-B1 genome map. The direction and size of 46 putative ORFs are illustrated with arrows. ORFs labeled in green display high similarity to proteins with known function. ORFs labeled in red display high similarity to other putative open reading frames without assigned functions. ORFs labeled in blue have low similarity (< 0.1) to other putative open reading frames. Sequences with high DNA similarity to the phage B40-8 structural proteins are shown in yellow. Putative B. fragilis promoters are shown as inverted orange triangles, the location of repeats are shown as purple boxes, and polymorphisms are indicated with black lines. Open reading frames (ORFs) were identified using GeneMark [16,17] and the NCBI ORF Finder and examined for known protein functions, structures, and motifs using a conserved domain database [18] (Additional file 1, Figure 2). Ten of the 46 putative ORFs contained amino acid sequences with predictable functions or motifs (Additional file 1). The 5' ends of three ORFs (ORF39, ORF40, and ORF43) displayed translated similarity to previously published N-terminal amino acid sequences for B. fragilis phage B40-8 head (MP1 and MP3) and tail (MP2) proteins at 100, 80, and 90% similarity, respectively [13]. The B40-8 MP2 gene (the only B. fragilis phage gene present in GenBank), was present in ATCC 51477-B1 but contained a 119 bp insertion and was split into three misarranged sections, one of which was inverted (Figure 2). This suggested the putative ATCC 51477-B1 tail protein was chimeric with respect to B40-8. The ATCC 51477-B1 genome contained gene functional clusters for host specificity (tail fiber), lysis, replication and regulation, and packaging and structural proteins (Figure 2) [19]. The host specificity region contained ORF7 with translated similarity to the tail fiber of Enterococcus phage phiEF24C (Additional file 1) and a large size (1,897 amino acids) suggesting it may be a tape measure gene [20]. A putative M-15 type peptidase (ORF10) was the only gene clearly linked with the lysis region. The replication and regulation region included phage anti-repressors (ORF22 and ORF26), DNA modifying enzymes (ORF15 and ORF16), and replication proteins (ORF22, ORF31, and ORF32). Within the packaging and structural cluster, an ORF with similarity to the TerL protein was identified (ORF 38), as well as ORFs with N-terminal sequences similar to the three phage B40-8 structural proteins previously mentioned (ORF39, ORF40, and ORF43) [13]. Four potential promoters were identified on the positive strand of the genome based on similarity to promoters found in [21]. Two were in the regulation and replication region, 5' of ORFs 17 and 22. The other two potential promoters were near the beginning of the genome, 5' of ORF2, and in the lysis region, 5' of ORF12. A tandem repeats finder [22] revealed a 23 bp repeat within an 83 bp segment having two perfect and two degenerate repeats in the replication and regulation region. Another 96 bp perfect repeat sequence was identified at the beginning of the phage genome and again at the end of the replication and regulation region (Figure 2). Eight polymorphic regions occurring within ORFs and displaying sequence variability from 4% to 13% were identified with the extensive pyrosequencing data. These regions ranged in size from 200 bp to greater than 1,000 bp. ORF7, the putative tail fiber protein, displayed the most polymorphisms, including a pyrosequencing contig with a deletion relative to the final assembly. Tail fiber variability modifies the phage host range [23] and may, for this phage, reflect the antigenic variability of B. fragilis surface components [21,24]. The sequence for B. fragilis phage ATCC 51477-B1 was deposited in GenBank with accession number FJ008913.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

SAH cloned and sequenced the genome by primer walking, assisted in the pyrosequence data analysis and annotation, and drafted the manuscript. ACL analyzed pyrosequencing data, annotated the genome, and assisted in drafting the manuscript. SR organized the study and provided final editing for the manuscript. DW provided genome cloning. GSS participated in the study design and coordination. All authors read and approved the final manuscript.

Additional file 1

Phage ATCC 51477-B1 genome organization. Click here for file
  18 in total

1.  Heuristic approach to deriving models for gene finding.

Authors:  J Besemer; M Borodovsky
Journal:  Nucleic Acids Res       Date:  1999-10-01       Impact factor: 16.971

2.  Culture and decontamination methods affecting enumeration of phages infecting Bacteroides fragilis in sewage.

Authors:  C Tartera; R Araujo; T Michel; J Jofre
Journal:  Appl Environ Microbiol       Date:  1992-08       Impact factor: 4.792

3.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

4.  Optimisation and standardisation of a method for detecting and enumerating bacteriophages infecting Bacteroides fragilis.

Authors:  R Araujo; M Muniesa; J Méndez; A Puig; N Queralt; F Lucena; J Jofre
Journal:  J Virol Methods       Date:  2001-04       Impact factor: 2.014

5.  Genomic structure of phage B40-8 of Bacteroides fragilis.

Authors:  Montserrat Puig; Rosina Gironés
Journal:  Microbiology       Date:  1999-07       Impact factor: 2.777

6.  Use of 16S rRNA gene-targeted group-specific primers for real-time PCR analysis of predominant bacteria in human feces.

Authors:  Takahiro Matsuki; Koichi Watanabe; Junji Fujimoto; Toshihiko Takada; Ryuichiro Tanaka
Journal:  Appl Environ Microbiol       Date:  2004-12       Impact factor: 4.792

7.  Tyrosine site-specific recombinases mediate DNA inversions affecting the expression of outer surface proteins of Bacteroides fragilis.

Authors:  Katja G Weinacht; Hazeline Roche; Corinna M Krinos; Michael J Coyne; Julian Parkhill; Laurie E Comstock
Journal:  Mol Microbiol       Date:  2004-09       Impact factor: 3.501

8.  Bacteriophages active against Bacteroides fragilis in sewage-polluted waters.

Authors:  C Tartera; J Jofre
Journal:  Appl Environ Microbiol       Date:  1987-07       Impact factor: 4.792

9.  Homogeneity of the morphological groups of bacteriophages infecting Bacteroides fragilis strain HSP40 and strain RYC2056.

Authors:  Nuria Queralt; Joan Jofre; Rosa Araujo; Maite Muniesa
Journal:  Curr Microbiol       Date:  2003-03       Impact factor: 2.188

10.  CDD: a Conserved Domain Database for protein classification.

Authors:  Aron Marchler-Bauer; John B Anderson; Praveen F Cherukuri; Carol DeWeese-Scott; Lewis Y Geer; Marc Gwadz; Siqian He; David I Hurwitz; John D Jackson; Zhaoxi Ke; Christopher J Lanczycki; Cynthia A Liebert; Chunlei Liu; Fu Lu; Gabriele H Marchler; Mikhail Mullokandov; Benjamin A Shoemaker; Vahan Simonyan; James S Song; Paul A Thiessen; Roxanne A Yamashita; Jodie J Yin; Dachuan Zhang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  15 in total

1.  Twelve previously unknown phage genera are ubiquitous in global oceans.

Authors:  Karin Holmfeldt; Natalie Solonenko; Manesh Shah; Kristen Corrier; Lasse Riemann; Nathan C Verberkmoes; Matthew B Sullivan
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-15       Impact factor: 11.205

Review 2.  Horizontal gene transfers with or without cell fusions in all categories of the living matter.

Authors:  Joseph G Sinkovics
Journal:  Adv Exp Med Biol       Date:  2011       Impact factor: 2.622

3.  Tunable Expression Tools Enable Single-Cell Strain Distinction in the Gut Microbiome.

Authors:  Weston R Whitaker; Elizabeth Stanley Shepherd; Justin L Sonnenburg
Journal:  Cell       Date:  2017-04-20       Impact factor: 41.582

4.  Complete genome sequence of Croceibacter bacteriophage P2559S.

Authors:  Ilnam Kang; Dongmin Kang; Jang-Cheon Cho
Journal:  J Virol       Date:  2012-08       Impact factor: 5.103

5.  Engineering Phage Host-Range and Suppressing Bacterial Resistance through Phage Tail Fiber Mutagenesis.

Authors:  Kevin Yehl; Sébastien Lemire; Andrew C Yang; Hiroki Ando; Mark Mimee; Marcelo Der Torossian Torres; Cesar de la Fuente-Nunez; Timothy K Lu
Journal:  Cell       Date:  2019-10-03       Impact factor: 41.582

6.  Bacteroides thetaiotaomicron-Infecting Bacteriophage Isolates Inform Sequence-Based Host Range Predictions.

Authors:  Andrew J Hryckowian; Bryan D Merrill; Nathan T Porter; William Van Treuren; Eric J Nelson; Rebecca A Garlena; Daniel A Russell; Eric C Martens; Justin L Sonnenburg
Journal:  Cell Host Microbe       Date:  2020-07-10       Impact factor: 21.023

7.  Novel Siphoviridae Bacteriophages Infecting Bacteroides uniformis Contain Diversity Generating Retroelement.

Authors:  Stina Hedžet; Maja Rupnik; Tomaž Accetto
Journal:  Microorganisms       Date:  2021-04-21

8.  Comparative (meta)genomic analysis and ecological profiling of human gut-specific bacteriophage φB124-14.

Authors:  Lesley A Ogilvie; Jonathan Caplin; Cinzia Dedi; David Diston; Elizabeth Cheek; Lucas Bowler; Huw Taylor; James Ebdon; Brian V Jones
Journal:  PLoS One       Date:  2012-04-25       Impact factor: 3.240

9.  Genome signature-based dissection of human gut metagenomes to extract subliminal viral sequences.

Authors:  Lesley A Ogilvie; Lucas D Bowler; Jonathan Caplin; Cinzia Dedi; David Diston; Elizabeth Cheek; Huw Taylor; James E Ebdon; Brian V Jones
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

Review 10.  The human gut virome: a multifaceted majority.

Authors:  Lesley A Ogilvie; Brian V Jones
Journal:  Front Microbiol       Date:  2015-09-11       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.