| Literature DB >> 33561243 |
Julie Zaworski1, Colleen McClung1, Cristian Ruse1, Peter R Weigele1, Roger W Hendrix2, Ching-Chung Ko2, Robert Edgar3, Graham F Hatfull2, Sherwood R Casjens4,5, Elisabeth A Raleigh1.
Abstract
Bacteriophage L, a P22-like phage of Salmonella enterica sv Typhimurium LT2, was important for definition of mosaic organization of the lambdoid phage family and for characterization of restriction-modification systems of Salmonella. We report the complete genome sequences of bacteriophage L cI-40 13-am43 and L cII-101; the deduced sequence of wildtype L is 40,633 bp long with a 47.5% GC content. We compare this sequence with those of P22 and ST64T, and predict 72 Coding Sequences, 2 tRNA genes and 14 intergenic rho-independent transcription terminators. The overall genome organization of L agrees with earlier genetic and physical evidence; for example, no secondary immunity region (immI: ant, arc) or known genes for superinfection exclusion (sieA and sieB) are present. Proteomic analysis confirmed identification of virion proteins, along with low levels of assembly intermediates and host cell envelope proteins. The genome of L is 99.9% identical at the nucleotide level to that reported for phage ST64T, despite isolation on different continents ∼35 years apart. DNA modification by the epigenetic regulator Dam is generally incomplete. Dam modification is also selectively missing in one location, corresponding to the P22 phase-variation-sensitive promoter region of the serotype-converting gtrABC operon. The number of sites for SenLTIII (StySA) action may account for stronger restriction of L (13 sites) than of P22 (3 sites).Entities:
Keywords: lambdoid phage; methylation state; mosaic genome; phage tRNA; restriction indicator; virion composition
Year: 2021 PMID: 33561243 PMCID: PMC8022706 DOI: 10.1093/g3journal/jkaa037
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Nucleotide sequence differences among phage L strains and phage ST64T
| L | L | L | ST64T nucleotide | L Gene (function) | Difference details |
|---|---|---|---|---|---|
| 6,512 | A | G | G | orf186 | Difference in codon 38; Ile in L |
| 21,362 | A | G | G | eaC | Synonymous difference in codon 193 |
| 23,697 | A | G | G | orf87 | Synonymous difference in codon 84 |
| 24,146 | A | G | G | orf91 | Synonymous difference in codon 23 |
| 24,823 | ΔΤ | ΔΤ | T | orf109/eaD | One T insertion frameshift in codon 59 |
| 27,443 | A | G | G | 17 | Synonymous difference in codon 96 |
| 28,342 | Δ | Δ | 15 bp | orf232 | ST64T has one more 15 bp repeat than L (15 imperfect 5 AA repeats in ST64T) |
| 30,260 | – | C insert | – | cII (repressor; RAST) | 1 C insertion in run of C's at codon 46 is the |
| 30,877 | G | A | A | cI (activator) | G is |
| 38,322 | T | C | C | 13 (holin) | T is |
Column 1: coordinate of the mutations in L cI 40 13am43. Columns 2, 3, and 4: nucleotide at the coordinate in L cI -40 13 43, L cII 101, and ST64T genomes respectively. Column 4: gene annotation in which the mutation is located. Column 5: consequences of the observed mutations.
Figure 1Restriction analysis of the L genome. Right panel: L genome structure was verified by restriction digestion analysis as follows: 1 μg of DNA in a 50-μl reaction was incubated for 1 h at 37˚C with each of three enzymes separately (EcoRI-HF, SapI, and PvuII-HF; New England Biolabs), and run on a 0.8% agarose electrophoresis gel with the 1 kb Extend Ladder (New England Biolabs). Asterisks (*) denote fragments with one end cut by the headful packaging. Left panel: in silico fragmentpredictions for the circular genome).
Figure 2Phage L genome organization. Linear representation of the genome. The CDS are represented by arrows, with color differentiating functional modules or regions. tRNA positions are marked by black arrows. Transcription terminators were predicted in silico; those blocking rightward are above the line and those blocking leftward transcription are below the line.
Figure 3Comparison of phage L and P22 genomes. The top panel displays the Mauve alignment of whole L and P22 genomes with coding sequences of P22 represented below. Pink regions of the Mauve alignment correspond to significantly similar stretches while white regions correspond to low homology areas. Insets A, B, C, and D display MAFFT-aligned closeups of low homology areas boxed above. For each inset, top row: the annotated CDSs of P22 are aligned with the second row, P22 reference sequence; third row, nucleotide identity between P22 and L; fourth row: L reference sequence (determined here). Black ticks in each genome reference row correspond to aligned segments, white ticks to gaps. The identity row is vertically scaled to the degree of nucleotide identity of the aligned sequences. 100% identity is dark green, charter use is shorter with lower identity, and red represents the lowest identity, including gapped regions.
S. enterica serovar Typhimurium methyl transferase recognition sites present in the L genome
| Systems | Motif | Type | L Sites | P22 sites |
|---|---|---|---|---|
| M.SenLT2I =StyLT | 5′ CAG*AG 3′ | m6A | 44 (42) | 47 |
| M.SenLT2III = StySA | 5′ GATC*AG 3′ | m6A | 13 (13) | 3 |
| M.SenLT2II = StySB |
5′ G*AGNNNNNNRTAYG 3′ 3′ CTCNNNNNNY*ATRC 5′ | m6A | 1 (1) | 1 |
| M.Dam | 5′ G*ATC 3′ | m6A | 73 (65) | 62 |
| M.Dcm | 5′ C*CWGG 3′ | (m5C) | 59 (n.d.) | 54 |
| M.SenLT2IV | 5′ ATGC*AT 3′ | m6A | 9 (0) | 6 |
Column 1: the MTase name used in REBASE with the name most encountered in the literature. Column 2: recognition sequence, in which symmetric sites are shown once; asymmetric sites methylated only on one strand are shown once; asymmetric sites methylated on both strands are shown twice. Column 3: methylation type. Columns 4 and 5: the number of methylation motif sequences present in L and P22 genomes. In parenthesis for L is the number of methylated sites observed by the PacBio sequence analysis; N.D. (not detected), m5C was not determined by this approach.