| Literature DB >> 22676492 |
Karlene H Lynch1, Paul Stothard, Jonathan J Dennis.
Abstract
BACKGROUND: Genomic analysis of bacteriophages infecting the Burkholderia cepacia complex (BCC) is an important preliminary step in the development of a phage therapy protocol for these opportunistic pathogens. The objective of this study was to characterize KL1 (vB_BceS_KL1) and AH2 (vB_BceS_AH2), two novel Burkholderia cenocepacia-specific siphoviruses isolated from environmental samples.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22676492 PMCID: PMC3483164 DOI: 10.1186/1471-2164-13-223
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Development and morphology of KL1 and AH2 plaques. Phages were plated in half-strength Luria-Bertani (½ LB) agar overlays with a 16 h liquid culture of Burkholderia cenocepacia C6433. Plates were incubated at 30°C or 37°C and photographed after 16, 24, and 48 h. C6433 30°C plates (center) are representative of growth at both 30°C and 37°C.
Figure 2KL1 (A) and AH2 (B) virion morphology. Phages were stained with 2% phosphotungstic acid and visualized at 180,000-fold magnification by transmission electron microscopy. Scale bars represent 50 nm.
Figure 3RFLP analysis of KL1 and AH2 genomic DNA. 5 μg of genomic DNA were digested overnight with EcoRI and separated on a 0.8% agarose gel. The DNA in the ambient gel (left) was not heated, while the DNA in the 80°C gel (right) was incubated 20 min at 80°C and chilled on ice prior to loading. Arrows indicate bands containing cos site DNA. L: 1 Kb Plus DNA Ladder (Invitrogen).
Figure 4Circos plots of KL1 and AH2 PROmer comparisons. Green ribbons indicate regions of similarity between two genomes at the protein level. Each region is on the same strand in both genomes. The scale (in kbp) is shown on the periphery of the plots. PROmer parameters: breaklen = 60, maxgap = 30, mincluster = 20, minmatch = 6. A) KL1/AH2 comparison; B) KL1/Pseudomonas phage 73 (PA73) comparison; C) AH2/Burkholderia phage BcepNazgul comparison.
Figure 5Genome maps of KL1 and AH2. Genes transcribed in the forward direction are shown above and those transcribed in the reverse direction are shown below. The scale (in kbp) is shown below the maps. Legend: light blue, lysis; purple, capsid morphogenesis and DNA packaging; pink, tail morphogenesis; red, DNA binding; green, MazG; gray, unknown function.
KL1 genome annotation
| Gene | Start | End | Putative function | Strand | Predicted ribosome binding site and start codon | Length (amino acids) | Closest relative | Alignment region (amino acids) | Percent identity | Source | GenBank accession number |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 267 | unknown | + | AGGGGCGAActtcgtATG | 88 | hypothetical protein ORF001 | 1-84/84 | 77 | YP_001293408.1 | ||
| 264 | 560 | holin | + | AAAGGGGCGGtaacGTG | 98 | hypothetical protein ORF002 | 3-88/88 | 42 | YP_001293409.1 | ||
| 514 | 1080 | lysin | + | AAAAGGGGttatcgaATG | 188 | hypothetical protein bglu_1g27070 | 2-181/188 | 47 | YP_002912484.1 | ||
| 1091 | 1408 | Rz | + | AAGTAAGGGGttcgaaATG | 105 | hypothetical protein ORF004 | 1-101/101 | 37 | YP_001293411.1 | ||
| 1329 | 1592 | Rz1 | + | GAAAGGtgccgccgATG | 87 | conserved hypothetical protein | 1-79/86 | 40 | ZP_06842908.1 | ||
| 1647 | 2138 | unknown | + | ACTAGGccgcgattATG | 163 | hypothetical protein ORF005 | 1-162/162 | 59 | YP_001293412.1 | ||
| 2116 | 3756 | terminase large subunit | + | AACAGGAAttgcttaATG | 546 | hypothetical protein ORF006 | 10-531/531 | 84 | YP_001293413.1 | ||
| 3770 | 5266 | portal protein | + | AAAGGAAAcgaaatcATG | 498 | hypothetical protein ORF007 | 3-494/501 | 85 | YP_001293414.1 | ||
| 5269 | 6384 | head morphogenesis protein | + | GGGGCGTAatcATG | 371 | hypothetical protein ORF008 | 1-364/364 | 73 | YP_001293415.1 | ||
| 6403 | 7110 | unknown | + | AAGGAGtccttgaaATG | 235 | hypothetical protein ORF009 | 1-235/239 | 82 | YP_001293416.1 | ||
| 7123 | 8097 | major capsid protein | + | AAGGAcactttatcATG | 324 | hypothetical protein ORF010 | 1-325/325 | 90 | YP_001293417.1 | ||
| 8171 | 8587 | unknown | + | AAGGAGtttcgaacATG | 138 | hypothetical protein ORF011 | 1-134/134 | 69 | YP_001293418.1 | ||
| 8656 | 9033 | unknown | + | AAAGGAGcgtcgaacATG | 125 | hypothetical protein ORF012 | 1-123/123 | 70 | YP_001293419.1 | ||
| 9047 | 9565 | unknown | + | AAGGGGcgcggcatcATG | 172 | hypothetical protein ORF013 | 1-172/172 | 83 | YP_001293420.1 | ||
| 9570 | 9944 | head-tail joining protein | + | GATAAGGGtctaacgctATG | 124 | hypothetical protein ORF014 | 1-124/126 | 59 | YP_001293421.1 | ||
| 9941 | 10399 | minor tail protein | + | ATACGGTAttgttcgcacaATG | 152 | hypothetical protein ORF015 | 5-151/151 | 68 | YP_001293422.1 | ||
| 10412 | 11965 | unknown | + | AAGGAGttacgaaaATG | 517 | hypothetical protein ORF016 | 3-511/511 | 78 | YP_001293423.1 | ||
| 12030 | 12458 | tail protein | + | GGAGTAAAccaaATG | 142 | hypothetical protein ORF017 | 1-142/142 | 79 | YP_001293424.1 | ||
| 12030 | 12823 | tail protein | + | GGAGTAAAccaaATG | 264 | hypothetical protein ORF017 | 1-142/142 | 79 | YP_001293424.1 | ||
| | | | | | | | hypothetical protein ORF018 | 1-118/118 | 78 | YP_001293425.1 | |
| 12792 | 13226 | tail protein | + | AAAAGGCGGcgcaacagaATG | 144 | hypothetical protein ORF019 | 1-144/144 | 80 | YP_001293426.1 | ||
| 13232 | 17050 | tail tape measure | + | AAGGAttagcagaaATG | 1272 | hypothetical protein ORF020 | 1-78, 131-1202/1204 | 61, 57 | YP_001293427.1 | ||
| 17069 | 18067 | unknown | + | AGGAAtacgaattATG | 332 | hypothetical protein XALc_0225 | 1-295/307 | 30 | YP_003374757.1 | ||
| 18070 | 19179 | unknown | + | GAGGAAAActaatcATG | 369 | hypothetical protein ORF033 | 1-332/333 | 25 | YP_001294541.1 | ||
| 19179 | 20870 | tail assembly protein | + | AAGAAGAtcgcataATG | 563 | hypothetical protein ORF023 | 63-565/568 | 36 | YP_001293430.1 | ||
| 20867 | 21688 | tail assembly protein | + | AAGGAcgattccagaATG | 273 | hypothetical protein ORF024 | 1-273/274 | 49 | YP_001293431.1 | ||
| 21689 | 24100 | tail assembly protein | + | AAGATGGGGtcggttaaATG | 803 | hypothetical protein ORF025 | 1-755/813 | 49 | YP_001293432.1 | ||
| 24097 | 26166 | DNA polymerase | - | AAGGAAtttgcccgATG | 689 | hypothetical protein ORF026 | 1-682/683 | 83 | YP_001293433.1 | ||
| 26179 | 27339 | DNA polymerase III β subunit | - | AAGGGGttaaaaATG | 386 | hypothetical protein ORF027 | 2-380/380 | 74 | YP_001293434.1 | ||
| 27323 | 27691 | unknown | - | GAATGGtgaaattATG | 122 | hypothetical protein Dole_2913 | 5-84/87 | 33 | YP_001530793.1 | ||
| 27696 | 29351 | superfamily II helicase/restriction enzyme | - | AAGGGttacgaATG | 551 | hypothetical protein ORF029 | 1-551/551 | 90 | YP_001293436.1 | ||
| 29344 | 30342 | exonuclease | - | GGAAGGcgaagaacgATG | 332 | hypothetical protein ORF030 | 1-365/365 | 65 | YP_001293437.1 | ||
| 30852 | 31637 | unknown | - | GAAAGGtgaaacgaacATG | 261 | hypothetical protein Isop_2441 | 1-118/151 | 37 | YP_004179564.1 | ||
| 31696 | 32412 | recombinase | - | AGGTGAAcgtATG | 238 | hypothetical protein ORF032 | 1-238/238 | 91 | YP_001293439.1 | ||
| 32471 | 32980 | unknown | - | AAGGAAccccaaaATG | 169 | hypothetical protein ORF033 | 7-146/146 | 49 | YP_001293440.1 | ||
| 33059 | 33598 | pyrophosphohydrolase | - | AGGGGcatcgtATG | 179 | hypothetical protein ORF034 | 8-185/185 | 69 | YP_001293441.1 | ||
| 33746 | 33934 | transcriptional regulator | + | GGGGcaagcATG | 62 | hypothetical protein ORF035 | 1-61/62 | 51 | YP_001293442.1 | ||
| 33924 | 36233 | primase | + | GAAGGcttgcgcaaatATG | 769 | hypothetical protein ORF036 | 1-773/773 | 85 | YP_001293443.1 | ||
| 36366 | 36668 | unknown | + | GAAGGAgttacgaacATG | 100 | hypothetical protein | 132-217/217 | 44 | YP_003359005.1 | ||
| 36735 | 37091 | unknown | + | GAAGGAGtacacgccATG | 118 | unnamed protein product | 262-336/404 | 32 | YP_004974060.1 | ||
| 37097 | 37360 | unknown | + | AGAAGAAGGAGtaagcgccATG | 87 | PREDICTED: photosystem II reaction center PSB28 protein, chloroplastic | 22-86/179 | 32 | XP_002271666.1 | ||
| 37728 | 38024 | unknown | + | AAAGGAGcgccagccATG | 98 | hypothetical protein ORF039 | 1-97/98 | 70 | YP_001293446.1 | ||
| 38060 | 38296 | unknown | + | AAGGAAccccgatcATG | 78 | hypothetical protein ORF040 | 1-80/80 | 50 | YP_001293447.1 | ||
| 38302 | 38703 | unknown | + | AAAGGGGtaattactATG | 133 | hypothetical protein ORF042 | 1-120/124 | 40 | YP_001293449.1 | ||
| 38707 | 39195 | Vsr endonuclease | + | GACGAAGttgcattaagccATG | 162 | hypothetical protein ORF043 | 1-176/179 | 61 | YP_001293450.1 | ||
| 39201 | 39458 | unknown | + | GGAAGGAGtaacccaaATG | 85 | hypothetical protein Astex_0306 | 3-81/183 | 44 | YP_004086155.1 | ||
| 39455 | 39655 | unknown | + | GGCGAAGtcgtcgaATG | 66 | monooxygenase, FAD-binding | 385-445/546 | 38 | ZP_07309792.1 | ||
| 39652 | 39840 | unknown | + | AAGGAGtacgcaccATG | 62 | hypothetical protein METUNv1_00516 | 11-65/68 | 39 | ZP_08503515.1 | ||
| 39882 | 40154 | unknown | + | AAAAGGAGtaacgaacATG | 90 | hypothetical protein Cflav_PD2164 | 58-133/172 | 30 | bacterium Ellin514 | ZP_03630603.1 | |
| 40138 | 40374 | unknown | + | GAACCGGAttacgattATG | 78 | hypothetical protein ORF047 | 2-77/77 | 67 | YP_001293454.1 | ||
| 40374 | 40550 | unknown | + | GGGTTAcgaataATG | 58 | hypothetical protein Glaag_3667 | 90-140/227 | 29 | YP_004435864.1 | ||
| 40562 | 40933 | unknown | + | GAAAGGtgaaatcATG | 123 | hypothetical protein BURMUCGD2M_4586 | 8-67/70 | 34 | ZP_03569237.1 | ||
| 40930 | 41415 | dCMP deaminase | + | GGAACGtccggcATG | 161 | hypothetical protein ORF049 | 2-153/155 | 75 | YP_001293456.1 | ||
| 41412 | 41786 | unknown | + | AAAGGctgaatcATG | 124 | hypothetical protein ORF050 | 4-125/127 | 43 | YP_001293457.1 | ||
| 41826 | 42032 | unknown | + | GGGGAtgcccacattATG | 68 | hypothetical protein ORF051 | 37-94/94 | 45 | YP_001293458.1 | ||
| 42120 | 42674 | unknown | + | AAGGAGttttacaaATG | 184 | hypothetical protein ORF052 | 9-190/190 | 66 | YP_001293459.1 |
AH2 genome annotation
| Gene | Start | End | Putative function | Strand | Predicted ribosome binding site and start codon | Length (amino acids) | Closest relative | Alignment region (amino acids) | Percent identity | Source | GenBank accession number |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 619 | 1035 | unknown | - | AAGGAAAcgacATG | 138 | hypothetical protein Nazgul32 | 12-130/130 | 29 | NP_918966.1 | ||
| 1073 | 1423 | unknown | - | AGGGGGGAAcggccATG | 116 | conserved hypothetical protein | 1-116/116 | 72 | ZP_03586942.1 | ||
| 1501 | 1818 | unknown | - | GGATTActgaccATG | 105 | family 2 glycosyl transferase | 292-387/387 | 32 | YP_003404522.1 | ||
| 1809 | 2024 | unknown | + | GAGAAAtagagATG | 71 | mobilization protein mbeA | 190-237/325 | 37 | EFZ49597.1 | ||
| 2021 | 2578 | unknown | - | AGGGGttacatcATG | 185 | hypothetical protein Nazgul06 | 88-158/330 | 44 | NP_919015.1 | ||
| 2728 | 2877 | unknown | - | AGGTGcaaaaATG | 49 | hypothetical protein BoklE_20935 | 6-38/38 | 48 | ZP_02357945.1 | ||
| 2874 | 3002 | unknown | - | AGGGGcgatcATG | 42 | polysaccharide deacetylase | 21-60/287 | 35 | ZP_04156726.1 | ||
| 3071 | 3325 | unknown | - | AAAGAgctATG | 84 | major facilitator superfamily MFS_1 | 131-209/467 | 37 | YP_004349464.1 | ||
| 3322 | 3579 | unknown | - | GGAGTAtccgccATG | 85 | hypothetical protein Plabr_1809 | 308-361/603 | 31 | YP_004269441.1 | ||
| 3663 | 3911 | unknown | - | GGGGGTAtgacATG | 82 | HAD-superfamily hydrolase | 70-119/268 | 38 | YP_002465429.1 | ||
| 3913 | 4314 | unknown | - | AGGGGGAGtaacggccATG | 133 | hypothetical protein Nazgul09 | 1-129/141 | 59 | NP_919018.1 | ||
| 4320 | 4805 | unknown | - | AGGGGttacatcATG | 161 | hypothetical protein Nazgul10 | 1-151/160 | 74 | NP_919019.2 | ||
| 4846 | 5454 | unknown | - | AAAAAGGGGtttttgacATG | 202 | 194 gene product | 101-187/188 | 43 | YP_004894001.1 | ||
| 6021 | 6302 | unknown | + | AAGGAGcaatcATG | 93 | hypothetical protein Nazgul13 | 3-93/93 | 41 | NP_919022.1 | ||
| 6311 | 6550 | unknown | + | AGGCGGtcgtATG | 79 | hypothetical protein BDB_mp60418 | 1-67/67 | 45 | blood disease bacterium R229 | CCA83252.1 | |
| 6707 | 7015 | unknown | + | ACACGAcaccATG | 102 | hypothetical protein MC7420_4162 | 43-84/88 | 45 | ZP_05027813.1 | ||
| 7012 | 7218 | unknown | + | GAAGGtgccggcATG | 68 | hypothetical protein Cy51472DRAFT_4929 | 53-81/152 | 45 | ZP_08976132.1 | ||
| 7215 | 8069 | unknown | + | AGGAAAGgaaATG | 284 | hypothetical protein TK90_2682 | 5-175/177 | 45 | YP_003494636.1 | ||
| 8123 | 8407 | unknown | + | GAGAAGGcacacacATG | 94 | GTP-binding protein | 150-232/1016 | 29 | AAX07516.1 | ||
| 8499 | 9128 | DNA polymerase III β subunit | + | GAACGGTGAGcttATG | 209 | hypothetical protein Nazgul21 | 24-216/237 | 24 | NP_918955.1 | ||
| 9149 | 9343 | unknown | + | AGGAGAAAGgagATG | 64 | hypothetical protein R2APBS1DRAFT_0277 | 9-63/344 | 31 | ZP_08951135.1 | ||
| 9346 | 9645 | unknown | + | GGGGGTAtctgaccATG | 99 | hypothetical protein PFL_2108 | 3-63/70 | 33 | YP_259216.1 | ||
| 9642 | 9938 | unknown | + | GGAGGGtcaTTG | 98 | aspA gene product | 38-122/317 | 32 | YP_002297975.1 | ||
| 9935 | 10171 | unknown | + | GGGGcttggcgtATG | 78 | hypothetical protein Nazgul19 | 18-97/97 | 39 | NP_919028.2 | ||
| 10256 | 10711 | pyrophosphohydrolase | + | AAGGAAAggacATG | 151 | hypothetical protein BCAS0549 | 15-139/140 | 60 | YP_002153936.1 | ||
| 10720 | 10977 | unknown | + | GAGGccggccATG | 85 | hypothetical protein AGRO_3677 | 208-273/300 | 41 | ZP_08529674.1 | ||
| 11082 | 12074 | unknown | + | AGGAGAAatcGTG | 330 | hypothetical protein | 8-95/113 | 48 | ADE87960.1 | ||
| 12101 | 13075 | transcriptional regulator | + | AAGGAAccgacATG | 324 | hypothetical protein Pnap_4317 | 25-252/342 | 45 | YP_973341.1 | ||
| 13078 | 13497 | unknown | + | GCTGACGAtctctgaccATG | 139 | hypothetical protein SCHCODRAFT_69044 | 549-631/848 | 33 | XP_003030158.1 | ||
| 13574 | 13768 | transcriptional regulator | + | AGGGAtttttcATG | 64 | hypothetical protein APT_2164 | 9-65/75 | 53 | GAB28674.1 | ||
| 13768 | 14031 | transcriptional regulator | + | AAGCGGAGccgtcctgATG | 87 | hypothetical protein Bcep1808_2468 | 2-85/86 | 73 | YP_001120302.1 | ||
| 14064 | 14450 | Vsr endonuclease | - | GGAGGAatgATG | 128 | DNA mismatch endonuclease Vsr | 15-141/141 | 65 | YP_002360880.1 | ||
| 14450 | 15025 | excinuclease | - | AACAGAGttgcagcGTG | 191 | Excinuclease ABC C subunit domain protein | 3-183/192 | 58 | EGH83133.1 | ||
| 15038 | 15892 | restriction endonuclease | - | GGCAAAGGtcgccgcATG | 284 | conserved hypothetical protein | 1-285/285 | 70 | CBJ36134.1 | ||
| 15889 | 17031 | cytosine methylase | - | AGGGGGttcgcGTG | 380 | DNA-cytosine methyltransferase | 1-385/385 | 66 | CBJ36133.1 | ||
| 17107 | 17199 | unknown | + | ACGAAGccttgcttaATG | 30 | resistance-nodulation-cell division acriflavin:proton (H+) antiporter | 850-868/1014 | 68 | YP_001486844.1 | ||
| 17511 | 18842 | integrase | + | GAAGGAGGtcttgtagcactgATG | 443 | chorismate mutase family protein | 1-362/386 | 62 | ZP_02147383.1 | ||
| 18990 | 19412 | unknown | + | AAGGAGGAatcATG | 140 | hypothetical protein Dda3937_00584 | 60-163/163 | 40 | YP_003882998.1 | ||
| 19462 | 20001 | unknown | - | GGAGAttttcATG | 179 | hypothetical protein PcarcW_20243 | 68-197/198 | 67 | ZP_03833564.1 | ||
| 20034 | 20264 | Rz1 | - | GGAGGAcgccATG | 76 | hypothetical protein BURPS668_A2333 | 27-81/81 | 62 | YP_001063327.1 | ||
| 20277 | 20588 | Rz | - | AGGGGGccgtATG | 103 | hypothetical protein ORF004 | 2-101/101 | 35 | YP_001293411.1 | ||
| 20585 | 21091 | lysin | - | AAGGAGAAGAacaGTG | 168 | hypothetical protein HMPREF0005_02034 | 1-161/163 | 60 | EFV83908.1 | ||
| 21088 | 21339 | holin | - | GAAGGGGtggacccgaccATG | 83 | conserved exported hypothetical protein | 1-83/85 | 35 | blood disease bacterium R229 | CCA83792.1 | |
| 21336 | 21665 | unknown | - | AAGGGGccagaagATG | 109 | hypothetical protein HDEF_1702 | 3-87/92 | 31 | Candidatus | YP_002924457.1 | |
| 21807 | 22121 | unknown | - | AAGGAGAAAtcacATG | 104 | hypothetical protein PPL19_05085 | 1-103/161 | 53 | ZP_09283635.1 | ||
| 22133 | 23731 | tail fiber protein | - | GGAACGtggacATG | 532 | hypothetical protein Bpse112_32291 | 69-240/282 | 45 | ZP_02502292.1 | ||
| 23809 | 26178 | tail assembly protein | - | AGAGGAAGAcaaATG | 789 | hypothetical protein HCH_05649 | 2-727/728 | 34 | YP_436732.1 | ||
| 26175 | 26375 | tail assembly protein | - | GGGGGCAAgaaATG | 66 | hypothetical protein HCH_05650 | 4-67/71 | 50 | YP_436733.1 | ||
| 26372 | 26608 | tail assembly protein | - | GAGGActgatcATG | 78 | putative transmembrane protein | 7-82/82 | 47 | ZP_05845047.1 | ||
| 26618 | 27418 | tail assembly protein | - | AGGGGGAtcaaacaATG | 266 | hypothetical protein HCH_05652 | 1-268/269 | 39 | YP_436735.1 | ||
| 27415 | 29100 | tail assembly protein | - | AAGAAGAtcacTTG | 561 | hypothetical protein HCH_05654 | 35-560/563 | 32 | YP_436736.1 | ||
| 29097 | 30158 | unknown | - | GACGAGGtttgaaATG | 353 | hypothetical protein D11S_2171 | 1-326/327 | 23 | YP_003256741.1 | ||
| 30160 | 31122 | unknown | - | GAGCGAGGcataacGTG | 320 | hypothetical protein XALc_0225 | 1-194/307 | 35 | YP_003374757.1 | ||
| 31124 | 35860 | tail tape measure | - | GGACTGAAcggaaATG | 1578 | phage tape measure protein | 1-109, 452-1680/1683 | 33 | YP_004548730.1 | ||
| 35853 | 36538 | tail protein | - | AAGGGGGCGagcATG | 228 | pre-tape measure frameshift protein G-T | 1-242/243 | 34 | NP_918998.2 | ||
| 36098 | 36538 | tail protein | - | AAGGGGGCGagcATG | 146 | hypothetical protein Sinme_1368 | 4-126/142 | 34 | YP_004548729.1 | ||
| 36549 | 37337 | unknown | - | GAGGAAtcaatcATG | 262 | hypothetical protein Sinme_1367 | 1-257/262 | 45 | YP_004548728.1 | ||
| 37385 | 37897 | minor tail protein | - | GAGGAAAGtataATG | 170 | hypothetical protein Sinme_1366 | 7-177/177 | 50 | YP_004548727.1 | ||
| 37897 | 38517 | unknown | - | GACGCAGGtttgccgacATG | 206 | hypothetical protein Nazgul55 | 5-198/205 | 49 | NP_918988.2 | ||
| 38514 | 38873 | unknown | - | GAGGCGcgtgATG | 119 | hypothetical protein Sinme_1364 | 3-120/125 | 38 | YP_004548725.1 | ||
| 38886 | 39134 | unknown | - | AAAGGAAccatcATG | 82 | hypothetical protein Nazgul57 | 1-38/85 | 47 | NP_918990.1 | ||
| 39205 | 40233 | major capsid protein | - | AAGGAGAAAGcaaaATG | 342 | capsid protein E | 2-343/346 | 50 | NP_918991.1 | ||
| 40290 | 40688 | decorator protein | - | AGGAGAAccatcATG | 132 | decorator protein D | 4-123/131 | 49 | NP_918992.1 | ||
| 40743 | 42071 | prohead protease | - | AGGACCAGAAccaATG | 442 | prohead protease ClpP | 4-427/434 | 53 | NP_918994.2 | ||
| 42068 | 43591 | portal protein | - | GGAAcccgtcgATG | 507 | phage portal protein | 57-554/559 | 59 | ACZ55505.1 | ||
| 43736 | 43960 | head-tail joining protein | - | GGACAAcactATG | 74 | head-tail joining protein Lambda W | 13-76/76 | 56 | NP_918996.1 | ||
| 44097 | 46076 | terminase large subunit | - | AAGAcctcgATG | 659 | terminase large subunit TerL | 44-677/677 | 58 | NP_918997.2 | ||
| 46210 | 46803 | terminase small subunit | - | GAAGGTGAtagcgATG | 197 | TerS | 9-179/222 | 49 | NP_918999.1 | ||
| 46796 | 46990 | transcriptional regulator | - | AGGAGTAcggtATG | 64 | aminoglycoside phosphotransferase | 423-473/487 | 29 | ZP_06416368.1 | ||
| 47047 | 47736 | repressor | - | GAAAGGCAAGGcagcagcATG | 229 | hypothetical protein Rvan_1213 | 14-180/242 | 36 | YP_004011581.1 | ||
| 47833 | 49446 | helicase | - | ACGAcctcctgcgATG | 537 | helicase | 11-507/522 | 52 | NP_919000.2 | ||
| 49443 | 49745 | resolvase | - | GAAAGGAGGAttcactGTG | 100 | conserved phage protein | 15-103/108 | 55 | NP_919001.2 | ||
| 49742 | 51796 | DNA polymerase | - | ACGTcaccATG | 684 | hypothetical protein ORF026 | 48-670/683 | 45 | YP_001293433.1 | ||
| 51875 | 52609 | single-stranded DNA binding protein | - | AAAGGTGAcaaaaATG | 244 | conserved phage protein | 4-186/198 | 35 | ACZ55548.1 | ||
| 52655 | 53995 | Cas4 superfamily exonuclease | - | GATCctctcgaccccATG | 446 | conserved phage protein | 8-448/454 | 48 | NP_919005.2 | ||
| 54140 | 54538 | unknown | - | GGAGAAatcATG | 132 | hypothetical protein RUMHYD_01446 | 1-120/122 | 26 | ZP_03782010.1 | ||
| 54718 | 55017 | Cro | + | AACGGAGAtcacaATG | 99 | hypothetical protein Nazgul73 | 5-90/97 | 31 | NP_919007.1 | ||
| 55054 | 57534 | primase | + | GGAGGGgcaATG | 826 | DR0530-like primase | 1-843/843 | 49 | NP_919008.2 |
Figure 6Sequences of the KL1 and AH2 predicted translational frameshift sites. For each phage, the first row shows the DNA sequence (with the predicted frameshift site underlined); the second row shows the amino acid sequence in the original frame (the KL1 gp18 stop codon is represented by an asterisk); the third row shows the amino acid sequence in the −1 frame; the fourth row shows the amino acid sequence of the frameshifted protein.