| Literature DB >> 30832293 |
Mike Dyall-Smith1,2, Peter Palm3, Gerhard Wanner4, Angela Witte5, Dieter Oesterhelt6, Friedhelm Pfeiffer7.
Abstract
The unexpected lysis of a large culture of Halobacterium salinarum strain S9 was found to be caused by a novel myovirus, designated ChaoS9. Virus purification from the culture lysate revealed a homogeneous population of caudovirus-like particles. The viral genome is linear, dsDNA that is partially redundant and circularly permuted, has a unit length of 55,145 nt, a G + C% of 65.3, and has 85 predicted coding sequences (CDS) and one tRNA (Arg) gene. The left arm of the genome (0⁻28 kbp) encodes proteins similar in sequence to those from known caudoviruses and was most similar to myohaloviruses phiCh1 (host: Natrialba magadii) and phiH1 (host: Hbt. salinarum). It carries a tail-fiber gene module similar to the invertible modules present in phiH1 and phiCh1. However, while the tail genes of ChaoS9 were similar to those of phiCh1 and phiH1, the Mcp of ChaoS9 was most similar (36% aa identity) to that of Haloarcula hispanica tailed virus 1 (HHTV-1). Provirus elements related to ChaoS9 showed most similarity to tail/assembly proteins but varied in their similarity with head/assembly proteins. The right arm (29⁻55 kbp) of ChaoS9 encoded proteins involved in DNA replication (ParA, RepH, and Orc1) but the other proteins showed little similarity to those from phiH1, phiCh1, or provirus elements, and most of them could not be assigned a function. ChaoS9 is probably best classified within the genus Myohalovirus, as it shares many characteristics with phiH1 (and phiCh1), including many similar proteins. However, the head/assembly gene region appears to have undergone a recombination event, and the inferred proteins are different to those of phiH1 and phiCh1, including the major capsid protein. This makes the taxonomic classification of ChaoS9 more ambiguous. We also report a revised genome sequence and annotation of Natrialba virus phiCh1.Entities:
Keywords: Archaea; caudovirus; genome inversion; haloarchaea; halobacteria; halovirus; transposon
Mesh:
Substances:
Year: 2019 PMID: 30832293 PMCID: PMC6471424 DOI: 10.3390/genes10030194
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Morphology of ChaoS9 particles by negative-stain electron microscopy. Purified virus was fixed with 2.5% glutaraldehyde, negatively-stained with 1% uranyl acetate and examined under a Zeiss EM 912 electron microscope. Scale bar represents 50 nm. Examples of contracted tails (labeled with a C) are shown in the top left-hand corner and the bottom right-hand corner.
Figure 2Proteins of halovirus ChaoS9. Purified virus was heated in sample buffer and separated on a 4%–12% SDS-polyacrylamide gel, and the separated proteins stained with Coomassie blue. MW, molecular weight standards (PageRuler). The sizes of the protein standards are indicated at the left (in kDa). Virus proteins (VP1 to VP7) are numbered from largest to smallest. Molecular weight estimates of the major virus proteins are shown in the adjacent table.
Figure 3Restriction digests of ChaoS9 DNA. Enzymes used are indicated above each well. The outside wells (Size M) were loaded with DNA size markers (Lambda-HindIII), and the fragment sizes (in kbp) are shown at the left edge. White triangles; terminal fragments predicted from the DNA sequence that were either underrepresented or not visible on gels. Blue triangles; bands predicted from the DNA sequence to occur only in longer than unit length genomes. See also Supplementary Figure S1.
Characteristics of ChaoS9 and related haloviruses phiH1 and phiCh1.
| Virus 1 | Head Diameter (nm) | Tail Length × Width (nm) | Morphotype | Plaque Morphology | Unit Genome Length 2 (nt) | %G + C | GenomeEnds in Virus 3 | Temperate (Genome Form) |
|---|---|---|---|---|---|---|---|---|
| ChaoS9 | 61 | 128 × 17 | myovirus | turbid | 55,145 | 65.3 | ds, linear, TR, CP, >1 unit length | ? |
| phiH1 | 64 | 170 × 18 | myovirus | turbid | 58,072 | 63.7 | ds, linear, TR, CP, >1 unit length | Yes, provirus is a plasmid (circular, ds, 1 unit length) |
| phiCh1 | 70 | 130 × 20 | myovirus | turbid | 58,487 | 61.9 | ds, linear, TR, CP, >1 unit length | Yes, provirus is a plasmid |
1 Data from this study for ChaoS9, [23] for phiH and [42] for phiCh1. 2 Data from this study (ChaoS9), Dyall-Smith et al. [26] (phiH1), and Witte et al., and this study [43] (phiCh1). 3 TR, CP; terminally redundant, circularly permuted.
Figure 4Comparison of the ChaoS9 genome with the genomes of 17 other published halovirus isolates. (a) Dot plot showing sequence similarity using the method of zPicture (Blastz) [32]. Names of viruses are shown along lower and left edges, and their lengths are indicated by the pink dashed lines. The scale, in kbp, is shown along the upper and right edges. The pink shaded box at lower left highlights the similarity between ChaoS9, phiCh1 and phiH1. (b) Average Nucleotide Identity (ANIb) heatmap. ANIb percentage values were calculated using EZbiocloud [33] and the heatmap produced using heatmapper [34]. Color scale indicates % ANIb.
Figure 5Genome map of ChaoS9 compared to phiCh1 and phiH1. (a) Cumulative AT-skew of all three viral genomes. (b) Genome map of phiCh1; (c) Genome map of ChaoS9; (d) Genome map of phiH1. Scale bar at bottom shows length, in kbp. Pink shading between genome maps indicates regions of similarity (tBLASTx, ≥30% amino acid identity). Functionally similar genes have been colored the same and are labeled nearby on one or more of the genomes (see Table 2 for details). The colors, gene labels (and encoded proteins) are: red, terL (large subunit terminase); light green, por (portal protein); brown, capsid protein genes, such as mcp, gpE and hp32 (major capsid protein), hp20, hp67, cp67; yellow, tpm (tape measure protein); blue, bpj (baseplate J family protein); pink, tail fiber genes; light purple, par (plasmid partition); dark green, orc1, (replication protein Orc1); light blue, repH (plasmid replication protein); light grey, dcm5, yhdJ, ycdA, m.II, m.III, m.I (methyltransferases); dark grey, ISH12 transposase; black, integrases (int1, int2). Genes of unknown or uncertain function are uncolored (white). The L-region of phiH1 has been described by [41].
Annotated coding sequences (CDS) of the ChaoS9 genome (accession MK450543).
| Start (nt) | Stop (nt) | Locus_Tag | Length (bp) | Direction | Gene | Product | Homologs 1: phiCh1, pNMAG03 | Homologs 2: phiH1, [Other] |
|---|---|---|---|---|---|---|---|---|
| 100 | 669 | ChaoS9_005 | 570 | + | - | HTH domain protein | PhiCh1_005, PhiCh1p02, ORF1, Nmag_4251 | PhiH1_005 |
| 656 | 2302 | ChaoS9_010 | 1647 | + |
| terminase large subunit TerL | - | [HALG_00007] |
| 2316 | 3944 | ChaoS9_015 | 1629 | + |
| portal protein Por | - | [HGTV1_7] |
| 3937 | 4083 | ChaoS9_020 | 147 | + | - | CxxC motif protein | - | ORPHAN |
| 4086 | 5273 | ChaoS9_025 | 1188 | + | - | putative phage head assembly protein, SPP1_gp7 family | - | [C478_10461] |
| 5384 | 7300 | ChaoS9_030 | 1917 | + | - | probable prohead protease protein | - | [HLASA_2034] |
| 7303 | 7755 | ChaoS9_035 | 453 | + | - | uncharacterized protein | - | [HLASA_2033] |
| 7801 | 8928 | ChaoS9_040 | 1128 | + | - | major capsid protein MCP | - | 3 [HLASA_2032; |
| 8944 | 9363 | ChaoS9_045 | 420 | + | - | uncharacterized protein | PhiCh1_055, PhiCh1p13, ORF12, Nmag_4261 | 6 PhiH1_050, |
| 9398 | 9784 | ChaoS9_050 | 387 | + | - | uncharacterized protein | - | [HLASA_2030] |
| 9781 | 10191 | ChaoS9_055 | 411 | + |
| head closure protein Hco, type 1 | PhiCh1_065, PhiCh1p15, ORF14, Nmag_4263 | 5 PhiH1_060, |
| 10188 | 10421 | ChaoS9_060 | 234 | + | - | uncharacterized protein | - | ORPHAN |
| 10414 | 10701 | ChaoS9_065 | 288 | + | - | uncharacterized protein | PhiCh1_070, PhiCh1p16, ORF15, Nmag_4264 | 6 PhiH1_065, |
| 10703 | 11149 | ChaoS9_070 | 447 | + |
| putative neck protein Nep, type 1 | PhiCh1_075, PhiCh1p17, ORF16, Nmag_4265 | PhiH1_070 |
| 11156 | 11746 | ChaoS9_075 | 591 | + |
| tail completion protein Tco, type 1 | PhiCh1_080, PhiCh1p18, ORF17, Nmag_4266 | PhiH1_075 |
| 11767 | 13071 | ChaoS9_080 | 1305 | + | - | tail sheath protein | PhiCh1_085, PhiCh1p19, ORF18, Nmag_4267 | PhiH1_080 |
| 13082 | 13480 | ChaoS9_085 | 399 | + | - | predicted tail tube protein | PhiCh1_090, PhiCh1p20, ORF19, Nmag_4268 | PhiH1_085 |
| 13492 | 13932 | ChaoS9_090 | 441 | + | - | uncharacterized protein | PhiCh1_095, PhiCh1p21, ORF20, Nmag_4269 | PhiH1_090 |
| 13935 | 14144 | ChaoS9_095 | 210 | + | - | uncharacterized protein | - | [WP_054519905] |
| 14147 | 16891 | ChaoS9_100 | 2745 | + |
| tape-measure tail protein Tpm | 7 PhiCh1_105, PhiCh1p23+PhiCh1p24, ORF22+ORF23, Nmag_4272 | PhiH1_100 |
| 16895 | 17422 | ChaoS9_105 | 528 | + | - | uncharacterized protein | PhiCh1_110, PhiCh1p25, ORF24, Nmag_4273 | PhiH1_105 |
| 17423 | 17767 | ChaoS9_110 | 345 | + | - | uncharacterized protein | PhiCh1_115, PhiCh1p26, ORF25, Nmag_4274 | PhiH1_110 |
| 17771 | 18604 | ChaoS9_115 | 834 | + | - | uncharacterized protein | 7 PhiCh1_120, PhiCh1p27+PhiCh1p28, ORF26+ORF27, Nmag_4275 | PhiH1_115 |
| 18644 | 18787 | ChaoS9_120 | 144 | + | - | CxxC motif protein | - | PhiH1_120 |
| 18784 | 19335 | ChaoS9_125 | 552 | + | - | uncharacterized protein | PhiCh1_125, PhiCh1p29, ORF28, Nmag_4276 | PhiH1_125 |
| 19338 | 19700 | ChaoS9_130 | 363 | + | - | virus-related protein | - | PhiH1_135 |
| 19697 | 20062 | ChaoS9_135 | 366 | + | - | uncharacterized protein | PhiCh1_130, PhiCh1p30, ORF29, Nmag_4277 | PhiH1_140 |
| 20069 | 21328 | ChaoS9_140 | 1260 | + |
| baseplate J family protein Bpj | PhiCh1_135, PhiCh1p31, ORF30, Nmag_4278 | PhiH1_145 |
| 21321 | 21929 | ChaoS9_145 | 609 | + | - | uncharacterized protein | PhiCh1_140, PhiCh1p32, ORF31, Nmag_4279 | PhiH1_150 |
| 21933 | 22517 | ChaoS9_150 | 585 | + | - | uncharacterized protein | PhiCh1_145, PhiCh1p33, ORF32, Nmag_4280 | - |
| 22514 | 23122 | ChaoS9_155 | 609 | + | - | uncharacterized protein | PhiCh1_150, PhiCh1p34, ORF33, Nmag_4281 | - |
| 23115 | 24698 | ChaoS9_160 | 1584 | + | - | repeat-containing tail fiber protein | 8 PhiCh1_155+ PhiCh1_175, PhiCh1p35+PhiCh1p37, ORF34+ORF36, Nmag_4282+Nmag_4286 | PhiH1_165+ |
| 24702 | 24986 | ChaoS9_165 | 285 | + | - | uncharacterized protein | 8 PhiCh1_160+ PhiCh1_170, Nmag_4285+Nmag_4283 | PhiH1_180+ |
| 25024 | 25698 | ChaoS9_170 | 675 | + |
| tyrosine integrase/recombinase Int1 | PhiCh1_165, PhiCh1p36, ORF35, Nmag_4284 | PhiH1_175 |
| 25709 | 25996 | ChaoS9_175 | 288 | - | - | uncharacterized protein | 8 PhiCh1_160+ PhiCh1_170, Nmag_4285+Nmag_4283 | PhiH1_180+ |
| 25999 | 26145 | ChaoS9_180 | 147 | - | - | repeat-containing tail fiber protein (C-term) (nonfunctional)9 | * 9 | * 9 |
| 26199 | 27455 | ChaoS9_185 | 1257 | - |
| IS1341-type transposase TnpB | - | PhiH1_340 |
| 27457 | 27849 | ChaoS9_190 | 393 | - |
| IS200-type transposase TnpA | - | PhiH1_335 |
| 27906 | 28820 | ChaoS9_195 | 915 | - | - | repeat-containing tail fiber protein (N-term) (nonfunctional) | 8 PhiCh1_155+ PhiCh1_175, PhiCh1p35+PhiCh1p37, ORF34+ORF36, Nmag_4282+Nmag_4286 | PhiH1_165+ |
| 28854 | 29579 | ChaoS9_200 | 726 | + | - | transmembrane domain protein | PhiCh1_180, PhiCh1p38, ORF37, Nmag_4287 | - |
| 29589 | 29861 | ChaoS9_205 | 273 | - | - | HTH domain protein | PhiCh1_185, PhiCh1p39, ORF38, Nmag_4288 | - |
| 29933 | 30241 | ChaoS9_210 | 309 | - | - | uncharacterized protein | PhiCh1_190, PhiCh1p40, ORF39, Nmag_4289 | PhiH1_220 |
| 30238 | 30813 | ChaoS9_215 | 576 | - | - | glutamine amidotransferase domain protein, class-II | PhiCh1_195, PhiCh1p41, ORF40, Nmag_4290 | - |
| 30818 | 31891 | ChaoS9_220 | 1074 | - | - | uncharacterized protein | 7 PhiCh1_200, PhiCh1p42+PhiCh1p43, ORF41+ORF42, Nmag_4291 | - |
| 32030 | 32266 | ChaoS9_225 | 237 | + | - | uncharacterized protein | - | ORPHAN |
| 32339 | 32584 | ChaoS9_230 | 246 | + | - | uncharacterized protein | - | PhiH1_225 |
| 32581 | 33009 | ChaoS9_235 | 429 | + | - | VapC family toxin | - | [BRC75_08225] |
| 33098 | 33448 | ChaoS9_240 | 351 | + | - | uncharacterized protein | PhiCh1_230, Nmag_4297 | PhiH1_250 |
| 33457 | 34059 | ChaoS9_245 | 603 | - |
| tyrosine integrase/recombinase Int2 | PhiCh1_215, PhiCh1p46, ORF45, Nmag_4294 | PhiH1_240 |
| 34241 | 34432 | ChaoS9_250 | 192 | - | - | uncharacterized protein | - | ORPHAN |
| 34507 | 35055 | ChaoS9_255 | 549 | - | - | uncharacterized protein | - | PhiH1_255 |
| 35048 | 35902 | ChaoS9_260 | 855 | - | - | Plasmid partition protein ParA | PhiCh1_220, PhiCh1p47, ORF46, Nmag_4295 | PhiH1_265 |
| 35976 | 36440 | ChaoS9_265 | 465 | - | - | transmembrane domain protein | - | PhiH1_210 |
| 36460 | 38241 | ChaoS9_270 | 1782 | - | - | uncharacterized protein | - | [AV929_12240] |
| 38243 | 38857 | ChaoS9_275 | 615 | - | - | uncharacterized protein | - | [CRI94_04435] |
| 38850 | 39080 | ChaoS9_280 | 231 | - | - | CxxC motif protein | - | [HALLA_11930] |
| 39073 | 39240 | ChaoS9_285 | 168 | - | - | transmembrane domain protein | - | ORPHAN |
| 39233 | 40492 | ChaoS9_290 | 1260 | - | - | uncharacterized protein | - | [DM826_07215] |
| 40485 | 40673 | ChaoS9_295 | 189 | - | - | CxxC motif protein | - | ORPHAN |
| 40663 | 41181 | ChaoS9_300 | 519 | - | - | uncharacterized protein | - | [DJ71_18565] |
| 41174 | 41611 | ChaoS9_305 | 438 | - | - | HNH-type endonuclease/MarR family transcription regulator | - | 4 [DJ70_12900; |
| 41613 | 41870 | ChaoS9_310 | 258 | - | - | HTH domain protein | - | [Natpe_3999] |
| 41997 | 42602 | ChaoS9_315 | 606 | + | - | uncharacterized protein | - | [C480_10020] |
| 42605 | 43051 | ChaoS9_320 | 447 | + | - | uncharacterized protein | - | [Natgr_3468] |
| 43044 | 43250 | ChaoS9_325 | 207 | + | - | uncharacterized protein | - | ORPHAN |
| 43250 | 43621 | ChaoS9_330 | 372 | + | - | uncharacterized protein | - | [OSG_eHP13_00215] |
| 43621 | 44127 | ChaoS9_335 | 507 | + | - | uncharacterized protein | - | ORPHAN |
| 44124 | 44255 | ChaoS9_340 | 132 | + | - | CxxC motif protein | - | [SAMN04488133_0114] |
| 44342 | 45400 | ChaoS9_345 | 1059 | + |
| Orc1-type DNA replication protein | - | [HLASA_2006] |
| 45482 | 45709 | ChaoS9_350 | 228 | - | - | uncharacterized protein | - | ORPHAN |
| 45837 | 45992 | ChaoS9_355 | 156 | + | - | uncharacterized protein | - | [HALDL1_16575] |
| 46369 | 46905 | ChaoS9_360 | 537 | + | - | uncharacterized protein | PhiCh1_295, PhiCh1p65, ORF64, Nmag_4216 | 5 [C472_00499] |
| 46902 | 47177 | ChaoS9_365 | 276 | + | - | uncharacterized protein | - | ORPHAN |
| 47179 | 47979 | ChaoS9_370 | 801 | + | - | zinc-finger domain protein | - | [DJ84_18225] |
| 47976 | 48068 | ChaoS9_375 | 93 | + | - | uncharacterized protein | - | ORPHAN |
| 48065 | 48397 | ChaoS9_380 | 333 | + | - | uncharacterized protein | - | ORPHAN |
| 48394 | 51690 | ChaoS9_385 | 3297 | + |
| plasmid replication protein RepH | PhiCh1_245, PhiCh1p55, ORF54, Nmag_4299 | PhiH1_285 |
| 51683 | 51925 | ChaoS9_390 | 243 | + | - | MarR family transcription regulator | - | [AV929_12115] |
| 51932 | 52591 | ChaoS9_395 | 660 | + | - | CxxC motif protein | - | [C443_17983] |
| 53208 | 53453 | ChaoS9_400 | 246 | + | - | transmembrane domain protein | PhiCh1_440, PhiCh1p93, ORF92, Nmag_4244 | PhiH1_460 |
| 53446 | 53769 | ChaoS9_405 | 324 | + | - | transmembrane domain protein | PhiCh1_445, PhiCh1p94, ORF93, Nmag_4245 | 10 PhiH1_465 |
| 53766 | 54455 | ChaoS9_410 | 690 | + | - | uncharacterized protein | - | 3 [halTADL_2427; |
| 54460 | 54705 | ChaoS9_415 | 246 | + | - | DUF217 domain protein | PhiCh1_460, PhiCh1p97, ORF96, Nmag_4248 | - |
| 54734 | 54922 | ChaoS9_420 | 189 | + | - | CxxC motif protein | PhiCh1_465, PhiCh1p98, ORF97, Nmag_4249 | PhiH1_480 |
| 54915> | <63 | ChaoS9_425 | 294 | + |
| terminase small subunit TerS | PhiCh1_470, PhiCh1p01, ORF98, Nmag_4250 | PhiH1_485 |
1 PhiCh1/pNMAG03 homologs of ChaoS9 proteins. For phiCh1, three codes are given: the locus tag from the revised genome (PhiCh1_), the RefSeq PhiCh1p and the originally assigned ORF codes (ORF for open reading frame). For example, PhiCh1_005, PhiCh1p02, Orf1. Codes starting with PhiCh1_ represent the revised genome sequence and annotation (Genbank accession MK450543; this publication), ORF codes represent the original annotation of the phiCh1 genome [35] (Genbank accession AF440695.1), while codes beginning with PhiCh1p represent the RefSeq version of the annotation of the same genome sequence (GB accession NC_004084). The number shift between ORF and PhiCh1p is due to the terS gene, the N-terminal part being encoded at the end of the genome, and the C-terminal part at its beginning. This gene is ORF98 in the original annotation and PhiCh1p01 in the RefSeq annotation. Codes starting with Nmag_ represent the annotation of the Nab. magadii plasmid pNMAG03 [28] (accession CP001935.1), which is the provirus state of phiCh1. The point of ring opening in pNMAG03 was set between Nmag_4303 and Nmag_4211. The absence of ORF and PhiCh1p codes indicates missing gene calls in the original annotation of phiCh1. 2 phiH1 homologs, or else “other” homologs of ChaoS9 proteins. If a homolog exists in phiH1 then the code is provided; if a homolog is lacking from phiH1 but exists in phiCh1, this is indicated by a hyphen; if a homolog is lacking in both, phiH1 and phiCh1, then an existing “other” homolog is listed in square brackets; codes are either from UniProt (locus tags) or from NCBI nr (WP numbers). The term ORPHAN indicates a complete lack of homologs. 3 Multiple homologs, separated by semicolon, are listed when a homolog is found in a halovirus, but this is significantly more distant than the closest homolog. 4 Multiple homologs indicate a ChaoS9-specific gene fusion. 5 The combination of phiH1/phiCh1 and “other” homologs is used when the homologs from phiH1 or phiCh1 are especially distant. 6 The combination of phiH1/phiCh1 and “other” homologs is used for HLASA_ codes to illustrate a longer stretch of synteny to PVHS1 from Halanaeroarchaeum sulfurireducens (see later). 7 Multiple PhiCh1p/ORF codes indicate that the gene was split by a frameshift in the originally published genome sequence of that virus. 8 Multiple PhiCh1/Nmag codes indicate the existence of paralogs. 9 This ORF represents the C-terminal fragment of a pseudogene (indicated by the term nonfunctional) which has been targeted by ISH12. The asterisks (*) indicate that corresponding pseudogene fragments do not exist as independent ORFs in phiH1 or phiCh1. 10 PhiCh1_445 and PhiH1_465, like ChaoS9_405, have three predicted transmembrane domains and are suspected to function as a holin [43].
Figure 6Provirus elements PVH3A1 and PVHS1 compared to ChaoS9. PVH3A1 from haloarchaeon strain 3A1_DGR (accession NZ_KK033114; nt 1475908-1520418), and PVHS1 from Haa. sulfurireducens strain M27-SA2 (accession NZ_CP011564; nt 1971038- 1927125), are compared to ChaoS9. Grey bands connecting genome maps represent similarity (tBLASTx, >30% identity) between the inferred proteins of ChaoS9 and each provirus. Positions of several annotated proteins and their genes are indicated by color and name; TerL, large subunit terminase (red); Por, portal protein (light green); Head, SPP1_gp7 family head assembly protein, (light brown); capsid proteins such as Mcp (major capsid protein) and Tail sheath protein (brown); Tpm, tape measure protein (yellow); Dam, adenine methyltransferase (light grey); Bpj, baseplate J family protein (blue); Orc1, replication protein Orc1 (green); RepH, plasmid replication protein (turquoise). Scale bar, in kbp, is shown at top.
Figure 7Phylogenetic tree reconstructions using (a) baseplate J family proteins, (b) major tail sheath proteins and (c), major capsid proteins. Halovirus proteins are marked by stars and bacteriophage proteins are marked by triangles. The consensus trees were produced using the Neighbor–Joining algorithm and 100 bootstrap replications. Scale bars represent the estimated number of amino acid replacements per position.