Literature DB >> 34297924

Virus-encoded histone doublets are essential and form nucleosome-like structures.

Yang Liu1, Hugo Bisio2, Chelsea Marie Toner3, Sandra Jeudy2, Nadege Philippe2, Keda Zhou3, Samuel Bowerman3, Alison White3, Garrett Edwards3, Chantal Abergel4, Karolin Luger5.   

Abstract

The organization of genomic DNA into defined nucleosomes has long been viewed as a hallmark of eukaryotes. This paradigm has been challenged by the identification of "minimalist" histones in archaea and more recently by the discovery of genes that encode fused remote homologs of the four eukaryotic histones in Marseilleviridae, a subfamily of giant viruses that infect amoebae. We demonstrate that viral doublet histones are essential for viral infectivity, localize to cytoplasmic viral factories after virus infection, and ultimately are found in the mature virions. Cryogenic electron microscopy (cryo-EM) structures of viral nucleosome-like particles show strong similarities to eukaryotic nucleosomes despite the limited sequence identify. The unique connectors that link the histone chains contribute to the observed instability of viral nucleosomes, and some histone tails assume structural roles. Our results further expand the range of "organisms" that require nucleosomes and suggest a specialized function of histones in the biology of these unusual viruses.
Copyright © 2021 The Author(s). Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Analytical Ultracentrifugation; KO fitness impact; Melbournevirus genetics; NCLDV; acidic patch; cryo-EM; doublet histone; giant virus; histone tail; non-eukaryotic nucleosome; nucleosome-like-particle; viral factory; viral nucleosome

Mesh:

Substances:

Year:  2021        PMID: 34297924      PMCID: PMC8357426          DOI: 10.1016/j.cell.2021.06.032

Source DB:  PubMed          Journal:  Cell        ISSN: 0092-8674            Impact factor:   41.582


Introduction

The organization of genomic DNA with histones into distinct complexes known as nucleosomes is a universal and highly conserved feature of all eukaryotes. The eukaryotic nucleosome core invariably comprises two copies each of the four unique histones H2A, H2B, H3, and H4. Each protein has a histone fold (HF) region that is structurally conserved between the four histones, as well as additional HF extensions and highly charged cationic tails that are unique to each (Luger and Richmond, 1998b). H2B-H2A and H4-H3 form obligate heterodimers that assemble into an octamer that wraps 147 bp DNA to form nucleosomes (Luger et al., 1997). The evolutionary origin of eukaryotic chromatin organization is widely thought to lie in the archaeal domain of life (Talbert et al., 2019). Archaeal histones are limited to the HF and are encoded by only one gene or a few closely related genes. They bind and bend DNA as homodimers or quasi-symmetric heterodimers using architectural principles also seen in eukaryotic histones but lack the ability to form defined particles and instead exist in “slinky-like” assemblies that organize between 90 and ∼600 bp DNA (Bowerman et al., 2021; Mattiroli et al., 2017). The genomes of some viruses (in particular nuclear DNA viruses and retroviruses that need to evade the host DNA damage recognition machinery) are organized into nucleosomes by appropriating eukaryotic histones and the host nucleosome assembly machinery during the latent and early lytic phase (e.g., Oh et al., 2015; reviewed in Lieberman, 2008). With the exception of SV40 (Polisky and McCarthy, 1975), viral genomes are not organized with nucleosomes in the capsid, and they do not encode viral histone homologs (Lieberman, 2008). In contrast, distinct histone-like proteins with homology to eukaryotic H2A, H2B, H3, and H4 have been identified in the genomes of some nucleocytoplasmic large DNA viruses (NCLDVs). Several members of the Marseilleviridae isolated from the Amoeba Acanthamoeba castellanii encode homologs of the four histone proteins in native doublet form, where H4 is fused to H3 and H2B is fused to H2A (Figure 1A) (Thomas et al., 2011). These proteins are present in Marseillevirus virions (Boyer et al., 2009; Fabre et al., 2017; Okamoto et al., 2018), where they might participate in the organization of the large (>300 kb) viral genomes. The arrangement of histones in doublets, the low level (<30%) of sequence similarity with eukaryotic histones, and their high degree of conservation within the family of Marseilleviridae (Figures 1A and S1A) suggest that they have evolved to fulfill viral functions other than (or in addition to) DNA compaction. Here, we show that viral histones are essential for viral infectivity, co-localize in the viral factory, and are in the same order of abundance as the major capsid protein in the viral capsids. Using cryogenic electron microscopy (cryo-EM), we demonstrate that viral histone doublets indeed interact with DNA to form nucleosome-like particles with distinct structural properties not seen in eukaryotes.
Figure 1

MV histone doublets are re-localized to viral factories (VFs)

(A) Histone dimer pairs (H2B-H2A and H4-H3) within Eukarya were aligned against the doublet Marseilleviridae histones using HHpred’s multiple sequence alignment tool (ClustalΩ). Known α helices from the histone fold (HF) domain in Eukarya are dark-colored tubes (H2B, red; H2A, yellow; H4, green; H3, blue; and additional helices, gray). Predicted α helices in MV histones were generated using HHrped’s Quick 2D prediction web server (shown in lighter coloration) within the Marseilleviridae histone doublets. Known R-T pairs and DNA binding residues are shown in Eukarya histones along with their conservation within Marseilleviridae histones; additional predicted DNA-binding residues are shown (positions demonstrated by lollipops).

(B) Light microscopy fluorescence images (scale bar, 10 μm) of A. castellanii cells transfected with GFP-A. castellanii-H2A, MV-H2B-H2A-GFP, and MV-H3-H4-GFP, non-infected and infected with MV at 4 h PI. While GFP-A. castellanii-H2A concentrates only in the nucleus (N) of the non-infected cells, MV-H2B-H2A-GFP and MV-H3-H4-GFP are scattered in the entire cell (including the nucleus). Upon virus infection, GFP-A. castellanii-H2A remains in the nucleus (yellow arrows), while MV-H2B-H2A-GFP and MV-H3-H4-GFP re-localize to the VF (cyan arrows). DAPI staining remains in the nucleus all along the infection, but the intense fluorescence in the late VF hides the staining of the nucleus at 4 h PI.

See also Figure S1.

Figure S1

Localization of fluorescently labeled MV histones in virus-infected Amoeba

(A) Complete sequence alignment of H2B-H2A (top) and H4-H3 (bottom) from Marseilleviridae, the host Acanthamoeba castellanii, and Xenopus laevis. Predicted α helices of H2B-H2A (light red and yellow) and H4-H3 (light green and blue) Melbournevirus histone doublets were generated using HHPRED’s Quick 2D prediction web server. Histone dimer pairs H2B-H2A and H4-H3 of Acanthamoeba castellanii and Xenopus laevis were each aligned against their respective Marseilleviridae histone doublets using HHPRED’s multiple sequence alignment tool, ClustalΩ. Conservation of each specific residue in each alignment is denoted by blue shading, with greater conservation being represented by darker blue. Known R-D clamp, R-T pairs, and DNA binding residues are indicated for Xenopus laevis histone pairs with their conservation within Melbournevirus histones. Marseilleviridae histones are 16%–33% conserved to Xenopus laevis histones (shown on far right of alignment). Known α helices from the histone fold domain in Xenopus laevis are shown as dark colored tubes; H2B are red, H2A are yellow, H4 are green, H3 in blue, and additional helices in gray. Logo plot demonstrating residue conservation among the alignments provided by ClustalΩ tool is shown below. Light microscopy fluorescence images (scale bar 10 μm) of A. castellanii cells transfected with (B) GFP-A. castellanii-H2A, C) MV-H2B-H2A-GFP, D) MV-H4-H3-GFP and E) MV-miniH2B-H2A-GFP, non-infected (NI) and infected with Melbournevirus at 30 min, 1h, 2h, 3h and 4h pi. B) GFP-A. castellanii-H2A concentrates only in the nucleus (N) of the non-infected cells and all along the infectious cycle. (C) MV-H2B-H2A-GFP, (D) MV-H4-H3-GFP and (E) MV-miniH2B-H2A-GFP are scattered in the entire cell (including the nucleus) in the non-infected cells. Between 1h and 2h pi, the viral histones start accumulating in the early viral factories (eVF). At 4h pi, the fluorescence is predominantly concentrated in the mature viral factory (VF). DAPI staining remains at the nucleus all along the infection but the intense fluorescence in the late VF hides the staining of the nucleus at 4h pi. (F). MV H2B-H2A-GFP and MV H4-H3-mRFP are co-localized in the viral factory, along with the viral DNA at 4h pi. (G). DAPI staining of Acanthamoeba cells infected with Melbournevirus at 4h pi. The host nucleus does not disappear upon the infectious cycle. Due to the large accumulation of viral DNA in the viral factories, exposure time for the DAPI fluorescence was optimized to visualize both the viral factory (VF) and the nucleus (N) of the cell (typically between 400ms and 1 s). (H) Schematic depicting the strategy for analysis of histone gene knockouts. Related to Figures 1 and 2.

MV histone doublets are re-localized to viral factories (VFs) (A) Histone dimer pairs (H2B-H2A and H4-H3) within Eukarya were aligned against the doublet Marseilleviridae histones using HHpred’s multiple sequence alignment tool (ClustalΩ). Known α helices from the histone fold (HF) domain in Eukarya are dark-colored tubes (H2B, red; H2A, yellow; H4, green; H3, blue; and additional helices, gray). Predicted α helices in MV histones were generated using HHrped’s Quick 2D prediction web server (shown in lighter coloration) within the Marseilleviridae histone doublets. Known R-T pairs and DNA binding residues are shown in Eukarya histones along with their conservation within Marseilleviridae histones; additional predicted DNA-binding residues are shown (positions demonstrated by lollipops). (B) Light microscopy fluorescence images (scale bar, 10 μm) of A. castellanii cells transfected with GFP-A. castellanii-H2A, MV-H2B-H2A-GFP, and MV-H3-H4-GFP, non-infected and infected with MV at 4 h PI. While GFP-A. castellanii-H2A concentrates only in the nucleus (N) of the non-infected cells, MV-H2B-H2A-GFP and MV-H3-H4-GFP are scattered in the entire cell (including the nucleus). Upon virus infection, GFP-A. castellanii-H2A remains in the nucleus (yellow arrows), while MV-H2B-H2A-GFP and MV-H3-H4-GFP re-localize to the VF (cyan arrows). DAPI staining remains in the nucleus all along the infection, but the intense fluorescence in the late VF hides the staining of the nucleus at 4 h PI. See also Figure S1. Localization of fluorescently labeled MV histones in virus-infected Amoeba (A) Complete sequence alignment of H2B-H2A (top) and H4-H3 (bottom) from Marseilleviridae, the host Acanthamoeba castellanii, and Xenopus laevis. Predicted α helices of H2B-H2A (light red and yellow) and H4-H3 (light green and blue) Melbournevirus histone doublets were generated using HHPRED’s Quick 2D prediction web server. Histone dimer pairs H2B-H2A and H4-H3 of Acanthamoeba castellanii and Xenopus laevis were each aligned against their respective Marseilleviridae histone doublets using HHPRED’s multiple sequence alignment tool, ClustalΩ. Conservation of each specific residue in each alignment is denoted by blue shading, with greater conservation being represented by darker blue. Known R-D clamp, R-T pairs, and DNA binding residues are indicated for Xenopus laevis histone pairs with their conservation within Melbournevirus histones. Marseilleviridae histones are 16%–33% conserved to Xenopus laevis histones (shown on far right of alignment). Known α helices from the histone fold domain in Xenopus laevis are shown as dark colored tubes; H2B are red, H2A are yellow, H4 are green, H3 in blue, and additional helices in gray. Logo plot demonstrating residue conservation among the alignments provided by ClustalΩ tool is shown below. Light microscopy fluorescence images (scale bar 10 μm) of A. castellanii cells transfected with (B) GFP-A. castellanii-H2A, C) MV-H2B-H2A-GFP, D) MV-H4-H3-GFP and E) MV-miniH2B-H2A-GFP, non-infected (NI) and infected with Melbournevirus at 30 min, 1h, 2h, 3h and 4h pi. B) GFP-A. castellanii-H2A concentrates only in the nucleus (N) of the non-infected cells and all along the infectious cycle. (C) MV-H2B-H2A-GFP, (D) MV-H4-H3-GFP and (E) MV-miniH2B-H2A-GFP are scattered in the entire cell (including the nucleus) in the non-infected cells. Between 1h and 2h pi, the viral histones start accumulating in the early viral factories (eVF). At 4h pi, the fluorescence is predominantly concentrated in the mature viral factory (VF). DAPI staining remains at the nucleus all along the infection but the intense fluorescence in the late VF hides the staining of the nucleus at 4h pi. (F). MV H2B-H2A-GFP and MV H4-H3-mRFP are co-localized in the viral factory, along with the viral DNA at 4h pi. (G). DAPI staining of Acanthamoeba cells infected with Melbournevirus at 4h pi. The host nucleus does not disappear upon the infectious cycle. Due to the large accumulation of viral DNA in the viral factories, exposure time for the DAPI fluorescence was optimized to visualize both the viral factory (VF) and the nucleus (N) of the cell (typically between 400ms and 1 s). (H) Schematic depicting the strategy for analysis of histone gene knockouts. Related to Figures 1 and 2.
Figure 2

MV histone doublets are essential for virus fitness

(A) Histone (MV and host) expression and host histone chaperones expression profile during the MV infectious cycle. Host actin genes expression were used for reference.

(B) Schematic representation of the strategy used to generate the recombinant mel_368 or mel_369 KO. Expected PCR product size are also shown. Primer a: HB146 or HB147 and primer b: HB145 or HB148 for mel_368 and mel_369 respectively; primer c, GFP reverse; primer d, GFP forward (Table S1). goi, gene of interest.

(C) PCR demonstrates correct integration of vectors in the locus of mel_368 or mel_369. Analysis of fitness changes associated with histone KO was analyzed by virus competition assay in wild-type or complemented amoebas. Mock cells were incubated with Superfect in the absence of plasmid. a+c, 5′ integration; d+b, 3′ integration; a+b, wild-type locus. HR, homologous recombination.

See also Figure S1 and Table S1.

Results

MV histone doublets are essential for viral propagation and accumulate in viral factories and mature virions

Melbournevirus (MV) encodes three putative histone doublets (MEL_369, MEL_368, and MEL_149, here named MV-H2B-H2A, MV-H4-H3, and MV-miniH2B-H2A). Putative histone domains in each doublet are linked by a ∼20-amino-acid connector (Figure 1A). Although these doublet histone proteins share less than 30% amino acid sequence identity with eukaryotic histones, they are highly conserved within the group of Marseilleviridae (Figure S1A). Secondary structure prediction suggests that viral histones form HFs (α1-L1-α2-L2-α3) but have either longer (MV-H2B-H2A) or shorter (MV-miniH2B-H2A) H2A C-terminal tails than their eukaryotic counterparts (Figure 1A). The H3 αN helix (which organizes the terminal turn of eukaryotic nucleosomal DNA) is predicted to be present in the viral H4-H3 doublets, while the sequence of the H2A docking domain (which tethers the H2A-H2B dimers to the (H3-H4)2 tetramer in eukaryotic nucleosomes) diverges from eukaryotic H2A (Figure S1A). Many of the “signature” amino acids that are important for DNA binding in eukaryotic histones are conserved in viral histones. These include arginine side chains that reach into the DNA minor groove and, in MV-H4-H3, a threonine from the paired L1 loop (R-T pair), as well as many other basic amino acids that are also interacting with the DNA (Figures 1A and S1A). To identify the localization of MV histone doublets during viral assembly, we imaged transfected cells expressing fluorescently (GFP) tagged MV-histone doublets as well as tagged amoeba histones at different time points post-infection (PI) and compared their localization with non-infected cells. Amoeba GFP-H2A concentrated only in nuclei, where it presumably associates with genomic DNA to form nucleosomes, regardless of MV infection (Figures 1B and S1B). In contrast, unique and distinct patterns were observed for the transfected MV-histone doublets upon virus infection, particularly in the viral factory (VF; foci in the amoeba cytoplasm, where viral transcription, replication, and assembly take place). MV-H2B-H2A-GFP and MV-H4-H3-GFP were initially dispersed throughout the whole cell, including the nucleus, but their localization started to change between 1 and 2 h PI (Figures S1C and S1D). Eventually, MV-H2B-H2A-GFP and MV-H4-H3-GFP accumulated and concentrated predominantly as foci in the viral factory (Figures 1B, S1C, and S1D, cyan arrows), although they were still present in nuclei at much lower concentrations (Figure 1B, yellow arrows). Similar localization patterns were obtained with MV-miniH2B-H2A-GFP (Figure S1E). In addition, co-localization of MV-H2B-H2A-GFP and MV-H4-H3-mRFP along with viral DNA was observed in the viral factory, indicating that MV histones are recruited for viral DNA packaging and virion production (Figure S1F). Thousand copies of the AT-rich 360-kb viral genome are produced in the viral factory to produce approximately thousands of mature virions, compared to the 24 copies of a 46.7-Mb GC-rich amoeba genome. As such, DAPI staining for the viral factory overpowers the weaker staining of the nucleus, which is not a sign of host genomic DNA disappearance during the infectious cycle (Figure S1G). Mass spectrometry proteomic analysis of the purified MV virions was performed (Fabre et al., 2017) (Table 1). MV-histone doublets are among the most abundant proteins in the MV viral proteome, with the number of copies per capsid in the range of the major capsid protein. Importantly, the amount of histone doublets (even when using the most conservative estimate), is sufficient to package the entire viral genome with nucleosomal particles with a repeat length of ∼180 bp (∼4,000 histone doublets per 360 kilo base pairs (kbp) viral genome).
Table 1

Mass spectrometry proteomics of the purified MV virions

Protein IDAnnotationFinal rankiBAQ R1Copy R1Rank R1iBAQ R2Copy R2Rank R2
MEL_236hypothetical protein11,7E+0914,08713E+0915,1071
MEL_342bhypothetical protein21,2E+099,91622E+0912,0172
MEL_368histone H4-H3 doublet39,8E+088,36242E+0910,3443
MEL_305major capsid protein41,1E+099,24032E+099,2404
MEL_274hypothetical protein55,7E+084,83551E+095,3855
MEL_369histone H2B-H2A doublet65E+084,22368E+083,8956
MEL_247papain-like cysteine protease73,5E+082,93578E+083,8127
MEL_280hypothetical protein82,2E+081,86285E+082,4298
MEL_234hypothetical protein92,1E+081,77695E+082,3549
MEL_241hypothetical protein101,8E+081,569104E+082,20810
MEL_358hypothetical protein111,4E+081,211113E+081,45411
MEL_207thioredoxin121,3E+081,083123E+081,42412
MEL_020hypothetical protein131,2E+081,053132E+081,25213
MEL_202hypothetical protein149,7E+07828152E+081,07214
MEL_260hypothetical protein159,8E+07835142E+081,00415
MEL_200transmembrane-domain-containing protein167,9E+07676162E+0885116
MEL_065hypothetical protein177E+07596172E+0877517
MEL_025peptidase186,2E+07531191E+0864019
MEL_097hypothetical protein196,3E+07534181E+0861321
MEL_282bhypothetical protein205,4E+07463241E+0865718
MEL_181ATP-dependent helicase215,7E+07482211E+0863420
MEL_354transmembrane-domain-containing protein225,8E+07493201E+0860422
MEL_365bhypothetical protein235,6E+07477221E+0860124
MEL_196hypothetical protein245,5E+07468231E+0860123
MEL_294thioredoxin255,4E+07463251E+0858325
MEL_360AAA-family ATPase265,1E+07435271E+0854327
MEL_244hypothetical protein275,1E+07437261E+0853228
MEL_185hypothetical protein284,6E+07389291E+0857226
MEL_275hypothetical protein294,9E+07420281E+0850529
MEL_269transmembrane-domain-containing protein304,2E+07354309E+0746032
MEL_228transmembrane-domain-containing protein313,9E+07333329E+0747630
MEL_168lipase323,9E+07328339E+0747631
MEL_041peptidase333,9E+07334318E+0738136
MEL_331hypothetical protein343,7E+07312348E+0739634
MEL_352lectin-domain-containing protein353,1E+07263389E+0744233
MEL_211transmembrane-domain-containing protein363,5E+07297357E+0737937
MEL_380serine/threonine protein kinase373,3E+07279378E+0739335
MEL_149mini H2B-H2A histone383,4E+07289367E+0736239
MEL_089hypothetical protein392,9E+07249397E+0736838
MEL_216helicase402,8E+07241406E+0730340
MEL_235serine/threonine protein kinase412,7E+07233416E+0728244
MEL_338hypothetical protein422,3E+07199446E+0729641
MEL_301transmembrane-domain-containing protein432,5E+07209436E+0728243
MEL_231hypothetical protein442,5E+07216425E+0726645
MEL_223bmembrane protein452,1E+07180456E+0728842
MEL_297hypothetical protein462E+07167495E+0726546
MEL_213bhypothetical protein472,1E+07178475E+0724747
MEL_215hypothetical protein482,1E+07177485E+0723748
MEL_255papain-like cysteine protease492,1E+07179465E+0723449
MEL_387hypothetical protein501,9E+07159524E+0721250
MEL_219ribonuclease III511,9E+07163504E+0720753
MEL_278disulfideoxidoreductase521,9E+07160514E+0720851

Asterisks indicate histone doublets.

Mass spectrometry proteomics of the purified MV virions Asterisks indicate histone doublets. Analysis of published transcriptomic data obtained for a member of the Marseilleviridae during its infectious cycle in A. castellanii (Rodrigues et al., 2020) sheds light on the possible contribution of the host to viral nucleosome assembly (Figure 2A). As expected, the viral DNA polymerase is expressed before the viral histones, between 1 and 2 h PI. Expression of viral histones begins 2–4 h PI and continues along the infectious cycle. Host histone expression, however, decreases after 2 h PI, as is the case for most of the A. castellanii genes (see, for reference, the actin gene in gray). The cellular histone chaperones are either never expressed (FUN_000176, FUN_002445, FUN_006481, FUN_014954, FUN_006327 lanes in gray at the bottom of the figure) or have their expression decreased after 2 h PI. No obvious homologs to the cellular histone chaperones were detected in the Marseilleviridae genome. MV histone doublets are essential for virus fitness (A) Histone (MV and host) expression and host histone chaperones expression profile during the MV infectious cycle. Host actin genes expression were used for reference. (B) Schematic representation of the strategy used to generate the recombinant mel_368 or mel_369 KO. Expected PCR product size are also shown. Primer a: HB146 or HB147 and primer b: HB145 or HB148 for mel_368 and mel_369 respectively; primer c, GFP reverse; primer d, GFP forward (Table S1). goi, gene of interest. (C) PCR demonstrates correct integration of vectors in the locus of mel_368 or mel_369. Analysis of fitness changes associated with histone KO was analyzed by virus competition assay in wild-type or complemented amoebas. Mock cells were incubated with Superfect in the absence of plasmid. a+c, 5′ integration; d+b, 3′ integration; a+b, wild-type locus. HR, homologous recombination. See also Figure S1 and Table S1. In order to assess the fitness cost associated with the impairment of potential nucleosome formation in MV capsids, we designed a strategy to knock out either mel_369 or mel_368 (Figures 2B and S1H). A. castellanii cells were transfected with linearized plasmid prior to infection with MV particles. Recombinant viruses were quickly detected by PCR upon 3–5 h PI (primers listed in Table S1). Expression of GFP could not be detected at 3h PI, likely due to the late expression profile of the histone promoters. The presence of mutant viral particles was followed during two generations by infecting wild-type amoeba compared to amoeba expressing an ectopic copy of the histones for trans-complementation (Figure 2C). Importantly, mutant viral particles were quickly outcompeted by wild-type MV in absence of trans-complementation, while growth of the mutant virus was sustained when the missing copy of the histone gene was stably expressed in the amoeba (Figure 2C). This demonstrates that both mel_369 and mel_368 are essential, and their knockout (KO) is associated with a strong fitness cost to the virus. Altogether, the distinct localization and expression patterns reveal specific targeting of MV histone doublets to the viral factory and the viral particle.

MV doublet histones form defined, unstable nucleosome-like particles

To determine what types of complexes MV-H2B-H2A, MV-miniH2B-H2A, and MV-H4-H3 doublets form with DNA, we expressed, purified, and refolded the proteins from E. coli (Figure S2A). When combined with DNA (“601” nucleosome positioning sequence) of varying lengths, and using the classic salt-gradient nucleosome reconstitution protocol (Dyer et al., 2004), MV-H2B-H2A and MV-H4-H3 form defined nucleosome-like particles (MV-NLPs) that are stable at 37°C, whereas individual histone doublets fail to form defined bands on DNA (Figure 3A). The composition of the MV-NLP was confirmed by sucrose gradient fractionation (Figure S2B). MV histones also formed defined particles on a native 181 bp DNA fragment derived from the MV genome (GC content = 45%). Irrespective of DNA sequence, and unlike X. laevis nucleosomes (a representative of eNuc), MV-NLPs dissociate upon heat treatment at 55°C (Figure S2C). MV-miniH2B-H2A alone binds DNA poorly, and no homogeneous MV-NLPs were formed with MV-H4-H3 and DNA (Figure S2D). We therefore focused on MV-NLPs with full-length MV-H2B-H2A and MV-H4-H3.
Figure S2

MV-histones form nucleosome-like particles

(A) SDS-PAGE of purified Melbournevirus (MV) histone doublets. (B) Sucrose gradient sedimentation of MV NLPs with 147 or 207 bp DNA. The compositions of MV NLPs were analyzed by native- and SDS-PAGE. (C) MV-NLP and eNuc were reconstituted on Widom ‘601’ DNA and Melbournevirus native DNA, respectively. The MV-NLP and eNuc were heat treated at 37 and 55°C. (D) Native PAGE of reconstituted MV mini-NLP (mini H2B-H2A instead of H2B-H2A) with 147 bp DNA. (E) GraFix of MV-NLPs with 207 bp DNA (top two panels) and 147 bp DNA (bottom two panels. Native- and SDS- PAGE of the crosslinked MV-NLP fractions representing successful crosslinking, compared to the native MV-NLP input. (F) Representative AFM images: Samples were diluted in TCS buffer and applied to APTES coated mica, rinsed with water and imaged on a NanoWizard Sa with a TAP300-GD cantilever. Samples include 147 bp DNA only, eNuc147, MV-H4-H3 with 147 bp DNA, MV-NLP147, MV-NLP207 and MV-NLP207 GraFix. Related to Figure 3.

Figure 3

Histone doublets in Marseilleviridae form nucleosome-like particles (MV-NLPs)

(A) Native PAGE of reconstituted MV-NLPs with “601” DNA of various lengths. Left panel: individual and combinations of histone doublets reconstituted onto 147 bp DNA; right panel: MV-H2B-H2B and MV-H4-H3 reconstituted onto 601 DNA of varying lengths.

(B) Sedimentation velocity analytical ultracentrifugation (SV-AUC) of MV-NLPs. Left: van Holde-Weischet plot of eukaryotic nucleosomes (eNuc), histone-DNA complexes with individual MV histone doublets, and native MV-NLPs with 147 bp DNA (no GraFix); right: van Holde-Weischet plot of crosslinked (GraFix-ed) MV-NLPs with 123, 147, or 207 bp DNA.

(C) Height profile of MV-NLPs with 147 or 207 bp DNA, obtained by atomic force microscopy (AFM). The average height profiles with standard deviations of the particles from each sample are shown; representative, original images are shown in Figure S2, and statistics are shown in Table S2.

MV-histones form nucleosome-like particles (A) SDS-PAGE of purified Melbournevirus (MV) histone doublets. (B) Sucrose gradient sedimentation of MV NLPs with 147 or 207 bp DNA. The compositions of MV NLPs were analyzed by native- and SDS-PAGE. (C) MV-NLP and eNuc were reconstituted on Widom ‘601’ DNA and Melbournevirus native DNA, respectively. The MV-NLP and eNuc were heat treated at 37 and 55°C. (D) Native PAGE of reconstituted MV mini-NLP (mini H2B-H2A instead of H2B-H2A) with 147 bp DNA. (E) GraFix of MV-NLPs with 207 bp DNA (top two panels) and 147 bp DNA (bottom two panels. Native- and SDS- PAGE of the crosslinked MV-NLP fractions representing successful crosslinking, compared to the native MV-NLP input. (F) Representative AFM images: Samples were diluted in TCS buffer and applied to APTES coated mica, rinsed with water and imaged on a NanoWizard Sa with a TAP300-GD cantilever. Samples include 147 bp DNA only, eNuc147, MV-H4-H3 with 147 bp DNA, MV-NLP147, MV-NLP207 and MV-NLP207 GraFix. Related to Figure 3. Histone doublets in Marseilleviridae form nucleosome-like particles (MV-NLPs) (A) Native PAGE of reconstituted MV-NLPs with “601” DNA of various lengths. Left panel: individual and combinations of histone doublets reconstituted onto 147 bp DNA; right panel: MV-H2B-H2B and MV-H4-H3 reconstituted onto 601 DNA of varying lengths. (B) Sedimentation velocity analytical ultracentrifugation (SV-AUC) of MV-NLPs. Left: van Holde-Weischet plot of eukaryotic nucleosomes (eNuc), histone-DNA complexes with individual MV histone doublets, and native MV-NLPs with 147 bp DNA (no GraFix); right: van Holde-Weischet plot of crosslinked (GraFix-ed) MV-NLPs with 123, 147, or 207 bp DNA. (C) Height profile of MV-NLPs with 147 or 207 bp DNA, obtained by atomic force microscopy (AFM). The average height profiles with standard deviations of the particles from each sample are shown; representative, original images are shown in Figure S2, and statistics are shown in Table S2. Sedimentation velocity analytical ultracentrifugation (SV-AUC) yields information on the size and shape distribution of macromolecular assemblies in solution. The diffusion-corrected sedimentation value of a particle is proportional to its mass, and inversely proportional to its viscous drag. The sedimentation behavior of MV-histone doublets assembled onto 147 bp DNA was compared to that of X. laevis histones reconstituted on the same DNA (eNuc147). The eNuc147 particle sediments at 11 S (Wang et al., 2018), while MV-NLP147 sediments at ∼8 S, with a “tail” toward lower S values, indicating dissociation of DNA at the relatively low (300 nM) concentrations used for AUC experiments (Figure 3B). Particles reconstituted with individual MV-histone doublets sediment at ∼5.5 S (Table 2). The AUC-derived molecular mass calculated for MV-NLP147 (Figure 3B, red arrow) suggests that at least one copy of the H2B-H2A doublet has been lost due to the sample dilution in AUC (Table 2). To counteract the apparent dissociation of MV-NLPs, we employed gradient fixation (GraFix) with glutaraldehyde (Kastner et al., 2008). Histone doublets were efficiently crosslinked, and the position of the main peak fraction in the gradient as well as migration on a native gel was unchanged upon crosslinking, suggesting little to no structural changes (Figure S2E). Particles reconstituted on 123, 147, and 207 bp DNA fragments sedimented between 8.8 and 9.9 S after crosslinking (Figure 3B, Table 2). The frictional ratio (f/f0) (a measure of particle “extension,” where larger values indicate increased drag, slowing sedimentation) of all MV-NLPs was higher than that of eNuc147, and f/f0 values increase in correlation with DNA length (Table 2). MV-NLP123 has the highest S-value among MV-NLP constructs, despite having the smallest mass, because of its significantly lower f/f0 value. This suggests that the additional DNA in MV-NLP147 and MV-NLP207 is extended.
Table 2

S values (S(20,W)), frictional ratios (f/f0), and calculated molecular weights (including confidence intervals) of histone-DNA complexes derived from SV-AUC

SampleS(20,W)f/f0Molecular weight (kDa) experimental/theoretical
No GraFix147bp DNA5.10 (5.09, 5.10)3.09 (3.09, 3.09)117.13 (116.9, 117.3)/97
MV-H2B-H2A 1475.68 (4.61, 6.75)2.66 (2.51, 2.81)134.61 (89.83, 179.38)/150
MV-H4-H3 1475.28 (4.59, 5.96)2.79 (2.70, 2.87)120.93 (104.14, 137.71)/146
MV-NLP1477.78 (7.74, 7.82)2.0 (1.96, 2.04)179.17 (173.55, 185.80)/206
eNuc14711.16 (11.15, 11.16)1.54 (1.52, 1.56)218.5 (214.4, 222.5)/210
GraFixMV-NLP123 (nucleosome-like)9.86 (7.99, 11.72)1.77 (1.52, 2.02)221.46 (170.93, 271.99)/190
MV-NLP147 (nucleosome-like)8.79 (7.30, 10.29)2.01 (1.86, 2.14)217.79 (140.57, 295.00)/206
MV-NLP207 (nucleosome-like)9.01 (7.48, 10.55)2.26 (2.00, 2.51)256.58 (194.54, 318.62)/245
S values (S(20,W)), frictional ratios (f/f0), and calculated molecular weights (including confidence intervals) of histone-DNA complexes derived from SV-AUC We visualized MV-NLPs by atomic force microscopy (AFM). Particles reconstituted with either MV-H4-H3 alone or the full complement of histone doublets were deposited onto mica surfaces, and their height profiles were compared to eNuc147 (Figures 3C and S2F; Table S2). The majority of eNuc147 survived the low concentration (2 nM) required for AFM imaging and presented the characteristic height profile of ∼2.1 nm (White et al., 2016). In contrast, over 30% of the observed particles heights indicative of free DNA, indicating complete MV-NLP207 disassembly, and the remaining particles had lower height profiles (∼1.3 nm) than the eNuc147 complex (Figures 3C and S2F; Table S2). This reduced height suggests that MV-NLP207 transiently disassemble at the low concentration required for AFM. Crosslinking MV-NLP prevented disassembly and resulted in particles comparable in height to eNuc147.

MV-NLPs resemble eukaryotic nucleosomes

Single-particle cryogenic electron microscopy (cryo-EM) was used to visualize crosslinked MV-NLP207. GraFix was required as the majority of untreated MV-NLPs dissociate during plunge freezing. Raw images and 2D classes show that MV-NLPs share many characteristic features of eukaryotic nucleosomes (Figure S3A), with defined segments of extending linker DNA visible in many classes. After 3D classifications and refinement, we obtained a structure of the MV-NLP207 at 6.1 Å (Figure S3A; Table S3). A second dataset obtained with GraFix-treated particles reconstituted onto 147 bp DNA (MV-NLP147) yielded an improved overall resolution of ∼4.0 Å (Figure S3B; Table S3). In both structures, electron density for the DNA and histone helices are clearly distinguishable (Videos S1 and S2), which allowed the assignment of ∼120 bp of bound DNA and all histone chains (Figure 4A; Video S3).
Figure S3

Cryo-EM data analysis of MV-NLP207 and MV-NLP147

(A) Raw micrograph of the Grafix-treated MV-NLP207 (Scale bar is 50 nm), 2D class averages generated from the dataset, 3D structure of the MV NLP, local resolution map and FSC curve, and data processing strategy flow chart. (B) Raw micrograph of the GraFix treated MV-NLP147 (Scale bar is 50 nm), 2D class averages generated from the dataset, 3D structure of the MV NLP, local resolution map and FSC curve, and data processing strategy flow chart. Related to Figure 4.

Figure 4

Cryo-EM reveals that MV-histone doublets form nucleosome-like structures with asymmetrically extending DNA

(A) Overview of MV-NLP147 and electron density. The equivalent regions of MV H3, H4, H2A, and H2B are shown in blue, green, yellow, and red, respectively.

(B) Overlay of MV-NLP147 (blue) with eNuc (gray). Only 80 bp of DNA with associated histones are shown for clarity. Superhelix locations (SHLs) are numbered from 0 to 6 starting from the nucleosome dyad (φ).

(C) Comparison of the DNA path of eNuc147 (gray ribbon diagram), MV-NLP147 (blue electron density), and MV-NLP207 (green electron density).

(D) Charged surface representation of the histones for MV-NLP147 and eNuc147. Coordinates for eNuc147 were taken from 3LZ0.

See also Figures S3A and S3B and Table S3.

Cryo-EM data analysis of MV-NLP207 and MV-NLP147 (A) Raw micrograph of the Grafix-treated MV-NLP207 (Scale bar is 50 nm), 2D class averages generated from the dataset, 3D structure of the MV NLP, local resolution map and FSC curve, and data processing strategy flow chart. (B) Raw micrograph of the GraFix treated MV-NLP147 (Scale bar is 50 nm), 2D class averages generated from the dataset, 3D structure of the MV NLP, local resolution map and FSC curve, and data processing strategy flow chart. Related to Figure 4. Cryo-EM reveals that MV-histone doublets form nucleosome-like structures with asymmetrically extending DNA (A) Overview of MV-NLP147 and electron density. The equivalent regions of MV H3, H4, H2A, and H2B are shown in blue, green, yellow, and red, respectively. (B) Overlay of MV-NLP147 (blue) with eNuc (gray). Only 80 bp of DNA with associated histones are shown for clarity. Superhelix locations (SHLs) are numbered from 0 to 6 starting from the nucleosome dyad (φ). (C) Comparison of the DNA path of eNuc147 (gray ribbon diagram), MV-NLP147 (blue electron density), and MV-NLP207 (green electron density). (D) Charged surface representation of the histones for MV-NLP147 and eNuc147. Coordinates for eNuc147 were taken from 3LZ0. See also Figures S3A and S3B and Table S3.

Video S3. Overview of the MV-NLP147 structure (model to map, Figure 3A), including a highlight of doublet connectors and H4 N-terminal domain (Figure 4), related to Figure 4

The equivalent regions of H3, H4, H2A, and H2B are shown in blue, green, yellow, and red, respectively. The overall dimensions of the DNA superhelix, the path described by the DNA around the histone core, and the overall layout of HF helices are similar between MV-NLPs and eukaryotic nucleosomes (Figures 4A and 4B; Table S4). However, DNA wrapping is incomplete and asymmetric in both MV-NLPs, as only ∼120 bp are wrapped in most particles. This is more pronounced in MV-NLP207, where well-defined electron density describes a straight path for DNA extending away from the histone core at superhelical location (SHL) 4.5, whereas the DNA extending at the other side (from SHL 5.5) appears to be more disordered (Figure 4C; Video S1). In MV-NLP147 complex slightly more DNA is bound (up to SHL 5.5 on both sides), and only one arm of extending DNA has defined density (Figure 4C; Video S2). This asymmetry is likely a consequence of the asymmetric 601 DNA sequence (Chen et al., 2017; Chua et al., 2012). Initial models of MV-histone doublets were generated though homology modeling (Waterhouse et al., 2018), where the ∼28- and ∼20-amino-acid connectors in H4-H3 and H2B-H2A were constructed through de novo methods (Webb and Sali, 2016). These were docked into the MV-NLP147 density, with good agreement with nucleosome-like configurations (Figures 4A and 4B; correlation coefficient between model and experimental density 0.766). Overall, the MV-histone core is less positively charged than the eukaryotic histone octamer (pI of ∼9.5 versus ∼11.0; Figure 4D). While basic amino acids describe a distinct path for the DNA in both histone cores, and many amino acid side chains that engage DNA are conserved between viral and eukaryotic histones (Figures 1A and S1A), positive charges along the helical path are less pronounced in MV histones, in particular in the region formed by the H3-H3′ four-helix bundle (“dyad”). This contributes to the reduced stability of MV-NLPs compared to eNuc. Additionally, the acidic patch, the primary point of interactions between eukaryotic nucleosomes and nuclear proteins (Peng et al., 2020), differs in size and charge, due to fewer acidic residues in MV-H2A (Figure 4D). The two MV-H4-H3 doublets superimpose onto the eukaryotic (H4-H3)2 tetramer with a backbone root-mean-square deviation (RMSD) of ∼2 Å (Figure 5A; Table S4). The main-chain arrangements of the MV-H3-H3′ four-helix bundles at the nucleosomal dyad are similar as in eNuc147, but the interface lacks the hallmark histidine-cysteine configuration. Each MV-H2B-H2A doublet interfaces with the H4 portion of a MV-H4-H3 doublet through a four-helix bundle of similar architecture as the H4-H2B interface in eNuc147. However, the MV-H2B α2 helix is one turn shorter than the eukaryotic sequence (due to a conserved proline at position 83), and the tyrosines that form π-stacking interactions in eNuc systems are consistently absent in both MV-H4 and MV-H2B (Figure 5B). Additionally, while the main chains of the two H2A L1 loops (38NYAE41) are close to one another in eNuc147 and form direct interactions through their side chains, the distance between L1 loops in MV-NLP147 (153GGCS156; positions 214–217 in Figure S1A) is increased and shows no direct L1-L1 interactions. L1-L1 separation is accompanied by a similar increase in the distance between H2B-H2A centers (36.3 Å in eNuc147 versus 38.6 Å in MV-NLP147; Table S4). In addition to being further apart in the MV-NLP147 complex, the H2B-H2A HFs reorient away from the nucleosome core (Figures 5C and S4A; Table S4). Conversely, the H2A docking domains of MV-H2A are angled inward, toward the dyad axis, in an altogether different arrangement than what is seen in eNuc147 (Figure 5C; Table S4; 6–8 Å backbone RMSD for chains C and G, respectively). Together, these structural changes may contribute to the reduced stability and increased propensity to form sub-NLPs (hexasomes and tetrasomes) of MV-NLPs in comparison to the eNuc complex.
Figure 5

Comparison of MV-NLP and eNuc histone structures

(A) Superposition of two MV-H4-H3 doublets (green and blue) with the eukaryotic (H3-H4)2 tetramer in gray (left), and a close-up of MV-H4-H3 doublet connector (right). Interactions of the MV-H4 N-terminal tail with the nucleosome are also shown.

(B) A comparison of the four-helix bundle structure formed by H4 and H2B.

(C) Superposition of two MV-H2B-H2A doublets (in red and yellow) with eukaryotic H2A-H2B dimers in gray (left), and a close-up of the MV-H2B-H2A doublet connector and docking domain.

See Table S4 for detailed information. Additional possible configurations for both connectors are shown in Figure S4.

Figure S4

MV-nucleosome histone dimer arrangement and histone connectors

(A) Description of ϕ, the pseudo-dihedral angle measuring the relative orientations of the H2B-H2A subunits, between MV-NLP and eNuc. Larger magnitude values correspond to a dimer arrangement that is angled further away from the dyad axis. Fictitious particle locations are shown as black spheres, and histones (H3 in blue, H4 in green, H2A in yellow, and H2B in red) and DNA (silver and gold ribbons) are semi-transparent to allow for better visibility of each particle’s location. (B) Overlay of de novo loop models with feasible conformations connecting H2B to H2A and (C) H4 to H3 in the MV-histone doublets. Loop configurations were generated using Modeler (v9.20) and then placed in the nucleosome context. After manually removing physically non-relevant (knots) conformations and clashes identified with a cutoff distance of 0.8 Å using CPPTRAJ of the Amber MD package (v18), the three best loops are shown in different colors (cross correlation range against the simulated map is 0.614-0.623). Related to Figure 5.

Comparison of MV-NLP and eNuc histone structures (A) Superposition of two MV-H4-H3 doublets (green and blue) with the eukaryotic (H3-H4)2 tetramer in gray (left), and a close-up of MV-H4-H3 doublet connector (right). Interactions of the MV-H4 N-terminal tail with the nucleosome are also shown. (B) A comparison of the four-helix bundle structure formed by H4 and H2B. (C) Superposition of two MV-H2B-H2A doublets (in red and yellow) with eukaryotic H2A-H2B dimers in gray (left), and a close-up of the MV-H2B-H2A doublet connector and docking domain. See Table S4 for detailed information. Additional possible configurations for both connectors are shown in Figure S4. MV-nucleosome histone dimer arrangement and histone connectors (A) Description of ϕ, the pseudo-dihedral angle measuring the relative orientations of the H2B-H2A subunits, between MV-NLP and eNuc. Larger magnitude values correspond to a dimer arrangement that is angled further away from the dyad axis. Fictitious particle locations are shown as black spheres, and histones (H3 in blue, H4 in green, H2A in yellow, and H2B in red) and DNA (silver and gold ribbons) are semi-transparent to allow for better visibility of each particle’s location. (B) Overlay of de novo loop models with feasible conformations connecting H2B to H2A and (C) H4 to H3 in the MV-histone doublets. Loop configurations were generated using Modeler (v9.20) and then placed in the nucleosome context. After manually removing physically non-relevant (knots) conformations and clashes identified with a cutoff distance of 0.8 Å using CPPTRAJ of the Amber MD package (v18), the three best loops are shown in different colors (cross correlation range against the simulated map is 0.614-0.623). Related to Figure 5. The connector linking the C terminus of MV-H2B with the N terminus of MV-H2A is easily accommodated, as these are near each other. A variety of favorable conformations are predicted through de novo modeling, with several different arrangements in agreement with observed weak density in MV-NLP147 (Figure 5C, inset, and Figure S4B). A conserved arginine is pointing into the minor groove, and main-chain phosphate interactions further hold the loop in place, echoing the contributions of the H2A N-terminal domain in eNuc. Connecting the MV-H4 C terminus with the N-terminal tail of MV-H3 is conceptually more difficult. In eukaryotic H4, the eight most C-terminal amino acids are engaged in contacts with H2A and H2B in the interior of the histone core. Our models show that the equivalent region in MV-H4 maintains docking domain interactions with MV-H2A. Physical constraints dictate that the missing linker segment must continue toward the DNA superhelix and project upward in the direction of MV-H3 αN, with which it connects (Figure 5A). Electron density attributed to the connector is visible between the DNA gyres on both sides and in both structures (MV-NLP207 and MV-NLP147). Although not sufficiently defined to allow modeling through real-space refinement, de novo generation of linker structures provided multiple feasible conformations (Figure S4C), and molecular dynamics flexible-fitting simulations confirmed that these orientations are physically relevant (Figure 5A). A stretch of 10 predominantly hydrophobic amino acids packs against the region connecting MV-H3αN with α3 to form a hydrophobic core (Figures 5A and S4C), and the connector is also positioned to engage in main-chain contacts with the DNA backbone at SHL ± 1.5. Mostly flexible and small amino acids (GAGSAGTGS) form the turn toward MV-H3 αN (Figure 5A). As such, the MV-H4-H3 connector (which is highly conserved among Marseilleviridae doublets) can be accommodated within the structural framework of a canonical nucleosome, although its presence likely contributes to the unwrapping of the terminal 10 bp of DNA observed in both MV-NLP densities. The other MV-histone tails that are not engaged in doublet formation also differ from the histone tails in eNuc. The C-terminal tail of MV-H2A is significantly longer than that of eukaryotic H2A (Figure S1A). The MV-H4 N-terminal domain (equivalent to the first 26 amino acids in eH4) does not extend over the DNA but takes the opposite direction to interact with MV-H4 α2 and α3 (Figure 5A). This density is as well defined as that of the main HF helices, suggesting that this domain may consistently interact with its own histone core rather than with DNA or the acidic patch in neighboring particles, as observed for the eukaryotic H4 tail (Dorigo et al., 2003; Morrison et al., 2018). The amino acid residues in this region are highly conserved among Marseilleviridae and differ from the eH4 tail (Figure S1A). A conserved proline (P34) at the tip of MV-H4 α1 might be responsible for the redirection (Figure 5A). As is the case with the eH4 tail, the first 20 amino acids of the MV-H4 tail are too disordered to be observed in the electron density map.

Discussion

The discovery that the genomes of some giant viruses encode histone-like proteins in which the dimerization partners are fused into a single chain was surprising (Thomas et al., 2011). Our results that these MV-histone doublets associate with the viral factory and indeed with the mature virus and that they form nucleosome-like particles with unique properties support the idea that they are involved in viral genome organization into the capsids. Instead of organizing DNA into nucleoprotein “core complexes” inside the viral capsid, Marseilleviridae utilize their own specialized histone doublets to form nucleosomes and possibly more compact chromatin within the viral particle. In the context of the ongoing debate regarding the role of viruses in the emergence of the eukaryotic nucleus, our findings also provide a new dimension to the diverse role of histones in genome organization of non-eukaryotic entities (Bell, 2001; Erives, 2017). Marseilleviridae exhibit intermediate dependency on their host, as they develop their viral factory in the cytoplasm but transiently recruit host nuclear proteins to the viral factory to transcribe early genes until the virally encoded RNA polymerase has been synthesized (Fabre et al., 2017). This is achieved through a reorganized leaky nucleus during the early phase. Amoeba histones are exclusively localized in the host nucleus whether the cells are infected or not, whereas transfected viral histones are present in the nucleus and cytoplasm in uninfected cells but move into viral factories upon infection to integrate into mature virions. This suggests that amoeba histones remain bound to cellular DNA, while the viral histones that made it into the nucleus apparently do not interact with amoeba genomic DNA and can leave the nucleus to associate with the viral factory. As such, amoeba histones are likely targeted to the nucleus and assembled into chromatin by histone chaperones and transporters that mostly avoid viral histones. As is the case for the other nuclear proteins transiently leaving the cell nucleus upon Marseilleviridae infection, it remains to be seen whether transfected viral histones are actively or passively recruited to the viral factory. Many “nucleosome signature features” are maintained among archaeal, eukaryotic, and now viral nucleosomes, such as the overall geometry and arrangement of the DNA superhelix, achieved by interactions between the DNA minor groove backbone and the main chain of antiparallel L1L2′ loops (from H4-H3 and H2B-H2A pairs) and the utilization of positive α helix dipole moments to bind DNA (Luger and Richmond, 1998a). Several of the “sprocket arginines” (Hodges et al., 2015) and their pairing with a threonine from the dimerization partner, as well as the highly conserved R-D clamps stabilizing the HF, are conserved among eukaryotic, archaeal, and viral histones. However, MV-doublet histones stably bind only ∼120 bp DNA as opposed to the 147 bp in eukaryotic nucleosomes. In our cryo-EM structures, the unbound DNA is characterized by distinct density indicating that it is in a defined orientation, and in this, it differs from the structures of eukaryotic variant nucleosomes that are also characterized by organizing only ∼120 bp DNA (Pentakota et al., 2017; Zhou et al., 2021). Unique to MV-NLPs is also the packing of the H4 N-terminal tail onto the surface of the histone core, the overall less positive charge of the histone core surface including the acidic patch, and, most notably, the linking of MV-H4-H3 and MV-H2B-H2A to form doublet histones. Fused histone genes are also observed in some archaea (Talbert et al., 2019) but to date not in any eukaryotic genome. The amino acid linkers connecting the histone moieties to form the doublets, which are conserved in sequence across Marseilleviridae, can be accommodated in the structure. However, the MV-H4-H3 connector (which is conserved in length and amino acid sequence between members of the Marseilleviridae) likely contributes to destabilizing the last turn of DNA. Overall, MV-NLPs are significantly destabilized compared to eNuc, and this seems to be an intrinsic property related to their virus-specific function. While this article was in revision, the structure of MV nucleosomes was published, with identical results and conclusions regarding the structural composition of the particles (Valencia-Sánchez et al., 2021). A common trait of DNA viruses is that they must actively package their DNA into capsids to protect it outside the host cell environment. On the other hand, they also need to make their genomes accessible to the transcription machinery immediately upon infection. Previous analyses of the infectious cycle of MVs revealed that the capsids appear to dissolve ∼2 h PI, leaving a spherical electron dense core in the cytoplasm (Fabre et al., 2017). Our study suggests that Marseilleviridae assemble virally encoded histones into nucleosomes to condense and protect their genome, to allow it to fit into the ∼200-nm-diameter icosahedron, also explaining the higher than expected electron density of the spherical core (Fabre et al., 2017). This process is essential for MV propagation, since mutant viral particles are rapidly lost in competition with wild-type viruses, strongly suggesting that they are noninfectious. Arguably, viral nucleosomes must be metastable to make the genome (once transferred into the cytoplasm) accessible to transcription by the host RNA polymerase recruited to the viral factory. This might explain the requirement for virus-encoded histones, as ATP-dependent remodelers utilized by eukaryotes to facilitate transcription through chromatin are localized mostly in the nucleus. The regions in MV histones that convey this metastability (such as the H4-H3 connector and the distinct makeup of the four-helix bundle regions) are highly conserved among Marseilleviridae. Why these histones exist as doublets, what determines their specificity for the viral genome, and which (if any) assembly factors they rely on for their association with the viral DNA are intriguing open questions that warrant further research. Genetic manipulation of giant viruses has been a neglected topic of research since their discovery despite the immense richness in genes with unknown functions encoded by their genome. Homology recombination has been used as a powerful tool to modify the genome of other viruses (Gowripalan et al., 2020), and we show here that it can be efficiently utilized for genetic manipulation of MV. Moreover, the essentiality of genes that encode MV histones, like mel_369 and mel_368, can be overcome by trans-complementation, introducing ectopic copies of the gene in the amoeba genome. Future efforts will be directed to identify selection systems to isolate mutant viruses for phenotypic analysis.

Limitations of study

Our study demonstrates that virally encoded histone doublets assemble into nucleosome-like particles in vitro and that they are present in the virus at amounts sufficient to package the entire viral genome into nucleosomes. However, we have no direct proof that the histones are indeed bound to viral DNA in the capsid. This is because any attempt to open the extremely stable virus capsid also destroys chromatin structure. However, the presence of strong electron density in the capsid core suggests that the genome is highly compacted. Methods such as cryo-electron tomography or correlative light electron microscopy (that do not require opening of the capsid) might be needed to visualize the organization of chromatin in the capsid. Although co-localization of the two histone doublets suggests that they are forming nucleosomes in the viral factory to which they are recruited, we have no direct proof of this. The timing of the association of viral histones with DNA in the VF (in particular when nucleosomes are formed and which chaperones, if any, assist in the process) has not yet been investigated, as it requires an infectious virus whose genome encodes tagged histones. These exciting experiments have only now become possible due to our breakthrough in being able to genetically manipulate the virus. Finally, visualizing nucleosome-like particles on the cryo-EM grid required crosslinking (as did a related study describing the structure of the MV nucleosome that was published while this article was under review; Valencia-Sánchez et al., 2021). It is possible that some aspects of the structure are more dynamic than suggested in these two structures. The high mobility of the connectors, together with the overall resolution reported here (∼4 Å), precluded a detailed description of the main chain in the connecting regions. It is therefore gratifying that the two independently determined structures are very similar, even in the ill-defined regions.

STAR★Methods

Key resources table

Resource availability

Lead contact

The lead contact is Karolin Luger (karolin.luger@colorado.edu).

Materials availability

Histone expression constructs, cell lines, viruses, as well as vectors are available from the lead contact upon request.

Data and code availability

Coordinates and density maps have been deposited in the PDB and EMDB databases (access codes PDB ID 7N8N, EMD-24238). All other data is available in the main text or supplemental data. Mass spectrometry proteomics data were deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) via the PRIDE partner repository with accession code PXD003910.

Method details

Histone sequence alignment and secondary structure prediction

Predicted Marseilleviridae histone-like proteins were aligned with eukaryotic histone proteins with HHpred’s Multiple Alignment using Fast Fourier Transform (MAFFT) with a 1.53 gap open penalty. Lineages A, B, and C of the Marseilleviridae family were included, lineage D was excluded due to sequence similarity with lineage C (∼90%) and lineage E was excluded due to the lack of annotated histone-like proteins. Using the MAFFT alignment, protein secondary structures were predicted using HHpred’s Quick 2D structural prediction webserver to demonstrate structural conservation of the histone fold domain between Eukarya and Marseilleviridae (Zimmermann et al., 2018).

Fluorescence localization of Melbournevirus histones in infected A. castellanii cells

The three Melbournevirus (AIT54904) histones encoding genes Mel-368 (MV-H4-H3), Mel-369 (MV-H2B-H2A) and Mel-149 (MV-miniH2B-H2A) were amplified by PCR from genomic DNA and cloned into an in-house modified plasmid derived from the pGAPDH-GFP amoebal expression plasmid (Bateman, 2010). Briefly, the GAPDH promoter was replaced by the stronger EF1 gene promoter and the genes were inserted using the NdeI restriction site to yield C-terminally GFP-tagged proteins. The A. castellanii histone H2A (2) gene (ACA1_364730A) was amplified from Acanthamoeba castellanii (Douglas) Neff (ATCC 30010TM) genomic DNA and cloned into the same plasmid engineered to yield a N-terminally GFP-tagged protein. Acanthamoeba castellanii cells were transfected with 6 μg of each plasmid using Superfect (QIAGEN). Selection of transformed cells was initially performed at 30 μg/mL Neomycin and increased up to 100 μg/mL within a couple of weeks. Transfected A. castellanii cells were grown on poly-L-lysine coating coverslips in a 12-well plate and infected with Melbournevirus at a MOI of 50 except for the negative control. At 30 min, 1h, 2h, 3h and 4h post-infection (pi), cells were fixed with PBS containing 3.7% formaldehyde for 20 min at room temperature. After one wash with PBS buffer, coverslips were mounted on a glass slide with 4 μl of VECTASHIELD mounting medium with DAPI and the fluorescence was observed using a Zeiss Axio Observer Z1 inverted microscope using a 63x objective lens associated with a 1.6x Optovar for DIC, DAPI or GFP fluorescence recording. To perform the co-localization experiment, the previous plasmid was modified to replace GFP with mRFP. We screened several selection markers to allow the selection of the co-transfected cells and Nourseothricin was the only antibiotics highly efficient on Acanthamoeba cells. The Neomycin-resistance gene (neo) was thus replaced by the Nourseothricin-resistance gene (nat1) in the previous plasmid. Briefly, the plasmid was digested using NdeI and XbaI to remove the GFP and the linearized plasmid was gel purified. The mRFP gene was amplified and inserted by homologous recombination using the In-Fusion kit (Takara Bio) according to the manufacturer protocol. The subsequent plasmid was amplified by PCR to get rid of the Neomycin-resistance gene and Nourseothricin-resistance gene was inserted by homologous recombination. The Melbournevirus gene encoding the H3-H4 histone was then cloned using the NdeI site to yield a C-terminally mRFP-tagged protein. The structure of the construct was confirmed by sequencing. Acanthamoeba castellanii cells expressing the MV-H2A-H2B-GFP proteins were transfected with 6μg of the MV-H3-H4-mRFP plasmid and selection of the co-transfected cells was performed using 100μg/mL Neomycin and 30 to 50 μg/mL Nourseothricin. Cells were grown in a 96-well glass bottom plate and infected at a MOI of 50. At 4h pi, the cells were washed with PBS containing 10 μg/mL of DAPI and observed for DIC, DAPI, GFP or mRFP fluorescence recording.

Mass spectroscopy proteomic analyses

The mass spectrometry proteomic of the purified virions was performed as previously described (Fabre et al., 2017). Data are available at the ProteomeXchange Consortium (proteomecentral.proteomexchange.org) via the PRIDE partner repository with the accession code PXD003910. Based on the structure of Melbournevirus (Okamoto et al., 2018) exhibiting a triangulation number T = 309 corresponding to 9240 Major Capsid Proteins, the copy numbers for each protein in the viral proteome was estimated using the computed intensity-based absolute quantification (iBAQ) values.

Marseillevirus-Acanthamoeba castellanii transcriptome

The time-course transcriptome analysis of A. castellanii cells infected by Marseillevirus was performed using the RNA-seq data from (Rodrigues et al., 2020). We first annotated the Marseillevirus genome (accession GU071086) as described (Blanca et al., 2020), as well as the A. castellanii Neff host genome (assembly GCA_000193105.1) using the Funannotate pipeline (version 1.7; https://github.com/nextgenusfs/funannotate). We then mapped the RNA-seq reads corresponding to 30 min pi (ERR3528397), 1 h pi (ERR3528398), 2 h pi (ERR3528399), 4 h pi (ERR3528400), 5 h pi (ERR3528401), 6 h pi (ERR3528402), 8 h pi (ERR3528403), 10 h pi (ERR3528404) and 12 h pi (ERR3528405) to the host and viral transcripts using Bowtie (Langmead and Salzberg, 2012) (version 2.3.4.1). Transcript quantification was done using RSEM (Li and Dewey, 2011) (version 1.3.0) and data normalization using the edgeR R package (Robinson et al., 2010). Host and viral mapped reads were normalized separately with the “TMMwsp” method and then combined to a single matrix. Expression values were then expressed in log transformed counts per million of mapped reads (CPM). Finally, we scaled each transcript expression values to a Z-score and produced a heatmap for the genes of interest. A hierarchical clustering of the heatmap was obtained using a Pearson correlation distance and the “Wards.D2” method.

Knockout generation cloning strategies and analysis

Vectors for knockout of MEL_369 or MEL_368 were generated by inserting 500 base pairs homology arms into the pGAPDH-EGFP vector (Bateman, 2010). Homology arms were amplified from Melbournevirus genomic DNA produced by Wizard genomic DNA purification kit (PROMEGA) according to manufacturer’s specifications. Primers used are presented in Table S1 (HB112-115 and HB120-123 for amplification of homology arms of MEL_368 or MEL_369, respectively), and inserted into the NdeI and XbaI sites by infusion (Takara). The vectors were digested with NdeI and XbaI (NEB) prior to transfection into A. castellanii. A. castellanii cells were transfected with 8 μg of each linearized plasmid using Superfect (QIAGEN) and infection by Melbournevirus was performed 1-hour post transfection. One well was used to analyze the genomic integration using primers presented in Table S1 after 5-hours post-infection. Another well was kept to analyze the infectiousness of neo-synthesized viral particles. To remove the excess of viral particle used for infection, the growth medium was removed 1-hour post-infection and fresh growth medium was added to the culture. Washing of the cells was repeated 5 times. After lysis, the viral progeny was recovered and kept for additional round of infection. Vector plasmids for trans-complementation were generated by directed mutagenesis using infusion (Takara) utilizing the primers HB155-HB160 to insert untagged genes into the pEF1-mRFP-NAT target vector. 6 μg of circular vectors were transfected into A. castellanii using Superfect (QIAGEN). Selection of transformed cells was initially performed at 30 μg/mL Nourseothricin and increased up to 100 μg/mL within a week post transfection. Competition assay were performed by passaging the recovered transfected viruses subsequently. Again, the presence of recombinant DNA was performed by PCR using infected cells 5-hours post infection. Extracellular virions were removed by washing 5 times. Genomic DNA was obtained by Wizard genomic DNA purification kit (PROMEGA) according to manufacturer’s specifications.

Melbournevirus histone doublet purification

Melbournevirus ORF 369 (H2B-H2A), ORF 368 (H4-H3) and ORF 149 (miniH2B-H2A) were cloned into pET-28 plasmid for expression and purification from E.coli, using protocols adapted from eukaryotic histones (Dyer et al., 2004). 6L of E. coli cells were re-suspended and lysed with 80 mL of guanidinium lysis buffer (6 M Guanidinium HCl, 20 mM sodium phosphate pH 6.8 and 2 M NaCl) and sonicated for 30 s at 50% strength for 4 times. Lysate was spun at 16,000 rpm for 25 minutes, and the supernatant was incubated with Nickel NTA Agarose Beads (GoldBio) for approx. 45 minutes at RT. The nickel beads were pelleted, and the supernatant was discarded. The beads were resuspended in 8 M Urea lysis buffer (8 M Urea, 20 mM sodium phosphate pH 6.8, and 2 M NaCl) and incubated for approximately 1 hour at room temperature. Proteins were eluted using histone elution buffer (8 M Urea, 20 mM sodium phosphate pH 6.8, 2 M NaCl, and 500 mM Imidazole). For protein refolding, histones were dialyzed against 1 M Urea Buffer (1 M urea, 1 mM EDTA, 20 mM Tris-Cl pH 7.5 and 2 M NaCl) for 6 hr, followed by overnight dialysis against the same buffer without urea (1 mM EDTA, 20 mM Tris-Cl pH 7.5 and 2 M NaCl). Precipitated protein was removed by centrifugation, and the supernatant was concentrated and run over a size-exclusion column S200 equilibrated in 1 mM EDTA, 20 mM Tris-Cl pH 7.5 and 2 M NaCl. Samples were stored in 20% glycerol at −80°C.

Marseillevirus histone-DNA complex (nucleosome) reconstitution

Melbournevirus H2B-H2A, H4-H3 and DNA of the specified lengths were mixed at a ratio of DNA to MV-H4-H3 to MV-H2B-H2A = 1.0: 2.4: 2.4. Widom 601 DNA at 147 bp and 207bp, and 181 bp of a conserved Melbournevirus native DNA fragment selected from its genome (GC cotent = 45%) were used. Viral NLPs and X.laevis nucleosomes were both reconstituted by gradient dialysis (Dyer et al., 2004). Reconstituted products were analyzed by 5% native-PAGE.

Sucrose gradient sedimentation and gradient fixiation (GraFix) crosslinking

To form the sucrose gradient, 6 ml top solution (50 mM NaCl, 20 mM HEPES, 1 mM EDTA, pH 7.5 and 5% sucrose) was added to a tube (Beckman, 331372). Then 6 mL bottom solution (50 mM NaCl, 20 mM HEPES, 1 mM EDTA, pH 7.5 and 40% sucrose) was slowly added to the bottom until it reached the halfway mark. The tubes were placed on a gradient maker (BioComp Gradient Master) to form a continuous gradient. 200 μl of sample at 4 μM was loaded on top and spun at 4 °C for 18 h at 30,000 rpm (Beckman, Rotor SW-41Ti). Fractioned samples were dialyzed against 50 mM NaCl, 20 mM Tris-Cl, 1 mM EDTA, pH 7.5 and 1 mM DTT to remove the sucrose. For GraFix, the same procedure was applied, except that 10% to 30% of glycerol was used instead of sucrose. In addition, 0.15% of glutaraldehyde was added to the bottom solution to form the continuous density and crosslinker gradient. After GRAFIX, the samples were eluted and dialyzed against 50 mM NaCl, 20 mM Tris-Cl, 1 mM EDTA, pH 7.5 and 1 mM DTT to quench the crosslinking reaction and remove glycerol.

Sedimentation velocity analytical ultracentrifugation (SV-AUC)

SV-AUC with absorbance optics (λ = 260 nm) was used to evaluate the homogeneity of the complexes in solution and to infer their molecular size and shape. Samples (composition as indicated) at 300 nM were spun at 30–35,000 rpm at 20°C in a Beckman XL-A ultracentrifuge in 50 mM NaCl, 20 mM Tris-Cl, 1 mM EDTA, pH 7.5 and 1 mM DTT, using an An60Ti rotor. Partial specific volumes of samples were estimated using UltraScan III version 4.0; this program was also used for all data analysis (Demeler et al., 2014; Gorbet et al., 2014). Time-invariant and radial-invariant noise contributions were subtracted from the experimental sedimentation velocity data by 2-dimensional Spectrum Analysis (2DSA), followed by refinement using the Genetic Algorithm-Monte Carlo (GA-MC) approach (Brookes et al., 2010). Sedimentation coefficients and molecular weights in Table 1 were extracted from resulting GA-MC models. Modeling calculations were performed on the UltraScan LIMS clusters at the Bioinformatics Core Facility at University of Texas Health Science Center at San Antonio and Lonestar cluster at Texas Advanced Computing Center. Integral sedimentation coefficient distributions (G(s)) were obtained from noise-corrected experimental data with the Enhanced van Holde-Weischet Analysis method. G(s) plot figures were created using python (Edwards et al., 2020) (https://github.com/Luger-Lab/AUC-analysis).

Atomic force microscopy (AFM)

All samples were imaged in air on JPK/Bruker NanoWizard Sa with TAP300-Gold (Ted Pella) cantilevers using fast scan glass block. Images were usually scanned at 1.5 × 1.5 μm at 1- 3 Hz. All buffers were filtered through 0.02 μM filters (Anotop) before use. Mica was freshly cleaved, treated with APTES for 30 min, rinsed with water and then dried with nitrogen gas utilizing a 0.22 μm PES filter. Samples were serially diluted in TE (20 mM Tris-HCl pH 7.5, 1 mM EDTA) to a final concentration of 2 nM. Within 2 minutes of final dilution, 30 μl sample was placed on the ATPES mica slide for 2 minutes, rinsed with 1.0 mL of water and dried with filtered N2 gas. Images were analyzed using JPK SPM software V 6.1.158. Each 1.5 × 1.5 μm image was divided into 4 quadrants and digitally zoomed to obtain clearer visuals of separated particles. Lines were drawn and dragged through each particle with the greatest height of each recorded. Analysis of heights were completed in either MS Excel and/or GraphPad Prism.

Single particle cryo electron microscopy (cryo EM) and data processing

Complexes of MV-NLPs from GraFix were concentrated to 2-3 μM using Amicon Ultra-4 centrifugal filters (Ultracel 50K, Millipore). C-Flat 1.2/1.3 (Au) grids were glow discharged (EMItec, Lohmar, DE) at 40 mA for 30 s. 4 μl sample was applied onto the grid before manual plunge freezing into ethane. Images for MV-NLP with 147 bp “601” DNA were acquired at nominal magnification of 64000x on a FEI Titan Krios (300 kV), equipped with a Gatan K3 Summit direct detector. Pixel size was 1.065 Å. The movies were captured in super resolution mode with electron dose rate at 10.5 electrons per pixel per second for 5.4 s and 0.108 s per frame. Defocus range was −0.8 to −2.0 μm. Images for MV-NLP with 207 bp “601” DNA were acquired at nominal magnification of 29000x on a FEI Tecani F20 (200 kV), equipped with a Gatan K3 Summit direct detector. Pixel size was 1.219 Å. The movies were captured in super resolution mode with electron dose rate at 10 electrons per pixel per second for 8 s and 0.2 s per frame. Defocus range was −1.0 to −2.5 μm. Both datasets were processed (Motion correction and CTF estimation) by cryoSPARC (v2.12.4) (Zhang, 2016; Zheng et al., 2017). Images were evaluated by inspecting the CTF fitting resolution. Roughly 1000 particles were then manually picked for generating autopicking templates through 2D classification in CryoSPARC. For MV-NLP with 147bp DNA, this yielded 2,648,493 particles across 6,552 micrographs, and these particles were subjected to four iterative rounds of 2D classification in order to discard bad particles. Two ab initio models were then generated, followed by heterogeneous refinement of the nucleosome-like class (34,418 particles). Non-uniform refinement, as well as global and local CTF refinements, were performed to improve the density to the final resolution, as estimated by GFSC. For MV-NLP with 207bp DNA, template-based particle picking yielded ∼2,131,563 particles across 3,495 micrographs, and these particles were subjected to four iterative rounds of 2D classifications to get best nucleosome particles. Two ab initio models were then generated, followed by heterogeneous refinement of the nucleosome like class. Non-uniform refinement with 377,702 particles, were performed to improve the density to the final resolution, as estimated by GFSC. Initial models were built by fitting MV histone homology models (see below) into the final 3D electron maps in UCSF Chimera, yielding an initial correlation coefficient of 0.7086 (Pettersen et al., 2004). These models were then iteratively modified and locally refined in COOT (Emsley and Cowtan, 2004).Molecular dynamics flexible-fitting (MDFF, details below) was then used utilized to simultaneously balance correlation with the EM density and atomic-level energy evaluation (electrostatic configurations and removal of steric overlaps).

Homology modeling

Initial homology models of Melbournevirus ORF 369 (H2B-H2A) and ORF 368 (H4-H3) doublets were constructed using SWISSMODEL, and the histone structures from the Xenopus laevis nucleosome (PDB 1AOI) were used as a reference (Bienert et al., 2017; Guex et al., 2009; Waterhouse et al., 2018). Using Modeler (v9.20), 10,000 loops of each doublet (H2B-H2A Y152 to S192 and H4-H3 F122 to T203, with sequence numbers identified as in Figure S1A) were generated (Sali and Blundell, 1993). To eliminate improbable loops within each doublet, clash identification was performed using CPPTRAJ of the Amber MD package (v18), where a cutoff distance of 0.8 Å between potentially over-lapping atoms was used (Roe and Cheatham, 2013). Any loops identified by CPPTRAJ without a clash were manually parsed through to eliminate loops that were not sterically overlapping but also not physically relevant, such as those forming “knots” in the histone folds. While several configurations with reasonable qualitative agreement to the EM density were identified (Figure S4), the loop with the overall best correlation with the density was selected as the initial conformation for further refinement via COOT (Emsley et al., 2010), PHENIX (Adams et al., 2010), and MDFF (see below) (McGreevy et al., 2016). Figures were rendered using Chimera and VMD (Humphrey et al., 1996).

Molecular dynamics flexible fitting (MDFF) protocol

MDFF simulations were initialized from the PHENIX-refined structure based upon our initial homology modeling. This method improves fitting to EM data by introducing external forces, which bias the structure from its initial conformation into the empirical density. Simulations were conducted using the GPU-enhanced NAMD engine (v2.13) with the CHARMM36 forcefield in an implicit solvent environment (Chan et al., 2012; Huang and MacKerell, 2013), biasing forces were applied to the protein backbone and heavy atoms of the DNA, in accordance with the level of resolution provided by the density. The simulation timestep was 2 fs, and a total simulation time of 5 ns was sufficient for convergence into the empirical density. Distance- and angle-based restraints of 20 kcal/mol/Å2 and 20 kcal/mol/degree2 were applied to structured regions to prevent over-fitting of the density by breaking of domains with known secondary structure, and dihedral restraints of 50 kcal/mol/degree2 were applied to maintain proper chirality during the biasing process. Over the course of the simulation, correlation between the model and experimental density only showed a modest improvement from 0.761 to 0.766, but significant benefits to residue conformations were observed, such as the removal of drastic Φ and Ψ orientations or the repositioning of sidechains that were originally used to explain main chain densities due to overfitting by real-space refinements in PHENIX.

Structural characterization of the MV-NLP147 model

Comparisons between MV-NLP147 and eNuc147 (PDB 3LZ0; (Vasudevan et al., 2010)) were conducted using VMD, and the protein backbone (N, C, and Cα) and DNA sugar ring (C1’, C2′, C3′, C4’, and C5′) atoms were used for RMSD and center-of-geometry calculations (Table S3). To minimize discrepancies between each system, the eNuc147 molecule was least-squares fit to the MV-NLP147 using the core α helices and central 121 bp of DNA as a reference. In this way, RMSD values represent the changes in the global positioning of each constituent, rather than measuring the conformational rearrangement within each individual piece, which was altogether quite low (typically ∼1-2 Å). MV-H2B-H2A dimer reorientation, relative to the eukaryotic H2A-H2B dimer, was determined by placing fictitious particles at the geometric centers of the first and last turns of the H2B α2 helix and calculating the dihedral angle describing this imaginary atom arrangement (Figure S4A). In the eNuc147 system, the fictitious atoms describing the C-terminal end of each helix were not located at the final turn of the helix, but rather at the turn within the helix that would yield an analogous length of the H2B α2 helix in MV-NLP147, to provide the best direct comparison of the two systems. Larger magnitude values in this measurement correspond to dimers that are angled further away from the dyad-axis, as well as from one another.

Quantification and statistical analysis

Statistical analyses and software used, as well as statistical details are found in the figure and table legends, as well as in the STAR Methods.
REAGENT or RESOURCESOURCEIDENTIFIER
Bacterial and virus strains

Rosetta 2(DE3)pLysSEMD Millipore71401-4
MelbournevirusIGS labN/A, available upon request

Chemicals, peptides, and recombinant proteins

SuperFect transfection reagentQIAGENCat# 301305
GlutaraldehydeEMSCAS #111-30-8
APTES for AFM slideFischer Scientific# AC151081000
FormaldehydeSigma-Aldrich252549
Vectashield mounting medium with DAPIVector LaboratoriesH-1200

Deposited data

MV NLP147 structureThis paperPDB: 7N8N
EM map of MV NLP147This paperEMD-24238
MS proteomics dataThis paperProteomeXchange Consortium
PRIDE accession code: PXD003910
Transcriptome data
Rodrigues et al., 2020NCBI Short Read Archive https://www.ncbi.nlm.nih.gov/Traces/sra/sra.cgi
Front MicrobiolSRA accession code
https://doi.org/10.3389/fmicb.2020.00651ERP117375

Experimental models: Organisms/strains

Acanthamoba castellaniiATCC30010

Oligonucleotides

All listed in Table S1This paperThis paper Table S1

Recombinant DNA

pET-MV-H2B-H2AThis paperN/A, available upon request
pET-MV-H4-H3This paperN/A, available upon request
pET-MV-miniH2B-H2AThis paperN/A, available upon request
pEF1-Mel368-GFP (GeneticinR)This paperN/A, available upon request
pEF1-Mel369-GFP (GeneticinR)This paperN/A, available upon request
pEF1-Mel149-GFP (GeneticinR)This paperN/A, available upon request
pEF1-H2A(A.castellanii)-GFP (GeneticinR)This paperN/A, available upon request
pEF1-Mel368-mRFP (NourseothricinR)This paperN/A, available upon request
pEF1- Mel368 (NourseothricinR)This paperN/A, available upon request
pEF1-Mel369 (NourseothricinR)This paperN/A, available upon request
5′ Mel368- GFP-3′ Mel368This paperN/A, available upon request
5′ Mel369- GFP-3′ Mel369This paperN/A, available upon request

Software and algorithms

AxioVision rel. 4.8.2 SoftwareCarl Zeisshttps://www.zeiss.com/corporate/us/home.html
JPK SPM software for AFM image processingV 6.1.158https://customers.jpk.com/
GraphPad Prismversion 9.0.0https://www.graphpad.com:443/
CryoSPARCV2.1https://cryosparc.com/
UCSF ChimeraV1.14https://www.cgl.ucsf.edu/chimera/
winCOOTV0.9.4.1https://www2.mrc-lmb.cam.ac.uk/personal/pemsley/coot/
UltraScan III (ver 4.0)Demeler et al., 2014https://ultrascan3.aucsolutions.com/
UltraScan LIMS ClusterDemeler et al., 2014https://uslims4.aucsolutions.com/
SV-AUC Visualization Scripts (Python3)Edwards et al., 2020https://github.com/Luger-Lab/AUC-analysis
SWISS-MODEL webserverWaterhouse et al., 2018https://swissmodel.expasy.org/interactive
MODELER (ver 9.20)Sali and Blundell, 1993https://salilab.org/modeller/
NAMD (CUDA-accelerated, ver 2.13)Chan et al., 2012https://www.ks.uiuc.edu/Research/namd/
CHARMM36 Forcefield ParametersHuang and MacKerell, 2013http://mackerell.umaryland.edu/charmm_ff.shtml#charmm
VMD (ver 1.9.3)Humphrey et al., 1996.https://www.ks.uiuc.edu/Research/vmd/
CPPTRAJ (ver 18)Roe and Cheatham, 2013https://ambermd.org/AmberTools.php
  57 in total

1.  Reconstitution of nucleosome core particles from recombinant histones and DNA.

Authors:  Pamela N Dyer; Raji S Edayathumangalam; Cindy L White; Yunhe Bao; Srinivas Chakravarthy; Uma M Muthurajan; Karolin Luger
Journal:  Methods Enzymol       Date:  2004       Impact factor: 1.600

2.  Coot: model-building tools for molecular graphics.

Authors:  Paul Emsley; Kevin Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2004-11-26

3.  VMD: visual molecular dynamics.

Authors:  W Humphrey; A Dalke; K Schulten
Journal:  J Mol Graph       Date:  1996-02

4.  Asymmetric unwrapping of nucleosomal DNA propagates asymmetric opening and dissociation of the histone core.

Authors:  Yujie Chen; Joshua M Tokuda; Traci Topping; Steve P Meisburger; Suzette A Pabit; Lisa M Gloss; Lois Pollack
Journal:  Proc Natl Acad Sci U S A       Date:  2016-12-27       Impact factor: 11.205

5.  Histone Sprocket Arginine Residues Are Important for Gene Expression, DNA Repair, and Cell Viability in Saccharomyces cerevisiae.

Authors:  Amelia J Hodges; Isaura J Gallegos; Marian F Laughery; Rithy Meas; Linh Tran; John J Wyrick
Journal:  Genetics       Date:  2015-05-12       Impact factor: 4.562

6.  Expression plasmids and production of EGFP in stably transfected Acanthamoeba.

Authors:  Erik Bateman
Journal:  Protein Expr Purif       Date:  2009-10-28       Impact factor: 1.650

7.  Decoding the centromeric nucleosome through CENP-N.

Authors:  Satyakrishna Pentakota; Keda Zhou; Charlotte Smith; Stefano Maffini; Arsen Petrovic; Garry P Morgan; John R Weir; Ingrid R Vetter; Andrea Musacchio; Karolin Luger
Journal:  Elife       Date:  2017-12-27       Impact factor: 8.140

8.  Structural basis of nucleosome dynamics modulation by histone variants H2A.B and H2A.Z.2.2.

Authors:  Min Zhou; Linchang Dai; Chengmin Li; Liuxin Shi; Yan Huang; Zhenqian Guo; Fei Wu; Ping Zhu; Zheng Zhou
Journal:  EMBO J       Date:  2020-10-19       Impact factor: 11.598

9.  Analysis of a Marseillevirus Transcriptome Reveals Temporal Gene Expression Profile and Host Transcriptional Shift.

Authors:  Rodrigo Araújo Lima Rodrigues; Amina Cherif Louazani; Agnello Picorelli; Graziele Pereira Oliveira; Francisco Pereira Lobo; Philippe Colson; Bernard La Scola; Jônatas Santos Abrahão
Journal:  Front Microbiol       Date:  2020-04-14       Impact factor: 5.640

10.  Rapid poxvirus engineering using CRISPR/Cas9 as a selection tool.

Authors:  Anjali Gowripalan; Stewart Smith; Tijana Stefanovic; David C Tscharke
Journal:  Commun Biol       Date:  2020-11-03
View more
  11 in total

Review 1.  Giant virus biology and diversity in the era of genome-resolved metagenomics.

Authors:  Frederik Schulz; Chantal Abergel; Tanja Woyke
Journal:  Nat Rev Microbiol       Date:  2022-07-28       Impact factor: 78.297

2.  Profile of Karolin Luger.

Authors:  Jennifer Viegas
Journal:  Proc Natl Acad Sci U S A       Date:  2022-08-08       Impact factor: 12.779

3.  A phylogenetic and proteomic reconstruction of eukaryotic chromatin evolution.

Authors:  Xavier Grau-Bové; Cristina Navarrete; Cristina Chiva; Thomas Pribasnig; Meritxell Antó; Guifré Torruella; Luis Javier Galindo; Bernd Franz Lang; David Moreira; Purificación López-Garcia; Iñaki Ruiz-Trillo; Christa Schleper; Eduard Sabidó; Arnau Sebé-Pedrós
Journal:  Nat Ecol Evol       Date:  2022-06-09       Impact factor: 19.100

4.  Eukaryogenesis: The Rise of an Emergent Superorganism.

Authors:  Philip J L Bell
Journal:  Front Microbiol       Date:  2022-05-11       Impact factor: 6.064

Review 5.  Viral histones: pickpocket's prize or primordial progenitor?

Authors:  Paul B Talbert; Karim-Jean Armache; Steven Henikoff
Journal:  Epigenetics Chromatin       Date:  2022-05-28       Impact factor: 5.465

6.  A phylogenomic framework for charting the diversity and evolution of giant viruses.

Authors:  Frank O Aylward; Mohammad Moniruzzaman; Anh D Ha; Eugene V Koonin
Journal:  PLoS Biol       Date:  2021-10-27       Impact factor: 8.029

7.  Novel Classes and Evolutionary Turnover of Histone H2B Variants in the Mammalian Germline.

Authors:  Pravrutha Raman; Mary C Rominger; Janet M Young; Antoine Molaro; Toshio Tsukiyama; Harmit S Malik
Journal:  Mol Biol Evol       Date:  2022-02-03       Impact factor: 16.240

Review 8.  Nucleosome Structures Built from Highly Divergent Histones: Parasites and Giant DNA Viruses.

Authors:  Shoko Sato; Mariko Dacher; Hitoshi Kurumizaka
Journal:  Epigenomes       Date:  2022-08-02

Review 9.  Nucleosome-Omics: A Perspective on the Epigenetic Code and 3D Genome Landscape.

Authors:  Siyuan Kong; Yuhui Lu; Shuhao Tan; Rongrong Li; Yan Gao; Kui Li; Yubo Zhang
Journal:  Genes (Basel)       Date:  2022-06-22       Impact factor: 4.141

10.  RNA Sequencing of Medusavirus Suggests Remodeling of the Host Nuclear Environment at an Early Infection Stage.

Authors:  Ruixuan Zhang; Hisashi Endo; Masaharu Takemura; Hiroyuki Ogata
Journal:  Microbiol Spectr       Date:  2021-09-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.