| Literature DB >> 32371601 |
Claire M Metrick1,2, Andrea L Koenigsberg1,3, Ekaterina E Heldwein4.
Abstract
Entities:
Keywords: RNA-binding protein; biomolecular condensate; conformational flexibility; herpesvirus; intrinsically disordered protein (IDP); liquid-liquid phase separation (LLPS); small‐angle X‐ray scattering (SAXS); structural model; tegument; viral assembly; viral protein
Mesh:
Substances:
Year: 2020 PMID: 32371601 PMCID: PMC7403781 DOI: 10.1128/mBio.00810-20
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1Alignment and characteristics of UL11 homolog sequences. (A) Sequences of UL11 homologs from herpesviruses aligned to HSV-1 UL11 with HSV-1 residue numbers marked. Human virus sequences used include HSV-1 strain 17 UL11 (RefSeq accession no. YP_009137085.1), HSV-2 strain HG52 UL11 (RefSeq YP_009137162.1), VZV strain Dumas ORF49 (RefSeq NP_040171.1), EBV strain B95-8 BBLF1 (RefSeq YP_401686.1), CMV strain AD169 UL99 (RefSeq P13200.3), HHV-6A strain Uganda-1102 U71 (RefSeq NP_042964.1), HHV-6B strain Z29 U71 (RefSeq NP_050250.1), HHV-7 strain JI U71 (RefSeq YP_073811.1), and KSHV strain GK18 ORF38 (RefSeq YP_001129391.1). Other representative animal virus sequences used include Marek’s disease virus (MDV) strain Md5 UL11 (RefSeq YP_001033939.1), pseudorabies virus (PRV) composite strain UL11 (RefSeq YP_068364.1), murine cytomegalovirus (MCMV) strain Smith UL99 (RefSeq YP_214100.1), saimiriine herpesvirus 2 (SaHV-2) ORF38 (RefSeq NP_040240.1), and equine herpesvirus 2 (EHV-2) strain 86/67 myristoylated tegument protein (RefSeq NP_042635.1). All sequences show the NCBI reference sequence accession number or code in parentheses. Conserved residues are marked with an asterisk. Myristoylated glycines and palmitoylated cysteines, experimentally determined or predicted, are shown in italicized type in cyan text. Acidic clusters, experimentally defined or predicted, are boxed in red. Groups of basic residues are boxed in blue. Beta strands and alpha helices predicted by PSIPRED (http://bioinf.cs.ucl.ac.uk/psipred/) are highlighted in yellow and light teal, respectively. Disorder-promoting residues (A/G/S/P/Q/E/R/K) are colored in magenta. Residues predicted to be disordered by DISOPRED3 (http://bioinf.cs.ucl.ac.uk/psipred/) are underlined in gray. (B) Representation of disorder by residue in HSV-1 UL11 as predicted by FoldUnfold (http://bioinfo.protres.ru/ogu/). Residues predicted to be in natively folded regions are shown in blue, and residues predicted to be in unfolded regions are shown in red. Residues scoring below the threshold (black line) but surrounded by folded residues are to be treated as folded and are shown in cyan.
FIG 2Expression and purification of UL11 constructs. (A) Schematic representation of UL11 sequence and UL11 constructs used in this work. Protease recognition sites are shown in gray. Construct names and molecular masses calculated from the sequence (MMCalc) and determined experimentally (this paper) (by SEC, MS, and SEC-SAXS) are shown. WT, wild type; H6, His6 tag; GST, glutathione S-transferase tag; StII, Strep-tag II. (B) Purification scheme for UL11 constructs. (C) Coomassie blue-stained gels of purified UL11 constructs.
FIG 3UL11 copurifies with RNA. (A) Fast protein liquid chromatography (FPLC) shows that UL11-StII is separated from copurifying nucleic acids on heparin resin. Protein, but not nucleic acids (NAs), binds heparin resin and elutes with a salt gradient (conductivity). UL11-StII is present in the eluted fraction, but not in the unbound fraction. The y axis shows absorbance or conductivity (in arbitrary units [AU]). Samples were resolved by SDS-PAGE and stained with Coomassie blue. The A260/A280 ratio shown below the SDS-PAGE confirms the presence of nucleic acids in the input fraction but not in the eluted fraction. (B) Nucleic acids that copurify with UL11-StII in acidic or neutral phenol-chloroform (P:C) are susceptible to digestion by RNase, but not DNase. The banding pattern is characteristic of E. coli rRNA, as marked. Samples were resolved on a formaldehyde agarose gel and stained with ethidium bromide. The gel in panel A was split to remove unrelated lanes, but contrast settings remain consistent between related gels.
FIG 4UL11 is an elongated monomer. (A) MALDI-TOF spectrum of UL11-StII. The x axis shows the mass-to-charge ratio (m/z) in daltons and the y axis shows relative abundance (%) scaled for the mass range in x. (B) Size exclusion chromatography (SEC) A280 traces and gels. The elution volumes of calibration standards used to calculate apparent molecular masses (kDa) are depicted in gray and with gray arrows. (C) Coomassie blue-strained gel of UL11 incubated with increasing amounts of BS3 cross-linker.
FIG 5UL11 undergoes liquid-liquid phase separation. (A) Microscopic images of UL11 constructs in solution show representative images of liquid-liquid phase separation (LLPS) seen under a variety of conditions, including purification buffer and for H6-UL11, crystallization screens (1:1 mix of protein-solution in vapor diffusion chamber). (B) Macroscopic images of UL11-StII in solution in purification buffer show reversibility of UL11-StII phase separation (cloudiness) upon temperature cycling. (C) Time lapse microscopic images of UL11-StII condensates in purification buffer display hallmark liquid-like properties of surface wetting and droplet merging (selected example in orange box). (D) LLPS (separation, orange circles; clear solution, blue triangles) occurs with increasing protein concentration, increasing salt concentration, and increasing molecular crowding.
FIG 6UL11 lacks defined secondary structure and is sensitive to proteolysis. (A) CD spectra of UL11-StII at several concentrations show primarily random-coil characteristics. UL11-StII undergoes LLPS (LLPS, orange circles; clear solution, blue triangles) with increasing concentration in CD buffer at ambient temperature. See also Table 1. (B) Limited proteolysis of UL11-StII using trypsin, chymotrypsin, V8 protease, or proteinase K with increasing amounts of protease. (C) Sequence of UL11-StII with potential proteolysis sites colored (green, chymotrypsin; pink, trypsin; purple, V8). V8 fragment identified by MALDI-TOF mass spectrometry and N-terminal sequencing (see Fig. S1 in the supplemental material) is underlined in purple. The StII sequence is double underlined.
CD structural parameters
| UL11 | Reference set | NRMSD | % of residues | |||||
|---|---|---|---|---|---|---|---|---|
| Helix1 | Helix2 | Strand1 | Strand2 | Turns | Unordered | |||
| 0.46 | SP175 | 0.043 | 0.08 | 0.12 | 0.12 | 0.11 | 0.17 | 0.4 |
| Set 6 | 0.056 | 0.03 | 0.04 | 0.21 | 0.09 | 0.17 | 0.46 | |
| Set 7 | 0.041 | 0.06 | 0.1 | 0.19 | 0.09 | 0.21 | 0.35 | |
| Avg | 0.06 | 0.09 | 0.17 | 0.10 | 0.18 | 0.40 | ||
| 0.23 | SP175 | 0.034 | 0.01 | 0.07 | 0.17 | 0.13 | 0.16 | 0.46 |
| Set 6 | 0.023 | 0.04 | 0.03 | 0.13 | 0.08 | 0.13 | 0.59 | |
| Set 7 | 0.024 | 0.02 | 0.04 | 0.12 | 0.07 | 0.14 | 0.58 | |
| Avg | 0.02 | 0.05 | 0.14 | 0.09 | 0.14 | 0.54 | ||
| 0.13 | SP175 | 0.019 | 0.02 | 0.06 | 0.15 | 0.12 | 0.17 | 0.49 |
| Set 6 | 0.022 | 0.02 | 0.04 | 0.11 | 0.07 | 0.12 | 0.64 | |
| Set 7 | 0.023 | 0.02 | 0.04 | 0.07 | 0.05 | 0.1 | 0.72 | |
| Avg | 0.02 | 0.05 | 0.11 | 0.08 | 0.13 | 0.62 | ||
The reference set is the set of curves used in CDSSTR (53) to estimate secondary structure content from experimental curves.
NRMSD, normalized root mean square deviation between calculated and experimental CD spectra.
The percentage of residues in a regular helix (93) (Helix1), distorted helix (Helix2), regular strand (Strand1), distorted strand (Strand2), and beta turns (Turns) and the percentage of residues lacking secondary structure (Helix1, Helix2, Strand1, Strand2, and Turns) (Unordered).
FIG 7SAXS analysis of UL11. (A) Trace of integrated scattering intensities for each frame collected in the UL11-StII SEC-SAXS experiment. Buffer curves (frames shaded gray) were averaged and subtracted from each scattering curve to calculate R (diamonds) frame by frame. Data from curves giving a consistent R within the major SEC peak (frames shaded blue) were averaged, and the buffer average was subtracted from this sample average to give the subtracted SAXS curve in panel B. (B) Subtracted SAXS curve for UL11-StII with Guinier plots (inset, including linear fits) at the low-angle region (qRg < 1.3). These regions were selected with AUTORG such that residuals were evenly distributed around zero. Data points in gray were excluded from the Guinier analysis. (C) Pair distance distribution functions [P(r)] for UL11-StII calculated from scattering curves in panel B, comparing curves calculated with Dmax forced to zero (inset) or not forced to zero (main panel). In the main panel, the faded blue line shows the continuation of the UL11 P(r) after the Dmax when forced to zero. (D) Normalized (dimensionless) Kratky plot was calculated from scattering curve in panel B. Parameters from these data are also summarized in Table 2. Data for UL11-StII have been deposited into the Small Angle Scattering Biological Data Bank (SASBDB) under code SASDEX4.
SAXS structural parameters
| Parameter | Value(s) for UL11-StII |
|---|---|
| Guinier analysis | |
| | 0.00477 ± 0.000041 |
| | 24.29 ± 2.0 |
| | 0.01254 |
| | 1.30239 |
| Coefficient of correlation, | 0.938 |
| | 0.004748 ± 0.00003566/0.004866 ± 0.00009504 |
| | 24.74 ± 0.1599/26.29 ± 1.712 |
| | 76/120 |
| | 0.0074 − 0.2802/0.0074 − 0.2802 |
| χ2 (total estimate from GNOM) | 1.0385 (0.8640)/0.9935 (0.7203) |
| AMBIMETER analysis of | |
| Compatible shape categories | 720/153 |
| Ambiguity score | 2.857/2.185 |
| Uniqueness | Highly ambiguous/might be ambiguous |
I(0), scattering intensity at zero angle.
R, radius of gyration.
qmin, lowest q value used in the Guinier analysis.
qR max, maximum q value used in the Guinier analysis × Rg.
R2, linear fit of the Guinier plot.
(forced to 0/not forced) = P(r) forced to 0 at Dmax in DATGNOM (91)/P(r) not forced to 0.
Dmax, maximum dimension calculated from P(r).
q range, q values used in P(r) plot.
χ2 fit of P(r) to data. The total quality estimate from GNOM (91) is shown in the parentheses.
The number of shape topologies, M, compatible with P(r) evaluated by AMBIMETER (94).
The ambiguity score is log M and correlates with probability of finding a false-positive result in a three-dimensional (3D) reconstruction (94).
Uniqueness of 3D reconstruction of P(r) evaluated by AMBIMETER (94).
Dimensional and flexibility characteristics of optimized ensembles
| Pool or ensemble | Parameter or | Proportion (%) | χ2 | |||
|---|---|---|---|---|---|---|
| Random pool | ||||||
| Optimized ensemble 1 | Compact | 20.75 | 59.61 | 44.4 | ||
| Intermediate | 27.35 | 81.195 | 44.4 | |||
| Extended | 35.95 | 111.43 | 11.1 | |||
| Optimized ensemble 2 | Compact | 21.38 | 61.47 | 54.6 | ||
| Intermediate | 28.59 | 79.97 | 36.4 | |||
| Extended | 37.55 | 109.55 | 9.1 | |||
| Optimized ensemble 3 | Compact | 21.94 | 63.63 | 55.5 | ||
| Intermediate | 27.78 | 81.53 | 33.3 | |||
| Extended | 35.79 | 107.3 | 11.1 | |||
| Avg ensemble | Compact | 21.36 | 61.57 | 51.5 | ||
| Intermediate | 27.91 | 80.90 | 38.0 | |||
| Extended | 36.43 | 109.43 | 10.4 | |||
The average values for the random pool or ensembles are shown in italic boldface type.
R, radius of gyration of structural models.
Dmax, maximum dimension of structural models.
Proportion is the percentage of curves representing each category within the pool or ensemble.
Rflex is a measurement of information entropy, correlated with conformational flexibility, in the pool or ensemble. Rflex would be 100% for a totally flexible system versus 0% for an ideal rigid system.
χ2, measurement of fit between the SAXS curve generated from the optimized ensemble of models and experimental data.
The random pool is 10,000 structural models representing UL11-StII generated by RANCH within EOM 2.0 (58, 59).
Optimized ensembles 1, 2, and 3 are from the random pool that together represent the experimental data as chosen by GAJOE within EOM 2.0 (58, 59).
Compact, category of models with R< 24 and Dmax < 71.
Intermediate, category of models with 26 < R< 31 and 77 < Dmax < 83.
Extended, category of models with R> 35 and Dmax > 107.
FIG 8Ensemble optimization modeling (EOM) of UL11-StII. (A and B) Histograms comparing the distribution of R values (A) and Dmax values (B) between a pool of 10,000 randomly generated structures (gray) and three independently optimized representative ensembles of structures (blues in panel A; purples in panel B) indicate the presence of three conformations. (C) Representative structures of the conformations found in the optimized ensembles are shown with the average R (blue) and Dmax (purple) values and the percentage of the ensemble they represent. These models suggest possible behavior of the flexible protein in solution, but they do not represent the only solution. These values are also summarized in Table 3.
FIG 9Predicted disorder in HSV-1 tegument proteins. Predicted flexibility of HSV-1 tegument proteins as calculated by FoldUnfold (http://bioinfo.protres.ru/ogu/). Folded residues are indicated in blue, while unfolded residues are indicated in red. Protein length is not shown to scale.