| Literature DB >> 32250657 |
Robert C Penner1,2.
Abstract
Earlier analysis of the Protein Data Bank derived the distribution of rotations from the plane of a protein hydrogen bond donor peptide group to the plane of its acceptor peptide group. The quasi Boltzmann formalism of Pohl-Finkelstein is employed to estimate free energies of protein elements with these hydrogen bonds, pinpointing residues with a high propensity for conformational change. This is applied to viral glycoproteins as well as capsids, where the 90th+ percentiles of free energies determine residues that correlate well with viral fusion peptides and other functional domains in known cases and thus provide a novel method for predicting these sites of importance as antiviral drug or vaccine targets in general. The method is implemented at https://bion-server.au.dk/hbonds/ from an uploaded Protein Data Bank file.Entities:
Keywords: antiviral drugs; vaccine targets; viral glycoproteins
Year: 2020 PMID: 32250657 PMCID: PMC7591380 DOI: 10.1089/cmb.2020.0120
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479
FIG. 1.Two peptide groups, P, P, are depicted on the left, participating in a hydrogen bond with donor P and acceptor P. The planes of these peptide groups are illustrated in gray. There is a unique 3d rotation carrying the (oriented) xz plane to the gray plane for P and sending the positive x-axis to the ray , and likewise for P. The composition illustrated on the right is the rotation in SO(3) associated to the pair P, P. See Section 5.2. for details.
FIG. 2.As explained in Section 5.2., SO(3) may be visualized as a 3D ball of radius π. Presented here are 81 horizontal slices of the histogram of backbone hydrogen bond (BHBs) in HQ60 in this ball from north to south pole colored by population density from Penner et al. (2014), where the R-Y-G-B color is linear in the density ranging from 19,000 to 1.
FIG. 3.Histogram of Π-values and of flanking Dictionary of Secondary Structure for Proteins (DSSP) secondary structure types across HQ60. (a) Histogram of Π(p) = ln(d(m)/d(p)) for all BHBs across HQ60. The x-axis corresponds to the indicated intervals of II-values achieved for the BHBs in HQ60, and the y-axis indicates the number of occurrences in HQ60 within each interval of size 0.18. (b) Population of flanking DSSP secondary structure types H (α helix), E (β strand), C (coil), B (β bridge), G (310 helix), 1 (π helix), S (bend), and T (turn) across the range of Π-values divided by 10 along the x-axis.
FIG. 4.Compare with Figure 6 in White et al. (2008), to which these images are aligned. Blue indicates non-exotic, and yellow, orange, and red, respectively, correspond to Π-values at least 7.5, 8.5, and 9.5. Influenza hemagglutinin (HA; a,b) prefusion and (c,d) postfusion. Paramyxovirus glycoprotein F (e,f) prefusion and (g,h) postfusion. Tick-borne encephalitis glycoprotein E (i) prefusion and (j,k) postfusion. Vesicular stomatitis glycoprotein G (l,m) prefusion and (n,o) postfusion.
FIG. 6.Paramyxovirus F prefusion, chain A on the left and chain B on the right. There is only approximate consensus on significant free energies between chains A and B and chain C in Figure 4f.
Exotic BHB Donor/Acceptor Residues Viral Glycoproteins for Test Cases
Donor/acceptor residues of BHBs in order of non-decreasing Π-values, with 7.5, 8.5, 9.5, and 9.85, respectively, corresponding to percentiles 90, 95, 99, and 100. The residues lying in generally agreed upon fusion loops are shown in bold.
Fusion loop is missing from the structure and therefore a fortiori contains no exotic BHBs.
FIG. 5.Influenza type 2 HA pre- and postfusion, both HA1 and HA2. Chains E and F are depicted on the left, and full glycoprotein on the right.
Conformationally Active and Exotic Residues in Test Cases
| Viral glycoprotein | [ | Further than one away from active | Active | One away from active | [ |
|---|---|---|---|---|---|
| Influenza glycoprotein HA chain F | 122 | 70 | 33 | 19 | 7 |
| Paramyxovirus glycoprotein F chain A | 422 | 81 | 251 | 90 | 62 |
| Tick-borne encephalitis glycoprotein E chain A | 376 | 120 | 148 | 108 | 34 |
| Vesicular stomatitis glycoprotein G chain A | 409 | 140 | 138 | 131 | 72 |
Displayed are the data upon which the p-values in Table 3 are based. For each virus, the pre- and postfusion PDB files are aligned in order to compare the change of conformational angles during reconformation.
Residues is the number of residues common to the aligned pre- and postfusion conformation PDB files, and [#]Exotic is the number of exotic prefusion residues, namely the number of predictions to be made.
Distance d to Nearest Active Residue for Exotic Residues
| Viral glycoprotein | d = 0 | d = 1 | d = 2 | d > 2 | First | Second |
|---|---|---|---|---|---|---|
| Influenza glycoprotein HA2 chain F | 2/1 | 2/1 | 0/0 | 0/1 | ||
| Paramyxovirus glycoprotein F chain A | 27/15 | 6/8 | 1/1 | 3/1 | ||
| Tick-borne encephalitis glycoprotein E chain A | 7/7 | 2/9 | 0/5 | 2/2 | ||
| Vesicular stomatitis glycoprotein G chain A | 17/4 | 12/15 | 2/10 | 3/9 |
Results presented as dissipative/conservative, where these notions are defined in Section 5.3. p-Values computed for the trinomial distribution discussed before. The first p-value tests significance of the implication: if a residue is exotic prefusion, then it is at most one residue away from an active residue, and for the second p-value, all conservative results are discarded. Vesicular stomatitis is exceptional because its glycoprotein G can oscillate between pre- and postfusion conformations evidently with conserved exotic residues. See Section 5.3. for further detail.
FIG. 7.Histogram of Π-values and flanking primary structure for all exotic BHBs across HQ60. Curves are colored by residue as indicated. Note the increasing frequency of glycine reflecting the presumably progressively contorted exotic features that the primary structure must support.