| Literature DB >> 29379064 |
Wenjuan Wei1, Nouredine Behloul1, Sarra Baha1, Zhenzhen Liu1, Mehwish Saba Aslam1, Jihong Meng2.
Abstract
Orally-transmitted viruses have evolved in a way to resist the extreme conditions of the host's gastrointestinal environment, especially the proteolysis of their structural proteins. However, the mechanisms allowing these viruses to survive these harsh conditions remain unclear. Hepatitis E virus (HEV) is an orally-transmitted human pathogen. Its capsid protein contains three domains S, P1 and P2. The latter forms a homodimer protruding from the virus shell, making it the most exposed part. By combining biochemical and computational methods, we found the trypsin digestion sites to be highly conserved among the HEV strains. Furthermore, the constructs of the HEV capsid protein that contain an extended P2 domain were digested within the extensions leaving the P2 domain intact. The trypsinization seems to occur in three possible double cleavages at R451-R619, R460-R619 or R460-R631.The dimerization disrupts the trypsin action at three main sites in the P2 domain R542, K544 and K554. These sites are very exposed in the monomeric P2 domain constructs which makes the monomeric forms very susceptible to trypsin action. Therefore, we believe that dimerization is a structural feature that has been selected by the evolutionary forces to render the HEV capsid protein resistant to the host's proteases; an evolutionary feature that could be common to some other (if not all) orally-transmitted viruses.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29379064 PMCID: PMC5788867 DOI: 10.1038/s41598-018-20137-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Non-reducing SDS-PAGE gel of p179/N562P, p179/N562Y, p146, p179, p216 and p222. (−) unheated, (+) heated. Full-length gel is presented in Supplementary Figure S3. (b) SDS-PAGE gel of the wild-type and mutant p179/N562 after trypsin-proteolysis: (+) treated with trypsin; (−) not treated with trypsin. Lanes 1, 3, 5, 7, 9, 11: heat-denatured before trypsin treatment and before loading onto the gel; Lanes 2, 4, 6, 8, 10, 12: heated-denatured only before loading onto the gel without trypsin treatment. M: protein marker in kilo-Daltons (Bio-Red, USA); Trypsin molecular weight is about 24 kDa. Full-length gel is presented in Supplementary Figure S4. (c) SDS-PAGE gel of HEV ORF2 proteins after trypsin-proteolysis. Full-length gel is presented in Supplementary Figure S5. (d) Immunoreactivity of HEV ORF2 proteins with mAb 1G10 detected by Western blot. (+): treated with trypsin; (−): not treated with trypsin. All samples were boiled only before loading on the gel. Full-length gel is presented in Supplementary Figure S6. M: protein marker in kilo-Daltons (Bio-Red, USA). Trypsin molecular weight is about 24 kDa. The grouping of gels/blots are cropped from different gels and made explicit using dividing lines.
Figure 2Illustration of the best docking poses (1–4) of trypsin (PDB ID: 418 G) and p146 homodimers. The trypsin and the p146 homodimer are shown in yellow and cyan cartoon representation, respectively. The trypsin catalytic triad is represented in blue sticks and the trypsin digestion sites depicted in red.
Figure 3Illustration of the best docking poses of trypsin (PDB ID: 418 G) and p179 (a), p216 (b) and p222 (c) homodimers. The trypsin is shown in yellow cartoon and its catalytic triad blue sticks. p179, p216 and p222 homodimers are shown in cyan surface representation with the trypsin digestion sites depicted in red.
Summary of possible trypsin proteolytic sites and comparison of the predictions with trypsin digestion assay results.
| Proteins | Proteolytic sitesa | Longest resulting peptide (aa) | Molecular weight (kDa) | Result of trypsin digestion assay |
|---|---|---|---|---|
|
|
|
|
|
|
| 554 | 100 | 11.01 | ||
| 179 |
|
|
|
|
| 451 | 172 | 18.62 | probableb | |
| 460 | 163 | 17.73 | probable | |
| 466 | 157 | 17.03 | probable | |
| 512 | 111 | 12.15 | ||
| 524 | 99 | 10.79 | ||
|
|
|
|
|
|
| 423 | 220 | 24.10 | ||
| 437 | 206 | 22.63 | ||
| 451 | 192 | 20.89 | ||
| 619 | 198 | 21.56 | ||
| 423–619c | 196 | 21.32 | ||
| 437–619 | 182 | 19.85 | probable | |
| 451–619 | 168 | 18.10 | probable | |
|
|
|
|
|
|
| 460 | 206 | 22.62 | ||
| 466 | 200 | 21.93 | ||
|
| 148 | 16.41 | ||
| 631 | 193 | 21.17 | ||
| 460–631 | 171 | 18.64 | probable | |
| 466–631 | 165 | 17.94 | probable | |
| 518–631 | 113 | 12.29 |
FL: Full length protein; aAll the proteolytic sites were predicted via protein docking analysis; bThe molecular weight of longest resulting peptide is in agreement with the results of proteolytic digestion assay; cthe protein was digested at two sites at the same time.
The residue accessible surface area (RSA), percent solvent accessibility (PSA) and protrusion index (PI) of trypsin proteolytic sites.
| Sites | p146 | p179 | p216 | p222 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RSA (Å2) | PSA (%) | PI | RSA (Å2) | PSA (%) | PI | RSA (Å2) | PSA (%) | PI | RSA (Å2) | PSA (%) | PI | |
|
| — | — | — | — | — | — |
|
|
| — | — | — |
|
| — | — | — | — | — | — |
|
|
| — | — | — |
|
| — | — | — |
|
|
|
|
|
|
|
|
|
| R460 | 109.3 | 47.7 | 0.8 | 142.2 | 62.0 | 0.5 | 43.6 | 19.0 | 0.4 | 130.5 | 56.9 | 0.4 |
|
|
|
|
| 159.4 | 69.5 | 0.2 | 88.7 | 38.7 | 0.1 | 159.5 | 69.5 | 0.2 |
|
|
|
|
|
|
|
|
|
|
| 96.9 | 42.2 | 0.5 |
|
|
|
|
|
|
|
|
|
|
| 146.9 | 75.5 | 0.3 |
| R524 | 111.9 | 48.8 | 0.9 |
|
|
| 118.2 | 51.5 | 0.4 | 44.9 | 19.6 | 0.3 |
| K534 | 44.7 | 23.0 | 0.6 | 61.1 | 31.4 | 0.6 | 95.9 | 49.3 | 0.5 | 46.4 | 23.8 | 0.3 |
| R542 | 21.0 | 9.1 | 0.6 | 77.4 | 33.7 | 0.3 | 50.1 | 21.9 | 0.0 | 41.8 | 18.2 | 0.4 |
| K544 | 19.4 | 10.0 | 0.4 | 0.0 | 0.0 | 0.3 | 1.5 | 0.8 | 0.1 | 25.9 | 13.3 | 0.5 |
| K554 | 89.6 | 46.1 | 0.9 | 23.0 | 11.8 | 0.9 | 5.5 | 2.8 | 0.4 | 89.6 | 46.1 | 0.8 |
| R578 | 76.6 | 33.4 | 0.3 | 91.0 | 39.7 | 0.4 | 90.9 | 39.6 | 0.3 | 18.1 | 7.9 | 0.2 |
|
| — | — | — | — | — | — |
|
|
|
|
|
|
|
| — | — | — | — | — | — | 43.9 | 19.1 | 0.1 |
|
|
|
|
| — | — | — | — | — | — | — | — | — |
|
|
|
|
| — | — | — | — | — | — | — | — | — |
|
|
|
|
|
|
|
| |||||||||
| K656 | — | — | — | — | — | — | — | — | — | 76.9 | 39.5 | 0.6 |
| R658 | — | — | — | — | — | — | — | — | — | 93.6 | 40.8 | 0.7 |
Trypsin cleavage sites with high solvent accessibility are in bold.
Figure 4Illustration of the docking results of p179 monomer and trypsin (PDB ID: 418 G). The trypsin and the monomeric p179 are shown in yellow and cyan cartoon representation, respectively. The trypsin catalytic triad is shown in blue sticks and the proteolytic sites in p179 (K554, K544 and R542) in red sticks.
The difference of percent solvent accessibility (%) of trypsin digestion sites between dimeric and monomeric HEV ORF2 proteins.
| Sites | p179 | p216 | p222 | |||
|---|---|---|---|---|---|---|
| Monomer | Dimer | Monomer | Dimer | Monomer | Dimer | |
| K423 | — | — | 90.783 | 90.783 | — | — |
| R437 | — | — | 98.394 | 99.052 | — | — |
| R451 | 80.214 | 80.214 | 67.897 | 68.116 | 60.118 | 60.118 |
| R460 | 62.004 | 62.004 | 18.996 | 18.996 | 57.106 | 56.886 |
| R466 | 69.502 | 69.502 | 38.658 | 38.658 | 69.311 | 69.523 |
| R512 | 59.636 | 58.978 | 67.519 | 67.307 | 43.937 | 42.238 |
| K518 | 76.155 | 76.155 | 81.21 | 80.967 | 75.522 | 75.522 |
| R524 | 69.927 | 69.708 | 51.539 | 51.539 | 19.562 | 19.562 |
| K534 | 31.437 | 31.437 | 49.311 | 49.311 | 23.842 | 23.842 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| R578 | 39.671 | 39.671 | 39.643 | 39.643 | 8.294 | 7.869 |
| R619 | — | — | 94.439 | 94.439 | 84.613 | 84.394 |
| R631 | — | — | 19.139 | 19.139 | 70.261 | 70.261 |
| R649 | — | — | — | — | 63.832 | 64.052 |
| K651 | — | — | — | — | 71.739 | 71.739 |
| K653 | — | — | — | — | 76.787 | 75.769 |
| K656 | — | — | — | — | 39.522 | 39.522 |
| R658 | — | — | — | — | 40.799 | 40.799 |
List of the primers used for the amplification of the different DNA coding sequences.
| Primers | sequence (5′ to 3′) |
|---|---|
| p146 (aa460–605) | F: TTT |
| R: TTT | |
| p179 (aa439–617) | F:CCC |
| R:CCC | |
| p216 (aa422–637) | F: CCC |
| R: CCC | |
| p222 (aa439–660) | F: CCC |
| R: CCC |