| Literature DB >> 31462935 |
Giuseppe Rigogliuso1, Martin L Biniossek2, John L Goodier3, Bettina Mayer2, Gavin C Pereira3, Oliver Schilling4,5, Eckart Meese1, Jens Mayer1.
Abstract
BACKGROUND: A considerable portion of the human genome derives from retroviruses inherited over millions of years. Human endogenous retroviruses (HERVs) are usually severely mutated, yet some coding-competent HERVs exist. The HERV-K(HML-2) group includes evolutionarily young proviruses that encode typical retroviral proteins. HERV-K(HML-2) has been implicated in various human diseases because transcription is often upregulated and some of its encoded proteins are known to affect cell biology. HERV-K(HML-2) Protease (Pro) has received little attention so far, although it is expressed in some disease contexts and other retroviral proteases are known to process cellular proteins.Entities:
Keywords: Endogenous retrovirus; HERV-K; Pathogenesis; Proteolysis; Retrotransposon; Retroviral protease
Year: 2019 PMID: 31462935 PMCID: PMC6707001 DOI: 10.1186/s13100-019-0178-z
Source DB: PubMed Journal: Mob DNA
Fig. 1Purification of HERV-K(HML-2) Protease. A previously established method for purification of prokaryotically expressed HML-2 Pro was employed with minor modifications (see text). Samples were taken at various steps of the procedure, such as bacterial culture before induction (“pre-ind.”), flow-through (“flow-thr.”) after binding of bacterial lysate to Pepstatin A-agarose, two wash fractions, and 4 elution fractions. Proteins were separated by SDS-PAGE in a 15% PAA-gel and visualized by staining with Coomassie Blue. Molecular mass of marker proteins (M) are indicated on the left. Purified, auto-processed HML-2 Pro migrates at approximately 12 kDa
Fig. 2Optimization of reaction conditions of HERV-K(HML-2) Protease. Purified HML-2 Pro was incubated with a fluorescent anthranilyl-substrate and fluorescence emission was measured for the indicated time periods. Influence of different buffer compositions (top), pH values (middle), and Pepstatin A concentrations (bottom) on HML-2 Pro activity are depicted. Buffer compositions were as follows: Buffer 1: 20 mM PIPES, 100 mM NaCl, 1 mM DTT, 10% [v/v] Glycerol, pH 6.5; Buffer 2: 50 mM MES, 1 M NaCl, 20% [v/v] Glycerol, 1 mM EDTA, pH 5.0; Buffer 3: 50 mM MES, 1 M NaCl, 1 mM EDTA, pH 5.0; Buffer 4: 100 mM MES-TRIS, 1.25 M NaCl, pH 6.0. Effects of pH were measured in a buffer consisting of 100 mM MES, 1 M NaCl. Note the differing glycerol concentrations of buffers 2 and 3 (see the text). Also note that reactions at pH 5.5 and pH 6 depleted the substrate after approximately 110 min due to high HML-2 Pro activity. Effects of Pepstatin A at 200 μM were measured with and without pre-incubation of protease with Pepstatin A
Fig. 3Cleavage sites in human proteins identified as substrates of HERV-K(HML-2) Protease by TAILS. a. Results of filtering of cleavage sites observed by TAILS. Results for two experiments (rep1, rep2) performed at pH 5.5 and pH 7 are each depicted. Various filters were applied, such as greater than 1.5-fold or 2-fold (fc) enrichment for the observed cleavage event compared to controls and particular amino acids in P1 and P1’ (see the paper text). Resulting numbers after applying the various filters are indicated by bars and by specific numbers when including P1 and P1’. b. Venn diagrams depicting overlap of cleavage sites and protein IDs in replicates (rep1, rep2) performed at pH 5.5 and pH 7. The overlap of protein IDs detected in all four experiments is depicted in the Venn diagram at the bottom. c. Numbers of cleavage sites in proteins identified as substrates of HERV-K(HML-2) Pro. Results are summarized for the replicate (rep1, rep2) TAILS experiments at pH 5.5 and pH 7. A single cleavage event was observed for the vast majority of proteins, fewer proteins were cleaved at more than one position, and a relatively small number of proteins were cleaved at up to 60 different positions within the particular protein. See Additional file 1: Table S2, for selected human proteins with multiple cleavage sites
Fig. 4Gene Ontology term-based characteristics of human proteins identified as substrates of HERV-K(HML-2) Protease by TAILS. Selected cellular components (a) and biological processes (b) are depicted. Numbers were compiled using PANTHER (Protein ANalysis THrough Evolutionary Relationships) GO-Slim as provided at http://geneontology.org [51, 52]. Numbers of proteins per category expected by chance are also given. Graph (b) also depicts in the bottom-most bar the overlap of proteins identified by TAILS with cancer-relevant genes as compiled by COSMIC (Catalogue Of Somatic Mutations In Cancer; https://cancer.sanger.ac.uk/cosmic) [53]. See Additional file 1: Table S3 for COSMIC cancer genes
Selected candidate proteins for verification of processing by HERV-K(HML-2) Proteasea
| Gene/Protein | Protein ID | Function |
|
|---|---|---|---|
| C15orf57 (CCDC32; Coiled-Coil Domain Containing 32) | Q9BV29 | unknown | +/+ |
| CALR (Calreticulin) | P27797 | Prevents binding of glucocorticoid receptor to glucocorticoid response element. Important modulator of gene transcription. AD: Myelofibrosis With Myeloid Metaplasia; Somatic and Essential Thrombocythemia | −/n.t. |
| CIAPIN1 (Cytokine Induced Apoptosis Inhibitor 1) | Q6FI81 | Cytokine-induced inhibitor of apoptosis. Anti-apoptotic effects in the cell. Negative control of cell death upon cytokine withdrawal | +/+ |
| DDX3X (DEAD-Box Helicase 3, X-Linked) | O00571 | Transcriptional regulation, mRNP assembly, pre-mRNA splicing, mRNA export, translation, cellular signaling, viral replication. Misregulation implicated in tumorigenesis. Cofactor for XPO1-mediated nuclear export of incompletely spliced HIV-1 Rev. RNAs. AD: Mental Retardation, X-Linked 102; Toriello-Carey Syndrome (Corpus Callosum, Agenesis Of, With Facial Anomalies And Robin Sequence) | i/n.t. |
| ENO1 (Enolase 1) | P06733 | Role in glycolysis, growth control, hypoxia tolerance, allergic responses. Shorter isoform binds to c-myc promoter and functions as a tumor suppressor. Autoantigen in Hashimoto encephalopathy. AD: Cancer-Associated Retinopathy; Non-Herpetic Acute Limbic Encephalitis | i/n.t. |
| HSP90AA1 (Heat Shock Protein 90 Alpha Family Class A Member 1) | P07900 | Proper folding of target proteins, regulation of transcription machinery. AD: Epidermolysis Bullosa Acquisita; Hypersensitivity Vasculitis | +/+ |
| HSP90AB1 (Heat Shock Protein 90 Alpha Family Class B Member 1) | P08238 | Signal transduction, protein folding and degradation; morphological evolution. AD: Bronchitis; Larynx Cancer | +/n.t. |
| MAP2K2 (Mitogen-Activated Protein Kinase Kinase 2) | P36507 | Critical role in mitogen growth factor signal transduction. AD: Cardiofaciocutaneous Syndrome 4 | +/+ |
| PDIA3 (Protein Disulfide Isomerase Family A Member 3) | P30101 | Modulates folding of newly synthesized glycoproteins. Molecular chaperone preventing protein aggregates. AD: Maxillary Sinus Squamous Cell Carcinoma; Anomalous Left Coronary Artery From The Pulmonary Artery | +/n.t. |
| PSMC4 (Proteasome 26S Subunit, ATPase 4) | P43686 | 26S proteasome assembly. AD: Cystic Fibrosis; Myotonia; Parkinson’s disease(?) | −/n.t. |
| RANBP1 (RAN Binding Protein 1) | P43487 | Participates in regulation of cell cycle by controlling transport of proteins and nucleic acids into nucleus. May control progression through the cell cycle by regulating transport of protein and nucleic acids across nuclear membrane. | +/n.t. |
| RNASEH2A (Ribonuclease H2 Subunit A) | O75792 | Component of RNase H2 complex that degrades RNA of RNA:DNA hybrids. Catalytic subunit of RNase H2. Mediates excision of single ribonucleotides from DNA:RNA duplexes. AD: Aicardi-Goutières Syndrome | n.t./+ |
| RNASEH2B (Ribonuclease H2 Subunit B) | Q5TBB1 | Component of RNase H2 complex. Non catalytic subunit of RNase H2. AD: Aicardi-Goutières Syndrome. | +/n.t. |
| S100A4 (S100 Calcium Binding Protein A4) | P26447 | Regulation of cell cycle progression, differentiation. Altered expression implicated in tumor metastasis. AD: Bile Duct Adenocarcinoma; Pancreatic Cancer | −/n.t. |
| STUB1 (STIP1 Homology And U-Box Containing Protein 1) | Q9UNE7 | Ubiquitin ligase/cochaperone. Targets misfolded chaperone substrates towards proteasomal degradation. Binds and ubiquitinates Hspa8 and Polb. Negatively regulates suppressive function of regulatory T-cells during inflammation. Negatively regulates TGF-β signaling. AD: Spinocerebellar ataxia, autosomal recessive 16; Gordon Holmes Syndrome | n.t./+ |
| TUBA1A (Tubulin Alpha 1a) | Q71U36 | Major component of microtubules. AD: Lissencephaly 3; Lissencephaly With Cerebellar Hypoplasia | +/+ |
| TRIM28 (Tripartite Motif Containing 28) | Q13263 | Nuclear corepressor for KRAB domain-containing zinc finger proteins. Mediates gene silencing. Important regulator of CDKN1A/p21. Mediates nuclear localization of KOX1, ZNF268, ZNF300. Required to maintain transcriptionally repressive state of genes in undifferentiated embryonic stem cells. | n.t./+ |
aGene/Protein as given by approved gene/protein symbols and full names. A representative protein ID is given. Selected functional characteristics as well as associated diseases (“AD”) for respective genes/proteins were compiled from information provided by GeneCards [55]. Experimental verifications (“ver.”) of processing of particular proteins by HML-2 Pro in vitro and in vivo is indicated as “+/+”, no processing observed as “–”, inconclusive evidence as “i” (see the text), n.t.: not tested in either in vitro or in vitro experiments
Fig. 5Verification of processing of human proteins by HERV-K(HML-2) Protease in vitro. Human candidate proteins were expressed in vitro using a coupled transcription/translation system. a. Results from protease incubations of various candidate proteins labeled with either 35S-methionine or a C-terminal HA-tag (“35S” and “HA”) are shown. Experiments included for each candidate protein a reaction without protease (“C”), one with protease (“+”), and one with protease and Pepstatin A (“+/P”). Reaction products were separated by SDS-PAGE in 10% PAA-gels and processed for phosphorimager analysis or HA-tag-specific Western blots depending on the label. Processing of full-length candidate proteins (indicated by an arrow) was evidenced by additional protein bands smaller than the respective full-length candidate protein (arrowheads) and/or a decrease in the amount of full-length candidate protein (see the Results section). One example of a candidate protein (PSMC4) without evidence of processing by HML-2 Pro is shown. b. Graphical depiction of candidate proteins confirmed to be processed by HML-2 Pro. The number of amino acids and corresponding molecular mass in kDa is indicated by scales at the top and by the line length for each protein. Positions of methionines and cleavage sites (grey and black arrowheads, respectively), as identified by TAILS in either one of the two replicate experiments at pH 5.5 (see the text), are indicated for each protein. Dashed lines indicate molecular masses of processing products observed experimentally for either 35S-methionine (“35S”)- or HA-tag (“HA”)-labeled candidate proteins. Note that the latter label will only detect C-terminal processing products. Processing products were not indicated for the two HSP90A proteins because observed products were difficult to assign due to too many observed cleavage sites. Processing of PDIA3 protein was supported by reduction of the amount of full-length protein, though no smaller processing products could be observed. Note that C15orf57 migrated slower in gel electrophoresis than predicted by molecular mass. See Additional file 2: Figure S2 for additional evidence of processing of candidate proteins by HML-2 Pro
Fig. 6Verification of processing of human proteins by HERV-K(HML-2) Protease in vivo. Human candidate proteins and HML-2 Pro were co-expressed in HeLa cells in vivo and detected by Western blot using antibodies as indicated. For each blot, the leftmost lane is a control co-transfected with a plasmid encoding a candidate protein and either a GFP-encoding plasmid or empty phCMV, pcDNA6 myc/his B, or pcDNA5 FRT/TO vector, depending on GFP-Pro or (sole) Pro co-expressed in the experiment (see below). Candidate protein co-expressed with wild-type Pro (pro-wt) and mutant Pro (pro-mut) were loaded in lanes 2 and 3 each. Pro was expessed as either (sole) Pro or EGFP-Pro. Blots were probed with α-HA, α-GFP, α-Pro, or an α-HSP90 antibody as indicated. Full-length candidate protein and processing products are indicated by arrows and arrowheads, respectively (see below). A Representative results from control experiments co-expressing HSP90AA1 with either HML-2 Pro or EGFP-Pro. Relevant blot regions are shown. When expressing pro-wt and pro-mut, HML-2 Pro can be detected as approximately 18 kDa and 19 kDa protein bands representing self-processed and unprocessed products, respectively, Pro (a, bottom blot). When HML-2 Pro is expressed as EGFP-Pro-wt or EGFP-Pro-mut fusion protein, proteins of approximately 30 kDa and 47 kDa, representing processed and unprocessed EGFP(−Pro) can be detected with an α-GFP antibody (b, middle blot). Unprocessed EGFP-Pro(−mut) and self-processed Pro of approximately 50 kDa and 18 kDa, respectively, can be detected when using an α-Pro antibody (b, bottom blot; c). B. Selected Western blot results from co-expression of candidate proteins and HML-2 Pro. Candidate proteins were tagged with N- or C-terminal epitopes and detected with respective epitope-specific antibodies as indicated. Note the more or less complete reduction of amounts of full-length candidate protein (arrows), and sometimes processing products (arrow heads), in lanes with co-expressed HML-2 Pro. Note in panel Aa and Ab that the same processing product was detected for HSP90AA1 in vitro and in vivo (the HSP90AA1 in vitro result is shown again in Ad for the sake of convenience). Also compare in vitro and in vivo results for C15orf57 and MAP2K2 as additional examples of similar sized processing products. Molecular masses of co-migrating marker proteins are indicated. Note that the α-Pro Western blot result shown for CIAPIN1-HA is extracted from the Western blot shown in Ac. See Additional file 2: Figure S3 for loading controls as well as more examples of proteins processed by HML-2 Pro in vivo. C. Graphical depictions of candidate proteins and predictions of processing products as observed when co-expressing candidate proteins and HML-2 Pro-wt in vivo. Numbers of amino acids and corresponding molecular mass (kDa) are indicated by scales at the top and by the length of lines for each protein. Positions of cleavage sites, as identified by TAILS experiments at pH 5.5 and pH 7, are indicated by triangles for each protein. Dashed lines indicate molecular masses of processing products and take into account whether the candidate protein was expressed with an N-terminal or a C-terminal epitope tag. Note the overlap between predictions and molecular masses of processing products observed in vivo
Fig. 7Multiple alignment of amino acid sequences of Proteases potentially encoded by HERV-K(HML-2) loci. Because HML-2 Pro is translated via a ribosomal frameshift from the Gag ORF only HML-2 Pro sequences that also harbor a full-length Gag ORF are included. Note that other HML-2 loci may also encode protease in the case of translation bypassing Gag-Pro frameshifts. The HML-2 Pro ORF also encodes an upstream dUTPase. The C-terminal “last” dUTPase motif is included in the multiple alignment. Also indicated are a previously reported N-terminal auto-processing site for HML-2 Pro [44], and DTG, FLAP and GRDLL motifs conserved in retroviral aspartyl proteases. Note the early stop codons in two sequences that partially or entirely remove the GRDLL region. The HML-2 locus designations used here are a combination of two established naming systems; the first based on the location of HML-2 loci in chromosomal bands [58] and the second based on HUGO Gene Nomenclature Committee (HGNC)-approved designations of transcribed HML-2 loci [59]. HERV-K113 and the three bottom-most sequences are HML-2 sequences not present in the human reference genome [2, 60]. Also note that locus chr3q27.2_ERVK-11 harbors a fused Gag-Pro ORF that extends approximately 700 aa in the N-terminal direction. Locus 7p22.1_ERVK-6 represents the protease sequence used for in vitro and in vivo experiments in this study