Literature DB >> 32609809

RNA aptamer capture of macromolecular complexes for mass spectrometry analysis.

Judhajeet Ray1, Angela Kruse2,3, Abdullah Ozer1, Takuya Kajitani1, Richard Johnson4, Michael MacCoss4, Michelle Heck2,3,5, John T Lis1.   

Abstract

Specific genomic functions are dictated by macromolecular complexes (MCs) containing multiple proteins. Affinity purification of these complexes, often using antibodies, followed by mass spectrometry (MS) has revolutionized our ability to identify the composition of MCs. However, conventional immunoprecipitations suffer from contaminating antibody/serum-derived peptides that limit the sensitivity of detection for low-abundant interacting partners using MS. Here, we present AptA-MS (aptamer affinity-mass spectrometry), a robust strategy primarily using a specific, high-affinity RNA aptamer against Green Fluorescent Protein (GFP) to identify interactors of a GFP-tagged protein of interest by high-resolution MS. Utilizing this approach, we have identified the known molecular chaperones that interact with human Heat Shock Factor 1 (HSF1), and observed an increased association with several proteins upon heat shock, including translation elongation factors and histones. HSF1 is known to be regulated by multiple post-translational modifications (PTMs), and we observe both known and new sites of modifications on HSF1. We show that AptA-MS provides a dramatic target enrichment and detection sensitivity in evolutionarily diverse organisms and allows identification of PTMs without the need for modification-specific enrichments. In combination with the expanding libraries of GFP-tagged cell lines, this strategy offers a general, inexpensive, and high-resolution alternative to conventional approaches for studying MCs.
© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32609809      PMCID: PMC7470977          DOI: 10.1093/nar/gkaa542

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Characterizing macromolecular complexes (MCs) and their interactions is essential for understanding any biological process at the molecular level. With increased resolution and throughput of mass spectrometry (MS) in the last decade, MS-based proteomic analyses following co-immunoprecipitations have been extensively utilized to identify interacting partners of many proteins (1–3). These assays rely on the availability of antibodies that are well-characterized, highly specific, and high-affinity against the protein of interest (POI) or a peptide tag that allow the binding partners of the (tagged)-target protein to be co-precipitated while the antibody is immobilized on beads/resins. Even for a single antibody, significant lot-to-lot variability affects the purity, specificity and yield of (co-) immunoprecipitations. The (co-) immunoprecipitated proteins are subsequently eluted by denaturation (i.e. with heat, SDS or combinations) or by on-bead proteolytic digestion and analyzed by MS. However, contaminating peptides derived from the antibody/serum or Protein-A/G are routinely found at an order of magnitude higher abundance than the POI (1,2). This can hinder the identification of interacting partners, particularly if they are rare or sub-stoichiometric. In some cases, eluates are further fractionated by gel electrophoresis and individual protein bands are excised to exclude heavy and light chains of the antibody prior to MS. This method limits the number of proteins that can be identified and prevents analysis of proteins below the limit of detection for a given electrophoresis/protein staining technique (4) while increasing the likelihood of keratin contamination and the length of time needed for sample preparation. To provide an alternative to immunoprecipitations, we have developed an RNA aptamer-based affinity purification method, which we call AptA–MS (aptamer affinity–mass spectrometry), using the highly-specific and high-affinity Green Fluorescent Protein (GFP)-aptamer (5) to co-purify GFP-tagged target proteins and their binding partners for identification by MS. Nucleic acid aptamers can be selected against a wide variety of targets and synthesized in unlimited quantities by cost-effective methods. These properties, in addition to their high specificity and affinity, make aptamers attractive reagents for affinity purification. Indeed, aptamers have been used for affinity purification of targets from biological mixtures followed by MS, but mainly for target detection and biomarker discovery (6). These detection assays were developed with a handful of aptamers and demonstrated to work by proof-of-principle experiments with little or no biological applications. General and simple affinity-capture methods using RNA aptamers are lacking, especially those that allow for quantitative analysis of protein interactions and protein complex formation directly from cellular lysates and can be applied to address a broad array of biological questions in a wide range of species, tissues, and cell types. We reasoned that the GFP protein in combination with the high-affinity and high-specificity GFP-aptamer (5) would serve as a suitable affinity tool to study protein-protein interactions by MS and allow use with a broad collection of existing GFP-fusion proteins in human cells and other model organisms including Drosophila and yeast. Here, we demonstrate that AptA–MS is superior to conventional co-immunoprecipitations for subsequent MS analysis, because it is devoid of immunoprecipitation-derived protein contaminants, and provides a dramatic enrichment of the POI. Using AptA–MS we have identified several known and novel interactors of human Heat Shock Factor 1 (HSF1) tagged with GFP, some of which showed an increased association following heat shock (HS). In addition, we have identified post-translational modifications (PTMs) of HSF1 and the co-precipitated histones without additional tailored enrichment steps for these modifications. We have also applied AptA–MS with other aptamers (e.g. NELF-aptamer) (7,8) to enrich its target from Drosophila S2 cells. Our results indicate that in addition to purifying transiently transfected HSF1-GFP from human cells, the GFP-aptamer is capable of enriching endogenous GFP-tagged RNA polymerase II (Pol II) from yeast, as well as formaldehyde crosslinked GFP from Drosophila S2 cells, thereby making it a versatile tool for affinity purification of GFP-tagged proteins from various sources.

MATERIALS AND METHODS

Cell culture and transfection

Human HCT116 cells were grown in McCoy's 5A media (with 10% FBS + P/S) at 37°C. Around 4 million HCT116 cells were plated in McCoy's 5A media (with 10% FBS + P/S) 24 h prior to being transfected with pEGFP-N1 or pHSF1-GFPN3 (9) (gift from Stuart Calderwood, Addgene# 32538) plasmid and Fugene HD reagent at 3:1 ratio. It should be noted that in this study for HCT116 and S2 cells, GFP or GFP-fusion protein refers to protein containing enhanced (E)-GFP which has equivalent binding affinity to the GFP-aptamer (5). Transfection efficiency was monitored after 20 h (∼90% efficient as judged by GFP fluorescence) and cells were then subjected to instantaneous HS (10). Cells were scraped after 30 min HS and centrifuged at 500 × g for 5 min at 4°C and washed twice with ice-cold 1× PBS. Drosophila S2 cells were grown in M3+BPYE media (with 10% FBS) at 25°C. S. pombe cells were cultivated using standard procedures (11).

Human cellular lysate preparation & aptamer-based affinity purification

Transfected human HCT116 cells before or after HS were resuspended in 0.5 ml ice-cold cellular lysis buffer (1× PBS + 0.2% NP40 + 1× EDTA-free Protease inhibitor cocktail). Cells were incubated on ice for 30 min followed by sonication with Bioruptor Diagenode at High setting (30 s ON/ 30 s OFF) for 5 min at 4°C. The lysate was centrifuged at 20 000 × g for 10 min at 4°C and the resulting supernatant was transferred to a new tube. The cleared lysate was diluted to a final buffer containing 1× PBS, 0.05% NP40, 5.25 mM MgCl2, 187.5 ng/μl yeast RNA, 187.5 ng/μl sheared salmon sperm DNA, 200 units SUPERase IN/ ml.

RNA preparation and immobilization on beads

The polyadenylated (20 nt ‘A’) GFP or polyadenylated control (NELF)-aptamer was in vitro transcribed using T7 RNA polymerase and purified by phenol/chloroform and polyacrylamide gel extraction. DNA sequences of the GFP and control (NELF)-RNA aptamers are based on sequences used in (8) and are as follows; GFP-aptamer: GGGAGCTTCTGGACTGCGATGGGAGCACGAAACGTCGTGGCGCAATTGGGTGGGGAAAGTCCTTAAAAGAGGGCCACCACAGAAGC, NELF-aptamer: GGGAATGGATCCACATCTACGAATTCCCAACGACTGCCGAGCGAGATTACGCTTGAGCGCCCCACTGAGGATGCCCACGGGCGATTGGGGCACGGCTTCACTGCAGACTTGACGAAGCTT. 200 pmol of polyadenylated GFP or the control (NELF)- aptamer was annealed to equimolar desthiobiotin-oligodT-20 in 200 μl of 1× annealing buffer (10 mM TrisHCl pH 7.5, 50 mM NaCl) by heating at 95°C for 3 min and slow cooling to room temperature over >1 h. For each pull-down 1 mg of Dynabeads MyOne Streptavidin C1 (Thermo) magnetic beads were washed once with 1 ml and twice with 0.1 ml of Tween wash buffer (5 mM TrisHCl pH 7.5, 0.5 mM EDTA, 1 M NaCl, 0.05% Tween-20) by placing on a magnetic separator for 2 min and removing the supernatant. To eliminate possible RNase activity, beads were washed once with 0.1 ml of 0.1 M NaOH, 0.05 M NaCl followed by two washes of 0.1 ml of 0.1 M NaCl with changing tubes in between washes. The beads were resuspended in 200 μl of 2× binding buffer (10 mM TrisHCl pH 7.5, 1mM EDTA, 2 M NaCl) supplemented with 4 units/ml SUPERase IN. The resulting bead slurry was mixed with the annealed RNA aptamer and incubated on a thermomixer for 1 h at 23°C with shaking. Aptamer bound beads were washed twice with 0.4 ml of Bead wash buffer (1× PBS, 0.05% NP40, 5 mM MgCl2), supplemented with 4 units/ml SUPERase IN with changing tubes in between washes.

Binding to lysate, washing and elution

The GFP- or control-aptamer bound beads were resuspended in the diluted cellular lysate and incubated at 4°C with rotation for 2 h. The beads were placed on a magnet and supernatant was removed. The beads were washed twice with 0.5 ml of Bead wash buffer and twice with 0.5 ml of 1× PBS, 5 mM MgCl2, with changing tubes in between washes. Beads were resuspended in 50 μl of fresh Elution buffer (5 mM Biotin, 50 mM ammonium phosphate, pH 7.5) and incubated in a Thermomixer at 37°C with shaking for 1 h followed by collection of the eluate into a fresh tube. Details of the immunoprecipitation method utilizing the GFP-antibody are provided in the Supplementary Information.

Affinity purification from other organisms

Drosophila S2 cells stably expressing GFP, wild-type Drosophila S2 cells and S. pombe cells expressing endogenous GFP-Rpb3 were cultured for lysate or nuclear extract preparation, and were subjected to aptamer-based purification or Tandem affinity purification (TAP)-tag-based purification. Further details of these methods are provided in the Supplementary Information.

Proteomics workflow

Mass spectrometry sample preparation

Eluate from human HCT116 cells was solubilized in 8 M urea in 100 mM ammonium bicarbonate (ABC). The sample was reduced in 10 mM dithiothreitol (DTT) at 37°C for 1 h. Cysteines were blocked in 30 mM methyl methanethiosulfonate (MMTS) at room temperature for 1 h without light. The sample volume was adjusted to reduce the concentration of urea to below 1 M using 100 mM ABC, and proteins were digested using 1 μg of Trypsin at 37°C overnight. Salts, RNA and other contaminants were removed using mixed-mode cation exchange (MCX) columns (Oasis). Elution buffer spiked-in with biotin and analyzed before and after MCX cleanup shows that the biotin signal is drastically reduced after cleanup (Supplementary Figure S1). Samples were dried using a speed vacuum.

Mass spectrometry

All mass spectrometry with samples prepared from human HCT116 cells was performed on a Q-Exactive HF-X (Thermo Fisher Scientific) mass spectrometer with a EasyLC 1200 HPLC and autosampler (Thermo Fisher Scientific). The dried pull-downs were solubilized in 30 μl of loading buffer (0.1% trifluoroacetic acid and 2% acetonitrile in water), and 3 μl was injected via the autosampler onto a 150-μm Kasil fritted trap packed with Reprosil-Pur C18-AQ (3-μm bead diameter, Dr. Maisch) to a bed length of 2 cm at a flow rate of 2 μl/min. After loading and desalting using a total volume of 8 μl of loading buffer, the trap was brought on-line with a pulled fused-silica capillary tip (75-μm i.d.) packed to a length of 25 cm with the same Dr. Maisch beads. The column and trap were mounted to a heated microspray source (CorSolutions) at 50°C. Peptides were eluted off of the column using a gradient of 5–28% acetonitrile in 0.1% formic acid over 25 min, followed by 28–60% acetonitrile over 5 min at a flow rate of 300 nl/min. The mass spectrometer was operated using electrospray ionization (2 kV) with the heated transfer tube at 250°C using data dependent acquisition (DDA), whereby one orbitrap mass spectrum (m/z 400–1600) was acquired with up to 20 orbitrap MS/MS spectra. The resolution for MS in the orbitrap was 60 000 at m/z 200, and 15 000 for MS/MS. The automatic gain control targets for MS was 3e6, and 1e5 for MS/MS. The maximum fill times were 45 and 25 ms, respectively. The MS/MS spectra were acquired using quadrupole isolation with an isolation width of 1.6 m/z and HCD collision energy (NCE) of 28%. The precursor ion threshold intensity was set to 2e6 in order to trigger an MS/MS acquisition. Furthermore, MS/MS acquisitions were allowed for precursor charge states of 2–5. Dynamic exclusion (including all isotope peaks) was set for 10 s. Details of the mass spectrometry method associated with NELF-aptamer pull-down from Drosophila S2 cells are provided in the Supplementary Information.

Data analysis

Raw spectral files were converted to mascot generic format using MSGUI, then searched against a database containing human proteins from UniProt with the addition of the protein sequence for GFP using Mascot. The search parameters allowed for fixed cysteine methylthiolation and variable methionine oxidation modifications, with a 10 ppm peptide mass tolerance, 0.5 Da fragment mass tolerance, and one missed tryptic cleavage. Subsequent searches allowed for variable lysine acetylation and serine/threonine phosphorylation, respectively, each with fixed cysteine methylthiolation and variable methionine oxidation allowing for 20 ppm peptide mass tolerance and 0.5 Da fragment mass tolerance and a maximum of three missed tryptic cleavages. Searches were also submitted with the above parameters and a 0.02 Da fragment mass tolerance, which resulted in no substantial changes to the results. Initial analyses of abundance and enrichment were conducted using Scaffold (12). Prediction and scoring of HSF1 interacting partners was done using Significance analysis of interactome (SAINT), and data was presented using the SAINT score and fold change A values (13). Rather than performing a traditional fold change calculation, SAINT takes into account representation of each protein among biological replicates. As a control dataset to train the algorithm, we used the proteins detected in pull-downs using the GFP-aptamer from GFP cells in addition to pull-downs using the NELF-aptamer from HSF1-GFP cells. This 2-fold control strategy provided proteins that bind non-specifically to RNA and to free GFP. The resulting output was an enrichment calculation (fold change A, the most conservative option) and probability score for interaction with HSF1 (SAINT score). Prediction and assignment of posttranslational modifications was done using Scaffold PTM (14). Site assignments were confirmed using MS1 quantification in Skyline (15). All mass spectrometry proteome data were deposited to the ProteomeXchange Consortium (16) via the PRIDE repository (17) with the dataset identifier PXD015620.

Gene ontology (GO) analysis

The Protein Analysis Through Evolutionary Relationships (PANTHER) classification system (18) was used to determine the GO classification of enriched proteins. UniProt IDs from proteins statistically enriched (Fishers exact test P < 0.05) in HSF1-GFP cells compared to GFP cells with the GFP- or NELF-aptamer were used as query for the PANTHER 15.0 Gene List Analysis tool. Functional classification using the Homo sapiens reference database was performed using this tool. The percent of genes from the query matching to a specific function against the total number of queried genes with function matches was plotted using ggplot2 (19).

RESULTS

Basic strategy

The GFP-aptamer (5) with polyA-tail is immobilized on streptavidin magnetic Dynabeads via hybridization with desthiobiotin-oligodT and is incubated with a cellular lysate expressing GFP-tagged POI. After gently washing the beads, the aptamer-bound proteins can be specifically eluted with excess biotin, which has much higher affinity for streptavidin and readily competes off desthiobiotin. Eluted proteins are then analyzed by MS, leaving non-specifically bound proteins on beads (Figure 1). The MS spectra are bioinformatically analyzed to determine enrichment of the POI relative to controls, protein interactions, and PTM site assignment.
Figure 1.

AptA–MS workflow. The polyadenylated GFP-aptamer is annealed to desthiobiotin (dB) labeled oligo dT and immobilized on streptavidin (SA) coated magnetic Dynabeads. Cellular lysate containing Protein of interest (POI)-GFP is incubated with the immobilized aptamer beads that are washed and finally eluted with biotin. Eluate is subjected to MS and the data is processed through a pipeline for protein identification followed by enrichment analysis, interaction score quantification and PTM analysis. Figure partially created with BioRender.com using the GFP structure (PDB ID: 4KW4, Barnard, T.J., Yu, X., Noinaj, N., Taraska, J.W. 2014, Crystal Structure of Green Fluorescent Protein doi: 10.2210/pdb4KW4/pdb).

AptA–MS workflow. The polyadenylated GFP-aptamer is annealed to desthiobiotin (dB) labeled oligo dT and immobilized on streptavidin (SA) coated magnetic Dynabeads. Cellular lysate containing Protein of interest (POI)-GFP is incubated with the immobilized aptamer beads that are washed and finally eluted with biotin. Eluate is subjected to MS and the data is processed through a pipeline for protein identification followed by enrichment analysis, interaction score quantification and PTM analysis. Figure partially created with BioRender.com using the GFP structure (PDB ID: 4KW4, Barnard, T.J., Yu, X., Noinaj, N., Taraska, J.W. 2014, Crystal Structure of Green Fluorescent Protein doi: 10.2210/pdb4KW4/pdb).

GFP-aptamer provides a clean method of enriching GFP-tagged proteins from cellular lysates

We tested our AptA–MS method by purifying HSF1 and its interacting partners from HCT116 cells transiently expressing HSF1 fused to GFP. HSF1 is a major regulator of stress-induced transcription that binds to hundreds of Heat Shock Elements (HSEs) genome-wide upon activation (20). In non-heat shock (NHS) conditions, endogenous HSF1 is predominantly in an inactive monomeric state but converts to an active trimeric DNA-binding state upon HS (21). HSF1 is also regulated by chaperones and PTMs, and the identification of these binding partners and PTMs is critical to our understanding of HSF1’s function (22,23). We transfected HCT116 cells with plasmid vectors expressing GFP or HSF1-GFP under the same promoter, and subjected them to a 30 min heat shock at 42°C (HS) or kept them at 37°C (NHS) (Figure 2A). Total cellular lysates prepared from transfected cells were confirmed to be expressing similar levels of GFP and HSF1-GFP proteins by GFP western blot (Figure 2B). Lysates were used for protein purifications with GFP- or control-aptamer immobilized on magnetic beads. The GFP-, but not the control-aptamer, was able to purify the targeted GFP and HSF1-GFP proteins that appeared as the most abundant bands observed by a silver-stained gel (Figure 2C). In addition, fluorescent signal from the eluates indicated that GFP remained intact and functional (Figure 2C, bottom panel). The additional bands detected on the silver-stained gel indicated the co-purification of interacting proteins. In comparison, an affinity purification of HSF1-GFP utilizing the GFP antibody revealed IgG heavy chain as the most abundant protein in the eluate as detected by a silver-stained gel (Supplementary Figure S2a).
Figure 2.

Enrichment of GFP-tagged proteins by AptA–MS. (A) Schematic representation of the experimental design. (B) Cellular lysates prepared from HCT116 cells transfected with GFP or HSF1-GFP expressing plasmids were analyzed by anti-GFP (green) and anti-Actin (red, loading control) western blot. GFP (Abcam, ab290) and Actin (Sigma, MAB1501) antibodies were used at 1:2000 and 1:5000 dilutions, respectively. (C) Lysate from cells expressing GFP or HSF1-GFP were precipitated with the GFP- or Control (Ctrl)-aptamer and eluates were analyzed by gel electrophoresis and silver-staining. Bottom panel shows a fluorescence image of the eluates. (D) Enrichment analysis of HSF1 in AptA–MS samples from cells expressing GFP or HSF1-GFP, before or after heat shock pulled-down with the GFP- or the control-aptamer. Plot represents data from five independent biological replicates.

Enrichment of GFP-tagged proteins by AptA–MS. (A) Schematic representation of the experimental design. (B) Cellular lysates prepared from HCT116 cells transfected with GFP or HSF1-GFP expressing plasmids were analyzed by anti-GFP (green) and anti-Actin (red, loading control) western blot. GFP (Abcam, ab290) and Actin (Sigma, MAB1501) antibodies were used at 1:2000 and 1:5000 dilutions, respectively. (C) Lysate from cells expressing GFP or HSF1-GFP were precipitated with the GFP- or Control (Ctrl)-aptamer and eluates were analyzed by gel electrophoresis and silver-staining. Bottom panel shows a fluorescence image of the eluates. (D) Enrichment analysis of HSF1 in AptA–MS samples from cells expressing GFP or HSF1-GFP, before or after heat shock pulled-down with the GFP- or the control-aptamer. Plot represents data from five independent biological replicates. To identify the interactors of the target proteins, the eluates were processed for MS and peptide spectra were analyzed using a bioinformatics pipeline that quantifies protein enrichment and identifies interacting partners as well as PTMs (Figure 1). HSF1 was the most abundant MS-detected protein in both HSF1-GFP pull-downs (NHS and HS) (Supplementary Figure S2b). However, analyzing a conventional immunoprecipitation-MS data (1) with the same pipeline revealed Protein A and IgG as the most abundant proteins identified (Supplementary Figure S2b). Non-specifically binding proteins can pose a serious problem for any affinity-purification method. To reduce the presence of these background binders we have used competitors like yeast RNA and sheared salmon sperm DNA during our experiment. In addition, the control pull-downs from the cellular lysates provide information about the non-specific binders. Using the spectral counts from all five independent biological replicates, we determined that HSF1 spectral counts from HSF1-GFP expressing cells were >300-fold enriched in GFP-aptamer pull-downs relative to the GFP-expressing cells, in which HSF1 was generally below the limit of detection (Figure 2D, Supplementary Table S1). The full list of identified proteins can be found in Supplementary Table S1.

AptA–MS specifically identifies HSF1 interacting partners and PTMs

We identified 28 proteins that are predicted to interact with HSF1 with high confidence based on SAINT score (13), many of which are specific to the HS condition (Figure 3A, Supplementary Table S2). Most of these interactors are molecular chaperones that are expected to associate with HSF1 (24,25). We have also purified dHSF via the TAP-tag method from S2 cells, where enriched eluates were gel fractionated and individual protein bands were gel excised prior to MS. This independent assay verifies that some of these chaperone interactors are common in metazoans (Supplementary Figure S3). TAP-tagged dHSF was expressed near endogenous levels and still found to be associated with chaperones like dHsc70-3 (ortholog of human BiP) and dHsc70-4 (ortholog of human HSPA8) in NHS condition, indicating that HSF1–chaperone interactions detected in HCT116 cells by AptA–MS are not simply due to HSF1-GFP overexpression. HSF1 is bound by chaperones in normal conditions that prevent its transcriptional activity and is converted to an active state upon stress as the chaperones are released to bind to unfolded protein targets (26). Our data identifies primarily the HSP70 family of proteins to be interacting with HSF1 with high confidence relative to HSP90, supporting previous findings that report the lack of HSP90 interaction in human cells (27), although this interaction has been reported in other organisms (28). In summary, the general overlap of AptA–MS identified candidates with the previously known HSF1 interacting partners validated the capability of our method to detect protein-protein interactions with high confidence.
Figure 3.

Interaction and PTM analysis of HSF1. (A) SAINT analysis of proteins pulled-down by HSF1-GFP AptA–MS before or after HS. Dotted horizontal line represents the SAINT score cutoff (0.65). Labeled proteins above the cutoff are called as HSF1 interactors. Fold Change represents the algorithmically calculated fold change A value which takes into account representation among biological replicates. (B) Post-translational modifications (PTMs) on HSF1 residues identified in AptA–MS. Red and blue represent phosphorylation and acetylation, respectively. Asterisk denotes newly identified modification.

Interaction and PTM analysis of HSF1. (A) SAINT analysis of proteins pulled-down by HSF1-GFP AptA–MS before or after HS. Dotted horizontal line represents the SAINT score cutoff (0.65). Labeled proteins above the cutoff are called as HSF1 interactors. Fold Change represents the algorithmically calculated fold change A value which takes into account representation among biological replicates. (B) Post-translational modifications (PTMs) on HSF1 residues identified in AptA–MS. Red and blue represent phosphorylation and acetylation, respectively. Asterisk denotes newly identified modification. AptA–MS revealed new HSF1 interacting proteins not detected by our TAP purification or other published studies (Supplementary Table S3). HSF1 has been previously shown to interact with translation elongation factor eEF1A1 based on an immunoprecipitation assay, which is implicated to have a broad regulatory function in the HS response (29). Interestingly, AptA–MS identified not only eEF1A1, but also eEF1A2 (SAINT scores of 0.69 and 0.66, respectively) as HSF1-interactors. In addition, we have identified novel, high-confidence HSF1 interactions with translation elongation factor HBS1 and thymidylate kinase (a nucleotide biosynthesis enzyme) (SAINT scores of 0.92 and 0.98, respectively) during HS (Figure 3A, Supplementary Table S2a, b). We have also found HSF2 to be associated with HSF1 during NHS as evident from the high SAINT score (0.78). However, we did not detect this high-confidence interaction upon HS, confirming previous reports where HSF1/HSF2 interaction was found to be reduced during HS (30). In addition, histones H4, H2B and H3.1 were found to be enriched in HS samples over NHS samples (Figure 3A), likely reflecting the fact that upon heat stress HSF1 binding to DNA increases and is located near nucleosomes (20). Detecting histones does not necessarily indicate direct interaction with HSF1, rather they could simply be nearby HSF1 and co-precipitated with HSF1-bound DNA. We applied a stringent SAINT score cutoff of 0.65 to call high confidence HSF1 interactors. However, there are many proteins with slightly lower SAINT scores, which may be bona fide interactors and could be confirmed in future experiments (Supplementary Table S2). HSF1 is well-known to possess multiple PTMs in normal and HS conditions that include acetylation, phosphorylation and sumoylation of specific residues (31). Our pull-down strategy allowed us to identify the acetylation and phosphorylation of HSF1 and other co-precipitants without any specific enrichment for these modifications (Figure 3B, Supplementary Table S4). MS1 quantification in Skyline (15) was used to validate the quality of the modified peptides (Supplementary Figure S4a). Scaffold PTM (14) and the MS2 spectra verified the PTM assignments on HSF1 peptides (Supplementary Figure S4b–d). In addition, we were able to identify two acetylated lysine residues, K62 and K162, that were not reported previously (Figure 3B, Supplementary Figure S4c, d). AptA–MS also identified PTMs (acetylated and phosphorylated residues) on HSF1-associated histones in both NHS and HS conditions (Supplementary Figure S5, Supplementary Tables S1 and S4). These PTMs were identified consistently but with relatively low spectral counts, providing an explanation for why they were not identified previously in the presence of abundant interfering signal from contaminating peptides. This association of HSF1 with acetylated histones is consistent with the observation that HSF1 preferentially binds to sites in open chromatin, in particular those that contain acetylated histones (20,32).

Multiple classes of proteins show increased HSF1 interaction in heat shock cells

We identified 32 proteins in NHS cells expressing HSF1-GFP that are enriched compared to GFP expressing cells upon pull-down with the GFP-aptamer based on a Fisher's exact test using a P value cutoff of <0.05. In the same pull-downs from the HS cells, we identified 42 enriched proteins. Earlier interaction studies have not identified histones as co-precipitants in HSF1 immunoprecipitations or affinity purifications, potentially due to large variation in protein abundances in immunoprecipitated samples. We found two histone proteins to be enriched in NHS cells, whereas six histone proteins were enriched in HS cells (Table 1). This illustrates that the chromatin landscape changes in response to HS, and that these changes are associated with HSF1-containing complexes. This fact is further demonstrated by the increase in proteins with ‘binding’ activity in HSF1 pull-downs from HS cells, as shown by GO analysis (Figure 4A). This is in concordance with a dramatic increase in the proportion of DNA binding proteins upon HS (Figure 4B).
Table 1.

Histone proteins with enriched spectral counts (based on Fisher's exact test P < 0.05) in GFP-aptamer pull-downs from cells expressing HSF1-GFP compared to GFP. Asterisk indicates that the fold change was not calculated as the control sample had zero spectral counts.

Histone ProteinAccession P ValueConditionFold change compared to GFP cells
Histone H4UniProtKB:P62805<0.00010Heat ShockN/A*
Histone H3UniProtKB:P684310.047Heat ShockN/A*
Histone H2BUniProtKB:P588760.0016Heat Shock9
Histone H2AUniProtKB:Q8IUE60.047Heat ShockN/A*
Histone H2BUniProtKB:P068990.00063Heat Shock13
Histone H2BUniProtKB:P23527<0.00010Heat ShockN/A*
Histone H4UniProtKB:P628050.0011Non-Heat Shock2.6
Histone H3UniProtKB:P684310.037Non-Heat Shock4
Figure 4.

GO analysis of proteins enriched by AptA–MS. Proteins enriched by the GFP-aptamer from HSF1-GFP expressing cells compared to GFP expressing cells before or after heat shock are primarily classified based on; (A) molecular function, (B) nucleic acid binding property and (C) protein category.

Histone proteins with enriched spectral counts (based on Fisher's exact test P < 0.05) in GFP-aptamer pull-downs from cells expressing HSF1-GFP compared to GFP. Asterisk indicates that the fold change was not calculated as the control sample had zero spectral counts. GO analysis of proteins enriched by AptA–MS. Proteins enriched by the GFP-aptamer from HSF1-GFP expressing cells compared to GFP expressing cells before or after heat shock are primarily classified based on; (A) molecular function, (B) nucleic acid binding property and (C) protein category. AptA–MS recapitulated published observations that HSF1 engages with chaperone proteins such as HSP70. We find that AptA–MS pull-downs contain a large proportion of chaperone proteins in NHS and HS conditions (Figure 3A, Supplementary Table S2a, b). Interestingly, we see a higher proportion of chaperone proteins co-purifying with HSF1 after HS, which suggests that chaperones might have a dynamic role in both maintaining inactive HSF1 during NHS and modulating the level of activation after HS (Figure 4C). Cytoskeletal reorganization and increased levels of transcription of cytoskeletal proteins have been observed after HS in some cell types, but their role and possible interplay with active HSF1 have not been documented (33). We observe upon HS an increase in HSF1-associated proteins with ‘structural molecule activity’ (Figure 4A), which includes cytoskeletal proteins. Indeed, we identify that cytoskeletal proteins are enriched upon GFP-aptamer pulldown in our HSF1-GFP tagged HS cells, indicating that cytoskeletal protein networks may be connected more directly to modulating HSF1 activity than previously thought (Figure 4C). Microtubules are particularly enriched as interactors with HSF1 following HS (Supplementary Table S1).

RNA affinity contaminants experimental repository (RACER) provides a database of non-specifically binding proteins for AptA–MS experiments

Our rigorously controlled study identified proteins that bind to free GFP and to a control (NELF)-aptamer selected for Drosophila NELF-E (dNELF-E) (7,8) with no predicted binding specificity in humans (Supplementary Table S1). We used a Fisher's exact test to statistically compare proteins enriched by the GFP-aptamer in cells expressing free GFP compared to cells expressing HSF1-GFP (Supplementary Table S5). Similarly, we identified proteins that are statistically enriched in pull-downs from HSF1-GFP expressing cells using the control-aptamer, compared to the GFP-aptamer (Supplementary Table S5). These datasets are highly informative as they provide a list of common contaminants for an AptA–MS experiment. The first comparison generates a list of proteins that bind to GFP non-specifically that can be used for future experiments using a GFP-tag. GO analysis of proteins enriched by the NELF-aptamer shows that the majority of these proteins function in binding, primarily to nucleic acids (Supplementary Figure S6a–c). All of these nucleic acid binding proteins are RNA-binding and is not unexpected as this aptamer recognizes the RNA binding domain of dNELF-E (7) (Supplementary Figure S6b). These datasets not only provide a control for GFP pull-downs, but also a control for non-specific RNA-binding proteins, analogous to the common contaminants repository (CRAPome) used for proteomics (34). We have constructed this database, designated it in RACER, and made it available on Mascot for future AptA–MS experiments.

Broad applicability of AptA–MS

To test the broad applicability of AptA–MS, we affinity enriched GFP from Drosophila S2 cells expressing GFP with or without formaldehyde crosslinking (Supplementary Figure S7a, b). The ability of the GFP-aptamer to enrich crosslinked GFP makes it usable for identifying transient interactions. In addition, we have utilized the NELF-aptamer to purify the NELF complex from Drosophila S2 nuclear extracts and observed ∼50-fold enrichment of dNELF-E in NELF-aptamer pull-down relative to the control (GFP)- aptamer pull-down (Supplementary Figure S7c, Supplementary Table S6), thereby making AptA–MS applicable for other aptamers as well. To demonstrate that proteins expressed from their endogenous promoters (and not just overexpressed proteins) can be purified by AptA–MS, we have utilized a S. pombe line expressing endogenously GFP-tagged Rpb3, a subunit of Pol II. Affinity purification with the GFP-aptamer successfully enriched GFP-Rpb3 along with potential interactors and possibly other Pol II subunits from these cells (Supplementary Figure S8). Our results clearly indicate that AptA–MS can be of general use to molecular and cellular biology applications in different model organisms.

DISCUSSION

Conventional affinity purification strategies rely upon the use of antibodies against a specific protein or epitope tag for successful enrichment of the bait protein and identification of its interacting partners (3). Nucleic acid-dependent affinity purification methods provide critical advantages over immunoprecipitations by limiting the amount of contaminating peptides that mask the detection of low-abundant interactors. Although relatively limited, aptamer-based purification strategies have been previously implemented for protein purification from biological sources (35–37). In addition, the Thrombin, IgE and ATP aptamers have been used for target isolation and detection by MS (38–40). In one study, the EGFR and INSR aptamers were utilized to detect some of the interactors of the target proteins by western blot (41). However, aptamers have not been broadly used for exploring biological questions pertaining to protein-protein interactions utilizing MS. Furthermore, the previous aptamer-based analytical assays were restricted to aptamers against specific proteins of a particular species or small molecules, lacking the versatility and high sensitivity that is achievable by AptA–MS, as the latter takes advantage of GFP as an affinity tag. We utilized the GFP-aptamer to develop AptA–MS for the following reasons. First, GFP is a widely-used protein tag that has been applied in cellular imaging for decades. Therefore, an affinity purification approach targeting GFP would potentially serve as a common strategy for purifying hundreds of GFP-tagged proteins thereby minimizing technical variation and background. Second, GFP has no significant sequence similarity in human cells or commonly used model organisms making it less prone to non-specific interactions with other cellular components. Third, GFP has no known propensity for nucleic acids, which provides an explanation for the previous unsuccessful attempts of selecting an aptamer against it (42). Therefore, our high-affinity aptamer against GFP should make it ‘less-sticky’ to other cellular proteins. Although, aptamers are considered to be ‘high-affinity’ reagents, the ligand binding affinities range from picomolar to micromolar dissociation constants (Kd) (43). Many of these aptamers cannot be used for affinity-purification, as we believe that the target-affinity requirement for such application should be at least in the low nanomolar range. In this aspect, the GFP-aptamer stands out due to its strong binding affinity (Kd = 2.4–4.2 nM) (8) and therefore, is suitable to serve the purpose. A good affinity-purification strategy requires reagents that are not only high-affinity, but also highly-specific and with broad utility. The GFP-tag and GFP-aptamer combination satisfies all these criteria and serves as a tool to study protein-protein interactions with high confidence, and is widely applicable in different species, tissues, and cell types. To demonstrate the practical utility of our method, we targeted HSF1, a critical regulator of HS response in mammals. HSF1 mediated activation of gene expression is associated with its binding to HSEs at pre-established transcriptional regulatory elements and release of paused Pol II into productive elongation (20,44,45). In normal conditions, HSF1 is constitutively expressed and remains inactive but upon stress it is converted to a transcriptionally active state, coordinated by PTMs and interactions with other proteins (46). Epitope-tagged HSF1 has been previously expressed in human cells to identify its PTMs and interacting partners upon immunoprecipitation followed by MS (25,47). With AptA–MS we have not only verified some of the strong interactors and PTMs that have previously been detected but also have identified a few novel interacting partners/co-precipitates, clearly demonstrating the potential of this technology. Our differential protein purification analysis shows a major response to HS that is coordinated by HSF1. Like previous studies, we detect chaperone proteins that are HS responsive and interact with HSF1 to induce transcriptional events (25). The functional classes associated with this induced interaction are illustrated via GO analysis showing an increased proportion of chaperone proteins and proteins with ‘binding' activity to be interacting with HSF1 after HS. Furthermore, nucleic acid-binding proteins are more enriched after HS, and these proteins shift from primarily RNA-binding to mostly DNA-binding proteins in HS cells. This may reflect HSF1’s increased DNA binding activity upon HS. Expression of cytoskeletal genes has been shown to be up-regulated early during HS (44), and interestingly we observe an increased enrichment of cytoskeletal proteins copurifying with HSF1 in these conditions. In particular, we demonstrate high-confidence interactions between HSF1 and cytoskeletal proteins including tubulin and keratin only after HS. We speculate that these interactions could be a component of a feedback regulation that keeps the HS response modulated. The chromatin environment of cells is also dramatically changed by stress. Alteration of histone PTM levels have been shown to be associated with HSF1 occupancy on chromatin upon HS (20,48). Histone methyltransferases particularly targeting histone H3 lysine 4 have been shown to contribute to the HS response (25,49). While these histone modifiers may not directly interact with HSF1, we show a HS-induced copurification of histone H3 with HSF1, indicating that HSF1 is binding near transcriptionally active H3-containing DNA. In addition, we detect induced interactions between HSF1 and histones H2B and H4. Our method can likely pick up these low-frequency interactions by virtue of its low background signal, and shows that HSF1 is binding differentially to transcriptionally-active chromatin as previously reported (20). HSF1 has been shown to undergo extensive PTMs during its regulation. HSF1 is ubiquitinated during recovery from HS and when overexpressed in cells, and we find an interaction between HSF1 and a ubiquitin-40S protein, likely reflecting this modification process. Multiple lysines of HSF1 were found to be acetylated even in the NHS condition (25), while acetylation of specific lysine residues was shown to be critical to the HS response. Acetylation of K80 and K118, was shown to be crucial for the release of HSF1 trimers from the HSEs and inhibited chromatin binding of HSF1 (25,50). In addition, acetylation of K208 by EP300 modulates HSF1 function and protein turnover. Using AptA–MS, we have detected acetylation at each of these essential residues in both NHS and HS conditions in addition to K62 and K162, providing an opportunity for further investigation to elucidate their roles in HSF1 regulation. The HS response varies by tissue and cell type (51,52). Not only does AptA–MS provide a robust method to investigate MCs and their interactions, but it also gives a snapshot of the HSF1-mediated response to HS in HCT116 cells. These data complement previous work done in other cell lines and also give a unique insight into a cell-specific process that requires interaction of translation elongation factors and cytoskeletal proteins with HSF1 during HS response. AptA–MS provides a detailed view of HSF1 interactions and modifications before and after HS. In addition, control pull-downs also reveal essential information. Pull-downs using GFP only cells with GFP-aptamer or HSF1-GFP cells with a structured control RNA aptamer (selected against dNELF-E) with no predicted binding partners in HCT116 cells provide a proteomic profile for non-specific interactions between RNA and proteins. The proteins identified from GFP only cells subjected to GFP-aptamer pull-down are not only informative about the non-specific binders of the GFP/GFP-aptamer in human cells, but also in combination with the control RNA aptamer-enriched proteins from HSF1-GFP cells, allow us to calculate an enrichment factor for each identified protein and to identify high-confidence HSF1 interactors using the most stringent criteria (see Materials and methods). GO analysis of proteins enriched by the control aptamer shows that the vast majority of these proteins are devoted to binding nucleic acid. These nucleic acid-binding proteins are RNA-binding proteins. We predict that these proteins are likely to bind RNA non-specifically, and propose that they serve as a resource in RACER, analogous to the common contaminants repository used for antibody-based pull-downs (34). In addition, the non-specific proteins enriched from the GFP only cells by the GFP-aptamer also provide a list of contaminants for AptA–MS studies using GFP-fusions. RACER is publicly available and can be continually updated for use in further AptA–MS experiments. Knock-in cell lines with GFP-tagged proteins generated by CRISPR/Cas9 are proving to be critical for imaging macromolecules in living cells (53). They also can be used directly for analysis by AptA–MS with the GFP aptamer, as this method is capable of purifying GFP and GFP-tagged proteins from various sources. Additionally, large libraries of GFP-tagged proteins are available, and any member of such libraries could also be used directly in AptA–MS, thereby allowing identification of the associated factors along with their PTMs in a single assay (54,55). These findings would complement optical studies of cellular dynamics and co-localization with other proteins in vivo. The GFP-aptamer has been shown to bind to other derivatives of GFP, making it applicable to precipitate proteins tagged with similar fluorescent proteins (5). Aptamers provide many advantages as affinity reagents: they can be selected against toxic proteins, are amenable for chemical modifications, are cost effective to synthesize, and can be made without the use of animals in any molecular biology lab in unlimited quantities. These advantages, in addition to its broad applicability, make AptA–MS a highly sensitive and simple tool that could significantly transform protein–protein interaction studies and provide deeper and more comprehensive insights in understanding the composition of MCs. Click here for additional data file.
  50 in total

Review 1.  Post-translational modification of human heat shock factors and their functions: a recent update by proteomic approach.

Authors:  Yan-Ming Xu; Dong-Yang Huang; Jen-Fu Chiu; Andy T Y Lau
Journal:  J Proteome Res       Date:  2012-04-24       Impact factor: 4.466

2.  Mammalian Heat Shock Response and Mechanisms Underlying Its Genome-wide Transcriptional Regulation.

Authors:  Dig B Mahat; H Hans Salamanca; Fabiana M Duarte; Charles G Danko; John T Lis
Journal:  Mol Cell       Date:  2016-03-24       Impact factor: 17.970

3.  Use of conditioned media is critical for studies of regulation in response to rapid heat shock.

Authors:  Dig B Mahat; John T Lis
Journal:  Cell Stress Chaperones       Date:  2016-11-03       Impact factor: 3.667

4.  Molecular genetic analysis of fission yeast Schizosaccharomyces pombe.

Authors:  S Moreno; A Klar; P Nurse
Journal:  Methods Enzymol       Date:  1991       Impact factor: 1.600

5.  Insights into the polerovirus-plant interactome revealed by coimmunoprecipitation and mass spectrometry.

Authors:  Stacy L DeBlasio; Richard Johnson; Jaclyn Mahoney; Alexander Karasev; Stewart M Gray; Michael J MacCoss; Michelle Cilia
Journal:  Mol Plant Microbe Interact       Date:  2015-04       Impact factor: 4.171

Review 6.  Regulation of heat shock transcription factors and their roles in physiology and disease.

Authors:  Rocio Gomez-Pastor; Eileen T Burchfiel; Dennis J Thiele
Journal:  Nat Rev Mol Cell Biol       Date:  2017-08-30       Impact factor: 94.444

7.  Regulation of molecular chaperone gene transcription involves the serine phosphorylation, 14-3-3 epsilon binding, and cytoplasmic sequestration of heat shock factor 1.

Authors:  XiaoZhe Wang; Nicholas Grammatikakis; Aliki Siganou; Stuart K Calderwood
Journal:  Mol Cell Biol       Date:  2003-09       Impact factor: 4.272

8.  SAINT: probabilistic scoring of affinity purification-mass spectrometry data.

Authors:  Hyungwon Choi; Brett Larsen; Zhen-Yuan Lin; Ashton Breitkreutz; Dattatreya Mellacheruvu; Damian Fermin; Zhaohui S Qin; Mike Tyers; Anne-Claude Gingras; Alexey I Nesvizhskii
Journal:  Nat Methods       Date:  2010-12-05       Impact factor: 28.547

9.  The translation elongation factor eEF1A1 couples transcription to translation during heat shock response.

Authors:  Maria Vera; Bibhusita Pani; Lowri A Griffiths; Christian Muchardt; Catherine M Abbott; Robert H Singer; Evgeny Nudler
Journal:  Elife       Date:  2014-09-16       Impact factor: 8.140

10.  Development of an aptamer-based affinity purification method for vascular endothelial growth factor.

Authors:  Maren Lönne; Svenja Bolten; Antonina Lavrentieva; Frank Stahl; Thomas Scheper; Johanna-Gabriela Walter
Journal:  Biotechnol Rep (Amst)       Date:  2015-08-28
View more
  1 in total

Review 1.  Protein-DNA/RNA Interactions: An Overview of Investigation Methods in the -Omics Era.

Authors:  Flora Cozzolino; Ilaria Iacobucci; Vittoria Monaco; Maria Monti
Journal:  J Proteome Res       Date:  2021-05-07       Impact factor: 4.466

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.