Literature DB >> 35865489

Identifying and characterising Thrap3, Bclaf1 and Erh interactions using cross-linking mass spectrometry.

Liudmila Shcherbakova1, Mercedes Pardo1, Theodoros Roumeliotis1, Jyoti Choudhary1.   

Abstract

Background: Cross-linking mass spectrometry (XL-MS) is a powerful technology capable of yielding structural insights across the complex cellular protein interaction network. However, up to date most of the studies utilising XL-MS to characterise individual protein complexes' topology have been carried out on over-expressed or recombinant proteins, which might not accurately represent native cellular conditions.
Methods: We performed XL-MS using MS-cleavable crosslinker disuccinimidyl sulfoxide (DSSO) after immunoprecipitation of endogenous BRG/Brahma-associated factors (BAF) complex and co-purifying proteins. Data are available via ProteomeXchange with identifier PXD027611.
Results: Although we did not detect the expected enrichment of crosslinks within the BAF complex, we identified numerous crosslinks between three co-purifying proteins, namely Thrap3, Bclaf1 and Erh. Thrap3 and Bclaf1 are mostly disordered proteins for which no 3D structure is available. The XL data allowed us to map interaction surfaces on these proteins, which overlap with the non-disordered portions of both proteins. The identified XLs are in agreement with homology-modelled structures suggesting that the interaction surfaces are globular. Conclusions: Our data shows that MS-cleavable crosslinker DSSO can be used to characterise in detail the topology and interaction surfaces of endogenous protein complexes without the need for overexpression. We demonstrate that Bclaf1, Erh and Thrap3 interact closely with each other, suggesting they might form a novel complex, hereby referred to as BET complex. This data can be exploited for modelling protein-protein docking to characterise the three-dimensional structure of the complex. Endogenous XL-MS might be challenging due to crosslinker accessibility, protein complex abundance or isolation efficiency, and require further optimisation for some complexes like the BAF complex to detect a substantial number of crosslinks. Copyright:
© 2021 Shcherbakova L et al.

Entities:  

Keywords:  Bclaf1; CLMS cross-linking mass spectrometry; DSSO; Endogenous complexes; Erh; Thrap3; XL-MS

Year:  2021        PMID: 35865489      PMCID: PMC9270653          DOI: 10.12688/wellcomeopenres.17160.1

Source DB:  PubMed          Journal:  Wellcome Open Res        ISSN: 2398-502X


Introduction

Cross-linking mass spectrometry (XL-MS) is a powerful technique that enables the identification of proximal amino acid residues within a single protein as well as residues in close proximity in interacting proteins. Intramolecular crosslinks provide distance constraint parameters that can support homology-based tertiary structure modelling or even guide de novo modelling ( Adikaram ; Kahraman ; Liu ; Orbán-Németh ; O'Reilly & Rappsilber, 2018). Moreover, intermolecular cross-linking information has been used to determine spatial orientation and elucidate the topology of protein complex subunits ( Gaik ; Herzog ; Leitner ; Politis ). In the absence of a full tertiary structure model protein-protein interaction (PPI) surface information derived from XL-MS can potentially serve as a guide to direct mutation or small molecule screens to disrupt specific subunit interactions within a given complex. Most XL-MS studies to date have been performed on over-expressed or recombinant proteins or protein complexes ( O'Reilly & Rappsilber, 2018). More rarely, XL-MS has been applied to study protein interactions in a more complex endogenous setting like cell lysates ( Götze ; Klykov ; Liu ; Yugandhar ), organelles ( Bartolec ; Makepeace ; Ryl ) specific cellular compartments ( Fasci ; Ser ) or purified complexes expressed at endogenous levels ( Makowski ; Spruijt ). Recent advances in XL-MS have introduced MS-cleavable crosslinkers, which generate distinct fragment pairs in MS2 and therefore substantially reduce the complexity of data analysis and improve identification accuracy ( Matzinger & Mechtler, 2021). Disuccinimidyl sulfoxide (DSSO) ( Kao ) is the most extensively used MS-cleavable reagent, and has been applied to the characterisation of protein interactions both in isolated proteins as well as complex samples ( Adams ; Klykov ; Nguyen ; Ser ; Smith ; Wang ). Here, we performed XL-MS on affinity-purified (AP) endogenous BRG/Brahma-associated factors (BAF) complex in native conditions to define the interaction surfaces between complex subunits and with associated protein partners. The strategy was based on immunoprecipitation of Arid1a, considered to be a scaffolding/bridging component of the complex ( Han ; He ; Mashtalir ), and unlike most published XL-MS studies, involved no artificial increase of the target protein complex by overexpression. In addition to BAF purification, we concomitantly achieved a significant co-enrichment of Thrap3, Bclaf1 and Erh proteins that allowed the identification of a substantial number of crosslinks (XLs) between them, implicating this protein cluster as a native protein assembly, that we hereby refer to as BET complex. We report the interaction surfaces between Thrap3, Bclaf1 and Erh, determined through chemical crosslinking mass-spectrometry.

Results

Bclaf1, Erh and Thrap3 interact directly with each other

To investigate interaction surfaces of Arid1a and the BAF complex, we carried out five experiments where immunoprecipitation of endogenous Arid1a from mouse embryonic stem cells (mESCs) was coupled to crosslinking using MS cleavable crosslinker DSSO and MS3 mass spectrometry. Crosslink peptide identification was performed using the full mouse Uniprot database, with crosslink assignment using XlinkX ( Liu ) as a part of Proteome Discoverer at a 1% false discovery rate (FDR). Whilst we were able to detect several XLs between BAF subunits and interacting proteins, these were mostly single crosslink spectra matches (CSMs) with exception of two. We also detected a significant number of XLs between Thrap3, Erh and Bclaf1, due to a strong enrichment of these proteins in Arid1a APs. Specifically, 13% of all CSMs identified in the five experiments involved these three proteins. For comparison, 9.6% of total CSMs obtained were assigned to BAF, which, in addition to being the bait in the AP, is also a much larger complex with 27 subunits. Given the richness of XL information associated with these proteins, we used this data to gain insight into the structural topology of this protein cluster. Our data suggests that they interact closely and might represent a stable complex. We hereby refer to it as the Bclaf1-Erh-Thrap3 (BET) complex. Bclaf1 and Thrap3 are highly homologous proteins ( Figure 1). They share the Thrap3-Bclaf1 domain, which defines the THRAP3/BCLAF1 family, that contains these two proteins. There is no available crystal or cryo-electron microscopy structure for Thrap3 or Bclaf1. Interestingly, domain analysis with Pfam database (v33.1) ( Mistry ) showed that Thrap3 and Bclaf1 are both highly disordered proteins along the whole length ( Figure 2A), which limits the application of structure prediction modelling on them without additional information. Hence our XL-MS data could add useful information for structural elucidation of the complex.
Figure 1.

Protein sequence alignment of mouse Thrap3 and mouse Bclaf1.

Proteins showed ~43% homology. The alignment was performed using Clustal Omega sequence alignment tool ( Madeira ) within the Jalview v2.11.1.4 ( Waterhouse ).

Figure 2.

Structural features of BET complex members and identified cross-linked sites.

( A) Annotations of domains and disorder regions of Thrap3, Bclaf1 and Erh as reported in Pfam database ( Mistry ). The illustration was made using DOG 2.0 ( Ren ). ( B) A two dimensional visualisation of Thrap3, Bclaf1 and Erh depicting the high confidence cross-links using xiVIEW ( Graham ).

Protein sequence alignment of mouse Thrap3 and mouse Bclaf1.

Proteins showed ~43% homology. The alignment was performed using Clustal Omega sequence alignment tool ( Madeira ) within the Jalview v2.11.1.4 ( Waterhouse ).

Structural features of BET complex members and identified cross-linked sites.

( A) Annotations of domains and disorder regions of Thrap3, Bclaf1 and Erh as reported in Pfam database ( Mistry ). The illustration was made using DOG 2.0 ( Ren ). ( B) A two dimensional visualisation of Thrap3, Bclaf1 and Erh depicting the high confidence cross-links using xiVIEW ( Graham ). We summarised confident crosslinks and corresponding number of CSMs for each of five experiments in Table 1. Due to the high sequence homology between Thrap3 and Bclaf1 ( Figure 1), the crosslinks were manually checked for unambiguous assignment between the two proteins and undistinguishable pairs were removed from analysis. We detected 24 unique XL sites and a total of 121 CSMs across all five experiments. The high density of crosslinks in the regions from 496-537 amino acids (aa) for Thrap3 and 578-635aa for Bclaf1 ( Figure 2B) indicates the interaction surface between the two proteins. This region of Thrap3 also contains most of XLs to-self, suggesting that it is a potentially globular area and is used for PPI. On the contrary, no intra-links were retained for Bclaf1 upon high confidence filtering ( Figure 2B). Even though they share ~43% homology, the middle parts of these proteins are quite different between approximately 400aa and 560aa, with fragments missing in one protein or the other ( Figure 1). The Bclaf1 and Thrap3 interaction interface also contains their cross-linked sites to Erh, which extends slightly further (up to 481-708aa in Thrap3 and 534-620aa in Bclaf1). Moreover, the region of dense crosslinks in Bclaf1 lies outside of the disordered region as reported in Pfam database ( Mistry ) (561-629aa; Figure 2A), suggesting that the Bclaf1 interaction surface, like that of Thrap3, may also be a globular structure. Our data demonstrates that Bclaf1, Erh and Thrap3 interaction is direct and suggests that they might form a ternary complex.
Table 1.

BET confident crosslinks within and outside the complex.

This table reports the confident crosslinks, which are defined as being detected in more than one experiment, and its corresponding CSMs number for each of four experiments. XL-pairs ID indicate peptide that contains crosslink to the unique site pair. For each unique crosslinked peptide pair we report a crosslinked spectra match (CSMs) count for each individual experiment.

XL- pair_IDCrosslink TypeSequence AAccession AProtein Descriptions APosition ASequence BAccession BProtein Descriptions BPosition BExp.1 #CSMsExp.2 #CSMsExp.3 #CSMsExp.4 #CSMsExp.5 #CSMs
1InterEESTSGFD[K]SRQ569Z6Thrap3808LLIY[K]VSNRP01631Ig kappa chain V-II region 26–105510450
2InterY[K]DDPVDLRQ569Z6Thrap3708QAQQAG[K]P84089Erh10400110
3InterGNEQEAA[K]NKKQ569Z6Thrap3665E[K]IYVLLRP84089Erh9000110
4InterFTSYQ[K]ATEEHSTRQ8K019Bclaf1635SR[K]EERQ569Z6Thrap349620300
5InterFTSYQ[K]ATEEHSTRQ8K019Bclaf1635[K]TSESRQ569Z6Thrap353710201
6InterEQYF[K]SPAVTLNERQ8K019Bclaf1620QAQQAG[K]P84089Erh10401013
7InterEQYF[K]SPAVTLNERQ8K019Bclaf1620[K]TSESRQ569Z6Thrap353710203
8InterSIFDHI[K]LPQANKQ8K019Bclaf1591[K]TSESRQ569Z6Thrap353700110
9InterLLASTLVHSV[K]KQ8K019Bclaf1578[K]TSESRQ569Z6Thrap353700110
10IntraGDFSSG[K]SSFSITRQ569Z6Thrap3555[K]TSESRQ569Z6Thrap353700410
11InterI[K]MIASDSHRPEVKQ8K019Bclaf1534QAQQAG[K]P84089Erh10410100
12IntraI[K]MIASDSHRPEVKQ8K019Bclaf1534MIASDSHRPEV[K]LKQ8K019Bclaf1a54600006
13IntraVTAY[K]AVQEKQ569Z6Thrap3524GFVPE[K]NFRQ569Z6Thrap351600110
14IntraVTAY[K]AVQEKQ569Z6Thrap3524[K]TSESRQ569Z6Thrap353710400
15InterGFVPE[K]NFRQ569Z6Thrap3516RQAQQAG[K]P84089Erh10420403
InterGFVPE[K]NFRQ569Z6Thrap3516QAQQAG[K]P84089Erh10400050
16IntraGFVPE[K]NFRQ569Z6Thrap3516[K]TSESRQ569Z6Thrap353700120
17Inter[K]AEEMEDEPFTERQ569Z6Thrap3481QAQQAG[K]P84089Erh10401110
Inter[K]AEEMEDEPFTERQ569Z6Thrap3481RQAQQAG[K]P84089Erh10400003
18Intra[K]AEEMEDEPFTERQ569Z6Thrap3481GFVPE[K]NFRQ569Z6Thrap351600003
19IntraE[K]IYVLLRRP84089Erh90E[K]IYVLLRP84089Erh9000125
20IntraE[K]IYVLLRP84089Erh90RQAQQAG[K]P84089Erh10400221
21InterE[K]IYVLLRP84089Erh90[K]TSESRQ569Z6Thrap353700431
22InterFSGSGSGTDFTL[K]ISRP01631Ig kappa chain V-II region 26–1079EESTSGFD[K]SRQ569Z6Thrap380800344
23InterFSGSGSGTDFTL[K]ISRP01631Ig kappa chain V-II region 26–1079EE[K]DSLQPSAEQ569Z6Thrap394300111
24InterMYEEHL[K]RP84089Erh41DLVHSN[K]KQ569Z6Thrap359910100

BET confident crosslinks within and outside the complex.

This table reports the confident crosslinks, which are defined as being detected in more than one experiment, and its corresponding CSMs number for each of four experiments. XL-pairs ID indicate peptide that contains crosslink to the unique site pair. For each unique crosslinked peptide pair we report a crosslinked spectra match (CSMs) count for each individual experiment. Previous studies identified that both the N-terminus (1-190aa) and the C-terminus (359-951aa) of Thrap3 are important for its function in DNA repair of post double stranded breaks and stalled replication forks, whilst the intervening residues 190-359aa appear to be dispensable for this function ( Vohhodina ). The C-terminal fragment of Thrap3 has been shown to partially rescue the Thrap3 knockout phenotype in terms of ability to respond to ionising radiation (IR) induced DNA damage ( Vohhodina ). Since this Thrap3 C-terminal fragment covers the XL-rich region, the rescue may have been mediated by restoration of interactions with Erh and Bclaf1. Moreover, there is a number of highly frequent phosphorylation sites (>100 reports per site) reported in PhosphoSitePlus (v6.6.0.1) ( Hornbeck ) within the Thrap3 C-terminus in mice (S379, S572, S679, S924).

Mapping BET crosslinks on available structures

Erh is the only component of the BET complex with an available crystal structure ( Arai ; Hazra ; Jin ; Kwon ; Li ; Wan ; Xie ). We detected a XL for Erh that corresponds to two crosslinked peptides with overlapping sequences ( Table 1; XL-pair ID 19). Although XL-MS is not able to distinguish between intra- or inter-links in the case of homo-multimeric proteins, the overlap is indicative of the crosslink involving two molecules of the same protein. In agreement with this data, Erh is known to form a homodimer ( Arai ; Hazra ; Xie ) and therefore we concluded that the self-link at the position 90 involves two Erh molecules. We utilised available Erh homodimer protein structure PDB:1WZ7 ( Arai ) to model this crosslink within a structural context ( Figure 3). The length of the mapped crosslink is in agreement with the DSSO maximum distance constraint of 37Å threshold used in other modelling studies ( Liu ).
Figure 3.

Erh dimer inter-crosslink.

ERH inter-crosslink at position K90-K90 was mapped onto available Erh (Uniprot accession: P84089) structure PDB:1WZ7 ( Arai ). When mapped onto this homodimeric structure it appears to be <37Å.

Erh dimer inter-crosslink.

ERH inter-crosslink at position K90-K90 was mapped onto available Erh (Uniprot accession: P84089) structure PDB:1WZ7 ( Arai ). When mapped onto this homodimeric structure it appears to be <37Å.

BET crosslinks satisfy the model based on sequence homology

Since there is no available crystal or cryo-EM structure for Thrap3 or Bclaf1, we used online modelling tool Robetta for de novo modelling of putative structures ( Raman ; Song ), and five predicted structures were rendered for each protein. In all models Thrap3 appeared to be largely unfolded and flexible ( Figure 4), with roughly the same area stretching from 330-725aa folding into a number of helices that resemble a globular structure. This region overlapped with the crosslink-dense area of Thrap3 ( Figure 5). When mapped onto one of the proposed models (model 1), the majority of crosslinks we detected satisfied the maximum distance threshold of 37Å ( Liu ), hence reinforcing the proposed model.
Figure 4.

Thrap3 sequence based modelling.

Thrap3 (mouse) protein models generated using Robetta protein modelling tool ( Raman ; Song ) based solely on the protein sequence.

Figure 5.

Thrap3 crosslinks satisfy the de novo homology-based model.

Predicted structure models for Thrap3 model 1 using Robetta protein modelling tool ( Raman ; Song ). Cross-linking data mapped at a maximum distance of 37Å using Chimera ( Pettersen ).

Thrap3 sequence based modelling.

Thrap3 (mouse) protein models generated using Robetta protein modelling tool ( Raman ; Song ) based solely on the protein sequence.

Thrap3 crosslinks satisfy the de novo homology-based model.

Predicted structure models for Thrap3 model 1 using Robetta protein modelling tool ( Raman ; Song ). Cross-linking data mapped at a maximum distance of 37Å using Chimera ( Pettersen ). Surprisingly, despite the homology between Bclaf1 and Thrap3, the predicted structures for Bclaf1 appeared generally much more folded ( Figure 6) than those of Thrap3 and overall looked very different.
Figure 6.

Bclaf1 sequence based modelling.

Bclaf1 (mouse) protein model generated using Robetta protein modelling tool ( Raman ; Song ) based solely on the protein sequence.

Bclaf1 sequence based modelling.

Bclaf1 (mouse) protein model generated using Robetta protein modelling tool ( Raman ; Song ) based solely on the protein sequence.

Discussion

Thrap3 and Bclaf1 have been shown to form part of a high molecular weight complex (the SNARP complex) involved in the regulation of cyclin D1 stability together with SNIP1, SkIP and Pinin ( Bracken ). Interactions between Bclaf1 and Thrap3 with Erh have been identified through immunoprecipitation of Erh ( Kavanaugh ), but these interactions were not further validated. Our XL-MS data confirms that Bclaf1, Thrap3 and Erh are in close proximity to each other and provides evidence that they can interact directly. Furthermore, the experimental conditions of cell lysis (high salt buffer) and the number of XLs detected suggest that they form a tight complex. Bclaf1, Erh and Thrap3 are functionally related and are all involved in splicing regulation. Thrap3 and Bclaf1 have also been shown to play an important role in DNA damage repair (DDR) through regulation of splicing of ataxia telangiectasia mutated (ATM) serine/threonine kinase and export of its mRNA, as well as regulation of other transcripts involved in DDR ( Vohhodina ), while Erh has been reported to regulate splicing of ataxia telangiectasia and Rad3-related ( ATR) protein ( Kavanaugh ). Thrap3 and Bclaf1 are able to compensate for the loss of each other ( Vohhodina ) indicating partial functional redundancy. There is a discrepancy in the literature as to whether any components of BET localise at the DNA damage (DD) sites with some data suggesting that Thrap3 and Bclaf1a do not localise ( Beli ), and neither does Erh ( Kavanaugh ). In fact, Thrap3 is suggested to be excluded from double strand breaks ( Beli ). However, another study showed that Bclaf1 interacts and co-localises with H2AX after IR in damage foci ( Lee ). Bclaf1 has also been reported to form a complex with phosphorylated BRCA1, downstream of its function as ATM splicing regulator, promoting splicing of DDR proteins such as ATRIP, BACH1 and EXO1 ( Savage ). The diverse function of Bclaf1 can be partially explained by itself being a subject of splicing regulation, with one specific splicing isoform implicated in regulation of tumour growth ( Zhou ). Crosslinks can not only be used to refine low or medium-resolution structures, but also aid the generation of protein models from their amino acid sequences ( Orbán-Németh ; Liu ). The crosslinking data presented here could be incorporated as C α-C α distance restraints using I-TASSER ( Roy ; Yang ; Zhang, 2008) to refine the preliminary protein models we generated and improve their reliability. If satisfactory model structures are obtained, XL data can again be exploited through the HADDOCK platform for inter-molecular docking of the BET complex subunits ( van Zundert ). However, due to lack of confident intramolecular XLs for Bclaf1 or available resolved structure of its homologues, its modelling may not be accurate enough to perform this task successfully. In summary, we have performed XL-MS on endogenous protein complexes to derive useful topological information. We have shown that Thrap3, Bclaf1 and Erh interact directly with each other to form a tight protein assembly and we have identified their interaction interfaces though XL-MS. Abnormal splicing events are often observed in cancer cells and have been involved in many types of cancer. Hence, characterisation of the interactions between these proteins will be useful for better understanding their role in oncogenesis

Methods

Cell culture

Mouse embryonic stem cells (mESCs) were cultured by StemCell Technologies Inc. (Cambridge, UK).

Immunoprecipitation and crosslinking

Protein G-Dynabeads (#10004D, Invitrogen) were prepared by coupling to antibodies against specific target protein in Table 2 for 15 minutes at room temperature (RT). Nuclear extraction was performed with isotonic buffer containing 10mM Tris-HCl, 10mM NaCl, 1.5mM MgCl2, 0.34M Sucrose, 1mM dithiothreitol (DTT), Halt Protease inhibitors (Thermo Scientific) and 0.05% NP-40. Nuclei were lysed for 10 minutes on ice with lysis buffer composed of 50 mM TrisHCl pH 8.0, 450 mM NaCl, 1 mM EDTA, 1 mM DTT, Halt Protease inhibitors (Thermo Scientific). NaCl was diluted to 150mM for immunoprecipitation (IP) for experiments 1-3 and 5, but kept at 450mM for experiment 4. Protein concentration was measured by Quick Start™ Bradford 1x Dye Reagent (#5000205, Biorad) following the manufactures guidelines. IP was performed for 1-2 hours at 4°C. Crosslinking reagent disuccinimidyl sulfoxide (DSSO) (#A33545, ThermoFisher) was dissolved in DMSO at 50mM. The crosslinking was performed at 1mM on the IP sample while still coupled to the beads, with DMSO final concertation being 2% at RT for 1 hour. The reaction was quenched with 125mM Tris-HCl pH 8.0 at RT for 15 minutes. The beads were washed three times with IPP150. For samples that were destined for MS analysis, the beads were washed and digested as previously described in Hillier with a modified digestion schedule: Lys C digestion overnight, followed by Trypsin for approximately 4 hours, two times and a Trypsin overnight digestion.
Table 2.

The antibodies used in immunoprecipitation (IP).

TargetNameCatalogue numberRRIDTypeSpecies in which the antibody was raisedCompany
Arid1a PSG3sc-32761AB_673396MonoclonalMouseSanta Cruz Biotechnology
N/A Mouse-IgG12-371AB_145840PolyclonalMouseMillipore

Mass spectrometry

Peptides generated by trypsin digestion were fractionated with the Pierce High pH Reversed-Phase Peptide Fractionation Kit (#84868, ThermoFisher) according to manufacturer instructions, and eight fractions were collected and dried. Liquid chromatography mass spectrometry (LC-MS) analysis was performed on the Dionex UltiMate 3000 UHPLC system coupled with the Orbitrap Lumos Mass Spectrometer (Thermo Scientific). Each peptide fraction was reconstituted in 15 μL 0.1% formic acid and loaded to the Acclaim PepMap 100, 100 μm × 2 cm C18, 5 μm trapping column at 10 μL/min flow rate of 0.1% formic acid loading buffer. The sample was then subjected to a gradient elution on the EASY-Spray C18 capillary column (75 μm × 50 cm, 2 μm) at 50°C. Mobile phase A was 0.1% formic acid and mobile phase B was 80% acetonitrile, 0.1% formic acid. The gradient separation method at flow rate 300 nL/min was as follows: for 90 minutes gradient from 5%-38% B, for 10 minutes up to 95% B, for 5 minutes isocratic at 95% B, re-equilibration to 5% B in 5 minutes, for 10 minutes isocratic at 5% B. Precursors between 375-1,600 m/z and charge equal or higher than +3 were selected with mass resolution of 120,000 in the top speed mode in 5 seconds and were isolated for collision-induced dissociation (CID) fragmentation with quadrupole isolation width 1.6 Th. Collision energy was set at 25%. Fragments with targeted mass difference of 31.9721 (DSSO crosslinker) were further subjected to CID fragmentation at the MS3 level with collision energy 35%, iontrap detection and MS2 isolation window 2 Th. Two precursor groups were selected with both ions in the pair. Targeted MS precursors were dynamically excluded for further isolation and activation for 30 seconds with 10 ppm mass tolerance.

Mass spectrometry and crosslinking data analysis

MS raw data from mESCs IPs was analysed using Proteome Discoverer 2.4 (#OPTON-30945; Thermo Scientific). XLinkX nodes were used to search crosslinked peptides pairs ( Liu ) for tryptic peptides with a minimum peptide length of 5 aa and maximum of 2 miss cleavages with dynamic modifications set to as oxidation of methionine (+15.995 Da) at 1% FDR. Precursor mass tolerance set to 20ppm, FTMS fragment mass tolerance to 30ppm, and ITMS fragment tolerance to 0.5 Da. Sequest was used for general MS2 search for tryptic peptides with a minimum peptide length of 6 aa and maximum of 2 miss cleavages with DSSO modifications followed by Target Decoy PSM Validator set at 1% FDR. Dynamic modifications were set as oxidation of methionine (+15.995 Da), DSSO hydrolysed for lysine (+176.014 Da) and DSSO quenched with Tris/dead end for lysine (+279.078 Da). The searches were performed against full UniProt Mouse database (August 2019) together with cRAP contaminant database. Alternatively to Proteome Discoverer 2.4 with XLinkX and Sequest, open-access software such as MaxQuant ( Cox & Mann, 2008) and pLink ( Chen ) could be used. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE ( Perez-Riverol ) partner repository with the dataset identifier PXD027611 (see underlying data).

Crosslinking mapping, protein modelling and visualisation

Annotations of domains and disorder regions of Thrap3, Bclaf1 and Erh as reported in Pfam database (v33.1) ( Mistry ). The illustration of the domains was made using DOG 2.0 ( Ren ). Two dimensional visualisation of crosslinks was performed using xiVIEW online tool ( Graham ). The crosslinks visualised were filtered so at least one of the proteins from the cross-linked pair is a member of BET and the site of the cross-linked pair was observed in more than one experiment. Structural models of Bclaf1 and Thrap3 were predicted by Robetta online tool with TrRefineRosetta modelling method for Bclaf1, and comparative modelling and ab initio modelling for Thrap3 ( Raman ; Song ). PDB files of the model and the available structures were visualised using UCSF Chimera package ( Pettersen ) and crosslinks were mapped using a Chimera plug-in Xlink Analyzer ( Kosinski ). Chimera is developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco (supported by NIGMS P41-GM103311).

Data availability

Underlying data

The underlying data has been deposited in the ProteomeXchange Consortium via the PRIDE partner repository, accession number PXD027611: https://identifiers.org/pride.project:PXD027611. The manuscript by Shcherbakova et al., describes the use of chemical cross-linking to study the protein interactions of proteins and protein complexes at endogenous levels following affinity purification (IP-XL-MS).  The manuscript is well written and straightforward to follow. The study sets out to investigate the interactions of the BAF complex, by immunoprecipitating Arid1a, a scaffolding protein of the BAF complex.  Unfortunately, the experiment yielded too few cross-links of the BAF complex to enable detailed topological investigation, however, three proteins, Thrap3, Bclaf1, and Erh were identified with numerous cross-links, suggesting that they may form a previously unidentified complex.  In all, five experiments were carried out yielding a total of 121 cross-links.  Of these, 24 CSMs (cross-link spectral matches) were attributed to subunits of this putative complex. The study highlights an important aspect of structural mass spectrometry, namely that chemical cross-linking enables the structural analysis of intrinsically disordered proteins and disordered regions within a protein.  Thrap3 and Bclaf1 are predicted to have extensively disordered regions, which makes it very difficult if not impossible to study the structures by crystallography or cryo-electron microscopy.  While the data here provide limited intra and inter cross-links, they are starting point for more extensive investigation of this putative new complex. Raw data underlying this work has been deposited in the ProteomeXchange Consortium via the PRIDE repository. Minor comments: The authors suggest naming the putative Bclaf1a-Erh-Thrap3 complex, the BET complex.  This is potentially confusing, as there are already the Bromodomain and Extra-Terminal domain (BET) proteins (BRD2, BRD3, BRD4, and BRDT).  Perhaps an alternative nomenclature would be better. Table 1 should additionally show the confidence score attributed to a CSM by the software used for data analysis (XLinkX). The legend also refers to 4 experiments where five are shown. The authors have employed the Robetta protein modelling tool to predict possible structures for these intrinsically disordered proteins.  It would be useful to compare these structures with those predicted by AlphaFold. Searches were performed against the full UniProt Mouse database.  Potentially an alternative approach would have been to characterise the IP obtained using Arid1a and build a more targeted database against which to search the data. The authors specify how protein concentrations were measured, but don’t say how much protein was used for the IP.  Additionally, a value of 1mM is given for the cross-linking reagent, how does this relate to a protein: cross-linker ratio?  Were several concentration ratios investigated? On page 11, the authors mention that alternative software is available for data analysis.  This list is not extensive and should at least include other applications such as Xi, StavroX, etc. The authors get slightly fixated on the distance restraint of 37Å.  This is a lenient value and may not have any particular importance for these disordered proteins. For the cross-linking data analysis, it may prove useful to include serine and threonine in the reaction specificity for DSSO. Is the work clearly and accurately presented and does it cite the current literature? Yes If applicable, is the statistical analysis and its interpretation appropriate? Yes Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Yes Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: Proteomics, structural mass spectrometry I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. In this contribution, Shcherbakova et al. report the characterization of the ternary interaction between mouse Thrap3, Bclaf1, and Erh by chemical cross-linking (XL) and mass spectrometry. The cross-linking data was obtained from an experiment that initially targeted the BAF complex, which did not yield sufficient cross-links for a detailed structural interpretation. Thrap3, Bclaf1, and Erh, however, were found to be connected through a mutually supporting set of cross-links, suggesting that the observed connections specify the binding interfaces in a ternary complex. The manuscript describes data generation from the IP-XL-MS experiment, for which results from multiple, independent experiments were combined. In total, 24 non-redundant peptide pairs involving at least one of the three proteins were identified repeatedly. Thrap3 and Bclaf1 are predicted to have large unstructured/intrinsically disordered regions, which makes it difficult to study their interactions by established structural biology methods. The XL data presented here serve as a starting point for further experiments, including integrative modeling approaches to define the organization of the complex. The role of Thrap3 and Bclaf1 in some splicing events may make the complex relevant for diseases such as cancer. The manuscript is clearly written and experiments are described in sufficient detail to enable replication. XL-MS results have been deposited to the PRIDE repository, facilitating reanalysis or reuse of the data. Minor comments that could be addressed in a revised version: On page 7, the authors discuss the potential ambiguity of XL data when it comes to site pairs on the same protein. While this is true, the case described here - a cross-link that connects partially overlapping sequences - needs to be inter-molecular (or a false positive identification). I would emphasize this more in the text. Several times throughout the manuscript, the authors mention that they used an upper distance of 37 Å to assess "compatibility" of a cross-link with available PDB structures or structure predictions, citing Liu et al., 2020, as a reference. I would like to point out that 37 Å is a quite lenient threshold, and upper distances of 30-32 Å are much more commonly used. Nevertheless, an interaction between largely unstructured proteins may sample a larger conformational space. Somewhat related to this, the authors discuss that intra-molecular links on Thrap3 were compatible with one of the proposed models (page 7). Since more than one model was predicted, how did the links fit to other models? In the methods section (page 10), an "IPP150" buffer is mentioned that needs to be defined. On the same page, "precursor selection ... with a mass resolution of 120,000" should be rephrased. Precursors were detected at a resolution of 120,000 but their selection for fragmentation is only dependent on the isolation width. "miss cleavages" should read "missed cleavages". The legend in Table 1 mentions four experiments, although the results from five are displayed. Is the work clearly and accurately presented and does it cite the current literature? Yes If applicable, is the statistical analysis and its interpretation appropriate? Not applicable Are all the source data underlying the results available to ensure full reproducibility? Yes Is the study design appropriate and is the work technically sound? Yes Are the conclusions drawn adequately supported by the results? Yes Are sufficient details of methods and analysis provided to allow replication by others? Yes Reviewer Expertise: cross-linking mass spectrometry; proteomics; structural biology; protein-protein interactions, protein-RNA interactions I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  60 in total

1.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Authors:  Jürgen Cox; Matthias Mann
Journal:  Nat Biotechnol       Date:  2008-11-30       Impact factor: 54.908

2.  DOG 1.0: illustrator of protein domain structures.

Authors:  Jian Ren; Longping Wen; Xinjiao Gao; Changjiang Jin; Yu Xue; Xuebiao Yao
Journal:  Cell Res       Date:  2009-02       Impact factor: 25.617

3.  Structure prediction for CASP8 with all-atom refinement using Rosetta.

Authors:  Srivatsan Raman; Robert Vernon; James Thompson; Michael Tyka; Ruslan Sadreyev; Jimin Pei; David Kim; Elizabeth Kellogg; Frank DiMaio; Oliver Lange; Lisa Kinch; Will Sheffler; Bong-Hyun Kim; Rhiju Das; Nick V Grishin; David Baker
Journal:  Proteins       Date:  2009

4.  A Simple Cross-Linking/Mass Spectrometry Workflow for Studying System-wide Protein Interactions.

Authors:  Michael Götze; Claudio Iacobucci; Christian H Ihling; Andrea Sinz
Journal:  Anal Chem       Date:  2019-07-19       Impact factor: 6.986

Review 5.  Cross-linking mass spectrometry: methods and applications in structural, molecular and systems biology.

Authors:  Francis J O'Reilly; Juri Rappsilber
Journal:  Nat Struct Mol Biol       Date:  2018-10-29       Impact factor: 15.369

6.  I-TASSER: a unified platform for automated protein structure and function prediction.

Authors:  Ambrish Roy; Alper Kucukural; Yang Zhang
Journal:  Nat Protoc       Date:  2010-03-25       Impact factor: 13.491

7.  Enhancer of Rudimentary Homolog Affects the Replication Stress Response through Regulation of RNA Processing.

Authors:  Gina Kavanaugh; Runxiang Zhao; Yan Guo; Kareem N Mohni; Gloria Glick; Monica E Lacy; M Shane Hutson; Manuel Ascano; David Cortez
Journal:  Mol Cell Biol       Date:  2015-06-22       Impact factor: 4.272

8.  Modular Organization and Assembly of SWI/SNF Family Chromatin Remodeling Complexes.

Authors:  Nazar Mashtalir; Andrew R D'Avino; Brittany C Michel; Jie Luo; Joshua Pan; Jordan E Otto; Hayley J Zullow; Zachary M McKenzie; Rachel L Kubiak; Roodolph St Pierre; Alfredo M Valencia; Steven J Poynter; Seth H Cassel; Jeffrey A Ranish; Cigall Kadoch
Journal:  Cell       Date:  2018-10-18       Impact factor: 41.582

9.  A conserved dimer interface connects ERH and YTH family proteins to promote gene silencing.

Authors:  Guodong Xie; Tommy V Vo; Gobi Thillainadesan; Sahana Holla; Beibei Zhang; Yiyang Jiang; Mengqi Lv; Zheng Xu; Chongyuan Wang; Vanivilasini Balachandran; Yunyu Shi; Fudong Li; Shiv I S Grewal
Journal:  Nat Commun       Date:  2019-01-16       Impact factor: 14.919

10.  Cross-link guided molecular modeling with ROSETTA.

Authors:  Abdullah Kahraman; Franz Herzog; Alexander Leitner; George Rosenberger; Ruedi Aebersold; Lars Malmström
Journal:  PLoS One       Date:  2013-09-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.