Literature DB >> 22706673

A protease for 'middle-down' proteomics.

Cong Wu1, John C Tran, Leonid Zamdborg, Kenneth R Durbin, Mingxi Li, Dorothy R Ahlf, Bryan P Early, Paul M Thomas, Jonathan V Sweedler, Neil L Kelleher.   

Abstract

We developed a method for restricted enzymatic proteolysis using the outer membrane protease T (OmpT) to produce large peptides (>6.3 kDa on average) for mass spectrometry-based proteomics. Using this approach to analyze prefractionated high-mass HeLa proteins, we identified 3,697 unique peptides from 1,038 proteins. We demonstrated the ability of large OmpT peptides to differentiate closely related protein isoforms and to enable the detection of many post-translational modifications.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22706673      PMCID: PMC3430368          DOI: 10.1038/nmeth.2074

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


The “bottom-up” and “top-down” approaches represent two strategies for proteomic studies using mass spectrometry. Bottom-up proteomics relies upon enzymatic protein digestions prior to on-line liquid chromatography-coupled tandem mass spectrometry analysis (LC-MS/MS)[1]. Top-down proteomics omits proteolysis and focuses on complete characterization of intact proteins and their post-translational modifications (PTMs)[2]. While both approaches continue to mature, they each have limitations. The tryptic peptides used in the bottom-up approach are the primary unit of measurement, but their relatively small size (typically ~8–25 residues long) leads to problems such as sample complexity, difficulties in assigning peptides to specific gene products rather than protein groups[4] and loss of single and combinatorial PTM information. The top-down approach handles these issues by characterizing intact proteins, but becomes less successful in the high-mass region. Therefore, a hybrid approach based on 2–20 kDa peptides could marry positive aspects of both bottom-up and top-down proteomics. We previously proposed a generic approach to “middle-down” proteomics to interrogate high-mass proteome with two essential features: a size-dependent protein fractionation technique and a robust but restricted proteolysis method[5] (Fig. 1a). A continuous tube-gel electrophoresis technique can now provide the size-dependent fractionation of a complex proteome[6]. Previous efforts to explore restricted proteolysis options included using alternative enzymes to trypsin (for example, Lys-C[7] and Lys-N[8]) and chemical methods (for example, microwave-assisted acid hydrolysis[9]). Nonetheless, these methods produced peptides only marginally longer than tryptic peptides in large-scale proteomic studies.
Figure 1

OmpT-based platform for middle-down proteomics and characterization of OmpT peptides from digestion of a standard protein. (a) The middle-down workflow was illustrated on proteins from a HeLa cell lysate sorted into narrow size ranges by molecular-weight based pre-fractionation (see silver stained gel, top row). A representative OmpT digestion of a fraction containing 50–75 kDa proteins (highlighted in the red box) was visualized by silver staining (left lane, bottom right) along with the control sample with no digestion (right lane). The digested samples were separated further and fractions below ~15 kDa were subjected to nanoLC-MS/MS analysis. (b) Peptide products from digestion of glyceraldehyde 3-phosphate dehydrogenase (GAPDH, 36 kDa) by OmpT were visualized on a Coomassie stained SDS-PAGE gel. Lane 1, GAPDH incubated without OmpT. Lane 2, GAPDH after OmpT digestion. Major peptide products are numbered from 1 through 4. Arrowhead indicates the intact OmpT enzyme. (c) Alignment of identified OmpT peptides by nanoLC-MS/MS with the original GAPDH sequence on top. Peptide cleavage sites are illustrated and N and C represent the protein N and C termini.

Here we present the protease OmpT to achieve a robust, yet restricted, proteolysis of a complex proteome (with its benefits described in Supplementary Fig. 1). OmpT is known to cleave between two consecutive basic amino acid residues (K/R–K/R) and is reported to have favorable kinetics with its k/K in the 104 – 108 s−1M−1 range. Derived from the Escherichia coli K12 outer membrane, OmpT belongs to the novel omptin protease family[10]. In this study, we developed OmpT into an efficient reagent to generate > 2 kDa peptides for middle-down proteomics. We first performed in silico digestions of the human proteome using various enzymatic or chemical approaches to create histograms of their predicted peptide masses (Supplementary Fig. 2). Most traditional enzymatic approaches generated predominantly small peptides (< 2 kDa). OmpT, which cleaves between less common dibasic sites, produced a distribution with a greater number of peptides > 3 kDa. We overexpressed and refolded OmpT to obtain active enzyme (see Online Methods), then optimized its digestion of four standard protein substrates. Optimal conditions for OmpT digestion were at pH 6.0 in 2–3 M urea at 22°C (Supplementary Fig. 3). We used urea to reduce the higher-order structure present in large protein substrates, an important step for the cleavage efficiency of OmpT[11]. Characterization of digestion products from the 36 kDa GAPDH standard is shown as an example (Fig. 1b–c and Supplementary Fig. 4). In addition to the predicted dibasic cleavages, we also observed a K-A cleavage, corroborating previous reports that OmpT can still cleave with aliphatic amino acids in its P1' position[12], especially under strongly denaturing conditions[13]. Although the GAPDH sequence contains a K-K site (Supplementary Fig. 3g), the cleaved product at this site (peptide 5 in Fig. 1c) was barely observable upon LC-MS/MS analysis (data not shown). This is likely because the flanking amino acids are two aspartic acids whose negative charges may prevent the binding of the nearby K-K site to the negatively charged OmpT active site[14]. We also characterized three other proteins digested with OmpT in detail (Supplementary Fig. 3b,d–f,h). We established an OmpT-based middle-down platform to analyze complex mixtures pre-sorted by protein size (Fig. 1a). Integrating data from the middle-down workflow applied to ~20–100 kDa proteins fractionated from the HeLa cell proteome, we identified 3,697 unique peptides (average size: 6.3 kDa) from 1,038 unique proteins (26% average sequence coverage at an estimated 1% false discovery rate (FDR)[2] (Supplementary Table 1). Two database search modes were used: biomarker and absolute mass (explained further below and in Online Methods). Both the forward and decoy databases for biomarker and absolute mass searches are available online using ProSightPTM 2.0 (http://prosightptm2.northwestern.edu). ProSightPC users can also download these databases using the link (ftp://prosightftp:gsX1gON@prosightpc.northwestern.edu/) and run both search modes locally. Results from an individual LC-MS/MS analysis of fractionated OmpT peptides from the middle-down workflow are provided as an example (Supplementary Fig. 5). We also performed a negative control treatment of substrate proteins in the absence of OmpT, which showed no sample auto-degradation (data not shown). Proteotypic OmpT peptides can allow differentiation of specific protein isoforms. Detailed sequence alignments between protein isoforms revealed high sequence identity, while OmpT peptides, owing to their desirably large size, covered unique regions where isoform sequences differed (Fig. 2a and Supplementary Fig. 6a–b). Long peptides can also prove beneficial for detection and identification of modified peptides. In this study, ~25% of OmpT peptides were identified with PTMs (using annotated modifications from the UniProt database[2]) and several examples of multiply modified peptides were found (Fig. 2b and Supplementary Fig. 6c–e). An additional 8% of unique peptides with unexpected mass discrepancies were confidently identified in error tolerant searching. Together, these data imply that the OmpT-based workflow can provide isoform-specific assignments, characterization of modified peptides and combinatorial PTM information complementary to traditional protease-based proteomic approaches.
Figure 2

Proteotypic OmpT peptides, peptide size distribution and iceLogo of OmpT recognition site. (a) Peptides 1 and 2 (10.8 kDa and 5.4 kDa respectively) cover a proteotypic sequence region of 37 kDa L-lactate dehydrogenase A chain isoform 1 (Uniprot number: P00338-1, 87% identify to isoform 2). Cleavage sites for OmpT peptides are shown. The schematic isoform alignment (detailed sequence alignment in Supplementary Fig. 6a), marks the region where the two isoform sequences differ between dashed lines. Peptides covering the distinct part of a certain isoform are shaded in black; peptides covering the common regions of all isoforms are in grey. (b) 84 kDa heat shock protein HSP 90-beta (Uniprot accession number: P08238) identified by peptides in grey; phosphorylation sites in peptide 3 (8.9 kDa) are indicated (survey spectrum of the singly and doubly modified species in Supplementary Fig. 6c). (c) Mass distribution of identified OmpT peptides (below ~15 kDa) in comparison with tryptic peptides[16]. (d) IceLogo of OmpT recognition sequences from P4 through P4' sites. OmpT cleaves between P1 and P1'. The y axis displays the percentage difference of amino acid frequencies between the experimental set and the reference set at each position.

The OmpT peptide size distribution was plotted in comparison with tryptic peptides (Fig. 2c). Because we only analyzed fractions below ~15 kDa, the average size of peptides identified here was 6.3 kDa, but OmpT peptides above 15 kDa were readily visible on gels (Fig. 1a and Supplementary Fig. 5c). We compared the performance between collision induced dissociation (CID) and electron transfer dissociation (ETD) using OmpT peptides (Supplementary Fig. 7 and Supplementary Table 2). The low degree of overlap between the methods indicates that both CID and ETD can serve as highly complementary fragmentation approaches to identify and characterize OmpT peptides. Although the substrate specificity of OmpT has been extensively studied, previous model substrates were mostly short peptides and unstructured protein linker regions[12,13,15]. This study helped to improve our understanding of OmpT's sequence preference under denaturing conditions (3 M urea) where whole proteins were the substrates. We searched the entire dataset in “biomarker” mode against an intact protein database. A biomarker search assumes no specific proteolytic cleavage, but rather queries every possible sub-sequence in the database within tolerance from an observed peptide mass. Confident biomarker peptide hits were then used to extract the P4 through P4' recognition sites of OmpT for the generation of an unbiased consensus sequence. From these data, we generated an iceLogo that normalizes observed amino acid frequencies at each site to a reference set of proteomic amino acid frequencies (Fig. 2d and Supplementary Fig. 8a–b). We also made a WebLogo for comparison, which illustrates amino acid frequencies at each site solely based on the input sequences without normalization (Supplementary Fig. 8c). As shown in iceLogo and WebLogo representations, the P1 site was restricted almost exclusively to lysine and arginine, while the P1' site was more permissive, allowing predominantly lysine and arginine, but also alanine and serine. The relative promiscuity of OmpT at the P1' position may be attributed to the location of P1'-substrate binding site near the loops on top of the beta-barrel[14], which could have increased flexibility under denaturing conditions. Because of OmpT's broader specificity at the P1' site, we defined the “major cleavage sites” as K/R–K/R/A/S and performed another in silico digestion of human proteome at all these major sites assuming 0 and 2 missed cleavages (Supplementary Fig. 2). The resultant peptide size distributions strongly resembled the distributions assuming only K/R–K/R cleavages. In addition to selectivities at the P1 and P1' sites, the P2' site also had a slight preference for aliphatic amino acids. Overall, OmpT favored positively charged residues across its recognition sites (with the exception of P2) and resisted negatively charged and proline residues. Selectivities outside P1–P1' have been previously reported[12,15] and might explain the average number of observed missed cleavages (0.99 ± 1.29) at the major sites. In spite of these preferences, OmpT is still a stringent protease with well-defined substrate specificities, which will be better understood with future experimentation and data mining. The stable beta-barrel structure of this membrane endopeptidase endows it with a remarkable resistance to both denaturants and surfactants, allowing extensive denaturation of large protein substrates under strongly solubilizing conditions for robust proteolysis. The analysis of widely-distributed OmpT peptides across a broad mass range will necessitate adjustments in separation protocols and LC-MS/MS methods accordingly. New and next generation instruments will further increase the routine size range accessible for sequencing, and many proteomics search engines will require modification to identify these larger peptides. With a demonstrated capacity for robust and restricted proteolysis, OmpT is as an attractive option for mass spectrometry-based interrogation of protein primary structure.
  24 in total

1.  Crystal structure of the outer membrane protease OmpT from Escherichia coli suggests a novel catalytic site.

Authors:  L Vandeputte-Rutten; R A Kramer; J Kroon; N Dekker; M R Egmond; P Gros
Journal:  EMBO J       Date:  2001-09-17       Impact factor: 11.598

2.  Informatics and multiplexing of intact protein identification in bacteria and the archaea.

Authors:  F Meng; B J Cargile; L M Miller; A J Forbes; J R Johnson; N L Kelleher
Journal:  Nat Biotechnol       Date:  2001-10       Impact factor: 54.908

3.  In vitro folding, purification and characterization of Escherichia coli outer membrane protease ompT.

Authors:  R A Kramer; D Zandwijken; M R Egmond; N Dekker
Journal:  Eur J Biochem       Date:  2000-02

4.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

5.  Toward efficient analysis of >70 kDa proteins with 100% sequence coverage.

Authors:  A J Forbes; M T Mazur; H M Patel; C T Walsh; N L Kelleher
Journal:  Proteomics       Date:  2001-08       Impact factor: 3.984

6.  A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids.

Authors:  D Wessel; U I Flügge
Journal:  Anal Biochem       Date:  1984-04       Impact factor: 3.365

7.  Substrate specificity at the P1' site of Escherichia coli OmpT under denaturing conditions.

Authors:  Kazuaki Okuno; Masayuki Yabuta; Kouji Kawanishi; Kazuhiro Ohsuye; Toshihiko Ooi; Shinichi Kinoshita
Journal:  Biosci Biotechnol Biochem       Date:  2002-01       Impact factor: 2.043

8.  Lipopolysaccharide regions involved in the activation of Escherichia coli outer membrane protease OmpT.

Authors:  R Arjen Kramer; Klaus Brandenburg; Lucy Vandeputte-Rutten; Marjolein Werkhoven; Piet Gros; Niek Dekker; Maarten R Egmond
Journal:  Eur J Biochem       Date:  2002-03

9.  Substrate specificity of the Escherichia coli outer membrane protease OmpT.

Authors:  John D McCarter; Daren Stephens; Kevin Shoemaker; Steve Rosenberg; Jack F Kirsch; George Georgiou
Journal:  J Bacteriol       Date:  2004-09       Impact factor: 3.490

10.  Mapping intact protein isoforms in discovery mode using top-down proteomics.

Authors:  John C Tran; Leonid Zamdborg; Dorothy R Ahlf; Ji Eun Lee; Adam D Catherman; Kenneth R Durbin; Jeremiah D Tipton; Adaikkalam Vellaichamy; John F Kellie; Mingxi Li; Cong Wu; Steve M M Sweet; Bryan P Early; Nertila Siuti; Richard D LeDuc; Philip D Compton; Paul M Thomas; Neil L Kelleher
Journal:  Nature       Date:  2011-10-30       Impact factor: 49.962

View more
  35 in total

1.  Analysis of Monoclonal Antibody Sequence and Post-translational Modifications by Time-controlled Proteolysis and Tandem Mass Spectrometry.

Authors:  Lichao Zhang; A Michelle English; Dina L Bai; Scott A Ugrin; Jeffrey Shabanowitz; Mark M Ross; Donald F Hunt; Wei-Han Wang
Journal:  Mol Cell Proteomics       Date:  2015-11-29       Impact factor: 5.911

2.  Rapid mitogenic regulation of the mTORC1 inhibitor, DEPTOR, by phosphatidic acid.

Authors:  Mee-Sup Yoon; Christina L Rosenberger; Cong Wu; Nga Truong; Jonathan V Sweedler; Jie Chen
Journal:  Mol Cell       Date:  2015-04-30       Impact factor: 17.970

3.  Neprosin, a Selective Prolyl Endoprotease for Bottom-up Proteomics and Histone Mapping.

Authors:  Christoph U Schräder; Linda Lee; Martial Rey; Vladimir Sarpe; Petr Man; Seema Sharma; Vlad Zabrouskov; Brett Larsen; David C Schriemer
Journal:  Mol Cell Proteomics       Date:  2017-04-12       Impact factor: 5.911

Review 4.  Protein analysis by shotgun/bottom-up proteomics.

Authors:  Yaoyang Zhang; Bryan R Fonslow; Bing Shan; Moon-Chang Baek; John R Yates
Journal:  Chem Rev       Date:  2013-02-26       Impact factor: 60.622

Review 5.  New-generation mass spectrometry expands the toolbox of cell and developmental biology.

Authors:  Camille Lombard-Banek; Erika P Portero; Rosemary M Onjiko; Peter Nemes
Journal:  Genesis       Date:  2017-01       Impact factor: 2.487

6.  Thorough Performance Evaluation of 213 nm Ultraviolet Photodissociation for Top-down Proteomics.

Authors:  Luca Fornelli; Kristina Srzentić; Timothy K Toby; Peter F Doubleday; Romain Huguet; Christopher Mullen; Rafael D Melani; Henrique Dos Santos Seckler; Caroline J DeHart; Chad R Weisbrod; Kenneth R Durbin; Joseph B Greer; Bryan P Early; Ryan T Fellers; Vlad Zabrouskov; Paul M Thomas; Philip D Compton; Neil L Kelleher
Journal:  Mol Cell Proteomics       Date:  2019-12-30       Impact factor: 5.911

7.  Complete Characterization of Cardiac Myosin Heavy Chain (223 kDa) Enabled by Size-Exclusion Chromatography and Middle-Down Mass Spectrometry.

Authors:  Yutong Jin; Liming Wei; Wenxuan Cai; Ziqing Lin; Zhijie Wu; Ying Peng; Takushi Kohmoto; Richard L Moss; Ying Ge
Journal:  Anal Chem       Date:  2017-04-12       Impact factor: 6.986

8.  Top-down/Bottom-up Mass Spectrometry Workflow Using Dissolvable Polyacrylamide Gels.

Authors:  Nobuaki Takemori; Ayako Takemori; Piriya Wongkongkathep; Michael Nshanian; Rachel R Ogorzalek Loo; Frederik Lermyte; Joseph A Loo
Journal:  Anal Chem       Date:  2017-08-02       Impact factor: 6.986

9.  Mass-biased partitioning to enhance middle down proteomics analysis.

Authors:  Joe R Cannon; Nathan J Edwards; Catherine Fenselau
Journal:  J Mass Spectrom       Date:  2013-03       Impact factor: 1.982

Review 10.  The growing landscape of lysine acetylation links metabolism and cell signalling.

Authors:  Chunaram Choudhary; Brian T Weinert; Yuya Nishida; Eric Verdin; Matthias Mann
Journal:  Nat Rev Mol Cell Biol       Date:  2014-08       Impact factor: 94.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.