| Literature DB >> 36120566 |
Linhui Zhai1,2,3, Le Wang1,3, Hao Hu3, Quan Liu3, Sangkyu Lee4, Minjia Tan1,3, Yinan Zhang1,2.
Abstract
High-throughput profiling of protein C-termini is still a challenging task. Proteomics provides a powerful technology for systematic and high-throughput study of protein C-termini. Various C-terminal peptide enrichment strategies based on chemical derivatization and chromatography separation have been reported. However, they are still costly and time-consuming, with low enrichment efficiency for C-terminal peptides. In this study, by taking advantage of the high reaction selectivity of 2-pyridinecarboxaldehyde (2-PCA) with an α-amino group on peptide N-terminus and high affinity between biotin and streptavidin, we developed a 2-PCA- and biotin labeling-based C-terminomic (PBC) strategy for a high-efficiency and high-throughput analysis of protein C-terminome. Triplicates of PBC experiments identified a total of 1,975 C-terminal peptides corresponding to 1,190 proteins from 293 T cell line, which is 180% higher than the highest reported number of C-terminal peptides identified from mammalian cells by chemical derivatization-based C-terminomics study. The enrichment efficiency (68%) is the highest among the C-terminomics methods currently reported. In addition, we not only uncovered 50 proteins with truncated C-termini which were significantly enriched in extracellular exosome, vesicle, and ribosome by a bioinformatic analysis but also systematically characterized the whole PTMs on C-terminal in 293 T cells, suggesting PBC as a powerful tool for protein C-terminal degradomics and PTMs investigation. In conclusion, the PBC strategy would benefit high-efficiency and high-throughput profiling of protein C-terminome.Entities:
Keywords: C-terminomics; chemical derivatization; enrichment; high-efficiency; post-translation modification (PTM)
Year: 2022 PMID: 36120566 PMCID: PMC9471192 DOI: 10.3389/fcell.2022.995590
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1C-terminal peptide enrichment workflow through the PBC strategy. The extracted whole cell proteome is digested into peptides by LysC protease. Then the LysC-digested peptides are tandemly reacted with 2-PCA and NHS-biotin. Next, the peptides with biotin-labeled lysine were captured by streptavidin beads. Finally, the C-terminal peptides are enriched by filter-aided separation from streptavidin beads and detected by LC-MS/MS.
FIGURE 22-PCA displays efficiently and high selectively labeling peptide N-terminal. (A) Pie chart of the 2-PCA labeling efficiency distribution calculated by using the spectral counting-based quantification method. (B,C) 2-PCA labeling efficiency of different length (B) and hydrophilic/hydrophobic value of peptide (C). The GRAVY scores were calculated using the online tool (https://web.expasy.org/protparam/) (Kyte and Doolittle, 1982). The p-value was calculated by paired Student t-test. (D,E) Full labeling (D) and un-labeling (E) peptide percent on different first three amino acids. (F) IceLogo analysis representation of the first six amino acids of full labeling, partial labeling, and un-labeling peptides.
FIGURE 32-PCA and biotin labeling displays high-efficiency labeling for α-amino on peptide N-termini and ε-amino group on lysine. (A) Histogram distribution of biotin labeling efficiency on amino group in lysine in three technical replicates. More than 97.7% ε-amino group on K was blocked. (B) Histogram distribution of blocking efficiency on amino group in protein N-terminal in three technical replicates. More than 99.7% α-amino group on the peptide N-terminal were blocked. (C) Percentage of the theoretic “2-PCA-labeling” and “un-labeling” human protein C-terminal peptide by in silico digestion using LysC. Only 695 C-terminal peptides with second amino acid as proline could not be labeled by 2-PCA. (D) Peptide retention time comparison among before and after 2-PCA labeling. (E) Peptide ion score comparison among before and after 2-PCA labeling. All the experiments were performed in technical triplicates from the same started proteome sample.
FIGURE 4Highly efficient and high-throughput identification of protein C termini from 293T cells by using the PBC strategy. (A,B) Overlap analysis of C-terminal peptides (A) and proteins (B) from the triplicate experiment of C-terminal peptide enriched by using the PBC strategy. (C) Pie chart analysis of the frequency of C-terminal peptides identification from technical triplicates. A total of 46.9% peptides (926 peptides) were identified in all the triplicates, and 70.8% peptides (1,398 peptides) were identified in at least two replicates. (D) Correlation of intensity analysis of identified C-terminal peptides among technical triplicates. (E) Peptide abundance dynamic range of technical triplicates through the PBC strategy. The abundance dynamic range was larger than seven orders of magnitude.
Comparison the results of human protein C-terminomics studies.
| # | This study |
|
|
|
|---|---|---|---|---|
| Cell type | 293 T | 293 T | 293 T | HeLa |
| Sample amount | 900 µg | 1.5 mg | 120 µg | 1 mg |
| Enzyme | LysC | LysargiNase | LysargiNase | Trypsin |
| Number of C-terminal peptides | 1,975 | 1,100 | 2,000 | 4,724 |
| Selective enrichment efficiency (%) | 68 | 39.1 | 22.4 | 2.7–34 |
| Number of protein C-termini | 1,190 | 924 | 1,812 | 2,219 |
| Isolation method | Two chemical reaction steps | Three chemical reaction steps | Three chemical reaction steps + SCX | SCX |
FIGURE 5Function analysis of the enriched C-terminal truncated proteins and the PTM distribution on C-terminal peptide. (A) IceLogo analysis of the last 10 amino acids of original C-terminal identified by PBC strategy. (B) Cellular components enrichment analysis of C-terminal proteins identified by PBC strategy. (C) The gene ontology and KEGG pathway enrichment analysis of the proteins with C-terminal truncated. The truncated proteins significantly enriched in extracellular exosome, vesicle, and ribosome. (D) Pie chart analysis of identified PSM with and without containing PTMs. In total, more than 30% PSM were identified with PTM-labeled peptides. (E) Distribution of PTMs modified on C-terminal peptide. (F) Distribution of identified PTMs on last amino acid of protein C-termini.
Identified Neo-N-termini for HSP90AB1 (P08238) and H4 (P62805).
| Gene name | Protein accession number | Annotated sequence |
|---|---|---|
| HAP90AB1 | P08238 | [K].LGLGIDEDEVAA.[E] |
| [K].LGLGIDEDEVAAEEPNAAVPD.[E] | ||
| [K].LGLGIDEDEVAAEEPNAAVPDEIPPLEGD.[E] | ||
| [K].LGLGIDEDEVAAEEPNAAVPDEIPPLEGDED.[A] | ||
| [K].LGLGIDEDEVAAEEPNAAVPDEIPPLEGDEDASR.[M] | ||
| [K].LGLGIDEDEVAAEEPNAAVPDEIPPLEGDEDASRMEEVD.[-] | ||
| H4 | P62805 | [K].VFLENVIR.[D] |
| [K].VFLENVIRD.[A] | ||
| [K].VFLENVIRDA.[V] | ||
| [K].VFLENVIRDAV.[T] | ||
| [K].VFLENVIRDAVTY.[T] | ||
| [K].RQGRTLYGF.[G] | ||
| [K].RQGRTLYGFG.[G] | ||
| [K].RQGRTLYGFGG.[-] |