| Literature DB >> 35945258 |
Jasmina Damnjanović1, Nana Odake2, Jicheng Fan2,3, Maurizio Camagna4, Beixi Jia2, Takaaki Kojima2,5, Naoto Nemoto6, Kiyotaka Hitomi7, Hideo Nakano2.
Abstract
cDNA display is an in vitro display technology based on a covalent linkage between a protein and its corresponding mRNA/cDNA, widely used for the selection of proteins and peptides from large libraries (1012) in a high throughput manner, based on their binding affinity. Here, we developed a platform using cDNA display and next-generation sequencing (NGS) for rapid and comprehensive substrate profiling of transglutaminase 2 (TG2), an enzyme crosslinking glutamine and lysine residues in proteins. After screening and selection of the control peptide library randomized at the reactive glutamine, a combinatorial library of displayed peptides randomized at positions - 1, + 1, + 2, and + 3 from the reactive glutamine was screened followed by NGS and bioinformatic analysis, which indicated a strong preference of TG2 towards peptides with glutamine at position - 1 (Gln-Gln motif), and isoleucine or valine at position + 3. The highly enriched peptides indeed contained the indicated sequence and showed a higher reactivity as TG2 substrates than the peptide previously selected by phage display, thus representing the novel candidate peptide probes for TG2 research. Furthermore, the obtained information on substrate profiling can be used to identify potential TG2 protein targets. This platform will be further used for the substrate profiling of other TG isozymes, as well as for the selection and evolution of larger biomolecules.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35945258 PMCID: PMC9363462 DOI: 10.1038/s41598-022-17494-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Outline of the cDNA display platform for screening and selection of preferred TG2 peptide substrate. TG2 stands for transglutaminase 2. Red dotted line represents cleavage site of RNase T1.
Figure 2The heatmaps showing the color-coded enrichment factor of each amino acid at randomized positions of (A) LibQ and (B) Lib4.
Figure 3Reactivity of the top enriched peptides, Top 1 and Top 2, and heatmap-derived sequence (Q-Q-C-Y-I-) in comparison to T26 and non-substrate peptide, T26QN, determined by the TG2 enzymatic assay. Assay was carried out in plate format using biotinylated peptides in the concentration range 0–10 μM with immobilized casein as amino donor, as described in Methods. Reaction products were detected by absorbance at 450 nm. Bars represent means of triplicate measurements with STD indicated for each measurement. All peptides were chemically synthesized with N-terminal biotin tag by Bio-Synthesis, USA and provided by Biologica, Japan.
Figure 4Analysis of the enriched peptides from Lib4. (A) Permutation importance for the amino acids used as input for a Random Forest Regression model, sorted by importance. The permutation importance indicates how important given amino acid is to accurately predict enrichment. Color of bars indicates association with enrichment: Green = important for increase of enrichment, red = important for decrease of enrichment. Bars are mean permutation importance for ten-fold cross-validated calculations. Insets represent enrichment differences at positions − 1, + 1, + 2 and + 3 when a particular amino acid is in that position (left side) and when any other amino acid is in that position (right side), and are significant as determined by independent T-test (p < 0.01; n > 7000). X denotes any amino acid except the one analyzed in the bar/s to the left in each inset. (B) Effect of hydrophobicity on enrichment. Asterisks indicate significant difference to all other plots, determined by independent T-test (p < 0.01; n = 7988–35,482). (C) Effect of polarity on enrichment. Amino acids with polar residues are denoted with P, and non-polar with NP. Asterisks indicate significant difference, determined by independent T-test (p < 0.01; n > 70.000). Orange lines in all boxplots indicate median enrichment; boxes represent first and third quantile; whiskers represent 1.5 × inter-quantile range.
Summarized results of the Blastp search with four query sequences, consisting of the first 7 a.a. residues of the peptides against the human and mouse proteins.
| Query | T26 (HQSYVDP) | Heatmap-derived sequence (QQCYIDP) |
|---|---|---|
| Match | Ig light chain junction region Ig heavy chain junction region Ig kappa light chain variable region Ankyrin-3 isoforms mCG144990 Histone-lysine N-methyltransferase SETD2 isoforms Bombesin receptor Synaptogyrin-2 hCG1820523 | Zinc transporters (solute carrier 30 family) Ig kappa light chain variable region Ig light chain junction region Heparanase 2 and 3 Coatomer subunit beta Telomere length regulation protein TEL2 Gamma-glutamyl hydrolase precursor mCG12390 IgA heavy chain variable region |
| Match | Stromal cell-derived factor 1 Fanconi anemia group A protein Ig heavy chain junction region Signal peptide, CUB and EGF-like domain-containing protein 2 mCG4615 Testase 1 FERM and PDZ domain-containing protein 1 Amiloride-sensitive sodium channel subunit alpha Olfactory receptors Seven-transmembrane helix receptor Protein C9orf135 isoform X2 Deleted in lung and esophageal cancer protein 1 C-X-C motif chemokine 6 precursor Granulocyte chemotactic protein mCG147083 | Anaphase-promoting complex subunit 1 Ig light chain junction region Chromosome 8 ORF 53 (human) mCG1045346 Vitrin isoforms Ig heavy chain junction region Triple functional domain protein Kalirin |
Only the matches with 6–7 residues including one residue mismatch, or 5 residues without a mismatch are given. If aligned amino acid residues are not identical but are similar in size and nature, this was not considered a mismatch.
DNA sequences of the cDNA display constructs used in this study.
T7P sequence is bolded, peptide sequence is colored red, and 6xHis sequence blue. Randomized sites are italicized.