| Literature DB >> 33805528 |
Nikola Winter1, Maria Novatchkova2,3, Andreas Bachmair1.
Abstract
The first amino acid of a protein has an important influence on its metabolic stability. A number of ubiquitin ligases contain binding domains for different amino-terminal residues of their substrates, also known as N-degrons, thereby mediating turnover. This review summarizes, in an exemplary way, both older and more recent findings that unveil how destabilizing amino termini are generated. In most cases, a step of proteolytic cleavage is involved. Among the over 500 proteases encoded in the genome of higher eukaryotes, only a few are known to contribute to the generation of N-degrons. It can, therefore, be expected that many processing paths remain to be discovered.Entities:
Keywords: N-degron; autophagy; endoprotease; protein turnover; proteolytic processing; ubiquitin proteasome pathway
Year: 2021 PMID: 33805528 PMCID: PMC8037982 DOI: 10.3390/ijms22073545
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Fate of proteins with the indicated amino-terminal residues.
| Exposed Residue | Exposing Protease | Predicted Fate |
|---|---|---|
| Cys | Met aminopeptidase | Oxidation, arginylation, ubiquitylation, proteasomal degradation of protein |
| Pro | Met AP | Depending on sequence context: Pro N-degron |
| Gly | Met AP | Depending on sequence context: Gly N-degron |
| Ala | Met AP/mitochondrial protease | Depending on sequence context: Ala N-degron |
| Asn | Met AP (depending on sequence context; blocked by Met acetylation), other endoproteases? | De-amidation, arginylation, ubiquitylation, proteasomal degradation of protein |
| Gln | Met AP (depending on sequence context; blocked by Met acetylation), Metacaspases, other | De-amidation, arginylation, ubiquitylation, proteasomal degradation of protein |
| Asp | Caspase-like proteases, | Arginylation, ubiquitylation, proteasomal degradation of protein |
| Glu | Signal peptidase, | Arginylation, ubiquitylation, proteasomal degradation of protein |
| Lys | Metacaspases | Ubiquitylation, proteasomal degradation of protein |
| Arg | Metacaspases | Ubiquitylation, proteasomal degradation of protein |
| Met Φ 1 | No processing | Depending on sequence context: ubiquitylation, proteasomal degradation of protein |
1 Φ symbolizes an amino acid with hydrophobic side chain.
Figure 1Position-specific amino acid probability in the first thirty residues of Arabidopsis proteins represented as a sequence logo (in single letter code). (A) Sequence logo based on the complete set of nuclear-encoded proteins with one representative per gene. (B) Sequence logo based on the subset of proteins with predicted localization exclusively in cytoplasmic or nuclear compartments. Predictions were obtained from Araport11.
Abundance of amino termini in the predicted proteome of Arabidopsis 1.
| Amino-Terminal Dipeptide | Abundance in the Compete Proteome | Abundance | Abundance in Nuclear/Cytoplasmic Proteins | Abundance | General Frequency of Amino Acid 2 in the Proteome |
|---|---|---|---|---|---|
| MA | 5891 | 21.5% | 1917 | 18.7% | 6.3% |
| ME | 3059 | 11.1% | 1469 | 14.3% | 6.7% |
| MS | 2898 | 10.6% | 1140 | 11.1% | 9.1% |
| MG | 2441 | 8.9% | 1090 | 10.6% | 6.4% |
| MD | 1874 | 6.8% | 988 | 9.6% | 5.4% |
| MK | 1629 | 5.9% | 479 | 4.7% | 6.4% |
| MV | 1519 | 5.5% | 598 | 5.8% | 6.7% |
| MT | 1216 | 4.4% | 458 | 4.5% | 5.1% |
| MN | 976 | 3.6% | 378 | 3.7% | 4.4% |
| ML | 951 | 3.5% | 225 | 2.2% | 9.5% |
| MR | 931 | 3.4% | 259 | 2.5% | 5.4% |
| MI | 738 | 2.7% | 162 | 1.6% | 5.3% |
| MM | 692 | 2.5% | 225 | 2.2% | 2.5% |
| MP | 601 | 2.2% | 285 | 2.8% | 4.8% |
| MF | 544 | 2.0% | 167 | 1.6% | 4.3% |
| MQ | 509 | 1.9% | 155 | 1.5% | 3.5% |
| MY | 340 | 1.2% | 101 | 1.0% | 2.8% |
| MH | 208 | 0.8% | 51 | 0.5% | 2.3% |
| MC | 205 | 0.7% | 68 | 0.7% | 1.9% |
| MW | 164 | 0.6% | 41 | 0.4% | 1.2% |
| Sum | 27,386 | 10,256 |
1 For each gene identifier of Araport11 version 20160703, a single representative gene model was used as provided by Arabidopsis.org (accessed on 26 February 2021); genes from ChrM and ChrC were excluded.