| Literature DB >> 35069635 |
Patrick Willems1,2, Elvis Ndah3, Veronique Jonckheere3, Frank Van Breusegem1,2, Petra Van Damme3.
Abstract
Alternative translation initiation is a widespread event in biology that can shape multiple protein forms or proteoforms from a single gene. However, the respective contribution of alternative translation to protein complexity remains largely enigmatic. By complementary ribosome profiling and N-terminal proteomics (i.e., riboproteogenomics), we provide clear-cut evidence for ~90 N-terminal proteoform pairs shaped by (alternative) translation initiation in Arabidopsis thaliana. Next to several cases additionally confirmed by directed mutagenesis, identified alternative protein N-termini follow the enzymatic rules of co-translational N-terminal protein acetylation and initiator methionine removal. In contrast to other eukaryotic models, N-terminal acetylation in plants cannot generally be considered as a proxy of translation initiation because of its posttranslational occurrence on mature proteolytic neo-termini (N-termini) localized in the chloroplast stroma. Quantification of N-terminal acetylation revealed differing co- vs. posttranslational N-terminal acetylation patterns. Intriguingly, our data additionally hints to alternative translation initiation serving as a common mechanism to supply protein copies in multiple cellular compartments, as alternative translation sites are often in close proximity to cleavage sites of N-terminal transit sequences of nuclear-encoded chloroplastic and mitochondrial proteins. Overall, riboproteogenomics screening enables the identification of (differential localized) N-terminal proteoforms raised upon alternative translation.Entities:
Keywords: Arabidopsis thaliana; N-terminal acetylation; N-terminal proteomics; alternative translation initiation; chloroplast transit peptide; riboproteogenomics; ribosome profiling; translation initiation site
Year: 2022 PMID: 35069635 PMCID: PMC8770321 DOI: 10.3389/fpls.2021.778804
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Ribosome sequencing (Ribo-seq) translation initiation site (TIS) calling in Arabidopsis thaliana. (A) Normalized ribosome footprint (RPF) density in TAIR10 Arabidopsis annotated coding sequence (CDS, representative gene model) regions after treatment with cycloheximide (CHX, red), which halts translation elongation, and lactimidomycin (LTM, blue), which halts translation initiation. (B) CHX (red) and LTM (blue) read coverage for cytochrome C-1 (AT1G22840.1). (C) Location of TIS called by PROTEOFORMER with respect to annotated TAIR10 gene models (for overview, see Supplementary Dataset S1). (D) Start codon distribution for called alternative TIS (aTIS).
Figure 2N-terminal peptide evidence for annotated and alternative TIS. (A) Number of identified Nt acetylation (NTA) or Nt-free (13C2D3-Ac) peptides matching TAIR10-annotated protein starts (position 1 or 2) in function of the identity of the ultimate N-terminal residue. (B) Distribution of aTIS (N = 81) matching the by N-terminal proteomics-identified alternative N-termini with respect to their location to TAIR10 gene models, being either intergenic, located in the 5′ UTR or downstream (dTIS) and in-frame in the CDS. (C) Peptide-to-spectrum matches (PSMs) are indicated for the 68 NME-compliant N-termini identified matching in-frame called dTIS of TAIR10 CDSs (Supplementary Dataset S2). In vivo Nt-free N-termini, labeled in vitro by 13C2D3-Ac, are colored in black, while in vivo NTA N-termini matching NatA, NatB, or NatC/E/F specificities are indicated in orange, blue, and green, respectively.
List of 68 N-terminal (Nt)-truncated proteoforms with matching riboproteogenomics evidence shaped by downstream alternative translation initiation sites (dTISs).
|
|
|
|
|
|---|---|---|---|
|
| |||
| AT1G14610 | Val-tRNA synthetase (TWN2) | Met45 | Ac-S |
| AT1G29880 | Gly-tRNA synthetase | Met40 | Ac-MD |
| AT3G11710 | Lys-tRNA synthetase (ATKRS-1) | Met17 | Ac-MD |
| AT5G26830 | Threonyl-tRNA synthetase (THRRS) | Met34 | Ac-A |
| AT1G52160 | tRNAse Z3 (TRZ3) | Met52 | Ac-ME |
| AT2G45330 | 2′ tRNA phosphotransferase | Met33 | Ac-MD |
| AT1G06560 | tRNA methyltransferase 4F (TRM4F) | Met37 | Ac-ME |
| AT1G36310 | tRNA methyltransferase 9 (TRM9) | Met29 | NH2-MR |
|
| |||
| AT1G33060 | NAC 014 (NAC014) | Met12 | Ac-T |
| AT1G49480 | Related to vernalization1 1 (RTV1) | Met3 | Ac-MD |
| AT1G72210 | Basic helix-loop-helix (bHLH96) | Met18 | Ac-ME |
| AT4G22745 | Methyl-CPG-binding domain 1 (MBD1) | Met7 | NH2-MN |
| AT5G67220 | BIM1 | Met23 | NH2-T |
|
| |||
| AT1G03360 | Ribosomal RNA processing 4 (RRP4) | Met3 | NH2-MR |
| AT1G07770; AT3G46040 | Ribosomal protein S15A (RPS15A) | Thr105(ACG)->Met | NH2-T |
| AT1G18540; AT1G74050; AT1G74060 | Ribosomal protein L6 family protein | Arg25(AGG)->Met | NH2-S |
| AT1G54270; AT3G13920 | EIF4A-2 | Arg52(AGG)->Met | NH2-G |
|
| |||
| AT1G71180 | Probable 3-hydroxyisobutyrate dehydrogenase | Met22 | Ac-ME |
| AT3G44310 | Nitrilase 1 (NIT1) | Met7 | Ac-S |
| AT1G58280 | Phosphoglycerate mutase family protein | Met43 | Ac-ME |
| AT3G60440 | Phosphoglycerate mutase family protein | Met24 | Ac-ME |
| AT5G16440 | Isopentenyl diphosphate isomerase 1 (IPP1) | Met59 | Ac-T |
| AT3G02780 | Isopentenyl diphosphate isomerase 2 (IPP2) | Met52 | Ac-T |
| AT4G37000 | Accelerated cell death (ACD2) | Met41 | Ac-ME |
| AT5G19150 | NAD(P)HX dehydratase | Met45 | Ac-S |
| AT5G24400 | 6-Phosphogluconolactonase 3 (PGL3) | Met70 | Ac-A |
| AT5G36700 | 2-Phosphoglycolate phosphatase 1 (PGLP1) | Met54 | Ac-T |
| AT3G56490 | HIS triad family protein 3 (HIT3) | Met19 | Ac-A |
| AT5G63890 | Histidinol dehydrogenase (HDH) | Met18 | NH2/Ac-MK |
| AT1G77670 | Pyridoxal phosphate-dependent transferase | Met41 | Ac-T |
| AT5G13050 | 5-Formyltetrahydrofolate cycloligase (5-FCL) | Met43 | NH2/Ac-S |
| AT5G12040 | ω-amidase | Met63 | Ac-A |
| AT4G08790 | Deaminated glutathione amidase | Met29 | Ac-A |
| AT5G03370 | Acylphosphatase family | Met66 | NH2/Ac-T |
| AT5G15870 | Glycosyl hydrolase family 81 protein | Met45 | Ac-S |
| AT5G41970 | Metal-dependent protein hydrolase | Met28 | NH2-A |
| AT3G10620 | Nudix hydrolase homolog 26 (NUDX26) | Met56 | Ac-ME |
|
| |||
| AT1G43900 | Protein phosphatase 2C family protein | Leu61(CTG)->Met | NH2/Ac-T |
| AT2G23070 | Casein kinase II subunit alpha-4 (CKA4) | Leu85(CTG)->Met | Ac-A |
| AT4G08500 | MAPK/ERK kinasekinase 1 (MEKK1) | Met8 | Ac-MK |
| AT3G12200 | NIMA-related kinase 7 (Nek7) | Met3 | Ac-ME |
| AT5G11860 | SCP1-like small phosphatase 5 (SSP5) | Met45 | NH2/Ac-MK |
|
| |||
| AT2G30110 | Ubiquitin-activating enzyme 1 (UBA1) | Met63 | NH2/Ac-A |
| AT2G36170 | Ubiquitin-60S ribosomal protein L40-1 (RPL40A) | Met84 | NH2-ML |
| AT5G46210 | Cullin4 (CUL4) | Met26 | NH2/Ac-MK |
| AT2G45170 | Autophagy 8E (ATG8E) | Met9 | Ac-MD |
| AT4G30920 | Leucine aminopeptidase 2 (LAP2) | Met57 | NH2/Ac-A |
| AT1G76140 | Prolyl endopeptidase | Met65 | Ac-G |
|
| |||
| AT1G60950 | Ferredoxin-2 (FD2) | Met52 | NH2-A |
| AT2G17420 | NADPH-dependent TRX reductase A (NTRA) | Met49 | Ac-ME |
| AT4G19880 | Glutathione S-transferase family protein | Met32 | NH2/Ac-A |
| AT5G27380 | Glutathione synthetase 2 (GSH2) | Met62 | Ac-ME |
| AT2G47730 | Glutathione S-transferase phi 8 (GSTF8) | Met49 | Ac-A |
| AT4G11600 | Glutathione peroxidase 6 (GPX6) | Met64 | Ac-A |
| AT1G66240 | Homolog of anti-oxidant 1 (ATX1) | Met31 | Ac-S |
| AT1G55805 | BolA-like family protein | Met52 | Ac-S |
|
| |||
| AT3G23100 | Homolog of X-ray repair cross complementing 4(XRCC4) | Met17 | NH2/Ac-V |
| AT2G19640 | ASH1-related protein 2 (ASHR2) | Ile2(ATA)->Met | Ac-MN |
| AT5G61140 | U5 small nuclear ribonucleoprotein helicase | Met72 | NH2/Ac-ML |
|
| |||
| AT5G66675 | Protein of unknown function (DUF677) | Met5 | Ac-MF |
| AT5G14540 | FLOE1 | Met18 | Ac-MD |
| AT3G27310 | Plant UBX domain-containing protein 1 (PUX1) | Met22 | Ac-ME |
| AT1G71840 | WD-40 repeat family protein | Met10 | Ac-MN |
| AT4G13940 | Adenosylhomocysteinase 1 (SAHH) | Leu57(TTG)->Met | NH2-S |
| AT2G39080 | NAD(P)-binding Rossmann-fold superfamily protein | Met59 | Ac-A |
| AT2G43290 | Calmodulin-like 5 (CML5) | Met47 | Ac-ML |
| AT3G47590 | Alpha/beta-Hydrolases superfamily protein | Met50 | Ac-MD |
| AT1G53280 | Protein DJ-1 homolog B (DJ1B) | Met48 | Ac-S |
For each gene locus, a description and the respective dTIS position (in the main protein isoform) were given, and in the case of translation at near-cognate start codons, the codon/position was provided. The identified NTA (Ac-) and/or Nt-free (NH.
Figure 3In vitro coupled transcription and translation (TnT) of TIS-mutagenized expression constructs confirm translation of the by riboproteogenomics-identified Nt proteoforms. Control and TIS-mutagenized pUNI51 constructs encoding the identified Nt proteoforms of (A) ISOPENTENYL DIPHOSPHATE ISOMERASE2 (IPP2, AT3G02780), (B) plant UBX DOMAIN-CONTAINING PROTEIN1 (PUX1, AT3G27310), and (C) NAC DOMAIN-CONTAINING PROTEIN14 (NAC014, AT1G33060). (Left) Genome views showing CHX and LTM strand-specific positional Ribo-seq data (red). LTM peaks corresponding to Ribo-seq-called dbTIS and dTIS were indicated by green and orange arrowheads, respectively. The vertical orange line indicates the position of the riboproteogenomic-matched dTIS. Nt peptides matching the aTIS event were plotted as red rectangles. In the case of PUX1, the Nt peptides span an exon-exon junction (right) (TIS mutagenized) Nt proteoform-encoding constructs were in vitro transcribed and translated. Following sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and electroblotting, radiolabeled proteins were visualized by radiography. Assignments of the translation products corresponding to translation initiation at the TAIR10-annotated TIS (green flag, M1) and from the identified dTIS (orange flag) were verified by mutating their respective ATG start codons to the (near-cognate start) Leu-encoding codon TTG. In each case, theoretical molecular weights of the identified Nt proteoforms are indicated.
Figure 4Differences in N-terminal acetylation patterns of N-terminal proteoforms arising from stromal processing peptidase (SPP) processing or (alternative) translation initiation. (A) PSMs of NTA peptides identified in cell cultures in function of the deviation to the chloroplast transit peptide (cTP) P1′ cleavage site predicted by TargetP 2.0 (Almagro Armenteros et al., 2019). The orange rectangle reflects the P5–P5′ predicted region, considered in this study to assign neo-N-termini indicative of cTP cleavage. (B) Number of identified PSMs of identified NTA [and corresponding Nt-free (13C2D3-Ac), if any] Nt peptides matching considered cTP cleavages [see panel (A)] in the case of N-termini exclusively identified as Nt-free (right) or identified by at least one PSM as NTA (left). Only Ala, Ser, Thr, and Val neo-N-termini (number indicated above bar) were plotted, as these represent the majority of cTP cleavage neo-N-termini (3,739/4,322 PSMs, 87%). (C) iceLogo (Colaert et al., 2009) motifs of cleavage motifs (P5–P5′) with an Nt peptide start position deviating maximally five residues of the predicted chloroplast or mitochondrial transit peptide (cTP/mTP) cleavage site (Almagro Armenteros et al., 2019) and with a SUBA4 chloroplast consensus location (Hooper et al., 2017). In case multiple cleavage sites were identified within a 5-residue window, the most upstream neo-N-terminus was selected as representative member. (D) Degree of NTA (%) for annotated protein N-termini (left) and stromal neo-N-termini (right) was plotted for various types of N-termini according to their NAT specificity profiles. N-termini matching NatA, NatB, or NatC/E/F specificities are indicated in orange, blue, and green, respectively, while stromal protein NTA is indicated in gray. Only Nt residue(s) with at least five data points were plotted; for full overview (see Supplementary Dataset S4).
Figure 5Riboproteogenomic evidence indicative of proteolytic signal peptide processing and alternative TIS usage. Ribo-seq coverage and PSM counts of identified Nt peptides were plotted for the first exonic region of [(A), top] IPP1 (AT5G16440) and [(B), left] ω-amidase (AT5G12040). Genome view showing CHX and LTM strand-specific positional Ribo-seq data (red). Vertical lines indicate identified dTIS (orange) and the cTP predicted cleavage site (dark green, dotted line) (Almagro Armenteros et al., 2019). LTM peaks corresponding to Ribo-seq-called dbTIS and dTIS were indicated by green and orange arrowheads, respectively. The PSM counts for the in vivo NTA (yellow) and invitro13C2D3-NTA (i.e., in vivo Nt-free) (black) N-termini are plotted. MS1 spectra of uniquely identified NTA peptides originating from [(A), bottom] IPP1 (AT5G16440.1) and [(B), right] ω-amidase (AT5G12040.1). Isotopic envelopes matching the in vivo NTA (green) and in vitro Ac13C2D3 (black, +5 Da) counterparts are shown for different NTA Nt peptides identified (peptide on top, bold residues). The calculated degree of NTA (%) was indicated together with the corresponding Nt residue. Predicted cTP cleavage motifs (P2–P1) preceding the identified Nt peptide sequences as well as the initiator Met (iMet) of riboproteogenomic-mapped dTIS (orange) were indicated in the sequence. All peptide precursors had a 2+ charge, and precursor m/z values are indicated.