| Literature DB >> 19885390 |
Sandra Goetze1, Ermir Qeli, Christian Mosimann, An Staes, Bertran Gerrits, Bernd Roschitzki, Sonali Mohanty, Eva M Niederer, Endre Laczko, Evy Timmerman, Vinzenz Lange, Ernst Hafen, Ruedi Aebersold, Joël Vandekerckhove, Konrad Basler, Christian H Ahrens, Kris Gevaert, Erich Brunner.
Abstract
Protein modifications play a major role for most biological processes in living organisms. Amino-terminal acetylation of proteins is a common modification found throughout the tree of life: the N-terminus of a nascent polypeptide chain becomes co-translationally acetylated, often after the removal of the initiating methionine residue. While the enzymes and protein complexes involved in these processes have been extensively studied, only little is known about the biological function of such N-terminal modification events. To identify common principles of N-terminal acetylation, we analyzed the amino-terminal peptides from proteins extracted from Drosophila Kc167 cells. We detected more than 1,200 mature protein N-termini and could show that N-terminal acetylation occurs in insects with a similar frequency as in humans. As the sole true determinant for N-terminal acetylation we could extract the (X)PX rule that indicates the prevention of acetylation under all circumstances. We could show that this rule can be used to genetically engineer a protein to study the biological relevance of the presence or absence of an acetyl group, thereby generating a generic assay to probe the functional importance of N-terminal acetylation. We applied the assay by expressing mutated proteins as transgenes in cell lines and in flies. Here, we present a straightforward strategy to systematically study the functional relevance of N-terminal acetylations in cells and whole organisms. Since the (X)PX rule seems to be of general validity in lower as well as higher eukaryotes, we propose that it can be used to study the function of N-terminal acetylation in all species.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19885390 PMCID: PMC2762599 DOI: 10.1371/journal.pbio.1000236
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Graphical representation of the datasets using Venn diagrams.
(A) Comparison of dataset generated by COFRADIC and shotgun analysis. A total of 1,102 distinct N-terminal peptides were identified with an overlap of 115 sequences. The COFRADIC approach yielded 835 N-terminal peptides among a total 4,203 distinct peptides identified from 8,402 spectra. In contrast, a classical shotgun approach on Kc cells not using COFRADIC enrichment yielded 382 N-terminal peptides among 19,915 distinct peptides (34,175 spectra) retrieved from Loevenich et al. [24]. (B) Identification of 124 distinct putative, alternative translation initiation sites identified in the COFRADIC and shotgun dataset, respectively. (C) Comparison of N-terminal peptides according to their acetylation status. From 1,226 distinct N-termini 861 were found to be acetylated, 317 non-acetylated with an overlap of 48 identical N-termini showing partial acetylation.
Figure 2Analysis of translational start sites.
(A) A Frequency analysis [54] of all predicted N-termini present in the Drosophila database BDGP_Release_3.2 revealed the presence of the Cavener consensus sequence C_A/G_A_A/C_ATG [25],[26]. The histogram shows the relative frequencies of nucleotides 5′ to the predicted (conventional) initiator codon generating acetylated as well as non-acetylated N-termini starting at position 1 or 2 of the predicted protein sequence. The nucleotide 3′ of the initiator sequence has been proposed to preferentially be a G at +4 [55] for strong initiation but has been shown not to be relevant for Drosophila [25],[26]. Interestingly the G at +4 is nevertheless predominant in all cases. (B) Relative frequencies of nucleotides 5′ to the alternative translation initiation sites. The Cavener sequence is conserved showing a shift in sequence preference at position −1 (from A to C), which fully complies with the Cavener consensus sequence (C_A_A_C_ATG) [25],[26].
Frequencies of acetylated and non-acetylated Drosophila Kc 167, yeast, and HeLa cell proteins with various N-terminal sequences (position 1 or 2).
|
|
|
| ||||||||||||||||||||||
| Ace | Free | Partial | Total | % Ace | % Free | % Partial | % AceTotal | Ace | Free | Partial | Total | % Ace | % Free | % Partial | % AceTotal | Ace | Free | Partial | Total | % Ace | % Free | % Partial | % AceTotal | |
|
| ||||||||||||||||||||||||
| Ala- | 201 | 21 | 9 | 231 | 87.0 | 9.1 | 3.9 | 90.9 | 0 | 23 | 17 | 40 | 0.0 | 57.5 | 42.5 | 42.5 | 252 | 11 | 26 | 289 | 87.2 | 3.8 | 9.0 | 96.2 |
| Cys- | 6 | 4 | 0 | 10 | 60.0 | 40.0 | 0.0 | 60.0 | 0 | 1 | 0 | 1 | 0.0 | 100.0 | 0.0 | 0.0 | 1 | 0 | 0 | 1 | 100.0 | 0.0 | 0.0 | 100.0 |
| Gly- | 15 | 35 | 1 | 51 | 29.4 | 68.6 | 2.0 | 31.4 | 0 | 13 | 0 | 13 | 0.0 | 100.0 | 0.0 | 0.0 | 6 | 16 | 5 | 27 | 22.2 | 59.3 | 19.0 | 40.7 |
| Ser- | 248 | 19 | 11 | 278 | 89.2 | 6.8 | 4.0 | 93.2 | 0 | 3 | 90 | 93 | 0.0 | 3.2 | 96.8 | 96.8 | 109 | 1 | 1 | 111 | 98.2 | 0.9 | 1.0 | 99.1 |
| Thr- | 44 | 20 | 5 | 69 | 63.8 | 29.0 | 7.2 | 71.0 | 0 | 14 | 8 | 22 | 0.0 | 63.6 | 36.4 | 36.4 | 21 | 2 | 3 | 26 | 80.8 | 7.7 | 12.0 | 92.3 |
| Val- | 9 | 47 | 5 | 61 | 14.8 | 77.0 | 8.2 | 23.0 | 0 | 22 | 0 | 22 | 0.0 | 100.0 | 0.0 | 0.0 | 1 | 25 | 5 | 31 | 3.2 | 80.6 | 16.0 | 19.4 |
|
| ||||||||||||||||||||||||
| Met-Asn- | 38 | 4 | 1 | 43 | 88.4 | 9.3 | 2.3 | 90.7 | 7 | 0 | 0 | 7 | 100.0 | 0.0 | 0.0 | 100.0 | 12 | 0 | 2 | 14 | 85.7 | 0.0 | 14.0 | 100.0 |
| Met-Asp- & Met-Glu- | 206 | 9 | 4 | 219 | 94.1 | 4.1 | 1.8 | 95.9 | 18 | 0 | 0 | 18 | 100.0 | 0.0 | 0.0 | 100.0 | 124 | 0 | 3 | 127 | 97.6 | 0.0 | 2.0 | 100.0 |
|
| ||||||||||||||||||||||||
| Met-Ile- | 9 | 6 | 1 | 16 | 56.3 | 37.5 | 6.3 | 62.5 | 0 | 2 | 0 | 2 | 0.0 | 100.0 | 0.0 | 0.0 | 2 | 1 | 1 | 4 | 50.0 | 25.0 | 25.0 | 75.0 |
| Met-Leu- | 17 | 13 | 2 | 32 | 53.1 | 40.6 | 6.3 | 59.4 | 1 | 6 | 1 | 8 | 12.5 | 75.0 | 12.5 | 25.0 | 4 | 3 | 3 | 10 | 40.0 | 30.0 | 30.0 | 70.0 |
| Met-Phe | 13 | 6 | 1 | 20 | 65.0 | 30.0 | 5.0 | 70.0 | 1 | 1 | 0 | 2 | 50.0 | 50.0 | 0.0 | 50.0 | 7 | 2 | 1 | 10 | 70.0 | 20.0 | 10.0 | 80.0 |
|
| ||||||||||||||||||||||||
| Asp- & Glu- | 0 | 0 | 0 | 0 | N.D. | N.D. | N.D. | N.D. | 0 | 0 | 0 | 0 | N.D. | N.D. | N.D. | N.D. | 2 | 0 | 0 | 2 | 100.0 | 0.0 | 0.0 | 100.0 |
| Met-Gln- | 23 | 3 | 1 | 27 | 85.2 | 11.1 | 3.7 | 88.9 | 2 | 1 | 1 | 4 | 50.0 | 25.0 | 25.0 | 75.0 | 7 | 1 | 2 | 10 | 70.0 | 10.0 | 20.0 | 90.0 |
| Met-Lys- | 3 | 25 | 3 | 31 | 9.7 | 80.6 | 9.7 | 19.4 | 0 | 10 | 0 | 10 | 0.0 | 100.0 | 0.0 | 0.0 | 2 | 11 | 6 | 19 | 10.5 | 57.9 | 32.0 | 42.1 |
| Met-Met- | 5 | 1 | 1 | 7 | 71.4 | 14.3 | 14.3 | 85.7 | 0 | 0 | 0 | 0 | N.D. | N.D. | N.D. | N.D. | 7 | 0 | 0 | 7 | 100.0 | 0.0 | 0.0 | 100.0 |
| Met-X*- | 24 | 33 | 3 | 60 | 40.0 | 55.0 | 5.0 | 45.0 | 2 | 5 | 1 | 8 | 25.0 | 62.5 | 12.5 | 37.5 | 8 | 2 | 3 | 13 | 61.5 | 15.4 | 23.1 | 84.6 |
| Pro- | 0 | 71 | 0 | 71 | 0.0 | 100.0 | 0.0 | 0.0 | 0 | 11 | 0 | 11 | 0.0 | 100.0 | 0.0 | 0.0 | 0 | 40 | 0 | 40 | 0.0 | 100.0 | 0.0 | 0.0 |
| Ile- | 0 | 0 | 0 | 0 | N.D. | N.D. | N.D. | N.D. | 0 | 1 | 0 | 1 | 0.0 | 100.0 | 0.0 | 0.0 | 0 | 1 | 0 | 1 | 0.0 | 100.0 | 0.0 | 0.0 |
| Total | 861 | 317 | 48 | 1226 | 70.2 | 25.9 | 3.9 | 74.1 | 31 | 113 | 118 | 262 | 11.8 | 43.1 | 45.0 | 56.9 | 565 | 116 | 61 | 742 | 76.1 | 15.6 | 8.2 | 84.4 |
Table 1 visualizes the Drosophila (fly) dataset according to known NAT substrates in humans and yeast (table adapted from [11]). Ace, acetylated N-termini; free, non acetylated N-termini; X* = A,G,H,P,R,S,T,V,Y.
Pfam analysis to correlate functional domains with the N-terminal acetylation status of a protein.
| Domain Name | Domain Accession | Domain Description | Domains Total | Hits Ace | Hits Free |
| Acetyltransf_1 | PF00583.16 | Acetyltransferase_(GNAT)_family |
| 0 |
|
| Actin | PF00022.11 | Actin | 15 | 2 | 7 |
| Aldedh | PF00171.14 | Aldehyde_dehydrogenase_family | 11 | 5 | 1 |
| Aldo_ket_red | PF00248.13 | Aldo/keto_reductase_family |
|
| 0 |
| AT_hook | PF02178.11 | AT_hook_motif |
|
| 0 |
| OSCP | PF00213.10 | ATP_synthase_delta_(OSCP)_subunit |
| 0 |
|
| BolA | PF01722.10 | BolA-like_protein | 2 | 2 | 1 |
| BTB | PF00651.23 | BTB/POZ_domain |
|
| 0 |
| Metallophos | PF00149.20 | Calcineurin-like_phosphoesterase | 22 | 5 | 1 |
| CH | PF00307.23 | Calponin_homology_(CH)_domain | 27 | 7 | 2 |
| CoA_binding | PF02629.11 | CoA_binding_domain |
|
| 0 |
| Cofilin_ADF | PF00241.12 | Cofilin/tropomyosin-type_actin-binding_protein | 7 | 4 | 2 |
| COG6 | PF06419.3 | Conserved_oligomeric_complex_COG6 |
|
| 0 |
| cwf21 | PF08312.4 | cwf21 |
|
| 0 |
| COX6B | PF02297.9 | Cytochrome_oxidase_c_subunit_VIb |
|
| 0 |
| dUTPase | PF00692.11 | dUTPase |
| 0 |
|
| Dynamin_N | PF00350.15 | Dynamin_family |
|
| 0 |
| efhand | PF00036.24 | EF_hand | 27 | 4 | 3 |
| ENTH | PF01417.12 | ENTH_domain |
|
| 0 |
| FERM_N | PF09379.2 | FERM_N-terminal_domain |
| 0 |
|
| Glycolytic | PF00274.11 | Fructose-bisphosphate_aldolase_class-I |
|
| 0 |
| GSHPx | PF00255.11 | Glutathione_peroxidase | 3 | 2 | 2 |
| GST_N | PF02798.12 | Glutathione_S-transferase,_N-terminal_domain | 42 | 11 | 5 |
| HATPase_c | PF02518.18 | Histidine_kinase-,_DNA_gyrase_B-,_and_HSP90-like_ATPase | 4 | 1 | 2 |
| IBN_N | PF03810.11 | Importin-beta_N-terminal_domain |
|
| 0 |
| BIR | PF00653.13 | Inhibitor_of_Apoptosis_domain |
|
| 0 |
| Iso_dh | PF00180.12 | Isocitrate/isopropylmalate_dehydrogenase |
|
| 0 |
| Kinesin | PF00225.15 | Kinesin_motor_domain |
|
| 0 |
| L27 | PF02828.8 | L27_domain |
|
| 0 |
| LSM | PF01423.14 | LSM_domain | 16 | 7 | 1 |
| Lactamase_B | PF00753.19 | Metallo-beta-lactamase_superfamily | 7 | 1 | 3 |
| MBD | PF01429.11 | Methyl-CpG_binding_domain |
|
| 0 |
| Miro | PF08477.5 | Miro-like_protein | 72 | 8 | 2 |
| MIT | PF04212.10 | MIT_(microtubule_interacting_and_transport)_domain |
|
| 0 |
| IATP | PF04568.4 | Mitochondrial_ATPase_inhibitor,_IATP | 3 | 2 | 2 |
| Mito_carr | PF00153.19 | Mitochondrial_carrier_protein | 47 | 3 | 3 |
| Myosin_N | PF02736.11 | Myosin_N-terminal_SH3-like_domain |
| 0 |
|
| NTF2 | PF02136.12 | Nuclear_transport_factor_2_(NTF2)_domain | 4 | 3 | 1 |
| HEAT_PBS | PF03130.8 | PBS_lyase_HEAT-like_repeat |
| 0 |
|
| PRA1 | PF03208.11 | PRA1_family_protein |
|
| 0 |
| Proteasome | PF00227.18 | Proteasome_A-type_and_B-type |
|
| 0 |
| Pkinase | PF00069.17 | Protein_kinase_domain |
|
| 0 |
| Pkinase_Tyr | PF07714.9 | Protein_tyrosine_kinase |
|
| 0 |
| Y_phosphatase | PF00102.19 | Protein-tyrosine_phosphatase | 5 | 2 | 2 |
| Esterase | PF00756.12 | Putative_esterase |
| 0 |
|
| PK | PF00224.13 | Pyruvate_kinase,_barrel_domain | 4 | 2 | 2 |
| Ras | PF00071.14 | Ras_family | 76 | 8 | 2 |
| Ribosomal_S6e | PF01092.11 | Ribosomal_protein_S6e | 3 | 3 | 3 |
| Rieske | PF00355.18 | Rieske_[2Fe-2S]_domain |
|
| 0 |
| RNA_pol_L | PF01193.16 | RNA_polymerase_Rpb3/Rpb11_dimerisation_domain | 5 | 1 | 3 |
| RRM_1 | PF00076.14 | RNA_recognition_motif._(a.k.a._RRM,_RBD,_or_RNP_domain) | 70 | 10 | 3 |
| SelR | PF01641.10 | SelR_domain |
|
| 0 |
| Septin | PF00735.10 | Septin |
|
| 0 |
| Serpin | PF00079.12 | Serpin_(serine_protease_inhibitor) |
|
| 0 |
| adh_short | PF00106.17 | short_chain_dehydrogenase |
|
| 0 |
| Cpn60_TCP1 | PF00118.16 | TCP-1/cpn60_chaperonin_family | 14 | 6 | 4 |
| TPR_1 | PF00515.20 | Tetratricopeptide_repeat | 11 | 3 | 2 |
| Thymosin | PF01290.12 | Thymosin_beta-4_family |
|
| 0 |
| Tmemb_18A | PF09771.1 | Transmembrane_protein_188 |
|
| 0 |
| Tubulin | PF00091.17 | Tubulin/FtsZ_family,_GTPase_domain |
| 0 |
|
| TPD52 | PF04201.7 | Tumour_protein_D52_family |
|
| 0 |
| ubiquitin | PF00240.15 | Ubiquitin_family |
| 0 |
|
| Cg6151-P | PF10233.1 | Uncharacterized_conserved_protein_CG6151-P |
|
| 0 |
| Vps35 | PF03635.9 | Vacuolar_protein_sorting-associated_protein_35 | 2 | 2 | 2 |
| V-ATPase_H | PF03224.6 | V-ATPase_subunit_H |
|
| 0 |
| VHS | PF00790.11 | VHS_domain | 4 | 1 | 2 |
| WD40 | PF00400.24 | WD_domain,_G-beta_repeat | 42 | 11 | 5 |
Table 2 shows the correlation of Pfam domains starting within the first 60 amino acids of a protein with its N-terminal acetylation status. Pfam domains that were solely associated with an acetylated N-terminus are indicated in italics. Pfam domains that were found to be exclusively associated with a free N-terminus are shown in bold. The p values for these correlations are summarized in Table S5A. Ace, acetylated N-termini; free, non-acetylated N-termini.
Figure 3Schematic drawing of the (X)PX rule.
(A) During translation of a protein with the sequence Met-Sur at its N-terminus (Sur being a small and uncharged amino acid residue, in this case Sur is equal to an Ala residue in green), the iMet will be removed by a methionine aminopeptidase (MAP, brown bubble) and the processed amino-terminus will be acetylated at the alpha amine of the Ala residue by a NAT (green oval). (B and C) A protein with the sequence Met-Pro at its N-terminus (referred to as Pro-X2, panel B) will undergo iMet cleavage by the methionine aminopeptidase (MAP, brown bubble) and the processed amino-terminus will remain unacetylated. If iMet cleavage is not taking place, proteins with the sequence Met-Pro-X3 at their mature amino-terminus (panel C) will also remain unacetylated. The Pro residue (red) thus prevents acetylation even if iMet cleavage occurs only partially. (D) Similarly, from a protein having the sequence X1-Pro-X3 (X1 being Ala or Ser, in this case Sur is equal to an Ala residue in green, Pro in red), the iMet will be removed and the processed amino-terminus will remain unacetylated (panel D). Although partial removal of the iMet is rarely observed under these circumstances, the N-terminus with the amino acid sequence M-A-P usually remains unacetylated as also observed by SRM measurements.
Figure 4Workflow to test the (X)PX rule.
(A) The cDNAs of hyrax (hyx) and cks85A were modified as follows: in the Drosophila hyx cDNA the codon for the secondary Ala was replaced by a Pro to prevent acetylation of the amino-terminus. Similarly, we replaced the codon for the secondary Pro in the Drosophila cks85A cDNA by either a Ser or an Ala to promote acetylation. In addition, all constructs were C-terminally HA-tagged and subjected to the control of the tubulin-1α promoter. The different constructs subsequently express Hyx-A2P-HA, Hyx-Wt-HA, Cks-P2A-HA, Cks-P2S-HA, and Cks-Wt-HA, respectively. (B) To test the (X)PX rule in vitro transient transfections of S2 cells were performed. The tagged proteins were isolated via immunoprecipitation and subjected to mass spectrometry analysis via SRM. As expected, the N-terminus of Hyx-Wt-HA was found to be acetylated in combination with a complete iMet removal. In contrast, the amino terminus of Hyx-A2P-HA was always unmodified but showed partial iMet cleavage (9.1%). For both kinase mutants Cks-P2A-HA as well as Cks-P2S-HA the N-Terminus is always acetylated. For Cks-P2S-HA we also detected the acetylated peptide MSADQIQYSEK caused by an incomplete iMet removal (9.4%). Cks-Wt-HA could not be detected due to toxicity effects of the expressed transgene but has initially been isolated via COFRADIC with a free N-terminus.
Figure 5In vivo analysis of genetically modified hyrax transgenes.
For an in vivo analysis, the hyrax Hyx-A2P-HA, Hyx-Wt-HA transgenes were integrated into the fly genome using the site-specific phiC31-mediated integration system [36]. Fly genotypes: tub>hyx-wt-HA/CyO; hyx (lanes labeled WT), tub>hyx-A2P-HA/CyO; hyx (lanes labeled A2P), Sp/CyO; hyx (negative controls, lanes labeled hyx). Numbers in lanes represent µg of protein loaded (A, B). (A) Total fly lysates of flies reared at 25°C were subjected to Western blot analysis and revealed identical expression levels for the two transgenes (97% Hyx-Wt-HA in comparison to Hyx-A2P-HA). Western blot analysis of total fly lysates of flies reared at 29°C revealed a 32% reduced amount of Hyx-A2P-HA protein as compared to Hyx-Wt-HA controls. (B) Confocal images of stained wing discs from 3rd instar larvae of Sp/CyO; hyx2/TM6b (negative controls, opened pinhole), Hyx-Wt-HA/CyO; hyx2/TM6b and Hyx-A2P-HA/CyO; hyx2/TM6b reared at 25°C and 29°C, respectively. At both temperatures, a similar strong nuclear staining was observed for Hyx-Wt-HA and Hyx-A2P-HA and no difference in localization could be detected.