| Literature DB >> 15003124 |
Sebastian Maurer-Stroh1, Masaki Gouda, Maria Novatchkova, Alexander Schleiffer, Georg Schneider, Fernanda L Sirota, Michael Wildpaner, Nobuhiro Hayashi, Frank Eisenhaber.
Abstract
We evaluated the evolutionary conservation of glycine myristoylation within eukaryotic sequences. Our large-scale cross-genome analyses, available as MYRbase, show that the functional spectrum of myristoylated proteins is currently largely underestimated. We give experimental evidence for in vitro myristoylation of selected predictions. Furthermore, we classify five membrane-attachment factors that occur most frequently in combination with, or even replacing, myristoyl anchors, as some protein family examples show.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15003124 PMCID: PMC395771 DOI: 10.1186/gb-2004-5-3-r21
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Numbers of analyzed sequences, experimentally verified myristoylated proteins plus their homologs and the set of additional new predictions
| Number in GenBank* | Number experimentally verified + homologs† | Number established NEW true predictions‡ | |
| Total | 600,916 | 1,122 | 2,037 |
| Leading methionine§ | 426,128 | 997 | 1,548 |
*Eukarya, non-identical sequences (nr100), February 2003. †Experimentally verified myristoylated proteins and their homologs with conserved myristoylation site (BLASTP E-value <0.005, plus manual curation). ‡Estimated NEW true predictions: (total number of predictions) minus (expected number of false-positive predictions) minus (number of experimentally verified + homologs) = number of true predictions not closely related to already experimentally verified examples (possibly additional new functions for myristoylated proteins?). §When a leading methionine is present in the sequence, it is less likely to represent a non-amino-terminal fragment in the database. However, this also excludes some true amino-terminal fragments, where a glycine becomes amino-terminal after proteolytic cleavage.
Experimentally verified myristoylated proteins and their subcellular localizations
| Experimentally verified protein type(s) | Total number in MYRbase | Subcellular localization | References | |
| GTP-binding proteins | 288 | |||
| G-protein alpha subunits (Gαi), Gαz), Gαo), Gαt)) | 201 | Type-dependent (receptor-specificity); for example, cytosol/plasma membrane (lipid rafts) | [ | |
| ADP ribosylation factors (ARF) and ARF-like (ARL) | 79 | Type-dependent; cytosol, intracellular membranes, cytoskeleton, nucleus/nucleolus (for example, ARL4, ARL5) | [ | |
| Rab5-related (Ara6) | 8 | Cytosol, endosomes, ER, plasma membrane | [ | |
| S/T-kinases | 206 | |||
| Ca2+/calmodulin-dependent kinases | 82 | Type-dependent; for example, cytosol, plasma membrane, ER membrane | [ | |
| Fen, Pto and related S/T-kinases | 60 | Cytosol, plasma membrane (~) | [ | |
| cAMP-dependent kinases (PKA catalytic subunit) | 51 | (Nucleus)/cytosol/(plasma membrane over anchoring proteins) | [ | |
| cGMP-dependent kinase II | 13 | (Cytosol)/plasma membrane | [ | |
| Ca2+-binding EF-hand proteins | 141 | |||
| Recoverin, neuronal calcium sensor, visinin, frequenin, neurocalcin, hippocalcin, | 66 | Type-dependent, triggered by Ca2+/myristoyl switch; for example, cytosol, plasma membrane, Golgi, neurofilament-rich structures | [ | |
| Calcineurin B phosphatase subunit, p22 | 43 | Cytosol, plasma membrane (phosphatidylserine-rich regions), microtubule cytoskeleton (p22) | [ | |
| Guanylate cyclase activators | 21 | Cytosol/rod outer segment membranes of photoreceptor cells | [ | |
| DNA-dependent kinase interacting protein KIP = calcium and integrin binding protein CIB = calmyrin | 11 | Cytosol, plasma membrane, ER, nuclear envelope, nucleoplasm, cytoskeleton | [ | |
| Other | ||||
| Tyrosine kinases (Abl, Blk, Fgr, Fyn, Hck, Lck, Lyn, Src, Yes) | 136 | Type-dependent; for example, cytosol/plasma membrane (lipid rafts/caveolae) | [ | |
| t-Actin (15 kDa carboxy-terminal fragment of cytoskeletal actin after caspase cleavage) | 89* | Cytosol, mitochondrial membrane | [ | |
| 26S ATP/ubiquitin-dependent protease regulatory subunit 4 | 26 | Nucleus, cytosol, microsomes | [ | |
| Endothelial nitric oxide synthase (eNOS) | 18 | Cytosol, Golgi, plasma membrane (lipid rafts/caveolae) | [ | |
| Golgi reassembly stacking protein 1 (65 kDa) and 2 (55 kDa) | 17 | Golgi membrane | [ | |
| Myristoylated alanine-rich carboxy-kinase substrate (MARCKS) | 15 | Cytosol, cytoskeleton, plasma membrane (lipid rafts) | [ | |
| Flagellar calcium-binding proteins in trypanosomes | 14 | Flagellar membrane | [ | |
| NADH cytochrome b-5 reductase (diaphorase) | 14 | Cytosol, ER membrane, mitochondrial outer membrane | [ | |
| 43 kDa acetylcholine receptor-associated protein of the synapse (RAPSYN) | 13 | Cytosol, Golgi, plasma membrane (lipid rafts/caveolae) | [ | |
| S/T protein phosphatase + EF hand (PPEF) | 12 | [ | ||
| Hydrophilic acylated surface protein B (HASPB) | 10 | Extracellular (!) face of the plasma membrane | [ | |
| Erythrocyte membrane protein band 4.2 (Pallidin) | 10 | Plasma membrane, intracellular vesicles | [ | |
| Vps15 (phosphatidylinositol 3-kinase-associated p150) | 10 | Cytosol, intracellular membranes (late Golgi) | [ | |
| Neuronal axonal membrane protein NAP-22 (brain acid soluble protein 1 BASP1; CAP23) | 9 | Golgi, plasma membrane (lipid rafts) | [ | |
| SSeCKS (A-kinase anchor protein 12) | 9 | Cytosol, cytoskeleton, plasma membrane | [ | |
| Sip2, beta subunits of 5'-AMP-activated protein kinase | 9 | Cytosol, plasma membrane | [ | |
| T-lymphoma invasion and metastasis inducing protein 1 (TIAM1 protein) | 9 | Cytosol, plasma membrane | [ | |
| PSD-Zip70, FEZ1 | 9 | Neuronal membranes (postsynaptic density, dendritic rafts), plasma membrane (lipid rafts) | [ | |
| S/T protein phosphatase Z | 8 | Nucleus | [ | |
| Annexin XIII | 6 | Cytosol, Golgi, plasma membrane (apical, lipid rafts) | [ | |
| Ezrin-binding partner PACE-1 | 6 | Cytosol, Golgi, lamellipodia | [ | |
| BH3 interacting domain death agonist (BID) | 5 | Cytosol, mitochondrial membrane | [ | |
| Calpastatin (testis-specific isoform tCAST) | 5 | Cytosol, intracellular membranes | [ | |
| A-kinase anchor protein 200 | 5 | Cytosol, plasma membrane | [ | |
| Golgi-associated PR-1 protein | 5 | Golgi, plasma membrane (lipid rafts/caveolae) | [ | |
| Vps20 (vacuolar protein sorting) | 4 | Cytosol, endosomal membranes | [ | |
| Adenylate kinase 1 | 4 | Cytosol, plasma membrane | [ | |
| A-kinase anchor protein 7 (18) | 3 | Cytosol, plasma membrane | [ | |
| 2'-5'-oligoadenylate synthetase splice form 2 | 3 | Cytosol, rough ER, ribosomes, nucleus | [ | |
| NADH ubiquinone dehydrogenase B18 | 3 | Mitochondrial membrane | [ | |
| Hisactophilin 1 and 2 (Histidine-rich actin binding protein) | 2 | Cytosol, plasma membrane, nucleus | [ | |
| Phosphoinositide-specific phospholipase C in | 2 | Cytosol, plasma membrane | [ | |
| Actin-myosin network maintenance protein Nullo | 2 | Plasma membrane (metaphase furrows during mitosis, cellularization front) | [ | |
| t-Gelsolin (fragment after caspase cleavage) | 2* | Not determined for myristoylated form, cytoskeleton(?) | [ | |
| Igloo (growth associated protein) | 1 | Cytosol, neuronal membranes | [ | |
| Flagellar creatine kinase (ATP:guanido phosphotransferase) | 1 | Sperm tail membranes | [ | |
*Not yet in MYRbase; cleavage motif is not automatically detected but is predicted when submitted as fragment to the predictor.
Figure 1Power-law distribution of family clusters in MYRbase with maximal 40% sequence identity and a minimal size of 3 (to exclude false positives). The inset table gives the values and correlation coefficients for different minimal cluster sizes. The power function distribution (without shift along the argument axis x), the Pareto distribution and the Zipf's law (if the rank is approximated by the argument x) have the common analytical form . Such distributions generally occur as the limit distribution of a multiplicative stochastic process with a lower boundary constraint (here, minimal cluster size). The common phenomenological form does not imply a unified mechanism for generating samples obeying these distribution functions. We suggest interpreting cluster size in terms of time for evolutionary divergence within the cluster (see text).
The five classes of co-occurring membrane attachment factors (coMAFs)
| coMAF class* | I | II | III | IV | V |
| Palmitoylatable cysteine† | Amino-terminal cluster of positive charges‡ | PIP2-specific binding domain§ | Transmembrane segments¶ | Other/not attributable (for example, protein-protein interactions)¥ | |
| Experimentally verified + homolog | 453 | 443/133/12 | 9 | 0 | 432 |
| Additional NEW prediction | 728 | 729/269/58 | 21 | 203 | 1,860 |
*Distribution of the five classes of coMAFs among the set of experimentally verified myristoylated proteins plus their homologs and the set of additional new predictions. Assignments are not necessarily unique, but can be combinations thereof. †At least one palmitoylatable cysteine within the first five residues (starting with the myristoylated glycine). ‡The content of positive charges (amino acids lysine (K), arginine (R) and histidine (H)) in the region of positions 6 to 35 is compared to the average composition in GenBank in the same region. The values correspond to hits occurring using threshold deviations of 1, 2 and 3σ from the GenBank average. §Significant HMMER-hit (below E-value of 0.01) with phosphatidylinositol-bisphosphate (PIP2)-specific binding domains (PH, ENTH, FERM, PX or FYVE). ¶A very conservative method was used to detect putative transmembrane segments. Besides requiring the attribute 'trusted' by the sensitive method DAS-TMfilter [151], we only list the hits that have more than three predicted TM regions and that are additionally filtered for their overall polarity measured by the content of positive charges. ¥All other proteins that did not fulfill any of the above criteria.
Figure 2Schematic representation of the membrane attachment of proteins with a myristoyl anchor (dark gray, space-filling atomic representation) in combination with different co-occurring membrane-attachment factors (coMAFs). Class I, plus palmitoyl anchor (also dark gray, space-filling); class II, plus cluster of positive charges (dark blue, spacefill); class III, plus PIP2-specific binding domain (PIP2 in space-filling, alkyl tails in cyan); class IV, plus transmembrane segments; class V, plus a domain for specific protein-membrane protein interactions. White space-filling molecules in the model membrane represent cholesterol and symbolize targeting to specialized compartments. Two different states of the calcium/myristoyl-switch of recoverin are depicted in the lower left of the figure (calcium ions in red). Visualization is with SwissPdb-Viewer [108].
Domain composition of the set of new predictions
| Number of entries | ID | Description |
| 57 | PF00097 | Zinc finger, C3HC4 type (RING finger) |
| 43 | PF00069 | Protein kinase domain |
| 35 | PF00481 | Protein phosphatase 2C |
| 26 | PF00023 | Ankyrin repeat |
| 21 | PF00931 | NB-ARC domain |
| 20 | PF00646 | F-box domain |
| 18 | PF04782 | Protein of unknown function (DUF632) |
| 18 | PF04783 | Protein of unknown function (DUF630) |
| 16 | PF00036 | EF hand |
| 14 | PF00135 | Carboxylesterase |
| 14 | PF00487 | Fatty acid desaturase |
| 13 | PF00651 | BTB/POZ domain |
| 13 | PF05049 | Interferon-inducible GTPase (IIGP) |
| 13 | PF00233 | 3'5'-cyclic nucleotide phosphodiesterase |
| 12 | PF00443 | Ubiquitin carboxyl-terminal hydrolase |
| 12 | PF00085 | Thioredoxin |
| 11 | PF00001 | 7 transmembrane receptor (rhodopsin family) |
| 11 | PF00047 | Immunoglobulin domain |
| 9 | PF00622 | SPRY domain |
| 9 | PF00628 | PHD-finger |
| 9 | PF00096 | Zinc finger, C2H2 type |
| 8 | PF01459 | Eukaryotic porin |
| 8 | PF03011 | Plasmodium falciparum erythrocyte membrane protein (PFEMP) |
| 8 | PF00400 | WD domain, G-beta repeat |
| 8 | PF00595 | PDZ domain (Also known as DHR or GLGF) |
| 8 | PF00560 | Leucine rich repeat |
| 7 | PF00520 | Ion transport protein |
| 7 | PF01135 | Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT) |
| 7 | PF00514 | Armadillo/beta-catenin-like repeat |
| 7 | PF00989 | PAS domain |
| 7 | PF00134 | Cyclin, amino-terminal domain |
| 7 | PF00169 | PH domain |
| 6 | PF00300 | Phosphoglycerate Phos Phosphoglycerate mutase family |
| 6 | PF01145 | SPFH domain / Band 7 family |
| 6 | PF01582 | TIR domain |
| 6 | PF02174 | PTB domain (IRS-1 type) |
| 6 | PF00462 | Glutaredoxin |
| 6 | PF03193 | Protein of unknown function, DUF258 |
| 6 | PF00515 | TPR domain |
| 6 | PF00240 | Ubiquitin family |
| 6 | PF01265 | Cytochrome c/c1 heme lyase |
| 6 | PF00702 | haloacid dehalogenase-like hydrolase |
| 6 | PF04641 | Protein of unknown function, DUF602 |
| 6 | PF00271 | Helicase conserved carboxy-terminal domain |
| 5 | PF00070 | Pyridine nucleotide-disulphide oxidoreductase |
| 5 | PF01417 | ENTH domain |
| 5 | PF00270 | DEAD/DEAH box helicase |
| 5 | PF02230 | Phospholipase/Carboxylesterase |
| 5 | PF00891 | O-methyltransferase |
| 5 | PF00046 | Homeobox domain |
| 5 | PF05003 | Protein of unknown function (DUF668) |
| 5 | PF00010 | Helix-loop-helix DNA-binding domain |
| 5 | PF00170 | bZIP transcription factor |
| 5 | PF00018 | SH3 domain |
| 5 | PF00255 | Glutathione peroxidase |
| 5 | PF00017 | SH2 domain |
| 5 | PF00089 | Trypsin |
| 5 | PF00153 | Mitochondrial carrier protein |
Examples of proteins predicted to be myristoylated
| Protein(s) | Total number in MYRbase | MI in MYRbase (number in cluster) | Taxonomic range |
| *Disease-resistance proteins | 32 | 25300600 (27) NB-ARC+LRR, 25453543 (5) TIR+NB-ARC+LRR | Plants |
| †S/T protein phosphatase 2C gamma (1G) | 30 | 2130393 (6)(ptc3), 26450759 (6), 4505999 (5)(1G), 25411959 (4), 22326510 (3), 6728987 (3), 6319415 (3)(ptc2/3), 23172576 (3)Dm,23615237 (2)Pf, 22326564 (2), 7488279 (2), 15239565 (1), 23483487 (1)(1Gpf), 24417194 (1), 25341907 (1), 21626866 (1)(1Gdm), 6319601 (1)(ptc4) | Mammals, insects, fungi, plants, apicomplexa |
| S/T-kinases (Crk1(cyclin-dependent kinase cdc2-related)-related) | 29 | 22327464 (11), 27817936 (10), 14532736 (4), 25402555 (3), 20805217 (1) | Plants |
| cGMP-specific 3',5'-cyclic phosphodiesterase 8A, 8B, 9A | 23 | 6166014 (10) 9A, 27479159 (9) 8AB, 21626649 (2) Dm, 2706887 (2) | Mammals, insects |
| *Thioredoxin | 20 | 15231958 (11), 1388078 (4), 28209505 (2), 28372832 (2), 28372834 (1) | Plants |
| Putative RING zinc finger proteins (ubiquitin ligases?) | 17 | 20279471 (10), 27500282 (4) Hs Mm, 7292914 (1) Dm, 3874246 (1) Ce, 23488532 (1) Pf | Plants, mammals, insects, apicomplexa |
| Interferon-inducible GTPase; 47 kDa GTPase IIGP | 16 | 25029534 (16) | Mammals |
| Mitochondrial import receptor subunit TOM40 | 16 | 12230369 (15), 6539563 (1) | Mammals, amphibians, insects, plants |
| ‡Germ cell less GCL | 15 | 21314704 (13), 7304006 (1), 6425507 (1) | Mammals, fish, insects, worms |
| Inward rectifier potassium channel Kir2.1 | 13 | 26336911 (13) | Mammals |
| Sphingolipid delta 4 desaturase DES1/DES2 | 12 | 27717299 (12) | Mammals, insects, worms |
| ARF GTPase-activating protein (PH, ArfGAP, ankyrin domains) | 12 | 22051029 (12) | Mammals |
| Naked cuticle 1,2 homolog | 12 | 22028145 (11), 27729789 (1) | Mammals |
| Protein-L-isoaspartate carboxylmethyltransferase | 12 | 27882417 (10), 27713894 (1), 3133008 (1) | Mammals, amphibians, worms |
| Cyclin-box carrying protein 1 | 11 | 1078903 (11) | Mammals, insects, worms |
Amino-terminal peptides of single representatives shown to be myristoylated in *[24] and †[152]. ‡Amino-terminal glycine to alanine mutation prevents nuclear envelope localization [153].
Diverse amino termini of selected sequences shown to undergo in vitro myristoylation
| Human protein | MYRbase identifier | Myristoylation motif |
| Immunoglobulin μ-heavy chain | 27650590 | GGTFSSYAISWV |
| Immunoglobulin λ-light chain | 4761381 | GQTASIT |
| 47 kDa GTPase IIGP | 23682869 | GQLFSS |
| Ubiquitin hydrolase Ubq-M | 5454156 | G |
| Lung cancer candidate FUS1 | 6005760 | GASGS |
| Potassium channel interacting protein KChIP1 | 7657247 | GAVMGTFSSLQT |
| Potassium channel Kir2.1 | 2282068 | GSV |
Residues indicated in bold type are positively charged; The C denoted in italic type indicates a palmitoylatable cysteine.