| Literature DB >> 20137074 |
Chi Nam Ignatius Pang1, Elisabeth Gasteiger, Marc R Wilkins.
Abstract
BACKGROUND: The methylation of eukaryotic proteins has been proposed to be widespread, but this has not been conclusively shown to date. In this study, we examined 36,854 previously generated peptide mass spectra from 2,607 Saccharomyces cerevisiae proteins for the presence of arginine and lysine methylation. This was done using the FindMod tool and 5 filters that took advantage of the high number of replicate analysis per protein and the presence of overlapping peptides.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20137074 PMCID: PMC2830191 DOI: 10.1186/1471-2164-11-92
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Finding high confidence modified peptides using 5 filtering strategies. By removing peptides that matched to both methylated peptides and unmodified peptides (filter 1) and peptides which contains D or E residues (filter 2), 2,641 peptides were left. By filtering for overlapping peptides (filter 3), and where at least one of the overlapping peptide is an unambiguous peptide match (filter 4), 163 peptides remained. These included 108 unambiguous peptide matches, 43 ambiguous peptide matches, and 12 peptides in both categories. Found in these peptides are 45 lysine methylation sites and 38 arginine methylation sites, all of which are of high confidence (filter 5).
Figure 2Filter 2: using two or more overlapping peptides to improve modification confidence. A peptide can have no missed cleavage or one missed cleavage (MC) at either the N-terminus or C-terminus of the peptide. The modification site has to be found in two or more overlapping peptides for it to be accepted for subsequent analyses. There are a few scenarios where these modification sites can be found. a) There can be one peptide with no missed cleavage overlapping with another peptide with one missed cleavage. b) The modification site can be found in two overlapping peptides each with one missed cleavage. c) In the third scenario, the modification site may be found in three peptides, one peptide with no missed cleavage, and two peptides with one missed cleavage each. In all of the above cases, at least one peptide in the overlapping peptides has to be an unambiguous peptide match. Cases where this was not seen were excluded from all subsequent analyses.
List of methylated proteins and methylation sites discovered by FindMod.
| Gene name | Swiss-Prot accession | Description | Methylated residuesa |
|---|---|---|---|
| ARC40 | Actin-related protein 2/3 complex subunit 1 | mK121 | |
| ATP2 | ATP synthase subunit beta, mitochondrial precursor | mK196 | |
| CDC11 | Cell division control protein 11 | dR35 | |
| DIP2 | U3 small nucleolar RNA-associated protein 12 | mK350 | |
| DNF1 | Probable phospholipid-transporting ATPase DNF1 | mK541 | |
| ECM29 | Proteasome component ECM29 | mR542, dR1112 | |
| EDE1 | EH domain-containing and endocytosis protein 1 | mR252 | |
| EMG1 | Essential for mitotic growth 1 | mK147 | |
| ERB1 | Ribosome biogenesis protein ERB1 | mK577, mK581 | |
| FKS1 | 1,3-beta-glucan synthase component FKS1 | mR946, dR946, dR952, mR962, dR962, dR1527 | |
| GCD10 | tRNA (adenine-N(1)-)-methyltransferase non-catalytic subunit TRM6 | mK436, mR447 | |
| GCN1 | Translational activator GCN1 | dK1446 | |
| GCN20 | Protein GCN20 | dK656 | |
| GUS1 | Glutamyl-tRNA synthetase, cytoplasmic | mR371 | |
| HAS1 | ATP-dependent RNA helicase HAS1 | dK444 | |
| IMD3 | Probable inosine-5'-monophosphate dehydrogenase IMD3 | mR168 | |
| ISW1 | ISWI chromatin-remodeling complex ATPase ISW1 | mK14 | |
| LEA1 | U2 small nuclear ribonucleoprotein A' | dR141 | |
| MPG1 | Mannose-1-phosphate guanyltransferase | dK299 | |
| MRPL17 | 54S ribosomal protein L17, mitochondrial precursor | mK70 | |
| MRPL20 | 54S ribosomal protein L20, mitochondrial precursor | mK104 | |
| NIP1 | Eukaryotic translation initiation factor 3 subunit C | mK514 | |
| NOC2 | Nucleolar complex protein 2 | mK384 | |
| NOG2 | Nucleolar GTP-binding protein 2 | dR336 | |
| NOT1 | General negative regulator of transcription subunit 1 | mR256 | |
| POL12 | DNA polymerase alpha subunit B | mK84 | |
| PRP43 | Pre-mRNA-splicing factor ATP-dependent RNA helicase PRP43 | dK662 | |
| PRT1 | Eukaryotic translation initiation factor 3 subunit B | mR572 | |
| PSD2 | Phosphatidylserine decarboxylase proenzyme 2 precursor | mR252 | |
| PYK1 | Pyruvate kinase 1 | mR216, dR216 | |
| RPA2 | DNA-directed RNA polymerase I subunit RPA2 | dK513 | |
| RPB2 | DNA-directed RNA polymerase II subunit RPB2 | mR496 | |
| RPL16A | 60S ribosomal protein L16-A | dK148 | |
| RPL18A, RPL18B | 60S ribosomal protein L18 | mR105 | |
| RPL1A, RPL1B | 60S ribosomal protein L1 | dK207 | |
| RPL20A, RPL20B | 60S ribosomal protein L20 | dK47 | |
| RPL27A, RPL27B | 60S ribosomal protein L27 | dR15, dK133 | |
| RPL2A, RPL2B | 60S ribosomal protein L2 | dR21 | |
| RPL3 | 60S ribosomal protein L3 | mR275, dK384 | |
| RPL4A | 60S ribosomal protein L4-A | dR84, mK104 | |
| RPL7A | 60S ribosomal protein L7-A | mR218 | |
| RPL7B | 60S ribosomal protein L7-B | mR218 | |
| RPL8B | 60S ribosomal protein L8-B | mK15, mK43, dK241 | |
| RPN2 | 26S proteasome regulatory subunit RPN2 | mK376 | |
| RPS11A, RPS11B | 40S ribosomal protein S11 | mR67 | |
| RPS13 | 40S ribosomal protein S13 | mK140 | |
| RPS17A | 40S ribosomal protein S17-A | dK59 | |
| RRP5 | rRNA biogenesis protein RRP5 | dR215, dK769 | |
| RSC1 | Chromatin structure-remodeling complex subunit RSC1 | mR454 | |
| RSC30 | Chromatin structure-remodeling complex protein RSC30 | mR692 | |
| RVB2 | RuvB-like protein 2 | mK412 | |
| SKI3 | Superkiller protein 3 | dK1088 | |
| SMB1 | Small nuclear ribonucleoprotein-associated protein B | mK138, mK145 | |
| SNF2 | Transcription regulatory protein SNF2 | dK1028 | |
| SNU114 | 114 kDa U5 small nuclear ribonucleoprotein component | dK356, mK935 | |
| SSB1 | Heat shock protein SSB1 | dR513 | |
| SSB2 | Heat shock protein SSB2 | dR513 | |
| TDH3 | Glyceraldehyde-3-phosphate dehydrogenase 3 | dR11 | |
| TIF32 | Eukaryotic translation initiation factor 3 subunit A | dK192 | |
| TUB2 | Tubulin beta chain | dR318 | |
| URA7 | CTP synthase 1 | dK28 | |
| USO1 | Intracellular protein transport protein USO1 | mK119 | |
| UTP22 | U3 small nucleolar RNA-associated protein 22 | mK1158 | |
| VPS52 | Vacuolar protein sorting-associated protein 52 | mR224 | |
| YKU70 | ATP-dependent DNA helicase II subunit 1 | dR549 | |
| YPR097W | PX domain-containing protein YPR097W | dK249 |
a: m, monomethylated; d, dimethylated
True positive rate at mass tolerance of 0.04 Da.
| Type of methylation | True positive rate (%) | No. of methylation sites correctly matched | No. of methylation sites tested | No. of peptides tested |
|---|---|---|---|---|
| Monomethyl-K | 100 | 27 | 466 | 1,201 |
| Dimethyl-K | 98 | 89 | 280 | 699 |
| Monomethyl-R | N.D.a | N.D.a | 3 | 6 |
| Dimeth-R | 76 | 28 | 137 | 495 |
| Monomethyl-K | 78 | 217 | 11,377 | 30,782 |
| Dimethyl-K | 89 | 3,673 | 11,377 | 30,691 |
| Monomethyl-R | 90 | 453 | 6,941 | 18,728 |
| Dimethyl-R | 90 | 2,140 | 6,902 | 18,594 |
a: N.D. - not determined
Methylated proteins identified independently by both FindMod and described in the literature.
| Ordered locus name (Swiss-Prot accession) | Protein name | Literaturea |
|---|---|---|
| YDL229W ( | Ssb1p | Wang and Lazarides [ |
| YNL209W ( | Ssb2p | Wang and Lazarides [ |
| YPR080W, YBR118W ( | Tef1p/Tef2p/eEF1αp | Cavallius |
| YBL087C, YER117W ( | Rpl23A/Rpl23B | Porras-Yakushi |
| YFL037W ( | Tub2p | Iwabata |
a: Proteins previously known to be methylated in yeast and in other organisms
Methylated proteins from yeast are enriched in specific processes, functions and components.
| Rank | Term (GO ID) | n1,1a | n1,2b | n2,1c | n2,2d | Corrected |
|---|---|---|---|---|---|---|
| 1 | Translation (6412) | 37 | 48 | 305 | 5712 | 3.57 e-23 |
| 2 | Ribosome biogenesis and assembly (42254) | 20 | 65 | 311 | 5706 | 5.03 e-7 |
| 3 | RNA metabolic process (16070) | 21 | 64 | 664 | 5353 | 0.01 |
| 4 | Organelle organization and biogenesis (6996) | 30 | 55 | 1230 | 4787 | 0.04 |
| 1 | Structural molecule activity (5198) | 31 | 54 | 304 | 5712 | 3.95 e-17 |
| 2 | Translation regulator activity (45182) | 5 | 80 | 47 | 5969 | 0.01 |
| 3 | RNA binding (3723) | 10 | 75 | 225 | 5791 | 0.03 |
| 1 | Ribosome (5840) | 33 | 52 | 307 | 5710 | 5.56 e-19 |
| 2 | Cytoplasm (5737) | 59 | 26 | 2701 | 3316 | 1.05 e-4 |
a: Number of methylated proteins with this GO slim term
b: Number of non-methylated proteins with this GO term
c: Number of methylated proteins with other GO slim term
d: Number of non-methylated proteins with other GO slim term
Figure 3Distribution of abundance of methylated proteins and half-life of lysine-methylated proteins, versus non-methylated proteins. a) The x-axis represents protein abundance, in copies per cell, as log base 10. The abundance of methylated proteins is represented with a solid line, while the abundance of non-methylated proteins is represented with a dotted line. b) The x-axis represents protein half-life, minutes, in log base 10. The half-life of lysine-methylated proteins is represented with a solid line, while the half-life of non-methylated proteins is represented with a dotted line. The arrow points to a group of proteins with very short half-life, seen only in 'other' proteins, which are likely to be unmethylated. Note that in both figures, 'other' proteins are those for which methylation was not found; this group may, however, contain some methylated proteins. Note also that abundance data and half-life data was not available for all yeast proteins in Belle et al. (2006) [53] and Ghaemmaghami et al. (2003) [50].
Lysine and arginine methylation motifs.
| Gene name | Swiss-Prot accession | Methylation sitea | Motifb |
|---|---|---|---|
| Gcd10p | meth-K4362 | RGKLHPLMT | |
| Pfk2p | meth-K1801 | HSYTDLAYR | |
| Rpl23Ap | dimeth-K1101 | GVIANPKGE | |
| Rps17Ap | dimeth-K592 | KIAGYTTHL | |
| Rvb2p | meth-K4122 | LISVAQQIA | |
| Ura7p | dimeth-K282 | VLASSTGML | |
| Uso1p | meth-K1192 | NGKYPSPLV | |
| Ecm29p | dimeth-R11122 | LAKSSALWSS | |
| Gus1p | dimeth-R3712 | IYRCNLTPHH | |
| Rpl27Bp | dimeth-R152 | LKAGKVAVVV | |
| Rpl4Ap | dimeth-R842 | IPRVGGGGTG | |
| Rps11Bp | dimeth-R672 | KCPFTGLVSI | |
| Tdh3p | dimeth-R112 | MVRVAINGFG | |
| Tub2p | dimeth-R3182 | GRYLTVAAFF | |
| Cdc11p | dimeth-R352 | VMIVGQS | |
| Ecm29p | meth-R5422 | ARLFNI | |
| Fks1p | meth-R9462, dimeth-R9462 | TLRTRI | |
| Fks1p | dimeth-R15272 | YHRNSWI | |
| Rpl4Ap | dimeth-R842 | IPRVGGG | |
| Rpl7Ap, Rpl7Bp | meth-R2182 | SNPSGG | |
| Rsc30p | meth-R6922 | SIKSFSS | |
| Tdh3p | dimeth-R112 | MVRVAIN | |
a: Evidence for the presence of the methylation site on this protein 1: Swiss-Prot, 2: methylation site confirmed by FindMod analysis
b: Methylation site matching the specified motif is underlined, the methylation site is highlighted in bold.