Literature DB >> 26158488

Enrichment and Analysis of Intact Phosphoproteins in Arabidopsis Seedlings.

Uma K Aryal1, Andrew R S Ross2, Joan E Krochko2.   

Abstract

Protein phosphorylation regulates diverse cellular functions and plays a key role in the early development of plants. To complement and expand upon previous investigations of protein phosphorylation in Arabidopsis seedlings we used an alternative approach that combines protein extraction under non-denaturing conditions with immobilized metal-ion affinity chromatography (IMAC) enrichment of intact phosphoproteins in Rubisco-depleted extracts, followed by identification using two-dimensional gel electrophoresis (2-DE) and liquid chromatography-tandem mass spectrometry (LC-MS/MS). In-gel trypsin digestion and analysis of selected gel spots identified 144 phosphorylated peptides and residues, of which only 18 phosphopeptides and 8 phosphosites were found in the PhosPhAt 4.0 and P3DB Arabidopsis thaliana phosphorylation site databases. More than half of the 82 identified phosphoproteins were involved in carbohydrate metabolism, photosynthesis/respiration or oxidative stress response mechanisms. Enrichment of intact phosphoproteins prior to 2-DE and LC-MS/MS appears to enhance detection of phosphorylated threonine and tyrosine residues compared with methods that utilize peptide-level enrichment, suggesting that the two approaches are somewhat complementary in terms of phosphorylation site coverage. Comparing results for young seedlings with those obtained previously for mature Arabidopsis leaves identified five proteins that are differentially phosphorylated in these tissues, demonstrating the potential of this technique for investigating the dynamics of protein phosphorylation during plant development.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26158488      PMCID: PMC4497735          DOI: 10.1371/journal.pone.0130763

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Seedling establishment is a critical stage in plant development, involving the transition from heterotrophic to autotrophic growth.[1] In Arabidopsis, seed germination is driven largely by the metabolism of storage products other than lipids, whereas seedling establishment involves the mobilization of seed oil reserves.[2] Triacylglycerol (TAG) is the predominant source of carbon in the seeds of Arabidopsis and related species, including Brassica napus (canola),[3] and mobilization of TAG supplies the energy and molecular building blocks required for seedling establishment.[1,4] Utilization of TAG and other seed reserves is thought to be controlled and regulated by multiple pathways,[5] and although considerable progress has been made in understanding dormancy and seed germination [6-8] the cellular mechanisms involved in seedling establishment are less well understood. Following germination the glycerol released from TAG through lipase action is converted to glyceraldehyde-3-phosphate (G-3-P) and then by isomerization to dihydroxyacetone phosphate (DHAP), which can either undergo glycolysis to pyruvate or conversion to hexose via gluconeogenesis.[9] The free fatty acids are catabolized by ß-oxidation in the glyoxysome. A more complete understanding of how this metabolic program is regulated in Arabidopsis would increase our knowledge of post-embryonic development in plants and assist in the improvement of canola and other oilseed crops. One way to achieve this is to study protein phosphorylation during early stage of seedling establishment because reversible phosphorylation of proteins regulates a wide variety of cellular processes during plant growth and development.[10] However, the analysis of protein phosphorylation can be challenging due to the low relative abundance of phosphoproteins and the possibility of phosphorylation at multiple sites within a given protein.[11,12] Affinity enrichment of phosphorylated proteins and/or the component phosphopeptides obtained by proteolysis can significantly enhance the identification of such proteins and the mapping of phosphorylation sites. However, phosphoproteome analysis of certain plant tissues is complicated by the presence of D-ribulose bisphosphate carboxylase/oxygenase (Rubisco),[11,13] an abundant phosphoprotein that inhibits the detection and analysis of other, less abundant plant proteins. Rubisco depletion columns (e.g. Seppro IgY spin columns; GenWay Biotech, San Diego CA, USA) are commercially available and have been used successfully to deplete Rubisco in total protein extracts. [14,15] Advances in phosphopeptide enrichment strategies have also enabled large-scale phosphoproteomic studies in Arabidopsis, providing new insights regarding the potential involvement of protein phosphorylation in various stages of plant development.[11,16-23] Despite these efforts, our knowledge of protein phosphorylation events during the transition from heterotrophic to photoautotrophic growth in young seedlings remains incomplete. Much of the information currently available in protein phosphorylation databases has been generated using peptide-level enrichment strategies, and although affinity purification of intact phosphoproteins has been demonstrated [11,24] the use of protein-level enrichment for phosphoproteome analysis in plants remains largely unexplored. To complement and expand upon previous investigations involving phosphopeptide enrichment we carried out a survey of protein phosphorylation in post-embryonic Arabidopsis seedlings (hereafter referred to as young seedlings) using Rubisco depletion and enrichment of intact phosphoproteins by immobilized metal-ion affinity chromatography (IMAC) combined with two-dimensional gel electrophoresis (2-DE) and liquid chromatography-tandem mass spectrometry (LC-MS/MS). The results of this study were then compared with those obtained previously using Rubisco depletion and protein-level enrichment of phosphoproteins from Arabidopsis mature leaves [11] to evaluate this approach for monitoring the dynamics of protein phosphorylation during plant development.

Materials and Methods

Materials

Seppro Rubisco IgY Spin Columns (GenWay Biotech, San Diego, CA, USA) were obtained from Sigma-Aldrich (St. Louis, MO, USA; Product No. SEP070). Acrylamide, bisacrylamide solution, IPG dry strips (pH 3–10, NL, 17 cm), carrier ampholytes, Precision Plus Protein standards, TEMED, TBP, DTT, and IAA were purchased from Bio-Rad (Hercules, CA). Urea was from Merck KGaA (Darmstadt, Germany), Tris base from Roche Diagnostics (Indianapolis, IN), PPS silent surfactant from Protein Discovery Inc. (Knoxville, TN), and trypsin (sequencing grade) from Promega (Madison, WI, USA). PHOS-Select iron affinity gel beads and SigmaPrep spin columns were purchased from Sigma-Aldrich (St.Louis, MO, USA). All other chemicals were also from Sigma-Aldrich unless otherwise stated, and were of analytical research grade.

Plant growth and protein extraction

Arabidopsis thaliana (L) Heynh (Col-0) seeds were treated with 50% bleach in MilliQ water (v/v) containing 5.25% sodium hypochlorite for 2 min and then with 50% (v/v) ethanol for 2 min before washing 4 times with sterilized Milli-Q water and cultivating in Petri dishes containing 0.5x Murashige and Skoog [25] mineral salts with BactoAgar. Seeds were allowed to imbibe at 4°C for 4 days in the dark before transferring them to a growth chamber maintained at 22°C and a 16-h light/8-h dark cycle. Week-old whole seedlings (including roots) with 2 seed leaves were collected for protein extraction. One gram of seedlings was ground to a powder in liquid nitrogen with 0.5% (w/w) PVPP and homogenized in 2 ml of ice-cold extraction buffer (pH 7.4) containing 10 mM Tris-HCl, 150 mM NaCl, the serine protease inhibitor PMSF (1 mM, freshly prepared in DMSO) and a protease inhibitor cocktail developed for plant cell and tissue extracts (Sigma P-9596, 0.2% v/v), together with phosphatase inhibitors 20 mM sodium fluoride, 1 mM sodium molybdate, 1 mM sodium orthovanadate, and 1 mM sodium β-glycerophosphate. The slurry was stirred for 30 min on ice, filtered through two layers of cheese cloth and centrifuged at 10,000 × g for 15 min at 4°C. After discarding the pellet the amount of protein in the supernatant was determined using the Bradford assay (Bio-Rad) with BSA as the standard, and the final concentration of the sample adjusted to 1 mg/ml using the extraction buffer.

Rubisco depletion

Each protein sample was filtered through a 0.45 μm spin filter (Millipore) and 500 μl of the extract, containing about 500 μg of protein, was loaded onto a Seppro IgY column. Rubisco was removed according to the manufacturer’s instructions. The protein flow-through and the bound fraction were collected separately, and each precipitated with 5 volumes of ice-cold methanol and 100 mM ammonium acetate at -20°C overnight. After centrifugation at 10,000 × g for 20 min at 4°C, the resulting pellets were thoroughly washed twice with ice-cold 100% methanol and then with 80% ice-cold methanol. Each pellet was briefly dried using a SpeedVac, re-dissolved in the column incubation buffer (6 M urea, 0.25% CHAPS, 50 mM sodium acetate, pH 4.0) to approximately 1 mg/ml, and used for phosphoprotein enrichment by immobilized metal-ion affinity chromatography (IMAC).

Phosphoprotein enrichment

Enrichment of intact phosphoproteins from Rubisco-depleted samples was carried out as previously described.[11] Briefly, a 500 μl slurry of PHOS-Select iron affinity gel beads (Sigma) was washed 3 times with 0.1% TFA in 30% acetonitrile and equilibrated 3 times with 500 μl of incubation buffer (6 M urea, 0.25% CHAPS, 50 mM sodium acetate, pH 4.0) with centrifugation at 1,000 × g for 1 min between each step before being loaded onto the column. Two ml of Rubisco-depleted protein sample in incubation buffer were loaded onto each spin column (about 2 mg total protein per 500 μl of bead slurry) and incubated for 1 h at room temperature with gentle shaking. Phosphoproteins bound to the IMAC columns were eluted three times with 200 μl of elution buffer (6 M urea, 50 mM Tris-acetate pH 7.5, 0.1 M EDTA, 0.1 M EGTA, 0.25% CHAPS), each time incubating at room temperature for 10 min with gentle shaking, and then centrifuged at 1,000 × g for 1 min. The 3 eluates were pooled and precipitated with methanol as previously described before re-suspension in lysis buffer (7 M urea, 2 M thiourea, 2% CHAPS) to obtain a total protein concentration of 1 μg/μl prior to 2-D gel electrophoresis (2-DE).

Gel electrophoresis and in-gel digestion

One-dimensional gel electrophoresis was used to resolve proteins from the bound and flow-through fractions obtained during Rubisco depletion on the Seppro column. Ten μl of each fraction containing approximately 10 μg of protein was mixed with 10 μl of gel sample buffer (0.2 M Tris-HCl, pH 6.8, 2% SDS, 10% glycerol,0.02% bromophenol blue) and separated on a 1.0 mm, 12.5% Criterion Tris/HCl gel in a Criterion Cell (Bio-Rad) (13.3 cm × 8.7 cm) at a constant voltage of 150 V. The separated proteins were visualized using Bio-Safe Coomassie Blue stain (Bio-Rad). For phosphoproteome analysis, 200 μl (200 μg) of IMAC-enriched phosphoprotein sample was mixed with 200 μl of rehydration buffer (7 M urea, 2 M thiorea, 2% CHAPS, 10 mM DTT, 0.5% IPG buffer, pH 3–10), resolved by 2-DE and visualized by silver staining 10. Gel images were recorded using an ImageScanner (GE Healthcare) and Phoretix 2D software (v2004) was used to measure the total number of protein spots visualized in each 2-DE gel image. Proteins of interest were excised manually from each gel and digested with trypsin using a MassPREP protein digestion station, according to the protocol (digestion 5.0) recommended by the manufacturer (Micromass, Manchester, UK). Preparation of tryptic peptide samples for LC-MS/MS analysis was carried out as previously described.[11]

Mass spectrometry and protein identification

Six μl of each 2-D gel protein digest was analyzed using a nanoAQUITY UPLC system (Waters, Milford, MA, USA) interfaced to a quadrupole time-of-flight (Q-TOF) Ultima Global hybrid tandem mass spectrometer (Waters, Mississauga, ON, Canada). Separations were performed using a Waters BEH130 C18 nanoAQUITY UPLC analytical column (75 μm, 1.75 mm × 100 mm) at an initial flow rate of 400 nl/min. Mobile phase solvent A was 0.2% formic acid in water and solvent B was 0.2% formic acid in 100% acetonitrile. Separations were performed using the following 55-min solvent program: 99:1 (%A:%B) for 1 min, changing to 90:10 at 16 min, 55:45 at 45 min, and 20:80 at 46 min, at which point the flow rate was changed to 800 nl/min and the gradient held until 52 min before reverting to 99:1 at 53 min. A 5 min seal wash with 10% acetonitrile in water was carried out after the completion of each run. The Q-TOF MS was operated in the positive ion mode and TOF MS spectra were acquired over the m/z range 400–1900 at the rate of one scan/s. Of the multiply charged (2+, 3+, or 4+) peptide ion peaks rising over a threshold, the three most abundant were automatically selected for CID, and product-ion spectra were acquired over the m/z range 50–1900 in TOF MS/MS mode. The CID collision energy was selected automatically according to the m/z ratio and charge state of the precursor ion. A real-time exclusion window was used to prevent precursor ions with the same m/z from being selected for CID and TOF MS/MS within 2 min of their initial acquisition. Data were also acquired using pre-programmed exclusion lists for keratin and trypsin. Data were processed using MassLynx 4.1 (Waters, Milford, MA) and searched against NCBInr protein sequence database for Arabidopsis thaliana (thale cress) using an in-house Mascot server (Version 2.2, Matrix Sciences, UK) and the following parameters: carbamidomethylation of cysteine as the fixed modification; oxidation of methionine and phosphorylation of serine, threonine and tyrosine as variable modifications; mass tolerances of 0.2 Da for MS and 0.5 Da for MS/MS data; and one missed cleavage for tryptic peptides. Peptide MS/MS spectra used for protein identification had to be of sufficient quality, with a signal-to-noise ratio of 3 or greater for annotated fragment ions, including neutral loss peaks associated with de-phosphorylation during CID. Only peptides matched with significant ion scores (P <0.05) and low expectation values (e-value <0.01) were selected. For unambiguous identification, each peptide MS/MS spectrum had to contain at least three sequential y- or b-type ions. Protein identification was regarded as positive if the Mascot score exceeded the 95% confidence threshold, the matched protein contained at least four top-ranking unique peptides, and protein sequence coverage by the matching peptides was >15%. If the same set of peptides matched multiple members of a protein family, or a protein appeared under different names and accession numbers in the database, the entry with the highest score and/or most descriptive name was reported. When protein isoforms were observed, the data were inspected manually. If several isoforms shared the same set of identified peptides the protein with the most matching peptides was accepted as the correct result. The presence of protein isoforms was confirmed and reported based on the identification of at least two unique peptides. Since the error tolerance of the MS method used (200 mDa) was greater than the mass difference between phosphorylation and sulfation (9.5 mDa), a second error-tolerant search reporting masses to 0.1 mDa was performed to allow sulfation and phosphorylation to be distinguished. Raw MS/MS spectra matched to phosphorylated peptides in the Mascot search were manually inspected and validated using MassLynx 4.1. The spectra were processed to give singly charged, monoisotopic, centroided peaks and compared with the in silico fragmentation masses for the matched peptide to confirm neutral loss of phosphoric acid for serine and threonine phosphorylation, or the mass increment of 80 Da associated with phosphorylation of tyrosine.

Results

Phosphoproteome analysis of young seedlings

A schematic representation of our analytical approach is shown in Fig 1. The molecular weight distributions of proteins in the bound and flow-through samples following Rubisco depletion were investigated by 1-DE (Fig 2). Results show that the Seppro IgY Rubisco-depletion columns are efficient at removing Rubisco from the protein extracts of young seedlings. The Rubisco protein concentrated in the bound fraction is predominantly the small subunit (SSU), whereas both small and large subunits of Rubisco were evident in a previous study of mature Arabidopsis leaves.[11] That study also found that Rubisco depletion significantly increased the number of identified phosphoproteins, even without IMAC enrichment, and that only Rubisco and other relatively abundant phosphoproteins were recovered from non-depleted extracts using IMAC, whereas IMAC enrichment more than doubled the number of phosphoproteins identified in depleted extracts. It has recently been demonstrated that the Rubisco SSU up-regulates expression of the Rubisco large subunit (LSU) at the transcriptional level. This coordinated expression of subunits may explain the relatively small amount of Rubisco LSU observed during early growth in young seedlings.[26]
Fig 1

Plant phosphoproteome analysis using Rubisco depletion, IMAC enrichment of phosphoproteins, 2-DE and liquid chromatography-tandem mass spectrometry.

Fig 2

One-dimensional gel electrophoresis of the flow-through and bound protein fractions (10 μg) obtained following depletion of plant protein extracts using Seppro Rubisco IgY spin columns.

Molecular weight markers (M) are shown on the left.

One-dimensional gel electrophoresis of the flow-through and bound protein fractions (10 μg) obtained following depletion of plant protein extracts using Seppro Rubisco IgY spin columns.

Molecular weight markers (M) are shown on the left. IMAC-purified phosphoproteins from the Rubisco-depleted flow-through fraction were subsequently resolved by 2-DE (Fig 3). The reproducibility of both 1- and 2-DE experiments was confirmed by analyzing and comparing three biological replicates (not shown). An average of 175 protein spots were detected in replicate 2-DE gels following IMAC enrichment of Rubisco-depleted extracts. These were excised, trypsinized and analyzed by LC-MS/MS, which identified 156 of the spots based on our acceptance criteria for protein identification (see above). Of these, 105 spots (i.e. 60% of the 175 detected following IMAC) were found to contain a total of 82 different phosphoproteins based on the detection of 144 tryptic phosphopeptides, not counting methionine-oxidized and non-oxidized forms of the same peptide (Table 1, S3 Fig). The spot in which each phosphoprotein had been identified with highest confidence was subsequently labeled on a representative 2-DE gel image (Fig 3). Although significantly depleted in the flow-through fraction (Fig 2) Rubisco SSU was still detectable in 2-DE gels (Fig 3, spot 65).
Fig 3

Two-dimensional gel electrophoresis of Rubisco-depleted phosphoproteins enriched by immobilized metal-ion affinity chromatography using PHOS-Select iron affinity gel beads.

Phosphoproteins identified by liquid chromatography-tandem mass spectrometry are indicated using arrows and numbers (see Table 1).

Table 1

List of identified phosphoproteins.

Spot No.*Gene locusProtein nameMW/pIFunctional groupPhosphopeptide (a) psite
1gi|15233613O-acetylserine (thiol) lyase (OASA1)33.9/5.9Amino acid biosynthesisDVpTELIGNTPLVYLNNVAEGCVGRT10
2gi|18404496Catalytic co-enzyme binding protein (b) 35.8/8.4Amino acid biosynthesisALDLApSKPEGTGTPTK (c) S302
3gi|15218373Cystidine/deoxycystidylate deaminase (b,d) 20.4/5.6Amino acid biosynthesisYpTDPTAHAEVTAIRT75
4gi|15235213Caffeoyl-CoA 3–0 methyltransferase29.3/5.1Amino acid biosynthesisTSpSTNGEDQKQSQNLR pTSSTNGEDQKQSQNLRS13 T11
5gi|30691732Aminoacylase, putative48.0/5.9Amino acid biosynthesisTpSKPEIFPASTDART387
6gi|15224470Pyridoxin biosynthesis protein PDX1.1 (b) 33.1/5.8Amino acid biosynthesis pTKGEAGTGNVVEAVRT165
7gi|15233161Peroxiglycinamidine cycloligase41.6/5.3Amino acid biosynthesisGLAHIpTGGGFTDNIPRT296
8gi|42573371Carbonic Anhydrase 2 (CA2) (b,d) 28.7/5.4Cellular metabolismGNEpSYEDAIEALKK (e) KIpTAELQAASSSDSK (e,f) VCPpSHVLDFHPGDAFVVR VLAEpSESSAFEDQCGR (e,f) S5 T35 S98 S191
9gi|7769871NAD malate dehydrogenase, mitochondrial (b) 37.2/8.5Cellular metabolismKLFGVpTTLDVVR RTQDGGpTEVVEAK KPGMpTRDDLFNINAGIVK YCPHALINMIpSNPVNSTVPIAAEIFK LNPLVSSLSLpYDIANTPGVAADVGHINTR LNPLVSSLpSLYDIANTPGVAADVGHINTR NGVEEVLDLGPLpSDFEKEGLEALKPELKT175 T251 T114 S146 Y61 S59 S325
10gi|15219721Malate dehydrogenase cytoplasmic 1 (b,d) 35.9/6.1Cellular metabolismVQpTSSGEKPVR (e,f) T203
11gi|15226185Fructose bisphosphate aldolase (b) 42.5/8.2Cellular metabolismYpSAEGENEDAKKS372
12gi|30678347Carbonic anhydrase 1 chloroplast (CA1) (b) 29.8/5.5Cellular metabolismVCPpSHVLDFQPGDAFVVR VIpSELGDSAFEDQCGRS98 S189
13gi|15232468Malate dehydrogenase (NAD), mitochondrial36.0/8.3Cellular metabolismVVILGAAGGIGQPLpSLLMKS46
14gi|16398Nucleotide diphosphate kinase (b) 16.3/7.9Cellular metabolismNVIHGpSDSVESAR (e,f) IIGApTNPAASEPGTIR KIIGApTNPAASEPGTIR MEQpTFIM*IKPDGVQRS116 T90 T90 T4
15gi|15230595Phosphoglycerate kinase 1 (PGK1) (b) 50.1/5.9Cellular metabolismVLPGVIALDEAIPVpTVT480
16gi|152201673-Isopropyl malate dehydrogenase 2 cytoplasmic43.5/5.3Cellular metabolismANPLApTILSAAM*LLKT338
17gi|15231715Fructose bisphosphate aldolase (FBA), putative (b,d) 38.8/6.1Cellular metabolismVpSPEVIAEHTVR pTVPAAVPAIVFLSGGQSEEEATR (c,f) SpSDGKLFVDILKS239 T254 S84
IGENEPpSEHSIHENNAYGLAR LGDGAAEpSLHVK ANSEApTLGTYKGDAK GILAADESpTGTIGKR (c,f) LApSINVENVETNRR ALSDHHVLLEGTLLKPNM*VpTPGpSDSPKS155 S350 T333 T33 S42 T230, S233
18gi|15222848Glyceraldehyde-3-phosphate dehydrogenase C-2 (GAPC-2) (d) 37.0/6.7Cellular metabolism pTLLFGEKPVTVFGIR SDLDIVpSNASCTTNCLAPLAK (c,f) T70 S152
19gi|15229231Glyceraldehyde-3-phosphate dehydrogenase C subunit (GAPC), cytosolic (b,d) 37.0/6.6Cellular metabolism pTLLFGEKPVTVFGIRT70
20gi|15218869Isocitrate dehydrogenase (IDH) (b) 46.1/6.1Cellular metabolism pTIEAEAAHGTVTRT302
21gi|15238559Glutamine synthatase 2 (GS2), mitochondrial (b) 47.8/6.4Cellular metabolism pTIEKPVEDPSELPK (c) GGNNILVICDTWpTPAGEPIPpTNKT302 T154, T162
22gi|15222551Phosphoribulose kinase (PRK), chloroplastic44.7/5.7Cellular metabolismHADFPGpSNNGTGLFQTIVGLKS360
23gi| 4539316Fructose bisphosphate aldolase, putative43.1/6.8Cellular metabolismRLDSIGLENpTEANRT91
24gi|15226185Fructose bisphosphate aldolase42.5/8.2Cellular metabolismYpSAEGENEDAKKS372
25gi|15227752Peroxisomal malate dehydrogenase (PMDH1)37.8/8.1Cellular metabolismKLMGVpTMLDVVR KPGM*pTRDDLFNINAGIVRT138 T127
26gi|15228198PYK 10 binding protein 1(PBP1) (b,d) 32.1/5.5Cell defense pSPEEVTGEEHGK QpTSPPFGLEAGTVFELKEEGHKS196 T254
27gi|18405982Avirulense-responsive protein19.6/5.0Cell defenseLHACIpSPSENGLINGK TVEVVLpTDTSEKKS56 T97
28gi|15236568Major latex protein related/ MLP-related17.5/5.9Cell defenseEIDDEpTKTLTLR VpYDVVYQFIPK pSLVADMGNHVSKT79 Y98 S139
29gi|15223957Major latex protein related/MLP related18.0/6.4Cell defenseFVpTSLAADMDDHILKT138
30gi|15236566Major latex-related/ MLP-related17.6/5.9Cell defenseRNDDFPEPpSGYMKS131
31gi|1755154Germin-like protein22.0/6.8Cell defenseAAVpTPAFAPAYAGINGLGVSLART72
32gi|15228199Jacalin lectin family protein (b) 32.2/5.9Cell signaling pSPEEVTGEEHGK ASELLHQFGVVM*PLpTNS195 T299
33gi|15228216Jacalin lectin family protein32.0/5.1Cell signaling pTSPPYGLETQKK KVHVGQGQDGVpSSINVVYAKT104 S37
34gi|15226403Cupin family protein (b) 55.9/5.8Cell signalingNRPQFLVGpSNSLLR pTGPFEFVGFTTSAHK GpSGSSECEDSYNIYDKK (e) S456 T433 S320
35gi|18418598Cyclase family protein30.0/5.6Cell signalingAGLpYSVHCLPLRY249
36gi|15241018 A. thaliana Ferretin 1 (ATFER1), chloroplastic (b) 28.1/5.7Electron transportADLAIPIpTSHASLAR (c) T80
37gi|9843639Rieske FeS protein24.6/8.8Electron transportGPAPLpSLALAHADIDEAGKS196
GDPpTYLVVENDK FLCPCHGpSQYNAQGRT138 S180
38gi|15231176ATP synthase D chain, mitochondrial (ATPQ)19.6/5.1Membrane transportVpTPEYKPKT96
39gi|7525040ATP synthase CF1 beta subunit53.9/5.4Membrane transportIVGEEHYEpTAQQVKT387
40gi|15231008Translocase of outer mitochondrial membrane 4034.2/6.3Membrane transportGKIDpSNGVASALLEERS269
41gi|1143394V-type proton ATPase26.2/6.0Membrane transportIDYpSMQLNASR pSNDPHGLHCSGGVVLASRS71 S178
42gi|15236722ATP synthase family (b) 23.9/5.8Membrane transportALDpSQIAALSEDIVKK (e) S203
43gi|7708276ATP synthase beta subunit52.5/5.2Membrane transportINPpTpTpSGSGVMTLEKT5, T6, S7
44gi|15227104putative ATP synthase subunit27.6/6.3Membrane transportEKIpTLDPEDPAAVKT71
45gi|15236678Ascorbate peroxidase 4 (APX4), chloroplastic38.1/8.6Oxidative stressAENEGLpSDGLSLIEEVKKS155
46gi|15223576Dehydroascorbate reductase 1 (DHAR1)23.4/5.6Oxidative stress pTPAEFASVGSNIFGTFGTFLKT91
47gi|15224582Glutathione S-transferase 10 (ATGSTF10)24.2/5.5Oxidative stressVLpTIYAPLFASSKT4
48gi|15224581Glutathione S-transferase 9 (ATGSTF9) (b,d) 24.1/5.5Oxidative stressQPAYLALQPFGpTVPAVVDGDYK LAGVLDVpYEAHLSKT52 Y146
49gi|15218640Glutathione S-transferase 6 (ATGSTF6)23.5/5.8Oxidative stressVFGHPASTApTRT15
50gi|15226610ATPDIL2-1/MEE30/UNE5 (PDI) (b) 39.8/5.8Oxidative stressGpSDYASKETER ELVAApSEDEKK AGHDYDGGRDLDDFVpSFINEKS321 S280 S243
51gi|15231718Peroxiredoxin type 2, chloroplastic24.7/9.1Oxidative stress pTILFAVPGAFTPTCSQK VLNLEEGGAFpTNSSAEDMLK VLNLEEGGAFpTNSSAEDM*LKT108 T223 T223
52gi|30693971Universal stress protein family protein17.9/5.7Oxidative stressDLKLDpSIVMGSRS125
53gi|15232567 A. thaliana thioredoxin M-type 4 (ATHM4), chloroplastic21.3/9.6Oxidative stressINpTDESPNTANR DpSIIGAVPRETLEKT144 S173
54gi|6539610Thioredoxin M2, chloroplastic (a) 20.6/9.4Oxidative stressTTLpTSSLDKFLP LNpTDESPNTPGQYGVRT178 T138
55gi|31218252-Cys peroxiredoxin, chloroplast precursor29.0/7.7Oxidative stress pSGGLGDLNYPLISDVTKS161
56gi|15223049Ascorbate peroxidase 1 (APX1), cytosolic27.8/5.7Oxidative stressQM*GLpSDKDIVALSGAHTLGRS152
57gi|15219086Protein disulfide isomerase (PDI)-like protein (b,d) 55.8/4.8Oxidative stress pSADDASEVVSDKK (c) S149
58gi|20197312Glutathione S-transferase 6 (GST6) (b) 24.1/6.1Oxidative stressAIpTQYLAEEYSEKGEKT72
59gi|184151552-Cys peroxiredoxin, chloroplastic29.9/5.6Oxidative stress pSFGVLIPDQGIALR pSGGLGDLNYPLVSDITKS189 S168
60gi|15228407Superoxide dismutase 1 (MSD1), mitochondrial (b) 25.5/8.5Oxidative stressYApSEVYEKENNS223
61gi|7658343Peroxiredoxin IIF (b) 21.3/9.0Oxidative stressLAEGpTDITSAAPGVSLQK pSLGLDKDLSAALLGPRT35 S147
62gi|15228194Sedoheptulose-1,7-bisphosphatase, chloroplastic (b,d) 42.7/6.2Photosynthesis and respirationGFPGpTHEELLLDEGKT235
63gi|15229349Ribose 5-phosphate isomerase-related (b,d) 29.4/5.7Photosynthesis and respirationLLpSSGELYDIVGIPTSK pSLGIPLVGLDTHPR (e,g) LQDLFKEFGCEpSKS86 S108 S206
64gi|414550Cytosolic triosephosphate isomerase (TPI) (b) 27.4/5.2Photosynthesis and respirationAILNEpSSEFVGDKVAYALAQGLK VApSPAQAQEVHDELRKS106 S178
65gi|13926229Rubisco small chain 1A (RBCS1A) (b) 14.9/5.7Photosynthesis and respirationEHGNpTPGYYDGR (e) KFEpTLSYLPDLTDSELAK (f) LPLFGCpTDSAQVLKS58 T14 T78
66gi|16194Rubisco small subunit (RbcS)20.6/7.6Photosynthesis and respirationEHGNpTPGYYDGR KFEpTLSYLPDLSDVELAK (f) FEpTLSYLPDLSDVELAK LPLFGCpTDSAQVLKT113 T69 T69 T133
67gi|15223217Glycine cleavage system H protein, mitochondrial(b) 18.0/5.1Photosynthesis and respirationVKPpSSPAELEALMGPK VKPpSSPAELEALM*GPKS141 S141
68gi|84468442Putative Rubisco subunit binding protein (b) 48.6/4.8Photosynthesis and respirationHEAAGDGpTTTASILART8
69gi|13926291PS II oxygen-evolving complex 1 (PSBO1) (b) 35.3/5.6Photosynthesis and respirationQLDApSGKPDSFTGKS221
70gi|20260472Glyoxylate reductase (b) 36.7/8.5Photosynthesis and respiration pSKCDPLVGLGAK pSYGLSDEDFSAVIEALKS85 S320
71gi|18416540CIP amino terminal domain containing protein26.1/9.2Protein degradation pSMNEDVDLSFKKS224
72gi|18390982ATP dependent Clp protease proteolytic subunit (CLPP), chloroplastic36.4/8.6Protein degradation pSVAYNEHRPR VPSpSGLM*PASDVLIRS107 S241
73gi|1522499320S proteasome subunit PAA227.4/5.8Protein degradationLLDQpSSVSHLFPVTK ApTSAGMKEQEAVNFLEKS64 T166
74gi|2511588Multicatalytic endopeptidase complex (b) 27.2/5.6Protein degradationLLDQpSSVTHLFPITKS63
75gi|1521931720S Proteasome alpha subunit B, putative (b) 25.7/5.8Protein degradationKLPpSILVDEASVQK LpYKEPIPVTQLVR YpTEDMELDDAIHTAILTLK RYTEDM*ELDDAIHpTAILTLKS54 Y101 T179 T194
76gi|15242045Chaperonin 20 (CPN20), chloroplastic (b,d) 26.8/8.9Protein foldingYpTSIKPLGDR pYTSIKPLGDR YTpSIKPLGDR pTLGGILLPSTAQSKPQGGEVVAVGEGRT60 Y59 S61 T80
77gi|15226314Chaperonin HSP 60A (CPN60A) (b,d) 62.2/5.1Protein foldingIpTAIKDIIPILEKT272
78gi|16221Chaperonin HSP6061.6/5.7Protein foldingVpTKDGVTVAKT80
79gi|62321455Putative cruciferin 12S seed storage protein19.9/7.9Seed storageGLPLEVIpTNGYQISPEEAK VFDQEIpSSGQLLVVPQGFSVM*KT143 S89
80gi|16667812S storage protein50.9/6.8Seed storageVFDQEIpSSGQLLVVPQGFSVM*K GLPLEVIpTNGYQISPEEAKRS366 T420
pTNENAQVNTLAGRT395
81gi|9758672Unnamed protein product29.0/5.9UnclassifiedVPELVAKpTELENIAKT149
82gi|18391006Unknown protein (b,d) 20.0/5.4UnclassifiedEIpSMPNGLLPLKS33

Phosphorylated proteins, peptides and residues (phosphosites) identified by mass spectrometry in protein extracts from young Arabidopsis seedlings after rubisco depletion, IMAC enrichment and 2-DE separation of phosphoproteins. (a) pS, pT and pY = phosphorylated serine, threonine and tyrosine residues; M* = oxidized methionine. (b) Protein reported in the PhosPhAt 4.0 database. 30,31 (c) Peptide reported in the PhoPhAt 4.0 database with a different protein phosphorylation site. 30 31 (d) Protein previously reported in Arabidopsis seedlings. 23 (e) Peptide reported in the PhosPhAt 4.0 database with the same protein phosphorylation site. 30, 31 (f) Peptide previously reported in Arabidopsis seedlings with a different protein phosphorylation site. 23 (g) Peptide previously reported in Arabidopsis seedlings with the same protein phosphorylation site. 23

Two-dimensional gel electrophoresis of Rubisco-depleted phosphoproteins enriched by immobilized metal-ion affinity chromatography using PHOS-Select iron affinity gel beads.

Phosphoproteins identified by liquid chromatography-tandem mass spectrometry are indicated using arrows and numbers (see Table 1). Phosphorylated proteins, peptides and residues (phosphosites) identified by mass spectrometry in protein extracts from young Arabidopsis seedlings after rubisco depletion, IMAC enrichment and 2-DE separation of phosphoproteins. (a) pS, pT and pY = phosphorylated serine, threonine and tyrosine residues; M* = oxidized methionine. (b) Protein reported in the PhosPhAt 4.0 database. 30,31 (c) Peptide reported in the PhoPhAt 4.0 database with a different protein phosphorylation site. 30 31 (d) Protein previously reported in Arabidopsis seedlings. 23 (e) Peptide reported in the PhosPhAt 4.0 database with the same protein phosphorylation site. 30, 31 (f) Peptide previously reported in Arabidopsis seedlings with a different protein phosphorylation site. 23 (g) Peptide previously reported in Arabidopsis seedlings with the same protein phosphorylation site. 23 The 144 detected phosphopeptides contained a total of 144 unique sites of protein phosphorylation, of which 48% (69) were serine, 48% (69) were threonine, and 4% (6) were tyrosine residues (Table 2; Fig 4A). To assess any differences in phosphorylation occupancy among the S, T and Y residues, we compared our results with those from previous studies that utilized different enrichment methods and plant tissues (Table 2). The distribution observed in this study for Arabidopsis seedlings is similar to that obtained using Rubisco depletion and IMAC enrichment of intact phosphoproteins from mature Arabidopsis leaves,[11] which contained 52% phosphoserine (pS), 40% phosphothreonine (pT), and 8% phosphotyrosine (pY) residues (Table 2). However, these results differ from those obtained using IMAC to enrich phosphopeptides generated by trypsin digestion of plant phosphoproteins. For example, previous results reported 88% pS, 11% pT and 1% pY in 22-day-old Arabidopsis seedlings [22]; 85% pS, 13% pT and 2% pY in 9-day-old Arabidopsis seedlings [23]; 85% pS, 11% pT and 4% pY in cultured Arabidopsis cells [21]; 86% pS, 13% pT and 1% pY in Medicago truncatula roots [26]; and 81% pS, 17% pT, and 2% pY in dormant poplar (Populus simonii × P. nigra) buds [27] when IMAC enrichment was performed at the phosphopeptide level. Rao and Moller [28] reported the occurrence of 77% pS, 17.5% pT and 5.5% pY in eukaryotic phosphoproteins based on a combined Uniprot, Phospho.ELM and Phosida database analysis, which also differs from the present study. By way of comparison, the average pS:pT:pY ratio observed for cellular phosphoproteins in mammals is approximately 1800:200:1,[29] corresponding to 89.95% pS, 10.00% pT and 0.05% pY. These results suggest that peptide- and protein-level enrichment strategies complement each other to some extent and that the latter provides access to a greater proportion of phosphorylated threonine and tyrosine residues, at least in plant phosphoproteins.
Table 2

Distribution of phosphorylated residues identified in plant proteins using immobilized metal-ion affinity chromatography of phosphorylated proteins or peptides.

Plant speciesTissueIMAC target% pS% pT% pYReferences
Arabidopsis thaliana seedlingspProteins48484This study
Arabidopsis thaliana leavespProteins5240811
Arabidopsis thaliana cultured cellspPeptides8511421
Arabidopsis thaliana seedlingspPeptides8811122
Arabidopsis thaliana seedlingspPeptides8513223
Medicago truncatula rootspPeptides8613125
Poplar simonii × P. nigra dormant budspPeptides8117126
Fig 4

Distribution and functional classification of identified phosphoproteins.

(A) Numbers of phosphoproteins and phosphopeptides identified in post-embryonic Arabidopsis seedlings and mature Arabidopsis leaves,[11] and of the phosphosites identified in phosphopeptides common to both tissues. (B) Functional classification of the phosphoproteins identified in Arabidopsis young seedlings according to the KEGG Pathway database (http://www.genome.jp/kegg/pathway.html). Proteins involved in carbohydrate/energy metabolism, oxidative stress/redox regulation and photosynthesis/respiration account for over 50% of the identified phosphoproteome.

Distribution and functional classification of identified phosphoproteins.

(A) Numbers of phosphoproteins and phosphopeptides identified in post-embryonic Arabidopsis seedlings and mature Arabidopsis leaves,[11] and of the phosphosites identified in phosphopeptides common to both tissues. (B) Functional classification of the phosphoproteins identified in Arabidopsis young seedlings according to the KEGG Pathway database (http://www.genome.jp/kegg/pathway.html). Proteins involved in carbohydrate/energy metabolism, oxidative stress/redox regulation and photosynthesis/respiration account for over 50% of the identified phosphoproteome. These findings are of particular significance given the emerging importance of tyrosine phosphorylation in plant processes such as germination, growth, development, and abiotic stress responses.[30] In particular, our discovery of 6 new tyrosine phosphorylation sites (Table 1) in proteins involved in the mobilization of seed reserves (NAD+ MDH), cell defence (MLP), cellular signaling (cyclase family protein), oxidative stress response (GST9), protein degradation (20S proteasome alpha subunit B) and protein folding (chaperonin 20) represents a significant contribution to the list of potential substrates for known and predicted protein tyrosine kinases in plants.[30] It also helps to address the apparent discrepancy between the predicted frequency of pY residues in the Arabidopsis proteome [13] and that observed using peptide-level affinity enrichment strategies, during which the phosphorylated residues in each protein are distributed between tryptic peptides containing only one or two such residues, of which those carrying the more abundant pS modification are likely to predominate in terms of recovery and analysis. Of the 144 phosphopeptides and 144 phosphosites reported in the present study, only 10 peptides and 1 phosphorylation site matched those identified during a recent survey of the phosphoproteome in hydroponically-grown Arabidopsis seedlings, which utilized Ti4+-IMAC enrichment of tryptic phosphopeptides from whole protein digests (Table 1).[23] Of those 144 phosphopeptides, 10 phosphopeptides and 5 phosphorylation sites were found in both the P3DB (http://www.p3db.org/) and PhosPhAt 4.0 (http://phosphat.uni-hohenheim.de/) databases, with an additional 8 peptides and 3 phosphorylation sites found only in the PhosPhAt 4.0 database. Of the identified phosphoproteins previously reported in Arabidopsis thaliana seedlings [23] (Table 1) two are isoforms of the same protein, GAPC (spots 18 and 19). Detection of the novel phosphopeptide pTLLFGEKPVTVFGIR in both isoforms indicates that both are phosphorylated at T70. However, a second phosphopeptide SDLDIVpSNASCTTNCLAPLAK, which had previously been detected in Arabidopsis seedlings [23] (though with a different site of phosphorylation), was also identified in one of the isoforms (spot 18) indicating phosphorylation at S152 (Table 1). The concomitant reduction in pI relative to the other isoform (spot 19) is consistent with horizontal separation of these two proteins on the 2-DE gel (Fig 3), demonstrating the utility of our gel-based approach for resolving differentially phosphorylated forms of a given protein. Similarly, vertical separation of two Rubisco polypeptides (Fig 3, spots 65 and 66) reflects the difference in molecular weight between the matched proteins, each of which contained the same number (3) of identified phosphorylation sites (Table 1). IMAC purification, 2-DE separation, and digestion of intact phosphoproteins to produce a mixture of phosphorylated and non-phosphorylated peptides may have contributed to the relatively small number of multiply-phosphorylated peptides identified during this study, compared with studies in which only phosphorylated peptides were enriched and analyzed by mass spectrometry. [16-18,21,26,27,31] However, the average number of phosphopeptides identified per plant protein (1.8 in young Arabidopsis seedlings and 1.9 in mature leaves [11]) compares well with studies that utilize peptide-level enrichment [27]. Furthermore, the phosphoproteins we identified in young seedlings using protein-level enrichment include basic proteins (e.g. APX1, APX4, nucleotide diphosphate kinase) and proteins previously identified as plasma membrane proteins (e.g. CA2, PGK, DHAR1) in Arabidopsis seedlings,[12,16,32] suggesting minimal bias towards proteins of a particular polarity, pI or molecular weight.[11] By enabling protein identification using both phosphorylated and non-phosphorylated peptides our approach also provides high confidence in the identification of phosphoproteins and hence, their selection as candidates for further investigation of the role of protein phosphorylation during plant development (which lies beyond the scope of the present study).

Functional classification of phosphoproteins

The identified phosphoproteins were sorted into functional groups using the KEGG Pathway database (http://www.genome.jp/kegg/pathway.html). The two largest groups were those involved in carbohydrate/energy metabolism (22%) and oxidative stress/redox regulation (20%), which together with photosynthesis and respiration (11%) accounted for more than half of the identified phosphoproteins (Fig 4). Many of these, including glyceraldehyde-3-phosphate dehydrogenase (GAPC-2), triosephosphate isomerase (TPI), phosphoglycerate kinase (PGK1), fructose bisphosphate aldolase (FBA), and malate dehydrogenase (MDH), play important role in processes such as glycolysis, gluconeogenesis and the Calvin cycle during seed germination and the early stages of seedling establishment. Identification of phosphorylated 20S proteasome subunits, proteases, chaperonins, thioredoxins, glutathione transferases (GSTs), dehydroascorbate reductase (DHAR1) and manganese superoxide dismutase (MSD1) is also consistent with the role of proteolytic events in mobilizing TAG and other seed reserves. Comparison of our experimental results with the PhosPhAt 4.0 Arabidopsis thaliana phosphorylation site database [33,34], P3DB database [35] and with supplementary information from a recently published survey of the Aradopsis seedling phosphoproteome [23] showed that 43 of the 82 phosphoproteins identified in our study have not been reported before (Table 1), and that we were able to identify new phosphorylation sites in previously characterized phosphoproteins such as FBA, GAPC-2, TPI and PMDH1 (S1 Fig), GSTs (S2A Fig and S2B Fig), PRK and IDH. New and known phosphorylation sites were also identified in 12S seed storage proteins (Table 1), further demonstrating the utility of our approach for identifying novel sites of protein phosphorylation in plant tissues.

Discussion

Phosphorylation of enzymes involved in post-embryonic development

Many of the enzymes known to be important during the early stages of plant growth were found to be phosphorylated in Arabidopsis young seedlings. The glycolytic enzyme triosephosphate isomerase (TPI), for example, plays a central role in chloroplast development [36] and other biochemical pathways by equilibrating the cytosolic pool of DHAP and G-3-P. The latter is required for 1,5-bisphosphate production in the Calvin cycle, whereas DHAP suppresses the production of chlorophyll and 1,5-bisphosphate. Phosphorylation of human TPI has been shown to reduce its activity in converting G-3-P to DHAP, and although it has been suggested that TPI can be phosphorylated at S21 there is evidence that other sites may be subject to phosphorylation.[37] Our discovery of phosphorylated S106 and S178 residues in Arabidopsis TPI (Table 1, spot 64) provides new information with which to investigate the role of protein phosphorylation in controlling the activity of this enzyme and thus regulating chloroplast development in young seedlings. NAD+ MDH, a key enzyme in carbohydrate metabolism, is responsible for regenerating NAD+ and is involved in the mobilization of seed oil reserves [4] and the photosynthetic assimilation of carbon in developing leaves. [38] We identified several sites of phosphorylation in mitochondrial NAD+ MDH (Table 1, spot 9), as well as single site of phosphorylation in cytosolic MDH (Table 1, spot 10). We also observed phosphorylation of 3-isopropyl malate dehydrogenase (spot 16), which is primarily involved in leucine biosynthesis. [39] Carbonic anhydrase (CA), a major chloroplast protein, is involved in photosynthesis [40] and the mobilization of seed reserves during the early stages of post-embryonic growth. CA1 is also known to form part of a Rubisco-containing Calvin cycle enzyme complex.[40] Identification of phosphorylation sites in CA (spots 8 and 12), ribose-5-phosphate isomerase (spot 63), Rubisco SSU (spot 66) and PRK (spot 22) may help to elucidate the role of protein phosphorylation in controlling the assimilation and utilization of carbon reserves during the early stages of seedling establishment.[41] Other identified phosphoproteins include members of the jacalin-lectin (Fig 5A), cupin, and cyclase families (spots 32 to 34), all of which are involved in cell signaling. A cupin domain protein (AtPirin1) has also been found to interact with G protein α-subunit GPA1 in Arabidopsis to regulate seed germination and seedling development.[42]
Fig 5

Identification of phosphorylation sites using tandem mass spectrometry (MS/MS).

The MS/MS spectra correspond to phosphopeptides with the following mass-to-charge (m/z) ratios, as obtained by trypsin digestion of proteins selected from the 2-DE gel shown in Fig 3 (A) m/z 689.810, showing phosphorylation of jacalin-lectin family protein (spot 32) at S195; (B) m/z 1134.612, showing phosphorylation of germin-like protein (spot 31) at T72; (C) m/z 814.872, showing phosphorylation of the Rubisco small subunit (spot 66) at T133; (D) m/z 936.502, showing phosphorylation of ribose 5-phosphate isomerase-related protein (spot 63) at S86. Peaks corresponding to sequential loss of intact amino acid residues from the C or N terminus of the peptide are labeled as b- or y-type ions, respectively.

Identification of phosphorylation sites using tandem mass spectrometry (MS/MS).

The MS/MS spectra correspond to phosphopeptides with the following mass-to-charge (m/z) ratios, as obtained by trypsin digestion of proteins selected from the 2-DE gel shown in Fig 3 (A) m/z 689.810, showing phosphorylation of jacalin-lectin family protein (spot 32) at S195; (B) m/z 1134.612, showing phosphorylation of germin-like protein (spot 31) at T72; (C) m/z 814.872, showing phosphorylation of the Rubisco small subunit (spot 66) at T133; (D) m/z 936.502, showing phosphorylation of ribose 5-phosphate isomerase-related protein (spot 63) at S86. Peaks corresponding to sequential loss of intact amino acid residues from the C or N terminus of the peptide are labeled as b- or y-type ions, respectively. Phosphorylation of 20S proteasome subunit PtrPBA1, and increased expression of 20S proteasome α-subunit B and regulatory subunit RPN10, have been observed in poplar dormant terminal buds.[27] We observed phosphorylation of 20S proteasome α-subunit B (spot 75) at S54, T179, T194 and Y101 and of the 20S proteasome subunit PAA2 (spot 73) at S64 and T166 in Arabidopsis young seedlings (Table 1). ATP dependent Clp protease proteolytic subunit (CLPP) is a highly conserved, multimeric serine protease [43] that degrades large globular proteins in the presence of an AAA ATPase complex. [44] CLPP (spot 72) was found to be phosphorylated at S107 and S241, and a Clp amino terminal domain-containing protein (spot 71) at S224. Although there is growing evidence of crosstalk between redox signaling and hormonal response pathways during seed germination,[45] the molecular components involved in this process during post-embryonic development remain elusive. We identified phosphorylated forms of several proteins known to be key regulators of stress response, including APX1 (spot 56), APX4 (spot 45), GST6 (spot 58), ATGSTF9, -10 and -6 (spots 47 to 49 and S2A Fig and S2B Fig), dehydroascorbate reductase 1 (spot 46), thioredoxins M2 and M4 (spots 53 and 54), peroxiredoxins (spots 51 and 59), and manganese superoxide dismutase (spot 60). Phosphorylation of APX1, APX4, peroxiredoxin type-2, GST6, and MSD1 was also observed in mature leaves [11] but at sites other than those observed in young seedlings (Table 3). Thioredoxins and other H2O2-scavenging enzymes help to protect plants from damage caused by the production of reactive oxygen species (ROS) during seed germination and seedling development.[46] Germin-like protein, which generates H2O2 from the oxidative breakdown of oxalate,[47] was also found to be phosphorylated in our study (Fig 5B).
Table 3

Changes in protein phosphorylation between post-embryonic seedlings and mature leaves.

Phosphorylated proteins, peptides and residues (S = serine, T = threonine, Y = tyrosine) identified in post-embryonic seedlings and mature leaves of Arabidopsis thaliana. Common phosphopeptides with conserved phosphosites are highlighted in bold and common phosphopeptides with different phosphosites are highlighted in bold and italics.

Gene locusProtein nameYoung seedlingsMature leavesPhospho-site(s)
gi|18404496 Catalytic co-enzyme binding ALDLApSKPEGTGTPTK ALDLApSKPEGTGTPTK S302
gi|15218869 Isocitrate dehydrogenasepTIEAEAAHGTVTR-T302
-LVPGWpTKPICIGRT127
gi|15219721 Malate dehydrogenase (MDH)VQpTSSGEKPVR-T203
-LSpSALSAASSACDHIRS243
NVIIWGNHpSSSQYPDVNHAKS189
gi|15222848 G3P cytosolic-2 (GAPC-2) pTLLFGEKPVTVFGIR pTLLFGEKPVTVFGIR T70
-FGIVEGLMTpTVHSITATQKT181
SDLDIVpSNASCTTNCLAPLAK-S152
gi|15231715 Fructose bisphosphate aldolase pTVPAAVPAIVFLSGGQSEEEATR pTVPAAVPAIVFLSGGQSEEEATR T254
IGENEPpSEHSIHENNAYGLAR IGENEPpSEHSIHENNAYGLAR S155
LGDGAAEpSLHVK LGDGAAEpSLHVK S350
VpSPEVIAEHTVR VpSPEVIAEHTVR S239
ANSEApTLGTYKGDAK-T333
GILAADESpTGTIGKR-T33
LApSINVENVETNRR-S42
pSSDGKLFVDILK-S83
ALSDHHVLLEGTLLKPNM*VpTPGpSDSPK-T230,S233
gi|15227752 Malate dehydrogenase (PMDH1)KLMGVpTMLDVVRT138
KPGM*pTRDDLFNINAGIVR-T127
-AIVNIIpSNPVNSTVPIAAEVFKS159
gi|15229231 G3P cytosolic (GAPC) pTLLFGEKPVTVFGIR pTLLFGEKPVTVFGIR T70
-FGIVEGLMTpTVHSITATQKT181
gi|16398 Nucleotide diphosphate kinase NVIHGpSDSVESAR NVIHGpSDSVESAR S116
KIIGApTNPAASEPGTIR-T90
IIGApTNPAASEPGTIR-T90
gi|414550 Cytosolic triose phosphate isomerase AILNEpSSEFVGDKVAYALAQGLK AILNEpSSEFVGDK S106
VApSPAQAQEVHDELRK VApSPAQAQEVHDELRK S178
-VIACVGEpTLEERT131
gi|42573371 Carbonic Anhydrase 2 (CA2) KIpTAELQAASSSDSK IpTAELQAASSSDSK T35
VCPpSHVLDFHPGDAFVVR VCPpSHVLDFHPGDAFVVR S98
VLAEpSESSAFEDQCGR VLAESEpSSAFEDQCGR S191,S193
GNEpSYEDAIEALKK-S5
-EAVNVpSLANLLTYPFVRS211
-pYAGVGAAIEYAVLHLKY126
gi|7769871 NAD-malate dehydrogenase RTQDGGpTEVVEAK pTQDGGTEVVEAK T251, T246
KPGMpTRDDLFNINAGIVK-T114
LNPLVSSLpSLYDIANTPGVAADVGHINTR-S59
LNPLVSSLSLpYDIANTPGVAADVGHINTR-Y61
NGVEEVLDLGPLpSDFEKEGLEALKPELK-S325
KLFGVpTTLDVVR-T175
YCPHALINMIpSNPVNSTVPIAAEIFK-S146
gi|15228198 PYK 10 binding protein 1(PBP1) pSPEEVTGEEHGK pSPEEVTGEEHGK S196
QpTSPPFGLEAGTVFELKEEGHK QTpSPPFGLEAGTVFELK T254,S255
-GANLWDDGpSTHDAVTKS20
-TpSDVIGSDEGTHFTLQVKS102
-VpYVGQAQDGISAVKY178
gi|1755154 Germin-like proteinAAVpTPAFAPAYAGINGLGVSLAR-T72
-GDpSMVFPQGLLHFQLNSGKS140
gi|18405982 Avirulense-responsive proteinLHACIpSPSENGLINGK-S56
TVEVVLpTDTSEKK pTVEVVLTDTSEKK T97, T91
gi|9843639 Rieske FeS protein FLCPCHGpSQYNAQGR FLCPCHGpSQYNAQGR S180
GPAPLpSLALAHADIDEAGK GPAPLpSLALAHADIDEAGK S196
GDPpTYLVVENDK-T138
gi|1143394 V-type proton ATPase IDYpSMQLNASR IDYpSMQLNASR S71
pSNDPHGLHCSGGVVLASR pSNDPHGLHCSGGVVLASR S178
gi|7525040 ATP synthase CF1 beta subunit IVGEEHYEpTAQQVK IVGEEHYEpTAQQVK T387
-TNPpTTSNPEVSIRT3
-VGLpTALTMAEYFRT252
gi|15223049 L-ascorbate peroxidase (APX1)QM*GLpSDKDIVALSGAHTLGR-S152
-ELLpSGEKEGLLQLVSDKS196
gi|15226610 ATPDIL2-1/MEE30/UNE5AGHDYDGGRDLDDFVpSFINEKDLDDFVpSFINEKS243
ELVAApSEDEKK-S280
GpSDYASKETER-S321
gi|15228407 Mn-superoxide dismutase (MSD1)YApSEVYEKENN-S223
-GpSLGSAIDAHFGSLEGLVKS124
-HHQAYVTNpYNNALEQLDQAVNKY67
-LVVDpTTANQDPLVTKT171
gi|15231718 Peroxiredoxin type 2 pTILFAVPGAFTPTCSQK pTILFAVPGAFTPTCSSQK T108
VLNLEEGGAFpTNSSAEDMLK VLNLEEGGAFpTNSSAEDMLK T223
-LPDpSTLSYLDPSTGDVKS82
VLNLEEGGAFpTNSSAEDM*LK-T223
gi|15236678 Ascorbate peroxidase 4 (APX4)AENEGLpSDGLSLIEEVKK-S155
-GGPIpSYADIIQLAGQSAVKS178
gi|20197312 Glutathione S-transferase (GST6) AIpTQYLAEEYSEKGEK AIpTQYLAEEYSEK T72
-GMFGMpTTDPAAVQELEGKT129
-QEAHLALNPFGQIPALEDGDLpTLFESRT64
gi|13926229 Rubisco small chain 1A (RBCS1A) EHGNpTPGYYDGR EHGNpTPGYYDGR T58
KFEpTLSYLPDLTDSELAK KFEpTLSYLPDLSDVELAK T14
-FEpTLpSYLPDLSDVELAKT14
-KFEpTLpSYLPDLSDVELAKT14, S16
LPLFGCpTDSAQVLK LPLFGCpTDSAQVLK T78
gi|15229349 Ribose 5-phosphate isomerase LLpSGSELYDIVGIPTSK LLSpSGELYDIVGIPTSK S86, S87
pSLGIPLVGLDTHPR pSLGIPLVGLDTHPR S108
LQDLFKEFGCEpSK-S206
gi|15226314 Chaperonin 60 alpha (CPN60A)IpTAIKDIIPILEK-T272
-HGLLpSVTSGANPVSLKS150
gi|15242045 Chaperonin 20 (CPN20) YpTSIKPLGDR YpTSIKPLGDR T60
pYTSIKPLGDR-Y59
pTLGGILLPSTAQSKPQGGEVVAVGEGR-T80
gi|16221 Chaperonin HSP60VpTKDGVTVAK-T80
-GIpSMAVDAVVTNLKS151

Changes in protein phosphorylation between post-embryonic seedlings and mature leaves.

Phosphorylated proteins, peptides and residues (S = serine, T = threonine, Y = tyrosine) identified in post-embryonic seedlings and mature leaves of Arabidopsis thaliana. Common phosphopeptides with conserved phosphosites are highlighted in bold and common phosphopeptides with different phosphosites are highlighted in bold and italics. Heat shock proteins (HSPs) are involved in bud dormancy [48] and phosphorylation of HSPs and chaperonin has been reported in Arabidopsis [21,49] and poplar.[27] Our results confirm phosphorylation of these proteins in Arabidopsis seedlings and identify sites of phosphorylation in HSP60 (T80) and chaperonin 20 (adjacent residues Y59, T60 and S61) that, to the best of our knowledge, have not been reported before (S2C Fig and S2D Fig).

Comparing protein phosphorylation at different stages of development

In a previous study we used IMAC to recover and identify 132 phosphoproteins with 252 component phosphopeptides in mature Arabidopsis leaf extracts (Fig 4A), following polyethylene glycol (PEG) fractionation to deplete Rubisco.[11] Having now used IMAC to recover and identify intact phosphoproteins in Rubisco-depleted extracts from young seedlings we decided to compare the results of the two studies. Of the 82 phosphoproteins identified in post-embryonic seedlings 28 were also identified in mature leaves, with 26 component phosphopeptides showing the same sites of phosphorylation in both tissues (Fig 4A, Table 3). For example, phosphorylation of the Rubisco small chain 1A at T58, T14 and T78 was observed in both seedlings and leaves, confirming phosphorylation of the protein at those sites. However, some of the phosphopeptides spanning the same amino acid sequence in both tissues showed a difference in protein phosphorylation state between young seedlings and mature leaves. For example, the CA2 peptide VLAESESSAFEDQCGR was identified in both tissues but was phosphorylated at S191 in seedlings and at S193 in leaves. Tryptic peptides showing differential phosphorylation of four other proteins (NAD+ MDH, PBP1, avirulence responsive protein, and ribose 5-phosphate isomerase) were also observed (Table 3), suggesting that these proteins may play a significant role in Arabidopsis development. Comparing the phosphorylation status of 12S seed storage protein (cruciferin) in young seedlings (Table 1, spot 80) and dormant Arabidopsis seeds [50] shows that certain phosphorylation sites (T395 and T420) are common to both tissues, thereby validating the current method with reference to results obtained during a previous in-depth study of cruciferin phosphorylation. However, an apparent shift in phosphorylation site from S367 in dormant seeds to S366 in post-embryonic seedlings again demonstrates the ability to detect subtle changes in phosphorylation status that may have implications for seed storage protein mobilization and other processes during plant development,[50] although further investigations are required to confirm the significance of these findings.

Conclusions

Seedling establishment involves the efficient utilization of endogenous protein reserves and external resources, requiring that developmental and metabolic programs adapt to the prevailing environmental conditions.[51] Using a combination of Rubisco depletion and IMAC enrichment of intact phosphoproteins we identified and characterized the phosphorylated forms of 82 proteins expressed in Arabidopsis young seedlings. These included enzymes involved in chloroplast development, mobilization of TAG, and other processes known to be important during the early stages of plant development. Comparison of our results for young seedlings with those obtained previously for Arabidopsis seeds [50] and mature leaves [11] shows that some of these proteins undergo differential phosphorylation during plant growth, and that protein level enrichment appears to enhance detection of pT and pY residues. Our study complements previous investigations by identifying an additional 43 proteins and 136 residues that undergo phosphorylation in Arabidopsis young seedlings. By purifying and enriching phosphorylated proteins under non-denaturing conditions our approach also lends itself to the study of phosphorylation in endogenous protein complexes and during protein-protein interactions.

Identification of novel phosphorylation sites using tandem mass spectrometry (MS/MS).

The MS/MS spectra correspond to phosphopeptides with the following mass-to-charge (m/z) ratios, as obtained by trypsin digestion of proteins selected from the 2-DE gel shown in Fig 3. (A) m/z 701.377, showing phosphorylation of FBA (spot 17) at S84; (B) m/z 878.968, showing phosphorylation of GAPC-2 (spot 18) at T70; (C) m/z 619.981, showing phosphorylation of cytosolic TPI (spot 64) at S178; and (D) m/z 721.369, showing phosphorylation of PMDH1 (spot 25) at T138. (PDF) Click here for additional data file.

Identification of novel phosphorylation sites using tandem mass spectrometry.

The MS/MS spectra correspond to phosphopeptides with the following mass-to-charge (m/z) ratios, as obtained by trypsin digestion of proteins selected from the 2-DE gel shown in Fig 3. (A) m/z 745.425, showing phosphorylation of ATGST10 (spot 47) at T4; (B) m/z 612.269, showing phosphorylation of ATGST6 (spot 49) at T15; (C) m/z 549.250, showing phosphorylation of HSP60 (spot 78) at T80; and (D) m/z 615.308, showing phosphorylation of chaperonin 20 (spot 76) at S61. (PDF) Click here for additional data file.

Phosphopeptide MS/MS spectra and MASCOT search results for selected phosphoproteins.

(PDF) Click here for additional data file.
  50 in total

1.  Large-scale analysis of phosphorylation site occupancy in eukaryotic proteins.

Authors:  R Shyama Prasad Rao; Ian Max Møller
Journal:  Biochim Biophys Acta       Date:  2011-12-10

2.  Elevated CO2 induces physiological, biochemical and structural changes in leaves of Arabidopsis thaliana.

Authors:  Nianjun Teng; Jian Wang; Tong Chen; Xiaoqin Wu; Yuhua Wang; Jinxing Lin
Journal:  New Phytol       Date:  2006       Impact factor: 10.151

3.  Utilization of Glycerol in the Tissues of the Castor Bean Seedling.

Authors:  H Beevers
Journal:  Plant Physiol       Date:  1956-11       Impact factor: 8.340

4.  Arabidopsis seedling growth, storage lipid mobilization, and photosynthetic gene expression are regulated by carbon:nitrogen availability.

Authors:  Thomas Martin; Oliver Oswald; Ian A Graham
Journal:  Plant Physiol       Date:  2002-02       Impact factor: 8.340

5.  Large-scale comparative phosphoproteomics identifies conserved phosphorylation sites in plants.

Authors:  Hirofumi Nakagami; Naoyuki Sugiyama; Keiichi Mochida; Arsalan Daudi; Yuko Yoshida; Tetsuro Toyoda; Masaru Tomita; Yasushi Ishihama; Ken Shirasu
Journal:  Plant Physiol       Date:  2010-05-13       Impact factor: 8.340

Review 6.  New insights into the ATP-dependent Clp protease: Escherichia coli and beyond.

Authors:  J Porankiewicz; J Wang; A K Clarke
Journal:  Mol Microbiol       Date:  1999-05       Impact factor: 3.501

Review 7.  Seed storage oil mobilization.

Authors:  Ian A Graham
Journal:  Annu Rev Plant Biol       Date:  2008       Impact factor: 26.379

8.  Large-scale Arabidopsis phosphoproteome profiling reveals novel chloroplast kinase substrates and phosphorylation networks.

Authors:  Sonja Reiland; Gaëlle Messerli; Katja Baerenfaller; Bertran Gerrits; Anne Endler; Jonas Grossmann; Wilhelm Gruissem; Sacha Baginsky
Journal:  Plant Physiol       Date:  2009-04-17       Impact factor: 8.340

9.  THE 1-DEOXY-D-XYLULOSE-5-PHOSPHATE PATHWAY OF ISOPRENOID BIOSYNTHESIS IN PLANTS.

Authors:  Hartmut K. Lichtenthaler
Journal:  Annu Rev Plant Physiol Plant Mol Biol       Date:  1999-06

10.  Phosphoproteomics of the Arabidopsis plasma membrane and a new phosphorylation site database.

Authors:  Thomas S Nühse; Allan Stensballe; Ole N Jensen; Scott C Peck
Journal:  Plant Cell       Date:  2004-08-12       Impact factor: 11.277

View more
  5 in total

1.  Correction: Enrichment and Analysis of Intact Phosphoproteins in Arabidopsis Seedlings.

Authors: 
Journal:  PLoS One       Date:  2015-07-29       Impact factor: 3.240

Review 2.  Protein Phosphorylation and Redox Modification in Stomatal Guard Cells.

Authors:  Kelly M Balmant; Tong Zhang; Sixue Chen
Journal:  Front Physiol       Date:  2016-02-05       Impact factor: 4.566

Review 3.  Different Stationary Phase Selectivities and Morphologies for Intact Protein Separations.

Authors:  A Astefanei; I Dapic; M Camenzuli
Journal:  Chromatographia       Date:  2016-09-23       Impact factor: 2.044

4.  Circadian Profiling of the Arabidopsis Proteome Using 2D-DIGE.

Authors:  Mani K Choudhary; Yuko Nomura; Hua Shi; Hirofumi Nakagami; David E Somers
Journal:  Front Plant Sci       Date:  2016-07-12       Impact factor: 5.753

5.  Identification of Key Proteins and Networks Related to Grain Development in Wheat (Triticum aestivum L.) by Comparative Transcription and Proteomic Analysis of Allelic Variants in TaGW2-6A.

Authors:  Dengfeng Du; Xin Gao; Juan Geng; Qingyan Li; Liqun Li; Qian Lv; Xuejun Li
Journal:  Front Plant Sci       Date:  2016-06-28       Impact factor: 5.753

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.