Tiana C White1, Zhengping Yi, Brenda G Hogue. 1. The Biodesign Institute, Center for Infectious Diseases and Vaccinology, Arizona State University, Tempe, AZ 85287-5401, USA.
Abstract
The coronavirus nucleocapsid (N) is a multifunctional phosphoprotein that encapsidates the genomic RNA into a helical nucleocapsid within the mature virion. The protein also plays roles in viral RNA transcription and/or replication and possibly viral mRNA translation. Phosphorylation is one of the most common post-translation modifications that plays important regulatory roles in modulating protein functions. It has been speculated for sometime that phosphorylation could play an important role in regulation of coronavirus N protein functions. As a first step toward positioning to address this we have identified the amino acids that are phosphorylated on the mouse hepatitis coronavirus (MHV) A59 N protein. High performance liquid chromatography coupled with electrospray ionization tandem mass spectrometry (HPLC-ESI-MS/MS) was used to identify phosphorylated sites on the N protein from both infected cells and purified extracellular virions. A total of six phosphorylated sites (S162, S170, T177, S389, S424 and T428) were identified on the protein from infected cells. The same six sites were also phosphorylated on the extracellular mature virion N protein. This is the first identification of phosphorylated sites for a group II coronavirus N protein.
The coronavirus nucleocapsid (N) is a multifunctional phosphoprotein that encapsidates the genomic RNA into a helical nucleocapsid within the mature virion. The protein also plays roles in viral RNA transcription and/or replication and possibly viral mRNA translation. Phosphorylation is one of the most common post-translation modifications that plays important regulatory roles in modulating protein functions. It has been speculated for sometime that phosphorylation could play an important role in regulation of coronavirus N protein functions. As a first step toward positioning to address this we have identified the amino acids that are phosphorylated on the mousehepatitis coronavirus (MHV) A59 N protein. High performance liquid chromatography coupled with electrospray ionization tandem mass spectrometry (HPLC-ESI-MS/MS) was used to identify phosphorylated sites on the N protein from both infected cells and purified extracellular virions. A total of six phosphorylated sites (S162, S170, T177, S389, S424 and T428) were identified on the protein from infected cells. The same six sites were also phosphorylated on the extracellular mature virion N protein. This is the first identification of phosphorylated sites for a group II coronavirus N protein.
The Coronaviridae is a large family of medically important viruses that cause primarily respiratory and enteric infections in humans and a wide range of animals. The viruses are enveloped and contain a single-stranded positive-sense RNA genome that ranges in size from 27 to 31 kb. All of the viruses have at least four structural proteins. The spike (S), membrane (M) and envelope (E) proteins are anchored in the virion envelope. The nucleocapsid (N) protein encapsidates the viral genome as a helical nucleocapsid inside the virion (Davies et al., 1981, Macnaughton et al., 1978).The focus of this report is the multifunctional N protein and its phosphorylation. Through its interactions with the viral RNA, the M protein and itself, N plays important roles in virus assembly (Escors et al., 2001, Hurst et al., 2005, Narayanan et al., 2003, Narayanan and Makino, 2001, Verma et al., 2006). The protein is also involved in viral RNA transcription and/or replication (Baric et al., 1988, Chang and Brian, 1996, Compton et al., 1987, Denison et al., 1999, van der Meer et al., 1999). Recent studies with coronavirusinfectious clones provided direct evidence for a role of the N protein in replication and/or transcription (Almazan et al., 2004, Casais et al., 2001, Yount et al., 2000). The protein may also play a role in viral mRNA translation (Tahara et al., 1994). Additionally, MHV A59 and severe acute respiratory syndrome coronavirus (SARS-CoV) N proteins are type I interferon antagonists (Kopecky-Bromberg et al., 2007, Ye et al., 2007).All coronavirus N proteins are phosphorylated and highly basic with isoelectric points (pI) of 10.3 to 10.7 (Laude and Masters, 1995). The role of phosphorylation is not known and only very recently were phosphorylated sites identified for transmissible gastroenteritis virus (TGEV) and avian infectious bronchitis virus (IBV) N proteins, group I and III viruses, respectively (Calvo et al., 2005, Chen et al., 2005).In the study reported here we used high pressure liquid chromatography coupled with electrospray ionization tandem mass spectrometry (HPLC-ESI-MS/MS) to identify the phosphorylated sites on the MHVA59 N protein from both infected cells and purified extracellular virions. High accurate mass resolution was achieved by ion cyclotron resonance (ICR) and Fourier transformation. Use of the ICR cell provides the most accurate mass data available today. A total of six phosphorylated sites (S162, S170, T177, S389, S424, and T428) were identified on the N protein taken from both infected cells and extracellular virions. This is the first identification of phosphorylated sites for a group II coronavirus N protein.
Materials and methods
Isolation of N protein from intracellular and extracellular fractions
Mouse 17Cl1 cells were infected with 0.1 pfu/ml of MHV A59. Intracellular and extracellular fractions were harvested separately at 18 h p.i. Cell culture supernatant containing extracellular virus was clarified by centrifugation to remove cell debris. Infected cells were rinsed with phosphate buffered saline chilled to 4 °C and disrupted in ice cold lysis buffer (100 mM Tris–HCl [pH 7.5], 100 mM NaCl, 0.5% Triton X-100, 1 mM phenylmethylsulfonyl fluoride, 1:100 dilution of phosphatase inhibitor cocktails 1 and 2 (Sigma), 10 μg RNase A). Nuclei were pelleted and the cytoplasmic lysates were resolved immediately by SDS-PAGE or stored at −80 °C.Virions were precipitated from the clarified extracellular supernatant by addition while mixing of polyethylene glycol (PEG) 8000 to a final concentration of ∼12.5 mM, followed by slow addition of NaCl to a final concentration of 400 mM. The precipitation was continued for 3 h at 4 °C with continuous stirring. Precipitates were collected by centrifugation at 4 °C for 20 min at 10,000 ×
g. The precipitate was resuspended in TMEN (50 mM Tris–HCl, 50 mM maleic acid, 1 mM EDTA, 100 mM NaCl) buffer [pH 6.0] and clarified at 10,000 ×
g for 5 min. Clarified precipitated virus was purified in continuous 20–60% (w/w) sucrose gradients that were centrifuged at 90,000 ×
g for 4.5 h at 4 °C. Virions from pooled gradient fractions in the 1.16–1.20 g/cm3 density range were subsequently pelleted through a 30% sucrose cushion at 30,000 rpm in a Beckman SW 50.1 rotor. The purified virus was resuspended in TMEN buffer [pH 6.0] and analyzed immediately or stored at −80 °C. Virions were disrupted in ice cold lysis buffer as described above prior to resolution of proteins by SDS-PAGE.
Isolation and preparation of the N protein for mass spectrometry
Proteins from the intracellular and extracellular fractions were separated by SDS-PAGE in 8% gels and visualized by Coomassie staining. The N protein from both fractions was excised from the gel. Positions of the N protein were confirmed by Western blotting of proteins from parallel lanes. Gel pieces containing the N protein were destained twice with 300 μl of 50% acetonitrile (ACN) in 40 mM NH4HCO3 and dehydrated with 100% ACN for 15 min. ACN was removed by aspiration and gel pieces were dried in a vacuum centrifuge at 60 °C for 20 min. Dried gel pieces were rehydrated in 20 μl of 40 mM NH4HCO3 containing 250 ng trypsin (Sigma) at 4 °C for 15 min prior to the addition of 50 μl of 40 mM NH4HCO3 containing 10 fmol/μl angiotensin II. Trypsin digests were incubated overnight at 37 °C. Protease activity was terminated by addition of 10 μl of 5% formic acid (FA) and incubation at 37 °C for 30 min. Digests were clarified for 1 min in a microcentrifuge and supernatants were removed. The extraction procedure was repeated using 40 μl of 5% FA. The resulting peptide mixtures were purified by solid-phase extraction using C18 ZipTips (Millipore) after loading in 0.05% heptafluorobutyric acid: 5% FA (v/v) and elution with 50% ACN:1% FA (v/v). The samples were dried by vacuum centrifugation and dissolved in 4 μl of 0.1% FA:2%ACN (v/v).
Mass spectrometry
HPLC-ESI-MS/MS was performed on a Thermo Finnigan (San Jose, CA) LTQ-FTICR fitted with a PicoView™ nanospray source (New Objective, Woburn, MA). On-line HPLC was performed using a Michrom BioResources Paradigm MS4 micro two-dimensional HPLC (Alburn, CA) with a PicoFrit™ column (New Objective, Woburn, MA, 75 μm i.d., packed with ProteoPep™ II C18 material, 300 Å); mobile phase, linear gradient of 2 to 27% ACN in 0.1% FA in 45 min, a hold of 5 min at 27% ACN, followed by a step to 50% ACN, a hold of 5 min and then a step to 80%; flow rate, 250 nl/min.A “top-10” data-dependent MS/MS analysis was performed (acquisition of a full scan spectrum followed by collision-induced dissociation (CID) mass spectra of the 10 most abundant ions in the survey scan) to identify N protein peptides. The survey scan was acquired using the Fourier transform ion cyclotron resonance (FTICR) mass analyzer, which offers high mass accuracy and resolution. A list of potential phosphorylated peptides was generated based on detected serine/threonine-containing peptides from the N protein. For localization of phosphorylation sites a scan protocol of 1 survey scan (FTICR), followed by 7 targeted MS/MS scans (CID spectra of specified m/z values were acquired using the LTQ mass analyzer). All uninterpreted tandem MS data were searched using Mascot (Matrix Science, London, UK). Assignments of the phosphopeptides were confirmed by manual comparison of the tandem mass spectra with the predicted fragmentation generated in silico by the MS-Product component of ProteinProspector (http://prospector.ucsf.edu). In the case of phosphopeptides containing more than one serine or threonine, we localized phosphorylation to particular residues by assigning fragment ion masses from the mass spectra that were unique to the fragmentation of a peptide phosphorylated at said residue.
Results
MHV A59N protein contains many potential phosphorylation sites
The MHVA59 N protein consists of 454 amino acids. The protein contains 41 serine, 22 threonine and 11 tyrosine residues, approximately 9, 5 and 2.4%, respectively, of the total amino acids. Of these, 24 serines, 5 threonines and 1 tyrosine are predicted by NetPhos 2.0 (http://www.cbs.dtu.dk/services/NetPhos/) (Blom et al., 1999) to be potential phosphorylation sites. To determine which of the predicted sites are indeed phosphorylated, the N proteins from both intracellular lysates and purified extracellular virions from MHV A59 infectedmouse 17Cl1 cells were analyzed by HPLC-ESI/MS/MS.
Identification of phosphorylated sites on extracellular virion N protein by HPLC-ESI-MS/MS
To identify phosphorylated sites on the N protein in extracellular mature virions, virus was purified and proteins were separated by 1D SDS-PAGE and visualized by Coomassie staining. The N protein band was excised (Fig. 1
) and in-gel digested with trypsin. Peptide pools were extracted, desalted and analyzed by on-line HPLC-ESI-MS/MS.
Fig. 1
Representative SDS-PAGE used to isolate MHV N protein and peptide map coverage. (A) Proteins were visualized by Coomassie staining (left panel). Boxes indicate the areas of gel excision. The positions of N from purified virus (V) and MHV infected cells (I) are indicated to the right of both panels. Control uninfected cells (U) are also shown. Positions of molecular weight standards in kDa are shown on the left. Western blot analysis of parallel lanes was used to verify the identity of the N protein (right panel). A rabbit polyclonal N antibody was used to identify the N protein (Cologna et al., 2000). (B) Sequence coverage of peptides that identified phosphorylated resides is shown. Double lines indicate coverage for both intracellular and extracellular N protein fractions. Single lines identify additional coverage of the intracellular fraction. Dots mark predicted phosphorylation sites.
Representative SDS-PAGE used to isolate MHV N protein and peptide map coverage. (A) Proteins were visualized by Coomassie staining (left panel). Boxes indicate the areas of gel excision. The positions of N from purified virus (V) and MHVinfected cells (I) are indicated to the right of both panels. Control uninfected cells (U) are also shown. Positions of molecular weight standards in kDa are shown on the left. Western blot analysis of parallel lanes was used to verify the identity of the N protein (right panel). A rabbit polyclonal N antibody was used to identify the N protein (Cologna et al., 2000). (B) Sequence coverage of peptides that identified phosphorylated resides is shown. Double lines indicate coverage for both intracellular and extracellular N protein fractions. Single lines identify additional coverage of the intracellular fraction. Dots mark predicted phosphorylation sites.Mass spectra were acquired using a Thermo Finnigan LTQ-FTICR. FTICR mass analyzer has the ability to obtain high mass accuracy and resolution while the LTQ mass analyzer can perform tandem mass spectrometry (MS/MS and MS3) to obtain sequence information of peptides by collision-induced dissociation (CID) of residues. It is one of the most sensitive instruments available for high mass resolution and mass accuracy for identification of protein phosphorylation and complex proteomic studies. We obtained 54% sequence coverage (by MASCOT) of the virion N protein (Fig. 1B).The first phosphorylated site identified was serine 389 (pS389). The identification was made from analysis of MS/MS spectra of the peptide N382–394 (DGGADVVpSPKPQR) which exhibited the most intense peak at mass-to-charge ratio (m/z) of 654.6, corresponding to [M+2H-98]2+ (Fig. 2
). The mass is representative of a doubly charged ion with a decrease in mass of 98 atomic mass units (amu), which is equal to the loss of one phosphoric acid group (H3PO4) from the precursor ion m/z
= 703.32+. Since this peptide contained only one serine, MS3 analysis was not necessary. Other intense peaks were assigned as b- and y-series sequence ions which result from the usual fragmentation of peptides at their amide bonds (Fig. 2). These results confirmed preliminary results from earlier MALDI analysis in our lab which indicated that S389 was phosphorylated (White and Hogue, 2006).
Fig. 2
CID MS/MS spectrum of phosphopeptide D382–R394 from tryptic digest of MHV virion N protein. Peaks correspond to fragments of the peptide containing residues 382–394 as indicated by the sequence shown with the spectrum. The most intense peak (m/z = 654.6 [M+2H-98]2+) corresponds to a precursor ion (m/z = 703.32+) with a + 2 charge and minus H3PO4. The position of phosphoserine at position 389 (pS) is indicated in the peptide sequence. The b-amino and y-carboxy series ions are indicated above peptide fragment peaks. Symbol (*) indicates loss of H3PO4 (98 amu) from a fragment.
CID MS/MS spectrum of phosphopeptide D382–R394 from tryptic digest of MHV virion N protein. Peaks correspond to fragments of the peptide containing residues 382–394 as indicated by the sequence shown with the spectrum. The most intense peak (m/z = 654.6 [M+2H-98]2+) corresponds to a precursor ion (m/z = 703.32+) with a + 2 charge and minus H3PO4. The position of phosphoserine at position 389 (pS) is indicated in the peptide sequence. The b-amino and y-carboxy series ions are indicated above peptide fragment peaks. Symbol (*) indicates loss of H3PO4 (98 amu) from a fragment.Two other phosphorylated amino acids, S424 and T428, were also identified in the carboxy end of the protein. Phosphorylated S424 was detected in phosphopeptide, N422–432 (NVpSRELTPEDR). Precursor ion m/z
= 698.3 was subjected to both MS/MS and MS3 fragmentation. The MS/MS spectrum exhibited the most intense peak at m/z 649.6, corresponding to [M+2H-98]2+ (Fig. 3A). Ions y8
2+ (m/z
= 508.4) and b6 (m/z
= 779.3) are unique ions which localized phosphorylation to serine 424. Observation of y7 (m/z
= 859.5) in the MS3 spectrum was also consistent with phosphorylation at S424 (Fig. 3B).
Fig. 3
CID MS/MS and MS3 spectra of phosphopeptide N422–R432 MHV virion N protein. Retention times (rt) for averaged spectra are indicated in each panel. (A) Precursor ion m/z = 698.3 was subjected to MS/MS. Peak m/z = 649.6 [M+2H-98]2+ corresponds to the precursor minus H3PO4 and plus a + 2 charge. Unique ions (arrows) m/z = 508.4 (y82+) and 779.3 (b6) localized phosphorylation at S424. (B) MS3 spectrum of the same peptide eluted at 28.70–28.93 min. Unique ion 859.5 y7 confirmed phosphorylation at S424. (C) MS3 spectrum of precursor ion m/z = 698.32+ eluted at 25.49–25.64 min. A unique ion 699.3 b6 confirmed phosphorylation at T428. Symbol (§) indicates loss of a H2O molecule (18 amu). Symbol (*) indicates loss of H3PO4.
CID MS/MS and MS3 spectra of phosphopeptide N422–R432 MHV virion N protein. Retention times (rt) for averaged spectra are indicated in each panel. (A) Precursor ion m/z = 698.3 was subjected to MS/MS. Peak m/z = 649.6 [M+2H-98]2+ corresponds to the precursor minus H3PO4 and plus a + 2 charge. Unique ions (arrows) m/z = 508.4 (y82+) and 779.3 (b6) localized phosphorylation at S424. (B) MS3 spectrum of the same peptide eluted at 28.70–28.93 min. Unique ion 859.5 y7 confirmed phosphorylation at S424. (C) MS3 spectrum of precursor ion m/z = 698.32+ eluted at 25.49–25.64 min. A unique ion 699.3 b6 confirmed phosphorylation at T428. Symbol (§) indicates loss of a H2O molecule (18 amu). Symbol (*) indicates loss of H3PO4.Two HPLC-MS/MS peaks corresponding to phosphopeptide N422–432 were noted. Spectra generated from peptides that contained phosphorylated S424 had retention times between 28.67 and 28.93 min (Fig. 3B), whereas the retention time for the peptides in the second peak was 25.49–25.64 min, roughly 3 min prior to the phosphorylated S424-containing peptides. Spectra of the latter identified T428 as phosphorylated. The fragment ion b6 (m/z
= 699.3) in the MS3 spectra was the unique identifier for phosphorylation at T428 (Fig. 3C).Additional sites in the amino terminal half of the virion N protein were also identified. The precursor ion for peptide N162–178 (pSDIVERDPSSHEAIPTR) was m/z
= 995.02+. A peak resulting from the neutral loss of H3PO4 was observed as the most intense peak in the MS/MS spectra at m/z
= 946.22+ (Fig. 4A). Observation of the appropriate y10 ion at m/z
= 1094.5 in the CID spectrum indicated that S162 was phosphorylated. Further analysis showed that T177 in the 162–178 peptide was also phosphorylated. Peptide N168–178 (DPpSSHEAIPTR) resulted from complete digestion of peptide 162–178 at arginine 167. The smaller peptide was fragmented and subjected to MS3 analysis which allowed us to identify S170 as also phosphorylated (Fig. 4B).
Fig. 4
CID spectra of phosphopeptides in the amino half of virion N protein. (A) MS3 spectrum of phosphopeptide S162–R178 resulting from a missed cleavage at R167 is shown. Unique ion 1094.5 y10 indicates phosphorylation at S162. Three unique ions 815.4 b7, 1423.7 b13, and 1536.7 b14 are all indicative of phosphorylation at T177. (B) MS3 spectrum of the phosphopeptide consisting of residues 168–178 is shown with its signature ions indicated about the peaks. Unique ions (arrows) 455.9 y82+ and 910.4 y8 indicate phosphorylation at S170.
CID spectra of phosphopeptides in the amino half of virion N protein. (A) MS3 spectrum of phosphopeptide S162–R178 resulting from a missed cleavage at R167 is shown. Unique ion 1094.5 y10 indicates phosphorylation at S162. Three unique ions 815.4 b7, 1423.7 b13, and 1536.7 b14 are all indicative of phosphorylation at T177. (B) MS3 spectrum of the phosphopeptide consisting of residues 168–178 is shown with its signature ions indicated about the peaks. Unique ions (arrows) 455.9 y82+ and 910.4 y8 indicate phosphorylation at S170.
Identification of N protein phosphorylation sites on MHV N protein from infectect cells by HPLC-ESI-MS/MS
To identify residues that are phosphorylated on the intracellular pool of the N protein from infected cells, cytoplasmic lysates were resolved by SDS-PAGE. A sample of purified virions was run on the gel and used as a guide to excise the N protein band from the whole cell lysates (Fig. 2). Mass spectra were again acquired using HPLC-ESI-MS/MS. We obtained 69% sequence coverage (by MASCOT) of the intracellular N protein (Fig. 1B).In the intracellular N protein phosphopeptide N162–178 was also detected. S162 and T177 were both phosphorylated. These phosphopeptides apparently coeluted and were subsequently fragmented together. Fig. 5A is an example a MS3 spectrum for N162–178. Detection of ions 797.5 y14
2+ and 1094.5 y10 is characteristic of phosphorylation at S162, while b12 and b14 ions (m/z 1352.6 and 1536.7, respectively) are consistent with phosphorylation at residue T177. Phosphopeptide N162–178 contained one site that was not cleaved by trypsin, but the fully cleaved phosphopeptide, N168–178 (DPpSSHEAIPTR), was also detected, which allowed us to localize phosphorylation at S170 as well (Fig. 5B and C).
Fig. 5
CID fragmentation spectra of N phosphopeptides from the intracellular fraction of infected cells. (A) MS3 spectrum of peptide S162–R178. Unique ions (arrows) 797.5 y142+, and 1094.5 y10, as well as 1352.6 b12, and 1536.7 b14 localized phosphorylation to S162 and T177, respectively. Other ions resulting from fragmentation are indicated. (s) and (t) indicates ions derived from either S162 and T177 phosphopeptides, respectively. (B) MS/MS spectrum of peptide N168–178. Neutral loss of H3PO4 from the precursor ion mass of 645.28 corresponds to the most intense peak at m/z = 596.5. Identification of unique ion (arrows) 455.9 y8 + 2 localized phosphorylation to S170. (C) MS3 spectrum of phosphopeptide 168–178. Identification of additional unique ions 264.1 b3-§ and 910.6 y8 confirmed the localization of phosphorylation at S170. Symbol (§) indicates masses corresponding to a loss of a H2O molecule. Symbol (*) indicates masses corresponding to a loss of H3PO4.
CID fragmentation spectra of N phosphopeptides from the intracellular fraction of infected cells. (A) MS3 spectrum of peptide S162–R178. Unique ions (arrows) 797.5 y142+, and 1094.5 y10, as well as 1352.6 b12, and 1536.7 b14 localized phosphorylation to S162 and T177, respectively. Other ions resulting from fragmentation are indicated. (s) and (t) indicates ions derived from either S162 and T177 phosphopeptides, respectively. (B) MS/MS spectrum of peptide N168–178. Neutral loss of H3PO4 from the precursor ion mass of 645.28 corresponds to the most intense peak at m/z = 596.5. Identification of unique ion (arrows) 455.9 y8 + 2 localized phosphorylation to S170. (C) MS3 spectrum of phosphopeptide 168–178. Identification of additional unique ions 264.1 b3-§ and 910.6 y8 confirmed the localization of phosphorylation at S170. Symbol (§) indicates masses corresponding to a loss of a H2O molecule. Symbol (*) indicates masses corresponding to a loss of H3PO4.Further analysis of peptides identified phosphorylation at S389, S424, and T428, as was also determined for the extracellular virion protein (Fig. 6
). Identification of S389 was straight forward due to S389 being the only possible target for phosphorylation in peptide N382–394 (Fig. 6A). Phosphopeptides containing phosphorylated S424 and T428 coeluted in the intracellular samples, in contrast to the previous separation of these peptides for the extracellular samples. However, MS3 fragmentation of the phosphopeptide indicated phosphorylation at both S424 and T428 (Fig. 6B and C).
Fig. 6
CID induced MS/MS mass spectrum of peptides containing residues 382–394 and 422–432 from intracellular N. (A) MS/MS spectrum of residues 382–394. Loss of H3PO4 from the precursor ion (m/z = 703.32+) correlates with the most intense peak in the spectrum, m/z = 654.6 [M+2H-98]2+. S389 is the only possible phosphorylation site in the peptide. (B) MS/MS spectrum of the peptide containing residues 422–432. Loss of H3PO4 from precursor (m/z = 698.32+) correlates with the most intense peak in the spectrum, m/z = 649.6 [M+2H-98]2+. Unique ions 508.5 y8 + 2, 730.4 y6, and 779.3 b6 identify S424 as phosphorylated. (C) MS3 fragmentation of the same peptide containing residues 422–432 allowed detection of unique ion 699.4 b6, indicative of phosphorylation at T428. Symbol (§) indicates loss of a H2O molecule. Symbol (*) indicates loss of H3PO4.
CID induced MS/MS mass spectrum of peptides containing residues 382–394 and 422–432 from intracellular N. (A) MS/MS spectrum of residues 382–394. Loss of H3PO4 from the precursor ion (m/z = 703.32+) correlates with the most intense peak in the spectrum, m/z = 654.6 [M+2H-98]2+. S389 is the only possible phosphorylation site in the peptide. (B) MS/MS spectrum of the peptide containing residues 422–432. Loss of H3PO4 from precursor (m/z = 698.32+) correlates with the most intense peak in the spectrum, m/z = 649.6 [M+2H-98]2+. Unique ions 508.5 y8 + 2, 730.4 y6, and 779.3 b6 identify S424 as phosphorylated. (C) MS3 fragmentation of the same peptide containing residues 422–432 allowed detection of unique ion 699.4 b6, indicative of phosphorylation at T428. Symbol (§) indicates loss of a H2O molecule. Symbol (*) indicates loss of H3PO4.
Discussion
We used HPLC-ESI/MS/MS to identify the phosphorylated sites on the N protein of MHV A59. This is the first identification of amino acids that are phosphorylated on the N protein for a group II coronavirus. Phosphorylated sites were identified on the protein from infected cells and mature extracellular virions. Six residues, S162, S170, T177, S389, S424 and T428, were found to be phosphorylated on both intracellular and extracellular virion N proteins (Fig. 7
). Potential cellular kinases that may phosphorylate the identified sites were predicted using NetPhosK 1.0 (http://www.cbs.dtu.dk/services/NetPhosK/) which produces neural network predictions of kinases specific for eukaryotic phosphorylation sites (Fig. 7) (Blom et al., 2004). It remains to be determined which are responsible for phosphorylations of the N protein, but multiple potential kinases are predicted for all but one of the identified sites.
Fig. 7
Schematic illustration of MHV N three-domain model separated by the A and B spacer domains (Parker and Masters, 1990). The relative positions of the phosphorylated sites identified on intracellular and extracellular virion N are shown. The positions of the RNA binding domain (Nelson et al., 2000) and putative dimerization domain (Yu et al., 2006) are noted. Eukaryotic kinases that may phosphorylate the identified sites were predicted by NetPhosK 1.0 (http://www.cbs.dtu.dk/services/NetPhosK/) are shown in the table below the schematic.
Schematic illustration of MHV N three-domain model separated by the A and B spacer domains (Parker and Masters, 1990). The relative positions of the phosphorylated sites identified on intracellular and extracellular virion N are shown. The positions of the RNA binding domain (Nelson et al., 2000) and putative dimerization domain (Yu et al., 2006) are noted. Eukaryotic kinases that may phosphorylate the identified sites were predicted by NetPhosK 1.0 (http://www.cbs.dtu.dk/services/NetPhosK/) are shown in the table below the schematic.Our results demonstrate that both serine and threonine residues are modified by phosphorylation, in contrast to earlier studies which suggested that MHV A59, as well as closely related MHV JHM, N proteins are phosphorylated exclusively on serine residues (Siddell et al., 1981, Stohlman and Lai, 1979). The earlier conclusions were made based on phosphoamino acid analysis of [32P]-labeled N peptides. Identification of the phosphorylated threonine residues in our study may reflect differences in the experimental approaches used to isolate and/or analyze the protein. Interestingly, based on comparative analysis of earlier HPLC of tryptic peptides derived from the N proteins of MHV A59 and a plaque variant of the JHM virus, it was suggested that serine at positions 161 and 162, respectively, were phosphorylated (Wilbur et al., 1986). Thus, S161 identified in the earlier study likely corresponds to S162 identified in our study.Alignment of the amino acid sequences for several members of the group II coronaviruses shows that, with the exception of S162, the sites are conserved in the bovine coronavirus (BCoV) and human coronavirus OC43 (HCoV OC43) N proteins (Fig. 8
). While our work was in progress the phosphorylated sites on the N protein were identified for two other coronaviruses. Phosphoserines at positions 9, 156, 254 and 256 were identified for the group I TGEV protein from virus infected cells, whereas only sites S156 and S256 were identified on the N protein present in purified virions (Fig. 8) (Calvo et al., 2005). The sites that are phosphorylated on the group III IBV N protein expressed with baculovirus in insect cells were found to be identical to the ones on the N protein expressed alone in Vero cells (Chen et al., 2005). Data from the study suggests that S190, S192, T378 and S379 are phosphorylated on intracellular IBV N (Fig. 8). Comparison of the sites for the three viruses illustrates that the location is not conserved, but phosphorylated sites are near either the amino or carboxy side of the serine/arginine (SR) rich domain that is conserved in all coronavirus N proteins. The SR rich domain is a distinguishing feature of the previously identified MHV RNA binding domain (Fig. 7, Fig. 8).
Fig. 8
Alignment of representative coronavirus N protein amino acid sequences. Positions of the phosphorylated sites on MHV N, conserved resides in the group II viruses BCoV, HCoV OC43 and SARS CoV, and recently identified sites on IBV (Chen et al., 2005) and TGEV (Calvo et al., 2005) N proteins are highlighted. The conserved S/R rich domains are boxed in each sequence. Sequences for MHV A59 (P03416), BCoV (P10527), HCov OC43 (Q696P4), SARS CoV (P59595), TGEV (P04134), HCoV 229E (P15130) and IBV CoV (P69596) were obtained from Swiss-Prot. The alignment was generated using ClustalW (Thompson et al., 1994).
Alignment of representative coronavirus N protein amino acid sequences. Positions of the phosphorylated sites on MHV N, conserved resides in the group II viruses BCoV, HCoV OC43 and SARS CoV, and recently identified sites on IBV (Chen et al., 2005) and TGEV (Calvo et al., 2005) N proteins are highlighted. The conserved S/R rich domains are boxed in each sequence. Sequences for MHV A59 (P03416), BCoV (P10527), HCov OC43 (Q696P4), SARS CoV (P59595), TGEV (P04134), HCoV 229E (P15130) and IBVCoV (P69596) were obtained from Swiss-Prot. The alignment was generated using ClustalW (Thompson et al., 1994).Recently structural information became available for IBV and SARS-CoV N proteins. The three-dimensional structures are based on NMR analysis of amino acids 45–181 (Huang et al., 2004) and X-ray crystal structures of the amino-terminal residues of IBV 19/29-160/162 (Fan et al., 2005, Jayaram et al., 2006) and SARS-CoV 47-175 (Saikatendu et al., 2007). Crystal structures have also been determined for carboxy-terminal residues 219–349 of IBV N (Jayaram et al., 2006) and residues 270–370 of SARS-CoV N (Yu et al., 2006). Based solely on amino acid alignments, this corresponds to the region encompassing roughly R45-N195 in the MHVA59 N protein (Fig. 8). No structural information is yet available for MHV N, but based on the sequence alignments we can speculate about the potential positions of the phosphorylated sites on MHV N. The sites in the amino terminal region (S162, S170, T177) potentially fall within a large loop between β-strands (β6 and β7 of SARS amino-terminus) (Saikatendu et al., 2007). These sites are adjacent to the previously identified MHV RNA binding domain that extends from T177 to P231 (Fig. 7, Fig. 8) (Nelson et al., 2000). Localization of the phosphorylation sites near this domain could influence presentation of the RNA binding domain and in turn interaction with the RNA. Clearly, much remains to be understood about, not only the structure of the MHV N protein, but also its functional domains compared with other coronavirus nucleocapsids. RNA binding domains for IBV and SARS-CoV are located in the amino-terminal domain preceding the serine-rich domain (Fan et al., 2005, Huang et al., 2004, Saikatendu et al., 2007). Mutagenesis of a positively charged β-hairpin within the amino-terminus of IBV N identified residues involved in RNA-binding in vitro, and when introduced into an infectious clone virus replication was attenuated (Tan et al., 2006). Kinetic analysis of the amino-terminus of IBV N showed that the domain binds viral RNA, but the full length protein exhibits stronger binding, suggesting that interactions with other regions of the protein are also important (Spencer and Hiscox, 2006). The conserved SR domain has been proposed to link the amino-terminal RNA binding and carboxy-terminal multimerization domains of IBV and SAR-CoV (Fan et al., 2005, Saikatendu et al., 2007). All of the identified phosphorylation sites on the IBV N protein are located on the carboxy side of its SR domain. The presence of phosphorylated residues in close proximity to the MHV SR domain may uniquely influence how the surrounding domains interact with viral RNA. Modeling of other coronavirus N proteins based on the crystal forms of SARS-CoV and IBV amino-terminal domains indicate that the proteins are similar in the overall organization of β-strands, but since the proteins differ in their surface charge distribution patterns, it was speculated that the residues involved in RNA binding and how they interact with the RNA is different (Saikatendu et al., 2007). In the case of MHV N, the presence of phosphorylated residues between the amino-end and the SR domain may contribute to this difference.Only phosphorylated S389 is included in what can be speculated to mirror on the IBV carboxy end structure. Interestingly, this would place the S389 site only a few residues beyond the carboxy terminal α5 helix in the IBV structure (Jayaram et al., 2006). Phosphorylation at this site, as well as the downstream S424 and T428 sites, could be important for multimerization since they fall just beyond the region of MHV N that has been predicted to correspond to the carboxy terminal dimerization domain on the SARS-CoV N structure (Fig. 7) (Yu et al., 2006).Recent data suggest that IBV N protein binding to viral RNA is influenced by phosphorylation since nonphosphorylated N protein bound both viral and nonviral RNAs, whereas the phosphorylated N exhibited a higher binding affinity for viral RNAs (Chen et al., 2005). The results are consistent with earlier suggestions for a potential role of N phosphorylation (Laude and Masters, 1995, Nelson et al., 2000). Affinity for viral RNA could play a role in assembly or uncoating. Dephosphorylation of MHV N protein has been suggested to facilitate infection (Mohandas and Dales, 1991). Virion associated kinase activity has been reported for MHV JHM, but whether it plays any role during infection is not known (Siddell et al., 1981). Phosphorylation of N could be important in whatever role the protein plays in viral RNA transcription and/or replication. Phosphorylation may alter the structure of the N protein and in turn presentation of RNA binding domain(s) that is important for recognition of the packaging signal, transcription regulatory sequences or other signature sequences in the viral RNA(s). It was recently demonstrated that TGEV and SARS-CoV N proteins have RNA chaperone activity (Zuniga et al., 2007). We fully agree with the suggestion that phosphorylation may have a role in this activity.Samples analyzed in this study reflect steady state levels of phosphorylation for the N protein present in virus infected cells at 18 h p.i. We did not detect any difference in phosphorylation of the protein from the infected cells and that in extracellular virions. It should be noted that coronaviruses assemble at intracellular membranes in the region of the endoplasmic reticulum Golgi complex (ERGIC) (Krijnse-Locker et al., 1994). The details of virus release are not fully understood, but virions are thought to transport through the constitutive secretory pathway after budding. Thus, the intracellular fraction in our study no doubt contained some assembled virions that had not been secreted, which would contribute to the profile of sites that were identified. Our identification of the same sites on both intracellular and virion N differs from the results from recent analysis of TGEV N as discussed above. Four sites were found to be phosphorylated on the protein from infected cells, while only two of these were identified on extracellular virion N (Calvo et al., 2005). What accounts for the difference is not known. It may reflect a basic difference between MHV and TGEV. Calvo et al. (2005) suggested the possibility that phosphorylation/dephosphorylation could play a role in TGEV assembly. We previously suggested that a more highly phosphorylated isoform of the N protein exists in BCoV (Hogue, 1995) and MHVinfected cells (Hogue, unpublished data). We also hypothesized that nucleocapsids might undergo dephosphorylation during assembly into virions. We did not attempt to identify phosphorylation sites on different isoforms in the present study. Excised gel pieces included both forms of the protein that we previously suggested might be differentially phosphorylated based solely on phosphatase digestion results. Others have reported differences in the phosphorylation status of intracellular and virions N proteins. IBV virion N protein was reported to be more phosphorylated than the protein in virus infected cells (Jayaram et al., 2005).Our identification of phosphorylation sites for MHV N protein is an important step toward deciphering the functional role(s) of the modification during the virus life cycle. Knowing where phosphorylated sites are located on the protein will help direct future molecular studies to address how the modification contributes to or modulates functions of the multifunctional N protein.
Authors: Kumar Singh Saikatendu; Jeremiah S Joseph; Vanitha Subramanian; Benjamin W Neuman; Michael J Buchmeier; Raymond C Stevens; Peter Kuhn Journal: J Virol Date: 2007-01-17 Impact factor: 5.103
Authors: Sarah A Kopecky-Bromberg; Luis Martínez-Sobrido; Matthew Frieman; Ralph A Baric; Peter Palese Journal: J Virol Date: 2006-11-15 Impact factor: 5.103
Authors: Niranjan B Butchi; David R Hinton; Stephen A Stohlman; Parul Kapil; Volker Fensterl; Ganes C Sen; Cornelia C Bergmann Journal: J Virol Date: 2013-11-06 Impact factor: 5.103
Authors: Nicholas E Grossoehme; Lichun Li; Sarah C Keane; Pinghua Liu; Charles E Dann; Julian L Leibowitz; David P Giedroc Journal: J Mol Biol Date: 2009-09-24 Impact factor: 5.469