| Literature DB >> 12775768 |
Oleg Krokhin1, Yan Li, Anton Andonov, Heinz Feldmann, Ramon Flick, Steven Jones, Ute Stroeher, Nathalie Bastien, Kumar V N Dasuri, Keding Cheng, J Neil Simonsen, Hélène Perreault, John Wilkins, Werner Ens, Frank Plummer, Kenneth G Standing.
Abstract
A new coronavirus has been implicated as the causative agent of severe acute respiratory syndrome (SARS). We have used convalescent sera from several SARS patients to detect proteins in the culture supernatants from cells exposed to lavage another SARS patient. The most prominent protein in the supernatant was identified by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) as a approximately 46-kDa species. This was found to be a novel nucleocapsid protein that matched almost exactly one predicted by an open reading frame in the recently published nucleotide sequence of the same virus isolate (>96% coverage). A second viral protein corresponding to the predicted approximately 139-kDa spike glycoprotein has also been examined by MALDI-TOF MS (42% coverage). After peptide N-glycosidase F digestion, 12 glycosylation sites in this protein were confirmed. The sugars attached to four of the sites were also identified. These results suggest that the nucleocapsid protein is a major immunogen that may be useful for early diagnostics, and that the spike glycoprotein may present a particularly attractive target for prophylactic intervention in combating SARS.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12775768 PMCID: PMC7780042 DOI: 10.1074/mcp.M300048-MCP200
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
FAnalysis of proteins from a new coronavirus associated with SARS.A, Viral particles pelleted from the supernatant of vero E6 cells exposed to samples derived from patients with SARS (lane 1) or from mock-infected cells processed in a similar fashion (lane 2) were separated by SDS-PAGE and analyzed by Western blot with convalescent sera from SARS patients (Tor 2, Tor 3, Tor 4, and BC 1) or a control serum from a noninfected donor (NML). The sera from the patients, but not the control, reacted with a 44- to 48-kDa species present in the supernatants from the infected but not the mock-treated cultures (indicated by arrowhead). B, A virus sample similar to that described in A was fractionated on a sucrose gradient. The fraction containing immunoreactive material was separated on a 4–12% bis acrylamide gradient gel and stained with colloidal Coomasie blue. A prominent band with an apparent molecular mass of 44–48 kDa was observed along with much less intense band at ∼180 kDa (indicated by arrowheads). These bands were excised and used for the mass spectrometric studies described in this report.
FA, Single MS MALDI-QqTOF spectrum of the peptide mixture obtained from tryptic digestion of the 46-kDa protein prior to HPLC fractionation. B, Expanded view of a small section of the MALDI mass spectrum in A. C, The same section of the spectrum obtained from fraction 23, after HPLC separation of the mixture. The peak labels indicate the residue numbers corresponding to the intact protein; in one case loss of 64 Da is indicated. The intense peak corresponding to T277–293 in the mixture is absent in fraction 23 (it elutes in fraction 21), but several weaker peaks that are present, like T389–405, are significantly enhanced by the HPLC. The improvement helps to identify them and is essential for high mass accuracy and for subsequent MS/MS analysis. Measured and predicted masses for all the tryptic peptides can be found in Table II; Δm is less than 10 mDa in nearly every case.
Measured m/z and calculated MH
M* represents oxidized methionine residues; Ac-SDN...is the acetylated N-terminal of the protein; Q** represents the N-terminal Gln residues converted into pyro-Glu; and N*** represents an Asn residue converted into Asp due to deamidation
| MH+ calculated (Da) | Δ | Residues start-end | Peptide sequence | |
|---|---|---|---|---|
| 601.303 | 601.305 | −3 | 204–209 | GNSPAR |
| 601.322 | 601.331 | 1 | 103–107 | ELSPR |
| 698.357 | 698.358 | −1 | 144–149 | DHIGTR |
| 708.330 | 708.335 | −5 | 96–102 | GGDGKM*K |
| 711.333 | 711.331 | 2 | 294–299 | QGTDYK |
| 746.387 | 746.383 | 4 | 356–361 | HIDAYK |
| 749.354 | 749.354 | 0 | 178–185 | GGSQASSR |
| 805.378 | 805.380 | −2 | 196–203 | NSTPGSSR |
| 831.462 | 831.457 | 5 | 227–233 | LNQLESK |
| 876.452 | 876.461 | −9 | 101–107 | M*KELSPR |
| 886.401 | 886.406 | −5 | 170–177 | GFYAEGSR |
| 916.478 | 916.478 | 0 | 362–369 | TFPPTEPK |
| 928.543 | 928.546 | −3 | 348–355 | DNVILLNK |
| 946.513 | 946.511 | 2 | 62–68 | EELRFPR |
| 1105.548 | 1105.553 | −5 | 339–347 | LDDKDPQFK |
| 1144.493 | 1144.499 | −6 | 1–10 | Ac-SDNGPQSNQR |
| 1154.579 | 1154.580 | −1 | 376–385 | TDEAQPLPQR |
| 1166.557 | 1166.559 | −2 | 267–276 | Q**YNVTQAFGR |
| 1183.589 | 1183.586 | 3 | 267–276 | QYNVTQAFGR |
| 1202.610 | 1202.613 | −3 | 238–248 | GQQQQGQTVTK |
| 1282.678 | 1282.675 | 3 | 375–385 | KTDEAQPLPQR |
| 1330.698 | 1330.708 | −10 | 238–249 | GQQQQGQTVTKK |
| 1410.774 | 1410.771 | 3 | 376–387 | KKTDEAQPLPQR |
| 1611.698 | 1611.692 | 6 | 406–421 | QLQNSM*SGASADSTQA |
| 1684.895 | 1684.891 | 4 | 128–143 | EGIVWVATEGALNTPK |
| 1687.898 | 1687.905 | −7 | 210–226 | MASGGGETALALLLLDR |
| 1703.897 | 1703.900 | −3 | 210–226 | M*ASGGGETALALLLLDR |
| 1774.838 | 1774.836 | 2 | 278–293 | GPEQTQGNFGDQDLIR |
| 1850.833 | 1850.827 | 6 | 15–32 | ITFGGPTDSTDNNQNGGR |
| 1851.814 | 1850.811 | 3 | 15–32 | ITFGGPTDSTDNNQN***GGR |
| 1875.879 | 1875.879 | 0 | 389–405 | Q**PTVTLLPAADM*DDFSR |
| 1892.905 | 1892.906 | −1 | 389–405 | QPTVTLLPAADM*DDFSR |
| 1930.944 | 1930.937 | 7 | 277–293 | RGPEQTQGNFGDQDLIR |
| 2005.008 | 2005.006 | 2 | 388–405 | KQPTVTLLPAADMDDFSR |
| 2015.081 | 2015.081 | 0 | 339–355 | LDDKDPQFKDNVILLNK |
| 2021.005 | 2021.001 | 4 | 388–405 | KQPTVTLLPAADM*DDFSR |
| 2077.048 | 2077.043 | 5 | 320–338 | IGM*EVTPSGTWLTYHGAIK |
| 2091.126 | 2091.120 | −2 | 150–169 | NPNNNAATVLQLPQGTTLPK |
| 2151.002 | 2151.010 | 6 | 69–88 | GQGVPINTNSGPDDQIGYYR |
| 2252.062 | 2252.071 | −9 | 300–319 | HWPQIAQFAPSASAFFGMSR |
| 2297.078 | 2297.092 | −14 | 108–127 | WYFYYLGTGPEASLPYGANK |
| 2307.128 | 2307.111 | 17 | 69–89 | GQGVPINTNSGPDDQIGYYRR |
| 2324.187 | 2324.190 | −3 | 41–61 | RPQGLPNNTASWFTALTQHGK |
| 2516.325 | 2516.339 | −14 | 210–233 | M*ASGGGETALALLLLDRLNQLESK |
FMS/MS spectrum of tryptic fragment of The complete spectrum is shown in A, with the amino acid sequence indicated between the y-series fragments. An example of a b-series fragment is shown in B, and of a y-series fragment in C. The signature isotopic pattern of fragments containing the C terminus is visible in C. The measured and predicted masses for all identified peaks are shown in Table I.
Calculated and measured masses for b and y ions from MS/MS measurements of the 2297.092-Da tryptic fragment
| y-ion | MH+ calculated | Δ | Residue | Δm (mDa) | MH+ calculated | b-ion | ||
|---|---|---|---|---|---|---|---|---|
| y1 | – | 147.113 | – | K | 2 | 2279.083 | 2279.081 | b20 |
| y2 | 261.138 | 261.156 | −18 | N | – | – | 2150.986 | b19 |
| y3 | 332.193 | 332.193 | 0 | A | – | – | 2036.943 | b18 |
| y4 | 389.197 | 389.215 | −18 | G | – | – | 1965.906 | b17 |
| y5 | 552.283 | 552.278 | 5 | Y | 0 | 1908.885 | 1908.885 | b16 |
| y6 | 649.328 | 649.331 | −3 | P | −13 | 1745.809 | 1745.822 | b15 |
| y7 | 762.413 | 762.415 | −2 | L | 21 | 1648.790 | 1648.769 | b14 |
| y8 | 849.438 | 849.447 | −9 | S | 8 | 1535.694 | 1535.685 | b13 |
| y9 | 920.478 | 920.484 | −6 | A | −15 | 1448.638 | 1448.653 | b12 |
| y10 | 1049.522 | 1049.526 | −4 | E | 19 | 1377.637 | 1377.616 | b11 |
| y11 | 1146.571 | 1146.579 | −8 | P | – | – | 1248.573 | b10 |
| y12 | 1203.592 | 1203.601 | −9 | G | −4 | 1151.516 | 1151.520 | b9 |
| y13 | 1304.636 | 1304.648 | −8 | T | −5 | 1094.494 | 1094.499 | b8 |
| y14 | 1361.654 | 1361.670 | −16 | G | 1 | 993.452 | 993.451 | b7 |
| y15 | 1474.739 | 1474.754 | −15 | L | −21 | 936.409 | 936.430 | b6 |
| y16 | 1637.818 | 1637.817 | 1 | Y | 11 | 823.357 | 823.346 | b5 |
| y17 | 1800.884 | 1800.881 | 3 | Y | 9 | 660.291 | 660.282 | b4 |
| y18 | 1947.970 | 1947.949 | 21 | F | 1 | 497.220 | 497.219 | b3 |
| y19 | 2111.022 | 2111.012 | 10 | Y | −5 | 350.145 | 350.150 | b2 |
| y20 | 2297.086 | 2297.092 | −6 | W | – | – | 187.087 | b1 |
FComparison of the amino acid sequence of the 2297-Da peptide (deduced The shaded regions indicate areas of identity.
Deglycosylated peptides found in PNGase F-treated tryptic digest of 139-kDa spike protein
Bold N symbols represent deglycosylated sites.
| Peptide | Sequence | Calc. [M+H]+ with no glycosylation | Δ | No. of sites | |
|---|---|---|---|---|---|
| T111–126 | SQSVIIIN | 1756.992 | 1757.973 1758.963 | 0.981 1.971 | 1 2 |
| T222–232 | LPLGI | 1257.732 | 1258.715 | 0.983 | 1 |
| T226–287 | YDE | 2453.114 | 2454.101 | 0.987 | 1 |
| T316–333 | FP | 2069.017 | 2071.001 | 1.984 | 2 |
| T778–796 | YFGGF | 2169.138 | 2170.121 | 0.983 | 1 |
| T1056–1068 | 1445.685 | 1446.672 | 0.987 | 1 | |
| T1074–1089 | EGVFVF | 1887.939 | 1888.931 | 0.992 | 1 |
| T1140–1163 | 2493.259 | 2495.23 | 1.971 | 2 | |
| T1174–1187 | NL | 1585.844 | 1586.823 | 0.979 | 1 |
Glycosylated peptides found in tryptic digest of 139-kDa spike protein without treatment with PNGase F
Peptide symbols: N, first glycosylation site; N, second glycosylation site. Carbohydrate composition symbols: M, mannose; F, fucose; G, galactose; N, N-acetylglucosamine. *, m/z values could not be determined accurately (see text). **, Δm, measured - calculated mass.
| Peptide | Sequence | Calc., [M+H]+ | Residual carbohydrate mass | Carbohydrate composition | Calculated carbohydrate mass | Δ | MS/MS glycopeptide | |
|---|---|---|---|---|---|---|---|---|
| T111–126 | SQSVIII | 1756.992 | 3201.514 | 1444.522 | M3N4F | 1444.534 | −0.012 | |
| 3404.572 | 1647.580 | M3N5F | 1647.613 | −0.033 | ✓ | |||
| IR | 3525.669 | 1768.677 | M3N4G2F | 1768.640 | 0.037 | |||
| 3566.670 | 1809.678 | M3N5GF | 1809.666 | 0.012 | ||||
| 3607.682 | 1850.690 | M3N6F | 1850.693 | −0.003 | ||||
| 3728.687 | 1971.695 | M3N5G2F | 1971.719 | −0.024 | ||||
| 3769.678 | 2012.686 | M3N6GF | 2012.745 | −0.059 | ||||
| 3890.761 | 2133.769 | M3N5G3F | 2133.772 | −0.003 | ||||
| 3931.793 | 2174.801 | M3N6G2F | 2174.798 | 0.030 | ||||
| 4093.818 | 2336.826 | M3N6G3F | 2336.859 | −0.025 | ✓ | |||
| 4255.844 | 2498.852 | M3N6G4F | 2498.904 | 0.052 | ||||
| 4296.905 | 2539.913 | M3N7G3F | 2539.930 | −0.017 | ||||
| T222–232 | LPLGI | 1257.732 | 2636.215 | 1378.483 | M6N2 | 1378.476 | 0.007 | |
| TNFR | 2798.26 | 1540.528 | M7N2 | 1540.528 | 0.000 | |||
| 2960.306 | 1702.574 | M8N2 | 1702.581 | −0.007 | ||||
| 3122.373 | 1864.641 | M9N2 | 1864.634 | 0.007 | ✓ | |||
| T226–287 | YDE | 2453.114 | 4100.666 | 1647.552 | M3N5F | 1647.613 | −0.061 | |
| ITDAVD | 4303.735 | 1850.621 | M3N6F | 1850.693 | 0.072 | ✓ | ||
| CSQNPL | 4427.1* | 1974.0 | M3N5FG2 | 1971.719 | ||||
| AELK | 4468.9* | 2015.8 | M3N6FG | 2012.745 | ||||
| 4589.1* | 2136.0 | M3N5G3F | 2133.772 | |||||
| 4792.2* | 2339.1 | M3N6G3F | 2336.859 | |||||
| 4953.9* | 2500.8 | M3N6G4F | 2498.904 | |||||
| T778–796 | YFGGF | 2169.138 | 3385.59 | 1216.452 | M5N2 | 1216.423 | −0.029 | |
| FSQILPD | 3547.644 | 1378.506 | M6N2 | 1378.476 | −0.030 | |||
| PLKPTK | 3709.738 | 1540.600 | M7N2 | 1540.529 | −0.071 | ✓ | ||
| 3871.702 | 1702.564 | M8N2 | 1702.581 | 0.017 | ||||
FSingle MS (A and B) and MS/MS (C) detection of glycosylated tryptic peptides of the 139-kDa spike protein. A, MS spectrum of HPLC fraction 21 of tryptic digest of the 139-kDa band. Labeled peaks are the monoisotopic [M+H]+ ions of glycosylated forms of T111–126 (see Table III). B, MS spectrum of HPLC fraction 25 showing glycosylated forms of T222–232. C, MS/MS spectra of the 3122.373-Da peak from B. D, Suggested high-mannose and complex N-glycan structures, emphasizing possible diglycosylation of T111–126.