| Literature DB >> 24529077 |
Lan Zhang, Bin Han, Rongli Li, Xiaoshan Lu, Aiying Nie, Lihai Guo, Yu Fang, Mao Feng, Jianke Li1.
Abstract
BACKGROUND: Royal jelly (RJ) is a proteinaceous secretion produced from the hypopharyngeal and mandibular glands of nurse bees. It plays vital roles in honeybee biology and in the improvement of human health. However, some proteins remain unknown in RJ, and mapping N-glycosylation modification sites on RJ proteins demands further investigation. We used two different liquid chromatography-tandem mass spectrometry techniques, complementary N-glycopeptide enrichment strategies, and bioinformatic approaches to gain a better understanding of novel and glycosylated proteins in RJ.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24529077 PMCID: PMC3942810 DOI: 10.1186/1471-2164-15-135
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Identification of proteins in royal jelly
| gi|58585098 | 566.72 | 96 | 742 | 166 | 48886 | Major royal jelly protein 1 precursor | - | - | |
| gi|58585108 | 527.66 | 93 | 437 | 100 | 51074 | Major royal jelly protein 2 precursor | - | - | |
| gi|58585142 | 487.03 | 88 | 354 | 12 | 61662 | Major royal jelly protein 3 precursor | - | - | |
| gi|284182838 | 357.38 | 89 | 236 | 15 | 53015 | Major royal jelly protein 4 | - | - | |
| gi|284812514 | 386.92 | 71 | 182 | 6 | 70182 | Major royal jelly protein 5 | - | - | |
| gi|58585188 | 282.69 | 58 | 57 | 7 | 49786 | Major royal jelly protein 6 precursor | - | - | |
| gi|62198227 | 422.37 | 84 | 182 | 70 | 50541 | Major royal jelly protein 7 precursor | - | - | |
| gi|58585070 | 131.13 | 15 | 3 | 3 | 46956 | Major royal jelly protein 8 precursor | - | - | |
| gi|67010041 | 221.7 | 45 | 26 | 10 | 48688 | Major royal jelly protein 9 precursor | - | - | |
| gi|148277624 | 146.46 | 19 | 6 | 6 | 48235 | Yellow-e3 precursor* | √ | # | |
| gi|328787887 | 299.74 | 26 | 40 | 37 | 188194 | Lysosomal alpha-mannosidase | - | - | |
| gi|89885579 | 290.12 | 50 | 33 | 3 | 65565 | Alpha-glucosidase | - | - | |
| gi|58585144 | 227.31 | 43 | 7 | 3 | 55947 | Alpha-amylase precursor | - | - | |
| gi|66564326 | 185.29 | 37 | 10 | 6 | 52947 | Plasma glutamate carboxypeptidase isoform 1 | - | - | |
| gi|328778095 | 116.42 | 10 | 4 | 4 | 56432 | Lysosomal Pro-X carboxypeptidase* | √ | # | |
| gi|66560290 | 102.39 | 10 | 3 | 3 | 42222 | Lysosomal aspartic protease* | √ | # | |
| gi|328782027 | 93.88 | 6 | 4 | 4 | 88720 | Membrane metallo-endopeptidase 1* | √ | # | |
| gi|48118838 | 276.66 | 56 | 32 | 31 | 58571 | Glucosylceramidase | - | - | |
| gi|66524161 | 196.85 | 59 | 15 | 14 | 25186 | Ferritin heavy chain | - | - | |
| gi|328779534 | 220.38 | 10 | 23 | 20 | 79344 | Hypothetical protein LOC552041 | - | - | |
| gi|328780642 | 167.48 | 16 | 9 | 8 | 64654 | Matrix metalloproteinase 14* | √ | # | |
| gi|328784061 | 154.24 | 27 | 6 | 5 | 34132 | Pancreatic triacylglycerol lipase* | √ | # | |
| Health promotion activity | gi|58585090 | 389.52 | 76 | 137 | 77 | 67938 | Glucose oxidase | - | - |
| gi|166795901 | 259.92 | 93 | 46 | 14 | 21348 | Apolipophorin-III protein precursor | - | - | |
| gi|328782084 | 240.41 | 41 | 7 | 4 | 59502 | Antithrombin-III | - | - | |
| gi|60115688 | 226.26 | 71 | 39 | 5 | 24819 | Icarapin precursor | - | - | |
| gi|254910938 | 214.17 | 75 | 14 | 4 | 10717 | Defensin-1 preproprotein | - | - | |
| gi|187281543 | 150.07 | 16 | 11 | 10 | 87937 | Venom dipeptidyl Peptidase 4 precursor* | √ | # | |
| gi|110755367 | 142.44 | 13 | 7 | 7 | 75706 | Toll-like receptor 13 isoform 1 | - | - | |
| gi|48101366 | 138.63 | 15 | 5 | 5 | 44639 | Venom serine protease 34* | √ | # | |
| gi|254548157 | 102.39 | 37 | 3 | 2 | 12611 | Hymenoptaecin precursor* | √ | # | |
| gi|328790726 | 243.78 | 70 | 28 | 19 | 42665 | Venom acid phosphatase Acph 1 | - | - | |
| gi|66507455 | 188.31 | 27 | 11 | 11 | 39483 | Venom protease* | √ | # | |
| gi|328792524 | 89.27 | 9 | 5 | 2 | 90763 | Hypothetical protein LOC408570* | √ | # | |
| gi|110758964 | 221.47 | 83 | 17 | 4 | 10161 | Regucalcin | - | - | |
| gi|66565246 | 83.06 | 16 | 3 | 2 | 17081 | Lysozyme isoform 1* | √ | # | |
| Developmental process | gi|66514614 | 165.33 | 19 | 8 | 4 | 48741 | Idgf4 | - | - |
| gi|110766389 | 139.29 | 22 | 5 | 5 | 30201 | Protein takeout | - | - | |
| gi|66521538 | 101.17 | 11 | 2 | 2 | 33735 | Protein CREG1* | √ | # | |
| gi|94158822 | 86.71 | 22 | 2 | 2 | 15201 | Odorant binding protein 14 precursor | - | - | |
| Unknown | gi|48094573 | 292.99 | 64 | 45 | 32 | 19434 | Hypothetical protein LOC408608 | - | - |
| gi|110763647 | 109.47 | 27 | 5 | 4 | 18478 | Hypothetical protein LOC726323 | - | - |
Note: All of the identified proteins are of Apis mellifera origin. Accession is the unique number given to mark the entry of a protein in the database of Apis (downloaded April 2012, version 4.5 of the honeybee genome) using in-house PEAKS software (version 6.0, Bioinformatics Solutions Inc.). “-10logP” is the score calculated by PEAKS software. Sequence coverage is the ratio of the number of amino acids in every peptide that matches with the mass spectrum divided by the total number of amino acids in the protein sequence. Matches are the number of experiment fragmentation spectra paired to a theoretical segment of protein. The number of unique peptides refers to the peptide sequences that are unique to an individual parent protein sequence. SignalP refers to the result researched with SignalP 4.1. PSORT refers to the result researched with PSORT II. “*” indicates the protein identified as novel in royal jelly. “√” indicates the protein identified with signal peptide by SignalP 4.1. “#” indicates the protein identified as extracellular by PSORT II. “-” indicates the protein did not be researched with SignalP 4.1 or PSORT II.
Figure 1Representative spectra of N-glycosylated peptide in royal jelly proteins. The tandem mass spectrum of the N-glycosylated site is identified in peptide GESLN(+2.99)KSLPILHEWK using 18O-water labeling.
Identification of glycosylated proteins, peptides and their glycosylation sites in royal jelly proteins
| gi|58585098 | 442.09 | 48886 | | | | | | | | | |
| | R.GESLN(+2.99)KSLPILHEWK.Fb | 57.82 | 585.32 | 3 | 8 | 24-38 | 28 | √ | | √ | |
| gi|58585108 | 398.06 | 51074 | | | | | | | | | |
| | K.TSNHLKQIEIPHDIAVN(+2.99)ATTGK.Ga | 67.92 | 797.42 | 3 | 15 | 162-183 | 178 | √ | √ | √ | √ |
| | K.IAIDKFDRLWVLDSGLVN(+2.99)R.Ta | 57.57 | 1117.1 | 2 | 9 | 128-146 | 145 | √ | √ | √ | √ |
| gi|284182838 | 298.73 | 53015 | | | | | | | | | |
| | R.KN(+2.99)LTNTLNVIHEWK.Yb | 57.71 | 856.97 | 2 | 5 | 30-41 | 31 | √ | | √ | |
| | K.M(+15.99)SNQQEN(+2.99)LTLKEVDNK.Vb | 47.23 | 637.31 | 3 | 2 | 236-251 | 242 | √ | | √ | |
| gi|58585188 | 203.5 | 49786 | | | | | | | | | |
| | K.NYPFDVDQWHN(+2.99)K.Td | 47.63 | 783.35 | 2 | 4 | 68-79 | 78 | √ | | √ | |
| | K.LLAFDLN(+2.99)TSKLLK.Qd | 47 | 739.94 | 2 | 2 | 158-170 | 164 | | √ | | |
| gi|62198227 | 312.92 | 50541 | | | | | | | | | |
| | K.QVDIPHEIAVN(+2.99)TTTEQGR.Ld | 54.49 | 1006 | 2 | 11 | 168-185 | 178 | √ | √ | √ | √ |
| | R.LWVLDSGLVN(+2.99)NTQPM(+15.99)C(+57.02)FPK.Ld | 45.55 | 1112 | 2 | 8 | 136-154 | 145 | √ | √ | √ | |
| | K.NGILFFGLVN(+2.99)NTAVGC(+57.02)WNEHQ(+0.98)TLQ(+0.98)R.Ed | 57.66 | 1447.2 | 2 | 2 | 312-316 | 321 | | | √ | |
| gi|67010041 | 107.71 | 48688 | | | | | | | | | |
| | K.IPHDIAIN(+2.99)STTGK.Rd | 40.14 | 685.37 | 2 | 2 | 170-182 | 177 | √ | | | |
| gi|148277624 | 89 | 48235 | | | | | | | | | |
| | K.YM(+15.99)SGTLNSN(+2.99)ETNFR.Ie | 50.32 | 826.86 | 2 | 2 | 384-397 | 392 | | | √ | |
| gi|328787887 | 165.89 | 188194 | | | | | | | | | |
| | R.LTQSFHYYEGM(+15.99)EGNNM(+15.99)EFKN(+2.99)R.Sd | 53.93 | 877.38 | 3 | 4 | 538-558 | 557 | √ | | √ | |
| | R.LLKDDAFGVGEALN(+2.99)ESAYGEGLVVR.Gd | 50.85 | 1313.2 | 2 | 2 | 722-746 | 735 | √ | | | |
| gi|89885579 | 193.65 | 65565 | | | | | | | | | |
| | K.N(+2.99)VSRDSN(+2.99)SSDFKK.Lb | 39.74 | 497.24 | 3 | 2 | 313-325 | 313 319 | √ | | | |
| | K.HM(+15.99)LIEAYTN(+2.99)LSM(+15.99)TM(+15.99)K.Yb | 38.65 | 917.42 | 2 | 2 | 282-296 | 290 | | √ | | |
| gi|66564326 | 178.08 | 52947 | | | | | | | | | |
| | K.ESADYGLENVHGEN(+2.99)VTVPFWVR.Gd | 63.89 | 1261.1 | 2 | 2 | 99-120 | 112 | √ | | | |
| | R.SVTPYSLYTPHTGHQSYGEN(+2.99)VTK.Id | 55.98 | 643.06 | 4 | 5 | 212-234 | 231 | √ | | √ | |
| | R.IM(+15.99)TLLSPM(+15.99)GN(+2.99)LTVR.Sd | 42.7 | 790.92 | 2 | 3 | 394-407 | 403 | √ | √ | | |
| | R.AIM(+15.99)NEALN(+2.99)GSFK.Gd | 34.47 | 657.33 | 2 | 2 | 54-65 | 61 | | √ | | |
| gi|328778095 | 71.61 | 56432 | | | | | | | | | |
| | R.YYGESM(+15.99)PYNN(+2.99)K.Se | 35.71 | 692.79 | 2 | 1 | 124-134 | 133 | | | √ | |
| gi|66560290 | 62.1 | 42222 | | | | | | | | | |
| | K.N(+2.99)GTDFAIR.Ye | 35.21 | 448.72 | 2 | 1 | 120-127 | 120 | √ | | | |
| gi|328782027 | 89.03 | 88720 | | | | | | | | | |
| | K.HNPIPDN(+.98)KVEWSEDEIKAN(+2.99)K.Te | 50.01 | 592.55 | 4 | 2 | 92-111 | 110 | √ | | | |
| | K.WYDN(+.98)SGVN(+2.99)TSTAK.Ie | 37.35 | 723.83 | 2 | 1 | 311-323 | 318 | √ | | | |
| | R.IVNTN(+2.99)DTETR.Le | 36.04 | 583.28 | 2 | 1 | 31-40 | 35 | √ | | | |
| gi|48118838 | 142.8 | 58571 | | | | | | | | | |
| | K.QFDNN(+2.99)ITYLKEEHYETYVNYLIK.Fd | 54.42 | 735.86 | 4 | 3 | 211-233 | 215 | √ | | √ | |
| | K.N(+2.99)FSLAPEDYNYK.Id | 46.71 | 732.33 | 2 | 2 | 171-182 | 171 | √ | | | |
| | K.TQANWIANYFGPILASSPFN(+2.99)K.Td | 30.3 | 781.41 | 3 | 1 | 273-293 | 292 | | √ | | |
| | R.M(+15.99)N(+2.99)VSEVKFDR.Cd | 40.85 | 622.3 | 2 | 3 | 72-81 | 73 | √ | √ | √ | |
| gi|58585090 | 277.69 | 67938 | | | | | | | | | |
| | R.SNLHVIVN(+2.99)ATVTK.Vd | 54.93 | 699.9 | 2 | 8 | 277-289 | 284 | √ | √ | √ | √ |
| | K.LVN(+2.99)TTVM(+15.99)RDLGVEFQK.Id | 52.05 | 934.99 | 2 | 7 | 499-514 | 501 | √ | √ | √ | √ |
| | R.WVQQGAFGWSWDEVM(+15.99)PYYLKSEN(+2.99)NTELSR.Vd | 40.6 | 1180.9 | 3 | 4 | 172-200 | 194 | | | √ | √ |
| | R.AFITPFEN(+2.99)R.Sd | 41.19 | 549.28 | 2 | 3 | 268-276 | 275 | | | √ | √ |
| | K.YYTTN(+2.99)ESHACLSTGGSCYWPR.Gd | 41.83 | 800.34 | 3 | 2 | 126-146 | 130 | | | √ | |
| gi|166795901 | 226.7 | 21348 | | | | | | | | | |
| | K.DQSANFVNNIQDYIKN(+2.99)VTEEVK.Td | 62.84 | 857.76 | 3 | 6 | 71-92 | 86 | √ | √ | √ | |
| gi|328782084 | 97.9 | 59502 | | | | | | | | | |
| | K.ISN(+2.99)DSAQNGERDSIYHLIER.Ld | 48.05 | 580.78 | 4 | 2 | 362-381 | 364 | √ | | | |
| gi|187281543 | 80.93 | 87937 | | | | | | | | | |
| | R.HLAFATFN(+2.99)DTNVR.Dc | 47.19 | 503.59 | 3 | 2 | 232-244 | 239 | √ | | | |
| | R.ANSFN(+2.99)GTWK.Tc | 39.89 | 514.24 | 2 | 2 | 64-72 | 68 | √ | | | |
| | K.YSWIDSN(+2.99)R.Tc | 34.35 | 522.24 | 2 | 2 | 625-632 | 631 | | | √ | |
| gi|110755367 | 127.33 | 75706 | | | | | | | | | |
| | R.M(+15.99)LEHLDLSN(+2.99)NSLSTVNR.Rd | 55.39 | 981.48 | 2 | 2 | 547-563 | 555 | √ | | | |
| | R.HLNTQFFHN(+2.99)TTNLNK.Ld | 54.76 | 611.31 | 3 | 2 | 166-180 | 174 | √ | | | |
| | K.LHTLEEGLFAN(+2.99)LTR.Ld | 50.96 | 539.62 | 3 | 2 | 432-445 | 442 | √ | | | |
| | R.LSEEAFKN(+2.99)ASK.Ld | 39.92 | 613.81 | 2 | 1 | 315-325 | 322 | √ | | | |
| gi|328790726 | 142.25 | 42665 | | | | | | | | | |
| | K.M(+15.99)PSTINFYPNDPYIN(+2.99)YTYEPAGK.Gd | 57.47 | 1357.6 | 2 | 2 | 38-60 | 52 | √ | | | |
| gi|328792524 | 89.27 | 90763 | | | | | | | | | |
| | R.WSLTPVNSN(+2.99)TTVVVK.Qe | 43.11 | 824.45 | 2 | 2 | 539-553 | 547 | √ | | | |
| | R.QN(+2.99)YTDAPPAK.Le | 41.13 | 554.27 | 2 | 2 | 590-599 | 591 | √ | | | |
| | R.IDPN(+2.99)SSFTQSNPIR.Fe | 38.14 | 789.89 | 2 | 2 | 284-297 | 287 | √ | | √ | |
| gi|66514614 | 135.8 | 48741 | | | | | | | | | |
| | R.LKDLTIGVLPHVN(+2.99)STVYYDAR.Ld | 57.43 | 595.07 | 4 | 2 | 216-236 | 228 | √ | | | |
| gi|110766389 | 90.9 | 30201 | | | | | | | | | |
| | R.ALFSN(+2.99)ITVIGAGN(+2.99)YSLTK.Sd | 56.15 | 938 | 3 | 2 | 105-122 | 109 117 | √ | | √ | |
| gi|48094573 | 292.9 | 19434 | | | | | | | | | |
| | K.GNLGTVN(+2.99)LTKVLKSVEDR.Ld | 53.12 | 641.34 | 4 | 5 | 61-78 | 67 | √ | √ | √ | √ |
| gi|110763647 | 109.47 | 18478 | | | | | | | | | |
| R.IYDPITN(+2.99)TSK.Md | 35.5 | 577.8 | 2 | 1 | 133-142 | 139 | √ | ||||
Note: All of the identified proteins are from Apis mellifera. Accession is the unique number given to mark the entry of a protein in the database of Apis (downloaded April 2012, version 4.5 of the honeybee genome). “-10logP” is the score calculated by PEAKS software (version 6.0, Bioinformatics Solutions Inc.). Charge is the number of the carrying charge of the peptide. No. of spectra is the number of the spectrum of the peptide generated by mass spectrometry. Amino acid residue No. corresponds to the position of the N-terminal and C-terminal amino acid of the peptide in the protein sequence. Glycosylation site indicates the position of the N-glycosylated amino acids of the peptide in the protein sequence. Orbitrap refers the peptides analyzed by the Q-Exactive mass spectrometry (Thermo Fisher Scientific). Triple TOF refers the peptides analyzed by Triple TOF 5600 (AB SCIEX). Lectin denotes N-glycopeptides enriched by the lectin method. Hydrazide represents N-glycopeptides enriched by hydrazide chemistry. “√” indicates that peptide is identified by the corresponding enrichment method and mass spectrometer. “a” is the known site in the known protein. “b” is the potential site (bioinformatics has predicted these potential sites in UniProt Database (updated April 2013), and it is experimentally confirmed in this study) in the known protein. “c” is the potential site in the novel protein. “d” denotes the novel site in the known protein. “e” is the novel site in the novel protein.
Figure 2Distribution of N-glycopeptides analyzed by different enriched methods and instruments of royal jelly proteins. A is the distribution of N-glycopeptides enriched by lectin and hydrazide methods using mass spectrometry of Q-Exactive (orbitrap-based MS). 21 and eight are N-glycopeptides uniquely identified by the lectin and hydrazide enrichment, respectively, and 18 are N-glycopeptides identified by both enrichment methods using orbitrap-based MS. B is the distribution of N-glycopeptides enriched by lectin and hydrazide methods using mass spectrometry of triple TOF 5600 (triple TOF-based MS). Eight and two are N-glycopeptides specifically identified by the lectin and hydrazide enrichment protocols, respectively, and six are N-glycopeptides identified by both enrichment methods using triple TOF-based MS. C is the distribution of N-glycopeptides identified by the orbitrap-based MS and triple TOF-based MS using lectin enrichment method. 29 are N-glycopeptides uniquely identified by orbitrap-based MS, and four are uniquely identified by triple TOF-based MS, and 10 are N-glycopeptides identified by both MS systems using the lectin enrichment method. D is the distribution of N-glycopeptides identified by orbitrap-based MS and triple TOF-based MS using hydrazide enrichment. 18 are N-glycopeptides identified by orbitrap-based MS alone, and eight are N-glycopeptides identified by both types of LC-MS/MS instruments with adoption of hydrazide enrichment.
Figure 3Distribution of N-glycosylated sites in royal jelly proteins. “2” is the identified two known sites in known glycoprotein. “6” is potential sites predicted in known glycoprotein, and “3” is potential glycosylation sites identified in novel glycoprotein. “33” is the novel sites identified in known glycoprotein, and “9” is the novel sites identified in novel glycoprotein.
Figure 4Distribution of N-glycosylated royal jelly proteins carrying different numbers of modification sites. “1, 2 and 3” are the N-glycosylated protein carrying 1, 2 and 3 N-linked glycosylation sites, respectvely. “> = 4” is the N-glycosylated protein carried four or more N-glycosylated sites.
Figure 5N-glycosylated site motif in royal jelly proteins.