| Literature DB >> 31717952 |
Delphine Vincent1, Vilnis Ezernieks1, Simone Rochfort1, German Spangenberg1.
Abstract
Earlier this year we published a method article aimed at optimising protein extraction from mature buds of medicinal cannabis for trypsin-based shotgun proteomics (Vincent, D., et al. Molecules 2019, 24, 659). We then developed a top-down proteomics (TDP) method (Vincent, D., et al. Proteomes 2019, 7, 33). This follow-up study aims at optimising the digestion of medicinal cannabis proteins for identification purposes by bottom-up and middle-down proteomics (BUP and MDP). Four proteases, namely a mixture of trypsin/LysC, GluC, and chymotrypsin, which target different amino acids (AAs) and therefore are orthogonal and cleave proteins more or less frequently, were tested both on their own as well as sequentially or pooled, followed by nLC-MS/MS analyses of the peptide digests. Bovine serum albumin (BSA, 66 kDa) was used as a control of digestion efficiency. With this multiple protease strategy, BSA was reproducibly 97% sequenced, with peptides ranging from 0.7 to 6.4 kD containing 5 to 54 AA residues with 0 to 6 miscleavages. The proteome of mature apical buds from medicinal cannabis was explored more in depth with the identification of 27,123 peptides matching 494 unique accessions corresponding to 229 unique proteins from Cannabis sativa and close relatives, including 130 (57%) additional annotations when the list is compared to that of our previous BUP study (Vincent, D., et al. Molecules 2019, 24, 659). Almost half of the medicinal cannabis proteins were identified with 100% sequence coverage, with peptides composed of 7 to 91 AA residues with up to 9 miscleavages and ranging from 0.6 to 10 kDa, thus falling into the MDP domain. Many post-translational modifications (PTMs) were identified, such as oxidation, phosphorylations, and N-terminus acetylations. This method will pave the way for deeper proteome exploration of the reproductive organs of medicinal cannabis, and therefore for molecular phenotyping within breeding programs.Entities:
Keywords: BSA; GluC; bottom-up proteomics; bovine serum albumin; chymotrypsin; middle-down proteomics; missed cleavage; nLC-MS/MS; protease digestion; trypsin/LysC
Year: 2019 PMID: 31717952 PMCID: PMC6888629 DOI: 10.3390/ijms20225630
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Experimental design.
Number of MS peaks, MS/MS spectra and MS/MS spectra annotated with SEQUEST for each of the bovine serum albumin (BSA) digests. For protease legend, refer to Figure 1. An arrow (->) indicates the order in which the proteases were added. A colon (:) indicates that individual digests were pooled with equimolarity.
| Tube | 1. MS | 2. all MS/MS | % MS/MS | 3. SEQUEST Annotated MS/MS | % MS/MS Annotated | % MS Annotated | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample | Protease Mix | Rep 1 | Rep 2 | Mean | SD | % CV | Rep 1 | Rep 2 | Mean | SD | Percent | Rep 1 | Rep 2 | Mean | SD | % | % |
| BSA | T | 83,678 | 83,056 | 83,367 | 440 | 0.5 | 9769 | 9325 | 9547 | 314 | 11 | 2133 | 1875 | 2004 | 182 | 21 | 2.4 |
| BSA | G | 91,922 | 98,895 | 95,409 | 3487 | 3.7 | 9081 | 9628 | 9355 | 387 | 10 | 929 | 1363 | 1146 | 307 | 12 | 1.2 |
| BSA | C | 92,116 | 90,303 | 91,210 | 907 | 1.0 | 10,327 | 9792 | 10,060 | 378 | 11 | 1358 | 1267 | 1313 | 64 | 13 | 1.4 |
| BSA | T->G | 89,648 | 83,107 | 86,378 | 3271 | 3.8 | 11,311 | 9698 | 10,505 | 1141 | 12 | 2178 | 1978 | 2078 | 141 | 20 | 2.4 |
| BSA | T:G | 84,347 | 87,462 | 85,905 | 1558 | 1.8 | 8605 | 9720 | 9163 | 788 | 11 | 2141 | 2332 | 2237 | 135 | 24 | 2.6 |
| BSA | T->C | 87,203 | 79,616 | 83,410 | 3794 | 4.5 | 10,944 | 8810 | 9877 | 1509 | 12 | 1864 | 1549 | 1707 | 223 | 17 | 2.0 |
| BSA | T:C | 90,847 | 92,736 | 91,792 | 945 | 1.0 | 10,245 | 10,115 | 10,180 | 92 | 11 | 2428 | 1931 | 2180 | 351 | 21 | 2.4 |
| BSA | G->C | 77,085 | 82,055 | 79,570 | 2485 | 3.1 | 6450 | 5163 | 5807 | 910 | 7 | 1103 | 475 | 789 | 444 | 14 | 1.0 |
| BSA | G:C | 99,001 | 100,001 | 99,501 | 500 | 0.5 | 9980 | 9847 | 9914 | 94 | 10 | 1169 | 1065 | 1117 | 74 | 11 | 1.1 |
| BSA | T->G->C | 88,919 | 84,798 | 86,859 | 2061 | 2.4 | 9880 | 6137 | 8009 | 2647 | 9 | 1485 | 1005 | 1245 | 339 | 16 | 1.4 |
| BSA | T:G:C | 91,975 | 89,420 | 90,698 | 1278 | 1.4 | 10,201 | 9503 | 9852 | 494 | 11 | 1015 | 1616 | 1316 | 425 | 13 | 1.5 |
| BSA | mean | 88,795 | 88,314 | 88,554 | 1884 | 2 | 9708 | 8885 | 9297 | 796 | 10 | 1618 | 1496 | 1557 | 244 | 17 | 2 |
| BSA | SD | 5707 | 6752 | 5811 | 1218 | 1 | 1317 | 1648 | 1333 | 756 | 1 | 544 | 531 | 501 | 136 | 4 | 1 |
| min | 77,085 | 79,616 | 79,570 | 440 | 1 | 6450 | 5163 | 5807 | 92 | 7 | 929 | 475 | 789 | 64 | 11 | 1 | |
| max | 99,001 | 100,001 | 99,501 | 3794 | 5 | 11,311 | 10,115 | 10,505 | 2647 | 12 | 2428 | 2332 | 2237 | 444 | 24 | 3 | |
Figure 2Protease digestion tests on bovine serum albumin (BSA). (A) Principal component analysis (PCA) of the BSA identified peptides. (B) Hierarchical clustering analysis (HCA) of the BSA identified peptides. (C). Identified peptides aligned onto BSA AA sequence of the mature protein. (D). BSA sequence coverage achieved using the various proteases on their own or in combination. (E) Average mass (Da) of BSA peptides resulting from the three proteases acting on their own, sequentially, or pooled; vertical bars denote standard deviation (SD). (F) Distribution of number of identified peptides according to the number of miscleaveages. Downward arrowhead (v) denotes the minimum peptide mass and upward arrowhead (^) denotes the maximum peptide mass.
Figure 3Principal component analysis (PCA) and partial least square (PLS) analysis of medicinal cannabis digests. (A) PCA projection plot of PC1xPC2 featuring the 42 digest samples resulting from the action of one protease (T, G, or C), two (T->G, T->C, or G->C), or three proteases (T->G->C) applied sequentially; (B) PCA loading plot of PC1xPC2 featuring the 27,635 C. sativa peptides identified and coloured according to their deconvoluted masses. (C) PLS score plot of LV1xLV2 featuring the 42 digest samples using the digestion type as a response, (D) PLS loading plot of LV1xLV2 featuring the 3349 most significant peptides from the linear model testing the response to proteases described in the Methods section, and coloured according to their retention time (min) and m/z values. T, trypsin/LysC, G, GluC, C, chymotrypsin, RT, retention time.
Number of MS peaks, MS/MS spectra and MS/MS spectra annotated in SEQUEST for each of the medicinal cannabis digests. For protease legend, refer to Figure 1. An arrow (->) indicates the order in which the proteases were added.
| Tube | 1. MS | 2. all MS/MS | % MS/MS | 3. SEQUEST Annotated MS/MS | % MS/MS Annotated | % MS Annotated | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Biol rep | Protease mix | Rep 1 | Rep 2 | Mean | SD | % CV | Rep 1 | Rep 2 | Mean | SD | Percent | Rep 1 | Rep 2 | Mean | SD | % | % | |
| Bud 1 | T | 86,458 | 115,577 | 101,018 | 20,590 | 20.4 | 12,827 | 11,731 | 12,279 | 775 | 12 | 2042 | 1929 | 1986 | 80 | 16 | 2.0 | |
| Bud 2 | T | 72,907 | 113,303 | 93,105 | 28,564 | 30.7 | 10,775 | 11,160 | 10,968 | 272 | 12 | 1606 | 1740 | 1673 | 95 | 15 | 1.8 | |
| Bud 3 | T | 70,473 | 112,818 | 91,646 | 29,942 | 32.7 | 10,541 | 10,585 | 10,563 | 31 | 12 | 1513 | 1643 | 1578 | 92 | 15 | 1.7 | |
| Bud 1 | G | 106,622 | 84,761 | 95,692 | 15,458 | 16.2 | 9035 | 8501 | 8768 | 378 | 9 | 1388 | 1376 | 1382 | 8 | 16 | 1.4 | |
| Bud 2 | G | 95,761 | 88,387 | 92,074 | 5214 | 5.7 | 8032 | 7906 | 7969 | 89 | 9 | 1200 | 1146 | 1173 | 38 | 15 | 1.3 | |
| Bud 3 | G | 93,760 | 91,846 | 92,803 | 1353 | 1.5 | 8810 | 8115 | 8463 | 491 | 9 | 1326 | 1290 | 1308 | 25 | 15 | 1.4 | |
| Bud 1 | C | 93,117 | 95,399 | 94,258 | 1614 | 1.7 | 9486 | 8644 | 9065 | 595 | 10 | 2589 | 2200 | 2395 | 275 | 26 | 2.5 | |
| Bud 2 | C | 93,778 | 92,536 | 93,157 | 878 | 0.9 | 8433 | 7788 | 8111 | 456 | 9 | 2232 | 1857 | 2045 | 265 | 25 | 2.2 | |
| Bud 3 | C | 97,359 | 97,813 | 97,586 | 321 | 0.3 | 9508 | 8341 | 8925 | 825 | 9 | 2382 | 2098 | 2240 | 201 | 25 | 2.3 | |
| Bud 1 | T->G | 116,131 | 113,352 | 114,742 | 1965 | 1.7 | 11,909 | 11,406 | 11,658 | 356 | 10 | 3416 | 3163 | 3290 | 179 | 28 | 2.9 | |
| Bud 2 | T->G | 113,690 | 111,601 | 112,646 | 1477 | 1.3 | 11,511 | 10,857 | 11,184 | 462 | 10 | 3103 | 2904 | 3004 | 141 | 27 | 2.7 | |
| Bud 3 | T->G | 118,020 | 115,958 | 116,989 | 1458 | 1.2 | 12,362 | 11,811 | 12,087 | 390 | 10 | 3633 | 3405 | 3519 | 161 | 29 | 3.0 | |
| Bud 1 | T->C | 98,125 | 94,395 | 96,260 | 2638 | 2.7 | 10,963 | 9568 | 10,266 | 986 | 11 | 4066 | 3434 | 3750 | 447 | 37 | 3.9 | |
| Bud 2 | T->C | 98,455 | 97,615 | 98,035 | 594 | 0.6 | 10,622 | 9090 | 9856 | 1083 | 10 | 4024 | 3308 | 3666 | 506 | 37 | 3.7 | |
| Bud 3 | T->C | 100,667 | 97,679 | 99,173 | 2113 | 2.1 | 11,238 | 8873 | 10,056 | 1672 | 10 | 4297 | 3321 | 3809 | 690 | 38 | 3.8 | |
| Bud 1 | G->C | 92,277 | 90,930 | 91,604 | 952 | 1.0 | 8219 | 7625 | 7922 | 420 | 9 | 2786 | 2545 | 2666 | 170 | 34 | 2.9 | |
| Bud 2 | G->C | 86,056 | 83,949 | 85,003 | 1490 | 1.8 | 7160 | 6390 | 6775 | 544 | 8 | 2393 | 2190 | 2292 | 144 | 34 | 2.7 | |
| Bud 3 | G->C | 93,847 | 89,624 | 91,736 | 2986 | 3.3 | 8158 | 7398 | 7778 | 537 | 8 | 2687 | 2502 | 2595 | 131 | 33 | 2.8 | |
| Bud 1 | T->G->C | 88,886 | 56,861 | 72,874 | 22,645 | 31.1 | 9479 | 4279 | 6879 | 3677 | 9 | 4117 | 2002 | 3060 | 1496 | 44 | 4.2 | |
| Bud 2 | T->G->C | 67,123 | 49,316 | 58,220 | 12,591 | 21.6 | 6835 | 1770 | 4303 | 3581 | 7 | 3065 | 824 | 1945 | 1585 | 45 | 3.3 | |
| Bud 3 | T->G->C | 84,077 | 77,062 | 80,570 | 4960 | 6.2 | 7685 | 5570 | 6628 | 1496 | 8 | 3392 | 2524 | 2958 | 614 | 45 | 3.7 | |
| Mean | 13,559 | 17,773 | 13,095 | 9797 | 11 | 1743 | 2526 | 2047 | 992 | 1 | 991 | 787 | 836 | 439 | 10 | 1 | ||
| SD | 13,232 | 17,345 | 12,779 | 9561 | 11 | 1701 | 2465 | 1997 | 968 | 1 | 967 | 769 | 816 | 428 | 10 | 1 | ||
| Min | 67,123 | 49,316 | 58,220 | 321 | 0.33 | 6835 | 1770 | 4303 | 31.1 | 7.391 | 1200 | 824 | 1173 | 8.49 | 14.7195 | 1.27398 | ||
| Max | 118,020 | 115,958 | 116,989 | 29,942 | 32.7 | 12,827 | 11,811 | 12,279 | 3677 | 12.155 | 4297 | 3434 | 3809 | 1585 | 45.1894 | 4.19837 | ||
Figure 4Statistics on medicinal cannabis peptides. (A) Averaged peptide ion score per AA residue targeted by the proteases. Maximum are represented by triangles. Vertical bars denote SDs. (B) Distribution of the numbers of missed cleavages per protease. (C) Distribution of the average masses of the cannabis peptides according to the number of missed cleavages. Vertical bars denote SDs. (D) Minimum (circles) and maximum (triangles) masses of the peptides according to the number of missed cleavages.
Number of fixed and dynamic PTMs per protease.
| Proteases | Carbamidomethylation | Acetylation | Phosphorylation | Oxidation | Total |
|---|---|---|---|---|---|
| Trypsin/LysC | 1362 | 296 | 6213 | 2927 | 10,798 |
| Chymotrypsin | 1483 | 238 | 7683 | 3520 | 12,924 |
| GluC | 1396 | 149 | 4820 | 2789 | 9154 |
| Total | 4241 | 683 | 18,716 | 9236 | 32,876 |
Figure 5Pie chart of the pathways in which identified Cannabis proteins are involved.