| Literature DB >> 26965458 |
Aida Serra1, Xinya Hemu1, Giang K T Nguyen1, Ngan T K Nguyen1, Siu Kwan Sze1, James P Tam1.
Abstract
Cyclotides are plant cyclic cysteine-rich peptides (CRPs). The cyclic nature is reported to be gene-determined with a precursor containing a cyclization-competent domain which contains an essential C-terminal Asn/Asp (Asx) processing signal recognized by a cyclase. Linear forms of cyclotides are rare and are likely uncyclizable because they lack this essential C-terminal Asx signal (uncyclotide). Here we show that in the cyclotide-producing plant Clitoria ternatea, both cyclic and acyclic products, collectively named cliotides, can be bioprocessed from the same cyclization-competent precursor. Using an improved peptidomic strategy coupled with the novel Asx-specific endopeptidase butelase 2 to linearize cliotides at a biosynthetic ligation site for transcriptomic analysis, we characterized 272 cliotides derived from 38 genes. Several types of post-translational modifications of the processed cyclotides were observed, including deamidation, oxidation, hydroxylation, dehydration, glycosylation, methylation, and truncation. Taken together, our results suggest that cyclotide biosynthesis involves 'fuzzy' processing of precursors into both cyclic and linear forms as well as post-translational modifications to achieve molecular diversity, which is a commonly found trait of natural product biosynthesis.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26965458 PMCID: PMC4786859 DOI: 10.1038/srep23005
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of sample preparation for LC-MS/MS analysis of cliotides.
X: N-terminal residue in the cliotide sequences is usually Gly and occasionally Asp or Ser. Ac: acetylated N-terminus. EA: ethylamine-alkylation.
Figure 2MS characterization of each chemical derivatization process.
(A) MS profile of the crude C. ternatea extraction. (B) Acetylation of free N-terminal amine groups of the linear peptides resulted in a mass increase of 42 Da. (C) Fractionation of the crude mixture by strong cation exchange chromatography (example showed MALDI-TOF spectrum of fraction #3). (D) Disulfide bonds were reduced and alkylated in a one-pot reaction using a DTT and BrEA mixture at pH 8.6, 55 °C. A mass increase of 264 Da was observed after reduction and alkylation of CRPs with 3 disulfide bonds. (E) For macrocyclic of CRPs containing one Asn residue, butelase 2-catalyzed digestion resulted in a mass increase of 18 Da.
Figure 3LC-MS/MS profile of CRPs in the fractionated extract of C. ternatea.
(A) Example LC-MS profile of SCX fraction 3. Reduced and alkylated species are labeled in each identified peak and the newly identified variant cT55 is framed. (B) Annotated MS/MS spectra of the new cT55 sequence which exhibited a single amino acid substitution (V to I) compared with CterA. *Double-charged fragment. z’ ions correspond to z + 1 ions.
Full sequences of 38 cliotides determined by both transcriptome and proteome.
| No. | Contig | Sequence | No. identified |
|---|---|---|---|
| 1 | cT1 | GIP--CGESCVFIPCITGAI-GCSCKSK-VCYRN | 14 |
| 2 | cT2 | GEFLKCGESCVQGECYTP---GCSCDWP-ICKKN | 14 |
| 3 | cT3 | GLPT CGETCTLGTCYVPD---CSCSWP-ICMKN | 18 |
| 4 | cT4 | GIP--CGESCVFIPCITAAI-GCSCKSK-VCYRN | 15 |
| 5 | cT5 | GIP--CGESCVFIPCISTVI-GCSCKN | 2 |
| 6 | cT6 | SIP--CGESCVYIPCLTTIV-GCSCKSN-VCYSN | 20 |
| 7 | cT7 | GIP--CGESCVFIPCTVTALLGCSCKDK-VCYKN | 31 |
| 8 | cT8 | GIP--CGESCVFIPCISSVV-GCSCKSK-VCYNN | 3 |
| 9 | cT9 | GIP--CGESCVFIPCLTTVV-GCSCKNK-VCYNN | 6 |
| 10 | cT10 | GIP--CGESCVYIPCTVTALLGCSCKDK-VCYKN | 27 |
| 11 | cT11 | GIP--CGESCVFIPCTITALLGCSCKDK-VCYKN | 9 |
| 12 | cT12 | GIP--CGESCVFIPCITGAI-GCSCKSK-VCYRD | 17 |
| 13 | cT15 | GLPI-CGETCFKTKCYTK---GCSCSYP-VCKRN | 12 |
| 14 | cT17 | GTVP-CGESCVFIPCITGIA-GCSCKN | 19 |
| 15 | cT18 | GLPI-CGETCFTGTCYTP---GCTCSYP-VCKKN | 2 |
| 16 | cT19 | GSVIKCGESCLLGKCYTP---GCTCSRP-ICKKN | 9 |
| 17 | cT20 | GSAIRCGESCLLGKCYTP---GCTCDRP-ICKKN | 2 |
| 18 | cT21 | DLQ--CAETCVHSPCIGP----CYCKHGLICYRN | 1 |
| 19 | cT23 | 2 | |
| 20 | cT27 | GVIP-CGESCVFIPCITGAI-GCSCKSK- | 6 |
| 21 | cT28 | GGSIPCGESCVFLPCFLP---GCSCKSS-VCYLN | 4 |
| 22 | cT29 | 2 | |
| 23 | cT32 | GDLFKCGETCFGGTCYTP---GCSCDYP-ICKNN | 6 |
| 24 | cT33 | GFN-SCSEACVYLPCFSK---GCSCFKRQ-CYKN | 3 |
| 25 | cT34 | SYIP-CGESCVYIPCTVTALLGCSCSN | 1 |
| 26 | cT40 | GIP--CGESCVFIPCTITALLGCSCKSK-VCYKN | 4 |
| 27 | cT43 | DLI--CSSTCLHTPCKASV---CYCKN | 1 |
| 28 | cT45 | -----CGESCVFLPC | 1 |
| 29 | cT54 | GIP--CGESCVYIPCTVTALLGCSCKN | 1 |
| 30 | cT55 | GVIP-CGESCVFIPCISTLI-GCSCKN | 1 |
| 31 | CterA | GVIP-CGESCVFIPCISTVI-GCSCKNK-VCYRN | 20 |
| 32 | CterB | GVP--CAESCVWIPCTVTALLGCSCKDK-VCYLN | 3 |
| 33 | CterD | G | 2 |
| 34 | CterG | GLP--CGESCVFIPCITTVV-GCSCKNK-VCYNN | 8 |
| 35 | CterH | GLP--CGESCVFIPCITTVV-GCSCKN | 7 |
| 36 | CterI | GTVP-CGESCVFIPCITGIA-GCSCKN | 17 |
| 37 | CterJ | GTVP-CGESCVFIPCITGIA-GCSCKNK-VCYID | 33 |
| 38 | CterO | GIP--CGESCVFIPCITG | 6 |
*Number of identified sequences.
**Unidentified residues are underlined (Missing due to truncation or double digestion by butelase 2).
Molecular diversity of cliotide represented by cT7.
| Peptide derived from ctc7 gene | −10logP | Mass | Error ppm | m/z | z | No. (C) | MOD |
|---|---|---|---|---|---|---|---|
| GIPCGESCVFIPCTVTALLGCSCKDKVCYKN | 148.48 | 3508.7559 | −0.7 | 702.758 | 5 | 6 | EA |
| 67.06 | 3550.7664 | −1 | 592.801 | 6 | 6 | NT-Acetylation (+42.01); EA | |
| GIPCGESCVFIPCTVTALLGCSCKDKVCY | 39.27 | 3266.6179 | −0.3 | 545.443 | 6 | 6 | EA |
| GIPCGESCVFIPCTVTALLGCS | 87.25 | 2341.1665 | 0.5 | 586.299 | 4 | 4 | EA |
| GIPCGESCVFIPCTVTALLGC | 89.92 | 2254.1345 | −0.7 | 564.541 | 4 | 4 | EA |
| GIPCGESCVFIPCTVTALLG | 83.75 | 2108.0833 | −0.5 | 528.028 | 4 | 3 | EA |
| GIPCGESCVFIPCTVTALL | 89.03 | 2051.0618 | 0.3 | 513.773 | 4 | 3 | EA |
| GIPCGESCVFIPCTVTAL | 77.69 | 1937.9777 | −0.3 | 485.502 | 4 | 3 | EA |
| 37.86 | 1979.9883 | −0.1 | 661.003 | 3 | 3 | NT-Acetylation (+42.01); EA | |
| GIPCGESCVFIPCTVTA | 105.51 | 1824.8936 | −0.1 | 457.231 | 4 | 3 | EA |
| 47.35 | 1866.9042 | −0.3 | 623.309 | 3 | 3 | NT-Acetylation (+42.01); EA | |
| GIPCGESCVFIPCTVT | 72.32 | 1753.8564 | −0.1 | 439.471 | 4 | 3 | EA |
| GIPCGESCVFIPCTV | 46.09 | 1652.8088 | −0.4 | 551.943 | 3 | 3 | EA |
| GIPCGESCVFIPCT | 59.6 | 1553.7404 | 0 | 518.921 | 3 | 3 | EA |
| GIPCGESCVFIPC | 87.7 | 1452.6927 | 0.3 | 485.238 | 3 | 3 | EA |
| GIPCGESCVFIP | 45.56 | 1306.6414 | 0.3 | 436.555 | 3 | 2 | EA |
| GIPCGESCVF | 77.6 | 1096.5045 | −0.2 | 366.509 | 3 | 2 | EA |
| IPCGESCVFIPCTVTALLGCSCKDKVCYKN | 34.93 | 3451.7344 | 5.8 | 576.3 | 6 | 6 | EA |
| PCGESCVFIPCTVTA | 42.08 | 1654.7881 | −0.1 | 552.603 | 3 | 3 | EA |
| PCGESCVFIPCTVT | 40.16 | 1583.751 | 0.7 | 528.925 | 3 | 3 | EA |
| IPCTVTALLGC | 43.84 | 2592.3147 | −0.4 | 519.47 | 5 | 4 | EA; Hexose (S) (+162.05) |
| IPCTVTALLGCSCKDKVCYK | 129.36 | 2430.262 | 0.2 | 487.06 | 5 | 4 | EA |
| CTVTALLGCSCKDKVCYK | 123.83 | 2220.125 | −0.4 | 445.032 | 5 | 4 | EA |
| TVTALLGCSCKDKVCYK | 56.02 | 2075.0576 | 5.5 | 519.775 | 4 | 3 | EA; Deamidation (N) (+.98) |
| TVTALLGC | 46.32 | 2236.1265 | −0.2 | 560.039 | 4 | 3 | EA; Hexose (S) (+162.05) |
| TVTALLGCSCKDKVCYKN | 118.62 | 2074.0737 | −0.5 | 519.526 | 4 | 3 | EA |
| VTALLGCSCKDKVCYKN | 97.33 | 1973.026 | −0.1 | 494.264 | 4 | 3 | EA |
| TALLGCSCKDKVCYKN | 103.44 | 1873.9576 | 0.2 | 469.497 | 4 | 3 | EA |
| ALLGCSCKDKVCYKN | 39.47 | 1729.8677 | 0 | 433.474 | 4 | 3 | EA |
| LLGCSCKDKVCYK | 74.15 | 3509.74 | 0.3 | 502.399 | 7 | 6 | EA; Deamidation (N) (+.98) |
aIn Peaks PTM function, the p-value is converted from the linear discriminative function score. A higher −10logP value indicated a more confident sequencing result. −10lgP value > 25 is equivalent to false discovery rate (FDR) < 0.03%.
bMOD: side-chain modifications and chemical derivatization.
cAll Cys residues were alkylated with ethylamine and labeled as EA.
dResidues with modifications are bold and underlined.
Figure 4Biosynthesis and diversification mechanisms of cliotides through fuzzy processing and post-translational modifications.