| Literature DB >> 26399495 |
Roland Hellinger1, Johannes Koehbach1,2, Douglas E Soltis3, Eric J Carpenter4, Gane Ka-Shu Wong4,5,6, Christian W Gruber1,2.
Abstract
Cyclotides are plant-derived mini proteins. They are genetically encoded as precursor proteins that become post-translationally modified to yield circular cystine-knotted molecules. Because of this structural topology cyclotides resist enzymatic degradation in biological fluids, and hence they are considered as promising lead molecules for pharmaceutical applications. Despite ongoing efforts to discover novel cyclotides and analyze their biodiversity, it is not clear how many individual peptides a single plant specimen can express. Therefore, we investigated the transcriptome and cyclotide peptidome of Viola tricolor. Transcriptome mining enabled the characterization of cyclotide precursor architecture and processing sites important for biosynthesis of mature peptides. The cyclotide peptidome was explored by mass spectrometry and bottom-up proteomics using the extracted peptide sequences as queries for database searching. In total 164 cyclotides were discovered by nucleic acid and peptide analysis in V. tricolor. Therefore, violaceous plants at a global scale may be the source to as many as 150 000 individual cyclotides. Encompassing the diversity of V. tricolor as a combinatorial library of bioactive peptides, this commercially available medicinal herb may be a suitable starting point for future bioactivity-guided screening studies.Entities:
Keywords: 1kp; Violaceae; circular peptides; cystine-knot; mass spectrometry; natural products; peptidomics; ribosomally synthesized and post-translationally modified peptides; transcriptome
Mesh:
Substances:
Year: 2015 PMID: 26399495 PMCID: PMC4642221 DOI: 10.1021/acs.jproteome.5b00681
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 4.466
Figure 1Viola tricolor cyclotide precursor analysis. (A) A generic cyclotide precursor is ribosomally synthesized and contains an ER signal, followed by a N-pro-region, N-terminal repeats (NTR), the cyclotide domain, and a C-terminal tail region. Cyclotide precursors may encode up to three peptide domains as it has been demonstrated for the V. tricolor tricyclon A precursor. Novel vitri precursors discovered by transcriptome mining contain one, two, or three cyclotide domains. Exemplarily shown are vitri precursors 2 and 9. As indicated each precursor domain may vary in length. For instance, vitri precursor 24 comprises an extended C-tail, whereas vitri precursor 52 completely lacks this domain. (B) Sequence comparison of vitri precursor processing site has been provided by a sequence logo. Three adjacent residues downstream of the C-terminus of the mature cyclotide domain in positions P′1, P′2, and P′3, and upstream of the N-terminus of the mature cyclotide domain in positions P1, P2, and P3 have been considered for being essential for precursor processing and cyclization in planta. Loop six comprises the highly conserved ligation sites, namely, glycine and asparagine or aspartic acid as highlighted in green.
Figure 2Mass spectrometry analysis of Viola tricolor cyclotide extract. (A) Plant peptide extract from V. tricolor was analyzed via LC–MS. A base peak chromatogram of mass signals ranging from 800–2200 Da is shown. Cyclotide masses were found to carry +2, +3, and rarely +4 H-ions. Signals >5.0% intensity threshold were assigned to mass and charge deconvolution tool from Data Analysis software (Bruker Daltonics). The determined cyclotide mass signals were consecutively numbered by increasing retention times. (B) Accordingly a MALDI−TOF spectrum is shown between 2750 and 3500 Da. Mass signals with an intensity threshold of >0.5% are labeled. For cyclotide names, retention times and molecular weight (monoisotopic masses) refer to Supplementary Table S-2.
Identification and Discovery of Vitri Cyclotides by LC−MS/MS Peptide Sequencing of Viola tricolor Extract and Fractions
| name | sequence | calc mass (monoiso.) [Da] | LC-MS | MALDI-MS | transcriptome | Mascot score | full/partial sequence | PTM |
|---|---|---|---|---|---|---|---|---|
| cycloviolacin O2 | GIPCGESCVWIPCISSAIGCSCKSKVCYRN | 3138.37 | x | x | 104.9 | partial | Dea (NQ) | |
| cO4 | GIPCGESCVWIPCISSAIGCSCKNKVCYRN | 3165.38 | x | 113.3 | full | Dea (NQ) | ||
| cO13 | GIPCGESCVWIPCISAAIGCSCKSKVCYRN | 3122.37 | x | x | 91.6 | partial | Dea (NQ) | |
| cO20 | GIPCGESCVWIPCLTSAIGCSCKSKVCYRD | 3153.37 | 90.2 | partial | ||||
| cO22 | GLPICGETCVGGTCNTPGCTCSWPVCTRN | 2904.16 | x | x | 118.8 | full | Dea (NQ), Hkyn(W), | |
| cO22 (linear) | acyclo-GLPICGETCVGGTCNTPGCTCSWPVCTRN | 2922.16 | x | x | 93.7 | partial | ||
| cO28 | GLPVCGETCVGGTCNTPGCSCSWPVCFRD | 2923.13 | x | 103.3 | full | Methyl (D) | ||
| chacur 1 | GLPVCGETCVGGTCNTPGCTCSWPICTRN | 2904.16 | x | x | 76.8 | partial | Hkyn(W) | |
| vitri peptide 1 | GLIPCGESCVWIPCISSVIGCSCKSKVCYKN | 3251.48 | x | 58.2 | partial | |||
| vitri peptide 2 | GSIPCGESCVWIPCISGIAGCSCSNKVCYLN | 3138.32 | x | x | x | 63.4 | partial | |
| vitri peptide 3 | GSWPCGESCVYIPCITSIAGCECSKNVCYKN | 3289.35 | x | 86.8 | partial | |||
| vitri peptide 4 | GTPCGESCIYVPCISAVFGCWCQSKVCYKD | 3221.32 | x | x | 74.3 | partial | ||
| vitri peptide 8 (linear) | acyclo-PTPCGETCIWISCVTAAIGCYCHESICYR | 3172.31 | x | 91.4 | partial | |||
| vitri peptide 9 | acyclo-GTIFDCGETCLLGKCYTPGCSCGSWALCYGQN | 3343.11 | x | 106.4 | full | |||
| vitri peptide 14 | GSSCGETCEVFSCFITRCACIDGLCYRN | 3012.18 | x | x | 62.0 | partial | ||
| vitri peptide 17 | GSDDQVACGESCAMTPCFMHVVGCVCSQKVCYR | 3488.37 | x | 81.5 | partial | Ox (M) | ||
| vitri peptide 18a | GVPICGETCFQGTCNTPGCTCKWPICERN | 3092.25 | x | x | 121.1 | full | Dea (NQ), Methyl (D) | |
| vitri peptide 20 | GDLVPCGESCVYIPCLTTVLGCSCSENVCYRN | 3372.41 | x | x | x | 71.1 | partial | |
| vitri peptide 21 | GGPLDCQETCTLSDRCYTKGCTCNWPICYKN | 3447.39 | x | 60.2 | partial | |||
| vitri peptide 22a | GAPVCGETCFTGLCYSSGCSCIYPVCNRN | 2979.16 | x | x | x | 153.0 | full | Dea (NQ) |
| vitri peptide 22a (linear) | acyclo-GAPVCGETCFTGLCYSSGCSCIYPVCNRN | 2997.16 | x | 75.9 | partial | |||
| vitri peptide 94b | GVAVCGETCTLGTCYTPGCSCDWPICKRN | 3012.22 | x | x | x | 139.0 | partial | Hkyn(W), Methyl (D) |
| vitri peptide 23 | GLPTCGETCTLGTCYTPGCTCSWPLCTKN | 2985.19 | x | x | 53.8 | partial | ||
| vitri peptide 24/28 | GEPVCGDSCVFFGCDDEGCTCGPWSLCYRN | 3194.14 | x | 57.9 | partial | |||
| vitri peptide 27a | GAFTPCGETCLTGECHTEGCSCVGQTFCVKK | 3171.27 | x | 67.0 | partial | Dea (NQ) | ||
| vitri peptide 29 | GVPSSDCLETCFGGKCNAHRCTCSQWPLCAKN | 3390.39 | x | 158.8 | partial | |||
| vitri peptide 30 | GFACGETCIFTSCFITGCTCNSSLCFRN | 2960.15 | x | 86.6 | partial | |||
| vitri peptide 36/37 | GGTIFSCGESCFQGTCYTKGCACGDWKLCYGEN | 3463.32 | x | 101.9 | partial | Dea (NQ) | ||
| vitri peptide 38 | GDTCYETCFTGFCFIGGCKCDFPVCVKN | 3032.18 | x | 63.0 | full | |||
| vitri peptide 39 | GAPICGESCFTGTCYTVQCSCSWPVCTRN | 3048.18 | x | x | x | 207.0 | full | Dea (NQ) |
| vitri peptide 39 (linear) | acyclo-GAPICGESCFTGTCYTVQCSCSWPVCTRN | 3066.18 | x | n.a. | partial | |||
| vitri peptide 24a | GGTIFNCGESCFQGTCYTKGCACGDWKLCYGEN | 3490.33 | x | 170.4 | partial | Dea (NQ), Ethyl (E) | ||
| vitri peptide 50 | GDIPCGESCVYIPCITGVLGCSCSHNVCYYN | 3244.29 | x | 76.9 | full | Dea (NQ) | ||
| vitri peptide 18b | GSVFNCGETCVFGTCFTSGCSCVYRVCSKD | 3134.26 | x | x | 163.8 | full | Dea (NQ) | |
| kalata B1 | GLPVCGETCVGGTCNTPGCTCSWPVCTRN | 2890.14 | x | x | 161.7 | full | Dea (NQ), Hkyn(W), | |
| kalata B1 (linear) | acyclo-GLPVCGETCVGGTCNTPGCTCSWPVCTRN | 2908.14 | x | x | n.a. | partial | ||
| kalata S | GLPVCGETCVGGTCNTPGCSCSWPVCTRN | 2876.13 | x | x | 307.0 | full | Dea (NQ), Hkyn(W), | |
| kalata S (linear) | acyclo-GLPVCGETCVGGTCNTPGCSCSWPVCTRN | 2894.13 | x | x | n.a. | partial | ||
| mram 8 | GIPCGESCVFIPCLTSAIGCSCKSKVCYRN | 3113.37 | 100.2 | partial | ||||
| oak6 cyclotide 1 | GLPVCGETCFGGTCNTPGCACDPWPVCTRN | 3033.18 | 75.4 | full | Hkyn(W), Dea (NQ) | |||
| tricyclon A | GGTIFDCGESCFLGTCYTKGCSCGEWKLCYGTN | 3478.35 | x | x | 331.3 | full | Dea (NQ) | |
| tricyclon A (linear) | acyclo-GTIFDCGESCFLGTCYTKGCSCGEWKLCYGTN | 3496.35 | x | n.a. | partial | Dea (NQ) | ||
| tricyclon B | GGTIFDCGESCFLGTCYTKGCSCGEWKLCYGEN | 3506.35 | 200.0 | full | Dea (NQ), Ethyl (E) | |||
| vaby C | GLPVCGETCAGGRCNTPGCSCSWPVCTRN | 2903.15 | x | x | 122.6 | full | ||
| vaby E | GLPVCGETCFGGTCNTPGCSCDPWPVCTRN | 3049.17 | 91.9 | full | Dea (NQ) | |||
| varv peptide E/vigno 3 (linear) | acycloGLPLCGETCVGGTCNTPGCSCSWPVCTRN | 2908.15 | x | x | n.a. | partial | ||
| varv B | GLPVCGETCFGGTCNTPGCSCDPWPMCSRN | 3067.13 | x | 229.0 | full | Me(Ox), Dea (NQ) | ||
| varv B (linear) | acyclo-GLPVCGETCFGGTCNTPGCSCDPWPMCSRN | 3085.13 | x | x | 77.4 | partial | Ox (M) | |
| varv C | GVPICGETCVGGTCNTPGCSCSWPVCTRN | 2876.13 | x | x | 253.7 | full | Dea (NQ) | |
| varv C (linear) | acyclo-GVPICGETCVGGTCNTPGCSCSWPVCTRN | 2894.13 | x | n.a. | partial | |||
| varv D | GLPICGETCVGGSCNTPGCSCSWPVCTRN | 2876.13 | x | x | 189 | full | Dea (NQ) | |
| varv D (linear) | acyclo-GLPICGETCVGGSCNTPGCSCSWPVCTRN | 2894.13 | x | 50.1 | partial | Hkyn(W), | ||
| varv E/vigno 3 | GLPICGETCVGGTCNTPGCSCSWPVCTRN | 2890.14 | x | x | 251.3 | full | Dea (NQ), Hkyn(W), | |
| varv F | GVPICGETCTLGTCYTAGCSCSWPVCTRN | 2957.17 | x | x | 244.5 | full | Dea (NQ), Hkyn(W) | |
| varv F (linear) | acyclo-GVPICGETCTLGTCYTAGCSCSWPVCTRN | 2975.37 | x | n.a. | partial | |||
| varv G | GVPVCGETCFGGTCNTPGCSCDPWPVCSRN | 3021.14 | x | x | 211.8 | full | Dea (NQ), Hkyn(W), | |
| varv G (linear) | acyclo-GVPVCGETCFGGTCNTPGCSCDPWPVCSRN | 3039.14 | x | n.a. | partial | |||
| varv H | GLPVCGETCFGGTCNTPGCSCETWPVCSRN | 3053.17 | x | 151.1 | full | Dea (NQ), Ethyl (E), Methyl (D) | ||
| viba 11 | GIPCGESCVWIPCISGAIGCSCKSKVCYRN | 3108.36 | 60.8 | full | Dea (NQ) | |||
| viba 15 | GLPVCGETCVGGTCNTPGCACSWPVCTRN | 2880.13 | x | 99.7 | full | Hkyn(W), Dea (NQ) | ||
| viba 19 | GLPVCGETCFGGTCNTPGCSCEWPVCTRN | 2966.14 | 77.3 | partial | Dea (NQ) | |||
| viba 30 | GPPVCGETCVGGTCNTPGCSCSWPVCTRN | 2860.10 | x | 191.0 | full | Dea (NQ), Kyn(W), | ||
| viba 30 (linear) | acyclo-GPPVCGETCVGGTCNTPGCSCSWPVCTRN | 2878.10 | x | x | n.a. | partial | ||
| viba 32 | GLPVCGEACVGGTCNTPGCSCSWPVCTRN | 2846.12 | x | 110.9 | full | Dea (NQ), Ethyl (E) | ||
| viba 9 | GIPCGESCVWIPCISSAIGCSCKNKVCYRK | 3179.43 | x | x | 106.5 | full | ||
| vibi C | GLPVCGETCAFGSCYTPGCSCSWPVCTRN | 2973.15 | 52.8 | full | ||||
| vibi G | GTFPCGESCVFIPCLTSAIGCSCKSKVCYKN | 3220.40 | 74.1 | partial | ||||
| vigno 10 | GTIPCGESCVWIPCISSVVGCSCKSKVCYKD | 3226.41 | 68.9 | partial | Dea (NQ) | |||
| vigno 4 | GLPLCGETCVGGTCNTPACSCSWPVCTRN | 2904.16 | x | x | 141.7 | full | Dea (NQ), Hkyn(W), | |
| vigno 5 | GLPLCGETCVGGTCNTPGCSCGWPVCVRN | 2858.15 | x | x | 152.2 | full | ||
| vigno 5 (linear) | acyclo-GLPLCGETCVGGTCNTPGCSCGWPVCVRN | 2876.20 | x | x | 152.3 | full | ||
| vigno 6 | GIPCGESCVWIPCISSAIGCSCKGSKVCYRN | 3195.39 | x | 78.8 | partial | |||
| vigno 7 | GTLPCGESCVWIPCISSVVGCSCKNKVCYKN | 3252.44 | 72.3 | partial | Dea (NQ) | |||
| vigno 9 | GIPCGESCVWIPCISSALGCSCKSKVCYRN | 3138.37 | x | x | 90.8 | partial | ||
| vitri A/cO3 | GIPCGESCVWIPCLTSAIGCSCKSKVCYRN | 3152.38 | x | x | 93.6 | partial | Dea (NQ) | |
| vitri D | GLPVCGETCFTGSCYTPGCSCNWPVCNRN | 3043.16 | x | x | x | 69.6 | full | |
| vitri E | GLPVCGETCVGGTCNTPGCSCSWPVCFRN | 2922.15 | x | x | 192.9 | full | Dea (NQ) | |
| vitri E (linear) | acyclo-GLPVCGETCVGGTCNTPGCSCSWPVCFRN | 2940.13 | n.a. | partial | ||||
| vitri F | GLTPCGESCVWIPCISSVVGCACKSKVCYKD | 3210.41 | x | 67.3 | partial | |||
| vodo M | GAPICGESCFTGKCYTVQCSCSWPVCTRN | 3075.23 | x | 162.8 | full | |||
| vodo N | GLPVCGETCTLGKCYTAGCSCSWPVCYRN | 3046.24 | x | 201.4 | full | Dea (NQ) | ||
| vodo N (linear) | acyclo-GLPVCGETCTLGKCYTAGCSCSWPVCYRN | 3066.24 | n.a. | partial |
Identified in trypsin digest.
Identified in endoproteinase GluC digest.
Identified in trypsin/endoproteinase GluC double digest.
Best score out of two independent analysis of each of the three digest experiments.
Modifications recorded in at least one peptide of assigned peptides of in total six independent analysis.
Figure 3Evaluation of post-translational modifications in cyclotides identified by MS/MS sequencing and database analysis. Database searching of MS/MS spectra of cyclotides revealed a number of peptides with posttranslational modifications. The most abundant ones have been illustrated exemplarily: (A) Aspartate methylation in vitri peptide 94b; (B) tryptophan degradation product kynurenine in varv peptide E; (C) tryptophan oxidation to 3-hydroxy-tryptophan in kalata B1; and (D) glutamate ethylation and glutamine or asparagine deamidation in vitri peptide 42a. (E) The MS/MS fragment spectrum of a C-terminal fragment of endoproteinase GluC processed vitri peptide 22a has been shown to confirm the presence of acyclic cyclotides in V. tricolor. The major b- and y-ion series have been labeled in all spectra. Furthermore, peptide mass, cleavage protease, mass error, and Mascot score of the illustrated peptides have been provided.
Figure 4Comparative analysis of vitri cyclotides identified by transcriptome mining, mass deconvolution, and MS/MS sequencing. (A) An overview of vitri peptides discovered in this study is summarized by a Venn diagram. The number of identified cyclotides by each approach (transcriptome mining, mass deconvolution or MS/MS sequencing) is indicated by the numbers in each of the three interwheeling circles. The overlapping regions represent the numbers of identified cyclotides by combination of two or all three approaches. (B) A pie chart illustrates coverage for confirmation of predicted vitri cyclotides by MS. All predicted 108 vitri cyclotides identified by transcriptome mining were grouped in cohorts: (i) confirmed by sequence or molecular weight (28%), (ii) partial sequences from transcriptome mining (50%), (iii) cyclotide-like molecules (2%) and unverified full-length cyclotides (20%).