| Literature DB >> 30458731 |
Rong Zeng1, Shigang Gao1, Lihui Xu1, Xin Liu1, Fuming Dai2.
Abstract
BACKGROUND: Gray leaf spot is a devastating disease caused by Stemphylium lycopersici that threatens tomato-growing areas worldwide. Typically, many pathogenesis-related and unrelated secreted proteins can be predicted in genomes using bioinformatics and computer-based prediction algorithms, which help to elucidate the molecular mechanisms of pathogen-plant interactions.Entities:
Keywords: CAZymes; Effector; Pathogen-host interaction; SCRSPs; Signal peptide; Tomato leaf gray spot disease
Mesh:
Substances:
Year: 2018 PMID: 30458731 PMCID: PMC6247510 DOI: 10.1186/s12866-018-1329-y
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Fig. 1Symptoms of S. lycopersici in field on tomato leave and fruit
The bioinformatics tools adopted for the prediction of secreted proteins from S. lycopersici
| Prediction algorithms | Objects predicted | References |
|---|---|---|
| SignalP v4.1 | Signal peptides |
|
| TMHMM v2.0 | Transmembrane domains |
|
| big-PI Fungal Predictor | GPI-anchor site |
|
| ProtComp v9.0 | sub-cellular localization |
|
| TargetP v1.1 | localization |
|
Fig. 2Methods used to predict secreted proteins in S. lycopersici
Fig. 3Analysis length of the predicted secreted proteins amino acid residues in S. lycopersici
Fig. 4Analysis of the candidate secreted proteins with different length of signal peptide in S. lycopersici
Fig. 5Percentage of 20 amino acid residues in S. lycopersici candidate secreted protein signal peptides
Amino acids frequency and distribution in cleavage site of signal peptide of the predicted secreted proteins in S. lycopersici
| Kinds of | −3 | −2 | −1 | 1 | 2 | 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Amino acid | No. | pct.(%) | No. | pct.(%) | No. | pct.(%) | No. | pct.(%) | No. | pct.(%) | No. | pct.(%) |
| A | 252 | 49.32% | 72 | 14.09% | 426 | 83.37% | 123 | 24.07% | 29 | 5.68% | 44 | 8.61% |
| V | 133 | 26.03% | 21 | 4.11% | 0 | 0.00% | 20 | 3.91% | 30 | 5.87% | 47 | 9.20% |
| T | 48 | 9.39% | 35 | 6.85% | 6 | 1.17% | 28 | 5.48% | 42 | 8.22% | 66 | 12.92% |
| S | 33 | 6.46% | 104 | 20.35% | 28 | 5.48% | 47 | 9.20% | 34 | 6.65% | 49 | 9.59% |
| I | 20 | 3.91% | 15 | 2.94% | 0 | 0.00% | 18 | 3.52% | 8 | 1.57% | 48 | 9.39% |
| G | 11 | 2.15% | 6 | 1.17% | 38 | 7.44% | 23 | 4.50% | 21 | 4.11% | 17 | 3.33% |
| C | 7 | 1.37% | 10 | 1.96% | 5 | 0.98% | 4 | 0.78% | 5 | 0.98% | 11 | 2.15% |
| L | 5 | 0.98% | 92 | 18.00% | 0 | 0.00% | 36 | 7.05% | 22 | 4.31% | 51 | 9.98% |
| H | 1 | 0.20% | 16 | 3.13% | 0 | 0.00% | 42 | 8.22% | 7 | 1.37% | 14 | 2.74% |
| R | 1 | 0.20% | 6 | 1.17% | 0 | 0.00% | 17 | 3.33% | 5 | 0.98% | 13 | 2.54% |
| D | 0 | 0.00% | 5 | 0.98% | 0 | 0.00% | 7 | 1.37% | 29 | 5.68% | 17 | 3.33% |
| E | 0 | 0.00% | 15 | 2.94% | 0 | 0.00% | 9 | 1.76% | 13 | 2.54% | 6 | 1.17% |
| F | 0 | 0.00% | 33 | 6.46% | 0 | 0.00% | 15 | 2.94% | 9 | 1.76% | 25 | 4.89% |
| K | 0 | 0.00% | 1 | 0.20% | 0 | 0.00% | 10 | 1.96% | 5 | 0.98% | 6 | 1.17% |
| M | 0 | 0.00% | 12 | 2.35% | 0 | 0.00% | 7 | 1.37% | 8 | 1.57% | 6 | 1.17% |
| N | 0 | 0.00% | 21 | 4.11% | 0 | 0.00% | 14 | 2.74% | 26 | 5.09% | 18 | 3.52% |
| Q | 0 | 0.00% | 30 | 5.87% | 3 | 0.59% | 76 | 14.87% | 18 | 3.52% | 20 | 3.91% |
| W | 0 | 0.00% | 1 | 0.20% | 0 | 0.00% | 7 | 1.37% | 5 | 0.98% | 6 | 1.17% |
| Y | 0 | 0.00% | 16 | 3.13% | 0 | 0.00% | 8 | 1.57% | 13 | 2.54% | 10 | 1.96% |
| P | 0 | 0.00% | 0.00% | 5 | 0.98% | 0 | 0.00% | 182 | 35.62% | 37 | 7.24% | |
pct percentage, A alanine, C cysteine, D aspartic acid, E glutamic acid, F phenylalanine, G glycine, H histidine, I isoleucine, K lysine, L leucine, M methionine, N asparagine, P proline, Q glutamine, R arginine, S serine, T threonine, V valine, W tryptophan, Y tyrosine
The highest frequency of amino acid residue around the cleavage site of signal peptide in S. lycopersici and other organisms [36]
| Organisms | −3 | −2 | −1 | + 1 | + 2 | + 3 |
|---|---|---|---|---|---|---|
| A | S | A | A | P | T | |
|
| A | S | A | A | P | T |
|
| A | S | A | A | P | T |
|
| A | L | A | A | D | L |
|
| A | A | A | A | A | A |
|
| A | L | A | A | P | – |
|
| A | S | A | A | P | L |
|
| A | N | A | A | S | S/W |
|
| V | S | A | Q | P | I |
|
| V | S | A | Q | P | L |
-: No statistics
Fig. 6Gene ontology (GO) annotation of the predicted secreted proteins of S. lycopersici. The best 305 identified proteins hits were aligned to the GO database and assigned to GO term
Fig. 7Predicted PHI proteins of the candidate secreted proteins of S. lycopersici
A summary of the carbohydrate-active proteins predicted in S. lycopesici and comparison of these data with those from P. parasitica and P. infestans
|
|
|
| ||||
|---|---|---|---|---|---|---|
| CAZyme families | CAZyme modules | CAZyme families | CAZyme modules | CAZyme families | CAZyme modules | |
| GH | 38 | 98 | 37 | 293 | 36 | 275 |
| GT | 1 | 1 | 29 | 169 | 30 | 158 |
| PL | 5 | 22 | 3 | 47 | 5 | 67 |
| CE | 8 | 40 | 12 | 113 | 12 | 90 |
| CBM | 17 | 42 | 14 | 85 | 16 | 74 |
| AA | 7 | 56 | 4 | 43 | 8 | 42 |
conserved domains search of the predicted small cysteine-rich secreted proteins (SCRSPs) against CDD database
| Gene code | Hit type | Length | Math site | E-Value | Accession | Description of SCRSP |
|---|---|---|---|---|---|---|
| KNG51615.1 | specific | 156 | 28–151 | 8.60E-49 | cd12798 | alt_a1 major allergen |
| KNG51272.1 | superfamily | 138 | 38–137 | 2.27E-54 | cl03937 | polysaccharide lyase family 3 protein |
| KNG49607.1 | specific | 90 | 21–79 | 0.000154 | pfam05730 | hypothetical protein TW65_03780 |
| KNG48427.1 | specific | 153 | 46–116 | 2.38E-05 | pfam05730 | hypothetical protein TW65_04690 |
| KNG46663.1 | specific | 143 | 42–139 | 4.84E-43 | cd00606 | guanyl-specific ribonuclease f1 |
| KNG46057.1 | superfamily | 170 | 73–155 | 4.97E-07 | cl09109 | snoal-like polyketide cyclase family protein |
| KNG45713.1 | specific | 147 | 6–128 | 5.00E-53 | COG1956 | gaf domain nucleotide-binding protein |
| KNG50949.1 | specific | 154 | 44–147 | 1.45E-36 | cd00448 | translation initiation inhibitor |
| KNG48201.1 | superfamily | 141 | 22–140 | 5.36E-65 | cl06331 | eliciting plant response protein |
| KNG47345.1 | superfamily | 160 | 24–153 | 5.19E-31 | cl17157 | major allergen alt |
| KNG47745.1 | specific | 36 | 51–86 | 1.75E-07 | cd00118 | hypothetical protein TW65_05484 |
| KNG46314.1 | specific | 101 | 29–89 | 1.04E-20 | pfam06766 | hypothetical protein TW65_86951 |