| Literature DB >> 32375409 |
Karla Gisel Carreón-Anguiano1, Ignacio Islas-Flores2, Julio Vega-Arreguín3, Luis Sáenz-Carbonell1, Blondy Canto-Canché1.
Abstract
Pathogens are able to deliver small-secreted, cysteine-rich proteins into plant cells to enable infection. The computational prediction of effector proteins remains one of the most challenging areas in the study of plant fungi interactions. At present, there are several bioinformatic programs that can help in the identification of these proteins; however, in most cases, these programs are managed independently. Here, we present EffHunter, an easy and fast bioinformatics tool for the identification of effectors. This predictor was used to identify putative effectors in 88 proteomes using characteristics such as size, cysteine residue content, secretion signal and transmembrane domains.Entities:
Keywords: computational prediction; effector proteins; fungal secretome; host-pathogen interaction
Mesh:
Substances:
Year: 2020 PMID: 32375409 PMCID: PMC7277995 DOI: 10.3390/biom10050712
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Bioinformatics tools integrated into EffHunter.
| Program | Features | Website | Reference |
|---|---|---|---|
| Perl/Bioperl | International association of users and developers of open-source Perl tools for bioinformatics, genomics and life science. |
| [ |
| SignalP 4.1 | Predicts the presence of signal peptides and the location of their cleavage sites in proteins from gram-positive bacteria, gram-negative bacteria and eukarya. |
| [ |
| Phobius | This server is for the prediction of transmembrane topology and signal peptides from the amino acid sequence of a protein. |
| [ |
| WoLFPSORT | Converts protein amino acid sequences into numerical localization features, based on sorting signals, amino acid composition and functional motifs, to predict protein subcellular location. |
| [ |
| TMHMM 2.0 | Predicts trans-membrane (TM) domain helices in proteins. |
| [ |
Positive data set of effector proteins used in this work.
| Species | Effector Proteins |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Avrk1, Avra1, Avra13, | |
| AvrPm2 | |
|
| |
|
|
|
|
|
|
|
|
|
|
| Avr2, Avr4, Avr4E, Avr5, Avr9, Ecp1, Ecp2, Ecp4, Ecp5, Ecp6 |
|
|
|
|
| CgEP1, Cgfl |
|
| FGL1 |
|
|
|
| Six1, Six2, Six3, Six4, Six5, Six6, Six7, Six8 | |
|
|
|
|
|
|
|
| MiSSP7 |
|
| AvrLm1, AvrLm4–7, AvrLm6, AvrLm11, |
|
| Avr1-CO39, Avr-Pia, AvrPib, Avr-Pita, Avr-Pii, Avr-Pik, AvrPi9, AvrPiz-t, Bas1, Bas2, Bas3, Bas4, Bas107, Bas162, |
|
| AvrL2-A, AvrL567-A, AvrM, AvrM14, AvrP4, AvrP123, |
|
|
|
|
|
|
|
|
|
|
| PpEC23 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| AvrSr50, PGTAUSPE-10-1 | |
| Pec6, PstSCR1 | |
|
| ToxB |
|
| NIP1, NIP2, NIP3 |
|
| SsSSVP1 |
|
| ToxA, Tox1, Tox3 |
|
|
|
|
| RTP1 |
|
| UhAvr1 |
|
| Cmu1, eff1-1, |
|
| Ave1, PevD1, Vdlsc1, VdSCP7 |
|
| AvrStb6, Zt6 |
* One hundred fifty effector proteins were pooled from the list (94) collected by Sperschneider et al. [15] and 56 effectors retrieved from the PHI database (labeled in bold). All these effector proteins have been published in peer-reviewed journals.
Validation of EffHunter for prediction of effector proteins and comparison with EffectorP 2.0.
|
| ||||||||||||||
| Data* | Proteins in Data set | Total Proteins | Length (30—400aas) | >4 Cysteine | Signal peptide by SignalP/Phobius | Proteins without TMD with TMHMM | Total prediction | Results | Sen/Rec | Spe | PPV/Prec | ACC | FPR | F1 score |
| Set 1 | 150 | 4680 | 765 | 435 | 107 | 105 | 105 | 105 | 70% | 100% | 100% | 99% | 0.00% | 0.82 |
| Set 2 | 2329 | 0 | ||||||||||||
| Set 3 | 476 | 0 | ||||||||||||
| Set 4 | 1725 | 0 | ||||||||||||
|
| ||||||||||||||
| Data* | Proteins in data set | Total proteins | Total prediction | Prediction | Sen/Rec | Spe | PPV/Prec | ACC | FPR | F1 score | ||||
| Set 1 | 150 | 4680 | 105 | 105 | 70% | 100% | 100% | 99% | 0.00% | 0.82 | ||||
| Set 2 | 2329 | 0 | ||||||||||||
| Set 3 | 476 | 0 | ||||||||||||
| Set 4 | 1725 | 0 | ||||||||||||
|
| ||||||||||||||
| Data* | Proteins in data set | Total proteins | Total prediction | Prediction | Sen/Rec | Spe | PPV/Prec | ACC | FPR | F1 score | ||||
| Set 1 | 150 | 4680 | 166 | 102 | 68% | 98% | 61% | 97% | 1.41% | 0.64 | ||||
| Set 2 | 2329 | 41 | ||||||||||||
| Set 3 | 476 | 22 | ||||||||||||
| Set 4 | 1725 | 1 | ||||||||||||
|
| ||||||||||||||
| Data* | Proteins in data set | Total proteins | Total prediction | Prediction | Sen/Rec | Spe | PPV/Prec | ACC | FPR | F1 score | ||||
| Set 1 | 150 | 4680 | 164 | 91 | 60% | 98% | 55% | 97% | 1.6% | 0.57 | ||||
| Set 2 | 2329 | 49 | ||||||||||||
| Set 3 | 476 | 20 | ||||||||||||
| Set 4 | 1725 | 4 | ||||||||||||
| Data* | Proteins in data set | Total proteins | Total prediction | Prediction | Sen/Rec | Spe | PPV/Prec | ACC | FPR | F1 score | ||||
| Set 1 | 150 | 4680 | 72 | 72 | 48% | 100% | 100% | 98% | 0.00% | 0.64 | ||||
| Set 2 | 2329 | 0 | ||||||||||||
| Set 3 | 476 | 0 | ||||||||||||
| Set 4 | 1725 | 0 | ||||||||||||
* Set 1: Positive dataset of true effectors (positive dataset comprises effectors retrieved from pathogen-host interaction (PHI) and the list collected by Sperschneider et al. [15]); Set 2: ABC transporters; Set 3: cytochrome P450; Set 4: proteins classified as major facilitator transporter superfamily. Sen/Rec: Sensitivity/Recall; Spe: Specificity; PPV/Prec: Positive Predictive Value/Precision; ACC: Accuracy; FPR: False positive rate; F1 score: Measure of the success of binary classifier (score reaches its best value at 1, and worst score at 0).
Data analysis of the set of fungal effector proteins between EffHunter and other reports.
| Species | Criteria in Reference | Genome Size (Mbp) | Total Proteome | Secretome | Effectors Prediction in Reference | EffHunter Prediction | Shared | Difference in: | Observations for Effectors Predicted by Reference or by EffHunter | Summary of True or False Positives or Negatives, or Ambiguous in the Specific Sets of Effectors (Considering Both Predictions) |
|---|---|---|---|---|---|---|---|---|---|---|
| Secretion signal with SignalP 4.1 and SecretomeP, no TMD with TMHMM 2.0, no hits outside powdery mildews with Blastp; subcellular localization with TargetP 1.1 and GPI anchors by Big-PI. | 124.49 | 7118 | 726 | 494 | 490 | 408 | Reference | |||
| E = 82 | EffHunter | |||||||||
| Secretion signal with SignalP 4.1, one or no TM domains with TMHMM 2.0, subcellular localization with TargetP 1.1 and WoLFPSORT, no GPI anchor with PredGPI, length <250aas, >2% cysteine residues | 74.1 | 13,107 | 584 | 105 | 136 | 78 | Reference | |||
| E = 58 | EffHunter | |||||||||
| Size <200 amino acids; secretion signal with SignalP 4.1, one or no TM domain with TMHMM 2.0, secreted by TargetP 1.1, no GPI-anchor with big-PI, subcellular localization with WoLFPSORT and ProtComp, and no functional information | 39.7 | 10933 | 492 | 171 | 183 | 110 | Reference | |||
| E = 74 | EffHunter |
* Ambiguous: Those candidates that meet criteria from one prediction (positive for this analysis), but do not meet criteria of the other analysis and criteria from one or the other are not definitive for assigning them as positive or negative. Databases analyzed in the references and by EffHunter were the same, except for M. graminicola. The authors did not provide that database; the nonredundant protein models from M. graminicola at JGI were downloaded in that case.
Figure 1Complete EffHunter workflow for the prediction of effectors in fungal proteomes. Red square, custom tools (user parameters); blue square, pre-installed prediction tools. RFS, retrieved FASTA sequences. The pipeline works correctly either on total proteomes or secretomes.
Figure 2Effector prediction of positive control set. (a) Venn diagram showing the distribution of shared and non-shared predicted effectors by EffHunter and EffectorP 2.0. (b) Pie chart summarizing the characteristics of the 41 non-shared effectors protein predicted by EffectorP 2.0.
Effector prediction data from the different predictions tool across 12 proteomes.
| Species | Lifestyle | Genome | Total Proteins | Effector Predictions | Effectors in Reference | Reference Genome | |||
|---|---|---|---|---|---|---|---|---|---|
| Mb | Coverage | EffHunter | EffectorP 2.0 | *SECRETOOL | |||||
|
| Necrotroph | 31.03 | 120× | 10688 | 227 | 113 | 228 | 139 | [ |
|
| Biotroph | 158.94 | 13× | 6526 | 255 | 109 | 143 | 437 | [ |
|
| Biotroph | 61.11 | 21× | 14127 | 342 | 151 | 296 | 271 | [ |
|
| Hemibiotroph | 51.6 | 9× | 12006 | 364 | 159 | 352 | 177 | [ |
|
| Hemibiotroph | 55.72 | 186.1× | 17726 | 474 | 256 | 361 | 364 | [ |
|
| Hemibiotroph | 44.81 | 8.31× | 12469 | 290 | 162 | 263 | 529 | [ |
|
| Hemibiotroph | 41.7 | 7× | 12755 | 273 | 368 | 528 | 163 | [ |
|
| Hemibiotroph | 39.7 | 8.9× | 10933 | 286 | 166 | 235 | NS | [ |
|
| Hemibiotroph | 228.54 | 7.6× | 17787 | 355 | 404 | 343 | 563 | [ |
| Biotroph | 88.64 | 6.9× | 15979 | 659 | 605 | 612 | 1106 | [ | |
|
| Necrotroph | 37.84 | 98× | 12169 | 322 | 182 | 328 | 317 | [ |
|
| Biotroph | 19.66 | 10× | 6785 | 113 | 107 | 142 | 426 | [ |
* Prediction in Sonah et al. [10]; NS: Not specified
Comparison of prediction between EffHunter and EffectorP 2.0 on noncanonical effectors.
| Species | Effector | Length | No. of Cysteine | Signal Peptide | *TMD | EffHunter | EffectorP 2.0 |
|---|---|---|---|---|---|---|---|
|
| SAD1 | 626 | 4 | No | 0 | Non-effector | Effector |
|
| Mg3LysM | 232 | 9 | Yes | 0 | Effector | Non-effector |
| BEC1054 | 118 | 2 | Yes | 0 | Non-effector | Effector | |
| BEC1011 | 118 | 3 | Yes | 0 | Non-effector | Effector | |
| BEC1019 | 316 | 8 | Yes | 0 | Effector | Non-effector | |
| CSEP0055 | 122 | 3 | Yes | 0 | Non-effector | Effector | |
| Bcg1 | 146 | 2 | Yes | 0 | Non-effector | Effector | |
| CSEP0105 | 128 | 6 | Yes | 0 | Effector | Non-effector | |
|
| SIS1 | 149 | 2 | Yes | 1 | Non-effector | Effector |
|
| Xyla | 231 | 1 | Yes | 0 | Non-effector | Effector |
|
| PIIN 08944 | 120 | 0 | Yes | 0 | Non-effector | Non-effector |
| AvrPm3 | 130 | 2 | Yes | 0 | Non-effector | Effector | |
| AvrSr35 | 577 | 3 | Yes | 0 | Non-effector | Non-effector |
* TMD: Transmembrane Domain.
Figure 3Effectoromes in fungal and oomycetes phytopathogens predicted by EffHunter.
Figure 4Predicted effectors in fungal and oomycetes proteomes using EffHunter.