| Literature DB >> 20146824 |
Jaeyoung Choi1, Jongsun Park, Donghan Kim, Kyongyong Jung, Seogchan Kang, Yong-Hwan Lee.
Abstract
BACKGROUND: Fungi secrete various proteins that have diverse functions. Prediction of secretory proteins using only one program is unsatisfactory. To enhance prediction accuracy, we constructed Fungal Secretome Database (FSD). DESCRIPTION: A three-layer hierarchical identification rule based on nine prediction programs was used to identify putative secretory proteins in 158 fungal/oomycete genomes (208,883 proteins, 15.21% of the total proteome). The presence of putative effectors containing known host targeting signals such as RXLX [EDQ] and RXLR was investigated, presenting the degree of bias along with the species. The FSD's user-friendly interface provides summaries of prediction results and diverse web-based analysis functions through Favorite, a personalized repository.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20146824 PMCID: PMC2836287 DOI: 10.1186/1471-2164-11-105
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of prediction programs used in FSD
| Prediction Program | Description | Ref |
|---|---|---|
| SignalP 3.0 | A program to predict whether a protein has the signal peptidase site I or not | [ |
| SigCleave | A program to predict whether a protein has signal peptides or not | [ |
| SigPred | A program to predict whether a protein has signal peptides or not | [ |
| RPSP | A program to predict whether a protein has signal peptides or not | [ |
| TMHMM 2.0c | A program to predict whether a protein has trans-membrane helix(es) or not | [ |
| TargetP 1.1b | A program to predict a site where a protein probably resides | [ |
| PSort II | A program to predict a site where a protein probably resides | [ |
| SecretomeP 1.0f | A program to predict whether a protein is secreted by non-classical pathways or not | [ |
| predictNLS | A program to predict whether a protein has nuclear localization signal or not | [ |
List of references and annotation results of characterized fungal secretory proteins
| Title | Total Identified Proteins | Class SP | Class SP3 | Class SL | Putative Secretome | Ref |
|---|---|---|---|---|---|---|
| Crucial Role of Antioxidant Proteins and Hydrolytic Enzymes in Pathogenicity of | 21 | 5 | 1 | 0 | 6 | [ |
| Crucial Role of Antioxidant Proteins and Hydrolytic Enzymes in Pathogenicity of | 21 | 1 | 2 | 0 | 3 | [ |
| The | 49 | 25 | 5 | 0 | 30 | [ |
| An analysis of the | 46 | 28 | 19 | 2 | 49 | [ |
| An analysis of the | 45 | 0 | 5 | 1 | 6 | [ |
| The secretome of the maize pathogen | 386 | 352 | 18 | 10 | 380 | [ |
| The secretome of the maize pathogen | 168 | 147 | 15 | 5 | 167 | [ |
| A Catalogue of the Effector Secretome of Plant Pathogenic Oomycetes | 25 | 22 | 1 | 0 | 23 | [ |
| Fungal degradation of wood: initial proteomic analysis of extra cellular proteins of | 11 | 8 | 0 | 0 | 8 | [ |
| Comparative proteomics of extracellular proteins in vitro and in planta from the pathogenic fungus | 120 | 63 | 8 | 0 | 71 | [ |
| Expression analysis of extracellular proteins from | 27 | 16 | 4 | 0 | 20 | [ |
| Dandruff-associated Malassezia genomes reveal convergent and divergent virulence traits shared with plant and human fungal pathogens | 34 | 28 | 0 | 0 | 28 | [ |
| Adaptive Evolution Has Targeted the C-Terminal Domain of the RXLR Effectors of Plant Pathogenic Oomycetes | 79 | 79 | 0 | 0 | 79 | [ |
| Genome, transcriptome, and secretome analysis of wood decay fungus Postia placenta supports unique mechanisms of lignocellulose conversion. | 47 | 29 | 3 | 1 | 33 | [ |
| Host-Microbe Interactions: Shaping the Evolution of the Plant Immune Response | 14 | 12 | 0 | 1 | 13 | [ |
| 1,093 | 815 | 81 | 20 | 916 | - | |
Figure 1FSD class definitions and the FSD pipeline. (A) Definitions of four FSD classes. The gray round rectangle indicates the total set of proteins, and the light blue arrows going outside the rectangle show the filtering out processes of the pipeline. The black rectangles show the names of the classes, the yellow arrows indicate expansion of the putative secretome boundary, and the white-bordered blue cross indicates additional information on the putative secretome. (B) Structure of the FSD pipeline. The two parallelograms are input data for the FSD pipeline. The rectangle in the middle indicates the process for identifying putative secretory proteins. The round rectangles indicate the four FSD classes. The gray square on the right represents the thirteen different analysis functions in Favorite.
Class definitions used in FSD
| Class | Description* |
|---|---|
| Class SP | Proteins which are predicted by SignalP 3.0 |
| Class SP3 | Proteins which are predicted by SigPred, SigCleave, or RPSP |
| Class SL | Proteins which are predicted by PSort II or TargetP 1.1b, but are not predicted by SignalP 3.0, SigPred, SigCleave, RPSP, or SecretomeP 1.0f |
| Class NS | Proteins which are predicted by SecretomeP 1.0f, but are not predicted by SignalP 3.0, SigPred, SigCleave, or RPSP |
* Proteins as follows were removed from all four classes described in this table: proteins which i) contain more than one trans-membrane helixes, ii) have ER retention signals, iii) predicted as mitochondrial proteins by PSort II and TargetP 1.1b, and iv) predicted as nuclear proteins by TargetP 1.1b and predictNLS.
List and distribution of secretion-associated proteins of the fungal genomes belonging to the subphylum Pezizomycotina archived in FSD
| Species | Size (Mb) | # of ORFs | Class SP | Class SP3 | Class SL | Putative Secretome | Ref |
|---|---|---|---|---|---|---|---|
| | |||||||
| | |||||||
| | 27.9 | 9,121 | 754 | 732 | 81 | 1,567 | [ |
| | 36.8 | 12,604 | 1,200 | 990 | 142 | 2,332 | [ |
| | 29.2 | 9,929 | 807 | 878 | 67 | 1,752 | [ |
| | 29.4 | 9,887 | 781 | 909 | 84 | 1,774 | [ |
| | 30.1 | 10,568 | 922 | 877 | 96 | 1,895 | [ |
| | 37.2 | 11,200 | 860 | 883 | 88 | 1,831 | N |
| | 34.0 | 14,086 | 1,142 | 1,320 | 154 | 2,616 | [ |
| | 37.1 | 12,063 | 1,060 | 1,064 | 145 | 2,269 | [ |
| | 29.3 | 10,406 | 934 | 916 | 81 | 1,931 | [ |
| | 42.7 | 16,448 | 1,163 | 1,287 | 182 | 2,632 | N |
| | 34.9 | 11,124 | 1,121 | 923 | 99 | 2,143 | N |
| | 27.7 | 10,663 | 548 | 957 | 80 | 1,585 | N |
| | 28.8 | 10,408 | 575 | 920 | 66 | 1,561 | N |
| | 27.6 | 10,465 | 539 | 892 | 65 | 1,496 | N |
| | 28.9 | 10,457 | 476 | 855 | 102 | 1,433 | [ |
| | 28.1 | 9,964 | 546 | 838 | 95 | 1,479 | N |
| | 27.5 | 10,125 | 558 | 869 | 91 | 1,518 | N |
| | 34.9 | 9,633 | 932 | 725 | 83 | 1,740 | N |
| | 43.9 | 11,184 | 1,040 | 951 | 93 | 2,084 | N |
| | 15.1 | 6,694 | 373 | 386 | 47 | 806 | [ |
| | 36.1 | 13,920 | 1,370 | 1,072 | 118 | 2,560 | N |
| | 36.6 | 13,339 | 1,282 | 1,004 | 118 | 2,404 | [ |
| | 61.4 | 17,608 | 1,613 | 1,297 | 147 | 3,057 | N |
| | 51.3 | 15,707 | 1,381 | 1,242 | 155 | 2,778 | [ |
| | 41.9 | 14,199 | 1,347 | 1,071 | 116 | 2,534 | N |
| | 29.9 | 7,454 | 357 | 578 | 96 | 1,031 | N |
| | 41.3 | 8,038 | 393 | 583 | 103 | 1,079 | N |
| | 39.0 | 9,547 | 468 | 842 | 87 | 1,397 | N |
| | 37.9 | 9,445 | 492 | 832 | 99 | 1,423 | N |
| | 33.0 | 9,349 | 398 | 736 | 79 | 1,213 | [ |
| | 41.7 | 11,069 | 1,573 | 833 | 64 | 2,470 | [ |
| | 23.3 | 8,777 | 564 | 702 | 88 | 1,354 | N |
| | 23.3 | 8,876 | 629 | 669 | 52 | 1,350 | N |
| | 73.4 | 10,327 | 770 | 778 | 81 | 1,629 | N |
| | 41.9 | 11,395 | 979 | 913 | 81 | 1,973 | N |
| | 32.6 | 10,403 | 959 | 818 | 84 | 1,861 | [ |
| | 39.2 | 9,842 | 817 | 788 | 61 | 1,666 | [ |
| | 34.2 | 9,572 | 788 | 749 | 78 | 1,615 | N |
| | 37.3 | 9,948 | 823 | 800 | 88 | 1,711 | N |
| | 37.8 | 10,640 | 849 | 895 | 73 | 1,817 | N |
| | 33.0 | 9,136 | 402 | 808 | 71 | 1,281 | N |
| | 29.1 | 9,264 | 470 | 823 | 92 | 1,385 | N |
| | 30.0 | 8,741 | 425 | 743 | 55 | 1,223 | N |
| | 32.2 | 12,791 | 947 | 1,008 | 127 | 2,082 | [ |
| | 28.6 | 10,638 | 713 | 792 | 109 | 1,614 | N |
| | 35.7 | 10,596 | 1,127 | 893 | 124 | 2,144 | [ |
| | 38.0 | 12,169 | 1,228 | 912 | 123 | 2,263 | N |
| | 38.3 | 14,522 | 971 | 1,109 | 147 | 2,227 | N |
| | 38.7 | 8,806 | 697 | 658 | 66 | 1,421 | N |
| | 37.2 | 15,983 | 1,511 | 1,309 | 142 | 2,962 | [ |
| | 35.7 | 13,252 | 748 | 1,116 | 114 | 1,978 | N |
| | 37.0 | 9,815 | 877 | 855 | 67 | 1,799 | N |
| | 36.1 | 11,100 | 907 | 935 | 86 | 1,928 | N |
| | 33.5 | 9,129 | 738 | 766 | 70 | 1,574 | [ |
| | 38.8 | 11,643 | 933 | 1,009 | 93 | 2,035 | N |
| | 24.2 | 8,576 | 571 | 699 | 69 | 1,339 | N |
| | 22.3 | 7,798 | 485 | 626 | 64 | 1,175 | [ |
| | 32.9 | 10,239 | 1,074 | 815 | 73 | 1,962 | N |
| | 33.9 | 10,575 | 1,157 | 861 | 77 | 2,095 | N |
| 2,059.4 | 641,257 | 50,164 | 52,111 | 5,578 | 107,853 | - | |
a Taxonomy based on [69]
b Insufficient exon/intron information
c Incomplete coverage of genome information
List and distribution of secretion-associated proteins of the fungal genomes belonging to the subphylum Saccharomycotina and Taphrinomycotina archived in FSD
| Species | Size (Mb) | # of ORFs | Class SP | Class SP3 | Class SL | Putative Secretome | Ref |
|---|---|---|---|---|---|---|---|
| | |||||||
| | |||||||
| | 14.3 | 6,185 | 321 | 405 | 87 | 813 | [ |
| | 14.5 | 6,160 | 310 | 385 | 78 | 773 | [ |
| | 14.5 | 6,027 | 308 | 340 | 71 | 719 | N |
| | 12.3 | 5,165 | 231 | 290 | 49 | 570 | [ |
| | 10.6 | 5,920 | 279 | 400 | 63 | 742 | [ |
| | 12.1 | 5,941 | 310 | 482 | 50 | 842 | [ |
| | 13.1 | 5,733 | 308 | 321 | 83 | 712 | [ |
| | 14.6 | 6,258 | 360 | 373 | 76 | 809 | [ |
| | 12.2 | 6,354 | 254 | 357 | 74 | 685 | [ |
| | 8.8 | 4,717 | 204 | 333 | 35 | 572 | [ |
| | 10.7 | 5,327 | 248 | 304 | 60 | 612 | [ |
| | 14.7 | 5,367 | 219 | 276 | 58 | 553 | [ |
| | 10.9 | 4,935 | 187 | 280 | 41 | 508 | [ |
| | 15.5 | 5,802 | 253 | 351 | 50 | 654 | [ |
| | 15.4 | 5,839 | 263 | 374 | 58 | 695 | [ |
| | 11.9 | 4,966 | 200 | 275 | 44 | 519 | [ |
| | 11.5 | 9,385 | 663 | 767 | 141 | 1571 | [ |
| | 11.4 | 4,677 | 177 | 240 | 46 | 463 | [ |
| | 12.5 | 5,354 | 223 | 261 | 51 | 535 | [ |
| | 12.5 | 5,382 | 224 | 290 | 53 | 567 | [ |
| | 12.5 | 5,400 | 232 | 267 | 53 | 552 | [ |
| | 11.2 | 5,451 | 220 | 364 | 63 | 647 | N |
| | 12.5 | 5,332 | 226 | 263 | 47 | 536 | [ |
| | 12.5 | 5,318 | 225 | 253 | 52 | 530 | [ |
| | 12.4 | 5,349 | 229 | 260 | 48 | 537 | [ |
| | 12.4 | 5,347 | 227 | 263 | 46 | 536 | [ |
| | 12.5 | 5,359 | 224 | 265 | 51 | 540 | [ |
| | 12.6 | 5,364 | 221 | 271 | 50 | 542 | [ |
| | 12.5 | 5,890 | 224 | 268 | 48 | 540 | [ |
| | 12.2 | 5,377 | 230 | 263 | 48 | 541 | [ |
| | 12.5 | 5,375 | 228 | 270 | 52 | 550 | [ |
| | 12.4 | 5,346 | 225 | 264 | 55 | 544 | [ |
| | 12.4 | 5,346 | 227 | 258 | 48 | 533 | [ |
| | 10.8 | 6,755 | 249 | 399 | 62 | 710 | [ |
| | 12.5 | 5,408 | 226 | 264 | 57 | 547 | [ |
| | 12.6 | 5,360 | 228 | 261 | 49 | 538 | [ |
| | 11.7 | 5,696 | 264 | 283 | 63 | 610 | N |
| | 12.2 | 6,692 | 394 | 425 | 99 | 918 | [ |
| | 12.4 | 5,433 | 233 | 269 | 55 | 557 | [ |
| | 12.6 | 5,329 | 218 | 268 | 51 | 537 | [ |
| | 12.6 | 5,350 | 217 | 264 | 47 | 528 | [ |
| | 12.6 | 5,334 | 220 | 266 | 51 | 537 | [ |
| | 12.6 | 5,392 | 225 | 269 | 51 | 545 | [ |
| | 12.6 | 5,368 | 226 | 266 | 56 | 548 | [ |
| | 12.4 | 5,467 | 237 | 271 | 52 | 560 | [ |
| | 12.6 | 5,370 | 223 | 268 | 57 | 548 | [ |
| | 12.3 | 5,415 | 239 | 262 | 60 | 561 | [ |
| | 12.6 | 5,377 | 223 | 271 | 49 | 543 | [ |
| | 12.5 | 5,376 | 227 | 265 | 47 | 539 | [ |
| | 12.0 | 5,903 | 293 | 303 | 59 | 655 | [ |
| | 12.4 | 5,341 | 223 | 255 | 45 | 523 | [ |
| | 12.4 | 5,353 | 224 | 258 | 47 | 529 | [ |
| | 12.5 | 5,351 | 224 | 256 | 54 | 534 | [ |
| | 12.4 | 5,364 | 230 | 269 | 54 | 553 | [ |
| | 10.7 | 6,648 | 229 | 368 | 67 | 664 | [ |
| | 12.5 | 5,354 | 224 | 270 | 51 | 545 | [ |
| | 12.6 | 5,383 | 221 | 254 | 50 | 525 | [ |
| | 12.5 | 5,398 | 215 | 267 | 54 | 536 | [ |
| | 12.6 | 5,373 | 226 | 265 | 51 | 542 | [ |
| | 11.0 | 2,968 | 120 | 180 | 29 | 329 | [ |
| | 11.2 | 3,768 | 187 | 195 | 28 | 410 | [ |
| | 11.5 | 9,016 | 575 | 630 | 154 | 1359 | [ |
| | 10.8 | 3,100 | 161 | 154 | 24 | 339 | [ |
| | 11.9 | 8,939 | 581 | 615 | 138 | 1334 | [ |
| | 20.5 | 6,524 | 409 | 464 | 75 | 948 | [ |
| | |||||||
| | 6.3 | 4,020 | 129 | 333 | 35 | 497 | N |
| | 11.3 | 5,172 | 207 | 312 | 25 | 544 | N |
| | 11.2 | 4,925 | 190 | 263 | 26 | 479 | N |
| | 12.6 | 5,058 | 192 | 288 | 36 | 516 | [ |
| 853.1 | 383,828 | 17,389 | 21,403 | 3,937 | 42,729 | - | |
a Taxonomy based on [69]
b Insufficient exon/intron information
c Incomplete coverage of genome information
List and distribution of secretion-associated proteins of the fungal genomes belonging to the phyla Basidiomycota, Chytridiomycota, and Microsporidia, the subphylum Mucoromycotina, and the phylum Peronosporomycota (oomycetes) archived in FSD
| Species | Size (Mb) | # of ORFs | Class SP | Class SP3 | Class SL | Putative Secretome | Ref |
|---|---|---|---|---|---|---|---|
| | |||||||
| | |||||||
| | 36.3 | 13,410 | 1,189 | 1,032 | 119 | 2,340 | N |
| | 18.9 | 6,980 | 377 | 549 | 56 | 982 | N |
| | 19.0 | 6,870 | 331 | 529 | 44 | 904 | N |
| | 18.5 | 6,431 | 342 | 523 | 39 | 904 | [ |
| | 19.1 | 6,475 | 344 | 541 | 38 | 923 | [ |
| | 64.9 | 20,614 | 1,190 | 2,024 | 256 | 3,470 | [ |
| | 26.7 | 13,560 | 843 | 1,127 | 126 | 2,096 | N |
| | 35.1 | 10,048 | 793 | 933 | 83 | 1,809 | [ |
| | 34.3 | 11,603 | 1,039 | 1,058 | 106 | 2,203 | N |
| | 90.9 | 17,173 | 1,057 | 1,808 | 202 | 3,067 | [ |
| | 38.5 | 13,181 | 975 | 1,175 | 119 | 2,269 | N |
| | |||||||
| | 21.9 | 16,694 | 1305 | 1483 | 233 | 3,021 | N |
| | 88.7 | 20,567 | 1,931 | 2,020 | 230 | 4,181 | N |
| | 21.2 | 5,536 | 187 | 592 | 43 | 822 | N |
| | |||||||
| | 9.0 | 4,286 | 211 | 378 | 37 | 626 | [ |
| | 19.7 | 6,689 | 789 | 583 | 10 | 1382 | [ |
| | 19.3 | 6,950 | 481 | 717 | 34 | 1232 | [ |
| | 19.7 | 6,787 | 574 | 687 | 34 | 1295 | N |
| | |||||||
| | 24.3 | 8,732 | 806 | 750 | 108 | 1,664 | N |
| | 23.9 | 8,818 | 650 | 785 | 91 | 1,526 | N |
| | |||||||
| | 36.6 | 10,930 | 580 | 623 | 83 | 1286 | N |
| | 55.9 | 14,792 | 642 | 1,085 | 221 | 1,948 | N |
| | 46.1 | 17,482 | 750 | 994 | 202 | 1,946 | [ |
| | |||||||
| | 6.1 | 2,606 | 166 | 208 | 62 | 436 | N |
| | 2.5 | 1,996 | 90 | 135 | 34 | 259 | [ |
| | |||||||
| | |||||||
| | 18.0 | 12,175 | 844 | 554 | 569 | 1,967 | N |
| | 16.9 | 15,007 | 1,027 | 643 | 661 | 2,331 | N |
| | 21.0 | 5,387 | 212 | 283 | 267 | 762 | [ |
| | 23.5 | 5,103 | 305 | 280 | 81 | 666 | N |
| | |||||||
| | |||||||
| | 83.6 | 14,789 | 868 | 1,235 | 132 | 2,235 | N |
| | 107.8 | 17,414 | 1,485 | 1,179 | 136 | 2,800 | N |
| | 228.5 | 22,658 | 1,668 | 1,923 | 153 | 3,744 | [ |
| | 66.7 | 15,743 | 1,670 | 1,372 | 91 | 3,133 | [ |
| | 86.0 | 19,027 | 2,040 | 1,662 | 96 | 3,798 | [ |
| 1,449.1 | 386,513 | 27,761 | 31,470 | 4,796 | 64,027 | - | |
a Taxonomy based on [69]
b Insufficient exon/intron information
c Incomplete coverage of genome information
Figure 2Distribution of three classes at the phylum/subphylum level. The average ratios of the classes to the total ORFs at the subphylum and phylum levels are described. The orange circular arc represents the fungal kingdom, and the four light blue round boxes represent phyla or kingdoms. Inside the chart, the blue line represents the ratio of class SP; the red line, class SP3; and the green line, class SL.
Figure 3Composition of RXLR/RXLX [EDQ] pattern in fungi, oomycetes, and . Composition of the RXLX [EDQ] (blue) and the RXLR (red) under class SP (A), class SP3 (B), and class SL (C) with the relative ratio in fungi, oomycetes, and Plasmodium species, respectively.
Figure 4Screenshot of genome-level analysis functions for an example fungal genome. This screenshot shows the ORF numbers and ratios of each class through the pie chart in the left and the table in the right. The numbers in the table provide links to the list of putative secretory proteins belonging to each group. This figure shows the result from M. oryzae.
Figure 5One page summary for a protein. The web page shows a one page summary of amino acid sequence, exon structure, and genome context via the SNUGB [15], along with 12 predictions, including signal peptides and subcellular localization.
Figure 6SNU Genome Browser implemented in the FSD. The SNUGB (http://genomebrowser.snu.ac.kr/)[15] displays i) four types of signal peptides predicted by SignalP 3.0, SigCleave, SigPred, and RPSP, ii) amino acid patterns, iii) nucleotide localization signals predicted by predictNLS, iv) transmembrane helixes predicted by TMHMM 2.0c, and v) hydropathy plots.
Figure 7Thirteen analysis functions in the Favorite browser. Six different pages of analyses, connected to the Favorite browser, are displayed. "Prediction distribution" provides a list of predicted secretory proteins with their proportion to all proteins. "Class distribution" shows the composition of the classes, with the protein numbers belonging to each class. "Frequency/Position distribution" gives a bar or pie graph and numerical values linking to proteins listed for each item. "Hydropathy plots" draws the two graphs with window sizes of 11 and 19. "Amino acid distribution" presents consensus amino acids around the cleavage sites. "Functional domain distribution" lists the domains and their architecture diagrams based on InterPro terms.