| Literature DB >> 21300622 |
Abstract
The Fungal Secretome KnowledgeBase (FunSecKB) provides a resource of secreted fungal proteins, i.e. secretomes, identified from all available fungal protein data in the NCBI RefSeq database. The secreted proteins were identified using a well evaluated computational protocol which includes SignalP, WolfPsort and Phobius for signal peptide or subcellular location prediction, TMHMM for identifying membrane proteins, and PS-Scan for identifying endoplasmic reticulum (ER) target proteins. The entries were mapped to the UniProt database and any annotations of subcellular locations that were either manually curated or computationally predicted were included in FunSecKB. Using a web-based user interface, the database is searchable, browsable and downloadable by using NCBI's RefSeq accession or gi number, UniProt accession number, keyword or by species. A BLAST utility was integrated to allow users to query the database by sequence similarity. A user submission tool was implemented to support community annotation of subcellular locations of fungal proteins. With the complete fungal data from RefSeq and associated web-based tools, FunSecKB will be a valuable resource for exploring the potential applications of fungal secreted proteins. Database URL: http://proteomics.ysu.edu/secretomes/fungi.php.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21300622 PMCID: PMC3263735 DOI: 10.1093/database/bar001
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Overview of FunSecKB. To search the database users can enter NCBI RefSeq gi or accession number, UniProt accession number, keywords or species. The database consists of information generated using seven prediction tools and subcellular location annotated in UniProtKB and our own manual curation. Users can browse through the results using the web user-interface. Links to external databases and resources are also provided for further exploration. Whole secretome sequences can be downloaded and BLAST utility can be accessed from the database interface.
Summary of genome size, proteome size, secretome size in different fungi
| Species | Phylum | Genome (Mb) | Predicted Proteome | Predicted Secretome | Curated Secretome | GPI-anchored Secretome | Soluble Secretome | Secretome (%) | GPI-anchored Portion (%) |
|---|---|---|---|---|---|---|---|---|---|
| Ascomycota | 31 | 9313 | 224 | 0 | 25 | 199 | 2.4 | 11.2 | |
| Ascomycota | 74 | 9587 | 335 | 0 | 51 | 284 | 3.5 | 15.2 | |
| Ascomycota | 8 | 4725 | 93 | 2 | 21 | 72 | 2.0 | 22.6 | |
| Ascomycota | 28 | 9121 | 571 | 17 | 71 | 500 | 6.3 | 12.4 | |
| Ascomycota | 36 | 13 487 | 951 | 25 | 100 | 851 | 7.1 | 10.5 | |
| Ascomycota | 29 | 9630 | 624 | 58 | 74 | 550 | 6.5 | 11.9 | |
| Ascomycota | 30 | 9541 | 704 | 29 | 76 | 628 | 7.4 | 10.8 | |
| Ascomycota | 34 | 14 102 | 832 | 253 | 82 | 750 | 5.9 | 9.9 | |
| Ascomycota | 37 | 12 074 | 843 | 28 | 85 | 758 | 7.0 | 10.1 | |
| Ascomycota | 29 | 10 401 | 774 | 23 | 70 | 704 | 7.4 | 9.0 | |
| Ascomycota | 39 | 16 389 | 755 | 4 | 92 | 663 | 4.6 | 12.2 | |
| Ascomycota | 28 | 14 633 | 449 | 41 | 117 | 332 | 3.1 | 26.1 | |
| Ascomycota | 16 | 5860 | 184 | 0 | 55 | 129 | 3.1 | 29.9 | |
| Ascomycota | 12 | 5192 | 121 | 7 | 48 | 73 | 2.3 | 39.7 | |
| Ascomycota | 15 | 6254 | 212 | 1 | 64 | 148 | 3.4 | 30.2 | |
| Ascomycota | 34 | 11 048 | 862 | 1 | 67 | 795 | 7.8 | 7.8 | |
| Ascomycota | 16 | 5936 | 169 | 0 | 40 | 129 | 2.8 | 23.7 | |
| Ascomycota | 29 | 10 440 | 263 | 2 | 41 | 222 | 2.5 | 15.6 | |
| Ascomycota | 12 | 6335 | 148 | 1 | 38 | 110 | 2.3 | 25.7 | |
| Ascomycota | 36 | 11 690 | 900 | 1 | 102 | 798 | 7.7 | 11.3 | |
| Ascomycota | 11 | 5357 | 113 | 5 | 37 | 76 | 2.1 | 32.7 | |
| Ascomycota | 10 | 5091 | 128 | 0 | 29 | 99 | 2.5 | 22.7 | |
| Ascomycota | 16 | 5799 | 139 | 0 | 34 | 105 | 2.4 | 24.5 | |
| Ascomycota | 40 | 14 010 | 1471 | 3 | 127 | 1344 | 10.5 | 8.6 | |
| Ascomycota | 33 | 10 406 | 751 | 21 | 78 | 673 | 7.2 | 10.4 | |
| Ascomycota | 39 | 9844 | 592 | 10 | 76 | 516 | 6.0 | 12.8 | |
| Ascomycota | 32 | 12 791 | 703 | 5 | 102 | 601 | 5.5 | 14.5 | |
| Ascomycota | 29 | 10 663 | 538 | 0 | 79 | 459 | 5.0 | 14.7 | |
| Ascomycota | 37 | 16 002 | 1103 | 1 | 101 | 1002 | 6.9 | 9.2 | |
| Ascomycota | 11 | 5920 | 159 | 0 | 33 | 126 | 2.7 | 20.8 | |
| Ascomycota | 9 | 5040 | 105 | 0 | 31 | 74 | 2.1 | 29.5 | |
| Ascomycota | 15 | 5816 | 144 | 0 | 35 | 109 | 2.5 | 24.3 | |
| Ascomycota | 33 | 10 272 | 789 | 1 | 89 | 700 | 7.7 | 11.3 | |
| Ascomycota | 37 | 12 169 | 942 | 0 | 93 | 849 | 7.7 | 9.9 | |
| Ascomycota | 12 | 5885 | 156 | 101 | 41 | 115 | 2.7 | 26.3 | |
| Ascomycota | 11 | 4824 | 109 | 0 | 7 | 102 | 2.3 | 6.4 | |
| Ascomycota | 13 | 5001 | 112 | 43 | 7 | 105 | 2.2 | 6.3 | |
| Ascomycota | 38 | 14 446 | 623 | 1 | 88 | 535 | 4.3 | 14.1 | |
| Ascomycota | 36 | 13 252 | 580 | 0 | 65 | 515 | 4.4 | 11.2 | |
| Ascomycota | 22 | 7760 | 312 | 0 | 45 | 267 | 4.0 | 14.4 | |
| Ascomycota | 15 | 5376 | 116 | 0 | 28 | 88 | 2.2 | 24.1 | |
| Ascomycota | 22 | 6472 | 299 | 5 | 78 | 221 | 4.6 | 26.1 | |
| Ascomycota | 12 | 4994 | 120 | 0 | 33 | 87 | 2.4 | 27.5 | |
| Basidiomycota | 36 | 13 546 | 917 | 8 | 106 | 811 | 6.8 | 11.6 | |
| Basidiomycota | 19 | 6578 | 186 | 0 | 34 | 152 | 2.8 | 18.3 | |
| Basidiomycota | 21 | 6594 | 181 | 0 | 30 | 151 | 2.7 | 16.6 | |
| Basidiomycota | 59 | 18 215 | 650 | 0 | 99 | 551 | 3.6 | 15.2 | |
| Basidiomycota | 9 | 4286 | 134 | 0 | 8 | 126 | 3.1 | 6.0 | |
| Basidiomycota | 27 | 13 649 | 465 | 0 | 39 | 426 | 3.4 | 8.4 | |
| Basidiomycota | 69 | 9083 | 391 | 0 | 22 | 369 | 4.3 | 5.6 | |
| Basidiomycota | 20 | 6548 | 431 | 2 | 21 | 410 | 6.6 | 4.9 | |
| Microsporidia | 3 | 1996 | 17 | 2 | 0 | 17 | 0.9 | 0.0 | |
| Microsporidia | 4 | 3632 | 21 | 0 | 0 | 21 | 0.6 | 0.0 | |
| Other species | 998 | 367 | 366 | ||||||
| Total | 478 073 | 23 878 | 1067 | 3014 |
Figure 2.Relationship between genome size, proteome size and secretome size in fungi. (a) genome size and proteome size; (b) proteome size and secretome size; (c) proteome size and GPI-anchored secreted proteins and (d) proteome size and soluble secreted proteins.
Highly encoded secreted protein families in fungi
| CDD functional domains | Numbers |
|---|---|
| pfam00135, COesterase, Carboxylesterase | 314 |
| pfam03443, Glyco hydro 61, Glycosyl hydrolase family 61 | 301 |
| COG0277, GlcD, FAD/FMN-containing dehydrogenases | 287 |
| cd04077, Peptidases S8 PCSK9 ProteinaseK like: Peptidase S8 family domain in ProteinaseK-like proteins | 223 |
| pfam00450, Peptidase S10, Serine carboxypeptidase | 215 |
| pfam00295, Glyco hydro 28, Glycosyl hydrolases family 28 | 207 |
| pfam00067, p450, Cytochrome P450 | 160 |
| pfam00933, Glyco hydro 3, Glycosyl hydrolase family 3 N terminal domain | 156 |
| cd05474, pepsin-like proteinases secreted from pathogens to degrade host proteins | 154 |
| COG2303, BetA, Choline dehydrogenase and related flavoproteins | 152 |
| pfam01083, Cutinase | 139 |
| pfam09362, DUF1996, Domain of unknown function (DUF1996) | 136 |
| pfam00264, Tyrosinase, Common central domain of tyrosinase | 130 |
| TIGR03388, ascorbase, L-ascorbate oxidase, plant type | 128 |
| cd04056, Peptidases S53, Peptidase domain in the S53 family | 124 |
| pfam04389, Peptidase M28, Peptidase family M28 | 122 |
| COG5309, COG5309, Exo-beta-1,3-glucanase | 121 |
| pfam04616, Glyco hydro 43, Glycosyl hydrolases family 43 | 114 |
| cd00519, Lipase 3, Lipase (class 3) | 106 |
| PRK02106, PRK02106, choline dehydrogenase | 100 |
| COG2730, BglC, Endoglucanase | 99 |
| pfam00328, Acid phosphat A, Histidine acid phosphatase | 98 |
| pfam03856, SUN, Beta-glucosidase (SUN family) | 97 |
| pfam07519, Tannase, Tannase and feruloyl esterase | 97 |
| smart00656, Amb all, Amb all domain | 94 |
| pfam00457, Glyco hydro 11, Glycosyl hydrolases family 11 | 92 |
| cd06097, Aspergillopepsin like: Aspergillopepsin like, aspartic proteases of fungal origin | 91 |
| cd02877, GH18 hevamine XipI class III | 88 |
| pfam00331, Glyco hydro 10, Glycosyl hydrolase family 10 | 88 |
| pfam01565, FAD binding 4, FAD binding domain | 87 |
| pfam03583, LIP, Secretory lipase | 87 |
| pfam03659, Glyco hydro 71, Glycosyl hydrolase family 71 | 87 |
| pfam01185, Hydrophobin, Fungal hydrophobin | 85 |
| pfam01532, Glyco hydro 47, Glycosyl hydrolase family 47 | 79 |
| cd02181, GH16 MLG1 glucanase | 78 |
| cd05471, Pepsin-like aspartic proteases, bilobal enzymes that cleave bonds in peptides at acidic pH | 77 |
| cd05384, SCP PRY1 like, SCP-like extracellular protein domain, PRY1-like sub-family restricted to fungi | 75 |
| cd07203, Fungal Phospholipase B-like; cPLA2 GrpIVA homologs; catalytic domain | 71 |
| pfam00840, Glyco hydro 7, Glycosyl hydrolase family 7 | 71 |
| pfam00150, Cellulase, Cellulase (glycosyl hydrolase family 5) | 70 |
| pfam11790, Glyco hydro cc, Glycosyl hydrolase catalytic core | 70 |
| pfam01522, Polysacc deac 1, Polysaccharide deacetylase | 69 |
| pfam07971, Glyco hydro 92, Glycosyl hydrolase family 92 | 68 |
| smart00636, Glyco 18, Glycosyl hydrolase family 18 | 68 |
| cd00842, MPP ASMase, acid sphingomyelinase and related proteins | 67 |
| cd03457, intradiol dioxygenase like, Intradiol dioxygenase supgroup | 67 |
| pfam03663, Glyco hydro 76, Glycosyl hydrolase family 76 | 67 |
| pfam05577, Peptidase S28, Serine carboxypeptidase S28 | 67 |
| pfam12296, HsbA, Hydrophobic surface binding protein A | 65 |
| cd02183, GH16 GPI glucanosyltransferase | 64 |
| COG0654, 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases | 63 |
| pfam01055, Glyco hydro 31, Glycosyl hydrolases family 31 | 62 |
| cd06248, Peptidase M14 Carboxypeptidase A/B-like subfamily | 61 |
| pfam02128, Peptidase M36, Fungalysin metallopeptidase (M36) | 61 |
| pfam04185, Phosphoesterase, Phosphoesterase family | 61 |
| pfam11765, Hyphal reg CWP, Hyphally regulated cell wall protein | 60 |
| pfam01328, Peroxidase 2, Peroxidase, family 2 | 59 |
| pfam01828, Peptidase A4, Peptidase A4 family | 58 |
| pfam03198, Glyco hydro 72, Glycolipid anchored surface protein | 57 |
| cd01846, Fatty acyltransferase-like subfamily of the SGNH hydrolases, a diverse family of lipases and esterases | 56 |
| pfam02102, Peptidase M35, Deuterolysin metalloprotease (M35) | 56 |
| pfam00723, Glyco hydro 15, Glycosyl hydrolases family 15 | 54 |
| pfam00128, Alpha-amylase, Alpha amylase, catalytic domain | 53 |
| cd08588, Catalytic domain of Arabidopsis thaliana PI-PLC X domain-containing protein | 52 |
| PHA03247, PHA03247, large tegument protein UL36; Provisional | 52 |
| pfam01301, Glyco hydro 35, Glycosyl hydrolases family 35 | 51 |
| pfam11937, DUF3455, Protein of unknown function (DUF3455) | 51 |
Figure 3.Molecular functional classification of fungal secreted proteins using GOSlimViewer.
Comparison of the two independently developed fungal secretome databases
| FSD | FunSecKB | |
|---|---|---|
| Data source | Fungal genomes | Fungal proteins in RefSeq |
| Prediction tools | SignalP3.0; SigCleave; SigPred; RPSP; TMHMM2.0c; TargetP1.1b; PsortII; PredictNLS; SecretomeP1.0f | SignalP 3.0; Phobius1.01; WolfPsort0.2; TargetP1.1b, TMHMM2.0c; PS-Scan |
| Data access | Sequence locus name; BLAST | Keywords, RefSeq gi or accession, UniProt accession; BLAST |
| Community curation tool | Not available | Available |