| Literature DB >> 29450146 |
Carla C Uranga1, Majid Ghassemian2, Rufina Hernández-Martínez1.
Abstract
Many basic science questions remain regarding protein functions in the pathogen: host interaction, especially in the trunk disease fungi family, the Botryosphaeriaceae, which are a global problem for economically important plants, especially fruiting trees. Proteomics is a highly useful technology for studying protein expression and for discovering novel proteins in unsequenced and poorly annotated organisms. Current fungal proteomics approaches involve 2D SDS-PAGE and extensive, complex, protein extraction methodologies. In this work, a modified Folch extraction was applied to protein extraction to perform both de novo peptide sequencing and peptide fragmentation analysis/protein identification of the plant and human fungal pathogen Lasiodiplodia theobromae. Both bioinformatics approaches yielded novel peptide sequences from proteins produced by L. theobromae in the presence of exogenous triglycerides and glucose. These proteins and the functions they may possess could be targeted for further functional characterization and validation efforts, due to their potential uses in biotechnology and as new paradigms for understanding fungal biochemistry, such as the finding of allergenic enolases, as well as various novel proteases, including zinc metalloproteinases homologous to those found in snake venom. This work contributes to genomic annotation efforts, which, hand in hand with genomic sequencing, will help improve fungal bioinformatics databases for future studies of Botryosphaeriaceae. All data, including raw data, are available via the ProteomeXchange data repository with identifier PXD005283. This is the first study of its kind in Botryosphaeriaceae.Entities:
Keywords: Bioinformatics; Gene ontology; Peptide fragmentation analysis; Trunk-disease fungi; de novo peptide sequencing
Year: 2017 PMID: 29450146 PMCID: PMC5802045 DOI: 10.1016/j.biopen.2017.03.001
Source DB: PubMed Journal: Biochim Open ISSN: 2214-0085
Fig. 1Venn diagrams of gene ontology data from Cytoscape BiNGO showing enriched biological processes in the proteome from Lasiodiplodia theobromae using an all-Uniprot ontology database, compared to A; An all-Saccharomyces cerevisiae ontology database, and B; an all-Candida albicans ontology database. Also included are enriched molecular functions of the proteome from L. theobromae assessed with an all-Uniprot ontology database compared to molecular functions from C; an all-S. cerevisiae ontology database, as well as compared to molecular functions of D; an all-C. albicans ontology database. Venn diagrams were created with the “R”-based program VennDiagram.
Proteins identified as “hypothetical” by the MSGF, X! Tandem and OMSSA algorithms, further searched with NCBI BLAST. The top-scoring homologous protein is reported for each search in the last column.
| Main accession | Description | Coverage [%] | Spectrum counting | #Validated unique | Confidence [%] | Validation | Homologous protein sequence (BLAST) |
|---|---|---|---|---|---|---|---|
| Hypothetical protein MPH_00684 [ | 6.0 | 30.1 | 2 | 100 | Confident | ||
| Hypothetical protein MPH_00885 [ | 17.1 | 228.8 | 3 | 100 | Confident | ||
| Hypothetical protein MPH_02256 [ | 9.1 | 124.1 | 4 | 100 | Confident | ||
| Hypothetical protein MPH_06104 [ | 11.6 | 108.6 | 2 | 100 | Confident | ||
| Hypothetical protein MPH_07279 [ | 9.9 | 242.1 | 2 | 100 | Confident | ||
| Hypothetical protein MPH_07998 [ | 4.9 | 11.8 | 2 | 100 | Confident | ||
| Hypothetical protein MPH_08297 [ | 12.3 | 74.8 | 3 | 100 | Confident | ||
| Hypothetical protein MPH_08717 [ | 23.4 | 128.1 | 7 | 100 | Confident | ||
| Hypothetical protein MPH_13123 [ | 21.2 | 581.6 | 4 | 100 | Confident |
Novel de novo peptide sequences from the Novor and PepNovo algorithms and BLASTp search results using an all-Uniprot database of the peptidome from Lasiodiplodia theobromae incubated in 5% grapeseed oil + 5% glucose for 20 days, showing the most probable homolog.
| Query sequence | Peptide Score | Charge | E-value | Uniprot ID | Organism | Protein name | |
|---|---|---|---|---|---|---|---|
| ELGFLASLGWNAPPAFPGPELTALDKLSAALSNR | 28.7 | 1175.2 | 3+ | 2 | Serine protease/ABC transporter B family protein tagD (EC 3.4.21.-) (Serine protease/ABC transporter tagD) | ||
| DDGSMDTCEDDGAGR | 43.1 | 808.8 | 2+ | 8.6 | Serine protease 41 (EC 3.4.21.-) (Testis serine protease 1) (TESSP-1) | ||
| QETSVPGDSQATLDALNEWR | 66.5 | 739.7 | 3+ | 1.6 | Snake venom metalloproteinase-disintegrin-like mocarhagin (MOC) (Mocarhagin-1) (SVMP) (EC 3.4.24.-) (Zinc metalloproteinase) | ||
| LLGTAMHVAGHLCAMMGWYRLTPSLHK | 17.0 | 1033.8 | 3+ | 4.6 | Snake venom serine proteinase 12 (SVSP) (EC 3.4.21.-) | ||
| WLSAKGWAQGVAEGPGDPR | 35.9 | 991.5 | 2+ | 7.7 | Snake venom vascular endothelial growth factor toxin VR-1' (svVEGF) (VEGF-F) | ||
| DYPPVLEHLLAWEEEQR | 19.0 | 708.7 | 3+ | 7 | Venom allergen 5 (Antigen 5) (Cysteine-rich venom protein) (CRVP) | ||
| VLTVKGSCNLTMLVSFWTFSGNLTNGASTGSHK | 37.4 | 1172.3 | 3+ | 5.7 | Venom allergen 5 (Antigen 5) (Cysteine-rich venom protein) (CRVP) (allergen Pac c 3) (Fragment) | ||
| MLPAFLVDVHHSKKVGVPPNYYYQHSK | 15.10 | 793.4 | 4+ | 7.1 | Venom prothrombin activator hopsarin-D (vPA) (EC 3.4.21.6) (Venom coagulation factor Xa-like protease) [Cleaved into: Hopsarin-D light chain; Hopsarin-D heavy chain] | ||
| APTPALDGGKLMFVSAK | 38.6 | 568.3 | 3+ | 0.63 | Zinc metalloprotease RasP (EC 3.4.24.-) (Regulating alternative sigma factor protease) (Regulating anti-sigma-W factor activity protease) | ||
| LASHKPNCFLAPPLGTHDVVPK | 34.5 | 800.1 | 3+ | 6 | Zinc metalloproteinase-disintegrin-like daborhagin-K (EC 3.4.24.-) (Haemorrhagic metalloproteinase russelysin) (Snake venom metalloproteinase) (SVMP) | ||
| VDSDFTVGLAAQWAETDLYK | 49.3 | 743.7 | 3+ | 7 | Zinc metalloproteinase-disintegrin-like lachestatin-2 (EC 3.4.24.-) (Snake venom metalloprotease) (SVMP) (Vascular apoptosis-inducing protein-like) (VAP-like) | ||
| YGMKWSLLMLAAGGGR | 54.8 | 863.9 | 2+ | 5.4 | Ankyrin repeat and SAM domain-containing protein 6 (Ankyrin repeat domain-containing protein 14) (SamCystin) | ||
| LMAAWGQNPGQAALYVEWNPEVMSAVWK | 37.10 | 1054.8 | 3+ | 1.2 | Ankyrin repeat and SAM domain-containing protein 6 (Ankyrin repeat domain-containing protein 14) (SamCystin) | ||
| LGAKWCHHCFAVPMK | 19.9 | 929.4 | 2+ | 0.95 | POTE ankyrin domain family member E (ANKRD26-like family C member 1A) (Prostate, ovary, testis-expressed protein on chromosome 2) (POTE-2) | ||
| ALKGFYLMNPPAPVWWTR | 49.6 | 716.4 | 3+ | 5.6 | ATP-dependent zinc metalloprotease FtsH (EC 3.4.24.-) | ||
| LAAPNPYLWWHDLVPR | 40.5 | 974.5 | 2+ | 6.7 | ATP-dependent zinc metalloprotease FtsH homolog | ||
| ELKAKKVGAMDVASSEFYK | 52.7 | 701.04 | 3+ | 1.5 | Enolase 1 (EC 4.2.1.11) (2-phospho- | ||
| TLVAGFFFLAASLTCLLVPSEAPTFSETTR | 45.0 | 812.4 | 4+ | 1.8 | Enolase (EC 4.2.1.11) (2-phospho- | ||
| ELGAHTLGFLESNPKFDPGSYEQLADLYK | 53.2 | 1080.5 | 3+ | 1.7 | Enolase (EC 4.2.1.11) (2-phospho- | ||
| HCLAAVPCMGQSPAAAALLNSSEVQNMAMGFK | 40.4 | 1025.5 | 2+, 3+, 4+ | 1.6 | Phospholipase D (EC 3.1.4.4) (Choline phosphatase) (PLD-A) | ||
| VFRLKDSGNSKPDKAAPPPGPLPR | 29.2 | 848.8 | 3+ | 8.3 | Phospholipase D2 (PLD 2) (hPLD2) (EC 3.1.4.4) (Choline phosphatase 2) (PLD1C) (Phosphatidylcholine-hydrolyzing phospholipase D2) | ||
| LHMGTDCAELDCVRSAPK | 33.2 | 687.3 | 3+ | 4.5 | Phospholipid-transporting ATPase 10 (AtALA10) (EC 3.6.3.1) (Aminophospholipid flippase 10) | ||
| DLADALKAAMGAPKHAVDHVAEPMDVAQLAPGNK | 36.9 | 867.7 | 4+ | 4.2 | Complement C2 (EC 3.4.21.43) (C3/C5 convertase) [Cleaved into: Complement C2b fragment; Complement C2a fragment] | ||
| NTLHAPEGTVVHLNAHK | 54.2 | 919.5 | 2+ | 8.7 | Complement C3 [Cleaved into: Complement C3 beta chain; Complement C3 alpha chain; C3a anaphylatoxin; Complement C3 gamma chain] (Fragment) | ||
| HNDRCPCMCNCQNDDCRCDTSNVPAK | 16.2 | 825.8 | 4+ | 6.3 | Complement C3 [Cleaved into: Complement C3 beta chain; Complement C3 alpha chain; C3a anaphylatoxin; Complement C3 gamma chain] (Fragment) | ||
| SWFVNVPNCDGALAGMTASFVR | 30.9 | 800.4 | 3+ | 9.7 | Complement C5 (C3 and PZP-like alpha-2-macroglobulin domain-containing protein 4) [Cleaved into: Complement C5 beta chain; Complement C5 alpha chain; C5a anaphylatoxin; Complement C5 alpha chain] | ||
| KDLKYLDVYYWRYFWKWRQVDYLVAPAK | 22.6 | 928.9 | 4+ | 4.1 | Hormone-sensitive lipase (HSL) (EC 3.1.1.79) | ||
| DPGFLGTLATASGKFGEGDPHAKTSGLNRK | 40.0 | 1010.5 | 3+ | 0.21 | Toxin CfTX-1 (Toxin 1) | ||
| TLMGYLLAVDGCALGCFFVMPGLGLVAALSK | 28.3 | 815.9 | 4+ | 3.2 | Toxin TdII-3 (Fragment) | ||
| SAPDPGTGDEVVMGQMDPR | 37.1 | 658.9 | 3+ | 4.8 | Aspartic proteinase (EC 3.4.23.-) | ||
| PAYHDLGVVPMADTR | 41.7 | 821.4 | 2+ | 9.2 | Aspartyl protease family protein 1 (EC 3.4.23.-) |
Fig. 2Taxonomic order distribution of peptide hits using the entire Uniprot database from both peptidome analysis approaches, showing the top ten most represented orders in each peptide identification approach with most homology to the peptidome from Lasiodiplodia theobromae.