| Literature DB >> 28284182 |
Lan Jing1, Dandan Guo2, Wenjie Hu2, Xiaofan Niu3.
Abstract
BACKGROUND: Many plant pathogen secretory proteins are known to be elicitors or pathogenic factors,which play an important role in the host-pathogen interaction process. Bioinformatics approaches make possible the large scale prediction and analysis of secretory proteins from the Puccinia helianthi transcriptome. The internet-based software SignalP v4.1, TargetP v1.01, Big-PI predictor, TMHMM v2.0 and ProtComp v9.0 were utilized to predict the signal peptides and the signal peptide-dependent secreted proteins among the 35,286 ORFs of the P. helianthi transcriptome.Entities:
Keywords: Bioinformatics; Prediction algorithm; Puccinia helianthi Schw.; Secretory protein; Signal peptide
Mesh:
Substances:
Year: 2017 PMID: 28284182 PMCID: PMC5346188 DOI: 10.1186/s12859-017-1577-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The bioinformatic tools adopted for the prediction of secretory proteins from Puccinia helianthi transcriptome
| Prediction algorithms | Objects predicted | References |
|---|---|---|
| SignalP v4.1 | N-terminal signal peptides | http://www.cbs.dtu.dk/services/SignalP/ |
| TMHMM v2.0 | Transmembrane domains | http://www.cbs.dtu.dk/services/TMHMM/ |
| Big-PI predictor | GPI-anchor site | http://mendel.imp.ac.at/gpi/fungi_server.html |
| TargetP v1.1 | Secretion pathway and position |
|
| ProtComp v9.0 | Localization sequences |
|
| LipoP v1.0 | Lipoprotein signal peptides |
|
| TatP v1.0 | Signal peptide with RR-motif |
|
| Clustal Omega | Proteins homology prediction |
|
Fig. 1Length distribution of Puccinia helianthi ORFs coding secretory proteins
Fig. 2Length distribution of Puccinia helianthi signal peptides
Fig. 3Percentage of 20 amino acid residues in Puccinia helianthi secretory protein signal peptides
Amino acids frequency and distribution in cleavage sites of signal peptide of secretory proteins
| Kinds of aa | 20 amino acid residues at the cleavage position from - 3 to + 3 of the signal peptides | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| −3 | −2 | −1 | +1 | +2 | +3 | |||||||
| No. | Percentage (%) | No. | Percentage (%) | No. | Percentage (%) | No. | Percentage (%) | No. | Percentage (%) | No. | Percentage (%) | |
| V | 242 | 26.7 | 34 | 3.7 | 3 | 0.3 | 50 | 5.5 | 34 | 3.7 | 72 | 7.9 |
| A | 147 | 16.2 | 40 | 4.4 | 446 | 49.1 | 94 | 10.4 | 18 | 2.0 | 53 | 5.8 |
| S | 135 | 14.9 | 150 | 16.5 | 148 | 16.3 | 96 | 10.6 | 97 | 10.7 | 72 | 7.9 |
| T | 103 | 11.3 | 42 | 4.6 | 47 | 5.2 | 44 | 4.8 | 57 | 6.3 | 68 | 7.5 |
| I | 81 | 8.9 | 27 | 3.0 | 1 | 0.1 | 30 | 3.3 | 36 | 4.0 | 76 | 8.4 |
| C | 72 | 7.9 | 14 | 1.5 | 70 | 7.7 | 15 | 1.7 | 22 | 2.4 | 43 | 4.7 |
| L | 45 | 5.0 | 137 | 15.1 | 18 | 2.0 | 84 | 9.3 | 59 | 6.5 | 89 | 9.8 |
| G | 34 | 3.7 | 9 | 1.0 | 123 | 13.5 | 33 | 3.6 | 29 | 3.2 | 36 | 4.0 |
| F | 6 | 0.7 | 53 | 5.8 | 3 | 0.3 | 30 | 3.3 | 25 | 2.8 | 35 | 3.9 |
| H | 6 | 0.7 | 48 | 5.3 | 0 | 0.0 | 33 | 3.6 | 22 | 2.4 | 24 | 2.6 |
| R | 6 | 0.7 | 26 | 2.9 | 6 | 0.7 | 41 | 4.5 | 34 | 3.7 | 13 | 1.4 |
| K | 6 | 0.7 | 24 | 2.6 | 0 | 0.0 | 30 | 3.3 | 50 | 5.5 | 38 | 4.2 |
| N | 5 | 0.6 | 57 | 6.3 | 3 | 0.3 | 40 | 4.4 | 60 | 6.6 | 49 | 5.4 |
| E | 5 | 0.6 | 77 | 8.5 | 5 | 0.6 | 66 | 7.3 | 86 | 9.5 | 39 | 4.3 |
| M | 4 | 0.4 | 15 | 1.7 | 4 | 0.4 | 11 | 1.2 | 4 | 0.4 | 12 | 1.3 |
| Y | 4 | 0.4 | 31 | 3.4 | 0 | 0.0 | 26 | 2.9 | 18 | 2.0 | 16 | 1.8 |
| Q | 3 | 0.3 | 73 | 8.0 | 10 | 1.1 | 117 | 12.9 | 42 | 4.6 | 45 | 5.0 |
| W | 2 | 0.2 | 15 | 1.7 | 3 | 0.3 | 3 | 0.3 | 4 | 0.4 | 13 | 1.4 |
| D | 1 | 0.1 | 31 | 3.4 | 5 | 0.6 | 58 | 6.4 | 59 | 6.5 | 43 | 4.7 |
| P | 1 | 0.1 | 5 | 0.6 | 13 | 1.4 | 7 | 0.8 | 152 | 16.7 | 72 | 7.9 |
Fig. 4Gene ontology annotation of the secretory proteins of Puccinia helianthi. The best hits were aligned to the GO database, and 143 putative secretory proteins were assigned to at least one GO term. Most consensus sequences were grouped into three major functional categories and 27 sub-categories
The 10 top GO terms significantly enriched for secretory proteins
| GO term | GO-ID | Category | % of input genes in GO-term |
|
|---|---|---|---|---|
| Hydrolase activity, hydrolyzing O-glycosyl compounds | GO:0004553 | Molecular function | 14.9 | 3.41E-20 |
| Hydrolase activity | GO:0016787 | Molecular function | 3.7 | 7.09E-20 |
| Hydrolase activity, acting on glycosyl bonds | GO:0016798 | Molecular function | 14.0 | 1.68E-19 |
| Carbohydrate metabolic process | GO:0005975 | Biological process | 7.4 | 1.65E-16 |
| Peptidase activity, acting on L-amino acid peptides | GO:0070011 | Molecular function | 7.1 | 5.09E-12 |
| Extracellular region | GO:0005576 | Cellular component | 17.6 | 6.93E-12 |
| Peptidase activity | GO:0008233 | Molecular function | 6.6 | 2.31E-11 |
| Serine-type endopeptidase activity | GO:0004252 | Molecular function | 16.9 | 7.50E-11 |
| Serine-type peptidase activity | GO:0008236 | Molecular function | 12.7 | 4.59E-10 |
| Serine hydrolase activity | GO:0017171 | Molecular function | 12.7 | 4.59E-10 |
GO enrichment analysis was carried out using the hypergeometric test with a value threshold of 0.05. Most significantly enriched terms were selected according to their p-value
Pathway categorization of the secretory proteins from Puccinia helianthi
| Parent KEGG pathway | No. of ESPs | KEGG pathway in the category |
|---|---|---|
| Metabolism: | 21 | |
| Amino Acid Metabolism | 1 | Arginine and proline metabolism |
| Biosynthesis of Other Secondary Metabolites | 2 | Phenylpropanoid biosynthesis |
| Carbohydrate Metabolism | 3 | Galactose metabolism |
| 1 | Propanoate metabolism | |
| 4 | Starch and sucrose metabolism | |
| Energy Metabolism | 1 | Oxidative phosphorylation |
| Enzyme Families | 3 | Peptidases |
| Lipid Metabolism | 2 | Steroid biosynthesis |
| 1 | Sphingolipid metabolism | |
| Metabolism of Cofactors and Vitamins | 2 | Riboflavin metabolism |
| 1 | Porphyrin and chlorophyll metabolism | |
| Genetic Information Processing | 17 | |
| Folding, Sorting and Degradation | 3 | Chaperones and folding catalysts |
| 2 | Protein processing in endoplasmic reticulum | |
| 1 | Ubiquitin system | |
| 1 | Ubiquitin mediated proteolysis | |
| Replication and Repair | 1 | Base excision repair |
| 1 | Chromosome | |
| Transcription | 4 | Spliceosome |
| 1 | Transcription factors | |
| Translation | 3 | Ribosome Biogenesis |
| Environmental Information Processing | 7 | |
| Signal Transduction | 1 | Jak-STAT signaling pathway |
| 1 | MAPK signaling pathway - yeast | |
| 1 | mTOR signaling pathway | |
| 1 | Notch signaling pathway | |
| Signaling Molecules and Interaction | 1 | CAM ligands |
| 1 | Cytokines | |
| 1 | Neuroactive ligand-receptor interaction | |
| Cellular Processes | 26 | |
| Cell Communication | 4 | Focal adhesion |
| 1 | Adherens junction | |
| Transport and Catabolism | 20 | Lysosome |
| 1 | Phagosome | |
| Organismal Systems | 7 | |
| Digestive System | 1 | Salivary secretion |
| Endocrine System | 2 | PPAR signaling pathway |
| Immune System | 2 | Antigen processing and presentation |
| 2 | Complement and coagulation | |
| Human Diseases | 4 | |
| Infectious Diseases | 1 | African trypanosomiasis |
| 1 | Tuberculosis | |
| 1 | Staphylococcus aureus infection | |
| Neurodegenerative Diseases | 1 | Alzheimer’s disease |
Fig. 5COG classifications of predicted secretory proteins in the transcriptome of Puccinia helianthi. All 80 putative proteins showing significant homology to those in the COG database were functionally classified into 14 families. Note: P, Inorganic ion transport and metabolism; L, Replication, recombination and repair; K, Transcription; E, Amino acid transport and metabolism; C, Energy production and transformation; U, Intracellular trafficking, secretion, and vesicular transport; S, Function unknown; M, Biosynthesis of cell and outer membrane; J, Translation, ribosomal structure and biogenesis; Q, The biosynthesis of secondary metabolites, transport and catabolism; R, General function prediction; G, The transport and metabolism of carbohydrates; O, Post-translational modification, protein turnover and molecular chaperones; I, Lipoid metabolism
Hydrolases among predicted secreted proteins of Puccinia helianthi
| Classification | Gene code | Blastx InterPro |
|---|---|---|
| Peptidase | KU994901 | Cysteine peptidase |
| KU994902 | Peptidase S1 | |
| KU994903 | Peptidase S8/S53 domain | |
| KU994904 | Peptidase S8/S53 domain | |
| KU994905 | Peptidase S8/S53 domain | |
| KU994906 | Gamma-glutamyl transpeptidase | |
| KU994907 | Peptidase S1 | |
| KU994908 | Peptidase S1 | |
| KU994909 | Cysteine peptidase | |
| KU994910 | Peptidase C1A | |
| KU994942 | Peptidase S1 | |
| KU994952 | Peptidase M36 | |
| KU994953 | Peptidase S10 | |
| KU994954 | Cysteine peptidase | |
| KU994955 | Peptidase S1 | |
| KU994982 | Peptidase M28 | |
| KU994983 | Peptidase S1 | |
| KU994984 | Peptidase S8/S53 domain | |
| KU994985 | Cysteine peptidase | |
| Glycoside hydrolase | KU994943 | Glycoside hydrolase |
| KU994944 | Glycoside hydrolase, family 31 | |
| KU994945 | Glycoside hydrolase, family 5 | |
| KU994946 | Glycoside hydrolase, family 31 | |
| KU994947 | Glycoside hydrolase, family 47 | |
| KU994948 | Glycoside hydrolase, family 10 | |
| KU994949 | Glycoside hydrolase, family 18 | |
| KU994950 | Glycoside hydrolase, family 30 | |
| KU994951 | Glycoside hydrolase, family 22 | |
| KU994956 | Glycoside hydrolase, family 61 | |
| KU994957 | Glycoside hydrolase, family 28 | |
| KU994958 | Glycoside hydrolase, family 30 | |
| KU994959 | Glycoside hydrolase | |
| KU994960 | Glycoside hydrolase, family 65 | |
| KU994971 | Glycoside hydrolase, family 32 | |
| Esterase | KU994921 | Cholinesterase |
| KU994975 | Cholinesterase | |
| KU994976 | Palmitoyl protein thioesterase | |
| KU994977 | Pectinesterase, catalytic | |
| KU994978 | Palmitoyl protein thioesterase | |
| KU994979 | Carboxylesterase, type B | |
| KU994980 | Calcineurin-like phosphoesterase domain | |
| Phosphatase | KU994928 | Histidine phosphatase superfamily |
| KU994929 | Inorganic pyrophosphatase | |
| KU994930 | Survival protein SurE-like Phosphatase/nucleotidase | |
| KU994931 | Histidine phosphatase superfamily | |
| KU994932 | Protein-tyrosine phosphatase | |
| Nuclease | KU994922 | Ribonuclease H-like domain |
| KU994923 | Deoxyribonuclease II | |
| KU994924 | Ribonuclease H-like domain | |
| KU994925 | Ribonuclease T2-like | |
| Polysaccharide deacetylase | KU994986 | Polysaccharide deacetylase |
| KU994987 | Polysaccharide deacetylase | |
| KU994988 | Polysaccharide deacetylase | |
| KU994989 | Polysaccharide deacetylase | |
| Alpha/beta hydrolase | KU994972 | Alpha/beta hydrolase fold-1 |
| KU994973 | Alpha/Beta hydrolase fold | |
| KU994974 | Alpha/beta hydrolase fold-1 | |
| Glucanase | KU994933 | Glucanases superfamily |
| KU994934 | Glucanases superfamily | |
| Glucoamylase | KU994926 | Glucoamylase |
| Ceramidase | KU994927 | Neutral/alkaline nonlysosomal ceramidase |
| Lipase | KU994935 | Lipase |
Pathogen Host Interaction database classification of secretory proteins of Puccinia helianthi
| Category | Num of PHI | Proportion (%) |
|---|---|---|
| Reduced virulence | 21 | 48.84 |
| Unaffected pathogenicity | 13 | 30.23 |
| Loss of pathogenicity | 4 | 9.30 |
| Effector (plant avirulence determinant) | 3 | 6.98 |
| Mixed outcome | 2 | 4.65 |
Functional classes of the secretory proteins of Puccinia helianthi
| PHI Category | Gene Code | Blastx NCBI Nr | Blastx InterPro |
|---|---|---|---|
| KU994910 | Hypothetical protein | Peptidase S8/S53 domain | |
| KU994911 | Hypothetical protein | ||
| Reduced virulence | KU994912 | Putative polysaccharide lyase family 1 | |
| KU994925 | Hypothetical protein | Ribonuclease T2-like | |
| KU994936 | L-ascorbate oxidase | Multicopper oxidase | |
| KU994937 | Hypothetical protein | Glucanosyl transferase | |
| KU994938 | Hypothetical protein | ||
| KU994939 | Hypothetical protein | ||
| KU994940 | Predicted protein | ||
| KU994945 | Hypothetical protein | Glycoside hydrolase | |
| KU994949 | Chitinase | Glycoside hydrolase | |
| KU994961 | Hypothetical protein | ||
| KU994962 | Hypothetical protein | Pectate lyase/Amb allergen | |
| KU994963 | Aspartic protease | ||
| KU994964 | Retrotransposable element | ||
| KU994965 | PR1 protein precursor | Cysteine-rich secretory protein | |
| KU994966 | Niemann-Pick C1 protein | ||
| KU994967 | Hypothetical protein | ||
| KU994968 | Pathogenesis-related protein 1 | ||
| KU994969 | Hypothetical protein | ||
| KU994970 | Hypothetical protein | Thioredoxin | |
| Unaffected pathogenicity | KU994905 | Probable serine carboxypeptidase CPVL | Serine carboxypeptidase |
| KU994913 | Beta glucosidase precursor | ||
| KU994914 | Putative chaperone protein | DnaJ domain | |
| KU994915 | Aspartic peptidase A1 | ||
| KU994916 | Beta glucosidase precursor | ||
| KU994943 | Hypothetical protein | Glycoside hydrolase | |
| KU994947 | Hypothetical protein | Glycoside hydrolase | |
| KU994957 | Predicted protein | Glycoside hydrolase | |
| KU994972 | Gastric triacylglycerol lipase-like | Alpha/beta hydrolase fold-1 | |
| KU994974 | Hypothetical protein | Alpha/beta hydrolase fold-1 | |
| KU994975 | Esterase | Cholinesterase | |
| KU994976 | Esterase 10 | Cholinesterase | |
| KU994980 | Esterase 9 | Carboxylesterase | |
| Loss of pathogenicity | KU994917 | Copper/zinc superoxide dismutase | |
| KU994918 | Ras-like C3 botulinum toxin substrate 1 | Small GTPase superfamily | |
| KU994944 | Hypothetical protein | Glycoside hydrolase | |
| KU994946 | Hypothetical protein | Glycoside hydrolase | |
| Effector | KU994907 | Limulus factor D | Peptidase S1 |
| KU994919 | Ovochymase-1 | ||
| KU994955 | Transmembrane protease serine 9-like | Peptidase S1 | |
| Mixed outcome | KU994920 | Guanine nucleotide-binding protein | WD40 repeat |
| KU994929 | Inorganic pyrophosphatase | Inorganic pyrophosphatase |