| Literature DB >> 26307970 |
Atsushi Kurotani1, Tetsuya Sakurai2.
Abstract
Recent proteome analyses have reported that intrinsically disordered regions (IDRs) of proteins play important roles in biological processes. In higher plants whose genomes have been sequenced, the correlation between IDRs and post-translational modifications (PTMs) has been reported. The genomes of various eukaryotic algae as common ancestors of plants have also been sequenced. However, no analysis of the relationship to protein properties such as structure and PTMs in algae has been reported. Here, we describe correlations between IDR content and the number of PTM sites for phosphorylation, glycosylation, and ubiquitination, and between IDR content and regions rich in proline, glutamic acid, serine, and threonine (PEST) and transmembrane helices in the sequences of 20 algae proteomes. Phosphorylation, O-glycosylation, ubiquitination, and PEST preferentially occurred in disordered regions. In contrast, transmembrane helices were favored in ordered regions. N-glycosylation tended to occur in ordered regions in most of the studied algae; however, it correlated positively with disordered protein content in diatoms. Additionally, we observed that disordered protein content and the number of PTM sites were significantly increased in the species-specific protein clusters compared to common protein clusters among the algae. Moreover, there were specific relationships between IDRs and PTMs among the algae from different groups.Entities:
Keywords: algae; glycosylation; phosphorylation; post-translational modifications; protein disorder; ubiquitination
Mesh:
Substances:
Year: 2015 PMID: 26307970 PMCID: PMC4581327 DOI: 10.3390/ijms160819812
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Evaluation of global protein disorder in algae proteomes. (A) Predicted disordered protein content in the studied algae proteomes computed with DISOPRED; (B) Results of a pairwise correlation analysis between the disorder content and the total number of amino acids in a proteome.
Figure 2Correlations between disordered protein content and the number of phosphorylation sites. (A) Normalized predicted Ser (S), Thr (T) and Tyr (Y) phosphorylation sites per 400 amino acids in the studied algae proteomes; The relative percentage of disordered protein content with different numbers of predicted S, T and Y phosphorylation sites are presented in (B–D), respectively.
Statistics of the correlation between protein disorder and proteomic parameters.
| Species | S-pho | T-pho | Y-pho | O-gly | N-gly | PEST | Ubi | TM | |
|---|---|---|---|---|---|---|---|---|---|
| Green algae | 0.976 | 0.961 | 0.867 | 0.968 | −0.705 | 0.953 | 0.993 | −0.830 | |
| 2 × 10−5 | 3 × 10−4 | 3 × 10−3 | 4 × 10−5 | 3 × 10−2 | 2 × 10−3 | 4 × 10−7 | 5 × 10−3 | ||
| 3 × 10−5 | 1 × 10−3 | 5 × 10−3 | 4 × 10−5 | 3 × 10−2 | 2 × 10−3 | 9 × 10−7 | 6 × 10−3 | ||
| 0.967 | 0.958 | 0.895 | 0.973 | −0.333 | 0.999 | 0.994 | −0.803 | ||
| 4 × 10−5 | 3 × 10−4 | 1 × 10−3 | 3 × 10−5 | 2 × 10−1 | 6 × 10−7 | 3 × 10−7 | 8 × 10−3 | ||
| 4 × 10−5 | 1 × 10−3 | 3 × 10−3 | 3 × 10−5 | 2 × 10−1 | 1 × 10−5 | 7 × 10−7 | 9 × 10−3 | ||
| 0.987 | 0.977 | 0.891 | 0.992 | −0.869 | 0.982 | 0.992 | −0.958 | ||
| 3 × 10−6 | 8 × 10−5 | 1 × 10−3 | 5 × 10−7 | 3 × 10−3 | 2 × 10−4 | 8 × 10−7 | 9 × 10−5 | ||
| 2 × 10−5 | 5 × 10−4 | 3 × 10−3 | 3 × 10−6 | 4 × 10−3 | 4 × 10−4 | 2 × 10−6 | 2 × 10−4 | ||
| 0.985 | 0.979 | 0.701 | 0.992 | −0.970 | 0.994 | 0.990 | −0.976 | ||
| 4 × 10−6 | 6 × 10−5 | 3 × 10−2 | 7 × 10−7 | 3 × 10−5 | 3 × 10−5 | 1 × 10−6 | 2 × 10−5 | ||
| 2 × 10−5 | 6 × 10−4 | 3 × 10−2 | 3 × 10−6 | 2 × 10−4 | 1 × 10−4 | 2 × 10−6 | 5 × 10−5 | ||
| 0.980 | 0.935 | 0.831 | 0.988 | −0.980 | 0.970 | 0.968 | −0.950 | ||
| 1 × 10−5 | 1 × 10−3 | 5 × 10−3 | 2 × 10−6 | 1 × 10−5 | 7 × 10−4 | 4 × 10−5 | 2 × 10−4 | ||
| 3 × 10−5 | 1 × 10−3 | 8 × 10−3 | 5 × 10−6 | 2 × 10−4 | 9 × 10−4 | 5 × 10−5 | 3 × 10−4 | ||
| 0.974 | 0.929 | 0.846 | 0.986 | −0.967 | 0.971 | 0.976 | −0.966 | ||
| 2 × 10−5 | 1 × 10−3 | 4 × 10−3 | 3 × 10−6 | 4 × 10−5 | 6 × 10−4 | 2 × 10−5 | 5 × 10−5 | ||
| 3 × 10−5 | 1 × 10−3 | 6 × 10−3 | 7 × 10−6 | 2 × 10−4 | 9 × 10−4 | 2 × 10−5 | 1 × 10−4 | ||
| 0.982 | 0.908 | 0.578 | 0.958 | −0.883 | 0.965 | 0.910 | −0.985 | ||
| 7 × 10−6 | 2 × 10−3 | 7 × 10−2 | 9 × 10−5 | 2 × 10−3 | 9 × 10−4 | 8 × 10−4 | 4 × 10−6 | ||
| 2 × 10−5 | 2 × 10−3 | 7 × 10−2 | 9 × 10−5 | 4 × 10−3 | 1 × 10−3 | 8 × 10−4 | 3 × 10−5 | ||
| 0.976 | 0.886 | 0.879 | 0.987 | −0.805 | 0.962 | 0.967 | −0.987 | ||
| 2 × 10−5 | 4 × 10−3 | 2 × 10−3 | 2 × 10−6 | 8 × 10−3 | 1 × 10−3 | 4 × 10−5 | 3 × 10−6 | ||
| 3 × 10−5 | 4 × 10−3 | 4 × 10−3 | 5 × 10−6 | 1 × 10−2 | 1 × 10−3 | 5 × 10−5 | 3 × 10−5 | ||
| Oomycetes | 0.974 | 0.967 | 0.903 | 0.991 | −0.971 | 0.995 | 0.995 | −0.941 | |
| 2 × 10−5 | 2 × 10−4 | 1 × 10−3 | 1 × 10−6 | 3 × 10−5 | 2 × 10−5 | 1 × 10−7 | 2 × 10−4 | ||
| 3 × 10−5 | 9 × 10−4 | 3 × 10−3 | 3 × 10−6 | 3 × 10−4 | 9 × 10−5 | 6 × 10−7 | 3 × 10−4 | ||
| 0.975 | 0.932 | 0.976 | 0.985 | −0.890 | 0.996 | 0.994 | −0.948 | ||
| 2 × 10−5 | 1 × 10−3 | 2 × 10−5 | 5 × 10−6 | 2 × 10−3 | 1 × 10−5 | 2 × 10−7 | 2 × 10−4 | ||
| 3 × 10−5 | 1 × 10−3 | 2 × 10−4 | 8 × 10−6 | 3 × 10−3 | 6 × 10−5 | 7 × 10−7 | 3 × 10−4 | ||
| 0.972 | 0.947 | 0.982 | 0.995 | −0.533 | 0.984 | 0.996 | −0.960 | ||
| 3 × 10−5 | 6 × 10−4 | 8 × 10−6 | 1 × 10−7 | 9 × 10−2 | 2 × 10−4 | 7 × 10−8 | 8 × 10−5 | ||
| 3 × 10−5 | 1 × 10−3 | 2 × 10−4 | 1 × 10−6 | 1 × 10−1 | 4 × 10−4 | 5 × 10−7 | 2 × 10−4 | ||
| 0.971 | 0.953 | 0.923 | 0.990 | −0.462 | 0.966 | 0.995 | −0.943 | ||
| 3 × 10−5 | 4 × 10−4 | 5 × 10−4 | 1 × 10−6 | 1 × 10−1 | 9 × 10−4 | 2 × 10−7 | 2 × 10−4 | ||
| 3 × 10−5 | 1 × 10−3 | 2 × 10−3 | 3 × 10−6 | 1 × 10−1 | 1 × 10−3 | 7 × 10−7 | 3 × 10−4 | ||
| Diatom | 0.977 | 0.950 | 0.849 | 0.996 | 0.900 | 0.998 | 0.996 | −0.935 | |
| 1 × 10−5 | 5 × 10−4 | 4 × 10−3 | 6 × 10−8 | 1 × 10−3 | 2 × 10−6 | 9 × 10−8 | 3 × 10−4 | ||
| 3 × 10−5 | 1 × 10−3 | 6 × 10−3 | 1 × 10−6 | 3 × 10−3 | 2 × 10−5 | 4 × 10−7 | 4 × 10−4 | ||
| 0.977 | 0.940 | 0.927 | 0.982 | 0.919 | 0.972 | 0.998 | −0.977 | ||
| 1 × 10−5 | 8 × 10−4 | 5 × 10−4 | 7 × 10−6 | 6 × 10−4 | 6 × 10−4 | 2 × 10−8 | 2 × 10−5 | ||
| 3 × 10−5 | 1 × 10−3 | 2 × 10−3 | 1 × 10−5 | 2 × 10−3 | 1 × 10−3 | 4 × 10−7 | 6 × 10−5 | ||
| 0.979 | 0.947 | 0.789 | 0.993 | 0.765 | 0.997 | 0.991 | −0.981 | ||
| 1 × 10−5 | 6 × 10−4 | 1 × 10−2 | 5 × 10−7 | 1 × 10−2 | 7 × 10−6 | 9 × 10−7 | 9 × 10−6 | ||
| 3 × 10−5 | 1 × 10−3 | 1 × 10−2 | 3 × 10−6 | 2 × 10−2 | 5 × 10−5 | 2 × 10−6 | 4 × 10−5 | ||
| Yellow algae | 0.975 | 0.937 | 0.386 | 0.979 | −0.951 | 0.989 | 0.948 | −0.991 | |
| 2 × 10−5 | 9 × 10−4 | 2 × 10−1 | 1 × 10−5 | 1 × 10−4 | 9 × 10−5 | 2 × 10−4 | 9 × 10−7 | ||
| 3 × 10−5 | 1 × 10−3 | 2 × 10−1 | 2 × 10−5 | 6 × 10−4 | 2 × 10−4 | 2 × 10−4 | 2 × 10−5 | ||
| 0.993 | 0.965 | 0.381 | 0.978 | −0.875 | 0.991 | 0.982 | −0.931 | ||
| 5 × 10−7 | 2 × 10−4 | 2 × 10−1 | 1 × 10−5 | 2 × 10−3 | 6 × 10−5 | 7 × 10−6 | 4 × 10−4 | ||
| 5 × 10−6 | 8 × 10−4 | 2 × 10−1 | 2 × 10−5 | 4 × 10−3 | 2 × 10−4 | 1 × 10−5 | 5 × 10−4 | ||
| Red algae | 0.971 | 0.989 | 0.965 | 0.968 | −0.086 | 0.970 | 0.967 | −0.936 | |
| 3 × 10−5 | 1 × 10−5 | 5 × 10−5 | 4 × 10−5 | 4 × 10−1 | 7 × 10−4 | 4 × 10−5 | 3 × 10−4 | ||
| 3 × 10−5 | 3 × 10−4 | 3 × 10−4 | 4 × 10−5 | 4 × 10−1 | 9 × 10−4 | 5 × 10−5 | 4 × 10−4 | ||
| 0.995 | 0.937 | 0.907 | 0.980 | −0.942 | 0.948 | 0.947 | −0.731 | ||
| 1 × 10−7 | 9 × 10−4 | 9 × 10−4 | 1 × 10−5 | 2 × 10−4 | 2 × 10−3 | 2 × 10−4 | 2 × 10−2 | ||
| 2 × 10−6 | 1 × 10−3 | 3 × 10−3 | 2 × 10−5 | 8 × 10−4 | 2 × 10−3 | 2 × 10−4 | 2 × 10−2 | ||
| 0.983 | 0.923 | 0.827 | 0.973 | −0.378 | 0.984 | 0.997 | −0.951 | ||
| 6 × 10−6 | 1 × 10−3 | 6 × 10−3 | 2 × 10−5 | 2 × 10−1 | 2 × 10−4 | 2 × 10−8 | 1 × 10−4 | ||
| 2 × 10−5 | 2 × 10−3 | 8 × 10−3 | 3 × 10−5 | 2 × 10−1 | 4 × 10−4 | 2 × 10−7 | 3 × 10−4 | ||
Pearson correlation coefficients, their associated p values, and false discovery rates (Benjamini–Hochberg procedure) are presented in the upper, middle and lower rows, respectively, for all analyzed correlations between IDRs and proteomic parameters. Shaded values indicate p values greater than 0.05 that were not considered statistically significant. The column names stand for S phosphorylation, T phosphorylation, Y phosphorylation, N-linked glycosylation, O-linked glycosylation, region enriched in proline (P), glutamic acid (E), serine (S), and threonine (T) residues (PEST), ubiquitination, and transmembrane helices.
Figure 3Correlation between disordered protein content and glycosylation sites. Normalized predicted O-glycosylation and N-glycosylation sites per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; The relative percentage of disordered protein content with different numbers of predicted O- and N-glycosylation sites are presented in (B,D), respectively.
Figure 4Correlation between disordered protein content and ubiquitination sites or PEST regions. Normalized predicted ubiquitination sites and PEST regions per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; Relative percentage of disordered protein content with different numbers of predicted sites of ubiquitination and predicted regions of PEST are presented in (B,D), respectively.
Figure 5Correlation between disordered protein content and transmembrane regions. Normalized predicted transmembrane helices per 400 amino acids in the studied algae proteomes are presented in (A); Relative percentage of disordered protein content with different numbers of predicted transmembrane helices are presented in (B).
Relative post-translational modification (PTM) content in ordered and disordered regions of algae proteomes.
| Category | S-pho | T-pho | Y-pho | O-gly | N-gly | Ubi |
|---|---|---|---|---|---|---|
| Green algae | 15.1 | 8.4 | 1.1 | 3.2 | 0.9 | 11.0 |
| Oomycetes | 25.9 | 14.5 | 1.7 | 3.6 | 1.2 | 16.0 |
| Diatom | 19.8 | 10.3 | 1.3 | 2.9 | 1.9 | 11.9 |
| Yellow algae | 17.4 | 9.9 | 1.0 | 3.4 | 1.2 | 10.5 |
| Red algae | 19.2 | 11.3 | 1.6 | 2.6 | 1.2 | 16.6 |
| Total | 18.8 | 10.5 | 1.3 | 3.3 | 1.2 | 12.5 |
Ratios of normalized PTM contents (Rd/o) were calculated as described in the Experimental Section 4.3.
Preference of protein disorder and PTM in species-specific protein clusters and common protein clusters of algae proteomes.
| Category | Disorder | S-pho | T-pho | Y-pho | O-gly | N-gly | Ubi |
|---|---|---|---|---|---|---|---|
| Specific | 34% | 2.1 | 1.0 | 0.7 | 1.4 | 2.1 | 1.0 |
| Common | 20% | 0.6 | 0.3 | 0.5 | 0.8 | 1.3 | 0.6 |
| Ratio of s/c | 1.7 | 3.4 | 3.2 | 1.4 | 1.8 | 1.6 | 1.8 |
Total values of each parameter in species-specific protein clusters and common protein clusters are presented. The one-tailed p values as determined with t-tests between species-specific protein clusters and common protein clusters were less than 0.05. Ratio of s/c = the value of specific/the value of common.