Literature DB >> 26307970

In Silico Analysis of Correlations between Protein Disorder and Post-Translational Modifications in Algae.

Atsushi Kurotani1, Tetsuya Sakurai2.   

Abstract

Recent proteome analyses have reported that intrinsically disordered regions (IDRs) of proteins play important roles in biological processes. In higher plants whose genomes have been sequenced, the correlation between IDRs and post-translational modifications (PTMs) has been reported. The genomes of various eukaryotic algae as common ancestors of plants have also been sequenced. However, no analysis of the relationship to protein properties such as structure and PTMs in algae has been reported. Here, we describe correlations between IDR content and the number of PTM sites for phosphorylation, glycosylation, and ubiquitination, and between IDR content and regions rich in proline, glutamic acid, serine, and threonine (PEST) and transmembrane helices in the sequences of 20 algae proteomes. Phosphorylation, O-glycosylation, ubiquitination, and PEST preferentially occurred in disordered regions. In contrast, transmembrane helices were favored in ordered regions. N-glycosylation tended to occur in ordered regions in most of the studied algae; however, it correlated positively with disordered protein content in diatoms. Additionally, we observed that disordered protein content and the number of PTM sites were significantly increased in the species-specific protein clusters compared to common protein clusters among the algae. Moreover, there were specific relationships between IDRs and PTMs among the algae from different groups.

Entities:  

Keywords:  algae; glycosylation; phosphorylation; post-translational modifications; protein disorder; ubiquitination

Mesh:

Substances:

Year:  2015        PMID: 26307970      PMCID: PMC4581327          DOI: 10.3390/ijms160819812

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   5.923


1. Introduction

The intrinsically disordered regions (IDRs) of proteins have been extensively investigated over the past several years, and their numbers are significantly higher in eukaryotes than in bacteria or archaea [1,2]. More than one-third of eukaryotic proteins contain IDRs of more than 30 residues in length [3]. IDRs are characterized by a low abundance of order-promoting and bulky hydrophobic aromatic amino acids such as Ile, Leu, Val, Cys, Asn, Trp, Tyr, and Phe, a high abundance of disorder-promoting amino acids such as Ala, Arg, Gly, Gln, Ser, Glu, Lys and Pro, and a high net charge at neutral pH [1,4,5,6,7,8,9]. While IDRs form unstructured domains, predictions on which regions of a protein are disordered based on protein sequences can be used to select target regions for 3D structural analysis. For example, the disordered C-terminal 100 amino acid deletion mutant of the human nei-like DNA glycosylase (NEIL) protein was successfully expressed in Escherichia coli, purified, and crystallized [10]. More than 50 methods have been developed since the early 2000s for protein disorder predictions [6,11,12], including the development of structure-related databases and the design of IDR prediction algorithms. IDRs often contain sites for various post-translational modifications (PTMs) such as phosphorylation, glycosylation, ubiquitination, acylation, and methylation, suggesting that PTMs favor the easily accessible and flexible disordered regions [13,14]. PTMs are involved in enzyme activity, protein localization, protein-protein interactions, signaling cascades, DNA repair, and cell division. PTMs commonly cause only small changes on the surface of the disordered region of a protein, whereas PTM-based protein-protein interactions often cause large-scale three-dimensional structural changes such as disorder-to-order transitions of IDRs [15]. Importantly, the larger structural changes are generally considered to have significant effects on protein functional diversity. PTM prediction methods are well established. Several studies have reported that protein phosphorylation on Ser, Thr, and Tyr predominantly occurs in the disordered regions of animal proteins [16,17,18,19]. Proteome-wide analyses have also shown that there are correlations between IDRs and other PTMs such as glycosylation, acetylation, and methylation [20,21]. Proteins with IDRs have also been examined in plant proteomes [1,16,22]. For example, Arabidopsis thaliana (A. thaliana) proteins were found to be generally less disordered than human proteins. However, certain functional classes of A. thaliana proteins are significantly more enriched in disordered regions in response to their environment compared to human proteins [23]. This suggests that plants may use disorder independently as an easy and fast mechanism for introducing versatility into protein interaction networks that underlie complex biological processes to quickly and efficiently adapt to environmental changes of which they cannot escape [23]. Additionally, the relationships between PTMs and other cellular features are increasingly being explored in plant studies. For instance, phosphoproteomics and genetics analyses of A. thaliana have revealed insight into signaling pathways [24]. However, proteome information in many plant species is lacking. For example, A. thaliana is one of the most commonly studied plants and detailed genomic data are available. Conversely, one-third of the proteins in A. thaliana still lack functional annotations in terms of biological processes in the Gene Ontology database [24,25,26,27]. Recently, several large-scale experimental and computational approaches have been used to enhance our knowledge of plant species proteomes [27,28,29]. One such higher plant study was a proteome-wide computational analysis of the correlation between protein disorder and PTMs including Ser/Thr/Tyr-phosphorylation and O/N-linked protein glycosylation [19,21]. However, sufficient numbers of this type of study have not yet been performed, and plant researchers are currently developing better resources for omics analyses of plant species. In order to enrich the gene annotation in plants and to understand gene and protein functions, we performed a proteome-wide correlation analysis of protein disorder with the major types of eukaryotic PTMs including Ser/Thr/Tyr-phosphorylation, O/N-linked glycosylation, and ubiquitination in 20 typical eukaryotic algae. We further analyzed sequences enriched in proline (P), glutamic acid (E), serine (S), and threonine (T) residues (PEST), as well as transmembrane helices. Phosphorylation, glycosylation, and ubiquitination were analyzed with a preference for occurrence in disordered regions using proteome sequence sets. We further investigated whether PEST regions and transmembrane helices preferentially occurred in disordered regions. In the process, it was revealed that protein disorder and multiple PTMs are favored by species-specific proteins among the algae.

2. Results

2.1. Comparison of Data Sets and Global Intrinsic Disorder in Algae Proteomes

We prepared non-redundant sequence sets from 20 algae whole-protein sequences using the OrthoMCL tool [30] to remove sequences with >90% identity (for details, see Experimental Section 4.1). The number of sequences in the analyzed algae proteomes varied greatly from approximately 4800 in Cyanidioschyzon merolae to 20,000 in Fragilariopsis cylindrus (Table S1). Disordered protein content was determined using DISOPRED [31], which ranged from approximately 20% in Pyropia yezoensis to 35% in Chlamydomonas reinhardtii (Figure 1A). A similar trend was observed with another disorder prediction tool, RONN [32] (Figure S1). Additionally, we found that there was a correlation between disordered protein content and the total number of amino acids among the algae proteomes (Figure 1B).
Figure 1

Evaluation of global protein disorder in algae proteomes. (A) Predicted disordered protein content in the studied algae proteomes computed with DISOPRED; (B) Results of a pairwise correlation analysis between the disorder content and the total number of amino acids in a proteome.

Evaluation of global protein disorder in algae proteomes. (A) Predicted disordered protein content in the studied algae proteomes computed with DISOPRED; (B) Results of a pairwise correlation analysis between the disorder content and the total number of amino acids in a proteome.

2.2. Correlation between Intrinsic Disorder and Phosphorylation

Reversible protein phosphorylation, principally on serine, threonine, or tyrosine residues, is considered the most important PTM because of its role as a major regulatory mechanism in every living cell. Many studies have reported that protein phosphorylation sites are predominantly observed in the disordered regions of eukaryotic proteins, including those of plants [17,18,21]. Furthermore, the amino acid composition, sequence complexity, hydrophobicity, charge, and other sequence attributes of regions adjacent to phosphorylation sites are very similar to those of IDRs [18]. Recently, the bioinformatics tool Musite was developed for large-scale prediction of phosphorylation sites from protein sequences as a stand-alone application [17]. Therefore, the Musite tool was used to predict phosphorylation sites in the target algae proteomes. The number of predicted phosphorylation sites was normalized to the uniform length of 400 amino acids, rather than per sequence, to account for the difference in the average protein lengths in the datasets. Similarly, we used normalized values in the analyses of the number of glycosylation sites, ubiquitination sites, PEST regions, and transmembrane helices. No trend in the average abundance of phosphorylation sites in algae proteomes was found (Figure 2A). However, a strong positive correlation was observed between protein disorder and phosphorylation. For every algae proteome analyzed, there was a positive correlation between the predicted number (from 0 to ≥7) of Ser, Thr, or Tyr phosphorylation sites and disordered protein content (Figure 2B–D). The correlations between the number of Ser and Thr phosphorylation sites to disordered protein content were statistically significant as determined by one-tailed probability values and false discovery rates controlled by the Benjamini–Hochberg procedure [33] (Table 1). The positive correlation between disordered protein content and the number of phosphorylation sites was confirmed by using the alternative disorder prediction tool, RONN (Figure S3).
Figure 2

Correlations between disordered protein content and the number of phosphorylation sites. (A) Normalized predicted Ser (S), Thr (T) and Tyr (Y) phosphorylation sites per 400 amino acids in the studied algae proteomes; The relative percentage of disordered protein content with different numbers of predicted S, T and Y phosphorylation sites are presented in (B–D), respectively.

Table 1

Statistics of the correlation between protein disorder and proteomic parameters.

SpeciesS-phoT-phoY-phoO-glyN-glyPESTUbiTM
Green algaeO. sp. RCC8090.9760.9610.8670.968−0.7050.9530.993−0.830
2 × 10−53 × 10−43 × 10−34 × 10−53 × 10−22 × 10−34 × 10−75 × 10−3
3 × 10−51 × 10−35 × 10−34 × 10−53 × 10−22 × 10−39 × 10−76 × 10−3
O. tauri0.9670.9580.8950.973−0.3330.9990.994−0.803
4 × 10−53 × 10−41 × 10−33 × 10−52 × 10−16 × 10−73 × 10−78 × 10−3
4 × 10−51 × 10−33 × 10−33 × 10−52 × 10−11 × 10−57 × 10−79 × 10−3
M. sp. RCC2990.9870.9770.8910.992−0.8690.9820.992−0.958
3 × 10−68 × 10−51 × 10−35 × 10−73 × 10−32 × 10−48 × 10−79 × 10−5
2 × 10−55 × 10−43 × 10−33 × 10−64 × 10−34 × 10−42 × 10−62 × 10−4
M. pusilla0.9850.9790.7010.992−0.9700.9940.990−0.976
4 × 10−66 × 10−53 × 10−27 × 10−73 × 10−53 × 10−51 × 10−62 × 10−5
2 × 10−56 × 10−43 × 10−23 × 10−62 × 10−41 × 10−42 × 10−65 × 10−5
C. sp. C-1690.9800.9350.8310.988−0.9800.9700.968−0.950
1 × 10−51 × 10−35 × 10−32 × 10−61 × 10−57 × 10−44 × 10−52 × 10−4
3 × 10−51 × 10−38 × 10−35 × 10−62 × 10−49 × 10−45 × 10−53 × 10−4
C. vulgaris0.9740.9290.8460.986−0.9670.9710.976−0.966
2 × 10−51 × 10−34 × 10−33 × 10−64 × 10−56 × 10−42 × 10−55 × 10−5
3 × 10−51 × 10−36 × 10−37 × 10−62 × 10−49 × 10−42 × 10−51 × 10−4
V. carteri0.9820.9080.5780.958−0.8830.9650.910−0.985
7 × 10−62 × 10−37 × 10−29 × 10−52 × 10−39 × 10−48 × 10−44 × 10−6
2 × 10−52 × 10−37 × 10−29 × 10−54 × 10−31 × 10−38 × 10−43 × 10−5
C. reinhardtii0.9760.8860.8790.987−0.8050.9620.967−0.987
2 × 10−54 × 10−32 × 10−32 × 10−68 × 10−31 × 10−34 × 10−53 × 10−6
3 × 10−54 × 10−34 × 10−35 × 10−61 × 10−21 × 10−35 × 10−53 × 10−5
OomycetesP. ramorum0.9740.9670.9030.991−0.9710.9950.995−0.941
2 × 10−52 × 10−41 × 10−31 × 10−63 × 10−52 × 10−51 × 10−72 × 10−4
3 × 10−59 × 10−43 × 10−33 × 10−63 × 10−49 × 10−56 × 10−73 × 10−4
P. sojae0.9750.9320.9760.985−0.8900.9960.994−0.948
2 × 10−51 × 10−32 × 10−55 × 10−62 × 10−31 × 10−52 × 10−72 × 10−4
3 × 10−51 × 10−32 × 10−48 × 10−63 × 10−36 × 10−57 × 10−73 × 10−4
P. infestans0.9720.9470.9820.995−0.5330.9840.996−0.960
3 × 10−56 × 10−48 × 10−61 × 10−79 × 10−22 × 10−47 × 10−88 × 10−5
3 × 10−51 × 10−32 × 10−41 × 10−61 × 10−14 × 10−45 × 10−72 × 10−4
P. capsici0.9710.9530.9230.990−0.4620.9660.995−0.943
3 × 10−54 × 10−45 × 10−41 × 10−61 × 10−19 × 10−42 × 10−72 × 10−4
3 × 10−51 × 10−32 × 10−33 × 10−61 × 10−11 × 10−37 × 10−73 × 10−4
DiatomP. tricornutum0.9770.9500.8490.9960.9000.9980.996−0.935
1 × 10−55 × 10−44 × 10−36 × 10−81 × 10−32 × 10−69 × 10−83 × 10−4
3 × 10−51 × 10−36 × 10−31 × 10−63 × 10−32 × 10−54 × 10−74 × 10−4
F. cylindrus0.9770.9400.9270.9820.9190.9720.998−0.977
1 × 10−58 × 10−45 × 10−47 × 10−66 × 10−46 × 10−42 × 10−82 × 10−5
3 × 10−51 × 10−32 × 10−31 × 10−52 × 10−31 × 10−34 × 10−76 × 10−5
T. pseudonana0.9790.9470.7890.9930.7650.9970.991−0.981
1 × 10−56 × 10−41 × 10−25 × 10−71 × 10−27 × 10−69 × 10−79 × 10−6
3 × 10−51 × 10−31 × 10−23 × 10−62 × 10−25 × 10−52 × 10−64 × 10−5
Yellow algaeA. anophagefferens0.9750.9370.3860.979−0.9510.9890.948−0.991
2 × 10−59 × 10−42 × 10−11 × 10−51 × 10−49 × 10−52 × 10−49 × 10−7
3 × 10−51 × 10−32 × 10−12 × 10−56 × 10−42 × 10−42 × 10−42 × 10−5
E. siliculosus0.9930.9650.3810.978−0.8750.9910.982−0.931
5 × 10−72 × 10−42 × 10−11 × 10−52 × 10−36 × 10−57 × 10−64 × 10−4
5 × 10−68 × 10−42 × 10−12 × 10−54 × 10−32 × 10−41 × 10−55 × 10−4
Red algaeC. merolae0.9710.9890.9650.968−0.0860.9700.967−0.936
3 × 10−51 × 10−55 × 10−54 × 10−54 × 10−17 × 10−44 × 10−53 × 10−4
3 × 10−53 × 10−43 × 10−44 × 10−54 × 10−19 × 10−45 × 10−54 × 10−4
P. yezoensis0.9950.9370.9070.980−0.9420.9480.947−0.731
1 × 10−79 × 10−49 × 10−41 × 10−52 × 10−42 × 10−32 × 10−42 × 10−2
2 × 10−61 × 10−33 × 10−32 × 10−58 × 10−42 × 10−32 × 10−42 × 10−2
P. purpureum0.9830.9230.8270.973−0.3780.9840.997−0.951
6 × 10−61 × 10−36 × 10−32 × 10−52 × 10−12 × 10−42 × 10−81 × 10−4
2 × 10−52 × 10−38 × 10−33 × 10−52 × 10−14 × 10−42 × 10−73 × 10−4

Pearson correlation coefficients, their associated p values, and false discovery rates (Benjamini–Hochberg procedure) are presented in the upper, middle and lower rows, respectively, for all analyzed correlations between IDRs and proteomic parameters. Shaded values indicate p values greater than 0.05 that were not considered statistically significant. The column names stand for S phosphorylation, T phosphorylation, Y phosphorylation, N-linked glycosylation, O-linked glycosylation, region enriched in proline (P), glutamic acid (E), serine (S), and threonine (T) residues (PEST), ubiquitination, and transmembrane helices.

Correlations between disordered protein content and the number of phosphorylation sites. (A) Normalized predicted Ser (S), Thr (T) and Tyr (Y) phosphorylation sites per 400 amino acids in the studied algae proteomes; The relative percentage of disordered protein content with different numbers of predicted S, T and Y phosphorylation sites are presented in (B–D), respectively. Statistics of the correlation between protein disorder and proteomic parameters. Pearson correlation coefficients, their associated p values, and false discovery rates (Benjamini–Hochberg procedure) are presented in the upper, middle and lower rows, respectively, for all analyzed correlations between IDRs and proteomic parameters. Shaded values indicate p values greater than 0.05 that were not considered statistically significant. The column names stand for S phosphorylation, T phosphorylation, Y phosphorylation, N-linked glycosylation, O-linked glycosylation, region enriched in proline (P), glutamic acid (E), serine (S), and threonine (T) residues (PEST), ubiquitination, and transmembrane helices.

2.3. Correlation between Intrinsic Disorder and Glycosylation

Protein glycosylation is one of the most common PTMs where the attachment of glycans to specific residues affects protein folding, localization, and stability. Glycosylated proteins play an important role in essential biological functions such as immunogenicity, catalytic activity, viral clearance, and ligand-receptor interactions [34,35,36,37]. Two major types of protein glycosylation exist: O-linked and N-linked. O-glycosylation occurs frequently in eukaryotic cells and is involved in many basic cellular functions. However, significant differences in O-glycosylation patterns between plants and animals have been described, including the sites of glycan addition and glycan composition. O-glycosylation in plants mainly occurs on hydroxylproline (Hyp) residues [38]. The freely available NetOGlyc tool can be used to predict O-glycosylation sites in mammalian proteins [39]. However, there is currently no available prediction tool for O-glycosylation sites for plant proteins. Therefore, in the present study, we designed an original program to predict Hyp-O-linked glycosylation sites utilizing the consensus sequence for Hyp-O-linked glycosylation previously reported for plants [40]. The average abundance of O-glycosylations sites varied in the studied algae proteomes from approximately 0.8 to more than 1.7 sites per protein (Figure 3A). Additionally, we observed that the disordered protein content was positively correlated with the number of O-glycosylation sites (from 0 to ≥7) in every algae proteome analyzed (Figure 3B). The correlation between IDRs and O-glycosylation had high correlation coefficients and were found to be statistically significant (Table 1). The positive correlation between disordered protein content and the number of O-glycosylation sites was confirmed by using the alternative disorder prediction tool, RONN (Figure S4A).
Figure 3

Correlation between disordered protein content and glycosylation sites. Normalized predicted O-glycosylation and N-glycosylation sites per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; The relative percentage of disordered protein content with different numbers of predicted O- and N-glycosylation sites are presented in (B,D), respectively.

Correlation between disordered protein content and glycosylation sites. Normalized predicted O-glycosylation and N-glycosylation sites per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; The relative percentage of disordered protein content with different numbers of predicted O- and N-glycosylation sites are presented in (B,D), respectively. Another major type of glycosylation, N-linked glycosylation, is arguably the most conserved form of protein glycosylation in eukaryotes [41]. As in other eukaryotic cells, N-linked glycosylation occurs frequently in plant cells and it has been shown that eukaryotic N-glycoproteins have invariant sequence recognition patterns, structural constraints, and subcellular localization [42]. N-linked glycosylation occurs on many secreted and membrane-bound glycoproteins as a co-translational process through the attachment to an asparagine (N) at the consensus motif asparagine-X-serine/threonine (NXS/T), in which X is any amino acid except proline [43]. For N-glycosylation site prediction, we used the prediction tool NetNglyc1.0 [44], which is noted to predict N-glycosylation sites in human proteins. However, the N-linked glycosylation pathway in algae shares a high degree of homology with that in other eukaryotic organisms [40,45]. Therefore we predicted N-linked glycosylation sites in the 20 algae proteomes by combining the results of the NetNglyc1.0 algorithm, and the existence of signal peptides by SignalP [46] and transmembrane regions by TMHMM [47] (Figure 3C). N-glycosylation correlated negatively with the disordered protein content in most of the studied algae proteomes. However, the results of the diatoms showed positive correlations between disordered protein content and N-glycosylation (Figure 3D, Table 1). These results were confirmed using the alternative disorder prediction tool, RONN (Figure S4B).

2.4. Correlation between Intrinsic Disorder and Ubiquitination and PEST (Region Enriched in Proline (P), Glutamic Acid (E), Serine (S), and Threonine (T) Residues)

Ubiquitination sites and PEST regions within a protein sequence are related to protein degradation [48]. We used the freely available UbPred tool for ubiquitination site prediction and epestfind tool for PEST region prediction [49,50]. The content of ubiquitination sites varied from approximately 0.4 to 1.4 sites per 400 amino acids, and the content of PEST regions varied from approximately 0.2 to 0.5 sites per 400 amino acids in the analyzed algae proteomes (Figure 4A,C). We observed that the predicted presence of both ubiquitination sites and PEST regions were statistically positively correlated to disordered protein content in all algae proteomes (Figure 4B,D) with high correlation coefficients (Table 1). The observed correlations of ubiquitination and PEST with protein disorder were confirmed with the alternative disorder prediction tool, RONN (Figure S5).
Figure 4

Correlation between disordered protein content and ubiquitination sites or PEST regions. Normalized predicted ubiquitination sites and PEST regions per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; Relative percentage of disordered protein content with different numbers of predicted sites of ubiquitination and predicted regions of PEST are presented in (B,D), respectively.

Correlation between disordered protein content and ubiquitination sites or PEST regions. Normalized predicted ubiquitination sites and PEST regions per 400 amino acids in the studied algae proteomes are presented in (A,C), respectively; Relative percentage of disordered protein content with different numbers of predicted sites of ubiquitination and predicted regions of PEST are presented in (B,D), respectively.

2.5. Correlation of Intrinsic Disorder with Transmembrane Helices

Transmembrane helices in proteins play an important role in the transport of various substances across biological membranes. In the present study, we used the freely available TMHMM tool for transmembrane helix prediction. The content of transmembrane helices varied in the analyzed algae proteomes from approximately 0.6 to 0.8 regions per 400 amino acids (Figure 5A). A negative correlation was observed between the number of predicted transmembrane helices and disordered protein content in all algae proteomes (Figure 5B). The observed correlations of the number of transmembrane helices with disordered protein content were confirmed with the alternative disorder prediction tool, RONN (Figure S6).
Figure 5

Correlation between disordered protein content and transmembrane regions. Normalized predicted transmembrane helices per 400 amino acids in the studied algae proteomes are presented in (A); Relative percentage of disordered protein content with different numbers of predicted transmembrane helices are presented in (B).

Correlation between disordered protein content and transmembrane regions. Normalized predicted transmembrane helices per 400 amino acids in the studied algae proteomes are presented in (A); Relative percentage of disordered protein content with different numbers of predicted transmembrane helices are presented in (B).

2.6. Relative Content of PTMs (Post-Translational Modifications) in Ordered and Disordered Regions of Algae Proteins

To further analyze the correlation between PTM sites and disordered regions, we determined the relative abundance of specific PTM sites including those for phosphorylation, O-glycosylation, and ubiquitination, in ordered or disordered regions of the algae proteomes. In accordance with a previous analysis [21], we used the relative abundance values for analysis, which are presented as a ratio of normalized PTM content in disordered and ordered segments (Rd/o) of algae proteomes (for details, see Experimental Section 4.3). This approach allowed us to estimate the robustness of the observed correlations between PTM sites and disordered regions. This analysis focused on site-specific PTMs rather than region-specific PTMs such as PEST and transmembrane helix content. Higher values of the Rd/o parameter were obtained for S- and T-phosphorylation as well as ubiquitination in the algae proteomes as a whole (18.8, 10.5, and 12.5, respectively). In contrast, the Rd/o parameter for Y-phosphorylation and N-glycosylation in total algae proteomes (1.3 and 1.2, respectively) indicated less robust relationships (Table 2). Incidentally, we found that oomycetes had the strongest correlation between phosphorylation and protein disorder compared to those of the other algae such as green algae, diatom, yellow algae, and red algae (Table 2).
Table 2

Relative post-translational modification (PTM) content in ordered and disordered regions of algae proteomes.

CategoryS-phoT-phoY-phoO-glyN-glyUbi
Green algae15.18.41.13.20.911.0
Oomycetes25.914.51.73.61.216.0
Diatom19.810.31.32.91.911.9
Yellow algae17.49.91.03.41.210.5
Red algae19.211.31.62.61.216.6
Total18.810.51.33.31.212.5

Ratios of normalized PTM contents (Rd/o) were calculated as described in the Experimental Section 4.3.

Relative post-translational modification (PTM) content in ordered and disordered regions of algae proteomes. Ratios of normalized PTM contents (Rd/o) were calculated as described in the Experimental Section 4.3.

2.7. Relative Disordered Protein Content and the Number of PTMs (Post-Translational Modifications) in Specific and Common Protein Clusters of Algae Proteomes

In a previous study, protein disordered regions were associated with higher amino acids substitution rates [51]. Therefore, disordered proteins may evolve faster and hence are less conserved than structured proteins [52,53]. As such, there should be differences in the disordered protein content and the number of PTMs between species-specific protein clusters and common protein clusters of algae proteomes. For these calculations, we used the OrthoMCL tool to classify common protein clusters involving all 20 algae species used in this study, and species-specific protein clusters involving just one algae species. In total, we analyzed 15,205 common protein clusters and 66,858 species-specific protein clusters. The content of each PTM parameter such as protein disorder, phosphorylation, glycosylation, and ubiquitination in species-specific protein clusters or in common protein clusters are shown in Table 3. The content of PTM parameters was 1.4 to 3.4 times higher in species-specific protein clusters than in common protein clusters, as expected. Notably, in green algae, diatom, and yellow algae, the occurrence of S/T-phosphorylation in species-specific protein clusters was more than three times higher than in common protein clusters (Table S2).
Table 3

Preference of protein disorder and PTM in species-specific protein clusters and common protein clusters of algae proteomes.

CategoryDisorderS-phoT-phoY-phoO-glyN-glyUbi
Specific34%2.11.00.71.42.11.0
Common20%0.60.30.50.81.30.6
Ratio of s/c1.73.43.21.41.81.61.8

Total values of each parameter in species-specific protein clusters and common protein clusters are presented. The one-tailed p values as determined with t-tests between species-specific protein clusters and common protein clusters were less than 0.05. Ratio of s/c = the value of specific/the value of common.

Preference of protein disorder and PTM in species-specific protein clusters and common protein clusters of algae proteomes. Total values of each parameter in species-specific protein clusters and common protein clusters are presented. The one-tailed p values as determined with t-tests between species-specific protein clusters and common protein clusters were less than 0.05. Ratio of s/c = the value of specific/the value of common.

3. Discussion

It was previously reported that disordered protein content was dependent on the length of the protein [54], while at the same time, disordered protein content is independent of proteome size [22]. In the current study, the degree of protein disorder was not associated with algae proteome size (Figure 1B), which was similar to a previous analysis [21]. Sequence redundancies before and after filtering were examined for every algae proteome (Figure S2), which were calculated as the number of removed sequences after filtering (subtracting the number of redundant sequences after filtering from that of before filtering) divided by the number before filtering. The numbers of sequences before and after filtering are given in Table S1. A strong positive correlation was observed between the content of redundant sequences and proteome size similar to previous results [21]. Additionally, it was found that the redundancy of amino acid sequences in the algae proteomes, which was approximately 1% to 27% (Figure S2), was lower as a whole than that in higher plants, which has been reported to be approximately 7% to 37% [21]. This implies that gene duplication occurs more often in higher plants than in algae, and that it is possible to obtain inaccurate results when sequences are not filtered for redundancy. In the current study, we systematically investigated disordered protein content, sites of multiple PTMs, and regions of PEST and transmembrane helices in algae proteomes. In addition, we investigated the correlation between disordered protein content and the number of PTM sites or regions of PEST and transmembrane helices in algae proteomes. Previous studies have reported on the content of protein disorder in many species. For example, in prokaryotes, varying disordered protein content in different thermal groups has been observed [55]. However, disordered protein content is much higher in eukaryotes than in prokaryotes [2,56,57]. In higher plants, disordered protein content has been reported to be higher in monocot proteomes than that in dicot proteomes [21]. However, in the algae proteome analysis herein, a trend in disordered protein content among the algae groups such as green algae, oomycetes, diatom, yellow algae, and red algae was not found. This may be due to the inclusion of algae from a wide range of groups. The disordered protein content ranged from approximately 20% to 35% (Figure 1A). Thus, a more detailed comparison of integral algae characteristics should be performed to understand the differences of disordered protein content. A previous study has suggested that surface accessibility of enzyme-mediated reversible PTMs is closely related to protein-protein interactions in disordered regions [58]. Accordingly, we found that disordered protein content positively correlated with the presence of predicted PTM sites in the algae proteomes (Table 2). Specifically, there were strong correlations between disordered protein content and Ser- and Thr-phosphorylation, O-glycosylation, ubiquitination, and PEST regions (Table 1). Prior human and plant proteome-wide analyses have revealed that phosphorylation sites are significantly enriched in disordered proteins [16,19,23]. Thus, the results of our proteome-wide analysis of algae are similar to those of other eukaryotic proteome-wide analysis. This suggests that the present algae analysis can be applied to other eukaryotic data sets to determine common features of protein disorder and PTMs. No trend was observed for the average abundance of phosphorylation sites between algae groups (Figure 2A). However, a positive correlation between the predicted number (from 0 to ≥7) of Ser, Thr, and Tyr phosphorylation sites and disordered protein content was observed for every algae proteome (Figure 2B–D). These results are in agreement with past analyses of higher plant species [21]. Therefore, we suggest that the abundance of phosphorylation in disordered proteins is common in various algae and land plants. We found that there was a low correlation between disordered protein content and Y-phosphorylation in algae, especially in yellow algae (Table 1). Similarly, small ratio values for Y-phosphorylation are presented in Table 2. Many PTMs including phosphorylation have been shown to occur in disordered regions, because IDRs are easily accessible and flexible. However, a recent study demonstrated that Y-phosphorylation does not tend to occur in disordered regions, which is in contrast to S- and T-phosphorylation [59]. Similarly, we found that Y-phosphorylation was predicted to occur preferentially in ordered regions. It has previously been reported that the O-linked glycosylation sites are predominantly located in the IDRs of many eukaryotic proteins such those in as Homo sapiens, Drosophila melanogaster, Caenorhabditis elegans, A. thaliana, Oryza sativa, and Schizosaccharomyces pombe [5,16]. In the present study, a strong positive correlation between protein disorder and O-glycosylation was also observed in algae (Figure 3B, Table 1 and Table 2), which is similar to a previous report [21]. The positive correlation between protein disorder and O-glycosylation was expected because O-glycosylation in plants occurs on the Pro in consensus sequences [A/S/T/V]-P(1,4)-X(0,10)-[A/S/T/V]-P(1,4) [40], in which amino acids tend to form disorder regions [1,4,5,6,7,8]. Hence, the positive correlation between protein disorder and O-glycosylation can be considered as a universal trend in eukaryotes, including algae. In contrast to O-glycosylation, it is known that N-glycosylation does not strongly correlate with IDR content because N-glycosylation generally occurs co-translationally before a protein is fully folded. As a result, there is a lack of structural preference for N-glycosylation. Accordingly, no clear structural preference has been reported for N-glycosylation in proteins [29,60]. The correlation analysis in this study also failed to reveal a strong association between N-glycosylation and protein disorder in algae (Figure 3D, Table 1 and Table 2). However, the analysis results of the diatoms showed positive correlations between disordered protein content and N-glycosylation. (Figure 3D, Table 1). N-glycosylation is thought to be related to folding and stability of proteins, the extracellular matrix and cell-adhesion molecules [61]. In addition, most diatoms exude polymers from a slit or apical pore field in the siliceous cell wall. The exuded polymers consist of extracellular matrices, and are assembled into a variety of structures, such as trails (material left behind during motility), sheaths (organic matrices tightly associated with the cell wall), capsules (organic matrices loosely associated with the cell walls), and stalks (permanent attachment structures) [62]. Therefore, the result of correlation analysis between N-glycosylation and protein disorder implies that the living environment and lifestyle of diatoms may be related to N-glycosylation and protein disorder. Ubiquitination is well known to be involved in protein degradation in eukaryotic cells. Similarly, PEST sequences represent a universal target for proteolytic degradation [48,63]. In several studies, IDR-related ubiquitination sites and regions of PEST within IDRs have been identified [4,64,65]. Therefore, we investigated the correlation between IDRs and the abundance of sequence motifs for ubiquitination and PEST regions. A strong positive correlation was observed between protein disorder and the predicted presence of both ubiquitination sites and PEST regions in all algae proteomes (Figure 4B,D and Table 1). These results provide further evidence that ubiquitination sites and PEST motifs preferentially occur in the flexible disordered regions of proteins including those in algae. Transmembrane helices in proteins are important in a variety of critical and diverse biological processes. It has been reported that disordered protein content and the abundance of transmembrane helices has an influence on protein expression and solubility [4,64,65]. Therefore, we investigated the relation between the IDR content and the presence of transmembrane helices. A strong negative correlation was observed between the predicted presence of transmembrane helices and protein disorder in all algae proteomes (Figure 5B). This result was expected because transmembrane helices commonly consist of hydrophobic residues, whereas disordered regions primarily contain hydrophilic residues. Thus, the function of transmembrane helices is associated with structural stability rather than flexibility. To further analyze correlations between PTMs sites and IDRs, we determined the relative abundances of site-specific PTMs sites including phosphorylation, O-glycosylation, and ubiquitination in ordered regions and disordered regions of algae proteomes. As a result, we observed that most values of Rd/o are over 1.0 indicating that PTMs primarily occur in the disordered regions (Table 2). However, Y-phosphorylation and N-glycosylation did not have a preference for disordered regions whereas S- and T-phosphorylation and ubiquitination had a strong preference for disordered regions. Previous studies have reported that disordered regions are associated with higher amino acids substitution rates [51]. By comparing protein-protein interactions that occur in humans, flies, and yeast, it was found that interactions in disordered regions were significantly less conserved than in ordered regions [52,53]. Additionally, while evolutionarily conserved sequences within disordered regions are important for function, these conserved sequences account for only 5% of the disordered regions [66]. Therefore, disordered regions are less conserved than structured regions. In the present study, the association between protein disorder and PTM sites was higher in species-specific protein clusters than in common protein clusters in the algae proteomes as a whole (Table 3). Taken together, these results indicate that disordered regions, PTMs sites, and species-specific protein sequences are significantly related in algae proteomes.

4. Experimental Section

4.1. Data Sets

We used the following proteome sequence sets: for green algae, Ostreococcus sp. RCC809 (O. sp. RCC809) [67], Ostreococcus tauri (O. tauri) [67], Micromonas sp. RCC299 (M. sp. RCC299) [68], Micromonas pusilla CCMP1545 (M. pusilla) [68], Coccomyxa sp. C-169 (C. sp. C-169) [69], Chlorella vulgaris (C. vulgaris) [70], Volvox carteri (V. carteri) [71], and Chlamydomonas reinhardtii (C. reinhardtii) [72]; for oomycetes, Phytophthora ramorum (P. ramorum) [73], Phytophthora sojae (P. sojae) [73], Phytophthora infestans T30-4 (P. infestans) [74], and Phytophthora capsici (P. capsici) [75]; for diatom, Phaeodactylum tricornutum CCAP1055/1 (P. tricornutum) [76], Fragilariopsis cylindrus CCMP1102 (F. cylindrus), and Thalassiosira pseudonana CCMP1335 (T. pseudonana) [77]; for yellow algae, Aureococcus anophagefferens (A. anophagefferens) [78] and Ectocarpus siliculosus (E. siliculosus) [79]; and for red algae, Cyanidioschyzon merolae (C. merolae) [80], Pyropia yezoensis (P. yezoensis) [81], and Porphyridium purpureum (P. purpureum) [82]. The proteome sequences of M. sp. RCC299, M. pusilla, C. sp. C-169, and V. carteri were retrieved from Phytozome (http://www.phytozome.net) [83]. O. sp. RCC809, O. tauri, C. vulgaris, C. reinhardtii, P. ramorum, P. sojae, P. infestans, P. capsici, P. tricornutum, F. cylindrus, T. pseudonana, A. anophagefferens, E. siliculosus, P. yezoensis, and P. purpureum were retrieved from the Genome Portal of the Department of Energy Joint Genome Institute (http://genome.jgi-psf.org) [84]. C. merolae proteome sequence was obtained from the Cyanidioschyzon merolae Genome Project (http://merolae.biol.s.u-tokyo.ac.jp) [80]. Subsequently, we prepared non-redundant protein sequence sets. First, to construct the protein sequence datasets, amino acid sequences of less than 50 and more than 4000 amino acids in length were filtered out for impartial analysis. Next, we made protein clusters calculated with the OrthoMCL tool (version 1.4) [30] for the above 20 algae whole protein sequence sets. The threshold options of sequence alignments in OrthoMCL are “pi_cut off = 90%”, “pmatch_cut off = 90%”, and “pv_cut off = 1 × 10−30”. Finally, we obtained non-redundant sequence sets from the clustering result of OrthoMCL to extract representative sequences of each cluster and singlet sequences.

4.2. Prediction of Multiple Properties of Protein Sequences

We used prediction tools for multiple calculations on the proteome sequences of the above 20 algae. To calculate intrinsic disorder in proteins, DISOPRED (version 2.4.2) [31] and RONN (version 3) [32] were employed. The disordered protein content of each protein was obtained by dividing the number of disordered amino acid residues by the protein length. We analyzed the sites of Ser, Thr, and Tyr phosphorylation, N-linked Asn glycosylation, O-linked Pro glycosylation, and ubiquitination. Moreover, the regions of transmembrane helix domains and PEST regions were investigated. These PTMs and regions were predicted using the following bioinformatics algorithm or tools. Sites of phosphorylation were predicted with Musite (version 1.0.1) [17] downloaded from http://musite.sourceforge.net, which was used to predict phosphorylation sites in the target algae proteomes with “Eukaryote-General-Ser-Thr; Eukaryote-General-Tyr” rather than with “A. thaliana-General-Ser-Thr” as a prediction model. This was because oomycetes, which are regarded as colorless algae [85], were included in the algae species set in this analysis and the A. thaliana model did not contain Tyr-phosphorylation information. Sites of O-glycosylation were predicted based on the previously reported consensus sequence [A/S/T/V]-P(1,4)-X(0,10)-[A/S/T/V]-P(1,4) [40,45] for plant proteins containing sites of O-linked Pro glycosylation. While there is currently no prediction tool to detect O-glycosylation sites in plants, we made an original Perl script utilizing the aforementioned consensus sequences. Sites of N-glycosylation were predicted with the NetNGlyc tool (version 1.0) [44]. The N-glycosylation sites were predicted by combining the results of the NetNglyc tool, and the existence of signal peptides by SignalP (version 4.0) [46] and transmembrane regions by TMHMM (version 2.0) [47]. Sites of ubiquitination were predicted with the UbPred tool [86] downloaded from http://ubpred.org. Sites predicted with medium confidence (score range 0.69 ≤ s ≤ 0.84) were considered valid ubiquitination sites from the UbPred prediction results. Transmembrane helix regions were predicted with TMHMM (version 2.0) [47], and PEST regions were predicted with the epestfind tool of EMBOSS [49,50]. We applied the TMHMM and epestfind tools with default runtime parameters. The numbers of predicted PTM sites or regions (transmembrane helix and PEST regions) in proteins were normalized to the uniform length of 400 amino acids, rather than per sequence, to account for the difference in the average protein lengths in the datasets.

4.3. Relative Content of PTMs in Ordered and Disordered Segments

The tendency of specific PTM sites to occur in disordered and ordered region of each algae proteome was calculated by using appropriate algorithms following described methods [21]. Specifically, in accordance with a past analysis, the relative abundance of a specific PTM in the ordered and disordered segments of algae proteomes was analyzed using the following ratio: Rd/o = Nd/Ld:No/Lo, where No is the total number of PTM sites in the ordered segment of a proteome, Lo is the length of the ordered proteome segment, Nd is the total number of PTM sites in the disordered segment of a proteome, and Ld is the length of the disordered proteome segment. By this definition, the Rd/o value equals 1 if the relative abundances of a PTM in the ordered and disordered regions are the same. Furthermore, a value of >1 indicates when a PTM preferentially occurs in disordered regions, and a value of <1 indicates if a PTM tends to occur in ordered regions.

4.4. Classification of Species-Specific Proteins and Common Proteins of Algae Proteomes

The protein sequences were analyzed using the OrthoMCL tool [30] to classify species-specific and common proteins among the 20 algae proteomes. Pairwise sequence similarities between all protein sequences were calculated with BLASTP using an e-value cutoff of 1 × 10−5. Using these results, protein clusters that were equivalent to orthologous groups were estimated using the Markov clustering algorithm employed in OrthoMCL with the default runtime parameters. In this study, a singlet and a cluster consisting of only one species were regarded as a species-specific protein. In contrast, a cluster which consisted of all 20 species, was regarded as a common protein.

5. Conclusions

This analysis is the first large-scale bioinformatics study of IDRs in algae proteomes, which are considered common ancestors of plants. Our findings regarding the correlation between disordered protein content and multiple PTM sites, PEST, and transmembrane helices in algae proteomes may be useful for investigations into the unsolved biological roles of these proteins, and for understanding algae evolution. Information resources are currently being developed for various plant species from comprehensive analyses such as those in genetics and metabolomics [87,88,89,90,91,92,93]. As such, we aim to perform an integrative analysis of protein properties combined with the above proteomic data, and expect to reveal relationships between proteomics and other biological features in the future.
  89 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Musite, a tool for global prediction of general and kinase-specific phosphorylation sites.

Authors:  Jianjiong Gao; Jay J Thelen; A Keith Dunker; Dong Xu
Journal:  Mol Cell Proteomics       Date:  2010-08-11       Impact factor: 5.911

3.  Niche of harmful alga Aureococcus anophagefferens revealed through ecogenomics.

Authors:  Christopher J Gobler; Dianna L Berry; Sonya T Dyhrman; Steven W Wilhelm; Asaf Salamov; Alexei V Lobanov; Yan Zhang; Jackie L Collier; Louie L Wurch; Adam B Kustka; Brian D Dill; Manesh Shah; Nathan C VerBerkmoes; Alan Kuo; Astrid Terry; Jasmyn Pangilinan; Erika A Lindquist; Susan Lucas; Ian T Paulsen; Theresa K Hattenrath-Lehmann; Stephanie C Talmage; Elyse A Walker; Florian Koch; Amanda M Burson; Maria Alejandra Marcoval; Ying-Zhong Tang; Gary R Lecleir; Kathryn J Coyne; Gry M Berg; Erin M Bertrand; Mak A Saito; Vadim N Gladyshev; Igor V Grigoriev
Journal:  Proc Natl Acad Sci U S A       Date:  2011-02-23       Impact factor: 11.205

4.  Complete nucleotide sequence of the chloroplast genome from the green alga Chlorella vulgaris: the existence of genes possibly involved in chloroplast division.

Authors:  T Wakasugi; T Nagai; M Kapoor; M Sugita; M Ito; S Ito; J Tsudzuki; K Nakashima; T Tsudzuki; Y Suzuki; A Hamada; T Ohta; A Inamura; K Yoshinaga; M Sugiura
Journal:  Proc Natl Acad Sci U S A       Date:  1997-05-27       Impact factor: 11.205

5.  Extracellular Matrix Assembly in Diatoms (Bacillariophyceae) (I. A Model of Adhesives Based on Chemical Characterization and Localization of Polysaccharides from the Marine Diatom Achnanthes longipes and Other Diatoms).

Authors:  B. A. Wustman; M. R. Gretz; K. D. Hoagland
Journal:  Plant Physiol       Date:  1997-04       Impact factor: 8.340

6.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life.

Authors:  J J Ward; J S Sodhi; L J McGuffin; B F Buxton; D T Jones
Journal:  J Mol Biol       Date:  2004-03-26       Impact factor: 5.469

7.  Surface accessibility of protein post-translational modifications.

Authors:  Chi Nam Ignatius Pang; Andrew Hayen; Marc Ronald Wilkins
Journal:  J Proteome Res       Date:  2007-04-12       Impact factor: 4.466

8.  Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis.

Authors:  Brett M Tyler; Sucheta Tripathy; Xuemin Zhang; Paramvir Dehal; Rays H Y Jiang; Andrea Aerts; Felipe D Arredondo; Laura Baxter; Douda Bensasson; Jim L Beynon; Jarrod Chapman; Cynthia M B Damasceno; Anne E Dorrance; Daolong Dou; Allan W Dickerman; Inna L Dubchak; Matteo Garbelotto; Mark Gijzen; Stuart G Gordon; Francine Govers; Niklaus J Grunwald; Wayne Huang; Kelly L Ivors; Richard W Jones; Sophien Kamoun; Konstantinos Krampis; Kurt H Lamour; Mi-Kyung Lee; W Hayes McDonald; Mónica Medina; Harold J G Meijer; Eric K Nordberg; Donald J Maclean; Manuel D Ospina-Giraldo; Paul F Morris; Vipaporn Phuntumart; Nicholas H Putnam; Sam Rash; Jocelyn K C Rose; Yasuko Sakihama; Asaf A Salamov; Alon Savidor; Chantel F Scheuring; Brian M Smith; Bruno W S Sobral; Astrid Terry; Trudy A Torto-Alalibo; Joe Win; Zhanyou Xu; Hongbin Zhang; Igor V Grigoriev; Daniel S Rokhsar; Jeffrey L Boore
Journal:  Science       Date:  2006-09-01       Impact factor: 47.728

9.  Single-residue posttranslational modification sites at the N-terminus, C-terminus or in-between: To be or not to be exposed for enzyme access.

Authors:  Fernanda L Sirota; Sebastian Maurer-Stroh; Birgit Eisenhaber; Frank Eisenhaber
Journal:  Proteomics       Date:  2015-07       Impact factor: 3.984

10.  Genome-wide analysis of protein disorder in Arabidopsis thaliana: implications for plant environmental adaptation.

Authors:  Natalia Pietrosemoli; Juan A García-Martín; Roberto Solano; Florencio Pazos
Journal:  PLoS One       Date:  2013-02-07       Impact factor: 3.240

View more
  10 in total

Review 1.  The Structural and Functional Diversity of Intrinsically Disordered Regions in Transmembrane Proteins.

Authors:  Rajeswari Appadurai; Vladimir N Uversky; Anand Srivastava
Journal:  J Membr Biol       Date:  2019-05-28       Impact factor: 1.843

2.  Pipeline to Identify Hydroxyproline-Rich Glycoproteins.

Authors:  Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Antony Bacic; Monika S Doblin; Carolyn J Schultz
Journal:  Plant Physiol       Date:  2017-04-26       Impact factor: 8.340

3.  Insights into the Evolution of Hydroxyproline-Rich Glycoproteins from 1000 Plant Transcriptomes.

Authors:  Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Gane Ka-Shu Wong; Douglas E Soltis; Nicholas W Miles; Michael Melkonian; Barbara Melkonian; Michael K Deyholos; James Leebens-Mack; Carl J Rothfels; Dennis W Stevenson; Sean W Graham; Xumin Wang; Shuangxiu Wu; J Chris Pires; Patrick P Edger; Eric J Carpenter; Antony Bacic; Monika S Doblin; Carolyn J Schultz
Journal:  Plant Physiol       Date:  2017-04-26       Impact factor: 8.340

4.  Presence and structure-activity relationship of intrinsically disordered regions across mucins.

Authors:  Joseph Carmicheal; Pranita Atri; Sunandini Sharma; Sushil Kumar; Ramakanth Chirravuri Venkata; Prakash Kulkarni; Ravi Salgia; Dario Ghersi; Sukhwinder Kaur; Surinder K Batra
Journal:  FASEB J       Date:  2020-01-05       Impact factor: 5.191

5.  Polymorphism and structure of style-specific arabinogalactan proteins as determinants of pollen tube growth in Nicotiana.

Authors:  Andrzej K Noyszewski; Yi-Cheng Liu; Koichiro Tamura; Alan G Smith
Journal:  BMC Evol Biol       Date:  2017-08-10       Impact factor: 3.260

Review 6.  Intrinsic Disorder and Posttranslational Modifications: The Darker Side of the Biological Dark Matter.

Authors:  April L Darling; Vladimir N Uversky
Journal:  Front Genet       Date:  2018-05-04       Impact factor: 4.599

7.  How Honey Bee Vitellogenin Holds Lipid Cargo: A Role for the C-Terminal.

Authors:  Vilde Leipart; Øyvind Halskau; Gro V Amdam
Journal:  Front Mol Biosci       Date:  2022-06-09

8.  Unfoldomics of prostate cancer: on the abundance and roles of intrinsically disordered proteins in prostate cancer.

Authors:  Kevin S Landau; Insung Na; Ryan O Schenck; Vladimir N Uversky
Journal:  Asian J Androl       Date:  2016 Sep-Oct       Impact factor: 3.285

9.  Intrinsic protein disorder reduces small-scale gene duplicability.

Authors:  Sanghita Banerjee; Felix Feyertag; David Alvarez-Ponce
Journal:  DNA Res       Date:  2017-08-01       Impact factor: 4.458

10.  Exploring intrinsically disordered proteins in Chlamydomonas reinhardtii.

Authors:  Yizhi Zhang; Hélène Launay; Antoine Schramm; Régine Lebrun; Brigitte Gontero
Journal:  Sci Rep       Date:  2018-05-01       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.