| Literature DB >> 27046079 |
Brett Trost1,2, Anthony Kusalik1, Scott Napper2,3.
Abstract
Protein kinase-mediated phosphorylation is among the most important post-translational modifications. However, few phosphorylation sites have been experimentally identified for most species, making it difficult to determine the degree to which phosphorylation sites are conserved. The goal of this study was to use computational methods to characterize the conservation of human phosphorylation sites in a wide variety of eukaryotes. Using experimentally-determined human sites as input, homologous phosphorylation sites were predicted in all 432 eukaryotes for which complete proteomes were available. For each pair of species, we calculated phosphorylation site conservation as the number of phosphorylation sites found in both species divided by the number found in at least one of the two species. A clustering of the species based on this conservation measure was concordant with phylogenies based on traditional genomic measures. For a subset of the 432 species, phosphorylation site conservation was compared to conservation of both protein kinases and proteins in general. Protein kinases exhibited the highest degree of conservation, while general proteins were less conserved and phosphorylation sites were least conserved. Although preliminary, these data tentatively suggest that variation in phosphorylation sites may play a larger role in explaining phenotypic differences among organisms than differences in the complements of protein kinases or general proteins.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27046079 PMCID: PMC4821552 DOI: 10.1371/journal.pone.0152809
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Conservation of phosphorylation sites (upper triangle) and protein kinases (lower triangle) among 20 species from a diverse range of lineages.
For a given pair of species, the values represent the percentage of phosphorylation sites or protein kinases found in at least one of the two species that were found in both species (the values C(S, S) described in the text). The cells are colored based on the value within; the closer the value is to 100, the brighter the shade of red.
Fig 2Comparison between the taxonomy of the 20 species described in the Materials and Methods section according to the National Center for Biotechnology Information (NCBI) Taxonomy Browser (panel A), and the dendrogram generated based on the phosphorylation site conservation of pairs of species (panel B).
The scale applies to panel B only, with the branch lengths representing values of C(S, S). The species names are color-coded based on lineage: red, mammals; pink, insects; blue, fish; green, plants; purple, birds; orange, arachnids; yellow, nematodes; black, others (single-celled organisms of different lineages).
Comparison between the values of C(S, S) generated using Method 1 and those generated using the other three methods (X = 2, 3, and 4).
| Degree | Directionality | |||||||
|---|---|---|---|---|---|---|---|---|
| Method # ( | Mean | Median | Range | Sthev | Mean | Median | Range | Sthev |
| 2 | 2.7 | 3 | [0, 19] | 1.9 | 2.1 | 2 | [−12, 19] | 2.6 |
| 3 | 3.8 | 4 | [0, 18] | 2.6 | 0.8 | 1 | [−11, 18] | 4.5 |
| 4 | 4.8 | 5 | [0, 26] | 3.0 | 1.9 | 3 | [−12, 26] | 5.3 |
See the Materials and Methods for descriptions of each method. The “degree” section of the table represents the degree to which the values of C(S, S) obtained using Method 1 differ from those obtained using Methods 2, 3, and 4, and is calculated as |C1(S, S)−C(S, S)|. The “directionality” section represents whether Method 1 tends to indicate more or less phosphorylation site conservation between pairs of species than the other methods (a positive mean and median indicates greater conservation, and vice versa), and is calculated as C1(S, S)−C(S, S).
Fig 3Conservation of proteins among 20 species from a diverse range of lineages.
For a given pair of species, the values represent the percentage of proteins found in at least one of the two species that were found in both species. The cells are colored based on the value within; the closer the value is to 100, the brighter the shade of red.