| Literature DB >> 19033273 |
Francesca Diella1, Sophie Chabanis, Katja Luck, Claudia Chica, Chenna Ramu, Claus Nerlov, Toby J Gibson.
Abstract
MOTIVATION: We noted that the sumoylation site in C/EBP homologues is conserved beyond the canonical consensus sequence for sumoylation. Therefore, we investigated whether this pattern might define a more general protein motif.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19033273 PMCID: PMC2638927 DOI: 10.1093/bioinformatics/btn594
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.The KEPE motif in C/EBP transcription factors. (A) The IUPred plot predicts human C/EBPα to be almost entirely natively disordered (the higher the peak, the more disordered). Like the KEPE motif, the leucine zipper (BRLZ) is also predicted as natively disordered (correctly so, since it must dimerise as coiled coil to acquire a stable folded structure). (B) C/EBP KEPE motif: an alignment of the RDM motif from Drosophila C/EBP and four vertebrate paralogues C/EBPα, -β, -δ and -ε (CEBPA, CEBPB, CEBPD and CEBPE) show the conservation of the motif (K is the sumoylated residue).
Enrichment of KEPE motif matches with various term combinations from the KW and/or GO term fields in Swiss-Prot entries
| Keywords | No. | Total | Total | ||
|---|---|---|---|---|---|
| KEPE | H | S | H | S | |
| Homo sapiens | 331 | 18 054 | 7707 | – | – |
| GO=cytoplasm*!nucleus | 35 | 2303 | 1146 | 2.45E−01 | 2.67E–02 |
| or KW=cytoplasm*!nucleus* | |||||
| GO=nuclear*|nucleus| | 168 | 3608 | 1789 | 2.10E−36 | 9.41E–29 |
| nucleolus!cytoplasm* or | |||||
| KW=nuclear*!cytoplasm* | |||||
| |nucleus!cytoplasm* | |||||
| Link=znf* | 79 | 1577 | 799 | 6.82E−17 | 4.01E–13 |
| Link=bzip* | 17 | 9 | 35 | 7.58E−18 | 8.62E–15 |
| GO=transcription* | 90 | 1239 | 627 | 7.43E−31 | 2.36E–26 |
| GO=chromatin* or | 21 | 213 | 139 | 4.12E−10 | 4.08E–07 |
| KW=chromatin* | |||||
| GO:0003700 | 36 | 506 | 264 | 3.55E−12 | 4.65E–10 |
| (transcription factor | |||||
| activity) | |||||
| GO:0008270 | 9 | 164 | 74 | 3.32E−03 | 4.20E–03 |
| (zinc ion binding) | |||||
| GO:0006355 | 17 | 253 | 131 | 4.60E−06 | 4.11E–05 |
| (Regulation of transcription, | |||||
| DNA-dependent) |
aSearch terms which have been used to retrieve the KEPE sequences.
bIn Swiss-Prot the annotation ‘cytoplasm’ is used (incorrectly) as a synonym for ‘cytosol’.
cNumber of sequences matching the KEPE pattern [MLIVTF]K.EP.{1,4}[DE] used in combination with the various search terms.
dTotal number of sequences (as obtained with the SIRW search tool, in Swiss-Prot release 54.7) matching the search terms shown in the left column. (H): all human sequences; (S): human sequences with the SUMO motif [VILMTF]K.E.
eThe P-value for the relative enrichment was calculated by the Fisher's exact test from the R package (for total ‘H’ and total ‘S’).
Fig. 2.Conservation score distributions for the KEPE motif and the six permutations comparing the nuclear and cytoplasmic compartments. KEPE bearing proteins were retrieved from UniProt/Swiss-Prot with the compartment keyword expressions in Table 1, processed for IUPred disorder prediction and evaluated for the ELM CS score. To enable comparison, the sets have been normalized into percentages and sorted into five CS score bins. The table in (A) shows that the KEPE matches have a significantly different conservation distribution in the nucleus compared with the controls. n=number of instances, D=maximum difference between the cumulative distributions, P-value=significance of the difference D, according to the KS test. KEPE matches also show a peak of strong conservation (unmatched by the controls) in the nucleus (B) but not in the cytoplasm (C). These results are consistent with a lack of functionality of the KEPE motif in the cytosol (since this is the implied meaning of ‘cytoplasm’ in Swiss-Prot).