| Literature DB >> 19432952 |
Yvonne J K Edwards1, Anna E Lobley, Melissa M Pentony, David T Jones.
Abstract
BACKGROUND: Disordered proteins need to be expressed to carry out specified functions; however, their accumulation in the cell can potentially cause major problems through protein misfolding and aggregation. Gene expression levels, mRNA decay rates, microRNA (miRNA) targeting and ubiquitination have critical roles in the degradation and disposal of human proteins and transcripts. Here, we describe a study examining these features to gain insights into the regulation of disordered proteins.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19432952 PMCID: PMC2718516 DOI: 10.1186/gb-2009-10-5-r50
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Properties of highly ordered and highly disordered proteins. (a) Box-plot distributions of the average expression levels for the transcripts encoding the highly ordered and the highly disordered proteins. (b) Box-plot of mRNA decay rates for the highly ordered and highly disordered proteins. (c) Box-plot of protein stability values. (d) The percentage of transcripts likely to be regulated by miRNA (y-axis) for the transcripts encoding the highly ordered and the highly disordered proteins. (e) The percentage of the proteins with one or more predicted ubiquitination sites (principal y-axis, burgundy bar chart) in the highly ordered and the highly disordered datasets; and the percentage of residues predicted as ubiquitination sites (secondary y-axis, navy line plot) versus different amounts of disorder.
Bioinformatics analysis of expression of human genes across 207 samples from 75 different types of normal tissues and cell lines
| Dataset | Description | Samples | Cel file sample replicates | References |
| [GEO:GSE1133] | Normal tissues and cell lines | 144 | 72 × 2 | [ |
| [GEO:GSE2361] | Normal human tissues | 36 | 36 × 1 | [ |
| [GEO:GSE2004] | Normal spleen | 22 | 3 × 3 (spleen) | - |
| liver and kidney | 2 × 3 (liver) | |||
| 1 × 3 (liver) | ||||
| 1 × 4 (kidney) | ||||
| [GEO:GSE781] | Normal kidney samples | 5 | 1 × 5 | [ |
| Total | 207 | 75 |
Figure 2Correlation of features with percentage of disorder in the proteome. (a) Variation in absolute transcript expression as the percentage of disorder increases in the proteome (yellow bars). The bar charts represent the average sample expression for the groups of transcripts separated according to the percentage range (x-axis) of the total disordered residues in the encoded proteins. The y-axis scale represents log2 absolute expression. Expression levels for the transcripts with MF and BP GO terms at level 4 are shown as light green and dark green bars, respectively. (b) Variation of mRNA decay rate as disorder increases in the proteome. mRNA decay rates versus the percentage bins of disordered residues are shown. (c) Variation of protein stability as disorder increases in the proteome. The stability index versus the percentage bins of disordered residues are shown. (d) The proportion of protein coding transcripts targeted by miRNA (y-axis) as the percentage of disorder increases in the proteome. The datasets for the transcripts encoding the disordered proteins (burgundy) and ordered proteins (mauve) and the proteome (yellow) are shown. (e) The percentage of the proteins with one or more predicted ubiquitination sites against the percentage of disorder (principal y-axis, bar charts); and the percentage of residues predicted as ubiquitination sites against the percentage of disorder (secondary y-axis, line plots). The transcripts encoding the disordered proteins, the ordered proteins and the proteome are shown in burgundy, mauve and yellow (respectively).
Percentage of transcripts encoding disordered proteins predicted to be targeted by miRNA
| Total* | Unique† | Match‡ | Percentage§ | |
| Highly disordered | 877 | 827 | 257 | 31.08 |
| Highly ordered | 5,693 | 5,351 | 782 | 14.61 |
| Disordered | 15,095 | 14,282 | 5,056 | 35.40 |
| Ordered | 18,774 | 17,766 | 3,433 | 19.32 |
| All proteins | 33,869 | 32,010 | 8,468 | 26.45 |
| Disordered | ||||
| [0,20] | 4,271 | 4,055 | 1,402 | 34.57 |
| (20,40] | 6,957 | 6,603 | 2,300 | 34.83 |
| (40,60] | 3,036 | 2,866 | 1,119 | 39.04 |
| (60,80] | 679 | 644 | 233 | 36.18 |
| (80,100] | 152 | 143 | 20 | 13.99 |
| Total | 15,095 | 14,311 | 5,074 | 35.45 |
| Ordered | ||||
| [0,20] | 16,341 | 15,503 | 3,037 | 19.59 |
| (20,40] | 2,173 | 2,024 | 362 | 17.89 |
| (40,60] | 214 | 207 | 35 | 16.91 |
| (60,80] | 33 | 31 | 4 | 12.9 |
| (80,100] | 13 | 9 | 0 | 0 |
| Total | 18,774 | 17,774 | 3,438 | 19.34 |
| Proteome | ||||
| [0,20] | 20,612 | 19,536 | 4,429 | 22.67 |
| (20,40] | 9,130 | 8,618 | 2,658 | 30.84 |
| (40,60] | 3,250 | 3,073 | 1,154 | 37.55 |
| (60,80] | 712 | 675 | 237 | 35.11 |
| (80,100] | 165 | 152 | 20 | 13.16 |
| Total | 33,869 | 32,010 | 8,468 | 26.45 |
For each data set, the *total number of transcripts encoding proteins and the †number of unique protein sequences encoded by transcripts are given. ‡A match occurs when a transcript of a protein sequence matches an mRNA targeted by a miRNA. §The percentage calculations are described in the Materials and methods. Values according to the category of disorder (Figures 1c, 2c) and the percentages of disordered residues (Figure 3c) are given.
Figure 3A summary of expression profiles for the highly disordered proteins. (a) The heat map displays four distinct transcript groups; constitutively expressed ribosomal subunits (light blue), high expressors (dark blue), medium expressors (green) and tissue specific expressors (gold). The clustering method was Ward's hierarchical clustering using Euclidean distances calculated over the absolute expression data matrix. Red colors indicate significantly high expression values (P < 0.001) within a sample tissue or cell line. (b). Summary of expression-function trends for highly disordered transcripts. Log10 of the number of tissues in which the transcript is expressed (x-axis); log10 expression of the average magnitude of expression within each tissue (y-axis). The points have been jittered for overlap using a normally distributed noise value of 0.05 on the log10 scale.
miRNA targeting of disordered proteins with different gene expression profiles (Figure 4)
| Expressor type | Total transcripts (frequency value) | Percentage of transcripts with different expression profiles | Transcripts with miRNA (frequency value) | Transcripts with no miRNA (frequency value) | Transcripts with miRNA (%) |
| Tissue specific | 50 (47) | 19.31 | 32 | 15 | 68.09 |
| High | 43 (41) | 16.60 | 27 | 14 | 65.85 |
| Medium | 31 (31) | 11.97 | 15 | 16 | 48.39 |
| Constitutive | 4 (1) | 1.54 | 0 | 1 | 0 |
| Transient or low | 131 (129) | 50.58 | 62 | 67 | 48.06 |
| Total | 259 |
Subsets of GO terms (biological process, molecular function and cellular component) over-represented for co-regulated transcripts encoding highly disordered proteins
| Term | Description | Disorder (60,80]% | Disorder (80,100]% |
| [GO:0005769] | Early endosome | Down | Down |
| [GO:0005770] | Late endosome | Down | Down |
| [GO:0005838] | Proteasome regulatory particle | Down | Down |
| [GO:0016272] | Prefoldin complex | Down | |
| [GO:0031371] | Ubiquitin conjugating enzyme complex | Down | |
| [GO:0000145] | Exocyst | Down | |
| [GO:0000502] | Proteasome complex | Down | |
| [GO:0032991] | Macromolecular complex | Up | |
| [GO:0043234] | Protein complex | Up | |
| [GO:0019872] | Small conjugating protein ligase activity | Up | |
| [GO:0042803] | Protein homodimerization activity | Up | |
| [GO:0051131] | Chaperone-mediated protein complex assembly | Up | |
| [GO:0008639] | Small protein conjugating enzyme activity | Up | |
| [GO:0004842] | Ubiquitin-protein ligase activity | Up | |
| [GO:0016874] | Ligase activity | Up | |
| [GO:0006512] | Ubiquitin cycle | Up | |
| [GO:0004869] | Cysteine protease inhibitor activity | Up | Up |
| [GO:0004866] | Endopeptidase inhibitor activity | Up | Up |
| [GO:0030414] | Protease inhibitor activity | Up | Up |
| [GO:0051082] | Unfolded protein binding | Up | Up |
| [GO:0046983] | Protein dimerization activity | Up | Up |
| [GO:0051291] | Protein hetero-oligomerization | Up | |
| [GO:0007032] | Endosome organization and biogenesis | Up | |
| [GO:0006983] | ER overload response | Up | |
| [GO:0051087] | Chaperone binding | Up | |
| [GO:0031579] | Lipid raft organization and biogenesis | Up | |
| [GO:0016235] | Aggresome | Up | |
| [GO:0016234] | Inclusion body | Up | |
| [GO:0016926] | Protein desumoylation | Up | |
| [GO:0008581] | Ubiquitin specific protease 5 activity | Up | |
| [GO:0006622] | Protein targeting to lysosome | Up | |
| [GO:0019783] | Small conjugating protein-specific protease activity | Down | |
| [GO:0008219] | Cell death | Down | |
| [GO:0007049] | Cell death | Down | |
| [GO:0051603] | Proteolysis involved in cellular protein catabolic process | Down | Down |
| [GO:0004221] | Ubiquitin thiolesterase activity | Down | Down |
| [GO:0016197] | Endosome transport | Down | Down |
| [GO:0016874] | Ligase activity | Down | Down |
| [GO:0004843] | Ubiquitin-specific protease activity | Down | Down |
| [GO:0051082] | Unfolded protein binding | Down | Down |
| [GO:0000209] | Protein polyubiquitination | Down | Down |
| [GO:0006511] | Ubiquitin-dependent protein catabolic process | Down | |
| [GO:0006512] | Ubiquitin cycle | Down | |
| [GO:0051087] | Chaperone binding | Down | |
| [GO:0030968] | Unfolded protein response | Down | |
| [GO:0030100] | Regulation of endocytosis | Down | |
| [GO:0043488] | Regulation of mRNA stability | Down | |
| [GO:0031396] | Regulation of protein ubiquitination | Down |
Up, up-regulation; down, down-regulation.
Figure 4Summary of transcripts encoding highly disordered proteins as putative miRNA targets associated with expression profiles. (a) The percentage of the transcripts as predicted targets of miRNA (y-axis) versus the different datasets (x-axis) that comprise transcripts with different patterns of gene expression (Table 3). The error bars represent the confidence in the percent value according to different sample sizes for the different groups. (b) The log10 odds-ratio (y-axis) discriminates categories as under- and over-represented in relation to being a predicted miRNA target.