| Literature DB >> 30308971 |
Gustav Arvidsson1, Anthony P H Wright2.
Abstract
Conformational protein properties are coupled to protein functionality and could provide a useful parameter for functional annotation of differentially expressed genes in transcriptome studies. The aim was to determine whether predicted intrinsic protein disorder was differentially associated with proteins encoded by genes that are differentially regulated in lymphoma cells upon interaction with stromal cells, an interaction that occurs in microenvironments, such as lymph nodes that are protective for lymphoma cells during chemotherapy. Intrinsic disorder protein properties were extracted from the Database of Disordered Protein Prediction (D²P²), which contains data from nine intrinsic disorder predictors. Proteins encoded by differentially regulated cell-adhesion regulated genes were enriched in intrinsically disordered regions (IDRs) compared to other genes both with regard to IDR number and length. The enrichment was further ascribed to down-regulated genes. Consistently, a higher proportion of proteins encoded by down-regulated genes contained at least one IDR or were completely disordered. We conclude that down-regulated genes in stromal cell-adherent lymphoma cells encode proteins that are characterized by elevated levels of intrinsically disordered conformation, indicating the importance of down-regulating functional mechanisms associated with intrinsically disordered proteins in these cells. Further, the approach provides a generally applicable and complementary alternative to classification of differentially regulated genes using gene ontology or pathway enrichment analysis.Entities:
Keywords: Microarray; RNA sequencing; differentially regulated genes; functional analysis; gene ontology analysis; intrinsic disorder; intrinsic disorder prediction; intrinsically disordered region; protein conformation; transcriptome
Mesh:
Substances:
Year: 2018 PMID: 30308971 PMCID: PMC6213395 DOI: 10.3390/ijms19103101
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Enrichment of intrinsically disordered regions (IDRs) in proteins encoded by genes that are differentially expressed in lymphoma cells upon adhering to stromal cells. The number (n) of IDRs (≥30 residues) for each predictor is shown as well as how the detected IDRs are distributed in relation to length. The number of IDRs in each size category is shown. The blue line shows the percentage of all IDRs encoded by adhesion-related genes (adsu) and non-adhesion-related genes (nadsu) that are associated with the adsu set, while the red line shows the percentage expected if IDRs are equally distributed between the adsu and nadsu sets.
Intrinsically disordered regions are enriched in proteins encoded by down-regulated genes in lymphoma cells upon adherence to stromal cells.
| Gene Set Comparison | Espritz-D # | Espritz-N # | Espritz-X # | IUPred-L # | IUPred-S # | PrDOS # | PV2 # | VLXT # | VSL2b # |
|---|---|---|---|---|---|---|---|---|---|
| Adsu vs. Nadsu | |||||||||
| IDR number in adsu | 487 | 1638 | 1210 | 1462 | 1517 | 2353 | 2445 | 2361 | 1966 |
| IDR number in nadsu * | 382 | 1036 | 796 | 847 | 892 | 1520 | 1794 | 1492 | 1431 |
| Adjusted | 2.38 × 10−8 | 1.32 × 10−27 | 2.81 × 10−23 | 4.34 × 10−27 | 1.32 × 10−27 | 4.24 × 10−34 | 2.85 × 10−19 | 2.75 × 10−25 | 8.29 × 10−19 |
|
| |||||||||
| IDR number in adsu_down | 276 | 1072 | 760 | 983 | 1025 | 1507 | 1508 | 1572 | 1216 |
| IDR number in adsu_up * | 171 | 458 | 363 | 387 | 397 | 758 | 683 | 639 | 606 |
| Adjusted | 3.51 × 10−78 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 | <1.00 × 10−99 |
* mean of 1000 resamples of n proteins encoded by genes in nadsu or adsu_up, where n = the number of genes in adsu or adsu_down, respectively. Abbreviations: adsu (adhesion-regulated genes); nadsu (non-adhesion-regulated genes); adsu_down (down-regulated adsu); adsu_up (up-regulated adsu). # Predictors of intrinsic disorder that appear in the D2P2 database.
Intrinsically disordered regions tend to be longer in proteins encoded by down-regulated genes in lymphoma cells upon adherence to stromal cells.
| Gene Set Comparison | Espritz-D | Espritz-N | Espritz-X | IUPred-L | IUPred-S | PrDOS | PV2 | VLXT | VSL2b |
|---|---|---|---|---|---|---|---|---|---|
| Adsu vs. Nadsu | |||||||||
| Median (mean) IDR length (adsu) | 64 (125) | 57 (97) | 66 (98) | 54 (87) | 52 (67) | 61 (98) | 61 (90) | 48 (62) | 74 (128) |
| Median (mean) IDR length (nadsu) | 61 (98) | 56 (91) | 62 (93) | 54 (86) | 51 (68) | 55 (86) | 56 (82) | 47 (61) | 66 (107) |
| Adjusted | 6.04 × 10−2 | 7.33 × 10−2 |
| 6.02 × 10−1 | 5.10 × 10−1 |
|
|
|
|
|
| |||||||||
| Median (mean) IDR length (adsu_down) | 68.5 (154) | 60 (104) | 76 (108) | 56 (93) | 53 (70) | 64 (107) | 65 (96) | 49 (65) | 79.5 (142) |
| Median (mean) IDR length (adsu_up) | 59 (89) | 54 (86) | 58 (83) | 50 (75) | 50 (62) | 58 (85) | 57 (80) | 46 (57) | 67 (104) |
| Adjusted |
|
|
|
|
|
|
|
|
|
* Mann–Whitney test; bold text = p < 0.05. Abbreviations: adsu (adhesion-regulated genes); nadsu (non-adhesion-regulated genes); adsu_down (down-regulated adsu); adsu_up (up-regulated adsu).
Distribution of IDRs predicted by VSL2b in proteins encoded by genes that are differentially regulated in lymphoma cells upon interaction with stromal cells.
| Gene Set Comparison | Number of Proteins | Number (%) of Completely Disordered Proteins | Number (%) of Proteins with IDR | Median Percent IDR Per Protein (All Proteins) | Median Percent IDR Per Protein (IDR-Containing Proteins) |
|---|---|---|---|---|---|
| adsu_down | 445 | 19 (4.3) | 367 (82.5) | 38.1 | 48.1 |
| adsu_up | 556 | 11 (2) | 370 (66.5) | 18.1 | 37.6 |
| nadsu | 17,459 | 476 (2.7) | 11,248 (64.4) | 17.8 | 37.6 |
adsu_down (down-regulated adhesion-regulated genes); adsu_up (up-regulated adhesion-regulated genes); nadsu (non-adhesion-regulated genes).
Figure 2Relative frequency distributions of proportion of IDR per protein and length-normalized number of IDRs per protein for proteins encoded by adsu_down genes in relation to adsu_up and nadsu genes. IDR predictions were made using VSL2b. (A) Relative frequency distributions (Density) of IDR-containing proteins with different percent IDR content. The median position and value are shown in blue. (B) Relative frequency distributions (Density) of numbers of IDRs per IDR-containing protein, normalized for differences in protein length (IDR number per 1000 amino acid residues). The median position and value are shown in blue.
Figure 3Comparison of proteins encoded by down- or up-regulated adhesion-regulated genes with regard to longest IDR length per protein and protein length. IDR-containing proteins encoded by (A) down-regulated adhesion-regulated genes (adsu_down) and (B) up-regulated adhesion-regulated genes (adsu_up) are shown. Of the 14 proteins in (A) for which the maximum IDR length is greater than 1000 residues (above dotted line), 6 proteins (red text) were also found in the sets of 14 proteins with the longest IDRs predicted by the PV2 and PrDOS predictors.
Figure 4Examples of proteins with long IDRs. Proteins that are reproducibly found by the VSL2b, PV2 and PrDOS predictors in the set of 14 proteins with the longest predicted IDRs (red text in Figure 3A) are shown. The residue-by-residue intrinsic disorder score (VSL2b) is plotted as a function of residue number throughout the length of the respective proteins. The horizontal gridline at a score of 0.5 distinguishes regions predicted to be ordered (<0.5) or intrinsically disordered (>0.5).