| Literature DB >> 33879268 |
Suyan Tian1, Chi Wang2,3.
Abstract
BACKGROUND: Psoriasis is an immune-mediated, inflammatory disorder of the skin with chronic inflammation and hyper-proliferation of the epidermis. Since psoriasis has genetic components and the diseased tissue of psoriasis is very easily accessible, it is natural to use high-throughput technologies to characterize psoriasis and thus seek targeted therapies. Transcriptional profiles change correspondingly after an intervention. Unlike cross-sectional gene expression data, longitudinal gene expression data can capture the dynamic changes and thus facilitate causal inference.Entities:
Keywords: Integrative clustering (iCluster); Long non-coding RNAs (lncRNAs); Longitudinal data; Psoriasis
Mesh:
Substances:
Year: 2021 PMID: 33879268 PMCID: PMC8056592 DOI: 10.1186/s40246-021-00323-6
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
The 20-lncRNA list identified by the iCluster ensemble
| Gene symbol | Score used to evaluate the importance of a specific genea | Biological relevance (confidence score) |
|---|---|---|
| LINC00936 | 11.07 | |
| FAM13A-AS1 | 9.23 | |
| PSMA3-AS1 | 6.49 | |
| LINC00173 | 4.78 | |
| PCED1B-AS1 | 4.73 | I (0.91) |
| SCARNA9 | 3.87 | I (0.18) |
| TMEM99 | 3.03 | I (0.02) |
| H19 | 2.89 | D (0.18) |
| LINC00640 | 2.65 | |
| PAXIP1-AS1 | 2.63 | |
| MAPKAPK5-AS1 | 2.02 | I (0.11) |
| DICER1-AS1 | 1.92 | I (0.04) |
| MIR600HG | 1.9 | |
| SNHG7 | 1.82 | I (0.1) |
| ZMIZ1-AS1 | 1.79 | I (0.08) |
| TMCC1-AS1 | 1.75 | |
| CD27-AS1 | 1.7 | D (0.71) |
| FRMD6-AS1 | 1.62 | |
| TRHDE-AS1 | 1.59 | |
| URB1-AS1 | 1.56 |
Note: D, directly related to psoriasis according to the GeneCards database; I, indirectly related to psoriasis according to the GeneCards database. The confidence scores are indicative of how much evidence supports the biological relevance, with a higher value corresponding to a stronger support
aA self-defined score is calculated as the sum of absolute weights over the replicates of iCluster modeling, which is used to determine if a certain gene should be selected in the final model. For more details, please refer to the “Materials and Methods” section
Fig. 1Heatmap of identified 20 lncRNAs expression profiles. a Heatmap of the average expression values across four time points for the 20 lncRNAs. b Heatmap of actual expression values at individual time points for the 20 lncRNAs. Type: response status, corresponding to the responder group and the non-responder group that were colored by red and green, respectively. From plot A, it is observed that the responders and non-responders may be clustered into several separate communities by the hierarchical clustering method. If the number of clusters was set at 2, eleven patients were misclassified, resulting in an unsatisfactory performance. In plot B, the similar pattern has been observed. Of note, the expression values have been normalized to have a mean of 0 and a variance of 1, for a clearer representation of these 20 lncRNAs. The color bars on the right side indicate the ranges of gene expression values, with red for high expression values and blue for low expression values
Comparison between iCluster-ensemble and competing methods
| Method | Are feature selection and classifier construction separate | Size | Predictive error |
|---|---|---|---|
| iCluster-ensemble | Yes | 20 | 20% |
| iCluster | Yes | 20 | 46.47% |
| GEE-based screening | Yes | 20 | 33.33% |
| EDGE | Yes | 27 | 43.33% |
| LASSO | No | 0.43a | 46.67% |
| LASSO-ensemble | Yes | 20 | 26.67% |
aSince LASSO builds up the final model simultaneously with feature selection, the sizes of final model differ in single LOO runs. Here, the average of the sizes over resulting 30 LASSO models is given. Predictive error corresponds to the leave-one-out error (LOO) rate
Fig. 2Change trajectories of 12 lncRNAs’ expression values over time stratified by the response status. In these plots, x-axis represents the time points measured, and y-axis represents the expression values. The averages of certain lncRNAs at separate time points were connected to a curve to represent their respective change trajectories over time. It is observed that for the responder group, most genes’ expression values moved back toward the expression values of non-lesional skins. This pattern became very apparent at week 4 and later. For the non-responder group, this pattern is much less apparent. NR, the non-responder group; R, the responder group; NL, non-lesional skin at the baseline; LS, lesional skin at the baseline; 1, lesional skin at week 1; 2, lesional skin at week 2; 4, lesional skin at week 4; 16, lesional skin at week 16
Enriched pathways by target mRNAs of the 20-lncRNA list
| ID | Description | Observed gene count | Background gene count | False discovery rate |
|---|---|---|---|---|
| GO: cellular component | ||||
| GO:0005839 | Proteasome core complex | 7 | 21 | <0.0001 |
| GO:0019773 | Proteasome core complex, alpha-subunit complex | 6 | 8 | <0.0001 |
| GO:0000796 | Condensin complex | 5 | 7 | <0.0001 |
| GO:0000799 | Nuclear condensin complex | 2 | 3 | 0.0033 |
| GO:1904813 | Ficolin-1-rich granule lumen | 4 | 125 | 0.0213 |
| GO:0043232 | Intracellular non-membrane-bounded organelle | 24 | 4005 | 0.0302 |
| GO:0044444 | Cytoplasmic part | 43 | 9377 | 0.0302 |
| GO:0000932 | P-body | 3 | 81 | 0.0475 |
| GO:0005737 | Cytoplasm | 48 | 11238 | 0.0475 |
| GO: molecular function | ||||
| GO:0004298 | Threonine-type endopeptidase activity | 7 | 21 | <0.0001 |
| GO:0005031 | Tumor necrosis factor-activated receptor activity | 3 | 25 | 0.0092 |
| GO:0070011 | Peptidase activity, acting on | 8 | 603 | 0.0362 |
| GO: biological process | ||||
| GO:0007076 | Mitotic chromosome condensation | 5 | 15 | <0.0001 |
| GO:0010032 | Meiotic chromosome condensation | 4 | 5 | <0.0001 |
| GO:0016579 | Protein deubiquitination | 7 | 275 | 0.0099 |
| GO:0006323 | DNA packaging | 6 | 195 | 0.0111 |
| GO:0043687 | Post-translational protein modification | 7 | 365 | 0.0253 |
| KEGG pathway | ||||
| hsa03050 | Proteasome | 7 | 43 | <0.0001 |