| Literature DB >> 24416320 |
Mikhail Pyatnitskiy1, Ilya Mazo2, Maria Shkrob3, Elena Schwartz2, Ekaterina Kotelnikova4.
Abstract
One of the main challenges in modern medicine is to stratify different patient groups in terms of underlying disease molecular mechanisms as to develop more personalized approach to therapy. Here we propose novel method for disease subtyping based on analysis of activated expression regulators on a sample-by-sample basis. Our approach relies on Sub-Network Enrichment Analysis algorithm (SNEA) which identifies gene subnetworks with significant concordant changes in expression between two conditions. Subnetwork consists of central regulator and downstream genes connected by relations extracted from global literature-extracted regulation database. Regulators found in each patient separately are clustered together and assigned activity scores which are used for final patients grouping. We show that our approach performs well compared to other related methods and at the same time provides researchers with complementary level of understanding of pathway-level biology behind a disease by identification of significant expression regulators. We have observed the reasonable grouping of neuromuscular disorders (triggered by structural damage vs triggered by unknown mechanisms), that was not revealed using standard expression profile clustering. For another experiment we were able to suggest the clusters of regulators, responsible for colorectal carcinoma vs adenoma discrimination and identify frequently genetically changed regulators that could be of specific importance for the individual characteristics of cancer development. Proposed approach can be regarded as biologically meaningful feature selection, reducing tens of thousands of genes down to dozens of clusters of regulators. Obtained clusters of regulators make possible to generate valuable biological hypotheses about molecular mechanisms related to a clinical outcome for individual patient.Entities:
Mesh:
Year: 2014 PMID: 24416320 PMCID: PMC3887006 DOI: 10.1371/journal.pone.0084955
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Overall pipeline of the proposed approach for disease subtyping.
See corresponding section for detailed description.
Figure 2Heatmap of activity scores (k-values) for clusters of regulators identified in GSE4183 dataset.
Samples are in columns, clusters of regulators are in rows. Horizontal side bar color encodes true class labels.
Identified significant clusters of regulators discriminating between adenoma, carcinoma and inflammation.
| Cluster of regulators | Number of regulators in cluster | Inflammation vs adenoma, AUC | Carcinoma vs adenoma, AUC | Inflammation vs carcinoma, AUC |
| cluster #10 | 52 | 0.991 | 0.742 | 0.707 |
| cluster #17 | 32 | 0.991 | – | 0.947 |
| cluster #15 | 57 | 0.938 | – | 0.787 |
| cluster #27 | 13 | 0.893 | – | 0.778 |
| cluster #13 | 86 | 0.947 | – | 0.707 |
| cluster #7 | 54 | 0.884 | 0.760 | – |
| cluster #3 | 82 | 0.991 | – | 0.556 |
| cluster #4 | 29 | 0.769 | – | 0.689 |
| cluster #28 | 16 | 0.796 | – | 0.636 |
| cluster #1 | 181 | 0.769 | – | 0.340 |
| cluster #9 | 39 | 0.876 | – | – |
| cluster #24 | 28 | 0.867 | – | – |
| cluster #21 | 37 | 0.813 | – | – |
| cluster #11 | 45 | 0.813 | – | – |
| cluster #6 | 111 | 0.742 | – | – |
| cluster #2 | 32 | 0.662 | – | – |
| cluster #5 | 23 | 0.422 | – | – |
Figure 3Comparison of clustering of 12 diseases of human muscle.
A) Dendrogram obtained using proposed approach based on analysis of regulators activity. B) Dendrogram obtained using Ward's method for clustering gene expression data.