| Literature DB >> 28672922 |
Qian-Song Chen1, Dan Wang1, Bao-Lian Liu2, Shu-Feng Gao1, Dan-Li Gao1, Gui-Rong Li1.
Abstract
The aim of the present study was to investigate key genes in fibroids based on the multiple affinity propogation-Krzanowski and Lai (mAP-KL) method, which included the maxT multiple hypothesis, Krzanowski and Lai (KL) cluster quality index, affinity propagation (AP) clustering algorithm and mutual information network (MIN) constructed by the context likelihood of relatedness (CLR) algorithm. In order to achieve this goal, mAP-KL was initially implemented to investigate exemplars in fibroid, and the maxT function was employed to rank the genes of training and test sets, and the top 200 genes were obtained for further study. In addition, the KL cluster index was applied to determine the quantity of clusters and the AP clustering algorithm was conducted to identify the clusters and their exemplars. Subsequently, the support vector machine (SVM) model was selected to evaluate the classification performance of mAP-KL. Finally, topological properties (degree, closeness, betweenness and transitivity) of exemplars in MIN constructed according to the CLR algorithm were assessed to investigate key genes in fibroid. The SVM model validated that the classification between normal controls and fibroid patients by mAP-KL had a good performance. A total of 9 clusters and exemplars were identified based on mAP-KL, which were comprised of CALCOCO2, COL4A2, COPS8, SNCG, PA2G4, C17orf70, MARK3, BTNL3 and TBC1D13. By accessing the topological analysis for exemplars in MIN, SNCG and COL4A2 were identified as the two most significant genes of four types of methods, and they were denoted as key genes in the progress of fibroid. In conclusion, two key genes (SNCG and COL4A2) and 9 exemplars were successfully investigated, and these may be potential biomarkers for the detection and treatment of fibroid.Entities:
Keywords: affinity propagation; cluster; fibroid; gene; mutual information network
Year: 2017 PMID: 28672922 PMCID: PMC5488419 DOI: 10.3892/etm.2017.4481
Source DB: PubMed Journal: Exp Ther Med ISSN: 1792-0981 Impact factor: 2.447
Figure 1.Preprocessing for microarray data by mean centering, z-score, quantile and cyclic loess methods.
Top 100 genes based on maxT multiple hypothesis testing.
| Number | Gene |
|---|---|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | |
| 7 | |
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
| 15 | |
| 16 | |
| 17 | |
| 18 | |
| 19 | |
| 20 | |
| 21 | |
| 22 | |
| 23 | |
| 24 | |
| 25 | |
| 26 | |
| 27 | |
| 28 | |
| 29 | |
| 30 | |
| 31 | |
| 32 | |
| 33 | |
| 34 | |
| 35 | |
| 36 | |
| 37 | |
| 38 | |
| 39 | |
| 40 | |
| 41 | |
| 42 | |
| 43 | |
| 44 | |
| 45 | |
| 46 | |
| 47 | |
| 48 | |
| 49 | |
| 50 | |
| 51 | |
| 52 | |
| 53 | |
| 54 | |
| 55 | |
| 56 | |
| 57 | |
| 58 | |
| 59 | |
| 60 | |
| 61 | |
| 62 | |
| 63 | |
| 64 | |
| 65 | |
| 66 | |
| 67 | |
| 68 | |
| 69 | |
| 70 | |
| 71 | |
| 72 | |
| 73 | |
| 74 | |
| 75 | |
| 76 | |
| 77 | |
| 78 | |
| 79 | |
| 80 | |
| 81 | |
| 82 | |
| 83 | |
| 84 | |
| 85 | |
| 86 | |
| 87 | |
| 88 | |
| 89 | |
| 90 | |
| 91 | |
| 92 | |
| 93 | |
| 94 | |
| 95 | |
| 96 | |
| 97 | |
| 98 | |
| 99 | |
| 100 |
Clusters identified by the mAP-KL method for fibroids.
| Cluster | No. of genes | Genes |
|---|---|---|
| 1 | 31 | |
| 2 | 12 | |
| 3 | 21 | |
| 4 | 18 | |
| 5 | 33 | |
| 6 | 13 | |
| 7 | 20 | |
| 8 | 22 | |
| 9 | 18 |
Figure 2.Mutual information network for clusters related the top 200 genes of fibroid. There were 174 nodes and 820 edges, where nodes represented genes, and edges were the interactions between two genes. The yellow nodes, CALCOCO2, COL4A2, COPS8, SNCG, PA2G4, C17orf70, MARK3, BTNL3 and TBC1D13, stood for exemplars of nine clusters identified via the multiple affinity propogation-Krzanowski and Lai method.
Topological properties of exemplars in mutual information network.
| Exemplar | Degree | Closeness | Betweenness | Transitivity |
|---|---|---|---|---|
| 104 | 174.81 | 3.54 | 1022 | |
| 110 | 237.21 | 3.71 | 1075 | |
| 88 | 147.27 | 3.29 | 657 | |
| 127 | 263.29 | 3.89 | 1122 | |
| 91 | 152.57 | 1.83 | 35 | |
| 85 | 155.03 | 0.50 | 0 | |
| 93 | 149.75 | 3.05 | 44 | |
| 87 | 131.67 | 3.01 | 0 | |
| 90 | 158.98 | 3.55 | 897 |
Figure 3.Sub-network for key genes (SNCG and COL4A2) extracted from the mutual information network, in which a total of 149 nodes and 222 edges were mapped. Nodes represented genes, and edges were the interactions between two genes. The yellow nodes were key genes of fibroid.
Figure 4.MIN for the top 200 ranked genes in the microarray data. There were 174 nodes and 1,002 interactions, where nodes represented genes, and edges were the interactions between two genes. The pink nodes were hub genes with top 5% degree distribution of the MIN. MIN, mutual information network.