| Literature DB >> 34181334 |
Saeedeh Pourahmad1,2, Somayeh Foroozani2, Mehdi Nourelahi3, Ahmad Hosseini4, Mahboobeh Razmkhah4.
Abstract
BACKGROUND: Comparison of gene expression algorithms may be beneficial for obtaining disease pattern or grouping patients based on the gene expression profile. The current study aimed to investigate whether the knowledge within these data is able to group the ovarian cancer patients with similar disease pattern.Entities:
Keywords: Clustering methods; Gene expression; Ovarian Cancer; Prognosis
Mesh:
Year: 2021 PMID: 34181334 PMCID: PMC8418829 DOI: 10.31557/APJCP.2021.22.6.1781
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Demographic Data for the Available Information of 37 Women with Ovarian Cancer
| Variable | The categories | no.(%) | Mean (SD)* |
|---|---|---|---|
| Stage | Stage 1 | 22 (59.5) | - |
| Stage 2 | 12 (32.4) | ||
| Stage 3 | 1 (2.7) | ||
| Stage 4 | 2 (5.4) | ||
| Missing | 0 (0) | ||
| Age at diagnosis | Between 20-30 | 10 (27.1) | 25.5 (3.171) |
| Between 31-40 | 6 (16.2) | 34.33 (2.944) | |
| Between 41-50 | 7 (18.9) | 42 (1.414) | |
| Between 51-60 | 9 (24.3) | 55.11 (3.060) | |
| Above 61 | 4 (10.8) | 68.5 (9.469) | |
| Missing | 1 (2.7) | - | |
| Age at marriage | Not Married | 12 (32.4) | 25 (45.227) |
| Below 15 | 3 (8.1) | 14.33 (0.577) | |
| Between 16-20 | 13 (35.2) | 17.69 (1.251) | |
| Between 21-36 | 6 (16.2) | 29 (5.060) | |
| Missing | 3 (8.1) | - | |
| Prognosis | Favourable (live five years after diagnosis) | 20 (54.1) | - |
| Unfavourable (dead) | 17 (45.9) | - |
Mean Value and Standard Deviation of PCR Real-Time Gene Expression Values for All Cases
| Genes’ name (abbreviated) | Full Gene name | Mean (SD) |
|---|---|---|
| IL-4 | Interleukin-4 | 0.279 (1.092) |
| IL-6 | Interleukin-6 | 0.025 (0.089) |
| IL-10 | Interleukin-10 | 0.012 (0.039) |
| IL-12b | Interleukin-12 | 0.064 (0.310) |
| IL-17 | Interleukin-17 | 0.005 (0.012) |
| IL-23 | Interleukin-23 | 0.595 (3.040) |
| IL-27 | Interleukin-27 | 0.006 (0.023) |
| FoxP3 | Forkhead box P3 | 0.028 (0.129) |
| CTLA-4 | Cytotoxic T-lymphocyte-associated Protein 4 | 0.010 (0.020) |
| TGF-β1 | Transforming Growth Factor Beta 1 | 1.761 (3.402) |
| IFN-γ | Interferon Gamma | 0.541 (2.934) |
| BCL-2 | B-cell lymphoma 2 | 0.899 (3.537) |
| Fas | - | 0.143 (0.606) |
| FasL | Fas ligand | 0.014 (0.056) |
| Her2 | Human Epidermal Growth Factor Receptor 2 | 0.016 (0.047) |
| MDM2 | Mouse Double Minute 2 Homolog | 0.025 (0.055) |
| Oct4 | Octamer Binding Transcription Factor 4 | 0.215 (0.721) |
| P53 | - | 0.306 (1.225) |
| SDF-1 | Stromal Cell-derived Factor 1 | 0.014 (0.043) |
| Survivin | - | 0.034 (0.128) |
The Selected Subsets of 20 Genes Using Two Feature Selection Methods and the Result of Genes’ Clustering by DBscan Method
| The selected genes using pearson correlation coefficient (subset 1)* | The selected genes using pearson correlation coefficient (subset 2)* | The selected genes | Genes’ clustering using | ||
|---|---|---|---|---|---|
| Cluster 1 | Cluster 2 | ||||
| IL-4 | IL-4 | IL-4 | TGF-β1 | IL-4 | BCL-2 |
| IL-23 | IL-10 | FoxP3 | IL-6 | Fas | |
| IL-27 | IL-12b | IFN-γ | IL-10 | FasL | |
| CTLA-4 | IL-17 | BCL-2 | IL-12 | Her2 | |
| TGF-β1 | IL-27 | Oct4 | IL-17 | MDM2 | |
| Fas | CTLA-4 | Survivin | IL-23 | Oct4 | |
| Her2 | TGF-β1 | IL-27 | P53 | ||
| MDM2 | FasL | FoxP3 | SDF1 | ||
| P53 | Her2 | CTLA-4 | Survivin | ||
| Survivin | Oct4 | IFN-γ | |||
| SDF1 | IFN-γ | ||||
* The value 0.85 considered for dividing the correlated genes in two different subsets
Figure 1Determining the Number of Optimal Clusters Based on Silhouette Mean Value in k-means Clustering Method
The Results of Patients’ Clustering by Four Methods for Two Clusters
| Clustering method | Feature selection method | Number of genes in analysis | The Silhoute mean values | RPT criteria | The correct | |
|---|---|---|---|---|---|---|
| Favourable | Unfavourable | |||||
| k-means | All genes | 20 | 0.824 | 1.433 | 62.16 | 37.84 |
| Laplacian score | 6 | 0.881 | 1.495 | 56.76 | 43.24 | |
| pearson correlation coefficient (subset 1) | 10 | 0.721 | 1.232 | 54.05 | 45.95 | |
| pearson correlation coefficient (subset 2) | 11 | 0.784 | 1.331 | 54.05 | 45.95 | |
| Hierarchical | All genes | 20 | 0.824 | 1.433 | 62.16 | 37.84 |
| Laplacian score | 6 | 0.881 | 1.495 | 56.76 | 43.24 | |
| pearson correlation coefficient (subset 1) | 10 | 0.82 | 1.433 | 62.16 | 37.84 | |
| pearson correlation coefficient (subset 2) | 11 | 0.778 | 1.293 | 59.46 | 40.54 | |
| DBscan | "All genes ( Ɛ=3 , Minpts=3)" | 20 | 0.703 | 1.062 | 54.05 | 45.95 |
| "Laplacian score ( Ɛ=1.75 , Minpts=3)" | 6 | 0.809 | 1.168 | 54.05 | 45.95 | |
| "pearson correlation coefficient (subset 1) ( Ɛ=2 , Minpts=3)" | 10 | 0.74 | 1.162 | 56.76 | 43.24 | |
| "pearson correlation coefficient (subset 2) ( Ɛ=2 , Minpts=3)" | 11 | 0.663 | 1.029 | 54.05 | 45.95 | |
| EM | All genes | 20 | -0.131 | -0.292 | 51.35 | 48.65 |
| Laplacian score | 6 | 0.324 | 0.994 | 54.05 | 45.95 | |
| pearson correlation coefficient (subset 1) | 10 | 0.338 | 0.536 | 48.65 | 51.35 | |
| pearson correlation coefficient (subset 2) | 11 | 0.712 | 0.543 | 48.65 | 51.35 | |