| Literature DB >> 29286618 |
Pandi M1, Balamurugan R, Sadhasivam N.
Abstract
Objective: A better understanding of functional genomics can be obtained by extracting patterns hidden in gene expression data. This could have paramount implications for cancer diagnosis, gene treatments and other domains. Clustering may reveal natural structures and identify interesting patterns in underlying data. The main objective of this research was to derive a heuristic approach to detection of highly co-expressed genes related to cancer from gene expression data with minimum Mean Squared Error (MSE).Entities:
Keywords: Cancer diagnosis; gene treatments; genomics; gene expression data; Cat Swarm Optimization
Mesh:
Substances:
Year: 2017 PMID: 29286618 PMCID: PMC5980909 DOI: 10.22034/APJCP.2017.18.12.3451
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Figure 1Convergence of MCSO-HS and CSO on LeuKaemia Cancer Dataset for 5 Clusters
Figure 2Convergence of MCSO-HS and CSO on Breast Cancer Dataset for 5 Clusters
Figure 3Plot of Number of Clusters Versus FOM Index on Leukaemia Cancer Dataset
Figure 4Plot of Number of Clusters Versus FOM Index on Breast Cancer Dataset
Significant GO Terms for Three Clusters on Breast Cancer Data
| Cluster No. | No. of Genes | Process | Function | Component |
|---|---|---|---|---|
| 3 | 83 | cell cycle process (n=38, p=1.9×10-8) | binding activity (n= 41, p=1.8×10-7) | intracellular organelle (n=62, p=3.3×10-6) |
| 4 | 78 | mitotic cell cycle process (n=42, p=6.5×10-7) | hydrolase activity (n=48, p=3.2×10-6) | cell part (n=58, p=1.9×10-4) |
| 5 | 131 | single-organism process (n=93, p=1.1×10-3) | transferase activity (n=81, p=1.8×10-2) | intracellular part (n=1,344, p=2.9×10-1) |
Figure 5Gene Ontology Biological Process of Breast Cancer Data (10 Genes)
Parameter and its Value for Benchmark Datasets
| Parameter | Value |
|---|---|
| No. of Cats (N) | 100 |
| SMP | 20 |
| SRD | 10 |
| CDC | 20 |
| SPC | 0 or 1 |
| Harmony memory considering rate (HMCR) | 0.9 |
| Pitch Adjustment Rate (PAR) | 0.3 |
| Harmony memory size(HMS) | 100 |
| Number of iteration(NI) | 200 |
| Cluster size | 1 to 15 |
Comparative Analysis on Leukaemia and Breast Cancer Data
| Method | Leukaemia Cancer | Breast Cancer | ||
|---|---|---|---|---|
| Adjusted Rand | FOM | Adjusted Rand | FOM | |
| Genclust random | 0.47 | 57.05 | 0.51 | 57.49 |
| Min kmeans –random | 0.44 | 57.05 | 0.38 | 55.73 |
| Max kmeans-random | 0.49 | 57.05 | 0.51 | 55.73 |
| Cast | 0.529 | 56.66 | 0.68 | 50.21 |
| Kmeans-Avlink | 0.508 | 57.36 | 0.62 | 59.49 |
| Avlink | 0.559 | 58.78 | 0.52 | 62.27 |
| GenClust-Avlink | 0.518 | 57.21 | 0.8 | 59.33 |
| HS | 0.671 | 56.35 | 0.83 | 49.21 |
| CSO | 0.78 | 55.94 | 0.85 | 47.94 |
| MCSO-HS | 0.891 | 54.03 | 0.92 | 45.86 |