| Literature DB >> 20569491 |
Xin Lu1, Ke Zhang, Charles Van Sant, John Coon, Dimitri Semizarov.
Abstract
BACKGROUND: Cancer is a heterogeneous disease caused by genomic aberrations and characterized by significant variability in clinical outcomes and response to therapies. Several subtypes of common cancers have been identified based on alterations of individual cancer genes, such as HER2, EGFR, and others. However, cancer is a complex disease driven by the interaction of multiple genes, so the copy number status of individual genes is not sufficient to define cancer subtypes and predict responses to treatments. A classification based on genome-wide copy number patterns would be better suited for this purpose.Entities:
Mesh:
Year: 2010 PMID: 20569491 PMCID: PMC2901344 DOI: 10.1186/1755-8794-3-23
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Workflow of the genomics-based tumor classification procedure.
Cophenetic correlation and BIC of the NSCLC gNMF models under different cluster numbers.
| 3 | 0.8031 | 1032670 |
| 4 | 0.7664 | 992443 |
| 5 | 0.7103 | 1249580 |
| 6 | 0.7166 | 1301055 |
| 7 | 0.7040 | 1301808 |
| 8 | 0.7109 | 1202876 |
a. The number of clusters by gNMF.
Figure 2Heatmap of the NSCLC tumor and cell line CGH data classified into 4 clusters. Each row represents a sample and each column represents a SNP locus. Red, white, and blue colors indicate high, normal, and low copy numbers, respectively. Horizontal black lines separate different subgroups, and vertical spaces separate chromosome 1 to 22. Cell lines representing each cluster are highlighted by green circles.
Number of tumor patients in each of the four NSCLC subtypes and the cell lines selected to represent each subtype.
| Clusters | Number of tumors | Representative Cancer Cell Lines for each subtype |
|---|---|---|
| Cluster A | 19 | HCC827, NCI-H1437, NCI-H1563, NCI-H1568, NCI-H1623, NCI-H1651, NCI-H1693, NCI-H1755, NCI-H1793, NCI-H1838, NCI-H1944, NCI-H1975, NCI-H1993, NCI-H2023, NCI-H2073, NCI-H2085, NCI-H2087, NCI-H2122, NCI-H2126, NCI-H2228, NCI-H2291, NCI-H23, NCI-H2342, NCI-H2347, NCI-H647, NCI-H920, NCI-H969, CLS-54, LX-289, SK-LU-1, H2882, Calu-6, H358, H460 |
| Cluster B | 60 | NCI-H2405, NCI-H522, SK-MES-1, H157, H1819, H2009, H2887, HCC1171, HCC1359, HCC15, HCC193, HCC366, HCC461, HCC515, HCC78, HOP-62, HOP-92, NCI-H266 |
| Cluster C | 42 | A549, Calu-3, NCI-H1734, NCI-H838, HCC95 |
| Cluster D | 124 | |
Figure 3Kaplan-Meier curves of the TTR for clinically annotated samples in the four NSCLC clusters: (a) four clusters considered separately; and (b) clusters 1 and 2 combined.
Figure 4Kaplan-Meier curves of the TTR and OS between the validation samples assigned into the four clusters using all existing tumor and cell lines to represent the clusters: (a) TTR; and (b) OS.
Figure 5Kaplan-Meier curves of the TTR and OS between the validation samples assigned into the four clusters using cell lines to represent the first 3 clusters and tumors to represent the 4th cluster: (a) TTR; and (b) OS.
Performance comparison of the unsupervised clustering algorithms on NSCLC data set.
| Methods | |||
|---|---|---|---|
| our gNMF | 0.001 | 0.250 | 14.24% |
| hierarchical clustering | 0.168 | 0.137 | 23.32% |
| original gNMF | 0.085 | 0.144 | 21.59% |
| K-means | 0.566 | 0.413 | 23.27% |
a The p-values comparing TTR of NSCLC patients classified into different subtypes by each clustering method
b The p-values comparing OS of NSCLC patients classified into different subtypes by each clustering method
c The error rate of 10-fold stability test for each method
Figure 6Heatmap of the CRC tumors and cell lines classified into 5 clusters.
Number of tumor patients in each of the five CRC subtypes and the cell lines selected to represent each subtype.
| Clusters | Number of tumors | Representative Cancer Cell Lines for each subtype |
|---|---|---|
| Cluster A | 0 | HCT-8, LS 174T, SK-CO-1, SW48, DLD-1, HCT-15, HCT116, LoVo, CL-34, CL-40, C170, LS180 |
| Cluster B | 2 | Caco-2, LS1034, LS411N, LS513, NCI-H498, NCI-H747, SW1116, SW1417, SW837, HT-29, SW620, CL-11, CL-14, Colo-678, SW-480 |
| Cluster C | 8 | Colo 320DM, NCI-H508, NCI-H716, SW1463, SW403, SW948, Colo 205, Colo-206F |
| Cluster D | 40 | |
| Cluster E | 51 | |
Figure 7Heatmap of the melanoma tumors and cell lines classified into 6 clusters.
Number of tumor patients in each of the six melanoma subtypes and the cell lines selected to represent each subtype.
| Clusters | Number of tumors | Representative Cancer Cell Lines for each subtype |
|---|---|---|
| Cluster A | 15 | SKMEL119, HS944, WM1366, WM88 |
| Cluster B | 12 | WM3248 |
| Cluster C | 14 | 1205LU |
| Cluster D | 4 | 451LU, SKMEL19, SKMEL28, SKMEL30, SKMEL63, WM35, WM983, WM983C |
| Cluster E | 25 | WM3211, M14, MEWO, SKMEL2, SKMEL5, UACC257, UACC62, WM122, WM13662, WM239A, WM32112, WM32482, WM793B, 501MEL |
| Cluster F | 10 | MALME3M, WM882 |