| Literature DB >> 20731868 |
Dwain Mefford1, Joel A Mefford.
Abstract
BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes, it suffers from instability in the face of gene or sample selection, and an inability to detect small sets that may be dominated by larger sets such as the estrogen-related genes in breast cancer. The rank-based partitioning algorithm introduced in this paper addresses several of these limitations. It delivers results comparable to two-way hierarchical clustering, and much more. Applied systematically across a range of parameter settings, it enumerates all the partition-inducing gene sets in a matrix of expression values.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20731868 PMCID: PMC2996978 DOI: 10.1186/1471-2164-11-482
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1erbb2/17q12 gene set as detected in the Uppsala data set. Expression values are mapped to a red-green continuum. Columns are ordered by column sum.
Instances of the erbb2/17q12 gene set detected in the Uppsala data.
| Table 2 | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 21t4 | 23t4 | 25t5 | 26t5 | 27t5 | 29t6 | 31t6 | 32t5 | 33t6 | 34t6 |
| ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ||
| ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 | ERRB2 |
| GRB7 | GRB7 | GRB7 | GRB7 | GRB7 | GRB7 | GRB7 | GRB7 | GRB7 | GRB7 |
| PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 |
| PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 | PERLD1 |
| STARD3 | STARD3 | STARD3 | STARD3 | STARD3 | STARD3 | STARD3 | STARD3 | STARD3 | STARD3 |
| CRKRS | CRKRS | CRKRS | CRKRS | CRKRS | CRKRS | CRKRS | |||
| PPARBF | PPARBF | PPARBF | PPARBF | PPARBF | PPARBF | PPARBF | |||
| PPARBF | PPARBF | PPARBF | PPARBF | ||||||
| RPL19 | RPL19 | RPL19 | |||||||
| CASC3 | |||||||||
| PSMD3 | |||||||||
| THRAP4 | |||||||||
| GSDML | |||||||||
| GSDML | |||||||||
The partition size (s) and tolerance (t) are listed at the head of each column. The first instance of this gene set is detected at s = 21, t = 4. As s and t are stepped, the number of erbb2/17q12 genes increases from size to a maximum of 15, at s = 29, t = 6. From that point the number of genes decreases.
Figure 2The survival results for the stromal(5)decorin gene set in the Uppsala cohort. Columns are ordered by column sum, and recurrence events have been added.
Data sets.
| dataset | source | n | survival endpoint | median/average follow-up | reference |
|---|---|---|---|---|---|
| Uppsala | GSE3494 | 251 | BCSS | 10.2 years | Miller 2005 |
| Stockholm | GSE1456 | 159 | DMFS | 6.1 years | Pawitan 2005 |
| TRANSBIG | GSE7390 | 198 | DMFS | 13.6 years | Desmedt 2007 |
| NKI | Chang | 295 | DMFS | 6.7 years | VandeVijver 2002 |
Gene sets and survival results for the Uppsala data.
| all samples | ER positive | ER negative | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| median | quartiles | median | quartiles | median | ||||||
| p | p | p | p | p | ||||||
| stromal(0) | ||||||||||
| stromal(2) | 1.16 | 0.28 | ||||||||
| stromal(3) | ||||||||||
| stromal(5) | 3.22 | 0.07 | ||||||||
| adipose | 0.99 | 0.31 | ||||||||
| immune(5) | 2.84 | 0.09 | 2.95 | 0.08 | 2.64 | 0.1 | ||||
| immune(6) | 3.41 | 0.06 | ||||||||
| immune(0) | 1.85 | 0.17 | 0.07 | 0.79 | 0.9 | 0.34 | 0.02 | 0.89 | ||
| TPSAB1 | 2.92 | 0.08 | 2.81 | 0.09 | 0.95 | 0.32 | ||||
| estrogen | 0.65 | 0.42 | 2.36 | 0.12 | 1.15 | 0.28 | 0.5 | 0.48 | ||
| AFFX-BioC-5 | 0 | 0.95 | 0.4 | 0.52 | 0.16 | 0.68 | 0.49 | 0.48 | 0.59 | 0.44 |
| ACTG1 | 0.12 | 0.72 | 0 | 0.96 | 0.7 | 0.4 | 0.06 | 0.8 | 0.74 | 0.38 |
| basal | 0.94 | 0.33 | 1.13 | 0.28 | 0.98 | 0.32 | 2.32 | 0.12 | 0.37 | 0.54 |
| ERBB2 | 2.83 | 0.09 | 3 | 0.08 | 2.27 | 0.13 | 2.4 | 0.12 | 0 | 0.97 |
| GGT1 | 0 | 0.99 | 0 | 0.96 | 0.05 | 0.81 | 0.1 | 0.74 | 0 | 0.99 |
| hemoglobin | 0.03 | 0.86 | 0.06 | 0.81 | 0.32 | 0.56 | 0.55 | 0.45 | 0 | 0.94 |
| immune(1) | 1.54 | 0.21 | 0.05 | 0.82 | 1.07 | 0.3 | 0.38 | 0.53 | 3.32 | 0.06 |
| immune(2) | 0.06 | 0.8 | 0.44 | 0.5 | 0.18 | 0.67 | 0.24 | 0.62 | 0.46 | 0.49 |
| immune(3) | 1.71 | 0.19 | 1.98 | 0.15 | 1.61 | 0.2 | 1.26 | 0.26 | 1.15 | 0.28 |
| immune(4) | 2.21 | 0.13 | 0.68 | 0.4 | 1.76 | 0.18 | 1.27 | 0.25 | 0 | 0.99 |
| immune(7) | 0.5 | 0.47 | 0.1 | 0.74 | 0.93 | 0.33 | 0.01 | 0.93 | 0.79 | 0.37 |
| LST1 | 3.25 | 0.07 | 0.82 | 0.36 | 1.69 | 0.19 | 0.25 | 0.61 | 1.04 | 0.3 |
| NFIB | 0.55 | 0.45 | 0.03 | 0.85 | 0.02 | 0.87 | 0.01 | 0.91 | 0.22 | 0.63 |
| PPP1R12A | 0.01 | 0.94 | 1.19 | 0.27 | 0 | 0.95 | 1.34 | 0.24 | 0.08 | 0.77 |
| ribosomal(0) | 0.35 | 0.55 | 1.7 | 0.19 | 0.36 | 0.55 | 0.53 | 0.46 | 0 | 0.96 |
| ribosomal(1) | 0.63 | 0.42 | 0.07 | 0.79 | 0.14 | 0.7 | 0.03 | 0.87 | 0.03 | 0.86 |
| ribosomal(5) | 0.19 | 0.66 | 0.51 | 0.47 | 0.22 | 0.64 | 0.12 | 0.73 | 0.97 | 0.32 |
| UBE2D2 | 0 | 0.99 | 0.43 | 0.51 | 0 | 0.99 | 0.78 | 0.37 | 0.13 | 0.72 |
| histone | 0.78 | 0.37 | 3.44 | 0.06 | 1.22 | 0.27 | 3.11 | 0.07 | ||
| GAPDH | 3.07 | 0.07 | 2.35 | 0.12 | 2.09 | 0.14 | 3.7 | 0.05 | ||
| CD24 | 0.45 | 0.5 | 5.25 | 0.02 | 0.82 | 0.36 | 0.51 | 0.47 | ||
| AFFX-M27830_5 | 1.68 | 0.19 | 4.3 | 0.03 | 1.72 | 0.19 | 0.01 | 0.92 | ||
| GNAS | 3.34 | 0.06 | 2.07 | 0.15 | 3.04 | 0.08 | 1.36 | 0.24 | ||
| 16q13 | 2.58 | 0.1 | 2.49 | 0.11 | 2.4 | 0.12 | ||||
| proliferation | 0.71 | 0.39 | ||||||||
Gene sets and survival results for the Stockholm data.
| all 159 samples | ||||
|---|---|---|---|---|
| median | quartiles | |||
| p | p | |||
| stromal(2) | ||||
| stromal(3) | ||||
| stromal(5) | ||||
| adipose | ||||
| hemoglobin | 3.33 | 0.06 | ||
| 16q13 | 2.62 | 0 | 2.12 | 0.14 |
| AFFX-BioC-5 | 0.03 | 0.86 | 0.26 | 0.6 |
| AFFX-M27830_5 | 0.15 | 0.69 | 2.09 | 0.14 |
| basal | 3.42 | 0.06 | 1.14 | 0.28 |
| estrogen | 0.02 | 0.88 | 0.98 | 0.32 |
| GGT1 | 1.61 | 0.2 | 0.76 | 0.38 |
| stromal(0) | 2.49 | 0.11 | 2.35 | 0.12 |
| stromal(1) | 0.98 | 0.32 | 0.08 | 0.78 |
| immune(0) | 0.13 | 0.72 | 0.1 | 0.75 |
| immune(1) | 0 | 0.96 | 0.35 | 0.55 |
| immune(2) | 1.1 | 0.29 | 1.11 | 0.29 |
| immune(5) | 1.25 | 0.26 | 0 | 0.96 |
| LST1 | 0.43 | 0.51 | 0.02 | 0.88 |
| OPHN1 | 0.04 | 0.84 | 0.02 | 0.89 |
| PPP1R12A | 2.16 | 0.14 | 3.11 | 0.07 |
| ribosomal(0) | 0.72 | 0.39 | 0.63 | 0.42 |
| TPSAB1 | 0.22 | 0.64 | 0.49 | 0.48 |
| UBE2D2 | 0 | 0.99 | 1.9 | 0.16 |
| immune(4) | 3.36 | 0.06 | ||
| ezrin | 3.35 | 0.06 | ||
| ACTG1 | 1.65 | 0.19 | ||
| ribosomal(1) | ||||
| histone | ||||
| CD24 | 2.7 | 0.1 | ||
| GAPDH | ||||
| proliferation | ||||
Gene sets and survival results for the TRANSBIG data.
| all 198 samples | ER positive | ER negative | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| median | quartiles | median | quartiles | median | ||||||
| p | p | p | p | p | ||||||
| ribosomal(0) | 0.06 | 0.8 | ||||||||
| ribosomal(3) | 3.38 | 0.06 | 0.46 | 0.49 | ||||||
| ribosomal(4) | 1.94 | 0.16 | 3.18 | 0.07 | 0.02 | 0.88 | ||||
| stromal(2) | 1.54 | 0.21 | 2.45 | 0.11 | 0.44 | 0.5 | 0.22 | 0.63 | ||
| AFFX-BioC-5 | 1.64 | 0.2 | 0.47 | 0.49 | 0.1 | 0.75 | 2.75 | 0.09 | ||
| adipose | 1.67 | 0.19 | 1.1 | 0.29 | 3.31 | 0.06 | 1.17 | 0.27 | 0 | 0.96 |
| ACTG1 | 2.53 | 0.11 | 0.43 | 0.51 | 0.31 | 0.57 | 0 | 0.98 | 0.49 | 0.48 |
| CD24 | 0.17 | 0.67 | 2.84 | 0.09 | 0.45 | 0.5 | 0.81 | 0.36 | 0.13 | 0.72 |
| CD44 | 0.31 | 0.57 | 0.05 | 0.81 | 0.41 | 0.52 | 0.05 | 0.86 | 0.01 | 0.92 |
| CFLAR | 1.66 | 0.19 | 0 | 0.99 | 0.82 | 0.36 | 2.84 | 0.09 | 1.04 | 0.3 |
| ERBB2 | 0.75 | 0.38 | 0.01 | 0.93 | 0.94 | 0.33 | 2.75 | 0.09 | 0.77 | 0.38 |
| estrogen | 0.24 | 0.62 | 0.57 | 0.44 | 0.28 | 0.59 | 2.11 | 0.14 | 0.54 | 0.46 |
| FOXA1 | 1.1 | 0.29 | 0.27 | 0.6 | 0.29 | 0.58 | 0.19 | 0.66 | 0.58 | 0.44 |
| GAPDH | 0.35 | 0.55 | 2.03 | 0.15 | 2.05 | 0.15 | 0.18 | 0.67 | 0.01 | 0.9 |
| GGT1 | 0.87 | 0.35 | 0 | 0.99 | 0.03 | 0.86 | 0.04 | 0.84 | 1.52 | 0.21 |
| hemoglobin | 0.24 | 0.62 | 0.53 | 0.46 | 0.31 | 0.57 | 0.11 | 0.73 | 0.74 | 0.38 |
| immune(0) | 0.05 | 0.81 | 0.57 | 0.44 | 0.22 | 0.64 | 0 | 0.96 | 1.35 | 0.24 |
| immune(1) | 1.01 | 0.31 | 0.56 | 0.45 | 0.87 | 0.34 | 0.94 | 0.33 | 2.2 | 0.13 |
| immune(2) | 0.31 | 0.57 | 0.17 | 0.68 | 0.49 | 0.48 | 0.1 | 0.74 | 0.01 | 0.94 |
| immune(4) | 0.04 | 0.83 | 0.01 | 0.92 | 0.05 | 0.81 | 0 | 0.98 | 2.11 | 0.14 |
| immune(5) | 1.26 | 0.26 | 0.04 | 0.84 | 0.68 | 0.4 | 1.27 | 0.26 | 0.23 | 0.63 |
| immune(6) | 0.08 | 0.77 | 0.1 | 0.75 | 0.19 | 0.66 | 0.67 | 0.41 | 0.97 | 0.32 |
| immune(10) | 1.81 | 0.17 | 0.47 | 0.49 | 1.71 | 0.19 | 1.97 | 0.16 | 0.97 | 0.32 |
| myo | 0.22 | 0.63 | 0.39 | 0.53 | 0.11 | 0.73 | 0 | 0.94 | 3.01 | 0.08 |
| NFIB | 0.56 | 0.45 | 0.12 | 0.73 | 0.41 | 0.52 | 2.12 | 0.14 | 0 | 0.98 |
| PPP1R12A | 1.47 | 0.22 | 0.26 | 0.61 | 0.51 | 0.47 | 1.26 | 0.26 | 0.63 | 0.42 |
| SMARCA4 | 0.08 | 0.77 | 0.43 | 0.51 | 1.04 | 0.3 | 2.79 | 0.09 | 1.99 | 0.15 |
| stromal(0) | 0.29 | 0.59 | 2.37 | 0.12 | 0.41 | 0.52 | 1.6 | 0.2 | 0.01 | 0.92 |
| stromal(1) | 2.51 | 0.11 | 1.46 | 0.22 | 2.21 | 0.13 | 0.08 | 0.77 | 2.36 | 0.12 |
| stromal(5) | 0.58 | 0.44 | 0.25 | 0.61 | 0.03 | 0.85 | 0.8 | 0.37 | 1.69 | 0.19 |
| TCF4 | 0.87 | 0.35 | 0.13 | 0.71 | 0.94 | 0.33 | 0.55 | 0.45 | 0.92 | 0.33 |
| TPSAB1 | 0.21 | 0.64 | 0.03 | 0.86 | 0.23 | 0.63 | 0.29 | 0.59 | 0.03 | 0.86 |
| UBE2D2 | 0.05 | 0.81 | 0 | 0.95 | 1.07 | 0.3 | 2.82 | 0.09 | 0.42 | 0.51 |
| basal | 0.13 | 0.71 | 2.48 | 0.11 | 2.32 | 0.12 | 0.35 | 0.55 | ||
| histone | 1.54 | 0.21 | 0.26 | 0.61 | 2.21 | 0.13 | 0.33 | 0.56 | ||
| NKTR | 1.29 | 0.25 | 0.03 | 0.85 | 1.55 | 0.21 | 1.54 | 0.21 | ||
| 16q13 | 0.86 | 0.35 | 0.23 | 0.63 | 0.4 | 0.52 | 0.27 | 0.6 | ||
| proliferation | ||||||||||
Gene sets detected in the three Affymetrix U133a data sets.
| Uppsala | TRANSBIG | Stockholm |
|---|---|---|
| adipose | adipose | adipose |
| basal | basal | basal |
| estrogen | estrogen | estrogen |
| proliferation | proliferation | proliferation |
| erbb2 | erbb2 | |
| immune(0) | immune(0) | immune(0) |
| immune(1) | immune(1) | immune(1) |
| immune(2) | immune(2) | immune(2) |
| immune(3) | ||
| immune(4) | immune(4) | immune(4) |
| immune(5) | immune(5) | immune(5) |
| immune(6) | immune(6) | |
| immune(7) | ||
| immune(10) | ||
| stromal(0) | stromal(0) | stromal(0) |
| stromal(1) | stromal(1) | |
| stromal(2) | stromal(2) | stromal(2) |
| stromal(3) | stromal(3) | |
| stromal(5) | stromal(5) | stromal(5) |
| ribosomal(0) | ribosomal(0) | ribosomal(0) |
| ribosomal(1) | ribosomal(1) | |
| ribosomal(3) | ||
| ribosomal(4) | ||
| ribosomal(5) | ||
| metallothioneins | metallothioneins | metallothioneins |
| ACTG1 | ACTG1 | ACTG1 |
| AFFX-BioC-5_at | AFFX-BioC-5_at | AFFX-BioC-5_at |
| CD24 | CD24 | CD24 |
| CD44 | ||
| GAPDH | GAPDH | GAPDH |
| GGT1 | GGT1 | GGT1 |
| hemoglobin | hemoglobin | hemoglobin |
| histone | histone | histone |
| PPP1R12A | PPP1R12A | PPP1R12A |
| TPSAB1 | TPSAB1 | TPSAB1 |
| UBE2D2 | UBE2D2 | UBE2D2 |
| NFIB | NFIB | |
| AFFX-M27830_5 | AFFX-M27830_5 | |
| LST1 | LST1 | |
| GNAS | ||
| NKTR | ||
| FOXA1 | ||
| myo | ||
| CFLAR | ||
| SMARCA4 | ||
| TCF4 | ||
| ezrin | ||
| OPHN1 |
Gene sets and survival results for the NKI data.
| 35 gene sets | all 295 samples | ER positive | ER negative | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| median | quartiles | median | quartiles | median | ||||||
| p | p | p | p | p | ||||||
| estrogen | 2.9 | 0.08 | ||||||||
| FOXA1 | 1.72 | 0.18 | 0.09 | 0.76 | ||||||
| stromal(5) | 2.73 | 0.09 | 2.13 | 0.14 | 3.21 | 0.07 | 0.02 | 0.88 | ||
| stromal(4) | 0.01 | 0.9 | ||||||||
| MAGEB1 | 0.82 | 0.36 | 1.83 | 0.17 | 0.19 | 0.66 | 0.41 | 0.52 | ||
| 8p11_12 | 2.56 | 0.1 | 0.4 | 0.52 | ||||||
| immune(0) | 0.01 | 0.92 | 0.11 | 0.74 | 2.8 | 0.09 | 1.25 | 0.26 | ||
| immune(1) | 0.81 | 0.36 | 0.07 | 0.78 | 0.03 | 0.86 | 0.82 | 0.36 | ||
| AANAT | 0.8 | 0.77 | 0.03 | 0.85 | 0.03 | 0.58 | 0.72 | 0.39 | 2.32 | 0.12 |
| ACACB | 0.03 | 0.85 | 0.85 | 0.35 | 1.31 | 0.25 | 0.67 | 0.41 | 1.38 | 0.24 |
| adipose | 0.13 | 0.71 | 0.69 | 0.4 | 0.66 | 0.41 | 0.21 | 0.64 | 0.01 | 0.92 |
| basal | 0.16 | 0.68 | 0.4 | 0.52 | 0.01 | 0.93 | 0.11 | 0.73 | 0.4 | 0.52 |
| BCL2L1 | 0.59 | 0.44 | 0.4 | 0.52 | 3.97 | 0.04 | 1.22 | 0.26 | 0.01 | 0.92 |
| EMX2 | 0.75 | 0.38 | 1 | 0.31 | 0.25 | 0.61 | 0.04 | 0.84 | 0.48 | 0.48 |
| ERBB2 | 0.34 | 0.55 | 0.55 | 0.45 | 0.76 | 0.38 | 0.25 | 0.61 | 0.08 | 0.77 |
| H3_histone | 0.27 | 0.6 | 0.17 | 0.68 | 0.27 | 0.6 | 0.11 | 0.74 | 0.02 | 0.89 |
| immune(5) | 0.8 | 0.37 | 0.06 | 0.8 | 3.01 | 0.08 | 2.23 | 0.13 | 0.98 | 0.32 |
| immune(9) | 0.34 | 0.55 | 0 | 0.99 | 0.85 | 0.35 | 0 | 0.94 | 0 | 0.97 |
| JAG2 | 0.12 | 0.72 | 0.14 | 0.71 | 0.17 | 0.68 | 0.22 | 0.63 | 0.01 | 0.9 |
| KRT1 | 1.7 | 0.19 | 0.81 | 0.36 | 2.58 | 0.1 | 3.21 | 0.07 | 0.71 | 0.4 |
| MMP17 | 0.02 | 0.89 | 1.42 | 0.23 | 0 | 0.95 | 0.11 | 0.74 | 0.65 | 0.42 |
| NRGN | 0.06 | 0.8 | 0.87 | 0.35 | 0.09 | 0.76 | 0.43 | 0.51 | 0.35 | 0.55 |
| RAB18 | 0.34 | 0.55 | 1.77 | 0.18 | 0.57 | 0.44 | 0.15 | 0.69 | 0.06 | 0.8 |
| RAPGEF2 | 1.25 | 0.26 | 1.23 | 0.26 | 1.46 | 0.22 | 0.19 | 0.66 | 2.48 | 0.11 |
| RBM5 | 0.03 | 0.85 | 0.65 | 0.42 | 0.24 | 0.62 | 0.85 | 0.35 | 0.31 | 0.57 |
| ribosomal(0) | 0.01 | 0.93 | 0.37 | 0.54 | 0.09 | 0.77 | 0.14 | 0.71 | 0.12 | 0.73 |
| ribosomal(2) | 0.8 | 0.36 | 1.48 | 0.22 | 0.52 | 0.47 | 1.19 | 0.27 | 0.21 | 0.64 |
| SEMG1 | 2.67 | 0.1 | 2.25 | 0.13 | 2.3 | 0.12 | 2.06 | 0.15 | 2.99 | 0.08 |
| stromal(0) | 0.31 | 0.57 | 0.62 | 0.42 | 2.41 | 0.12 | 2.33 | 0.12 | 0.28 | 0.59 |
| PTPRO | 0.02 | 0.89 | 2.84 | 0.09 | 0.22 | 0.63 | 0 | 0.97 | ||
| ATP12A | 3.04 | 0.08 | 1.78 | 0.18 | 1.49 | 0.22 | ||||
| immune(2) | 2.68 | 0.1 | 0.4 | 0.52 | ||||||
| immune(3) | 1.01 | 0.31 | ||||||||
| H2B_histone | 0.3 | 0.58 | ||||||||
| proliferation | ||||||||||
Figure 3Gene sets detected across Affymetrix and Agilent platforms. Red indicates significant association with decreased survival; green indicates significant association with increased survival.