| Literature DB >> 24339933 |
Isabel Brito1, Philippe Hupé, Pierre Neuvial, Emmanuel Barillot.
Abstract
MOTIVATION: Array-CGH can be used to determine DNA copy number, imbalances in which are a fundamental factor in the genesis and progression of tumors. The discovery of classes with similar patterns of array-CGH profiles therefore adds to our understanding of cancer and the treatment of patients. Various input data representations for array-CGH, dissimilarity measures between tumor samples and clustering algorithms may be used for this purpose. The choice between procedures is often difficult. An evaluation procedure is therefore required to select the best class discovery method (combination of one input data representation, one dissimilarity measure and one clustering algorithm) for array-CGH. Robustness of the resulting classes is a common requirement, but no stability-based comparison of class discovery methods for array-CGH profiles has ever been reported.Entities:
Mesh:
Year: 2013 PMID: 24339933 PMCID: PMC3855312 DOI: 10.1371/journal.pone.0081458
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Description of array-CGH data sets used in this study.
| data set | no. of arrays | no. of probes | platform | tumor tissue |
|
| 98 | 2146 | HumArray 2.0 | bladder |
|
| 25 | 2415 | HumArray 2.0 | pancreas |
|
| 49 | 2385 | HumArray 1.14 | liver |
|
| 85 | 3127 | BAC/PAC | colon |
|
| 49 | 1741 | HumArray 1.11 | bladder |
Figure 1Frequency of input data representation, dissimilarity measure and clustering algorithm among the class discovery methods declared stable for each data set.
The parameters used are Jaccard coefficient and 0.97 threshold.
Figure 2Frequency of input data representation, dissimilarity measure and clustering algorithm among the class discovery methods declared stable for each partition from 2 to 10 clusters.
The parameters used are Jaccard coefficient and 0.97 threshold.
Figure 3Frequency of class discovery methods declared stable.
The parameters used are Jaccard coefficient and 0.97 threshold.
Description of data set "zhang"
| data set | no. of arrays | no. of probes | platform | tumor tissue |
|
| 311 | 58494 | Affymetrix 100K Xka | breast |