| Literature DB >> 31561508 |
Max Pfeffer1, André Uschmajew2, Adriana Amaro3, Ulrich Pfeffer4.
Abstract
Uveal melanoma (UM) is a rare cancer that is well characterized at the molecular level. Two to four classes have been identified by the analyses of gene expression (mRNA, ncRNA), DNA copy number, DNA-methylation and somatic mutations yet no factual integration of these data has been reported. We therefore applied novel algorithms for data fusion, joint Singular Value Decomposition (jSVD) and joint Constrained Matrix Factorization (jCMF), as well as similarity network fusion (SNF), for the integration of gene expression, methylation and copy number data that we applied to the Cancer Genome Atlas (TCGA) UM dataset. Variant features that most strongly impact on definition of classes were extracted for biological interpretation of the classes. Data fusion allows for the identification of the two to four classes previously described. Not all of these classes are evident at all levels indicating that integrative analyses add to genomic discrimination power. The classes are also characterized by different frequencies of somatic mutations in putative driver genes (GNAQ, GNA11, SF3B1, BAP1). Innovative data fusion techniques confirm, as expected, the existence of two main types of uveal melanoma mainly characterized by copy number alterations. Subtypes were also confirmed but are somewhat less defined. Data fusion allows for real integration of multi-domain genomic data.Entities:
Keywords: DNA-methylation; constrained matrix factorization; copy number alteration; data fusion; gene expression profile; metastasis; similarity network fusion; singular value decomposition; tumor classification; tumor subtypes
Year: 2019 PMID: 31561508 PMCID: PMC6826760 DOI: 10.3390/cancers11101434
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1(a) Copy Number variation plot for the clusterings obtained by jSVD; columns = samples, rows = chromosomes. (b) Somatic mutation, transcriptome and methylome data for the clusterings obtained by jSVD; columns = samples, rows = genes. (c) Kaplan Meyer survival curves for the clusterings obtained by jSVD; y-axis = ratio of patients surviving, x-axis: time in months. Gains (red or yellow) and losses (blue) are indicated by conventional color codes.
Figure 2(a) Copy Number variation plot for the clusterings obtained by jCMF; columns = samples, rows = chromosomes. (b) Somatic mutation, transcriptome and methylome data for the clusterings obtained by jCMF; columns = samples, rows = genes. (c) Kaplan Meyer survival curves for the clusterings obtained by jCMF; y-axis = ratio of patients surviving, x-axis: time in months.
Figure 3(a) Copy Number variation plot for the clusterings obtained by SNF; columns = samples, rows = chromosomes. (b) Somatic mutation, transcriptome and methylome data for the clusterings obtained by SNF; columns = samples, rows = genes. (c) Kaplan Meyer survival curves for the clusterings obtained by SNF; y-axis = ratio of patients surviving, x-axis: time in months.
Contingency table for risk prediction.
| Observed | Predicted | n | Pearson Chi-Square * | Odds Ratio | 95% Confidence Interval | ||||
|---|---|---|---|---|---|---|---|---|---|
| low | interm. | high | |||||||
|
|
| low | 38 | − | 16 | 80 |
|
| 3.2–31.0 |
| high | 5 | − | 21 | ||||||
|
| low | 21 | 10 | 23 | 69 |
|
| 2.7–176.5 | |
| high | 1 | 1 | 24 | ||||||
|
| low | 14 | 21 | 19 | 57 |
|
| 2.0–140.9 | |
| high | 1 | 2 | 23 | ||||||
|
|
| low | 21 | 12 | 21 | 66 |
|
| 2.3–52.8 |
| high | 2 | 2 | 22 | ||||||
|
| low | 21 | 15 | 18 | 63 |
|
| 2.6–62.2 | |
| high | 2 | 2 | 22 | ||||||
|
| low | 17 | 19 | 18 | 59 |
|
| 2.6–179.0 | |
| high | 1 | 2 | 23 | ||||||
* Comparison between low and high risk, intermediate risk not considered.