| Literature DB >> 29229936 |
Rahul Metri1, Abhilash Mohan2, Jérémie Nsengimana3, Joanna Pozniak3, Carmen Molina-Paris4, Julia Newton-Bishop3, David Bishop3, Nagasuma Chandra5,6.
Abstract
Understanding the biological factors that are characteristic of metastasis in melanoma remains a key approach to improving treatment. In this study, we seek to identify a gene signature of metastatic melanoma. We configured a new network-based computational pipeline, combined with a machine learning method, to mine publicly available transcriptomic data from melanoma patient samples. Our method is unbiased and scans a genome-wide protein-protein interaction network using a novel formulation for network scoring. Using this, we identify the most influential, differentially expressed nodes in metastatic as compared to primary melanoma. We evaluated the shortlisted genes by a machine learning method to rank them by their discriminatory capacities. From this, we identified a panel of 6 genes, ALDH1A1, HSP90AB1, KIT, KRT16, SPRR3 and TMEM45B whose expression values discriminated metastatic from primary melanoma (87% classification accuracy). In an independent transcriptomic data set derived from 703 primary melanomas, we showed that all six genes were significant in predicting melanoma specific survival (MSS) in a univariate analysis, which was also consistent with AJCC staging. Further, 3 of these genes, HSP90AB1, SPRR3 and KRT16 remained significant predictors of MSS in a joint analysis (HR = 2.3, P = 0.03) although, HSP90AB1 (HR = 1.9, P = 2 × 10-4) alone remained predictive after adjusting for clinical predictors.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29229936 PMCID: PMC5725601 DOI: 10.1038/s41598-017-17330-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A schematic representation of the biomarker identification pipeline. The pipeline involves 5 major steps. Condition specific network: Construction of weighted network using protein-protein interaction network and gene expression data. Shortest Path analysis: Identification of all-vs.-all nodes shortest paths using Dijkstra’s algorithm. Response Paths: Paths with highest differential activity in diseased condition identified using string matching metric. Influence paths: Prioritizing paths based on influence on the network. Signature genes: Feature ranking of genes to obtain minimal set classifying conditions. The conditions considered for study: Normal Skin (NS), Primary Melanoma (PM) and Metastasis Melanoma (MM). Analysis results in numbers is shown in bottom section of the image
Figure 2(A) A network view of response paths identified using the Jaro-Winkler metric for the MM vs. PM comparison. (B) Functional enrichment of differentially regulated genes in top-response paths of MM vs. PM, MM vs. NS and PM vs. NS. (C) Percentage coverage of genes in 41 communities by genes of top-response paths from 3 comparisons. (D) A subnetwork of response paths of MM vs. PM prioritized based on influence score.
Figure 3Final signature genes for 3 condition comparisons. (A) 6 genes of MM vs. PM (B) 20 genes of PM vs. NS (C) 25 of MM vs. NS. The first two columns after gene name show the differential expression level in cohort GSE15605 and GSE7553, respectively. Third column show Venn diagrams indicating percentage of patients, the gene is differentially regulated in TCGA based on z-score. In the human protein atlas section, first four columns show the antibody stain levels observed in melanoma tissue. The size of each circle is based on a ratio of the number of patients showing particular expression to the total patients and the colouring is based on the intensity of expression. The last column is stain intensity in control tissue.
Figure 4(A) log2 intensity values of the 6 genes for each condition in GSE15605, GSE7553 and TCGA. (B) The combined score which is computed as a ratio of product of the signal intensity (log2) of upregulated genes (HSP90AB1 and ALDH1A1) to the product of the signal intensity (log2) downregulated genes (KIT, KRT16, SPRR3 and TMEM45B).
Biological significance of PM vs. NS and MM vs. NS signature.
| Gene symbol | Description | Association with cancer | GO biological process | Remarks | |||
|---|---|---|---|---|---|---|---|
| A | B | C | |||||
|
| |||||||
| TYR* | ↑ | Tyrosinase | ✓ | pigmentation | Well established biomarker of melanoma[ | ||
| RGS20* | ↑ | Regulator Of G-Protein Signaling 20 | ✓ | cell differentiation | Involved in cancer cell aggregation, migration, invasion and adhesion in other cancers[ | ||
| PLA1A | ↑ | Phospholipase A1 Member A | ✓ | metabolic process | Identified to be related to short survival of melanoma patients[ | ||
| SNCA | ↑ | Synuclein Alpha | ✓ | metabolic process | Protein of many diseases and also reported as biomarker of malignant melanoma[ | ||
| QPRT | ↑ | Quinolinate Phosphoribosyltransferase | ✓ | metabolic process | A potential marker for follicular thyroid carcinoma[ | ||
| MLANA | ↑ | Melan-A | ✓ | An established melanoma biomarker[ | |||
| AP1S2* | ↑ | Adaptor Related Protein Complex 1 Sigma 2 Subunit | ✓ | ✓ | intracellular protein transport | Upregulated in expression profile of 20 cancer types[ | |
| PLAT* | ↑ | Plasminogen Activator, Tissue Type | ✓ | cell mobility | Plasminogen activation system studied in uveal melanoma[ | ||
| MLPH | ↑ | Melanophilin | ✓ | intracellular protein transport | Differentially expressed in melanoma[ | ||
| WIPI1 | ↑ | WD Repeat Domain, Phosphoinositide Interacting 1 | ✓ | metabolic process | Coordinates Melanosome Formation and Melanogenic Gene Transcription[ | ||
| ARPC1B* | ↑ | Actin Related Protein 2/3 Complex Subunit 1B | ✓ | ✓ | cell mobility | Prediction marker for choroidal malignant melanoma and lung cancer[ | |
| S100B | ↑ | S100 Calcium Binding Protein B | ✓ | cell proliferation | An established melanoma biomarker[ | ||
| TIMP1 | ↑ | TIMP Metallopeptidase Inhibitor 1 | ✓ | cell proliferation | Timp1 interacts with CD63 to activate PI3-K signaling pathway in melanoma[ | ||
| CD63 | ↑ | CD63 Molecule | ✓ | cell mobility | |||
| FYN* | ↑ | FYN Proto-Oncogene, Src Family Tyrosine Kinase | ✓ | ✓ | cell mobility | Potential biomarker for melanoma and other cancers[ | |
| FN1* | ↑ | Fibronectin 1 | ✓ | cell mobility | Used in a diagnostic assay of metastatic melanoma[ | ||
| SFN* | ↓ | Stratifin | ✓ | ✓ | cell death | Downregulated in melanoma and other cancers[ | |
| ALOX12* | ↓ | Arachidonate 12-Lipoxygenase, 12S Type | ✓ | ✓ | skin development | Biomarker for prostate cancer and also downregulated in melanoma[ | |
| LGALS7 | ↓ | Galectin 7 | ✓ | ✓ | apoptotic process | Dual role observed in melanoma. Downregulation studied in cervical cancer and gastric cancer[ | |
| PIP* | ↓ | Prolactin Induced Protein | ✓ | regulation of immune system process | Biomarker for Breast Cancer[ | ||
|
| |||||||
| SERPINE2 | ↑ | Serpin Family E Member 2 | ✓ | cell differentiation | Therapeutic target for colorectal cancer[ | ||
| S100A1 | ↑ | S100 Calcium Binding Protein A1 | ✓ | cell proliferation | Established melanoma marker[ | ||
| PHLDA1 | ↑ | Pleckstrin Homology Like Domain Family A Member 1 | ✓ | cell differentiation | Expression involved in intestinal tumorigenesis[ | ||
| TAF1A | ↑ | TATA Box-Binding Protein-Associated Factor 1A | Regulation of transcription | ||||
| UBE2C | ↑ | Ubiquitin Conjugating Enzyme E2 C | ✓ | cell proliferation | Therapeutic target for melanoma[ | ||
| LDHB | ↑ | Lactate Dehydrogenase B | ✓ | metabolic process | Established biomarker of melanoma[ | ||
| PARP1 | ↑ | Poly(ADP-Ribose) Polymerase 1 | ✓ | cell differentiation | Associated with poor survival of melanoma patients[ | ||
| ASPM | ↑ | Abnormal Spindle Microtubule Assembly | ✓ | cell differentiation | Has a pro-invasion role in metastasis[ | ||
| ALDH1A3 | ↑ | Aldehyde Dehydrogenase 1 Family Member A3 | ✓ | metabolic process | Identified as marker and target of melanoma therapeutics[ | ||
| PRKAR1A | ↑ | Protein Kinase A Type 1a Regulatory Subunit | ✓ | cell differentiation | Overexpression studied in cholangiocarcinoma[ | ||
| MCM3 | ↑ | Minichromosome Maintenance Complex Component 3 | ✓ | metabolic process | Is a possible independent prognostic marker for melanoma[ | ||
| AASS | ↑ | Aminoadipate-Semialdehyde Synthase | ✓ | metabolic process | Is an oncogene[ | ||
| SDCBP | ↑ | Syndecan Binding Protein | ✓ | cell mobility | Involved in cancer development and progression[ | ||
| COL4A6 | ↓ | Collagen Type IV Alpha6 Chain | ✓ | ✓ | cell adhesion | Involved in aggressiveness and metastasis of melanoma and other cancers[ | |
| AACS | ↓ | Acetoacetyl-CoA Synthetase | ✓ | cell differentiation | Low expression studied in tumor tissues[ | ||
| SGK2 | ↓ | SGK2,Serine/ThreonineKinase 2 | ✓ | regulation of cell growth | Dowregulated in melanoma[ | ||
*Genes also present in MM vs. NS signature. A: Melanoma biomarker B: Studies related to melanoma C: Studies related to other cancers.
Hazard ratios for MSS for individual genes in the whole dataset&.
| Gene | Univariable MSS | Multivariable unadjusted MSS | Multivariable adjusted MSS | |||
|---|---|---|---|---|---|---|
| HR | P value | HR | P value | HR | P value | |
|
| 0.9 | 0.1 | 0.9 | 0.2 | 0.9 | 0.3 |
|
| 1.9 | 2 × 10−4 | 1.7 | 0.002 | 2.0 | 10−4 |
|
| 0.9 | 0.05 | 0.9 | 0.3 | 1.0 | 0.2 |
|
| 1.03 | 0.5 | 1.1 | 0.03 | 1.0 | 0.5 |
|
| 0.9 | 0.005 | 0.96 | 0.4 | 1.0 | 0.5 |
|
| 0.9 | 0.001 | 0.93 | 0.04 | 0.9 | 0.07 |
&Death hazard ratio (HR) reflects the change from the baseline of 1.0 each time the gene expression is doubled.
Association between each gene and histological features of melanoma.
| Gene | AJCC (Pvalue) | Ulceration (Pvalue) | Mitotic rate correlation (P-value) | Breslow thickness correlation(P-value) |
|---|---|---|---|---|
|
| 0.4 | 0.4 | −0.04 (0.4) | −0.01(0.7) |
|
| 0.03 | 9 × 10−4 | 0.13 (0.001) | 0.2 (1.6 × 10−5) |
|
| 5 × 10−5 | 10−5 | −0.11 (0.005) | −0.2 (2.3 × 10−10) |
|
| 0.7 | 0.6 | −0.08 (0.06) | −0.06 (0.1) |
|
| 5.5 × 10−13 | 2.2 × 10−11 | −0.2 (7.9 × 10−8) | −0.3 (3.1 × 10−16) |
|
| 4.8 × 10−12 | 2 × 10−5 | −0.2 (3.1 × 10−7) | −0.3, (1.3 × 10−20) |
Figure 5Survival curves according to the combined 6-gene score (unadjusted) in test data (1/3 of total sample). The score was dichotomised by the median.
Figure 6The 6-gene score distribution by AJCC stage (A), ulceration status (B), mitotic rate (C) and Breslow thickness (D). Note the log scale for mitotic rate and Breslow thickness.