| Literature DB >> 34828357 |
Emily Z Ma1, Karl M Hoegler1, Albert E Zhou1.
Abstract
Over 100,000 people are diagnosed with cutaneous melanoma each year in the United States. Despite recent advancements in metastatic melanoma treatment, such as immunotherapy, there are still over 7000 melanoma-related deaths each year. Melanoma is a highly heterogenous disease, and many underlying genetic drivers have been identified since the introduction of next-generation sequencing. Despite clinical staging guidelines, the prognosis of metastatic melanoma is variable and difficult to predict. Bioinformatic and machine learning analyses relying on genetic, clinical, and histopathologic inputs have been increasingly used to risk stratify melanoma patients with high accuracy. This literature review summarizes the key genetic drivers of melanoma and recent applications of bioinformatic and machine learning models in the risk stratification of melanoma patients. A robustly validated risk stratification tool can potentially guide the physician management of melanoma patients and ultimately improve patient outcomes.Entities:
Keywords: bioinformatics; deep learning; machine learning; melanoma; melanoma genomics
Mesh:
Year: 2021 PMID: 34828357 PMCID: PMC8621295 DOI: 10.3390/genes12111751
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Key advances in melanoma genomic research. BI: bioinformatics, ML: machine learning.
Summary of major studies in bioinformatic and machine learning risk stratification of melanoma.
| Publication | Methods | Key Finding(s) | Performance | Data |
|---|---|---|---|---|
| Arora et al. 2020 [ | Multiple machine learning algorithms (e.g., SVM 1, decision tree, random forest) | Machine learning model based on clinicopathologic variables outperformed model based on GEP profiles or AJCC 1 staging in predicting OS 1 | RNA expression data of cutaneous melanomas (CMs) ( | |
| Bellomo et al. 2020 [ | Machine learning logistic regression model | Epithelial-to-mesenchymal transition and melanosome function genes were associated with SLN 1 metastasis; model combining clinicopathologic and gene expression variables better predicted SLN metastases than model with clinicopathologic or gene expression variables | AUROC 1: 0.82 (clinicopathologic and gene expression model) | Gene expression data of primary CMs ( |
| Brinker et al. 2021 [ | Artificial neural network (ANN) | ANNs trained with H&E images not matched to SLN status had AUROC of 62% and may not be clinically relevant to predict SLN status | AUROC: 61.8% (matched), 55.0% (unmatched) | Primary melanoma with positive SLN H&E slides ( |
| Cheng et al. 2015 [ | Multi-variate Cox regression analysis | BRAF and MMP2 were prognostic biomarkers for stage I/II, while p27 is a biomarker for stage III/IV | Primary ( | |
| Farrow et al. 2021 [ | Multi-variate Cox regression analysis | 12 genes predicted RFS 1; increased | RNA samples ( | |
| Garg et al. 2021 [ | Random forest classifier | Machine learning models trained with 121 metastasis associated genes performed better in predicting regional lymph node metastasis than models trained with clinical trained with clinical covariates or published prognostic signatures | PAUROC: 7.03 × 10−4 (combined model) | RNA data of primary CMs ( |
| Huang et al. 2021 [ | Decision-tree algorithm (XGBoost) | 5-methylcytosine (m5c) signatures were used to predict CM prognosis; NSUN6 may be a marker for CM progression | Transcriptomic data of CMs ( | |
| Jiang et al. 2021 [ | GO 1 and KEGG 1 enrichment analysis, PPI network analysis | Identified 435 DEGs 1; | Gene expression data of CMs from UCSC Xena ( | |
| Johannet et al. 2021 [ | Deep convolutional neural network (DCNN) | Machine learning algorithm trained with histology and clinicodemographic variables predicted immunotherapy response (PFS 1) in advanced melanoma patients with AUC 1 of 0.800 | AUC: 0.800 | Advanced melanoma patients ( |
| Jönsson et al. 2010 [ | Unsupervised hierarchical clustering, two-group significance of microarray analysis (SAM), support tree analysis | Four distinct subtypes with unique gene signatures are associated with different prognoses | Global gene expression data of stage IV CMs ( | |
| Lee et al. 2019 [ | Multi-variate Cox regression analysis | Pre-operative ctDNA predicts melanoma-specific survival in stage III melanoma | Pre-operative ctDNA from stage III CM patients ( | |
| Mancuso et al. 2021 [ | Multiple machine learning algorithms (e.g., logistic regression, SVM, decision tree, Gaussian naïve Bayes classifier) | Machine learning algorithm classified early-stage melanoma patients with high and low risk of metastasis; select serum cytokines (e.g., IL-4, GM-CSG, DCD) and Breslow thickness were variables that best predicted metastasis | Accuracy: 80% (Breslow thickness and serum markers model) | Stage I and II melanoma patients ( |
| Segura et al. 2010 [ | SAM, KEGG enrichment analysis | 18 overexpressed miRNAs were significantly correlated with longer post-recurrence survival | Accuracy: 80.2% | Total RNA of metastatic CMs ( |
| Sheng et al. 2020 [ | GO and KEGG enrichment analysis, PPI network analysis | Identified 258 DEGs as potential biomarkers of metastasis | Gene expression data of primary ( | |
| Shepelin et al. 2018 [ | Multiple machine learning algorithms (e.g., SVM, random forest) | Identified 44 characteristic signaling pathways associated with melanoma metastasis | Accuracy: 94% (SVM classifier) | Transcriptomic data of primary and metastatic CMs ( |
| Wang et al. 2020 [ | GO enrichment analysis, PPI network analysis | Gene expression data of CD38 positive CMs from TCGA | ||
| Wei et al. 2018 [ | KEGG and GO enrichment analysis, PPI network analysis, SVM classifier | An SVM predictor for melanoma metastasis had greater than 94% prediction accuracy; 798 DEGs 1 were identified | Accuracy: 94.4 to 100% | Gene expression data of primary ( |
| Wong et al. 2005 [ | Nomogram | A nomogram using clinicopathologic information accurately predicted the probability of a positive SLN in melanoma | Accuracy: 69.4% | SLN biopsies ( |
| Yang et al. 2018 [ | Two-way hierarchical clustering analysis, SVM classifier, random forest classifier | SVM classifier of a 6 lncRNA signature risk-stratified patients with 85% accuracy | Accuracy: 84.84% (two-way hierarchical clustering), 85.9% (SVM classifier) | lncRNA data of primary CMs ( |
| Zormpas-Petridis et al. 2019 [ | Spatially constrained-convolution neural network (SC-CNN) | A novel multi-resolution hierarchical framework (SuperCRF) predicted survival based on histology features; SuperCRF had an 12% improvement in accuracy compared to state-of-art SC-CNN cell classifiers | Accuracy: 84.63% | Melanoma H&E slides ( |
1 AJCC: American Joint Committee on Cancer; AUC: area under the curve; AUROC: area under the receiver operating characteristic; DEG: differentially expressed genes; GO: gene ontology; KEGG: Kyoto Encyclopedia of Genes and Genomes; OS: overall survival; PFS: progression-free survival; RFS: recurrence -free survival; SLN: sentinel lymph node; SVM: support vector machine; TCGA: The Cancer Genome Atlas.