| Literature DB >> 33937286 |
Dekker C Deacon1, Eric A Smith2, Robert L Judson-Torres1,3.
Abstract
Despite significant progress in the development of treatment options, melanoma remains a leading cause of death due to skin cancer. Advances in our understanding of the genetic, transcriptomic, and morphologic spectrum of benign and malignant melanocytic neoplasia have enabled the field to propose biomarkers with potential diagnostic, prognostic, and predictive value. While these proposed biomarkers have the potential to improve clinical decision making at multiple critical intervention points, most remain unvalidated. Clinical validation of even the most commonly assessed biomarkers will require substantial resources, including limited clinical specimens. It is therefore important to consider the properties that constitute a relevant and clinically-useful biomarker-based test prior to engaging in large validation studies. In this review article we adapt an established framework for determining minimally-useful biomarker test characteristics, and apply this framework to a discussion of currently used and proposed biomarkers designed to aid melanoma detection, staging, prognosis, and choice of treatment.Entities:
Keywords: biomarkers; diagnostics; likelihood ratio; melanoma; prognostics
Year: 2021 PMID: 33937286 PMCID: PMC8085270 DOI: 10.3389/fmed.2021.642380
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Abbreviated MPATH-DX classification and AJCC 8th edition staging criteria for cutaneous melanoma.
Figure 1Examples of pigmented skin lesion clinical decision points and their outcomes, which could be aided by validated biomarkers. (A) Assessing a melanocytic lesion of unknown significance will result in a decision to either monitor or biopsy that lesion. (B) After biopsy, the histopathological diagnosis made by a pathologist or dermatopathologist will identify the lesion as a benign, atypical or malignant melanocytic proliferation. This determination will dictate whether the region is monitored or additional surgeries are performed. (C) Recommendations for additional therapy, such as potential sentinel lymph node biopsy and systemic adjuvant therapy, are made based on the estimated risk of metastasis, occurrence of metastasis, and evidence of treatment failure.
Figure 2Theoretical framework for biomarker utility analysis. (A) A clinician may be challenged with a state of uncertainty, where the perceived probability of the disease (prior odds, purple) does not surpass the clinician's personal thresholds for recommending either major intervention or no intervention (decision thresholds, red dotted lines). A clinician may hope to reduce uncertainty based upon a biomarker test result. A positive result must alter the probability of the disease sufficiently to breach the decision threshold to recommend major intervention (blue line). A negative result must alter the probability of the disease sufficiently to breach the decision threshold to recommend no intervention (green line). In this example, the prior odds are an equal probability of disease or no disease. Based upon this clinician's decision thresholds, an actionable biomarker test result would require either a 10-fold increase or 10-fold decrease in this perceived probability (posterior odds). The extent by which a biomarker test might change the perceived odds can be calculated as a likelihood ratio (LR). A positive LR (LR+, blue line) increases the likelihood of the disease state being present and a negative LR (LR–, green line) decreases the likelihood of the disease state being present. The posterior odds following a biomarker test can be calculated as the prior odds multiplied by the likelihood ratio. (B) LR+ and LR– can be plotted against sensitivity and specificity on the axes of a receiver operator characteristic (ROC) curve graph, which represents the sensitivity and specificity of the test at different cut-off values where an “abnormal” result flag would be generated (red curved line). The LR values represent the slope at any given point on the ROC curve, and the LR lines for point V are represented. The area under the ROC curve (AUC) describes the overall sensitivity and specificity of a test at all cut-off points. AUC of 1.0 represents 100% sensitivity and specificity, and a useless test (gray line) has an AUC of 0.5. The red ROC curve has an approximate AUC of >0.9. (C) In the example shown in (A), a minimum LR+ of 10 is required to generate posterior odds that breach the clinician's decision threshold for major intervention. The area (shaded in light blue) left of this LR+ slope (dark blue line) represents the sensitivity and specificity characteristics of a test useful for confirming or ruling in a disease state that has an LR+ ≥10. A biomarker test with performance represented by the red ROC curve has a greater range of cut-off values that can produce an LR+ ≥ 10, and has a higher maximal potential sensitivity (point W) compared to the purple ROC curve (point X). This translates to a greater odds of achieving a true test value that has the potential to change the clinician's decision for the red ROC curve, especially with test cut-off values at point W. (D) Similar to (C), the area (shaded in green) above the LR– slope (dark green line) represents the sensitivity and specificity characteristics of a test useful for excluding or ruling out the example disease state from (A). The red ROC curve represents a hypothetically useful test with a higher maximal sensitivity capable of achieving a LR– ≤0.1 (point Y), and the purple ROC curve represents a test that is less likely to influence clinical decision making with a lower maximal sensitivity capable of achieving a LR– ≤ 0.1 (point Z). Figure adapted from (23).
Figure 3Hypothetical biomarker test characteristics to assist in deciding to biopsy. (A) Solitary lesions with perceived prior odds of malignancy estimated at 1:2 to 1:30 are typically biopsied due to the minimal pain and expense of biopsy. Therefore, a test's LR+ characteristic is not generally valuable when evaluating a single lesion. In contrast, a LR– of 0.1 indicates a 10-fold decrease in likelihood of malignancy, and may inform the decision to not biopsy. (B) LR+ and LR- slopes from (A) plotted on a receiver operating characteristic (ROC) curve, indicate the hypothetical test characteristics required of a biomarker to provide clinically meaningful decision to biopsy a suspicious lesion, with the area shaded in green representing estimated test characteristics useful in ruling out biopsy. (C) The estimations and assumptions change when a single patient presents with numerous borderline lesions. Due to limited patient tolerance for multiple biopsies, there may be an increased threshold to biopsy each additional lesion. A test designed to address these parameters would have different test characteristics. For instance, a LR– ≤ 0.2 might convince a provider to defer biopsy on a lesion, and a LR+ ≥5 could prompt a biopsy for a lesion that would have otherwise been monitored. (D) In the setting of multiple lesions, the question of which lesions to biopsy may be more readily influenced with biomarker testing, as indicated by the light blue shaded area (shift from indecision to biopsy) or light green shaded area (shift from indecision to monitor). These assumptions representing a higher burden of biopsy and altered threshold may also apply to biopsies of sensitive areas such as face, genitals, or nail bed.
Figure 4Biomarkers have the potential to clarify the risk of melanoma in the histopathologic interpretation of borderline pigmented lesions. (A) Melanocytic lesions exist on a histologic spectrum from benign-appearing to malignant. At the extremes of this spectrum (e.g., unambiguous benign melanocytic nevus (MPATH-Dx class I) or unambiguous invasive melanoma (MPATH-Dx class IV-V) no biomarker test is likely to change management (gray dots, Table 1). For borderline melanocytic lesions, especially MPATH-Dx class II-III lesions, pathologists are generally conservative and recommend excision of ambiguous lesions that may have no malignant potential to avoid potential adverse consequences of underdiagnosis (36). This results in excision using 5 mm margins of many atypical and dysplastic melanocytic lesions without full knowledge of malignant potential. A LR– of 0.2, corresponding to a 5-fold reduction in estimated malignant potential, may be sufficient to avoid a recommendation of excision in a more benign-appearing atypical/dysplastic specimen if threshold for treatment is set at ~1:200. At the more dysplastic end of the spectrum, a biomarker test with LR+ of 2 may be sufficient to increase recommendation from excision with 5 mm margins to excision with 10 mm margins. This results in a significant increase in size of total excision and repair that is commensurate with the higher estimated increased risk of malignancy. Due to the inherently ambiguous nature of MPATH-Dx class II–III lesions, it is challenging to estimate the decision thresholds for treatment. Depending on the pathologist's level of suspicion and the clinician's threshold for intervention (which may be influenced by patient specific-factors including prior odds and morbidity of procedure), different LR values are almost certainly necessary to justify changing intervention, such as deferring excision or increasing excision margins. (B) The estimated minimum necessary likelihood ratios indicate that there is a significant range of biomarker characteristics that provide clinical utility in deferring excision (light green shaded area) and/or increasing excision margins (light blue shaded area).
Melanoma biomarkers used by reference centers and dermatopathologists.
| Protein expression | IHC | Loss of p16 protein expression | 26 | 11 desmoplastic melanomas, 15 Spitz nevi | Sectioned FFPE slide | Differentiate desmoplastic Spitz nevus and desmoplastic melanoma | NR | NR | 82 | No | Qualitative, requires expert interpretation | ( |
| 37 | 19 Spitzoid melanomas, 18 Spitz nevi | Sectioned FFPE slide | Does not differentiate Spitzoid melanomas from Spitz nevi | Initial diagnosis plus consensus conference with 3+ dermatopathologists | NR | 21 | ( | |||||
| PRAME protein expression (4+ staining) | 400 | 255 primary and metastatic melanomas, 145 melanocytic nevi | Sectioned FFPE slide | Support diagnosis of melanoma by PRAME positivity | 2+ | NR | 87 | Yes | Qualitative, requires expert interpretation | ( | ||
| 110 | 110 ambiguous cases reviewed by 2+ dermato-pathologists | Sectioned FFPE slide | Validate and compare PRAME IHC to FISH for differentiating ambiguous cases | 2+ | NR | 75.0/98.8/NR/NR, >62.5, 0.253 | ( | |||||
| Copy Number Variation | CGH | Copy number variation by comparative genomic hybridization | 186 | 132 melanomas, 54 benign nevi (27 Spitz nevi, 19 blue nevi, 7 congenital nevi) | High tissue requirement; remainder of FFPE block | Differentiate melanoma from benign nevi on basis of copy number variation | NR | NR | 96 | No | Qualitative, requires expert interpretation | ( |
| FISH | FISH | Four probes targeting chromosome 6p23, 6p25, 11q13, and centromere 6 | 22 | 12 ambiguous lesions, 10 unequivocal lesions | Medium to High tissue requirement | Validate FISH on histologically ambiguous lesions based on clinical behavior of lesion | Initial diagnosis plus one dermatopathologist review, ambiguous tumors reviewed by two pathologists | NR | 60/50/NR/NR, 1.2/0.80 | Yes | Qualitative, requires expert interpretation | ( |
| Four probes targeting chromosome 9p21, 6p25, 11q13, and 8q24 | 424 | Training set: 152 melanoma and 170 nevi. Validation set: 51 melanoma and 51 nevi. | Medium to High tissue requirement | Distinguish between melanoma and nevi on basis of chromosomal changes | 1 | 0.94+ | 94/98/NR/NR, 47/0.061 | Yes | ( | |||
| Gene Expression with Myriad myPath | 23 GEP qRT-PCR | Weighted 23 gene expression algorithm by qRT-PCR | 901 | Training set: 254 melanoma and 210 nevi. Validation set: 437 independent lesions | Medium tissue requirement, macro-dissection from sections | Differentiate benign nevi from malignant melanoma | Initial diagnosis, one study dermatopathologist review, if discordance a third dermatopathologist | 0.96 | 90/91/NR/NR, 10/0.11 | Yes | Quantitative algorithm, Qualitative output | ( |
| 1,400 | 204 melanoma, 656 nevi | Prospective validation of differentiating between benign and malignant melanocytic lesions | Concordance between 3 experienced dermatopathologists | NR | 91.5/92/NR/NR, 11.4/0.0924 | ( | ||||||
| 182 | 99 melanomas, 83 nevi | Correlate long-term clinical outcomes with gene signature | 3 | NR | 93.8/96.2/NR/NR, 24.7/0.0644 | ( | ||||||
| 50 | 23 melanomas, 27 nevi | Categorize potential desmoplastic melanomas as likely malignant or likely benign | 2 dermatopathologists, independent review | NR | Sensitivity is “about 80%, better than FISH” | ( | ||||||
| 181 | 125 diagnostically uncertain lesions, 56 diagnostically certain lesions | Test accuracy of GEP to diagnosis and outcomes (gold standard) in cases with uncertain histo-pathological diagnoses | 3 dermatopathologists | NR | 90.4/95.5/NR/NR, 20.1/0.101 | ( | ||||||
| CGH vs. FISH | 30 | 25 melanomas, 5 nevi | Histology | Confirmation of initial diagnosis by 1 dermatopathologist | – | FISH vs. CGH: 90% | – | – | ( | |||
| Histology vs. FISH vs. myPath | 117 | Histologically unequivocal: 15 malignant and 24 benign. 78 Histologically challenging cases | Histology | 1 pathologist for unequivocal and 3 for challenging cases | FISH: 93/100/NR/NR, >90/0.07 | FISH vs. histology: 97%, myPath vs. histology: 83% | FISH: 56/83/NR/NR, 3.3/0.53 | FISH vs. histology: 70%, myPath vs. histology: 64% | ( | |||
| Histology vs. myPath vs. FISH vs. (discordant cases only) CGH | 268 | Histologically unequivocal: 198. Histologically challenging: 70 | Histology and SNP-array | Challenging cases reviewed by 1 study and 2 independent dermatopathologists | myPath: 67/81/NR/NR, 3.5/0.41 | myPath vs. histology: 75% | FISH: 61/100/NR/NR, >60/0.41 | FISH vs. histology: 84%, myPATH vs. histology: 74% | ( | |||
AUC, area under (receiver operating characteristic) curve; CGH, comparative genomic hybridization; FFPE, formalin-fixed paraffin embedded; FISH, Fluorescence in situ hybridization; IHC, immunohistochemistry; LR+, positive likelihood radio; LR–, negative likelihood radio; NPV, negative predictive value; NR, not reported; PPV, positive predictive value; qRT-PCR, quantitative real time polymerase chain reaction; S, sensitivity; Sp, specificity.
Calculated from manuscript data, no validation cohort.
Tests are considered to have independent cohort validation if an independent clinical cohort was included in the original study or at least one follow-up study used an independent cohort with the same diagnoses or a cohort with ambiguous diagnoses.
All LR values are calculated based on reported sensitivity and specificity.
Proposed melanoma diagnosis biomarkers and prospective uses.
| DNA Methylation | 40-CpG classifier | Next generation sequencing | 162 | 89 melanoma, 73 nevi | 250–300 ng of DNA from microdissected slides | Classify uncertain samples as melanoma or nevi | 4 | 0.996 | 96.6/100/ 100/96.2, >90/0.03 | Quantitative | ( |
| Promoter methylation | 1,505 CpG site microarray, analyzed 12 CpG loci highly predictive of melanoma | 49 | 22 melanoma, 27 nevi | 250 ng of DNA from FFPE tissue | Differentiate nevi and malignant melanoma | NR | 0.89–1.0 | ≤ 100/ ≤ 100/NR/NR | Quantitative | ( | |
| CpG island hypermethylation in promoter of | 405 | 199 melanoma, 208 nevi | Not reported, extraction from tissue | Differentiate dysplastic nevi and malignant melanoma | NR | 0.806 | 52/94/ /NR/NR, 8.67/0.51 | Quantitative | ( | ||
| 152 | 84 melanoma, 68 nevi | Serum draw | Differentiate healthy vs. | NR | 0.905 | NR | Quantitative | ( | |||
| Gene expression | microRNA ratio | Sequencing or qRT-PCR to obtain microRNA Ratio Trained Model score | 179 | 106 melanoma, 73 nevi | Single tissue section | Differentiate nevi and malignant melanoma | 5+ | 0.9 | 81/88/ NR/NR, 6.75/0.22 | Quantitative | ( |
| RT-PCR | qRT-PCR for | 193 | 47 melanoma, 48 nevi | 3–12 tissue sections | Differentiate dysplastic nevi and malignant melanoma | NR | 0.94 | NR | Quantitative | ( | |
| Microarray | DNA microarray assay using 14 genes | 120 | 62 melanoma, 58 nevi | 8 μM tissue section, microdissected with laser capture | Differentiate nevi and malignant melanoma | 2–4 | NR | 90/86/NR/ NR | Quantitative | ( | |
| Serum nucleic acid | Circulating miRNA panel | qRT-PCR for six microRNAs in serum | 65 | Serum from 50 uveal melanoma, 5 metastatic uveal melanoma, 10 uveal nevi | Serum draw | Differentiate uveal melanoma from nevi | NR | NR for panel | 93/100 | Quantitative | ( |
| Protein Expression | Ciliation index | Immunofluorescence for acetylated alpha-Tubulin | 124 | 26 melanoma, 42 nevi | Single tissue section | Classification of Spitzoid tumors as malignant or benign | 3+ | 0.84 | 81/65/NR/NR, 2.31/0.29 | Semi- quantitative | ( |
| IHC | IHC for 5-hydroxymethyl-cystosine | 190 | 126 melanoma, 45 nevi | Single tissue section | Differentiate nevi from melanoma | 2 | 0.78 | 93/98/NR/NR, 46.5/0.07 | Semi-quantitative | ( | |
| Multi-IHC semi-quantitative scoring system | IHC scoring of Ki-67, p16, and HMB45 | 308 | 234 melanoma, 74 nevi | Three tissue sections | Differentiate nevi and malignant melanoma | NR | 0.987 | 97.4/97.3/NR/NR, 36.1/0.03 | Semi-quantitative | ( |
AUC, area under (receiver operating characteristic) curve; FFPE, formalin-fixed paraffin embedded; LR+, positive likelihood radio; LR–, negative likelihood radio; NPV, negative predictive value; NR, not reported; PPV, positive predictive value; S, sensitivity; Sp, specificity; qRT-PCR, quantitative real time polymerase chain reaction.
Leave 1-out validation.
All LR values are calculated based on reported sensitivity and specificity.
Proposed prognostic, staging, and treatment-monitoring biomarkers.
| Castle DecisionDx | 31 GEP | TaqMan qRT-PCR from FFPE tissue | 1421 | By T: 613 T1, 452 T2, 240 T3, 116 T4 | macrodissected FFPE sections | Identify T1–T2 melanomas at low risk for positive SLNB | NR | NR | NR | Quantitative | ( |
| Skyline | GEP and clinico-pathologic features | TaqMan qRT-PCR from FFPE tissue | 506 | 160 discovery, 360 development, 146 validation | FFPE blocks | Develop and validate a GEP panel to identify positive SLNB | 2+ dermato-pathologists | 0.78 | 89/76/NR/NR, 3.71/0.14 | Quantitative GEP, qualitative clinicopathologic features | ( |
| 754 | 128 positive SLNB, 626 negative SLNB | FFPE blocks | Identify patients who can forgo SLNB | Initial diagnosis made with 2+ dermato-pathologists. No other review | 0.82 | TIb: 41/82/7/98, 2.28/0.72; TIIa: 80/53/21/95, 1.7/0.24; TIIb: 94/27/21/96, 1.29/0.22; TIIIa: 99/12/33/98, 1.13/0.08; TIIIb: 100/7/45/100, 1.08/ <0.01 | ( | ||||
| Circulating Tumor Cells | Flow and fluorescent microscopy | CD146 cell sorting. DAPI+, HMW-MAA+, CD45/CD34– Immuno-fluorescence | 15 | Tumor: 15, 0. Metastasis 6, 9 | Whole blood | Identification of early mets | NR | NR | CTC: 33/100/100/69, >30/0.67; CTC+5-S-cysteinyldopa: 67/100/100/82, >60/0.33 | Semi-quantitative | ( |
| Serum nucleic acid | Circulating non-coding RNA by qRT-PCR | hsa-miR-1246 miScript assay for RNU2-1f | 122 | 33 training, 16 distant metastasis validation, 16 nevi, 57 healthy control | Serum draw | Use RNU2-1f non-coding RNA serum level to diagnose melanoma mets | NR | 0.9375 (validation) | Regional mets: 69.6/87.2/NR/NR Distant mets: 70.0/87.2/NR/NR | Quantitative | ( |
| Protein Expression | ELISA | ELISA for S100 and OPN | 106 | 6 squamous cell carcinomas, 76 melanomas, 24 metastatic mela-nomas, 3 normal | Plasma draw | Distinguish metastatic melanoma from local disease with OPN +/– S100 and LDH | NR | OPN and S100: 0.97 | 95.5/85.9/NR/NR, 6.77/0.05 | Quantitative | ( |
| Castle DecisionDx | 31 GEP | TaqMan qRT-PCR from FFPE tissue | 479 | 107 discovery, 268 development, 104 validation | macrodissected FFPE sections | Identification of stage I/II tumors with worse prognosis | NR | 0.93 | 100/78/NR/NR, 4.54/ <0.01 | Quantitative | ( |
| 217 | All post-SLNB: 58 positive SLN, 159 negative SLN | macrodissected FFPE sections | Prognostic accuracy of GEP and SLNB in predicting RFS, distant mets, and OS | NR | NR | N.R./N.R./50/82 | ( | ||||
| 205 | By stage: 2 I, 68 IA, 39 IB, 1 II, 40 IIA, 41 IIB, 14 IIC | Biopsy | Compare GEP to AJCC in predicting 5 yr RFS, distant mets, and OS | NR | NR | GEP+AJCC for Recurrence: 90/71/NR/NR, 3.10/0.14; Distant Mets: 88/63/NR/NR., 2.38/0.19; Death: 82/62/NR/NR, 2.16/0.29 | ( | ||||
| Gene Expression | RT-PCR | qRT-PCR for PAX3d to monitor recurrent in stage II-IV disease | 198 | 111 melanoma, 87 healthy controls | Plasma draw | Identify recurrence in stage II-IV disease with plasma PAX3d mRNA | NR | 0.823 | Stage II–III relapse: 67/75/67/75, 2.68/0.44; Stage IV relapse: 75/93/43/98, 10.7/0.27 | Quantitative | ( |
| Serum nucleic acid | Circulating tumor DNA | PCR of BRAF and NRAS mutants | 125 | 20 with progression, 9 with pseudo-progression | Plasma draw | Differentiate radiologic pseudo-progression and true progression | NR | NR | 90/100/100/82, >90/0.1 | Quantitative | ( |
| Serum nucleic acid | Circulating tumor DNA | BRAF V600mut ctDNA by qRT-PCR | 36 | 16 before therapy, 20 after therapy | Plasma draw | Monitor patient response to BRAF/MEK inhibition therapy with BRAF V600mut ctDNA | NR | NR | 70/100/NR/NR., >70/0.30 | Quantitative | ( |
AUC, area under (receiver operating characteristic) curve; CTC, circulating tumor cells; ctDNA, circulating tumor DNA; ELISA, enzyme-linked immunosorbent assay. FFPE, formalin-fixed paraffin embedded; GEP, gene expression panel; LR+, positive likelihood radio; LR–: negative likelihood radio; Mets, metastasis; NPV, negative predictive value; NR, not reported; OS, overall survival; PPV, positive predictive value; RFS, recurrence-free survival; S, sensitivity; qRT-PCR, quantitative real time polymerase chain reaction; SLNB, sentinel lymph node biopsy; Sp, specificity.
All LR values are calculated based on reported sensitivity and specificity.
Figure 5Utility of biomarkers to augment Breslow depth in determining which patients are offered sentinel lymph node biopsy (SLNB). (A) As depth of melanoma increases, from in situ to greater than one millimeter, the probability of sentinel lymph node positivity, and thereby stage and treatment selection, increases. Sentinel lymph node biopsy (SLNB) is positive in ~5% of patients with T1b melanoma, resulting in a decision threshold posterior odds of ~1:20 for many surgical oncologists to offer SLNB and reflecting NCCN guideline recommendations (90). As SLNB is not routinely offered for melanoma in situ or T1a melanomas, the incidence of SLNB positivity is unknown. We therefore estimate that these are substantially and progressively less positive as tumor thickness decreases. SLNB is standard of care for T2+ melanomas (gray dot) and thus no biomarker is likely to alter this recommendation without exceptionally low LR- test characteristics. (B) Currently patients with melanoma in situ are not generally offered SLNB since the likelihood of positivity (prior odds) is very low. A very high LR+, estimated at 25, would be necessary to recommend SLNB. (C) As the melanoma Breslow depth increases, the prior odds of sentinel lymph node metastasis increases (becomes closer to 1:1). Therefore, a lower LR+, such as LR+ ≥10, would be necessary to prompt SLNB. (D) Biomarkers may have the greatest utility for T1b malignant melanomas. Current NCCN guidelines recommend thoughtful consideration of SLNB based on patient-specific factors, and any test that could offer relatively small changes in certainty could be of great clinical utility. Thus, a modest LR+ of 2.0 or LR– of 0.5 may be sufficient for a clinically-actionable biomarker.