| Literature DB >> 34909744 |
Sarem Rashid1, Nikolai Klebanov2, William M Lin2, Hensin Tsao1,2.
Abstract
Pathogenic phenotypes in cutaneous melanoma have been vastly cataloged, although these classifications lack concordance and are confined to either morphological or molecular contexts. In this study, we perform unsupervised k-medoids clustering as a machine learning technique of 2,978 primary cutaneous melanomas at Mass General Brigham and apply this information to elucidate computer-defined subsets within the clinicopathologic domain. We identified five optimally separated clusters of melanoma that occupied two distinct clinicopathologic subspaces: a lower-grade partition associated with common or dysplastic nevi (i.e., nevus-associated melanomas) and a higher-grade partition lacking precursor lesions (i.e., de novo melanomas). Our model found de novo melanomas to be more mitogenic, more ulcerative, and thicker than nevus-associated melanomas, in addition to harboring previously unreported differences in radial and vertical growth phase status. The utilization of mixed clinicopathologic variables, reflective of actual clinical data contained in surgical pathology reports, has the potential to increase the biological relevance of existing melanoma classification schemes and facilitate the discovery of new genomic subtypes.Entities:
Keywords: AJCC, American Joint Committee on Cancer; CP, clinicopathologic; DNM, de novo melanoma; MGB, Mass General Brigham; NAM, nevus-associated melanoma; PL, precursor lesion; RC, replicate set cluster; RGP, radial growth phase; TC, training set cluster
Year: 2021 PMID: 34909744 PMCID: PMC8659382 DOI: 10.1016/j.xjidi.2021.100047
Source DB: PubMed Journal: JID Innov ISSN: 2667-0267
Baseline Summary of Randomized Melanoma Entries for Training and Replicate Data Sets
| Feature | Training Set | Replicate Set | |
|---|---|---|---|
| n | 1,979 | 999 | |
| Thickness (mm), median (IQR) | 1.14 (0.7–2.3) | 1.10 (0.67–2.08) | 0.255 |
| Mitoses (per mm2), median (IQR) | 2.00 (0.8–5.0) | 1.00 (0.0–4.0) | 0.523 |
| Anatomic (Clark) level | 0.101 | ||
| 2 | 92 | 51 | |
| 3 | 535 | 296 | |
| 4 | 1,219 | 604 | |
| 5 | 133 | 48 | |
| Ulceration | 0.793 | ||
| Absent | 1,636 | 822 | |
| Present | 343 | 177 | |
| RGP | 0.208 | ||
| Absent | 489 | 226 | |
| Present | 1,490 | 773 | |
| VGP | 0.658 | ||
| Absent | 19 | 14 | |
| Epithelioid | 1,428 | 713 | |
| Spindled | 114 | 52 | |
| Small | 39 | 21 | |
| Unspecified | |||
| PL | 0.779 | ||
| Absent | 1,504 | 758 | |
| Common | 234 | 111 | |
| Dysplastic | 220 | 116 | |
| Other | 21 | 14 | |
| TILs | 0.944 | ||
| Nonbrisk/absent | 1,829 | 924 | |
| Brisk | 150 | 75 | |
| VI | 0.867 | ||
| Absent | 1,801 | 911 | |
| Present | 178 | 88 | |
| Regression | 0.214 | ||
| Absent | 1,943 | 974 | |
| Present | 36 | 25 | |
| Sex | 0.235 | ||
| Female | 858 | 456 | |
| Male | 1,121 | 543 | |
| Age (y), median (IQR) | 66.00 (55.00–76.00) | 66.00 (54.00–76.00) | 0.611 |
| Month of diagnosis, median (IQR) | 7.0 (4.0–9.0) | 7.0 (4.0–9.0) | 0.729 |
| Predicted AJCC Stage | 0.424 | ||
| IA | 591 | 315 | |
| IB | 758 | 398 | |
| IIA | 292 | 129 | |
| IIB | 214 | 94 | |
| IIC | 124 | 63 |
Abbreviations: AJCC, American Joint Committee on Cancer; IQR, interquartile range; PL, precursor lesion; RGP, radial growth phase; TIL, tumor-infiltrating lymphocyte; VGP, vertical growth phase; VI, vascular invasion.
Statistical tests were performed by the Mann‒Whitney test for continuous variables and chi-square testing for categorical variables.
Other category includes the following precursor lesions: melanocytic hyperplasia, nevus of special sites, small dermal melanocytic nest, and lentigo maligna.
Figure 1Training set CP cluster comparisons and projected outcomes. (a) t-SNE, a computational method for visualizing high dimensional data, of k-medoids clustering assignments that demonstrates how the algorithm conceptualizes the data in CP space. (b) Individual feature comparisons show distinct organization on the basis of PL status. TC1 and 2 were largely comprised of NAMs and were differentiated by the presence of common or dysplastic lesions, respectively (P < 0.0001). Conversely, TC3–5 were made of DNMs. (c) Supercluster comparisons demonstrate significantly increased thickness, mitotic rate, and age in DNMs compared with those in NAMs (P < 0.0001). (d) To the left, we observe a greater proportion of IIA, IIB, and IIC melanomas in TC3–5 (P < 0.0001). To the right, the plot shows the highest prevalence of SSM for all TCs. ∗∗∗∗Significant associations with P < 0.0001. Abs, absent; AJCC, American Joint Committee on Cancer; ALM, acral lentiginous melanoma; CP, clinicopathologic; DN, dysplastic nevi; DNM, de novo melanoma; LM, lentigo maligna; NAM, nevus-associated melanoma; PL, precursor lesion; SSM, superficial spreading melanoma; t-SNE, t-distributed stochastic neighbor embedding; TC, training set cluster; TIL, tumor-infiltrating lymphocyte; VGP, vertical growth phase.
Summary of Clinicopathologic Features for Each Cluster Using the k-Medoids Algorithm (k = 5) on the Training Data Set (n = 1,979)
| Feature | TC1 | TC2 | TC3 | TC4 | TC5 | |
|---|---|---|---|---|---|---|
| n | 235 | 210 | 644 | 610 | 280 | |
| Thickness (mm), median (IQR) | 0.90 (0.60–1.39) | 0.76 (0.56–1.10) | 1.10 (0.70–2.00) | 1.10 (0.66–1.90) | 3.60 (2.25–5.89) | <0.0001 |
| Mitoses (per mm2), median (IQR) | 1.00 (0.00–2.00) | 1.00 (0.00–2.80) | 1.00 (0.00–3.00) | 2.00 (0.80–4.00) | 9.00 (5.80–14.90) | <0.0001 |
| Anatomic (Clark) level | <0.0001 | |||||
| 2 | 21 (9) | 13 (6.3) | 28 (4.6) | 30 (5.2) | 0 (0) | |
| 3 | 82 (34.9) | 87 (41.4) | 177 (27.5) | 162 (26.6) | 27 (9.6) | |
| 4 | 131 (55.7) | 108 (51.4) | 404 (62.7) | 383 (62.8) | 193 (68.9) | |
| 5 | 1 (0.4) | 2 (1) | 35 (5.4) | 35 (5.7) | 60 (21.4) | |
| Ulceration | <0.0001 | |||||
| Absent | 225 (95.7) | 195 (92.9) | 608 (94.4) | 549 (90) | 59 (21.1) | |
| Present | 10 (4.3) | 15 (7.1) | 36 (5.6) | 61 (10) | 221 (78.9) | |
| RGP | <0.0001 | |||||
| Absent | 33 (14) | 10 (4.8) | 99 (15.4) | 128 (21) | 219 (78.2) | |
| Present | 202 (86) | 200 (95.2) | 545 (84.6) | 482 (79) | 61 (21.8) | |
| VGP | <0.0001 | |||||
| Absent | 2 (0.9) | 4 (1.9) | 9 (1.4) | 4 (0.7) | 0 (0) | |
| Epithelioid | 199 (84.7) | 167 (79.5) | 442 (68.6) | 427 (70) | 193 (68.9) | |
| Spindled | 7 (3) | 9 (4.3) | 35 (5.4) | 46 (7.5) | 17 (6.1) | |
| Small | 7 (3) | 9 (4.3) | 11 (1.7) | 12 (2) | 0 (0) | |
| Unspecified | 20 (8.5) | 21 (10) | 147 (22.8) | 121 (19.8) | 70 (25) | |
| PL | <0.0001 | |||||
| Absent | 0 (0) | 0 (0) | 633 (98.3) | 599 (98.2) | 272 (97.1) | |
| Common | 230 (97.9) | 0 (0) | 0 (0) | 0 (0) | 4 (1.4) | |
| Dysplastic | 0 (0) | 210 (100) | 0 (0) | 6 (1) | 4 (1.4) | |
| Other | 5 (2.1) | 0 (0) | 11 (1.7) | 5 (0.8) | 0 (0) | |
| TILs | 0.0082 | |||||
| Nonbrisk/absent | 220 (93.6) | 182 (86.7) | 602 (93.5) | 560 (91.8) | 265 (94.6) | |
| Brisk | 15 (6.4) | 28 (13.3) | 42 (6.5) | 50 (8.2) | 15 (5.4) | |
| VI | <0.0001 | |||||
| Absent | 201 (85.5) | 186 (88.6) | 571 (88.7) | 569 (93.3) | 274 (97.9) | |
| Present | 34 (14.5) | 24 (11.4) | 73 (11.3) | 41 (6.7) | 6 (2.1) | |
| Regression | 0.0023 | |||||
| Absent | 233 (99.1) | 208 (99) | 637 (98.9) | 598 (98) | 267 (95.4) | |
| Present | 2 (0.9) | 2 (1) | 7 (1.1) | 12 (2) | 13 (4.6) | |
| Sex | <0.0001 | |||||
| Female | 104 (44.3) | 79 (37.6) | 28 (4.3) | 585 (95.9) | 62 (22.1) | |
| Male | 131 (55.7) | 131 (62.4) | 616 (95.7) | 25 (4.1) | 218 (77.9) | |
| Age (y), median (IQR) | 60.00 (47.00–73.00) | 62.50 (52.00–72.00) | 70.00 (59.00–78.00) | 62.00 (52.00–74.00) | 71.00 (59.00–80.00) | <0.0001 |
| Month of diagnosis, median (IQR) | 7.0 (4.0–9.0) | 7.0 (4.0–10.0) | 6.0 (4.0–9.0) | 7.0 (4.0–9.0) | 6.0 (4.0–9.0) | 0.233 |
Abbreviations: IQR, interquartile range; PL, precursor lesion; RGP, radial growth phase; TC, training set cluster; TIL, tumor-infiltrating lymphocyte; VGP, vertical growth phase; VI, vascular invasion.
Statistical tests were performed by Mood’s median test for continuous variables and by chi-square testing for categorical variables.
Other category includes the following precursor lesions: melanocytic hyperplasia, nevus of special sites, small dermal melanocytic nest, and lentigo maligna.
Figure 2Enrollment and attrition for patient entries in the clustering analysis. MIS, melanoma in situ; NA, not applicable.
Figure 3CP comparisons and projections for RCs. (a) Visualization of RC assignments using t-SNE. (b) Again, individual feature distributions showed precursor lesion status to be the strongest partition within the CP space. RC1–2 tumors tended to be NAMs, whereas RC3–5 tumors tended to be DNMs with increased thickness and mitotic rate. (c) RC3–5 demonstrated the highest proportion of stage IIC melanomas (P < 0.0001). SSM was the most common pathologic subtype across all replicate set clusters. ∗∗∗∗Significant associations with P < 0.0001. Abs, absent; AJCC, American Joint Committee on Cancer; ALM, acral lentiginous melanoma; CP, clinicopathologic; DN, dysplastic nevi; DNM, de novo melanoma; LM, lentigo maligna; NAM, nevus-associated melanoma; RC, replicate set cluster; SSM, superficial spreading melanoma; t-SNE, t-distributed stochastic neighbor embedding.
Summary of Clinicopathologic Features for Each Cluster Using the k-Medoids Algorithm (k = 5) on the Replicate Data Set (n = 999)
| Feature | TC1 | TC2 | TC3 | TC4 | TC5 | |
|---|---|---|---|---|---|---|
| n | 101 | 116 | 285 | 299 | 198 | |
| Thickness (mm), median (IQR) | 1.00 (0.61–1.69) | 0.69 (0.51–1.13) | 1.30 (0.80–2.50) | 1.10 (0.70–1.80) | 1.23 (0.70–2.77) | <0.0001 |
| Mitoses (per mm2), median (IQR) | 1.00 (0.00–2.00) | 1.00 (0.00–2.00) | 2.00 (0.80–6.00) | 1.00 (0.80–4.00) | 2.00 (0.00–5.00) | <0.0001 |
| Anatomic (Clark) level | <0.0001 | |||||
| 2 | 8 (8.2) | 14 (12.3) | 11 (4) | 8 (2.8) | 10 (5.5) | |
| 3 | 23 (22.8) | 64 (55.2) | 70 (24.6) | 94 (31.4) | 45 (22.7) | |
| 4 | 67 (66.3) | 36 (31) | 194 (68.1) | 180 (60.2) | 127 (64.1) | |
| 5 | 3 (3) | 2 (1.7) | 10 (3.5) | 17 (5.7) | 16 (8.1) | |
| Ulceration | <0.0001 | |||||
| Absent | 97 (96) | 111 (95.7) | 210 (73.7) | 245 (81.9) | 159 (80.3) | |
| Present | 4 (4) | 5 (4.3) | 75 (26.3) | 54 (18.1) | 39 (19.7) | |
| RGP | <0.0001 | |||||
| Absent | 19 (18.8) | 6 (5.2) | 74 (26) | 70 (23.4) | 57 (28.8) | |
| Present | 82 (81.2) | 110 (94.8) | 211 (74) | 229 (76.6) | 141 (71.2) | |
| VGP | <0.0001 | |||||
| Absent | 3 (3) | 3 (2.6) | 0 (0) | 2 (0.7) | 7 (3.5) | |
| Epithelioid | 86 (85.1) | 100 (86.2) | 285 (100) | 242 (80.9) | 0 (0) | |
| Spindled | 3 (3) | 3 (2.6) | 0 (0) | 7 (2.3) | 39 (19.7) | |
| Small | 1 (1) | 3 (2.6) | 0 (0) | 4 (1.3) | 13 (6.6) | |
| Unspecified | 8 (7.9) | 7 (6) | 0 (0) | 44 (14.7) | 139 (70.2) | |
| PL | <0.0001 | |||||
| Absent | 0 (0) | 4 (3.4) | 283 (99.3) | 290 (97) | 181 (91.4) | |
| Common | 90 (89.1) | 13 (11.2) | 1 (0.4) | 0 (0) | 7 (3.5) | |
| Dysplastic | 7 (6.9) | 96 (82.8) | 0 (0) | 6 (2) | 7 (3.5) | |
| Other | 4 (4) | 3 (2.6) | 1 (0.4) | 3 (1) | 3 (1.5) | |
| TILs | 0.0057 | |||||
| Nonbrisk/Absent | 95 (94.1) | 105 (90.5) | 269 (94.4) | 264 (88.3) | 191 (96.5) | |
| Brisk | 6 (5.9) | 11 (9.5) | 16 (5.6) | 35 (11.7) | 7 (3.5) | |
| VI | 0.2833 | |||||
| Absent | 90 (89.1) | 103 (88.8) | 255 (89.5) | 280 (93.6) | 183 (92.4) | |
| Present | 11 (10.9) | 13 (11.2) | 30 (10.5) | 19 (6.4) | 15 (7.6) | |
| Regression | 0.9428 | |||||
| Absent | 99 (98) | 114 (98.3) | 277 (97.2) | 292 (97.7) | 192 (97) | |
| Present | 2 (2) | 2 (1.7) | 8 (2.8) | 7 (2.3) | 6 (3) | |
| Sex | <0.0001 | |||||
| Female | 36 (35.6) | 80 (69) | 269 (94.4) | 10 (3.3) | 148 (74.7) | |
| Male | 65 (64.4) | 36 (31) | 16 (5.6) | 289 (96.7) | 50 (25.3) | |
| Age (y), median (IQR) | 65.00 (50.00–73.00) | 62.00 (49.50–70.75) | 71.00 (59.50–80.00) | 60 (49.00–70.00) | 70.50 (63.00–78.00) | <0.0001 |
| Month of diagnosis, median (IQR) | 8.0 (4.0–10.0) | 5.0 (3.0–9.0) | 6.0 (4.0–9.0) | 7.0 (4.0–9.0) | 6.0 (4.0–9.0) | 0.178 |
Abbreviations: IQR, interquartile range; PL, precursor lesion; RGP, radial growth phase; TC, training set cluster; TIL, tumor-infiltrating lymphocyte; VGP, vertical growth phase; VI, vascular invasion.
Statistical tests were performed by Mood’s median test for continuous variables and by chi-square testing for categorical variables.
Other category includes the following precursor lesions: melanocytic hyperplasia, nevus of special sites, small dermal melanocytic nest, and lentigo maligna.