| Literature DB >> 35216305 |
Alba Machado-Lopez1, Roberto Alonso1,2, Victor Lago3, Jorge Jimenez-Almazan2, Marta Garcia2, Javier Monleon4, Susana Lopez5, Francisco Barcelo6, Amparo Torroba7, Sebastian Ortiz8, Santiago Domingo3, Carlos Simon1,9,10, Aymara Mas1.
Abstract
The absence of standardized molecular profiling to differentiate uterine leiomyosarcomas versus leiomyomas represents a current diagnostic challenge. In this study, we aimed to search for a differential molecular signature for these myometrial tumors based on artificial intelligence. For this purpose, differential exome and transcriptome-wide research was performed on histologically confirmed leiomyomas (n = 52) and leiomyosarcomas (n = 44) to elucidate differences between and within these two entities. We identified a significantly higher tumor mutation burden in leiomyosarcomas vs. leiomyomas in terms of somatic single-nucleotide variants (171,863 vs. 81,152), indels (9491 vs. 4098), and copy number variants (8390 vs. 5376). Further, we discovered alterations in specific copy number variant regions that affect the expression of some tumor suppressor genes. A transcriptomic analysis revealed 489 differentially expressed genes between these two conditions, as well as structural rearrangements targeting ATRX and RAD51B. These results allowed us to develop a machine learning approach based on 19 differentially expressed genes that differentiate both tumor types with high sensitivity and specificity. Our findings provide a novel molecular signature for the diagnosis of leiomyoma and leiomyosarcoma, which could be helpful to complement the current morphological and immunohistochemical diagnosis and may lay the foundation for the future evaluation of malignancy risk.Entities:
Keywords: classification model; diagnostic/prognostic biomarkers; differential gene expression; exome/transcriptome; integrative analysis; leiomyoma; leiomyosarcoma; machine learning; mutational pattern
Mesh:
Year: 2022 PMID: 35216305 PMCID: PMC8877247 DOI: 10.3390/ijms23042190
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Epidemiological, demographic, and clinicopathological outcomes of 56 patients diagnosed with uterine leiomyoma (LM) and 47 patients with leiomyosarcoma (LMS) from the experimental cohort.
| Characteristics | LMS | LM | |
|---|---|---|---|
| Demographic variables | Age | ||
| ≤30 years | - | 2 (3.57%) | |
| 31–40 years | 7 (14.89%) | 17 (30.36%) | |
| 41–50 years | 11 (23.41%) | 33 (58.93%) | |
| 51–60 years | 20 (42.55%) | 2 (3.57%) | |
| ≥61 years | 9 (19.15%) | - | |
| Not available ( | - | 2 (3.57%) | |
| Median (years) | 53 | 44 | |
| Range (years | 35–75 | 28–55 | |
| Ethnicity | |||
| Caucasian | 36 (76.59%) | 41 (73.21%) | |
| African American | 1(2.13%) | 1 (1.79%) | |
| Latin | 4 (8.51%) | 6 (10.71%) | |
| Asian | 1(2.13%) | - | |
| Arabic | 1(2.13%) | - | |
| Not available | 4 (8.51%) | 8 (14.29%) | |
| Body mass index (kg/m2) | |||
| Median | 27.15 | 24 | |
| Range | 21.5–34.9 | 18.20–34.63 | |
| Not available ( | 21 | 13 | |
| Gynecologic background | Parity | ||
| Yes | 23 (48.94%) | 27 (48.21%) | |
| No | - | 1 (1.79%) | |
| Not available | 24 (51.06%) | 28 (50.00%) | |
| Miscarriage | |||
| Yes | 7 (14.89%) | 15 (26.79%) | |
| No | 16 (34.05%) | 13 (23.21%) | |
| Not available | 24 (51.06%) | 28 (50.00%) | |
| Menopausal status | |||
| Premenopausal | 15 (38.30%) | 46 (82.14%) | |
| Postmenopausal | 18 (31.91%) | 2 (3.57%) | |
| Not available | 14 (29.79%) | 8 (14.29%) | |
| Symptoms | Pelvic mass | ||
| Yes | 25 (53.19%) | 28 (50.00%) | |
| No | 7 (14.89%) | 20 (35.71%) | |
| Not available | 15 (31.92%) | 8 (14.29%) | |
| Abnormal uterine bleeding | |||
| Yes | 17 (36.17%) | 26 (46.43%) | |
| No | 11 (23.40%) | 21 (37.50%) | |
| Not available | 19 (40.43%) | 9 (16.07%) | |
| Abdominal pain | |||
| Yes | 16 (34.04%) | 14 (25.00%) | |
| No | 11 (23.41%) | 32 (57.14%) | |
| Not available | 20 (42.55%) | 10 (17.86%) | |
| Imaging | CT | ||
| Yes | 14 (29.79%) | 7 (12.50%) | |
| No | 16 (34.04%) | 42 (75.00%) | |
| Not available | 17 (36.17%) | 7 (12.50%) | |
| MRI | |||
| Yes | 5 (10.64%) | 5 (8.93%) | |
| No | 22 (46.81%) | 44 (78.57%) | |
| Not available | 20 (42.55%) | 7 (12.50%) | |
| Ultrasound | |||
| Yes | 31 (65.96%) | 49 (87.50%) | |
| No | - | - | |
| Not available | 16 (34.04%) | 7 (12.50%) | |
| Tumor size (cm) | |||
| Median | 13 | 7.4 | |
| Range | mar-24 | 0.25–25 | |
| Not available ( | 17 | 10 | |
| Suspected uterine sarcoma | |||
| Yes | 15 (31.91%) | 5 (8.93%) | |
| No | 15 (31.91%) | 44 (78.57%) | |
| NA | 17 (36.18%) | 7 (12.50%) | |
| Surgical treatment | Endometrial biopsy | ||
| Yes | 15 (31.91%) | 39 (69.64%) | |
| No | 17 (27.66%) | 10 (17.86%) | |
| Not available | 19 (40.43%) | 7 (12.50%) | |
| Primary surgery | |||
| Laparoscopic hysterectomy | 1 (2.13%) | 12 (21.43%) | |
| Laparoscopic myomectomy | - | 5 (8.93%) | |
| Laparotomic hysterectomy | 33 (70.21%) | 21 (37.50%) | |
| Laparotomic myomectomy | - | 11 (19.64%) | |
| Not available | 13 (27.66%) | 7 (12.50%) | |
| Clinical follow-up | Recurrence | ||
| Yes | 19 (40.43%) | 49 (87.50%) | |
| No | 9 (19.16%) | - | |
| Not available | 19 (40.43%) | 7 (12.50%) | |
| Status | |||
| Alive | 12 (25.53%) | 48 (85.71%) | |
| Deceased | 12 (25.53%) | - | |
| Not available | 23 (48.84%) | 8 (14.29%) | |
| Follow-up (months) | |||
| Median | 24 | - | |
| Range | 8–116 | - | |
| Not available ( | 29 | - |
Figure 1Comparative analysis of single-nucleotide variants (SNVs), insertions/deletions (indels), and mutational signatures for leiomyosarcoma (LMS) and leiomyoma (LM) samples. (A) Tumor profile of LMS-exclusive variants, including frequency and type of mutations. (B) Tumor profile of LM-exclusive variants, including frequency and type of mutations. In both cases, rows represent individual genes, while columns represent individual tumors. Bars illustrate the number of samples for each exclusive mutation. Types of mutations are annotated according to color. (C) Relative contribution of the indicated mutation types to the point mutation spectrum for each tumor type. Error bars indicate standard deviation over all samples. Total number of mutations for LM and LMS is indicated. (D) Relative contribution of each indicated trinucleotide changes to the two mutational signatures identified by non-negative matrix factorization (NMF) analysis. (E) Heatmap showing relative contribution of each mutational signature described in the COSMIC database for each sample.
Figure 2Comparative analysis of copy number variants (CNVs) in leiomyosarcoma (LMS) and leiomyoma (LM) samples and proximal effects from integrative analysis of CNVs and RNAseq data. (A) Distribution of CNVs per tumor type. (B) Genome-wide CNV distribution in LMS (left) and LM (right). In both cases, rows represent individual samples, while columns represent chromosomes. Types of CNVs are annotated by color, depending on if the deletion/duplication is detected in one sample (purple/blue) or two or more samples (green/yellow). (C) Kaplan–Meier plots showing the association between overall survival and alterations in at least 67% of the most frequent CNVs detected in LMS patients. (D) Heatmap of unsupervised hierarchical clustering based on the 370 genes affected with the most common CNVs related to patient outcome. (E) Proximal effects from the integrative analysis of CNVs and RNAseq data. Boxplots show a region’s expression (y-axis, log of normalized counts per million reads mapped) of genes regulated by the specific region (x-axis) and colored by copy number state, represented as loss (blue), normal (orange), and gain (red) in LMS (upper) and LM (lower) samples. ** p-adjusted value < 0.01; *** p-adjusted value < 0.001.
Figure 3Structural variant plots of chromosomal rearrangements in leiomyoma (LM) and leiomyosarcoma (LMS). (A) Bar plot showing the number of high-confidence fusions per LMS sample (upper). Bar plot and ideograms showing the most frequently affected chromosome regions in LMS samples (middle). Schematic representation of the gene sequence and functional protein domain for the most affected gene, ATRX, validated by immunohistochemistry (lower), using glioma biopsies as a positive control (right). Scale bar represents 75 µM (n = 3). (B) Bar plot showing the number of high-confidence fusions per LM sample (upper). Bar plots and ideograms showing the most frequently affected chromosome regions in LM samples (middle). Schematic representation of the gene sequence and functional protein domain for the most affected gene, RAD51B, validated by immunohistochemistry (lower) and using gallbladder as a positive control (right). Scale bar represents 75 µM (n = 3).
Figure 4Transcriptional analysis and validation of the targeted gene panel on leiomyoma (LM) and leiomyosarcoma (LMS). (A) Heatmap of hierarchical clustering. Top dendrogram shows clustering of samples, and left dendrogram shows clustering of all differentially expressed genes. Colors in the heatmap represent gene expression intensities, with blue indicating low expression and red indicating high expression. The bar on top of the heatmap represents the group by color (green = LMS; pink = LM). (B) Heatmap showing clustering of samples using the normalized coverage data for each of the 19 genes. (C) Class probabilities predicted by the model for the test set, with the “warning range” highlighted in light orange.