| Literature DB >> 26046463 |
Ingrid Cifola1, Marta Lionetti2,3, Eva Pinatel1, Katia Todoerti4, Eleonora Mangano1, Alessandro Pietrelli1, Sonia Fabris2,3, Laura Mosca2,3, Vittorio Simeon4, Maria Teresa Petrucci5, Fortunato Morabito6, Massimo Offidani7, Francesco Di Raimondo8, Antonietta Falcone9, Tommaso Caravita10, Cristina Battaglia1,11, Gianluca De Bellis1, Antonio Palumbo12, Pellegrino Musto13, Antonino Neri2,3.
Abstract
Primary plasma cell leukemia (pPCL) is a rare and aggressive form of plasma cell dyscrasia and may represent a valid model for high-risk multiple myeloma (MM). To provide novel information concerning the mutational profile of this disease, we performed the whole-exome sequencing of a prospective series of 12 pPCL cases included in a Phase II multicenter clinical trial and previously characterized at clinical and molecular levels. We identified 1, 928 coding somatic non-silent variants on 1, 643 genes, with a mean of 166 variants per sample, and only few variants and genes recurrent in two or more samples. An excess of C > T transitions and the presence of two main mutational signatures (related to APOBEC over-activity and aging) occurring in different translocation groups were observed. We identified 14 candidate cancer driver genes, mainly involved in cell-matrix adhesion, cell cycle, genome stability, RNA metabolism and protein folding. Furthermore, integration of mutation data with copy number alteration profiles evidenced biallelically disrupted genes with potential tumor suppressor functions. Globally, cadherin/Wnt signaling, extracellular matrix and cell cycle checkpoint resulted the most affected functional pathways. Sequencing results were finally combined with gene expression data to better elucidate the biological relevance of mutated genes. This study represents the first whole-exome sequencing screen of pPCL and evidenced a remarkable genetic heterogeneity of mutational patterns. This may provide a contribution to the comprehension of the pathogenetic mechanisms associated with this aggressive form of PC dyscrasia and potentially with high-risk MM.Entities:
Keywords: multiple myeloma; mutations; plasma cell leukemia; whole-exome sequencing
Mesh:
Year: 2015 PMID: 26046463 PMCID: PMC4627327 DOI: 10.18632/oncotarget.4028
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Coding somatic non-silent variants identified in pPCL panel
| Sample | Somatic | Somatic sub-clonal | Total No. Non-syn SNVs | Total No. Indels | Total No. Somatic Variants | ||
|---|---|---|---|---|---|---|---|
| Non-syn SNVs | Indels | Non-syn SNVs | Indels | ||||
| PCL-016 | 148 | 4 | 17 | 0 | 165 | 4 | 169 |
| PCL-017 | 160 | 11 | 0 | 0 | 160 | 11 | 171 |
| PCL-018 | 25 | 2 | 9 | 1 | 34 | 3 | 37 |
| PCL-019 | 133 | 5 | 17 | 0 | 150 | 5 | 155 |
| PCL-020 | 195 | 15 | 0 | 0 | 195 | 15 | 210 |
| PCL-026 | 232 | 6 | 34 | 0 | 266 | 6 | 272 |
| PCL-027 | 156 | 16 | 0 | 0 | 156 | 16 | 172 |
| PCL-030 | 179 | 11 | 0 | 0 | 179 | 11 | 190 |
| PCL-032 | 176 | 8 | 0 | 0 | 176 | 8 | 184 |
| PCL-035 | 203 | 14 | 0 | 0 | 203 | 14 | 217 |
| PCL-036 | 146 | 16 | 0 | 0 | 146 | 16 | 162 |
| PCL-038 | 40 | 2 | 17 | 0 | 57 | 2 | 59 |
Figure 1Coding somatic variants identified in the 12 pPCL samples
A. Distribution of coding somatic non-silent variants according to amino acid changes or reading frameshifts. B. Classification of coding somatic SNVs (both syn and non-syn) by nucleotide change. Transitions are in green (C > T, T > C) and transversions are in blue (C > A, C > G, T > A, T > G). C. Mutational signatures in pPCL. Coding somatic SNVs identified in the 12 samples were classified according to nucleotide change and sequence context into the 96 possible mutated trinucleotides (listed horizontally according to the conventional order). Substitutions are displayed in different colors along the horizontal axis and number of mutations attributed to each type is shown on the vertical axis.
Figure 2Copy number analysis
Copy number alterations were identified starting from WES data by using EXCAVATOR software. The Circos plot represents somatic amplifications (red) and deletions (green) found in each case, distributed across all chromosomes. Samples are displayed according to increasing case number from outer to inner track. Colors of track borders indicate translocation groups: t(14;16) in red, t(11;14) in blue, t(14;20) in green, t(4;14) in orange. PCL-020 resulted as outlier with the greatest number of CNAs and a basal tetraploidy with regions till to five copies.
Figure 3Global amount of somatic alterations per sample
The numbers of coding somatic non-silent variants (non-synonymous SNVs and indels) and CNA regions (gains and losses) were summed up by sample, to have an overview of the global burden of somatic alterations carried by each case.
Statistically significant recurrently affected genes with potential driver role in pPCL
| Gene | Gene Name | No. Cases (%) | No. Variants | No. Non-syn SNVs | No. Indels | Function | |
|---|---|---|---|---|---|---|---|
| HLA-DQA1 | major histocompatibility complex, class II, DQ alpha 1 | 3 (25%) | 3 | - | 3 | 3.89E-09 | immune response |
| CIDEC | cell death-inducing DFFA-like effector c | 3 (25%) | 1 | 1 | - | 3.07E-05 | apoptosis |
| CELA1 | chymotrypsin-like elastase family, member 1 | 3 (25%) | 1 | - | 1 | 7.42E-05 | proteolysis, cell membrane and matrix organization |
| SRRM5 | serine/arginine repetitive matrix 5 | 3 (25%) | 2 | 2 | - | 7.89E-05 | unknown |
| CCDC144NL | coiled-coil domain containing 144 family, N-terminal like | 3 (25%) | 2 | - | 2 | 8.59E-05 | unknown |
| SPTB | spectrin, beta, erythrocytic | 3 (25%) | 5 | 5 | - | 0.00280 | cell membrane organization and stability |
| DIS3 | exosome catalytic subunit DIS3 | 3 (25%) | 3 | 3 | - | 0.00427 | RNA binding, RNA degradation |
| FAM166B | family with sequence similarity 166, member B | 2 (16%) | 1 | - | 1 | 0.01062 | unknown |
| RPL17 | ribosomal protein L17 | 2 (16%) | 2 | 1 | 1 | 0.01405 | RNA binding, translation |
| CMYA5 | cardiomyopathy associated 5 | 1 (8%) | 7 | 7 | - | 0.02032 | protein binding |
| UNC80 | unc-80 homolog | 4 (33%) | 4 | 4 | - | 0.02762 | ion transmembrane transport |
| SCN9A | sodium channel, voltage-gated, type IX, alpha subunit | 3 (25%) | 3 | 3 | - | 0.03008 | ion transmembrane transport |
| ZNF598 | zinc finger protein 598 | 3 (25%) | 1 | - | 1 | 0.03008 | translation repressor |
| KIF2B | kinesin family member 2B | 2 (16%) | 2 | 1 | 1 | 0.09976 | chromosome segregation, genome stability |
Note:
Identical recurrent variants found in unpaired pPCL samples annotated in both dbSNP and COSMIC catalogue and confirmed somatic in COSMIC website.
Pathways significantly enriched in mutated genes by ToppGene analysis
| Pathway | Source | FDR | No. Damaging/Total variants | Genes | |
|---|---|---|---|---|---|
| Cadherin signaling pathway | PantherDB (ID P00012) | 1.26E-07 | 3.07E-04 | 24/48 | PCDH15 |
| ECM-receptor interaction | BioSystems: KEGG (ID 83068) | 2.01E-06 | 2.45E-03 | 9/26 | CD36, SV2B, COL1A2 |
| Cell Cycle G2/M Checkpoint | MSigDB C2: BioCarta (ID M8560) | 1.25E-05 | 1.02E-02 | 13/20 | ATM, ATR, BRCA1, CDC25A, CDKN1A, PRKDC, EP300, CHEK2, RPS6KA1, TP53 |
| Wnt signaling pathway | PantherDB (ID P00057) | 2.17E-05 | 1.18E-02 | 36/66 | MYH13, PCDH15 |
| Extracellular matrix organization | BioSystems: REACTOME (ID 576262) | 2.50E-05 | 1.18E-02 | 22/51 | ACTN1, ACAN |
Note:
30 genes showing a cadherin domain according to significant enrichment by ToppGene analysis (InterPro IPR002126 domain, q-value 3.24E-08, FDR correction).
34 genes annotated as proteinaceous extracellular matrix (ECM) cellular components according to significant enrichment by ToppGene analysis (GO Cellular Component GO:0005578, q-value 4.62E-04, FDR correction).
Significant enrichment for damaging variants (q-value < 0.05, Bonferroni correction).
Figure 4Recurrence of the 11 multiple myeloma genes in our pPCL series with respect to MM published datasets
Recently published datasets for multiple myeloma include: Walker et al., Blood 2012 (22 cases, [20]), Lohr et al., Cancer Cell 2014 (203 cases, [17]) and Bolli et al., Nat. Commun. 2014 (67 cases, [16]). Recurrence rates were calculated as the percentage of positive samples on the total number of cases included in each study.
Figure 5Hierarchical clustering of the 55 MM-21pPCL-4N dataset based on the expression levels of the 132 most variable and mutated genes
Data were previously generated on GeneChip Human Gene 1.0 ST array platform [12]. Samples were labelled according to tumor type (MM, multiple myeloma; p, primary plasma cell leukemia), translocation/cyclin D (TC) classification and the main associated cytogenetic alterations. Significant sample clusters, according to major IgH translocations, are highlighted.