| Literature DB >> 32596059 |
Abstract
Alveolar soft-part sarcoma (ASPS) is an extremely rare malignancy characterized by the unbalanced translocation der(17)t(X;17)(p11;q25). This translocation generates a fusion protein, ASPL-TFE3, that drives pathogenesis through aberrant transcriptional activity. Although considerable progress has been made in identifying ASPS therapeutic vulnerabilities (e.g., MET inhibitors), basic research efforts are hampered by the lack of appropriate in vitro reagents with which to study the disease. In this report, previously unmined microarray data for the ASPS cell line, ASPS-1, was analyzed relative to the NCI sarcoma cell line panel. These data were combined with meta-analysis of pre-existing ASPS patient microarray and RNA-seq data to derive a platform-independent ASPS transcriptome. Results demonstrated that ASPS-1, in the context of the NCI sarcoma cell panel, had some similarities to normal mesenchymal cells and connective tissue sarcomas. The cell line was characterized by high relative expression of transcripts such as CRYAB, MT1G, GCSAML, and SV2B. Notably, ASPS-1 lacked mRNA expression of myogenesis-related factors MYF5, MYF6, MYOD1, MYOG, PAX3, and PAX7. Furthermore, ASPS-1 had a predicted mRNA surfaceome resembling an undifferentiated mesenchymal stromal cell through expression of GPNMB, CD9 (TSPAN29), CD26 (DPP4), CD49C (ITGA3), CD54 (ICAM1), CD63 (TSPAN30), CD68 (SCARD1), CD130 (IL6ST), CD146 (MCAM), CD147 (BSG), CD151 (SFA-1), CD166 (ALCAM), CD222 (IGF2R), CD230 (PRP), CD236 (GPC), CD243 (ABCB1), and CD325 (CDHN). Subsequent re-analysis of ASPS patient data generated a consensus expression profile with considerable overlap between studies. In common with ASPS-1, elevated expression was noted for CTSK, DPP4, GPNMB, INHBE, LOXL4, PSG9, SLC20A1, STS, SULT1C2, SV2B, and UPP1. Transcripts over-expressed only in ASPS patient samples included ABCB5, CYP17A1, HIF1A, MDK, P4HB, PRL, and PSAP. These observations are consistent with that expected for a mesenchymal progenitor cell with adipogenic, osteogenic, or chondrogenic potential. In summary, the consensus data generated in this study highlight the unique and highly conserved nature of the ASPS transcriptome. Although the ability of the ASPL-TFE3 fusion to perturb mRNA expression must be acknowledged, the prevailing ASPS transcriptome resembles that of a mesenchymal stromal progenitor. ©2020 Stockwin.Entities:
Keywords: ASPS; Genomics; Mesenchymal; Meta-analysis; Microarray; RNA-seq; Sarcoma; Stromal; Surfaceome; Transcriptome
Year: 2020 PMID: 32596059 PMCID: PMC7307565 DOI: 10.7717/peerj.9394
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Study design.
This study focuses on the analysis of four publicly available GEO gene-expression datasets. GSE68591 comprises exon level expression data for the NCI sarcoma cell line panel, which includes data for the ASPS cell line ASPS-1. The remaining three studies comprise transcriptomic data for ASPS patient samples. GSE13433 comprises Affymetrix U133 plus 2.0 microarray data from our initial gene expression study of seven ASPS patients. GSE32569 uses the same array platform to study ASPS patient sample responses to Cedirinib. Lastly, GSE54729 comprises Illumina HISeq 2000 RNA-seq data for ASPS patient samples generated as part of an ASPS mouse modeling study. These data were re-analyzed using appropriate controls in order to generate a consensus transcriptome and gain insights into ASPS pathobiology.
The top fifty over-expressed genes in ASPS-1 relative to the average of the NCI sarcoma cell line panel.
| CRYAB; FDXACB1 | 3391149 | crystallin alpha B; ferredoxin-fold anticodon binding domain containing 1 | 227.1 |
| MT1G | 3692999 | metallothionein 1G | 202.9 |
| C7orf69 | 3000905 | chromosome 7 open reading frame 69 | 202.0 |
| GCSAML | 2390102 | germinal center-associated, signaling and motility-like | 177.6 |
| SV2B | 3608638 | synaptic vesicle glycoprotein 2B | 132.6 |
| ADGRL4 | 2419432 | adhesion G protein-coupled receptor L4 | 129.9 |
| PLA2G7 | 2955827 | phospholipase A2, group VII | 93.5 |
| SULT1C2 | 2498911 | sulfotransferase family 1C member 2 | 91.3 |
| SLN | 3389954 | sarcolipin | 88.9 |
| CFAP61 | 3878972 | cilia and flagella associated protein 61 | 85.2 |
| PPEF1 | 3970714 | protein phosphatase, EF-hand calcium binding domain 1 | 82.4 |
| ACP5 | 3851072 | acid phosphatase 5, tartrate resistant | 79.8 |
| CD36 | 3010503 | CD36 molecule (thrombospondin receptor) | 79.1 |
| PPARGC1A | 2763550 | peroxisome proliferator-activated receptor gamma, coactivator 1 alpha | 78.9 |
| ASB11 | 4000485 | ankyrin repeat and SOCS box containing 11, E3 ubiquitin protein ligase | 78.1 |
| BMP5 | 2958172 | bone morphogenetic protein 5 | 74.9 |
| PRUNE2 | 3210616 | prune homolog 2 (Drosophila) | 72.3 |
| SUCNR1 | 2648098 | succinate receptor 1 | 66.5 |
| PSG9 | 3864286 | pregnancy specific beta-1-glycoprotein 9 | 66.0 |
| CKMT2 | 2818035 | creatine kinase, mitochondrial 2 (sarcomeric) | 61.7 |
| DPP4 | 2584018 | dipeptidyl-peptidase 4 | 61.2 |
| ABCB1 | 3060182 | ATP binding cassette subfamily B member 1 | 59.7 |
| SCIN | 2990404 | scinderin | 58.7 |
| FABP3 | 2404418 | fatty acid binding protein 3, muscle and heart | 55.8 |
| PRUNE2 | 3210497 | prune homolog 2 (Drosophila) | 55.7 |
| SLC27A2 | 3593575 | solute carrier family 27 (fatty acid transporter), member 2 | 54.7 |
| SLCO4C1 | 2869096 | solute carrier organic anion transporter family, member 4C1 | 51.9 |
| PSG11,5,4,2 | 3863929 | pregnancy specific beta-1-glycoprotein 11,5,4,2 | 50.4 |
| PRLR | 2853102 | prolactin receptor | 47.9 |
| NPY6R | 2830450 | neuropeptide Y receptor Y6 (pseudogene) | 44.6 |
| ANXA3 | 2732844 | annexin A3 | 43.2 |
| TRPC7 | 2876793 | transient receptor potential cation channel, subfamily C, member 7 | 42.9 |
| CD5L | 2439138 | CD5 molecule-like | 41.0 |
| AKR1C2 | 3274758 | aldo-keto reductase family 1, member C2 | 40.9 |
| GPNMB | 2992814 | glycoprotein (transmembrane) nmb | 40.3 |
| IL13RA2 | 4018729 | interleukin 13 receptor, alpha 2 | 39.1 |
| LRRC39 | 2425173 | leucine rich repeat containing 39 | 38.8 |
| CST1 | 3901361 | cystatin SN | 37.4 |
| CDH7 | 3792273 | cadherin 7, type 2 | 36.8 |
| DOK5 | 3889833 | docking protein 5 | 35.1 |
| SEPT3; WBP2NL | 3947227 | septin 3; WBP2 N-terminal like | 34.8 |
| GCNT3 | 3596147 | glucosaminyl (N-acetyl) transferase 3, mucin type | 34.8 |
| ENPP5 | 2955673 | ectonucleotide pyrophosphatase/phosphodiesterase 5 (putative) | 34.7 |
| DOK3 | 2888879 | docking protein 3 | 34.2 |
| LCP1 | 3512874 | lymphocyte cytosolic protein 1 (L-plastin) | 34.0 |
| CDA | 2324084 | cytidine deaminase | 33.4 |
| KLHL4 | 3983324 | kelch-like family member 4 | 33.3 |
| CTSK | 2434609 | cathepsin K | 31.5 |
| LIPC | 3595691 | lipase, hepatic | 31.0 |
| RPSA | 3827218 | ribosomal protein SA | 30.0 |
Figure 2Analysis of ASPS-1 transcriptomic data.
(A) Heirarchical clustering of the NCI sarcoma cell line panel exon array data (B) tissue or tumor derivation for cluster nearest ASPS-1. (C) GTEX tissue of origin analysis for top fifty ASPS-1 transcripts.
Figure 3Expression of myogenesis-related transcripts in ASPS-1 relative to the other sarcoma cell lines.
Cell lines in the NCI sarcoma panel were segregated according to disease type and the average of exon expression data plotted for transcripts encoding myogenesis-related transcription factors and muscle structural proteins. ASPS-1 data is shown first; then RMS, Rhabdomyosarcoma; RT, Rhabdoid tumor; LMS, Leiomyosarcoma; CS, Chondrosarcoma; FS, Fibrosarcoma; EWS, Ewings Sarcoma; OS, Osteosarcoma; SS, Synovial Sarcoma; US, Uterine Sarcoma; SNOS, Sarcoma not otherwise specified; GCS, Giant cell sarcoma; MNS, Malignant peripheral nerve sheath; LPS, Liposarcoma; SPS, Spindle cell sarcoma; EP, Epithelioid; NC, Normal cells; SkMc, skeletal muscle cells. For transcripts; MYF5, Myogenic Factor 5, MYF6, Myogenic Factor 6, MYOD1, Myogenic Differentiation 1, MYOG = Myogenin, PAX3, Paired Box 3, PAX7 = Paired Box 7. Structural proteins; DES, desmin, NEB, nebulin, TNNT1, Troponin T1 - Slow Skeletal Type, TRIM63, Tripartite Motif Containing 63, TTN, titin and MSTN, Myostatin. Transcripts evaluated but not shown included; EYA1, LBX1, MEF2B, MEOX2, MITF, MSX1, PITX1, SIM2, SIX1, SIX4, TFE3 and TFEB.
The top fifty upregulated transcripts generated from each of the four experiments utilizing ASPS patient data from GSE13433, GSE32569 and GSE54729.
| 205502_at | 639.7 | CYP17A1 | 207733_x_at | 608.6 | PSG9 | 211470_s_at | 573.8 | SULT1C2 | ENSG00000159871 | 100.9 | LYPD5 |
| 212992_at | 531.8 | AHNAK2 | 209594_x_at | 601.3 | PSG9 | 205502_at | 420.6 | CYP17A1 | ENSG00000183979 | 423.1 | NPB |
| 205445_at | 434.1 | PRL | 205445_at | 572.3 | PRL | 1554018_at | 375.0 | GPNMB | ENSG00000198203 | 237.6 | SULT1C2 |
| 240717_at | 419.0 | ABCB5 | 205502_at | 537.1 | CYP17A1 | 205342_s_at | 367.4 | SULT1C2 | ENSG00000148795 | 1700.4 | CYP17A1 |
| 205342_s_at | 338.7 | SULT1C2 | 1555786_s_at | 510.2 | LINC00520 | 210809_s_at | 365.1 | POSTN | ENSG00000146678 | 200.6 | IGFBP1 |
| 201850_at | 258.8 | CAPG | 206224_at | 467.6 | CST1 | 238720_at | 351.1 | LOC101927057 | ENSG00000172179 | 357.1 | PRL |
| 211470_s_at | 252.9 | SULT1C2 | 223572_at | 437.6 | HHATL | 210587_at | 341.2 | INHBE | ENSG00000169006 | 142.2 | NTSR2 |
| 238720_at | 248.1 | LOC101927057 | 240717_at | 346.4 | ABCB5 | 206214_at | 262.4 | PLA2G7 | ENSG00000101197 | 113.9 | BIRC7 |
| 1554018_at | 240.8 | GPNMB | 208555_x_at | 322.5 | CST2 | 212992_at | 259.6 | AHNAK2 | ENSG00000170369 | 345.4 | CST2 |
| 205302_at | 207.6 | IGFBP1 | 236972_at | 277.8 | TRIM63 | 1565162_s_a | 241.7 | MGST1 | ENSG00000100167 | 143.8 | SEPT3 |
| 204638_at | 198.2 | ACP5 | 1553663_a_at | 259.2 | NPB | 229831_at | 228.7 | CNTN3 | ENSG00000204632 | 396.4 | HLA-G |
| 206899_at | 170.8 | NTSR2 | 205302_at | 255.8 | IGFBP1 | 200832_s_at | 225.4 | SCD | ENSG00000146070 | 147.4 | PLA2G7 |
| 210587_at | 170.4 | INHBE | 221051_s_at | 200.5 | NMRK2 | 206899_at | 168.5 | NTSR2 | ENSG00000110492 | 313.0 | MDK |
| 212805_at | 157.6 | PRUNE2 | 206239_s_at | 192.8 | SPINK1 | 205302_at | 159.6 | IGFBP1 | ENSG00000227925 | 134.1 | LOC101929771 |
| 209875_s_at | 150.3 | SPP1 | 210587_at | 184.3 | INHBE | 227180_at | 157.3 | ELOVL7 | ENSG00000170373 | 455.2 | CST1 |
| 1557636_a_at | 145.1 | C7orf57 | 206899_at | 171.9 | NTSR2 | 219648_at | 149.3 | MREG | ENSG00000225328 | 118.5 | LINC01594 |
| 210809_s_at | 144.4 | POSTN | 205551_at | 169.9 | SV2B | 210397_at | 148.0 | DEFB1 | ENSG00000118785 | 260.1 | SPP1 |
| 221577_x_at | 134.0 | GDF15 | 219106_s_at | 149.9 | KLHL41 | 209875_s_at | 147.2 | SPP1 | ENSG00000139269 | 137.6 | INHBE |
| 221008_s_at | 133.2 | ETNPPL | 206799_at | 146.3 | SCGB1D2 | 557636_a_a | 145.3 | C7orf57 | ENSG00000102575 | 190.9 | ACP5 |
| 206214_at | 130.1 | PLA2G7 | 229052_at | 143.1 | ANKRD23 | 205825_at | 139.0 | PCSK1 | ENSG00000185518 | 114.7 | SV2B |
| 209035_at | 122.3 | MDK | 221008_s_at | 142.4 | ETNPPL | 219073_s_at | 138.2 | OSBPL10 | ENSG00000205336 | 129.2 | ADGRG1 |
| 202450_s_at | 121.7 | CTSK | 205342_s_at | 125.6 | SULT1C2 | 205343_at | 132.8 | SULT1C2 | ENSG00000185567 | 246.4 | AHNAK2 |
| 200832_s_at | 121.0 | SCD | 212805_at | 124.7 | PRUNE2 | 555778_a_a | 127.5 | POSTN | ENSG00000136235 | 2756.8 | GPNMB |
| 212806_at | 119.7 | PRUNE2 | 1564758_at | 115.7 | LOC643659 | 221008_s_at | 126.1 | ETNPPL | ENSG00000042493 | 172.1 | CAPG |
| 230067_at | 116.8 | FAM124A | 229831_at | 114.5 | CNTN3 | 231736_x_at | 116.1 | MGST1 | ENSG00000143387 | 1718.2 | CTSK |
| 208555_x_at | 116.8 | CST2 | 209738_x_at | 113.4 | PSG6 | 212805_at | 115.1 | PRUNE2 | ENSG00000030582 | 707.5 | GRN |
| 225275_at | 115.8 | EDIL3 | 233389_at | 107.5 | CFAP61 | 558378_a_a | 108.4 | AHNAK2 | ENSG00000107317 | 354.1 | PTGDS |
| 229831_at | 112.1 | CNTN3 | 212992_at | 106.9 | AHNAK2 | 218292_s_at | 98.4 | PRKAG2 | ENSG00000106617 | 161.8 | PRKAG2 |
| 227404_s_at | 111.2 | EGR1 | 233238_s_at | 106.0 | CTB-12O2.1 | 221577_x_at | 94.6 | GDF15 | ENSG00000216490 | 275.5 | IFI30 |
| 212841_s_at | 109.4 | PPFIBP2 | 1569072_s_at | 102.9 | ABCB5 | 218404_at | 94.0 | SNX10 | ENSG00000183696 | 183.0 | UPP1 |
| 216834_at | 108.8 | RGS1 | 206994_at | 87.9 | CST4 | 233748_x_at | 90.8 | PRKAG2 | ENSG00000110092 | 192.1 | CCND1 |
| 208792_s_at | 106.1 | CLU | 239205_s_at | 84.7 | CR1; CR1L | 224918_x_at | 89.1 | MGST1 | ENSG00000212443 | 410.2 | SNORA53 |
| 223362_s_at | 105.2 | SEPT3. | 217871_s_at | 84.7 | MIF | 212070_at | 86.0 | ADGRG1 | ENSG00000185585 | 105.1 | OLFML2A |
| 208791_at | 92.7 | CLU | 226086_at | 83.8 | SYT13 | 205551_at | 85.1 | SV2B | ENSG00000130203 | 285.1 | APOE |
| 1558846_at | 92.7 | PNLIPRP3 | 213175_s_at | 82.5 | SNRPB | 244444_at | 84.5 | PKD1L2 | ENSG00000111412 | 119.3 | C12orf49 |
| 230746_s_at | 92.3 | N/A | 221523_s_at | 81.9 | RRAGD | 208965_s_at | 83.6 | IFI16 | ENSG00000206503 | 1571.8 | HLA-A |
| 218292_s_at | 89.1 | PRKAG2 | 243167_at | 77.0 | ABCB5 | 208146_s_at | 82.8 | CPVL | ENSG00000106066 | 239.3 | CPVL |
| 1565162_s_at | 88.4 | MGST1 | 206372_at | 74.4 | MYF6 | 226847_at | 82.8 | FST | ENSG00000138131 | 108.9 | LOXL4 |
| 205825_at | 83.7 | PCSK1 | 209875_s_at | 70.5 | SPP1 | 223484_at | 80.3 | C15orf48 | ENSG00000118508 | 104.3 | RAB32 |
| 226372_at | 82.7 | CHST11 | 244444_at | 67.3 | PKD1L2 | 234983_at | 78.9 | C12orf49 | ENSG00000174080 | 454.1 | CTSF |
| 202503_s_at | 82.6 | KIAA0101 | 205862_at | 65.9 | GREB1 | 240717_at | 78.3 | ABCB5 | ENSG00000169116 | 207.1 | PARM1 |
| 205343_at | 82.0 | SULT1C2 | 222379_at | 65.8 | KCNE4 | 229177_at | 78.1 | C16orf89 | ENSG00000120885 | 187.0 | MIR6843 |
| 205551_at | 81.7 | SV2B | 1554371_at | 60.2 | PKD1L2 | 205844_at | 75.8 | VNN1 | ENSG00000214435 | 114.9 | AS3MT |
| 1569072_s_at | 81.5 | ABCB5 | 205825_at | 58.2 | PCSK1 | 238376_at | 75.5 | LOC100505564 | ENSG00000130208 | 134.3 | APOC1 |
| 227180_at | 79.8 | ELOVL7 | 222714_s_at | 55.6 | LACTB2 | 205445_at | 73.0 | PRL | ENSG00000100644 | 335.4 | HIF1A |
| 231736_x_at | 79.5 | MGST1 | 218619_s_at | 54.6 | SUV39H1 | 242340_at | 71.7 | N/A | ENSG00000135047 | 458.4 | CTSL |
| 202037_s_at | 76.5 | SFRP1 | 236523_at | 54.3 | LOC285556 | 204285_s_at | 71.3 | PMAIP1 | ENSG00000144136 | 134.6 | SLC20A1 |
| 219648_at | 74.5 | MREG | 1557636_a_at | 53.7 | C7orf57 | 204466_s_at | 71.3 | SNCA | ENSG00000101846 | 109.4 | STS |
| 206685_at | 71.8 | HCG4 | 212070_at | 52.9 | ADGRG1 | 203767_s_at | 70.7 | STS | ENSG00000111775 | 242.3 | COX6A1 |
| 210397_at | 71.0 | DEFB1 | 204830_x_at | 52.8 | PSG5 | 222872_x_at | 70.5 | NABP1 | ENSG00000089101 | 164.4 | CFAP61 |
Figure 4VENN diagram showing overlap between analyses.
Lists of over-expressed transcripts (five-fold cut off) were used to determine extent of overlap between the five datasets. The number of differentially expressed transcripts at five-fold is underlined. Callouts show the 25 transcripts over-expressed in all experiments and the 73 found in all ASPS patient analyses.