| Literature DB >> 28542306 |
Marta Martínez1, Carlos O S Sorzano1,2, Alberto Pascual-Montano1, Jose M Carazo1.
Abstract
Benign neurofibromas, the main phenotypic manifestations of the rare neurological disorder neurofibromatosis type 1, degenerate to malignant tumors associated to poor prognosis in about 10% of patients. Despite efforts in the field of (epi)genomics, the lack of prognostic biomarkers with which to predict disease evolution frustrates the adoption of appropriate early therapeutic measures. To identify potential biomarkers of malignant neurofibroma transformation, we integrated four human experimental studies and one for mouse, using a gene score-based meta-analysis method, from which we obtained a score-ranked signature of 579 genes. Genes with the highest absolute scores were classified as promising disease biomarkers. By grouping genes with similar neurofibromatosis-related profiles, we derived panels of potential biomarkers. The addition of promoter methylation data to gene profiles indicated a panel of genes probably silenced by hypermethylation. To identify possible therapeutic treatments, we used the gene signature to query drug expression databases. Trichostatin A and other histone deacetylase inhibitors, as well as cantharidin and tamoxifen, were retrieved as putative therapeutic means to reverse the aberrant regulation that drives to malignant cell proliferation and metastasis. This in silico prediction corroborated reported experimental results that suggested the inclusion of these compounds in clinical trials. This experimental validation supported the suitability of the meta-analysis method used to integrate several sources of public genomic information, and the reliability of the gene signature associated to the malignant evolution of neurofibromas to generate working hypotheses for prognostic and drug-responsive biomarkers or therapeutic measures, thus showing the potential of this in silico approach for biomarker discovery.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28542306 PMCID: PMC5443557 DOI: 10.1371/journal.pone.0178316
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Prisma flow diagram.
Microarray studies selected from public databases and included in the MPNST vs. NF meta-analysis.
| Tissue | Organism | NF vs. control | MPNST vs. control | MPNST vs. NF | Reference | Platform | Probe | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accession | Samples | Accession | Samples | Accession | Samples | ||||||||
| NF | control | MPNST | control | MPNST | NF | ||||||||
| Cell | Human | GSE14038 | 22 | 10 | GSE14038 | 13 | 10 | GSE14038 | 22 | 10 | [ | Affy U133 Plus 2.0 | 54675 |
| GSE39764 | 3 | 3 | [ | Agilent-014850 4x44K | 45015 | ||||||||
| Nerve | Human | GSE41747 | 26 | 3 | GSE41747 | 6 | 3 | [ | Affy U133A | 22283 | |||
| [ | Agilent 011521 G4110A | 19061 | |||||||||||
| [ | Affy U133 Plus 2.0 | 54675 | |||||||||||
| [ | ABI Version 2 | 33025 | |||||||||||
| Mouse | GSE41747 | 15 | 15 | GSE41747 | 18 | 15 | [ | Affy 430 2.0 | 45101 | ||||
1Accession number from GEO database.
2Accession number from ArrayExpress database.
3Data included in GSE14038 accession.
Fig 2General schema of the integration of studies to generate the MPNST vs. NF gene signature and its processing by NFFinder.
The upper panel shows the five microarray studies used to compute the MPNST vs. NF gene signature by meta-analysis (central panel). Score values calculated for each gene in each study (sij) and for each signature gene (si) are shown next to rectangles that indicate the respective gene size effect or fold change (FCij and FCi); red for upregulated genes and green for downregulated ones. The lower panel describes the results obtained from NFFinder when GEO and CMap-DrugMatrix databases are interrogated for direct or inverse matching of gene expression patterns, respectively, by using the signature gene names as input. Experimental validation should verify the hypotheses generated by this in-silico predictive workflow.
List of 20 genes with the highest and the lowest scores of the MPNST vs. NF signature.
| ENSGENE | hgnc_symbol | chrom_name | band | score | logFC | logFC_m | studies | mouse | BD_ratio |
|---|---|---|---|---|---|---|---|---|---|
| ENSG00000171848 | RRM2 | 2 | p25.1 | 3.06 | 4.19 | 4.47 | 5 | 1 | 13.68 |
| ENSG00000131747 | TOP2A | 17 | q21.2 | 2.98 | 3.73 | 3.94 | 5 | 1 | 10.01 |
| ENSG00000166803 | KIAA0101 | 15 | q22.31 | 2.80 | 4.02 | 4.08 | 5 | 1 | 8.80 |
| ENSG00000089685 | BIRC5 | 17 | q25.3 | 2.80 | 3.13 | 3.22 | 5 | 1 | 5.24 |
| ENSG00000137804 | NUSAP1 | 15 | q15.1 | 2.59 | 3.67 | 3.75 | 5 | 1 | 16.50 |
| ENSG00000185686 | PRAME | 22 | q11.22 | 2.50 | 3.43 | 3.65 | 4 | 0 | 7.08 |
| ENSG00000149948 | HMGA2 | 12 | q14.3 | 2.44 | 3.54 | 3.57 | 5 | 1 | 21.31 |
| ENSG00000117724 | CENPF | 1 | q41 | 2.43 | 3.08 | 3.20 | 5 | 1 | 9.38 |
| ENSG00000198901 | PRC1 | 15 | q26.1 | 2.43 | 3.05 | 3.14 | 5 | 1 | 16.71 |
| ENSG00000157456 | CCNB2 | 15 | q22.2 | 2.34 | 2.99 | 3.11 | 5 | 1 | 14.17 |
| ENSG00000128045 | RASL11B | 4 | q12 | 2.19 | 3.39 | 3.81 | 4 | 0 | 17.65 |
| ENSG00000156076 | WIF1 | 12 | q14.3 | 2.17 | 4.70 | 4.99 | 4 | 0 | 22.10 |
| ENSG00000134057 | CCNB1 | 5 | q13.2 | 2.16 | 2.54 | 2.48 | 5 | 1 | 20.67 |
| ENSG00000066279 | ASPM | 1 | q31.3 | 2.15 | 2.93 | 3.02 | 5 | 1 | 11.88 |
| ENSG00000088325 | TPX2 | 20 | q11.21 | 2.14 | 2.96 | 3.19 | 5 | 1 | 23.98 |
| ENSG00000123975 | CKS2 | 9 | q22.2 | 2.07 | 2.21 | 2.30 | 5 | 1 | 17.55 |
| ENSG00000143476 | DTL | 1 | q32.3 | 2.07 | 3.20 | 3.30 | 5 | 1 | 26.58 |
| ENSG00000170312 | CDK1 | 10 | q21.2 | 2.01 | 3.12 | 3.17 | 4 | 1 | 17.50 |
| ENSG00000007062 | PROM1 | 4 | p15.32 | 1.99 | 4.19 | 4.54 | 4 | 0 | 12.37 |
| ENSG00000115163 | CENPA | 2 | p23.3 | 1.94 | 2.77 | 2.93 | 5 | 1 | 18.19 |
| ENSG00000021300 | PLEKHB1 | 11 | q13.4 | -1.66 | -2.96 | -3.13 | 5 | 1 | 35.86 |
| ENSG00000109846 | CRYAB | 11 | q23.1 | -1.68 | -3.13 | -3.20 | 5 | 1 | 38.39 |
| ENSG00000100146 | SOX10 | 22 | q13.1 | -1.74 | -2.81 | -2.93 | 5 | 1 | 35.15 |
| ENSG00000100307 | CBX7 | 22 | q13.1 | -1.79 | -2.33 | -2.37 | 5 | 1 | 18.08 |
| ENSG00000149218 | ENDOD1 | 11 | q21 | -1.82 | -2.40 | -2.44 | 4 | 1 | 18.78 |
| ENSG00000197766 | CFD | 19 | p13.3 | -1.87 | -2.98 | -2.53 | 5 | 1 | 43.75 |
| ENSG00000148180 | GSN | 9 | q33.2 | -1.90 | -2.60 | -2.58 | 5 | 1 | 7.053 |
| ENSG00000134121 | CHL1 | 3 | p26.3 | -1.94 | -3.63 | -3.36 | 5 | 1 | 32.93 |
| ENSG00000172005 | MAL | 2 | q11.1 | -1.97 | -3.05 | -3.24 | 5 | 1 | 16.49 |
| ENSG00000174944 | P2RY14 | 3 | q25.1 | -2.01 | -3.39 | -3.47 | 5 | 1 | 35.95 |
| ENSG00000108381 | ASPA | 17 | p13.2 | -2.06 | -2.29 | -2.40 | 5 | 1 | 31.63 |
| ENSG00000168477 | TNXB | 6 | p21.32 | -2.07 | -3.13 | -3.34 | 5 | 1 | 26.58 |
| ENSG00000127951 | FGL2 | 7 | q11.23 | -2.08 | -3.92 | -3.75 | 5 | 1 | 27.43 |
| ENSG00000196616 | ADH1B | 4 | q23 | -2.08 | -3.86 | -3.88 | 5 | 1 | 38.97 |
| ENSG00000071991 | CDH19 | 18 | q22.1 | -2.15 | -4.21 | -4.01 | 5 | 1 | 33.78 |
| ENSG00000147588 | PMP2 | 8 | q21.13 | -2.33 | -4.39 | -4.80 | 5 | 1 | 14.94 |
| ENSG00000171819 | ANGPTL7 | 1 | p36.22 | -2.60 | -3.85 | -4.12 | 5 | 1 | 18.30 |
| ENSG00000148671 | ADIRF | 10 | q23.2 | -2.66 | -3.81 | -4.00 | 4 | 0 | 3.22 |
| ENSG00000160307 | S100B | 21 | q22.3 | -3.22 | -4.31 | -4.38 | 5 | 1 | 11.91 |
| ENSG00000107317 | PTGDS | 9 | q34.3 | -3.47 | -4.89 | -4.85 | 5 | 1 | 7.88 |
1The complete MPNST vs. NF gene signature of 579 genes, embedded in the unfiltered list of 7064 genes, is bold-highlighted in Table A in S2 Table.
2ENSEMBL gene ID.
3Gene symbol from HUGO Gene Nomenclature Committee.
4Human chromosome name.
5Median of logFCij computed for each gene across the studies.
6Median of logFC_mij computed for each gene across the studies.
7Number of studies included in the MPNST vs. NF meta-analysis.
8Inclusion (1) or exclusion (0) of mouse data in MPNST vs. NF meta-analysis. Exclusion may be due to the absence of mouse data or because current data differed from human data.
9Bhattacharyya distance ratio.
Fig 3Chromosome distribution of the MPNST vs. NF gene signature.
The gene signature distribution was calculated from the 336 genes with positive score (a), and from the 243 genes with negative score (b). Bar diagrams compare the observed distribution of MPNST vs. NF gene percentage in the human chromosome arms (dark bars) with the expected distribution according to the human ENSEMBL database (light bars). Statistical significance of the gene signature over-represented chromosome arms is above the bars. Over-represented human chromosome bands in the MPNST vs. NF gene signature are shown below each chart. Their statistical significance is shown at the top right side of band names. (****) P(X≥x) < 0.0001, (***) 0.0001< P(X≥x) < 0.001, (**) 0.001< P(X≥x) < 0.01, (*) 0.01< P(X≥x) < 0.05.
Fig 4Clustering of phenotype comparisons and of NF1-related genes.
Comparisons among MPNST, NF and control phenotypes were grouped through hierarchical clustering. Cell culture and nerve tumor comparisons are shown on the top of the upper dendrogram, and the node height scale is detailed on the left of this tree. The hierarchical relationship among gene clusters obtained by grouping the logFC_m values of 2209 NF1-related genes by the Self-Organizing Tree Algorithm (SOTA) is represented by the dendrogram on the left. Clusters are described by their SOTA centroid vectors. Color scale of logFC_m values is shown below. The right side of the Fig details the number of genes in each cluster, the gene number of the MPNST vs. NF signature in each cluster (*, > 15 genes), the percentage of genes of this signature in each cluster (grey scale shown below), the number of biological process GO terms over-represented in each cluster, and the summary of the GO term enrichment as functional characterization of clusters. A complete list of terms is shown in S9 Table.
MPNST vs. NF signature genes potentially down-regulated by promoter hypermethylation.
| ENSGENE | hgnc | ch | band | NF vs. control | MPNST vs. control | MPNST vs. NF | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cell cultures | Nerve tumors | Cell cultures | Nerve tumors | Cell cultures | Nerve tumors | ||||||||||||||
| score | logFC | score | logFC | score | logFC | score | logFC | score | logFC | score | logFC | ||||||||
| ENSG00000068028 | RASSF1 | 3 | p21.31 | 0 | -0.36 | -0.08 | -0.47 | -0.07 | -0.91 | 0 | -1.07 | 3.35 | 1.64e-03 | -0.03 | -0.70 | -0.09 | -0.70 | 2.77 | 1.30e-02 |
| ENSG00000159200 | RCAN1 | 21 | q22.12 | 0 | -0.93 | 0 | 0.73 | -0.76 | -3.18 | 0 | -1.11 | 2.99 | 1.88e-07 | -0.19 | -2.37 | -0.64 | -2.08 | 2.41 | 1.28e-02 |
| ENSG00000197971 | MBP | 18 | q23 | -0.56 | -4.38 | -1.28 | -4.28 | -1.69 | -5.89 | -1.54 | -7.13 | 3.80 | 6.60e-59 | -0.02 | -2.99 | -0.69 | -2.96 | 3.41 | 1.06e-53 |
| ENSG00000175785 | PRIMA1 | 14 | q32.12 | 0 | -2.20 | 0 | -0.32 | -0.58 | -2.75 | -0.67 | -2.40 | 3.09 | 5.52e-07 | 0 | -1.43 | -0.71 | -2.43 | 2.87 | 4.01e-05 |
| ENSG00000152061 | RAGAP1L | 1 | q25.1 | 0 | 0.50 | 0 | -0.40 | 0.06 | 1.11 | -0.24 | -1.94 | 2.55 | 5.11e-06 | 0 | 0.86 | -0.73 | -1.54 | 2.24 | 1.74e-05 |
| ENSG00000213088 | ACKR1 | 1 | q23.2 | 0 | 0.07 | 0 | -0.77 | 0 | -0.02 | -0.78 | -3.15 | 2.03 | 2.30e-04 | 0 | 0 | -1.00 | -2.84 | 3.33 | 1.04e-02 |
| ENSG00000158887 | MPZ | 1 | q23.3 | 0 | -1.83 | 0 | -1.45 | -1.22 | -4.66 | -1.01 | -5.09 | 4.37 | 1.95e-08 | -0.70 | -5.51 | -1.11 | -2.49 | 2.27 | 3.46e-03 |
| ENSG00000070731 | ST6GALNAC2 | 17 | q25.1 | -0.18 | -1.97 | -0.25 | -1.26 | -1.33 | -4.69 | -0.69 | -2.93 | 2.74 | 4.47e-06 | -0.47 | -3.54 | -1.56 | -2.81 | 2.01 | 1.31e-02 |
| ENSG00000132470 | ITGB4 | 17 | q25.1 | 0 | 0.02 | 0 | -0.62 | 0 | 0.16 | -0.55 | -2.29 | 2.16 | 4.77e-02 | 0 | -0.12 | -1.62 | -3.13 | 1.90 | 6.70e-05 |
| ENSG00000160307 | S100B | 21 | q22.3 | 0 | -1.40 | 0 | -0.72 | -1.81 | -8.33 | -1.06 | -4.94 | 2.37 | 6.82e-04 | -0.86 | -7.66 | -3.22 | -4.31 | 2.56 | 2.02e-02 |
1Data extracted and ordered from gene list showed in Table J in S2 Table. The complete list of genes potentially down-regulated by hypermethylation in MPNST vs. NF and vs. control comparisons is shown in S10 Table.
2ENSEMBL gene ID.
3Gene symbol from HUGO Gene Nomenclature Committee.
4Human chromosome name.
5Median of logFCij computed for each gene across the studies comparing two phenotypes.
edgeR.logFC and edgeR.adj.pval computed by the analysis of promoter methylation. NA values of edgeR.logFC and edgeR.adj.pval were omitted for NF vs. control comparison.
human
chromosome arms/bands was calculated considering the association of ENSEMBL gene IDs to human chromosome arms/bands. To assess correlation between values of two vectors, Pearson's product-moment correlation was calculated using the cor.test function from the R stats package.