| Literature DB >> 29734672 |
Marko Vidak1, Ivana Jovcevska2, Neja Samec3, Alja Zottel4, Mirjana Liovic5, Damjana Rozman6, Saso Dzeroski7, Peter Juvan8, Radovan Komel9.
Abstract
Glioblastoma (GB) is the most aggressive brain malignancy. Although some potential glioblastoma biomarkers have already been identified, there is a lack of cell membrane-bound biomarkers capable of distinguishing brain tissue from glioblastoma and/or glioblastoma stem cells (GSC), which are responsible for the rapid post-operative tumor reoccurrence. In order to find new GB/GSC marker candidates that would be cell surface proteins (CSP), we have performed meta-analysis of genome-scale mRNA expression data from three data repositories (GEO, ArrayExpress and GLIOMASdb). The search yielded ten appropriate datasets, and three (GSE4290/GDS1962, GSE23806/GDS3885, and GLIOMASdb) were used for selection of new GB/GSC marker candidates, while the other seven (GSE4412/GDS1975, GSE4412/GDS1976, E-GEOD-52009, E-GEOD-68848, E-GEOD-16011, E-GEOD-4536, and E-GEOD-74571) were used for bioinformatic validation. The selection identified four new CSP-encoding candidate genes—CD276, FREM2, SPRY1, and SLC47A1—and the bioinformatic validation confirmed these findings. A review of the literature revealed that CD276 is not a novel candidate, while SLC47A1 had lower validation test scores than the other new candidates and was therefore not considered for experimental validation. This validation revealed that the expression of FREM2—but not SPRY1—is higher in glioblastoma cell lines when compared to non-malignant astrocytes. In addition, FREM2 gene and protein expression levels are higher in GB stem-like cell lines than in conventional glioblastoma cell lines. FREM2 is thus proposed as a novel GB biomarker and a putative biomarker of glioblastoma stem cells. Both FREM2 and SPRY1 are expressed on the surface of the GB cells, while SPRY1 alone was found overexpressed in the cytosol of non-malignant astrocytes.Entities:
Keywords: FREM2; SPRY1; biomarkers; cell surface; data repositories; experimental validation; glioblastoma; glioblastoma stem cells; meta-analysis
Mesh:
Substances:
Year: 2018 PMID: 29734672 PMCID: PMC5983642 DOI: 10.3390/ijms19051369
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of dataset selection
| Database | GEO | ArrayExpress | GLIOMASdb | ||
|---|---|---|---|---|---|
| Query | »glioblastoma« | »glioblastoma, grade« | »glioblastoma, stem cells« | »glioblastoma, neurospheres« | / |
| Filters | »homo sapiens«, »DataSets« | »homo sapiens«, »RNA assay«, »array assay« | / | ||
| All datasets | 36 | 76 | 69 | 14 | 1 |
| Datasets > 20 samples | 11 | 58 | 17 | 3 | 1 |
| Rejected—Reason 1 | 5 | 35 | 11 | 1 | 0 |
| Rejected—Reason 2 | 2 | 16 | 3 | 2 | 0 |
| Rejected—Reason 3 | 0 | 4 | 1 | 0 | 0 |
| Appropriate datasets | 4 | 3 | 1 + 1 conditionally * | 0 | 1 |
| Appropriate datasets (list) | GSE4412/GDS1975 GSE4412/GDS1976 GSE4290/GDS1962 GSE23806/GDS3885 | E-GEOD-16011 E-GEOD-52009 E-GEOD-68848 | E-GEOD-4536 E-GEOD-74571 * | / | GLIOMASdb |
Reason 1—no relevant comparison possible (Comparisons relevant for selection of new GB/GSC marker candidates are: (1) glioblastoma multiforme tissue vs. non-malignant tumor tissue, (2) glioblastoma multiforme tissue vs. tissue from low-grade—i.e., grade II—gliomas, (3) glioblastoma multiforme tissue vs. tissue from oligodendrogliomas, (4) GSCL vs. ordinary glioblastoma multiforme tissue, (5) GSCL vs. CGCL, (6) neurospheres vs. ordinary glioblastoma multiforme tissue, (7) neurospheres vs. CGCL; the group in which elevated expression of GB/GSC markers is expected is marked in bold). Reason 2—relevant comparisons possible, but not with at least 10 samples in both compared groups. Reason 3—dataset is not original (duplicates or combinations of other GEO and ArrayExpress datasets). * two relevant comparisons possible: the first compares two groups with 9 samples each, the second compares a group of 10 samples with a group of 9 samples. Conditionally acceptable because this is the only found dataset that enables comparisons including glioblastoma neurospheres.
Ranking of appropriate datasets
| Dataset | Category | NPC | Rank | Curated | Use of Dataset |
|---|---|---|---|---|---|
| GSE4412/GDS1975 | Grade | 1 | 5 | yes | validation |
| GSE4412/GDS1976 | Grade | 1 | 5 | yes | validation |
| GSE4290/GDS1962 | Grade | 3 | 1 *** | yes | selection (winner) |
| E-GEOD-16011 | Grade | 2 | 3 | no | validation |
| E-GEOD-52009 | Grade | 2 | 3 | no | validation |
| E-GEOD-68848 | Grade | 3 | 1 *** | no | validation |
| GSE23806/GDS3885 | Neuro/GSCL | 4 | 1 | yes | selection (winner) |
| E-GEOD-4536 | Neuro/GSCL | 2 | 2 | no | validation |
| E-GEOD-74571 | Neuro/GSCL | 2 | 2 | no | validation |
| GLIOMASdb | Grade | 1 | / | yes | selection (directly qualified) |
Category: Grade—datasets with samples from gliomas of different grade; Neuro/GSCL—datasets with samples of neurospheres and/or GSCL; NPC = number of possible comparisons; Rank—NPC-based rank within category; *** If rank is equal, curated datasets have precedence.
Summary of statistical tests
| Stage | Test | First Group (Overexpression of GSC Marker Candidates Expected) | Second Group | DATA ORIGIN | |||
|---|---|---|---|---|---|---|---|
| Tissue/Cells | N | Tissue/Cells | N | Database | Dataset | ||
| Selection | 1 | GB | 77 | Non-malignant brain tissue | 23 | GEO | GSE4290/GDS1962 |
| 2 | GB | 77 | Grade II glioma | 45 | GEO | GSE4290/GDS1962 | |
| 3 | GB | 77 | oligodendroglioma | 50 | GEO | GSE4290/GDS1962 | |
| 4 | GSCL | 27 | GB tumor tissue | 12 | GEO | GSE23806/GDS3885 | |
| 5 | GSCL | 27 | Conventional GB cell lines | 32 | GEO | GSE23806/GDS3885 | |
| 6 | neurospheres | 17 | GB tumor tissue | 12 | GEO | GSE23806/GDS3885 | |
| 7 | neurospheres | 17 | Conventional GB cell lines | 32 | GEO | GSE23806/GDS3885 | |
| 8 | GB | 143 | Grade II glioma | 110 | GLIOMASdb | / | |
| Validation | v1-1 | GB | 228 | Non-malignant brain tissue | 28 | ArrayExpress | E-GEOD-68848 |
| v2-1 | GB | 24 | Grade II glioma | 63 | ArrayExpress | E-GEOD-52009 | |
| v2-2 | GB | 134 | Grade II glioma | 99 | ArrayExpress | E-GEOD-68848 | |
| v2-3 | GB | 159 | Grade II glioma | 24 | ArrayExpress | E-GEOD-16011 | |
| v3-1 | GB | 59 | oligodendroglioma | 11 | GEO | GSE4412/GDS1975 | |
| v3-2 | GB | 59 | oligodendroglioma | 11 | GEO | GSE4412/GDS1976 | |
| v3-3 | GB | 24 | oligodendroglioma | 13 | ArrayExpress | E-GEOD-52009 | |
| v3-4 | GB | 228 | oligodendroglioma | 67 | ArrayExpress | E-GEOD-68848 | |
| v3-5 | GB | 159 | oligodendroglioma | 52 | ArrayExpress | E-GEOD-16011 | |
| v4-1 | GSCL | 28 | GB tumor tissue | 17 | ArrayExpress | E-GEOD-4536 | |
| v5-1 | GSCL | 28 | Conventional GB cell lines | 26 | ArrayExpress | E-GEOD-4536 | |
| v6-1 | neurospheres | 9 | GB tumor tissue | 9 | ArrayExpress | E-GEOD-74571 | |
| v7-1 | neurospheres | 9 | Conventional GB cell lines | 10 | ArrayExpress | E-GEOD-74571 | |
N = number of samples, GB = glioblastoma.
Selection of new GB/GSC marker candidates—top 10 genes.
| Gene | Success Rate | CSP Status | Already Linked to Glioblastoma Carcinogenesis | Single Failed Test ( | Results of the Failed Test (Only for CSP Genes with 7/8 Success Rate) | |
|---|---|---|---|---|---|---|
| lfc > 0 | ||||||
|
|
|
|
| / | / | |
|
| 8/8 | no | no | none | / | / |
|
| 8/8 | no | yes [ | none | / | / |
|
| 7/8 | no | yes [ | Test 8 | / | / |
|
| 7/8 | no | no | Test 8 | / | / |
|
| 7/8 | no | no | Test 7 | / | / |
|
|
|
|
|
|
|
|
|
| 7/8 | no | yes [ | Test 3 | / | / |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Genes selected for further analysis are shown in bold.
Comparison of new GB/GSC marker candidates with established GSC marker candidates
| Test | Points | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| New Candidates | Established Candidates | Control | |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| 1 | 4 | 4 | 4 | 3 | 3 | 4 | 4 | 3 | 3 | 4 | 0 | 0 | 4 | 4 | 0 |
| 2 | 4 | 4 | 4 | 4 | 3 | 4 | 4 | 0 | 3 | 4 | 0 | 0 | 0 | 0 | 0 |
| 3 | 4 | 4 | 3 | 4 | 3 | 4 | 4 | 0 | 3 | 4 | 0 | 0 | 0 | 0 | 0 |
| 4 | 4 | 4 | 4 | 4 | 0 | 0 | 4 | 3 | 0 | 3 | 0 | 4 | 0 | 0 | 0 |
| 5 | 4 | 4 | 4 | 4 | 3 | 4 | 0 | 3 | 3 | 0 | 0 | 0 | 3 | 0 | 0 |
| 6 | 4 | 3 | 4 | 4 | 0 | 0 | 3 | 3 | 0 | 1 | 0 | 3 | 0 | 0 | 3 |
| 7 | 4 | 4 | 4 | 4 | 3 | 4 | 0 | 3 | 3 | 0 | 0 | 0 | 0 | 3 | 0 |
| 8 | 4 | 4 | 4 | 4 | / | 4 | 4 | 0 | / | 4 | / | / | 0 | 0 | 0 |
| APPT | 4.000 | 3.875 | 3.875 | 3.875 | 2.143 | 3.000 | 2.875 | 1.875 | 2.143 | 2.500 | 0.000 | 1.000 | 0.875 | 0.875 | 0.375 |
APPT = average points per test (rounded to three decimals); 4 points: FDR-corrected p < 0.001 and lfc > 1 (highly significant and highly overexpressed); 3 points: FDR-corrected p < 0.05 and lfc > 0, while criteria for 4 points are not met (significant and overexpressed); 1 point: p < 0.05 and lfc > 0 (overexpressed but borderline non-significant after application of FDR correction); 0 points: either p > 0.05 or lfc < 0 (non-significant or not overexpressed); /: gene not featured on the platform or in the results table.
Results of bioinformatic validation tests
| Test | Points | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| New Candidates | Established Candidates | Control | |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| v1-1 | 4 | 4 | 4 | 4 | 3 | 4 | 4 | 0 | 3 | 4 | 0 | 0 | 3 | 4 | 0 |
| v2-1 | 3 | / | 3 | 3 | 0 | 0 | 4 | 0 | 0 | 3 | 0 | 0 | 3 | 0 | 0 |
| v2-2 | 3 | 4 | 4 | 4 | 3 | 3 | 4 | 0 | 3 | 4 | 0 | 3 | 0 | 0 | 0 |
| v2-3 | 3 | / | 4 | 4 | 3 | 4 | 4 | 0 | 3 | 4 | 0 | 3 | / | / | / |
| v3-1 | / | / | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| v3-2 | 3 | 0 | 3 | / | / | / | / | 0 | 0 | 0 | / | / | / | / | 0 |
| v3-3 | 3 | / | 0 | 3 | 0 | 0 | 3 | 1 | 0 | 3 | 0 | 0 | 1 | 0 | 0 |
| v3-4 | 3 | 3 | 3 | 3 | 3 | 3 | 4 | 0 | 3 | 4 | 0 | 3 | 0 | 0 | 0 |
| v3-5 | 3 | / | 3 | 0 | 4 | 4 | 3 | 0 | 3 | 4 | 0 | 0 | / | / | / |
| v4-1 | 4 | 4 | 4 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 3 | 0 | 0 | 3 | 3 |
| v5-1 | 3 | 4 | 4 | 4 | 4 | 4 | 0 | 4 | 3 | 0 | 0 | 0 | 0 | 0 | 0 |
| v6-1 | 3 | 3 | 1 | 3 | 0 | 0 | 0 | 0 | 3 | 0 | 3 | 0 | 0 | 3 | 3 |
| v7-1 | 3 | 4 | 4 | 4 | 3 | 4 | 0 | 3 | 1 | 0 | 3 | 0 | 3 | 1 | 0 |
| APPT | 3.167 | 3.250 | 2.846 | 2.667 | 1.917 | 2.250 | 2.583 | 0.615 | 1.692 | 2.077 | 0.750 | 0.750 | 1.000 | 1.100 | 0.545 |
APPT = average points per test (rounded to three decimals). Legend from Table 4 also applies here.
Figure 1Statistical analysis of qRT-PCR results. Relative mRNA expression levels for FREM2 and SPRY1 for NCH, U251MG, U87MG, and NSC. Mean gene expression value is presented on the scatter plot (middle line) with error bars representing standard deviation (SD). x-axis: samples; y-axis: relative mRNA expression. * p ≤ 0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001. (A) Kruskal–Wallis test with Dunn’s correction: NCH vs. U87MG: FREM2—significant difference (p < 0.0001). NCH vs. NSC: FREM2—significant difference (p = 0.0489). U251MG vs. U87MG: SPRY1—significant difference (p = 0.0155). U251MG vs. NSC: SPRY1—significant difference (p = 0.0004). (B) Mann–Whitney test: NCH/U251MG/U87MG vs. NSC: SPRY1—significant difference (p = 0.0025). (C) Unpaired t-test with Welch’s correction: NCH vs. U251MG/U87MG: FREM2—significant difference (p = 0.0030). (D) Mann–Whitney test: NCH vs. NSC: FREM2—significant difference (p = 0.0181). NCH vs. NSC: SPRY1—significant difference (p = 0.0227).
Figure 2Statistical analysis of qRT-PCR results. Relative mRNA expression levels for FREM2 and SPRY1 for NCH, U251MG, U87MG, and astrocytes. Mean gene expression value is presented on the scatter plot (middle line) with error bars representing standard deviation (SD). x-axis: samples; y-axis: relative mRNA expression. * p ≤0.05, ** p < 0.01, *** p < 0.001, **** p < 0.0001. (A) Kruskal–Wallis test with Dunn’s correction: NCH vs. U87MG: FREM2—significant difference (p < 0.0001). NCH vs. astrocytes: FREM2—significant difference (p < 0.0001). U251MG vs. U87MG: FREM2—significant difference (p = 0.0295).One-way ANOVA multiple comparisons test: NCH vs. astrocytes: SPRY1—significant difference (p < 0.0001). U251MG vs. astrocytes: SPRY1—significant difference (p < 0.0001). U87MG vs. astrocytes: SPRY1—significant difference (p < 0.0001). (B) Unpaired t-test with Welch’s correction: NCH/U251MG/U87MG vs. astrocytes: FREM2—significant difference (p =0.0044).NCH/U251MG/U87MG vs. astrocytes: SPRY1—significant difference (p = 0.0036). (C) Mann–Whitney test: NCH vs. U251MG/U87MG: FREM2—significant difference (p < 0.0001). (D) Mann–Whitney test: NCH vs. astrocytes: FREM2—significant difference (p = 0.0002). Unpaired t-test with Welch’s correction: NCH vs. astrocytes: SPRY1—significant difference (p = 0.0036).
Figure 3(A) Images of immunofluorescent staining of FREM2: nuclei (left); FREM2 staining with anti-FREM2 antibodies (middle); merged presentation with original staining (right): nuclei (blue); FREM2 (red). White arrows indicate staining of FREM2 in cell membranes; (B) Images of immunofluorescent staining of SPRY1: nuclei (left); SPRY1 staining with anti-SPRY1 antibodies (middle); merged presentation with original staining (right): nuclei (blue); SPRY1 (red). White arrows indicate staining of SPRY1 in cell membranes.
Figure 4Western blot analysis. NCH—protein extract from glioblastoma stem-like cells, NCH cell line (NCH644 & NCH421K). U251, U87—protein extracts from (respectively) glioblastoma U251MG and U87MG cell lines. NSC—protein extract from neural stem cell cells. ASTROCYTES—protein extract from primary human astrocytes. Quantification of Western blotting for FREM2 and SPRY1 expression: relative band intensities were calculated as the ratios between the antigen band intensities for FREM2 and SPRY1 and those of the whole protein loaded on the gel (both measured in arbitrary units [AU]). GAPDH was used as loading control for both western blots. Statistical analysis of Western blot. Mean protein expression value is presented on the graph with error bars representing standard deviation (SD). x-axis: samples; y-axis: relative protein expression. * p ≤0.05, ** p < 0.01. Kruskal-Wallis test with Dunn’s correction: NCH vs. ASTROCYTES: FREM2—significant difference (p = 0.0013); U251 vs. ASTROCYTES: FREM2—significant difference (p = 0.0405); U251MG vs. U87MG: SPRY1—significant difference (p = 0.0027).