| Literature DB >> 25859847 |
Martin A Rijlaarsdam1, David M J Tax2, Ad J M Gillis1, Lambert C J Dorssers1, Devin C Koestler3, Jeroen de Ridder2, Leendert H J Looijenga1.
Abstract
The cell of origin of the five subtypes (I-V) of germ cell tumors (GCTs) are assumed to be germ cells from different maturation stages. This is (potentially) reflected in their methylation status as fetal maturing primordial germ cells are globally demethylated during migration from the yolk sac to the gonad. Imprinted regions are erased in the gonad and later become uniparentally imprinted according to fetal sex. Here, 91 GCTs (type I-IV) and four cell lines were profiled (Illumina's HumanMethylation450BeadChip). Data was pre-processed controlling for cross hybridization, SNPs, detection rate, probe-type bias and batch effects. The annotation was extended, covering snRNAs/microRNAs, repeat elements and imprinted regions. A Hidden Markov Model-based genome segmentation was devised to identify differentially methylated genomic regions. Methylation profiles allowed for separation of clusters of non-seminomas (type II), seminomas/dysgerminomas (type II), spermatocytic seminomas (type III) and teratomas/dermoid cysts (type I/IV). The seminomas, dysgerminomas and spermatocytic seminomas were globally hypomethylated, in line with previous reports and their demethylated precursor. Differential methylation and imprinting status between subtypes reflected their presumed cell of origin. Ovarian type I teratomas and dermoid cysts showed (partial) sex specific uniparental maternal imprinting. The spermatocytic seminomas showed uniparental paternal imprinting while testicular teratomas exhibited partial imprinting erasure. Somatic imprinting in type II GCTs might indicate a cell of origin after global demethylation but before imprinting erasure. This is earlier than previously described, but agrees with the totipotent/embryonic stem cell like potential of type II GCTs and their rare extra-gonadal localization. The results support the common origin of the type I teratomas and show strong similarity between ovarian type I teratomas and dermoid cysts. In conclusion, we identified specific and global methylation differences between GCT subtypes, providing insight into their developmental timing and underlying developmental biology. Data and extended annotation are deposited at GEO (GSE58538 and GPL18809).Entities:
Mesh:
Year: 2015 PMID: 25859847 PMCID: PMC4479500 DOI: 10.1371/journal.pone.0122146
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Tumor types/samples and cell lines analyzed and schematic visualization of genomic functional categories of interest.
(A) GCT subtypes in the hypothesized context of normal germ cell development as proposed in earlier studies (grey box). Developmental schemes are indicated in blue (male), red (female) or when possible in both sexes (white). DG does not originate from CIS but is indicated together with SE for reasons of consistency. (B) Samples included in this study. Abbreviations match Fig 1A and roman numbers indicate the GCT type to which the histological subtypes belongs. n indicates the number of tumor samples per group. All samples are from male patients except the DGs, DCs and a subset of the type I TEs. Please note that when only TE is denoted, this indicates the group of all type I TEs together. Otherwise II.TE (type II pure TE) or the abbreviations for specific localizations are used as indicated in this figure. Four GCT cell lines were included; tumor of origin between brackets. (C) Reference to (abbreviations of) the functional genomic regions as mentioned in the rest of the manuscript. Probes were classified according to their relation to gene coding regions, micro-RNA (MIR) coding regions, CpG islands and/or transposon elements (LINE/SINE). The distance to the transcription start site (TSS) was used in accordance with the Illumina manifest: 200 or 1500 bp. Of note, the TSSAssociated category contains all probes with a distance < 1500 bp to the TSS in contrast to the TSS1500 category from Illumina which is only contains probes 200-1500bp from the TSS. Probes within imprinting associated regions were classified as (1) mapped inside a known imprinting control region (ICR) or (2) either mapped inside an ICR or mapped close to the TSS of a transcript of an imprinted gene (200/1500bp upstream, not mutually exclusive). P/M indicates the expressed allele, i.e. paternal/maternal respectively. Numbers between brackets indicate the number of valid probes within each specific category (total number of valid probes: 437,881). *The visualization did not permit including the probe count for all categories. The counts for the empty categories are: 5’UTR = 59,338; ISLAND = 136,339; IMPR_P200 = 638; IMPR_P1500 = 1,659; IMPR_M200 = 610; IMPR_M1500 = 2,265.
Fig 2Methylation patterns in GCT subtypes and cell lines.
To illustrate differences in methylation status between histological GCT subtypes two (visualization) methods were applied. Firstly, the methylation pattern over the whole genome and specific functional categories (Fig 1C) is visualized using the distribution of the methylation percentage β in all samples of a certain GCT subtype. Next, the discriminatory power of the methylation pattern for each individual sample is shown using principal component analysis. (A) Distribution of methylation percentage. Violin plots: grey areas indicate a kernel density plot of the methylation percentage (β) of all probes in all samples in a certain category. The boxplot indicates the interquartile range (black bars) and median (white squares). X-axis labels indicate histological subgroup according to Fig 1A and 1B. TE indicates type I TE only. (B) Principal Component Analysis. The first two principal components (PC) are plotted to evaluate the discriminative power of the methylation pattern between the subtypes. Abbreviations of histological subtypes are explained in Fig 1A. CL indicates cell lines. Please note that in the legend of the PCA the TE group is subdivided based on gender and localization: I = type I; II = type II/formally part of the mNS group, s = sacrum, t = testis, o = ovary, m = male, f = female. A more detailed visualization of the TE classes is provided in S2 Fig, which also includes the full series of 18 functional categories, bootstrap validation of the PCA and an estimation of the variance explained by the first two principal components.
Pairwise comparison of GCT subtypes.
| A. Seminomatous (SE/DG) versus non-seminomatous (EC/mNS) GCTs | |
|---|---|
| DMProbes[ | DMProbes[ |
| 6 | 110,462 (25%) |
|
| ↓ in tr and ↑ in non-tr, LINE/SINE suggested global difference in methylation status rather than differential methylation of specific regulatory elements. CpG islands were ↓. miRs regions were weakly ↑. ICRs were ↓, suggesting no difference in imprinting status. |
| DMRegions[ | DMRegions[ |
| 0 |
|
| - |
|
|
| |
| DMProbes[ | DMProbes[ |
| 61 | 56,764 (13%) |
| significantly overrepresented on chromosome 12 (13/61) and preferentially located in the 5’UTR of genes and CpG islands | ↓on tr, CpG islands and ICRs and ↑of non-trs, transposons and miRs (≈DMPs[ |
| DMRegions[ | DMRegions[ |
| 3 |
|
|
|
|
|
| |
| DMProbes[ | DMProbes[ |
| 17,407 | 1,520 |
| Enrichments were weak, the strongest being ↓ in transposons. | ≈80% on X chromosome → differences in sex (TE = male+female, EC/mNS = male). ↑ in tr and ↓ in non-tr and transposon elements suggests differential methylation of specific regulatory elements. ↑ in CpG islands. |
| DMProbes[ | DMRegions[ |
| 580 (all autosomal) | 128 (15 autosomal) |
| e.g. | e.g. |
|
| |
| DMProbes[ | DMProbes[ |
| 15,340 | 2,830 |
| ↑ in non-tr and ↓in the tr. ↑ in ICR_P/IMPR_P200/1500 in line with paternal cell of origin of SS. | ↓ at non-tr and CpG islands. |
| DMProbes[ | DMRegions[ |
| 559 | 30 |
| e.g. hotspot genes like |
|
This table concisely summarizes the results of the search for differentially methylated (DM) probes (P) and differentially methylated regions (R) between pairs (A and B) GCT subtypes. Briefly, the number of DMPs and DMRs is shown separately for probes hypermethylated in A or B. The subtype in which the probes are hypermethylated is indicated in bold and underlined. Also, a brief interpretation of the genomic function of the DMPs is provided. For the DMRs the associated genes are discussed in the context of GCTs. (Abbreviations) ↓ significantly underrepresented; ↑ significantly overrepresented; % DMPs is calculated relative to the total number of valid probes (Materials and Methods section). tr = transcription regulation associated regions (TSS200/TSSAssociated/5’UTR/EXON1); non-tr = non transcription regulation associated gene coding regions (GENE.BODY/3’UTR). The other functional categories are depicted in Fig 1C. [global] = global methylation difference between subtypes; no distinguishable potential subtype specific differentially methylated regulatory elements. (Associated sources) Statistical procedures are described in the Materials and methods section. The overall methylation pattern of each histological subtype is visualized in Fig 2. Functional enrichment of DMPs is visualized in Fig 3. Details of enrichment calculations and raw counts and percentages are presented in Table S2. Enrichment of chromosomes is depicted more detailed in S3B Fig. DMRs, recurrent tumor DMR and DMPs are listed in S3 and S1 Tables respectively. DMRs are visualized in Fig 4, S4 Fig and GSE58538: Files S1 and S2.
Fig 3Functional enrichment of DMPs.
DMPs were classified according to their functional genomic location (Fig 1C). Statistical over- and underrepresentation of probes in certain categories provides clues to differences between GCT subtypes in regarding function of methylation. Enrichment was assessed by comparing the number of probes in a functional category in a subset of DMPs with the that in the total dataset (Fisher’s Exact test, see Materials & Methods section). Results are shown for four pairwise (A vs B) comparisons of histological subtypes: (A) SE/DG versus EC/mNS; (B) SE/DG vs type I TE; (C) EC/MNS vs type I TE and (D) SE/DG vs SS. (LEFT) The number (n) of DMPs identified in either the DMP[-B] (hypermethylated in A, green) or DMP[A-] (hypermethylated in B, red) group. (MIDDLE/RIGHT) Functional enrichment in the DMP[-B] and DMP[A-] group respectively. X-axis: positive numbers indicate a significant overrepresentation of DMPs in a functional category compared to non-DMPs while negative numbers indicate a significant underrepresentation. Depicted is the log2 ratio of (1) the % of either DMP group assigned to a category and (2) the % of non-DMPs assigned to that category. Only significant enrichments are depicted (2-sided Fisher’s Exact test, see Methods section for Bonferroni corrected α threshold). DMPs[se/dgvs].IMPR_P1500 showed significant underrepresentation, but could not be plotted on log scale (0 probes in DMP group). Details of calculations and raw counts and percentages are presented in S2 Table. Y-axis: functional categories as specified in Fig 1C.
Fig 4Methylation profile at GCT subtype specific differentially methylated regions (DMRs).
Visualization of the methylation percentage at specific loci is used to zoom in on a predefined region and investigate local methylation differences between GCT subtypes. (A) From top to bottom the following is depicted: (1) Four-color heat map indicating methylation % for each individual probe in the depicted region. For the sample groups specified on the left the median methylation % is shown. (2) Position of all probes in the region of interest (ROI) is annotated as black rectangles. (3) HMM segments are displayed as grey boxes spanning the segment’s width and grouped per state. Numbers indicate the state of each (group of) segment(s). (5) GC% was obtained from the UCSC genome browser database (gc5Base table). (6) Transcripts overlapping with the ROI are plotted at the bottom. Plot generated using the Gviz package. Abbreviations of histological subtypes are explained in Fig 1A. Please note that the TE group is subdivided based on gender and localization: I = type I; II = type II/formally part of the mNS group, s = sacrum, t = testis, o = ovary, m = male, f = female. CL indicates cell lines.
Fig 6Methylation of imprinting control regions and the X chromosome.
Analogous to Fig 2 the differences in methylation status between histological GCT subtypes is illustrated by two methods. Firstly, the methylation pattern is visualized using the distribution of the methylation percentage β. Next, the discriminatory power of the methylation pattern for each individual sample is shown using principal component analysis. (A) All probes associated with paternally expressed genes (ICR_P). (B) All probes associated with maternally expressed genes (ICR_M). (C) All probes located on the X chromosome. (D) Distribution of methylation in individual TE samples ordered by sex and localization. To compare type I and II TE the n = 3 type II pure TEs from the mNS were included in this visualization. Methylation levels of all probes, and probes associated with ICRs (P/M) and probes on the X chromosome are subsequently shown. (Distribution plots of methylation percentage.) Violin plots: grey areas indicate a kernel density plot of the methylation percentage (β) of all probes in all samples in a certain category. The boxplot indicates the interquartile range (black bars) and median (white squares). X-axis labels indicate histological subgroup according to Fig 1A and 1B. TE indicates type I TE only. (Principal Component Analysis.) The first two principal components (PC) are plotted to evaluate the discriminative power of the methylation pattern between the subtypes. Abbreviations of histological subtypes are explained in Fig 1A. CL indicates cell lines. Please note that in the legend of the PCA the TE group is subdivided based on gender and localization: I = type I; II = type II/formally part of the mNS group, s = sacrum, t = testis, o = ovary, m = male, f = female.
GCT (methylation) associated genes.
| Gene & region | Description |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Genomic locations and strand were retrieved from genecards.com/UCSC. Detailed visualizations of the methylation status of these genes is presented in Fig 5 and S5A Fig. DMRs in the cell lines are presented in GSE58538: File S2. ICRs are visualized in Fig 7 and S6 Fig.
Fig 5Methylation profile of GCT specific genes and regions of interest (ROIs).
Visualization of the methylation percentage at specific loci is used to zoom in on a predefined region and investigate local methylation differences between GCT subtypes. The genes are reviewed in Table 2. (A) From top to bottom the following is depicted: (1) Four-color heat map indicating methylation % for each individual probe in the depicted region. For the sample groups specified on the left the median methylation % is shown. (2) Position of all probes in the region of interest (ROI) is annotated as black rectangles. (3) HMM segments are displayed as grey boxes spanning the segment’s width and grouped per state. Numbers indicate the state of each (group of) segment(s). (5) GC% was obtained from the UCSC genome browser database (gc5Base table). (6) Transcripts overlapping with the ROI are plotted at the bottom. Plot generated using the Gviz package. Abbreviations of histological subtypes are explained in Fig 1A. Please note that the TE group is subdivided based on gender and localization: I = type I; II = type II/formally part of the mNS group, s = sacrum, t = testis, o = ovary, m = male, f = female. CL indicates cell lines.
Fig 7Methylation status of imprinting control regions.
Visualization of the methylation percentage at specific loci is used to zoom in on a predefined region and investigate local imprinting differences between GCT subtypes. Two illustrative regions are depicted. (A) ICR_P: . The overlapping H19 transcript is an aberrant, long alternative transcript (H19-012, ENST00000428066). This ICR regulates H19 and IGF2 expression and lies upstream all other transcripts of H19. The other ICRs are visualized in S6 Fig and listed in S4 Table. (Visualizations) From top to bottom the following is depicted: (1) Four-color heat map indicating methylation % for each individual probe in the depicted region. For the sample groups specified on the left the median methylation % is shown. (2) Position of all probes in the region of interest (ROI) is annotated as black rectangles. (3) HMM segments are displayed as grey boxes spanning the segment’s width and grouped per state. Numbers indicate the state of each (group of) segment(s). (5) GC% was obtained from the UCSC genome browser database (gc5Base table). (6) Transcripts overlapping with the ROI are plotted at the bottom. Plot generated using the Gviz package. Abbreviations of histological subtypes are explained in Fig 1A. Please note that the TE group is subdivided based on gender and localization: I = type I; II = type II/formally part of the mNS group, s = sacrum, t = testis, o = ovary, m = male, f = female. CL indicates cell lines.
Fig 8GCT methylation status in context of methylation during germ cell development.
The top and bottom line charts depict normal germ cell development in female and male respectively (stages specified in the middle black bar). Methylation status during normal germ cell development is depicted for the global genome, ICRs and chromosome X (see Discussion). Putative cells of origin of the various types of GCTs are indicated in the brown boxes. ICR_P/M = ICR regulating paternally/maternally expressed genes. Bimodal indicates a methylation pattern peaking 0 and 100% with the exception of SE/DG (between 0 and ≈50). The table (bottom) provides a summary of the results, mainly Figs 2 and 6. Abbreviations: pf = primordial follicle. Type I tumors are indicated with their type (I), sex (m = male, f = female) and location (s = sacral, t = testis, o = ovary). Other GCT subtypes are indicated with their type (I, II, IV) and the abbreviation of each histological class, which are explained in the main text. Gradient bars indicate percentages of methylation (0→100%, green-white-grey-red) analogous to the gradient used in the other figures.