| Literature DB >> 35953478 |
Thomas R W Oliver1,2, Lia Chappell1, Rashesh Sanghvi1, Lauren Deighton1, Naser Ansari-Pour3,4, Stefan C Dentro1,5, Matthew D Young1, Tim H H Coorens1, Hyunchul Jung1, Tim Butler1, Matthew D C Neville1, Daniel Leongamornlert1, Mathijs A Sanders1,6, Yvette Hooks1, Alex Cagan1, Thomas J Mitchell1,2, Isidro Cortes-Ciriano5, Anne Y Warren2, David C Wedge3,7, Rakesh Heer8,9, Nicholas Coleman2,10, Matthew J Murray2,10, Peter J Campbell1, Raheleh Rahbari11, Sam Behjati12,13,14.
Abstract
Germ cell tumours (GCTs) are a collection of benign and malignant neoplasms derived from primordial germ cells. They are uniquely able to recapitulate embryonic and extraembryonic tissues, which carries prognostic and therapeutic significance. The developmental pathways underpinning GCT initiation and histogenesis are incompletely understood. Here, we study the relationship of histogenesis and clonal diversification in GCTs by analysing the genomes and transcriptomes of 547 microdissected histological units. We find no correlation between genomic and histological heterogeneity. However, we identify unifying features including the retention of fetal developmental transcripts across tissues, expression changes on chromosome 12p, and a conserved somatic evolutionary sequence of whole genome duplication followed by clonal diversification. While this pattern is preserved across all GCTs, the developmental timing of the duplication varies between prepubertal and postpubertal cases. In addition, tumours of younger children exhibit distinct substitution signatures which may lend themselves as potential biomarkers for risk stratification. Our findings portray the extensive diversification of GCT tissues and genetic subclones as randomly distributed, while identifying overarching transcriptional and genomic features.Entities:
Mesh:
Year: 2022 PMID: 35953478 PMCID: PMC9372159 DOI: 10.1038/s41467-022-31375-4
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1The mutational profile of GCTs.
a Overview of the experimental design, including micrographs from PD46269 illustrating different tumour histologies amenable to microdissection and sequencing (Supplementary Data 1). Gamma changes have been applied to these micrographs to help distinguish the histological features. G-TER, glandular teratoma; SM-TER, smooth muscle teratoma. Scale bar represents 250 microns. The total number of whole genomes includes both the microdissected (131) and bulk (7) samples. Note that for the mixed tumours PD42034 and PD45545 only one histology was available for microdissection and PD42569 was normal testis from a healthy donor. b Summary plot of key genomic data and relevant metadata pertaining to each GCT analysed (Supplementary Data 1–2, 9). Samples are ordered by patient and age. Each patient is labelled as having either a single bulk whole genome (B) or with the number of individual histological units microdissected for WGS (dark grey circle beneath each patient ID). See Methods for driver annotation. c Trinucleotide context plot of SBS-A. Source data are provided as a Source Data file.
Fig. 2Whole genome duplication timing and tumour diversification.
a Estimated burden of substitutions across the genome prior to the duplication event by age (Supplementary Data 12). Dashed red line is the fitted asymptotic regression with the grey ribbon indicating the 95% confidence intervals. The underlying equation is: pre-duplication substitution burden = −0.59 + 574.02 * e(−0.26 * age). b Bar plot comparing the prevalence and timing of WGD between our postpubertal GCTs (n = 10 tumours) and the tumours analysed by PCAWG (Supplementary Data 13)[16]. Tumour abbreviations used are as per the PCAWG studies (see Source data). c Pairwise comparison of the genetic and transcriptomic similarity of microdissections within GCTs where multiple tissues underwent DNA and mRNA sequencing (Supplementary Data 10) - PD43296 (n = 34 genomes, n = 60 transcriptomes), PD43298 (n = 26 genomes, n = 48 transcriptomes), PD45543 (n = 4 genomes, n = 6 transcriptomes), PD45544 (n = 9 genomes, n = 24 transcriptomes), PD46269 (n = 9 genomes, n = 37 transcriptomes). The p-values for the one-sided permutation test using label-swapping, comparing each tumour’s intra- and inter-histology genomic similarity are 0.355, 0.720, 0.429, 0.257, and <0.001 respectively. Using the same statistical test for the assessment of transcriptomic similarity, the p-values are <0.001, <0.001, 0.015, <0.001 and <0.001. The results are uncorrected for multiple hypothesis testing. Annotations for p-values: ns, not significant; *<0.05; ***<0.001. Where p-values are <0.001, no random permutation of the data for 1000 re-samples captures as large a difference between the two groups. Source data are provided as a Source Data file. d, e Histological images from example testicular d and ovarian e mixed GCTs that underwent extensive multiregional sampling, each annotated with the mutation clusters that define the phylogenetic relationship of each microbiopsy (Supplementary Data 10). Circles on the histological images correspond to a numbered mutation cluster in the associated phylogeny on the right-hand side. The number next to each cluster denotes the number of autosomal substitutions that support it. Circles are coloured to spatially highlight the clonal composition of each tumour. Each cluster is labelled with the number of microdissections for which it is the major clone and a list of the histologies the cluster pervades. MRCA, most recent common ancestor. These figures are simplified versions of the full phylogenies (Supplementary Fig. 6). f Subclonal chromothripsis of chromosome 17 in a prepubertal yolk sac tumour (Supplementary Data 4, 5). Reconstructed breakpoints are illustrated above the copy number calls. Scale bars represent 2.5 mm.
Fig. 3Pathways of GCT histogenesis.
a–c Comparison of the expression of marker genes in example GCT tissues and their corresponding fetal and adult counterparts, using reference single cell datasets[19–23]. Coloured dots denote the median and dashed lines the interquartile range for the underlying data points which represent the expression of a single gene per microbiopsy per tissue. Individual data points are plotted where the total data points supporting a point range are ≤10. The total number of microbiopsies supporting the data in each grey panel is provided in brackets within the figure. Source data are deposited on Mendeley. d–f Example volcano plots illustrating the differential expression analysis between embryonal carcinoma and more mature GCT tissues. The dashed lines represent the cut-offs for the log2 fold-change (>|2 | ) and adjusted p-value (<0.01, Benjamini-Hochberg correction) considered significant. Genes enriched in each differentiated tissue are shifted to the right. Marker genes used to define embryonic stem cells and each differentiated tissue are annotated on the plot. The full list of genes and tissues comparisons can be found in Supplementary Data 15.
Fig. 4The relationship between GCT genome and transcriptome.
a Heatmap showing gene enrichment per GCT tissue relative to healthy seminiferous tubules, binned by cytoband. Colours correspond to significance of enrichment according to the adjusted p-value (false discovery rate correction). The number next to each histology is the number of eligible microbiopsies that informs the analysis (Supplementary Data 16). b Combined plot of the chromosome 12 copy number changes across all invasive tumours and the rolling average log2 fold-change in gene expression compared with healthy seminiferous tubules. The window size for the rolling average is 50 genes. The average log2 fold-change in expression across 12p was significantly higher than across comparable numbers of genes found across regions with near baseline ploidy (one-sided permutation test, p < 1 × 10−5) (Supplementary Fig. 9), as indicated by the three asterisks. Source data are provided as a Source Data file.