Literature DB >> 26343385

Virtual microdissection identifies distinct tumor- and stroma-specific subtypes of pancreatic ductal adenocarcinoma.

Richard A Moffitt1, Raoud Marayati1, Elizabeth L Flate1, Keith E Volmar2, S Gabriela Herrera Loeza1, Katherine A Hoadley1,3, Naim U Rashid1, Lindsay A Williams1,4, Samuel C Eaton5, Alexander H Chung5, Jadwiga K Smyla1, Judy M Anderson6, Hong Jin Kim1,7, David J Bentrem8,9, Mark S Talamonti10, Christine A Iacobuzio-Donahue11, Michael A Hollingsworth6, Jen Jen Yeh1,5,7.   

Abstract

Pancreatic ductal adenocarcinoma (PDAC) remains a lethal disease with a 5-year survival rate of 4%. A key hallmark of PDAC is extensive stromal involvement, which makes capturing precise tumor-specific molecular information difficult. Here we have overcome this problem by applying blind source separation to a diverse collection of PDAC gene expression microarray data, including data from primary tumor, metastatic and normal samples. By digitally separating tumor, stromal and normal gene expression, we have identified and validated two tumor subtypes, including a 'basal-like' subtype that has worse outcome and is molecularly similar to basal tumors in bladder and breast cancers. Furthermore, we define 'normal' and 'activated' stromal subtypes, which are independently prognostic. Our results provide new insights into the molecular composition of PDAC, which may be used to tailor therapies or provide decision support in a clinical setting where the choice and timing of therapies are critical.

Entities:  

Mesh:

Year:  2015        PMID: 26343385      PMCID: PMC4912058          DOI: 10.1038/ng.3398

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Rigorous sequencing studies have shown that few genetic alterations (KRAS, CDKN2A, SMAD4, and TP53) are prevalent in PDAC[1-3], but these and other analyses of PDAC tumors are hampered by limited tumor cellularity and the presence of abundant stroma intermixed with normal endocrine and exocrine cells. Additionally, metastatic samples often include cell types from the host organ. Thus, PDAC tumors are complex mixtures in which the malignant epithelial cells often represent a minority of the tissue compartment (Fig. 1a). Illustrating this limitation, an important study of exome and copy number in pancreatic cancer removed 43 of 142 patients due to low tumor cellularity affecting the sensitivity of mutation detection[1]. While some studies use microdissection to improve tumor content[4-7], microdissection for precision medicine approaches is not yet feasible. When considering gene expression of bulk tumors, normal pancreas and PDAC tissues often cluster together, separate from cell lines which are assumed to be purely neoplastic[8].
Figure 1

Successful Deconvolution of Normal Tissue with NMF. (a) Cartoon depicting the major cell types in primary tumor and liver metastasis samples. (b) (above) Overlap of sample types (solid colors) with factor weights (grayscale heat maps), and (below) heat maps of five exemplar genes for all tumors and adjacent normal tissues. Gene expression shown in the heat map has been Z-normalized. (c) Box and whiskers plots showing median, quartiles, and range comparing NMF factor weights across tissue types and corresponding t-test result. (d) Percent tumor cellularity versus NMF liver factor weight, and NMF basal tumor factor weight for metastases to the liver and adjacent liver samples. Linear regression lines are shown in red along with corresponding statistics.

Separating molecular signatures of tissue compartments from the measurements of bulk tumor belongs to the general class of problems called blind source separation. Previous studies have used samples of chronic pancreatitis, which is often fibrotic, to control for the presence of desmoplastic stroma in tumor samples[9]. In prostate cancer, Stuart et al. have used pathology assessments of cell types to train models of gene expression signatures of tumor, stroma, and normal tissue[10]. In a follow up study, they used their learned gene lists for in silico estimation of tissue components in a larger set of data[11]. A similar approach has also been used to quantify stromal content across multiple data sets from the cancer genome atlas (TCGA)[12]. Among source separation techniques, nonnegative matrix factorization (NMF) is especially well suited for biological data, because it constrains all sources to be positive in nature, reflecting the goal of identifying positive gene expression exemplars, rather than pairwise differences between tissue types. Briefly, we define NMF as modeling the matrix X of expression for g genes and s samples, as the product of a matrix G of g gene weights for k factors and a matrix S of s sample weights for k factors. Alexandrov et al. have recently demonstrated that NMF is useful for a similar problem of identifying mutational signatures from the aggregate list of somatic mutations in human cancer samples[13,14]. Similarly, Biton et al. have applied a related technique, Independent component analysis, to examine gene expression in bladder cancer[15]. In this study, we have overcome the challenges of bulk tumor analysis where signal is averaged out between normal, tumor and stroma compartments, by using NMF to perform a virtual microdissection of primary and metastatic PDAC samples. This has allowed us to identify tumor-specific and stroma-specific subtypes with prognostic and biologic relevance. In addition, by focusing on tumor autonomous gene expression, we found that intra-patient tumor heterogeneity between primary and metastatic sites was unexpectedly low.

Results

Virtual microdissection of PDAC

We used NMF to analyze gene expression in a cohort of microarray data from 145 primary and 61 metastatic PDAC tumors, 17 cell lines, 46 pancreas and 88 distant site adjacent normal samples using Agilent (Agilent Technologies) human whole genome 4x44K DNA microarrays (106 primary tumors were previously used in a separate bulk analysis of gene expression (GSE21501[16]). To validate our findings, RNA sequencing was performed on 15 primary tumors, 37 pancreatic cancer patient-derived xenografts (PDX), 3 cell lines, and 6 cancer associated fibroblast (CAF) lines derived from deidentified patients with pancreatic cancer. Histology of all available samples was reviewed by a single blinded pathologist (KEV). Table 1 summarizes the demographic and clinical characteristics of patients in our cohorts.
Table 1

Demographics and Univariate Cox analysis

AllResected with SurvivalUnivariate Cox p-valueMicroarray PrimaryRNAseq PrimaryRNAseq PDX
RaceCaucasian1281210.50799925
African American23180.3331038
Other870.821503

GenderF90830.34867523
M80680.34855814

T StageT1440.420212
T222200.5302025
T31311220.74391928
T4110.115100

N StageN049430.06836710
N11121060.06880525

M StageM01601491151235
M130201

Adjuvant TherapyYes74700.05544521
No30280.0552737

DifferentiationWell16130.9401601
Moderate49470.3984913
Poor34310.4073412

PDXGraft Success44370.16411837
Graft Failure18120.164930

MarginPositive58520.02634517
Negative93880.02675717

Total1931631431537

NMF distinguishes normal and tumor compartments

A key obstacle in the analysis of gene expression data, particularly in PDAC, is the removal of confounding normal or stroma gene expression from local and distant organ sites. Supplementary Figure 1 shows example histology of samples with both tumor, normal, and stromal tissue. We used NMF to identify gene expression which we attribute to normal pancreas, liver, lung, muscle, and immune tissues. Expression of exemplar genes from these factors, i.e. genes with distinctly large weights in a single column of G, as well as factor weights for the samples, i.e. rows of S, showed excellent agreement with known tissue labels (Fig. 1b, c, Supplementary Fig. 2). Investigation of the exemplar genes from these factors further confirmed their role as confounding normal tissue. For example, using the Kolmogorov–Smirnov test, the top-weighted genes from the liver factor show significant (p<10-10) enrichment in the MSigDB term SU_LIVER (Supplementary table 1), and the highest weighted gene, fibrinogen beta (FGB) (Supplementary table 2), is specifically expressed in normal human liver tissue. In addition to normal tissue from distant organs, we found two factors which were exclusive to pancreas tissue, but were differentiated from each other by their respective gene lists. One factor described endocrine function including expression of glucagon and insulin (GCG and INS) (Supplementary table 2), while the other factor described exocrine function including expression of digestive enzyme genes such as pancreatic lipase, PNLIP (Supplementary table 2). This unsupervised discovery of two molecularly distinct yet highly co-localized factors related to normal pancreatic function represents an important proof of concept in the use of NMF to identify novel features without pre-defined expression knowledge. To validate our normal expression signatures, all available samples were reviewed by a single pathologist (KEV) to independently assess the amount of tumor, normal, and stroma cellularity. We found that many factor weights were correlated or anti-correlated to tumor cellularity (Supplementary Fig. 3). Among normal and metastatic liver samples, for example, tumor-specific factor weights were correlated with cellularity whereas the normal-specific liver factor weight was inversely related to the tumor content of a sample (Fig. 1d). These findings support our hypothesis that factor weights obtained from NMF are quantitatively indicative of underlying sample composition.

Identification of stroma-specific subtypes

Stroma is particularly important in PDAC. According to pathology assessments, stroma varies (Supplementary Fig. 1 c-e), and comprises on average 48% of our primary tumor samples with a standard deviation of 30%. Our analysis identified two factors, which describe gene expression from the stroma, which were distinctly different from the normal factors shown in Figure 1. Consensus clustering on exemplar genes from these two stroma factors divided tumor samples into two stromal subtypes, which we classified as “normal”, and “activated” (Fig. 2a). Patients with samples with an activated stroma subtype had worse median survival of 15 months and 60% 1-year survival when compared to patients with a normal stroma subtype (median 24 months, 1-year 82%, Fig. 2b). Both were notably absent in PDAC cell lines (Fig. 2c), which exhibited a distinct mitotic expression signature associated with mitotic checkpoints and DNA replication (Supplementary table 1)[17]. The fact that cell lines do not express these stromal factors, and that many metastatic samples express them at low levels (Supplementary Fig. 4), suggest that these genes are not expressed by the tumor epithelium. To further validate the stromal origin of these gene expression signatures, we isolated 6 CAF lines from primary tumors (Supplementary Fig. 5), and found that they robustly overexpressed our hypothesized stromal signatures compared to PDAC tumor cell lines which had no expression of the stromal signatures (Fig. 2c).
Figure 2

Dual action of stroma is described by distinct gene expression patterns which are not expressed in cell lines. (a) Consensus clustered heat map of UNC primary tumor samples, metastases, and cell lines using genes from stromal factors. Samples clustered into 3 groups, describing samples with activated stroma, normal stroma, and samples with low or absent stromal gene expression. (b) Kaplan-Meier survival analysis of resected PDAC patients from the activated and normal stromal clusters shows that samples in the activated stroma group have worse prognosis, with a hazard ratio of 1.94 (CI = [1.11,3.37], p = 0.019). (c) Gene expression of stromal signatures are overexpressed in CAFs as compared to tumor cell lines. (d) Genes from both stromal signatures are specifically overexpressed by the mouse stroma in PDX tumors, and not expressed by the human tumor cells.

The vast majority of collagen gene expression was attributable to stromal compartments, with the lone exception being COL17A1, which was high in tumors. “Normal” stroma was characterized by relatively high expression of known markers for pancreatic stellate cells, smooth muscle actin, vimentin, and desmin, (ACTA2, VIM, and DES). Stellate cells have been shown to promote cancer cell survival in vitro[18], but at the same time may restrain PDAC in mouse models[19,20]. Targeting desmoplastic stroma by hedgehog pathway inhibition has been shown to both accelerate the development of disease[21] and enhance delivery of chemotherapy[22]. In patients, the ratio of smooth muscle actin stained area to the collagen-stained area has been shown to be predictive of poor outcomes[23]. We found that “activated” stroma was characterized by a more diverse set of genes associated with macrophages, such as the integrin ITGAM, and the chemokine ligands CCL13 and CCL18. “Activated” stroma also expressed other genes which point to its role in tumor promotion, including the secreted protein SPARC, WNT family members WNT2, and WNT5A, gelatinase B (MMP9), and stromelysin 3 (MMP11). The presence of fibroblast activation protein (FAP), which has previously been related to worse prognosis, in the activated stroma suggests that an activated fibroblast state may be partially responsible for the poor outcomes for these patients[24]. This lead us to hypothesize that the “normal” stroma factor may describe a “good” version of stroma and that “activated” stroma factor may describe the activated inflammatory stromal response that has been seen in previous studies to be responsible for disease progression[25-27]. Our factor analysis supports a complex, multi-gene model of stroma in PDAC, which may explain why single gene analysis has yielded mixed results.

Identification of tumor-specific subtypes

Independent of normal and stromal factors, we found that two tumor-specific factors define “classical” and “basal-like” subtypes of PDAC. When our samples were split into the two tumor subtypes (Fig. 3a), patients with basal-like subtype tumors had an overall worse median survival of 11 months and 44% 1-year survival compared to 19 months and 70% 1-year survival for those with classical subtype tumors (p=0.007, Fig. 3b). All cell lines assayed in this study (p < 0.001), as well as a majority of metastatic samples (p = 0.002), were classified as “basal-like”, suggesting that cell line models represent only one subset of PDAC. These subtypes as well as their prognostic value were independently validated within the recently published International Cancer Genome Consortium (ICGC) PDAC microarray data set (Fig. 3c, d)[28]. Genes from the “basal-like” factor, including laminins and keratins, were also consistent with basal subtypes previously defined in bladder[14,28,29] and breast[30] cancers (Fig. 3e-h). Interestingly, genes from our “basal-like” subtype reproduced subtype calls (p<0.001) in breast cancer, had prognostic value in breast cancer samples (p<0.001) and reproduced previous subtype calls in bladder cancer (p<0.001). Given these promising results, we developed a single-sample cross-platform classifier of basal-like subtype which was trained on our microarray, TCGA bladder, and Perou breast cancer data, with a 93% cross validation accuracy, and was able to classify TCGA breast cancer data with 92% accuracy during external validation (Supplementary Fig. 6)
Figure 3

Tumor specific gene expression suggests two subtypes of PDAC with similarities to other tumor types. (a) Consensus clustered heat map of primary tumors, metastatic tumors, and cell line models of PDAC using correlation as the underlying distance function shows two subtypes of PDAC. (b) Kaplan-Meier survival analysis of resected primary patients from each tumor subtype (36 basal-like, 89 classical) in a shows differential prognosis among subtypes with a hazard ratio of 1.89, and a 95% CI of [1.19, 3.02]. (c) Consensus clustered heat map of tumors in the ICGC PDAC cohort split by basal and classical factor gene expression into basal-like (n=56) and classical (n=47) tumors. (d) Basal-like tumors in the ICGC data set has a hazard ratio of 2.11, with a 95% CI of [1.14, 3.89]. Median follow up was 20 months (e) Consensus clustered heat map of TCGA Bladder cancer samples split by basal and classical factor gene expression into basal-like (n=128) and classical-like (n=95) tumors strongly agrees with BASE47 basal calls shown above the heat map. (f) Subtyping in the TCGA BLCA data set had a hazard ratio of 1.43, with a 95% CI of [0.84, 2.42] (g) Consensus clustered heat map of the Perou breast cancer data set as split by basal factor genes (n=72 basal-like, n=223 not basal) strongly agrees with the division of samples into previously published basal and non-basal subtypes. (h) Basal-like breast cancer, as defined by our labeling, had a hazard ratio of 3.52, with a 95% CI of [1.94, 6.38].

Potential subtypes of PDAC have previously been described by Collisson et al.[4]. We used the published exemplar genes for “exocrine-like”, “classical”, and “quasimesenchymal” subtypes to cluster normal pancreas, cell lines, and primary PDAC tumors from our cohort (Supplementary Fig. 7a). The three previous classifications were also observed in our data, but did not hold prognostic power either by cluster label or by supervised classification with PAM[31](Supplementary Fig. 7b). Furthermore, inclusion of the Collisson et al. subtypes into a multivariate Cox regression with our proposed tumor subtypes did not remove the predictive power of our subtyping (p = 0.014). By cross-referencing the genes from Collisson et al.'s model with our NMF model, we observed three key findings. First, “exocrine-like” genes overlapped with genes from our exocrine pancreas factor (17/17). Tumors in this cluster had expression indistinguishable from adjacent normal samples from our data set. Second, Collisson et al.'s “classical” genes overlapped with our “classical” subtype genes (20/22), for which we retain the naming convention “classical”. Third, the gene set associated with “quasimesenchymal” subtype appeared to be a mixed collection of genes from our “basal-like” tumor (6/20) and stromal subtypes (6/20). Thus, the appearance of stromal factors in the Collisson et al. list of “quasimesenchymal” class genes may explain the apparent mesenchymal-like gene expression that was observed. “Basal-like” and “classical” tumors were found within both “normal” and “activated” stroma subtypes (Fig. 4a). Differential prognosis among tumor and stroma subtypes was cumulative, as “classical” subtype tumors with “normal” stroma subtypes (n = 24) had the lowest hazard ratio of 0.39, with a 95% CI of [0.21, 0.73], while the “basal-like” subtype tumors with “activated” stroma subtypes (n = 26) had the highest hazard ratio of 2.28 with a 95% CI of [1.34, 3.87] (Fig. 4b). In a multivariate Cox regression model which included tumor subtypes, stromal subtypes, and clinical variables (gender, race, T stage, N stage, margin status, adjuvant therapy, histological grade, and age), both classifications were independently associated with survival (stroma subtypes: p = 0.037, tumor subtypes: p = 0.003).
Figure 4

Multivariate survival analysis of tumor and stromal subtypes. (a) Heat map of tumor samples using 25 genes from each of the tumor and stromal factors, with samples sorted horizontally by classification. Signature scores for selected gene sets appear above for each sample. (b) Combined Kaplan-Meier survival analysis of resected primary patients from basal-like or classical tumor types and normal or activated stroma subtypes with differential survival (p < 0.001 log-rank test). Differential prognosis among subtypes shows complementarity. Classical tumors with normal stroma subtypes (n=24) had the lowest hazard ratio of 0.39, and a 95% CI of [0.21, 0.73], while basal-like tumors with activated stroma subtypes (n=26) had the highest hazard ratio of 2.28 with a 95% CI of [1.34, 3.87]. (c) Kaplan-Meir survival analysis shows that patients with classical subtype tumors show less response to adjuvant therapy (HR = 0.76, 95% CI [0.40, 1.43]) compared to (d) basal-like tumors (HR of 0.38, and a 95% CI of [0.14, 1.09]). (e) Kaplan-Meir survival analysis shows that African-Americans have worse overall survival in both basal-like and classical subtypes, with a Hazard ratio of 2.28 and a 95% CI of [1.16,4.5].

Although basal-like subtype tumors have a worse prognosis, patients with basal-like subtype tumors showed a strong trend towards better response to adjuvant therapy (p=0.072, Fig. 4c,d). Among basal-like subtype patients, adjuvant therapy provided a hazard ratio of 0.38, (95% CI of [0.14, 1.09]), while in patients with classical subtype tumors, adjuvant therapy is associated with a hazard ratio of only 0.76 (95% CI [0.40, 1.43]). In our cohort, there was no association of most clinical variables (race, gender, T stage, N stage, differentiation, tumor cellularity) with survival, although positive nodal status trended towards significance, and positive margin status was significantly associated with worse survival (Table 1). Table 2 shows two-way associations of all subtype calls with clinical and pathological information from our cohort of PDAC patients. We found no association of tumor or stroma subtype with standard clinical or pathological variables, with the notable exception of mucinous features.
Table 2

Summary of associations with clinical covariates and subtypes

Tumor SubtypeFisher's ExactStroma SubtypeFisher's Exact

CovariateClassicalBasal-likep-valueNormalActivatedp-value
GenderMale50160.84915361
Female64191743

RaceCaucasian90270.52126651
African American13237

T StageT21660.595141
T387252559

N StageN03590.53211220.649
N172252154

MarginPositive3880.3857220.629
Negative65222249

Adjuvant TherapyYes48130.43710300.769
No219519

DifferentiationPoor23110.47911180.203
Well49161344

MucinLow Mucin49240.04218430.792
High Mucin233619

StromaNormal3180.144
Activated5731

Tumor-specific subtypes in patient-derived xenografts

To assess the tumor or stromal specificity of our signatures, we performed RNAseq on a group of 37 PDX tumors. PDX tumors are composed of human tumor cells surrounded by mouse stroma (Supplementary Fig. 8)[29]. Genes from both of our tumor signatures were expressed as human transcripts, whereas genes from both of our stromal signatures were expressed as mouse transcripts (Fig. 2d, Supplementary Fig. 9a). Furthermore, we found that PDX RNAseq expression divided PDX into both classical and basal-like groupings (Supplementary Fig. 9b) while predominantly expressing an activated stromal signature (Fig. 2d). We found that while tumor-specific subtype was not predictive of graft success (Fig. 5a), patient tumors with an activated stroma subtype had significantly higher graft success rates than those with normal stroma subtype or low amounts of stroma (Fig. 5b) (p=0.019). Basal-like subtype tumors also exhibited faster growth rates than classical tumors (p=0.032) as measured by the length of time that tumors took to grow to 200 mm3 (TT200, Fig. 5c-d), a previously used metric for PDX growth[30]. Retrospective analysis of patients who had matched PDX tumors found that a shorter TT200 was associated with an unfavorable recurrence-free survival (p=0.035, Fig. 5e), suggesting that PDX tumor growth rate may reflect patient biology.
Figure 5

Associations between tumor and stroma subtypes, PDX tumors, KRAS mutations and SMAD4 expression. (a) Tumor subtype was not associated with PDX graft success rate (p=0.417). (b) Activated stromal subtype samples engraft with higher success rates than low or normal stromal subtype samples (p=0.019) (c) Basal-like tumor subtype PDX reach 200 mm3 faster than classical subtype PDX (p=0.032). (d) PDX from samples with activated stroma subtype or normal stroma subtype do not have significantly different times to reach 200 mm3 (p=0.170). (e) PDX tumors with faster growth rates were associated with earlier recurrences in patients (HR = 0.31, 95% CI [0.10, 0.92]. (f) KRAS mutation type is not uniformly distributed among race or subtype. KRAS G12D mutations are more prevalent in basal-like subtype tumors than classical tumors (p=0.030). (g) African Americans have more G12V mutations, while Caucasians have more G12D mutations (p<0.001). (h) SMAD4 staining in primary tumors is predictive of successful PDX engraftment (p=0.044). (i) Basal-like subtype PDX exhibit weaker SMAD4 staining than classical subtype PDX (p=0.015).

We also measured both mouse and human-specific expression of the Collisson et al. genes in our PDX models. We found that while genes from the “classical” subtype were expressed by human cells in PDX, “quasimesenchymal” transcripts were expressed by a mixture of human and mouse cells, and “exocrine-like” transcripts were infrequently expressed (Supplementary Fig. 7c). This supports our hypothesis that while the “classical” subtype is a bona fide group, the “quasimesenchymal” subtype is partially driven by non-tumor contributions of stroma and the “exocrine-like” subtype by normal pancreas.

KRAS codon mutations, tumor-specific subtypes, and race

Studies of KRAS codon mutations have demonstrated that different codon mutations may have differential functions[31,32] and in some clinical studies, have been shown to be associated with differential outcome. Because PDX tumors are enriched for human-specific tumor cells, we evaluated KRAS codon mutations in our PDX cohort using manually curated RNAseq data. While the overall frequency of KRAS codon mutations was similar to a recent study of PDAC[7], we noted that the KRAS G12D mutation was significantly overrepresented in our basal-like subtype while G12V was isolated to the classical subtype (Figure 5f, p=0.030). Furthermore, we found an overrepresentation of KRAS G12V mutations in African Americans (Figure 5g, p<0.001). In contrast to basal-like breast cancers which occur most frequently in African American women and have a worse prognosis[33], African American patients in our cohort tended to have mainly classical subtype tumors (13 vs 2). We found that similar to other cancers, African Americans had a worse prognosis after adjusting for tumor subtype (Figure 4e, p=0.017). African American patients with classical subtype tumors had a median survival of 13 months compared to Caucasian patients with classical subtype tumors who had a median survival of 19 months.

Other commonly mutated genes and altered pathways in PDAC

Previously, loss of SMAD4 has been shown to promote tumor growth[34,35]. We found that, similar to previous PDX studies of PDAC, loss of SMAD4 was associated with graft success in PDX models[36] (Fig. 5h, Supplementary Fig. 10, p=0.044). Furthermore, in our PDX cohort, we found that SMAD4 expression was significantly higher in classical compared to basal-like subtype PDX tumors (Fig. 5i, p=0.015), consistent with the observation that SMAD4 loss confers a more aggressive phenotype. Using mutation, genomic subtype[3], and gene expression[28] data from publically available ICGC data in which we show recapitulation of our subtypes and prognosis, we evaluated significantly mutated genes and pathways in PDAC, including ones recently identified through whole-exome sequencing of microdissected primary PDAC tumors[1-3,7]. We found no significant associations between our expression subtypes and these mutationally altered pathways, i.e. TGFβ, RB, NOTCH, CTNNB1, SWI/SNF, and DNA repair (Supplementary Fig. 11). Furthermore, we found no overlap between our subtypes and recently identified genomic subtypes, or response to platinum therapy[3]. Consistent with this, a recent comprehensive study of somatic mutations in PDAC long-term survivors suggested that somatic mutations alone will not be sufficient to explain clinical outcome[37]. Given the overlap of our classical subtype with that of Collisson et al.[5], it was not surprising to find that our classical subtype was also enriched for genes associated with GATA6 overexpression[38] (Supplementary Fig. 12a, Fig 4a). GATA6 has been found to promote epithelial cell differentiation[38,39]. This prompted us to perform a more detailed histological markers of differentiation in our samples and found that samples with greater than 10% extracellular mucin, a marker of differentiation, comprised mostly of classical subtype tumors (88.5%, n=23) compared to only 11.5% (n=3) of basal-like subtype tumors (Supplementary Fig. 13, p=0.042, Table 2). Consistent with the increased presence of extracellular mucin, our classical subtype was enriched for genes upregulated in mucinous ovarian cancer (Supplementary Fig. 12b, WAMAUNYOKOLI_OVARIAN CANCER_GRADES_1_2_UP[40]). Interestingly, our basal-like subtype was enriched for genes related to KRAS activation and STK11 loss in a lung cancer mouse model where STK11-deficient tumors demonstrated shorter latency and more frequent metastasis[41] (Supplementary Fig. 12c). We found one sample with STK11 inactivation in the ICGC data; this sample was a basal-like subtype (Supplementary Fig. 11). Notably, our subtypes were not associated with other known signaling pathways in PDAC, including Fanconi anemia, DNA repair, chromatin remodeling, beta-catenin, RB, ARF, G1 (Fig. 4a). However, all of these pathways except for beta-catenin were considerably differentially expressed in cell lines compared to patient tumors, suggesting that gene expression in cell lines may be a deceptive representation of most tumors.

Low intra-patient heterogeneity in tumor-specific genes

We expect that only a subset of genes are relevant to the question of intra- and inter-patient heterogeneity in PDAC. Many methods exist to pre-select genes for supervised analysis[32], but selection of the most differentially expressed genes is a common preprocessing step during unsupervised analysis[33]. When clustering matched samples of metastatic and primary lesions using the 50 most differentially expressed genes among all matched samples, samples separated primarily by organ site instead of by patient (Fig. 6a, c). In contrast, when considering 25 top ranked exemplar genes each from the “basal-like” and “classical” factors, samples from the same patient clustered closer together, and were less dependent of organ site (Fig. 6b, d). This is further illustrated in a focused analysis of two patients (Fig. 6), whose tumor samples appear patient-specific when considering our tumor subtype gene list, but cluster by site when considering differentially expressed genes. Overall, we found that our tumor subtype gene list showed higher similarity (mean Pearson's ρ=0.53) between all other samples from the same patient than did the differentially expressed gene list (ρ=0.32, t-test p<0.001). Furthermore, our tumor subtype gene list produced much lower similarity among all other samples from the same organ site across different patients (ρ = 0.04) than the differentially expressed gene list (ρ=0.34, p<0.001). This observed similarity of tumor gene expression among tumors within the same patient suggests overall high inter-patient tumor heterogeneity and low heterogeneity between primary and metastatic sites. However, we did observe examples of intra-patient heterogeneity between metastatic sites. For example, lung metastases, even those from patients with “basal-like” tumors in other locations, clustered exclusively with the “classical” tumors, suggesting that some intra-patient heterogeneity may exist among metastatic sites, and supporting the previously reported divergent patterns of failure in PDAC[34].
Figure 6

Overcoming tumor cellularity reveals true heterogeneity among matched primary and metastatic sites. (a) Sample-sample correlations of matched primary and metastatic tumors using the 50 most differentially expressed genes across all samples (“DE50”) causes samples to group by organ location. (b) Sample-sample correlations using 25 genes each from classical and basal-like tumor lists (”T50”) caused samples to cluster instead by tumor subtype and patient of origin. (c) Correlation of samples within the same patient is higher when using T50 genes than when using DE50 genes. (d) Correlation of samples originating in the same organ was higher when using DE50 than when using T50. (e) Clustering of multiple samples from two patients using the DE50 divides samples by organ. Genes expressed highly in lung and liver tissue are noted with brackets. Clustering of the same samples using T50 genes separates samples by patient. Brackets note genes which differentiate the two patients. A diagram of sampled locations for these patients indicated by concentric circles, illustrating how samples simultaneously exhibit both patient (inner color) and organ (outer color) specific gene expression.

Discussion

Our study represents the largest investigation of primary and metastatic PDAC gene expression to date. We have used NMF to identify novel prognostic subtypes of PDAC which may have been previously obscured by confounding normal and stromal tissue. Our identification of normal-, tumor-, and stroma-specific gene expression signatures is supported by both their overlap with previously identified gene lists and their expression in appropriate tissue types. Our tumor subtypes are further supported by their relationship to previously identified basal tumor subtypes in breast and bladder cancers and their prognostic relevance in external cohorts. Our findings of two different stroma subtypes may help explain the differential effects of stroma previously seen in preclinical models. Tumor and stroma specific gene expression classified PDAC into four distinct subtypes with prognostic relevance. The orthogonal nature of tumor- and stroma-specific subtypes suggest an important interplay in patient tumors that will need to be taken into account as stroma and immune modulating therapies are studied. In our cohort, patients with basal-like tumors appeared to derive more benefit from adjuvant therapy. Whether basal-like and classical subtypes may be associated with response to specific therapies will need to be studied further as more effective therapies become available. One challenge will be defining preclinical model systems that recapitulate these subtypes as our results suggest that traditional cell lines are lacking in the classical subtype. Although we demonstrate that PDX models recapitulate tumor-specific subtypes, these models alone may not be sufficient due to either the lack of human stroma or overrepresentation of the activated stroma subtype in the tumors that are successfully grafted. Thus more detailed characterization of genetically engineered mouse models of PDAC models will be needed to determine which models best reflect both our tumor- and stroma-specific subtypes. Recent exome sequencing studies have confirmed commonly mutated genes in PDAC but have not uncovered mutations that clearly confer survival differences[2,3,7]. In fact, exome sequencing of a cohort of very long-term survivors of PDAC[37] found no differences in somatic mutations to explain the improved biology of tumors from these rare patients compared to the majority of patients with PDAC, suggesting that examining somatic mutations alone may not be sufficient to understand the biological and clinical differences in PDAC tumors. Furthermore exome sequencing studies and studies of microdissected samples are limited to the tumor compartment and overlook the stroma compartment which has been shown to be biologically critical in PDAC, with both tumor-promoting and tumor-inhibiting effects. Our results suggest that RNA subtypes may better capture the molecular landscape of PDAC and its reflection on patient outcome. We hypothesize that our RNA subtypes may reflect the broad effect of somatic mutations while also capturing the importance of the neoplastic stroma. These results provide new insight into the molecular composition of PDAC which may be used for precision medicine. Furthermore, knowledge of these subtypes and their prognostic value may provide decision support in a clinical setting where the choice and timing of therapies is critical.
  47 in total

1.  An in vivo platform for translational drug development in pancreatic cancer.

Authors:  Belen Rubio-Viqueira; Antonio Jimeno; George Cusatis; Xianfeng Zhang; Christine Iacobuzio-Donahue; Collins Karikari; Chanjusn Shi; Kathleen Danenberg; Peter V Danenberg; Hidekazu Kuramochi; Koji Tanaka; Sharat Singh; Hossein Salimi-Moosavi; Nadia Bouraoud; Maria L Amador; Soner Altiok; Piotr Kulesza; Charles Yeo; Wells Messersmith; James Eshleman; Ralph H Hruban; Anirban Maitra; Manuel Hidalgo
Journal:  Clin Cancer Res       Date:  2006-08-01       Impact factor: 12.531

2.  Tumor engraftment in nude mice and enrichment in stroma- related gene pathways predict poor survival and resistance to gemcitabine in patients with pancreatic cancer.

Authors:  Ignacio Garrido-Laguna; Maria Uson; N V Rajeshkumar; Aik Choon Tan; Elizabeth de Oliveira; Collins Karikari; Maria C Villaroel; Ana Salomon; Gretchen Taylor; Rajni Sharma; Ralph H Hruban; Anirban Maitra; Daniel Laheru; Belén Rubio-Viqueira; Antonio Jimeno; Manuel Hidalgo
Journal:  Clin Cancer Res       Date:  2011-07-08       Impact factor: 12.531

3.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

4.  Pancreatic stellate cells: partners in crime with pancreatic cancer cells.

Authors:  Alain Vonlaufen; Swapna Joshi; Changfa Qu; Phoebe A Phillips; Zhihong Xu; Nicole R Parker; Cheryl S Toi; Romano C Pirola; Jeremy S Wilson; David Goldstein; Minoti V Apte
Journal:  Cancer Res       Date:  2008-04-01       Impact factor: 12.701

5.  Exploration of global gene expression patterns in pancreatic adenocarcinoma using cDNA microarrays.

Authors:  Christine A Iacobuzio-Donahue; Anirban Maitra; Mari Olsen; Anson W Lowe; N Tjarda van Heek; Christophe Rosty; Kim Walter; Norihiro Sato; Antony Parker; Raheela Ashfaq; Elizabeth Jaffee; Byungwoo Ryu; Jessa Jones; James R Eshleman; Charles J Yeo; John L Cameron; Scott E Kern; Ralph H Hruban; Patrick O Brown; Michael Goggins
Journal:  Am J Pathol       Date:  2003-04       Impact factor: 4.307

6.  Xenome--a tool for classifying reads from xenograft samples.

Authors:  Thomas Conway; Jeremy Wazny; Andrew Bromage; Martin Tymms; Dhanya Sooraj; Elizabeth D Williams; Bryan Beresford-Smith
Journal:  Bioinformatics       Date:  2012-06-15       Impact factor: 6.937

7.  Subtypes of pancreatic ductal adenocarcinoma and their differing responses to therapy.

Authors:  Eric A Collisson; Anguraj Sadanandam; Peter Olson; William J Gibb; Morgan Truitt; Shenda Gu; Janine Cooc; Jennifer Weinkle; Grace E Kim; Lakshmi Jakkula; Heidi S Feiler; Andrew H Ko; Adam B Olshen; Kathleen L Danenberg; Margaret A Tempero; Paul T Spellman; Douglas Hanahan; Joe W Gray
Journal:  Nat Med       Date:  2011-04-03       Impact factor: 53.440

8.  Inferring tumour purity and stromal and immune cell admixture from expression data.

Authors:  Kosuke Yoshihara; Maria Shahmoradgoli; Emmanuel Martínez; Rahulsimham Vegesna; Hoon Kim; Wandaliz Torres-Garcia; Victor Treviño; Hui Shen; Peter W Laird; Douglas A Levine; Scott L Carter; Gad Getz; Katherine Stemke-Hale; Gordon B Mills; Roel G W Verhaak
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

9.  Smad4 loss promotes lung cancer formation but increases sensitivity to DNA topoisomerase inhibitors.

Authors:  Sarah M Haeger; Joshua J Thompson; Sean Kalra; Timothy G Cleaver; Daniel Merrick; Xiao-Jing Wang; Stephen P Malkoski
Journal:  Oncogene       Date:  2015-04-20       Impact factor: 9.867

10.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors:  Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2013-04-25       Impact factor: 13.583

View more
  591 in total

1.  Altered RNA Splicing by Mutant p53 Activates Oncogenic RAS Signaling in Pancreatic Cancer.

Authors:  Luisa F Escobar-Hoyos; Alex Penson; Ram Kannan; Hana Cho; Chun-Hao Pan; Rohit K Singh; Lisa H Apken; G Aaron Hobbs; Renhe Luo; Nicolas Lecomte; Sruthi Babu; Fong Cheng Pan; Direna Alonso-Curbelo; John P Morris; Gokce Askan; Olivera Grbovic-Huezo; Paul Ogrodowski; Jonathan Bermeo; Joseph Saglimbeni; Cristian D Cruz; Yu-Jui Ho; Sharon A Lawrence; Jerry P Melchor; Grant A Goda; Karen Bai; Alessandro Pastore; Simon J Hogg; Srivatsan Raghavan; Peter Bailey; David K Chang; Andrew Biankin; Kenneth R Shroyer; Brian M Wolpin; Andrew J Aguirre; Andrea Ventura; Barry Taylor; Channing J Der; Daniel Dominguez; Daniel Kümmel; Andrea Oeckinghaus; Scott W Lowe; Robert K Bradley; Omar Abdel-Wahab; Steven D Leach
Journal:  Cancer Cell       Date:  2020-06-18       Impact factor: 31.743

Review 2.  Molecular subtypes in cancers of the gastrointestinal tract.

Authors:  Maarten F Bijlsma; Anguraj Sadanandam; Patrick Tan; Louis Vermeulen
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2017-04-12       Impact factor: 46.802

Review 3.  The pancreatic cancer genome revisited.

Authors:  Akimasa Hayashi; Jungeui Hong; Christine A Iacobuzio-Donahue
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2021-06-04       Impact factor: 46.802

Review 4.  Biomarkers in pancreatic ductal adenocarcinoma.

Authors:  J Gallego; C López; R Pazo-Cid; F López-Ríos; A Carrato
Journal:  Clin Transl Oncol       Date:  2017-06-14       Impact factor: 3.405

5.  Aldolase B-Mediated Fructose Metabolism Drives Metabolic Reprogramming of Colon Cancer Liver Metastasis.

Authors:  Pengcheng Bu; Kai-Yuan Chen; Kun Xiang; Christelle Johnson; Scott B Crown; Nikolai Rakhilin; Yiwei Ai; Lihua Wang; Rui Xi; Inna Astapova; Yan Han; Jiahe Li; Bradley B Barth; Min Lu; Ziyang Gao; Robert Mines; Liwen Zhang; Mark Herman; David Hsu; Guo-Fang Zhang; Xiling Shen
Journal:  Cell Metab       Date:  2018-04-26       Impact factor: 27.287

6.  Classification of Pancreatic Cancer: Ready for Practical Application?

Authors:  Danny N Khalil; Eileen M O'Reilly
Journal:  Clin Cancer Res       Date:  2018-05-11       Impact factor: 12.531

7.  Cysteine depletion induces pancreatic tumor ferroptosis in mice.

Authors:  Michael A Badgley; Daniel M Kremer; H Carlo Maurer; Kathleen E DelGiorno; Ho-Joon Lee; Vinee Purohit; Irina R Sagalovskiy; Alice Ma; Jonathan Kapilian; Christina E M Firl; Amanda R Decker; Steve A Sastra; Carmine F Palermo; Leonardo R Andrade; Peter Sajjakulnukit; Li Zhang; Zachary P Tolstyka; Tal Hirschhorn; Candice Lamb; Tong Liu; Wei Gu; E Scott Seeley; Everett Stone; George Georgiou; Uri Manor; Alina Iuga; Geoffrey M Wahl; Brent R Stockwell; Costas A Lyssiotis; Kenneth P Olive
Journal:  Science       Date:  2020-04-03       Impact factor: 47.728

8.  Real-time Genomic Characterization of Advanced Pancreatic Cancer to Enable Precision Medicine.

Authors:  Andrew J Aguirre; Jonathan A Nowak; Nicholas D Camarda; Richard A Moffitt; Arezou A Ghazani; Mehlika Hazar-Rethinam; Srivatsan Raghavan; Jaegil Kim; Lauren K Brais; Dorisanne Ragon; Marisa W Welch; Emma Reilly; Devin McCabe; Lori Marini; Kristin Anderka; Karla Helvie; Nelly Oliver; Ana Babic; Annacarolina Da Silva; Brandon Nadres; Emily E Van Seventer; Heather A Shahzade; Joseph P St Pierre; Kelly P Burke; Thomas Clancy; James M Cleary; Leona A Doyle; Kunal Jajoo; Nadine J McCleary; Jeffrey A Meyerhardt; Janet E Murphy; Kimmie Ng; Anuj K Patel; Kimberly Perez; Michael H Rosenthal; Douglas A Rubinson; Marvin Ryou; Geoffrey I Shapiro; Ewa Sicinska; Stuart G Silverman; Rebecca J Nagy; Richard B Lanman; Deborah Knoerzer; Dean J Welsch; Matthew B Yurgelun; Charles S Fuchs; Levi A Garraway; Gad Getz; Jason L Hornick; Bruce E Johnson; Matthew H Kulke; Robert J Mayer; Jeffrey W Miller; Paul B Shyn; David A Tuveson; Nikhil Wagle; Jen Jen Yeh; William C Hahn; Ryan B Corcoran; Scott L Carter; Brian M Wolpin
Journal:  Cancer Discov       Date:  2018-06-14       Impact factor: 39.397

9.  Generation of an in vitro 3D PDAC stroma rich spheroid model.

Authors:  Matthew J Ware; Vazrik Keshishian; Justin J Law; Jason C Ho; Carlos A Favela; Paul Rees; Billie Smith; Sayeeduddin Mohammad; Rosa F Hwang; Kimal Rajapakshe; Cristian Coarfa; Shixia Huang; Dean P Edwards; Stuart J Corr; Biana Godin; Steven A Curley
Journal:  Biomaterials       Date:  2016-09-02       Impact factor: 12.479

10.  A Novel MIF Signaling Pathway Drives the Malignant Character of Pancreatic Cancer by Targeting NR3C2.

Authors:  Shouhui Yang; Peijun He; Jian Wang; Aaron Schetter; Wei Tang; Naotake Funamizu; Katsuhiko Yanaga; Tadashi Uwagawa; Abhay R Satoskar; Jochen Gaedcke; Markus Bernhardt; B Michael Ghadimi; Matthias M Gaida; Frank Bergmann; Jens Werner; Thomas Ried; Nader Hanna; H Richard Alexander; S Perwez Hussain
Journal:  Cancer Res       Date:  2016-04-20       Impact factor: 12.701

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.