Stephanie Roessler1, Guoling Lin2, Marshonna Forgues1, Anuradha Budhu1, Shelley Hoover3, R Mark Simpson3, Xiaolin Wu4, Ping He5, Lun-Xiu Qin6, Zhao-You Tang6, Qing-Hai Ye6, Xin Wei Wang1. 1. 1. Laboratory of Human Carcinogenesis, National Cancer Institute, NIH, Bethesda, MD, USA; 2. 1. Laboratory of Human Carcinogenesis, National Cancer Institute, NIH, Bethesda, MD, USA; ; 2. Liver Cancer Institute, Fudan University, Shanghai, China; 3. 3. Laboratory of Cancer Biology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA; 4. 4. Laboratory of Molecular Technology, NCI-Frederick, SAIC-Frederick, Frederick, MD 21701, USA; 5. 5. Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, Maryland 20892, USA. 6. 2. Liver Cancer Institute, Fudan University, Shanghai, China;
Abstract
Metastasis is the main cause of cancer mortality but its process remains poorly understood and thus hampers more effective treatment and improved cancer prognosis. To search for metastasis driver genes responsible for tumor spread, we integrated genomic and transcriptomic profiles of 61 matched primary tumors and distant metastases of liver or colorectal carcinoma isolated by laser-capture microdissection and assayed by array-based technologies. We found that primary tumor lesions and their matched distant metastases were largely similar at the genomic and transcriptomic levels, but substantial differences could be found between primary tumors with or without accompanying metastases. Interestingly, metastasis genes were principally tumor type and organ site-specific. Despite distinct pathway enrichment, different metastasis gene sets shared common prognostic capacity and were predictive of hepatocellular carcinoma survival in an independent cohort. Thus, the metastatic propensity is inherent to the primary tumor and the lack of general metastasis genes necessitates the development of specific treatment modalities.
Metastasis is the main cause of cancer mortality but its process remains poorly understood and thus hampers more effective treatment and improved cancer prognosis. To search for metastasis driver genes responsible for tumor spread, we integrated genomic and transcriptomic profiles of 61 matched primary tumors and distant metastases of liver or colorectal carcinoma isolated by laser-capture microdissection and assayed by array-based technologies. We found that primary tumor lesions and their matched distant metastases were largely similar at the genomic and transcriptomic levels, but substantial differences could be found between primary tumors with or without accompanying metastases. Interestingly, metastasis genes were principally tumor type and organ site-specific. Despite distinct pathway enrichment, different metastasis gene sets shared common prognostic capacity and were predictive of hepatocellular carcinoma survival in an independent cohort. Thus, the metastatic propensity is inherent to the primary tumor and the lack of general metastasis genes necessitates the development of specific treatment modalities.
Entities:
Keywords:
Colon cancer; Liver cancer; Organ site-specific metastasis; Profiling.
The progression of a primary tumor to metastatic disease is believed to be a multistep process which requires the ability of cells to survive in the circulation, extravasate into the parenchyma of distant organs, adapt to the new environment and outgrow secondary lesions 1, 2. Only few tumor cells are believed to be capable of completing this process. In addition, metastases arise at different rates and different organ sites are favored depending on the cancer type 3. Thus, it is expected that there is a significant difference in tumor biology between primary tumor cells and their metastasized progenies. Although, many metastasis-related genes have been identified, the extent to which these genes are involved in the metastasis development of different tumor entities, their role in organ site-specific metastasis, and how they drive progression remains unclear 4, 5.An effective and physiologically relevant approach to identify candidate metastasis driver genes is to perform genomic characterization of primary and paired metastatic lesions from the same individual. Such a comparison provides the most meaningful and physiological relevance to humancancer. However, such high quality patient samples of primary tumors and matched metastases are generally not available due to the fact that most of the metastatic lesions develop many years after initial cancer diagnosis and intervention. Thus, most studies investigating genes involved in metastasis rely on in vitro assays, animal models or metastases from autopsy without matched primary tumors. One approach which was taken in earlier studies is the selection of metastatic cells via in vivo mouse models, whereby, primary tumor tissues or cell lines are introduced into mice and organ site-specific metastatic cells are selected over multiple cycles and analyzed 6-8. These studies led to the discovery of many pro-metastatic genes 9. But only few studies so far compared a limited amount of primary tumors and paired metastases. Interestingly, gene profiling of 8 primary breast tumors and paired metastases revealed striking similarity between the primary tumor and the distant metastasis of the same patient 10. In addition, the analysis of a metastasis signature obtained from comparison of multiple cancer types and unmatched primary tumors was able to predict patient outcome in breast, lung, prostate adenocarcinoma and medulloblastoma 11. Previously, we have shown that primary hepatocellular carcinoma (HCC) samples and their corresponding intrahepatic metastases derived from surgical resection do not exhibit any statistically significant differences 12. However, these results were based on intrahepatic metastatic lesions and thus their difference may be minimal.To overcome the limitations described above, we collected high quality gastrointestinal tumor specimens derived from patients with primary tumors and corresponding distant metastases, whose metastatic lesions were determined to be potentially treatable by surgery. We collected a set of paired primary and distant metastatic liver and colon carcinomas from surgical resection with the goal of identifying metastasis dissemination-related genes. We performed gene expression profiling of laser capture microdissected tumor and paired metastasis and genome-wide somatic copy number alteration (SCNA) profiling to ensure that the identified genes are tumor-specific. We found that liver metastases display some genomic instability compared to their paired primary tumors but the gene expression profiles of primary liver and colorectal carcinoma did not show any significant differences. In contrast, liver metastases of colorectal carcinomapatients differed greatly from liver carcinoma. The integrative analysis of gene expression and SCNA data showed that potential cancer driver genes are enriched in primary tumor lesions with metastatic potential and that the organ site-specific metastasis gene signatures are associated with HCC prognosis despite of their distinct gene sets and related pathways.
Methods
Primary tumors and metastatic tissues were obtained with informed consent from patients who underwent radical resection or liver transplantation between 2004 and 2006 at the Liver Cancer Institute (LCI, Fudan University, Shanghai, China). The study was approved by the Institutional Review Board of LCI, Fudan University, Shanghai. A total of 39 patients, 7 colorectal adenocarcinoma, one gastric adenocarcima, 26 HCC, 4 cholangiocarcinoma and one angiosarcomapatient were recruited. Synchronous liver metastasis from all 7 colorectal carcinomapatients, synchronous lung metastasis from 2 liver cancerpatients, 5 synchronous lymph node metastasis, and 3 synchronous adrenal gland metastasis were obtained at the time of surgery of the primary tumor. For 11 liver cancerpatients, metachronous lung metastases were resected after primary tumor surgery and 10 HCCpatients did not develop any metastasis at 2 years post resection. All patients were diagnosed by two independent pathologists, with detailed information on clinical presentation and pathological characteristics. In addition, tissue samples of 5 patients with hemangioma were obtained. Tissue samples were fresh frozen in liquid nitrogen within 30 minutes of surgical excision.Enriched tumor cells from 61 primary and metastatic tumor specimens were obtained by laser capture microdissection (LCM). Selected tissue foci were microdissected with a laser microdissection microscope (LMD6000, LeicaMicrosystems, Buffalo Grove, IL) equipped with a 355nm ultraviolet laser diode from Hematoxylin & Eosin stained, 8µm thick, tissue sections mounted on glass slides containing a polyethylene naphthalate (PEN) membrane (Life Technologies, Grand Island, NY). Representative LCM images are shown in Figure 1B and Supplementary Material: Figure S1. The dissected cell population was selected with the aid of a drawing tablet and integrated pen tool (Wacom. Vancouver, WA). One Hematoxylin/eosin-stained section of each specimen was reviewed by a pathologist to confirm diagnosis and presence of tumor. The pathologist indicated which representative sections of the tumors should be microdissected. Tumor cells from primary tumor and metastatic specimens were collected.
Figure 1
Gene expression analysis of colorectal and liver carcinoma and metastases. (A) Colorectal tumors and paired metastasis to the liver are indicated in pink. Liver carcinoma with metastasis to the lymph node and to the lung are depicted in green and yellow, respectively. (B) Representative images of laser capture microdissection (LCM) of a primary cholangiocarcinoma before (top panel) and after LCM (bottom panel) of a case with lymph node metastasis (PEM-02A). In green are two areas selected for LCM. (C) Unsupervised clustering of 61 primary and metastatic clinical tumor specimens analyzed by gene expression microarrays. Colorectal carcinomas (GIM), liver carcinomas (PEM) with metastasis to the lymph node or the lung, non-metastatic liver carcinomas (NM) and normal liver (NL) are indicated. Paired tumor-metastasis samples which cluster next to each other are indicated with black dots and samples which do not cluster together with red arrows. (D) Overlap of gene lists resulting from comparison of non-metastatic (NM) cancer to lung metastases, to lymph node metastases and to liver metastasis of colorectal carcinoma. (E) In this study, 17 liver and colorectal carcinomas with their respective paired metastasis have been used to perform gene expression analysis of LCM-enrich tumor epithelium and to measure somatic copy number alterations (SCNA). The data obtained was subsequently used for integrative comparison of primary carcinoma and paired metastasis.
Following LCM, total RNA was isolated from dissected tissues using the PicoPure protocol (Arcturus, Mountain View, CA). RNA quality was analyzed using the Agilent 2100 Bioanalyzer RNA 6000 Pico Assay (Agilent, Santa Clara, CA). The mRNA was amplified with two linear amplification steps by in vitro transcription using the MEGAscript T7 kit (Ambion, Austin, TX), followed by the labeling step using the BioArray HighYield RNA Transcript Labeling Kit T3 from Enzo Life Sciences (Farmingdale, NY). Labeled cRNA was hybridized onto Affymetix HG-U133 Plus 2.0 Arrays. CEL files with the normalized expression data, and additional tumor marker information, were deposited in the GEO repository (GSE40367).All Affymetrix HG-U133 Plus 2.0 microarrays were normalized with MAS 5.0 in BRB array tools Version 4.2.1 (http://linus.nci.nih.gov/BRB-ArrayTools.html). Arrays were normalized to the median array. After exclusion of probes with more than 50% missing or 50% absent calls, 23,515 probes of a total of 54,675 probes on the HG-U133 Plus 2.0 arrays remained. Class comparison of the gene expression data was performed in BRB-Array Tools (http://linus.nci.nih.gov/BRB-ArrayTools.html). For survival risk prediction, genes whose expression was significantly related to survival by univariate Cox proportional hazards regression were used by principal component analysis. Next, to compute a prognostic index, the weighted average of the principal component values was calculated, using the regression coefficients derived from the Cox regression. Finally, this prognostic index was used to split samples into two groups of equal size by the median of the prognostic index. Thereby, a high value of the prognostic index corresponded to a high value of hazard of death (high risk), and consequently a relatively poor predicted survival (low risk). In order to evaluate the predictive value of the method, 10-fold cross-validation with 1000-fold random permutation of the Cox-Mantel log-rank test was performed. We used clustering applying Euclidean distance and average linkage of the survival groups for each gene signature to analyze patient subgroups.Somatic copy number alterations of 7 colon adenocarcinomas with paired liver metastasis and 10 liver carcinoma with metastasis to the lymph node, adrenal gland or lung were analyzed by Agilent array-based comparative genomic hybridization (CGH) as previously described 13. Briefly, genomic DNA was isolated using proteinase K digestion (P2308, Sigma-Aldrich, St. Louis, MO) and subsequent Phenol/Chlororform extraction. Purified DNA was quantified by the fluorometric assay Quant-iT Pico Green dsDNA (Invitrogen, Carlsbad, CA). ArrayCGH Human-Genome-CGH-105A Oligo Microarrays G4412A were carried out according to the manufacturer's instructions (Agilent, Santa Clara, CA). Five micrograms of genomic DNA and reference DNA of the opposite gender (human genomic DNA, Promega, San Luis Obispo, CA) were digested with the restriction enzymes AluI and RsaI. Digested DNA was labeled using the BioPrime Array CGH Genomic Labeling kit (Invitrogen, Carlsbad, CA) according to the manufacturer's instructions. Tumor and reference DNA were combined and hybridized to Human Genome CGH 105A Oligo Microarray glass slides (G4412A, Agilent, Santa Clara, CA) for 36-40 hours at 65°C, washed and dried. Subsequently, dried array slides were scanned using the DNA Microarray Scanner (Agilent, Santa Clara, CA). Raw image files of the arrays were processed using Feature Extraction software 8.1 (Agilent, Santa Clara, CA). Arrays with derivative log ratio spread (DLRS) higher than 0.2 log units were repeated and according to MIAME guidelines, raw data Feature Extraction files were deposited into the GEO repository (GSE41210).All algorithms were implemented in the R programming language (version 2.15.0; www.r-project.org). Signal intensities were preprocessed by the Agilent feature extraction software followed by segmentation analysis using the snapCGH (version 1.26.0) R package available at the Bioconductor (www.bioconductor.org) as previously reported 13. In brief, sex chromosomes were excluded from the arrayCGH analysis because the opposite gender was used as a reference for quality control. Intensity ratios of the tumor over reference samples were logarithm-2 (log2) transformed, median-centered, and subjected to an improved circular binary segmentation algorithm 14. Segments without significant statistical differences were merged into the same states using the method provided by Willenbrock and Fridlyand 15. Centers of the segmentation means for individual arrays were determined by setting the mode of density distribution as the baseline (log2 ratio = 0). The DNA copy number aberrations at particular genomic locations were determined by the corresponding segmentation means. Regions with segmented log2 ratios greater than 0.3 and smaller than -0.3 were considered regions of gain and loss, respectively. For unsupervised hierarchical clustering of the arrayCGH data, segmented data was converted to 1, -1, 0 according to their respective status of gain, loss and no change, and then weighted by the squares of the frequencies of copy number gain or loss at a particular genomic location. Adjacent probes with identical DNA copy number profiles across all samples were combined to form unique segments. Average-linkage clustering was performed based on the Euclidean distance metric. The significance of the difference in gain or loss status between HCC subgroups in unique segments was determined by the Fisher's exact test and the p-values were adjusted using Benjamini-Hochberg correction. Adjacent significant regions with gaps less than 50kb were combined and considered as one large region with differential genomic aberrations between subgroups.To calculate the Spearman or Pearson correlation coefficients (r) for genes with more than one probe set, the mean gene expression was calculated. For each gene, the correlation coefficient between the segmented log2 ratios of the DNA copy number and the expression values was computed, as done previously 13. The empirical null distribution was obtained from the Spearman or Pearson correlation coefficients calculated from 1,000 random permutations, as indicated. Genes with correlation coefficients greater than the one corresponding to the 95th percentile of the empirical null distribution were considered positively correlated between DNA copy number alteration and expression.Organ site-specific metastasis gene lists resulting from class comparisons were analyzed by Ingenuity Pathway Analysis (IPA; http://www.ingenuity.com/products/ipa). We performed IPA of the respective corresponding probe sets to each organ site specific metastasis gene list with the associated fold change as input. For each of the three gene signatures enrichment of genes in canonical pathways was analyzed separately. The statistical significance was defined as p<0.05. To account for multiple testing, a statistical significance of p<0.001 and False Discovery Rate (FDR) <0.1 (<10% false positives) were used.
Results
Gene expression profiling of matched primary tumors and distant metastases reveals a high degree of similarity
We conducted gene expression profiling of a total of 61 matched primary liver and colon carcinoma lesions with paired synchronous metastasis to the lung, lymph nodes, adrenal glands, or liver, and of liver carcinomas without tumor relapse at 2-year post tumor resection (Supplementary Material: Table S1 and Figure 1A). To exclude gene expression patterns of the tumor stroma, enriched tumor epithelia were obtained by laser capture microdissection (LCM) and representative images of LCM are shown in Figures 1B and S1. Because the liver microenvironment was shown to exhibit a distinct metastasis signature which is characteristic of a Th1/Th2-like cytokine shift, we also attempted to isolate tumor stroma for gene expression profiling by LCM 16. However, we were not able to obtain enough material to perform microarray analysis and thus, focused our analysis on the comparison of tumor epithelia in primary tumors and paired metastasis. Unsupervised hierarchical clustering of the gene expression data revealed four major tumor-site-specific and organ-specific clusters where most of the primary tumors (14/17) preferentially clustered with their paired extrahepatic metastases (Figure 1C), reminiscent of our previous findings with intrahepatic metastases 12. Among primary liver carcinoma with extrahepatic metastasis (PEM), 2 synchronous (PEM-12, PEM-13) and 8 metachronous lung metastases fell into the same cluster. Thus, synchronous and metachronous metastasis to the lung showed more similarities than metastasis to other organs such as the lymph node or adrenal gland. It appears that tumors and their paired metastases share similar gene expression profiles and that liver cancers with different metastatic sites have distinct transcriptomes.
Identification of metastasis-related genes across tissue types
To further search for metastasis-related genes, we compared gene expression profiles of primary colorectal carcinomas (gastrointestinal metastasis: GIM) and paired synchronous liver metastases using pairwise class comparison even when increasing statistical power by comparing all primary liver and colorectal tumor samples (PEM+GIM) to their paired metastasis (Table 1), indicating that these primary carcinomas and their paired metastasis are transcriptomically similar. Taken together, our results indicate that primary colorectal or liver carcinomas and their paired synchronous metastasis do not exhibit any significant transcriptomic differences. These results reinforce the hypothesis that the ability to metastasize is inherent to the primary tumor and metastasis should be compared to non-metastatic tumors.
Table 1
Differentially expressed genes and performance of classifier.
Classifier
Clinical groups
No. of cases
p-valuea
FDRb
No. of genes
Colon
Primary vs. metastasis
7/7
<0.001
1
<0.01
1
Lymph node
Primary vs. metastasis
5/5
<0.001
>0.983
<0.01
>0.983
Adrenal gland
Primary vs. metastasis
3/3
<0.001
NA c
<0.01
0.916
Lung
Primary vs. metastasis
2/2
<0.001
NA
<0.01
1
All cases
Primary vs. metastasis
17/17
<0.001
NA
<0.01
1
Lung
NM vs. lung metastasis
10/12
<0.001
<0.064
280
<0.01
<0.124
Colon
NM vs. colon metastasis
10/7
<0.001
<0.001
1387
<0.01
<0.065
2540
Lymph node
NM vs. lymph node metastasis
10/5
<0.001
<0.024
730
<0.01
<0.094
1783
a Parametric p-value for two-sided t-test. b False Discovery Rate (FDR); bold indicates significant FDR values at FDR < 0.1 (10%). c NA: not available.
In contrast to a minimum difference among primary and paired metastatic lesions, class comparison of non-metastatic liver cancers (NM) to those with lung metastases or lymph node metastasis resulted in 369 and 1011 significant probes sets, which correspond to 280 and 730 genes, respectively. In addition, liver metastases of colorectal carcinomas differed in 2002 probe sets to NM which represent 1387 genes (p<0.001; Supplementary Material: Table S2). However, the identified differential gene sets exhibited a small overlap of only 14 genes, suggesting that most genes are tumor type and organ site-specific (Figure 1D; Supplementary Material: Table S2). Most of the 14 genes' function is related to metabolism (5/14 or 36%). Three of the genes are involved in proliferation, while 3 others are related to cell-cell or cell-matrix interaction and the remaining genes have roles in fibrosis, autoimmunity and histone modification. Taken together, it appears that the metastatic genes of different organ-specific metastases are distinct.
Somatic copy number alterations in primary tumors and metastases
Since genomic alterations have been implicated in metastasis, we performed an integrative analysis of genomic and transcriptomic profiling in order to identify metastasis driver genes (Figure 1E). To determine genomic differences between primary tumors and distant metastases, we analyzed the somatic copy number alteration (SCNA) profiles by array-based comparative genomic hybridization (arrayCGH). We only included cases with fresh frozen metastatic tissues available which left us with 34 clinical specimens of 7 colorectal carcinoma and 10 liver cancerpatients. The comparison of primary and metastatic liver tumors showed that both exhibit differences in their SCNA profiles (Figure 2A and B). In addition, colorectal tumors showed few SCNAs and similarly to liver, colorectal tumors and paired liver metastasis they exhibited differences in their SCNA profiles (Figure 2C and D). Unsupervised hierarchical clustering of the genomic alterations revealed that all 7 primary and paired metastatic colorectal carcinoma pairs each clustered together, whereas, only 5 out of 10 liver carcinoma pairs clustered adjacent to each other (Figure 3A). Next, we analyzed the mean number of SCNAs per sample which was comparable with previous studies (Figure 3B) 17. Thus, liver tumors appeared to have accumulated more genomic aberrations than colorectal tumors. Although not significant, there was a trend of more SCNAs in liver carcinoma with paired extrahepatic metastasis (PEM) than colorectal carcinoma with matched metastases to the liver (GIM). We calculated the difference of SCNA counts in primary versus metastatic tissue pairs for each patient. Consistent with the clustering results (Figure 3A), the difference of SCNA counts in primary versus metastatic tissue pairs was larger in PEM specimens than in GIM specimens (t-test p= 0.044; Figure 3C). This suggests that the genomic instability is higher in PEM than in GIM.
Figure 2
Frequencies of significant aberration in SCNAs of gain and loss areas in gastrointestinal and liver carcinoma. (A) Frequencies of significant SCNAs are plotted as a function of genome location for 10 primary liver cancer cases. Positive values indicate frequencies of samples showing copy number increases, shown in red, and negative values indicate frequencies of samples showing copy number decreases, shown in green. Chromosome boundaries and centromere position are indicated by vertical solid and dashed lines, respectively. (B) Frequencies of SCNAs in the corresponding distant metastases of these 10 liver cancer cases. (C and D) Frequencies of significant SCNAs of 7 primary colorectal cancer and 7 paired colorectal metastases to the liver are plotted as a function of genome location, respectively. Horizontal dashed blue lines indicate frequency of + and -0.2.
Figure 3
Distinct patterns and frequencies of significant aberration in SCNA profiles in primary and metastatic gastrointestinal (GIM) and liver carcinoma (PEM). (A) Unsupervised clustering of SCNA profiles of 17 paired primary and metastatic tumor specimens is shown. Paired tumor-metastasis samples are indicated with black dots and un-paired samples with red arrows. Colorectal carcinomas with liver metastasis (GIM), liver carcinomas (PEM) with lung metastasis and metastasis to the lymph node are shown in pink, yellow and green respectively. (B) Mean number of deletions and amplifications per sample type are shown for primary (P) and metastatic (M) tissues of colorectal (GIM) and liver carcinoma patients (PEM). Mean deletion counts are displayed as grey and amplification counts as black bars, respectively. (C) Difference of SCNA counts in primary versus metastatic tissue pairs for GIM and PEM. (D and E) Differences (y axis) between frequencies of gain (D) and loss (E), respectively, across the genome for GIM versus PEM cases. SCNA frequencies are plotted as a function of positioned location in the genome with positive values indicating higher frequencies in GIM versus PEM.
Comparison of SCNA frequency in GIM and PEM samples showed that the gained and lost regions differ by up to 60% (Figure 3D and E). Chromosome 1q, 8q and 17q gain and loss of large regions of chromosomes 3p, 4q, 9p, 13q, 16q and 21 were mainly associated with liver carcinoma, whereas gain of 13q and loss of 18 were most prevalent in colorectal carcinoma. The statistical analysis of SCNAs showed that 14 gain regions and 27 loss regions are significantly different between GIM and PEM (Supplementary Material: Table S3). Further analysis revealed that most of the loss (13/14) and gain regions (22/27) have a higher frequency of aberration in PEM than GIM. Thus, colorectal and liver tumors have specific genomic aberration profiles. In addition, primary tumors and matched metastases display some genomic differences, which indicate genomic instability in contrast to the transcriptomic profiles that do not show any significant differences.
Integration of gene expression profiling with somatic copy number alterations
To identify potential cancer driver genes, we sought to integrate SCNA with gene expression profiling by performing correlation analysis, a strategy successfully used previously 13, 18. Gene expression profiles of 7 primary colorectal cancer cases with paired metastasis and for 10 primary liver cancers and 9 paired metastases were available (N=33; PEM-11M gene expression microarray failed the quality control). We applied Pearson or Spearman correlation analysis of the gene expression and SCNA profiles for each gene of the 33 specimens and compared the resulting density distribution to the random distribution (Figure 4A and B). We found that 3,593 out of 12,714 genes (28.26%) of the Pearson correlation and 3,566 out of 12,714 genes (28.05%) of the Spearman distribution fell into the 95th percentile of the corresponding random distribution. Thus, Pearson and Spearman correlation performed similarly and about 28% of genes showed correlation between SCNA and gene expression and therefore are potential cancer driver genes.
Figure 4
Genomic and transcriptomic correlation of SCNAs observed in primary colorectal and liver carcinomas with paired metastasis. (A) The density histogram shows the distribution of the Pearson and (B) Spearman correlation coefficients of gene expression and arrayCGH data from 33 tumor and metastasis tissues. The blue line represents the density distribution of the 1000-fold random permutation of the data and the red line represents the density distribution of the Pearson correlation coefficients.
Distinct regulatory pathways are enriched in organ site-specific metastasis
Due to the small overlap of metastasis genes of only 14 genes, we further analyzed the organ site-specific metastasis genes to determine their functional networks. We therefore focused on the organ site-specific genes after integration of gene expression profiling with somatic copy number alteration. Forty-nine (21.3%) of the 230 lung metastasis-specific genes showed significant correlation which corresponds to 52 probe sets (Supplementary Material: Table S4). We found that 66 (22.5%; 81 corresponding probe sets; Supplementary Material: Table S5) out of 293 lymph node-specific genes displayed correlation and 203 (21.3%; 256 corresponding probe sets; Supplementary Material: Table S6) genes of the 952 colorectal liver metastasis-specific genes showed significant correlation. We performed Ingenuity Pathway Analysis (IPA) of the respective corresponding probe sets and the associated fold change as input and found that the top 5 canonical pathways differed greatly. The top 2 pathways obtained from IPA for lymph node metastasis genes (Mevalonate Pathway and Geranylgeranyl-PP Biosynthesis) are involved in the HMG-CoA pathway. Lung metastasis genes on the other hand were enriched in cell-cell signaling and colon metastasis in stearate/fatty acid activation (Figure 5A). However, survival analysis of the respective gene signatures revealed that all three signatures are able to predict patient outcome in an independent HCC cohort (N=242; Figure 5B-D) 19. Lastly, we sought to test if the three gene signatures assigned the same patients as good and poor outcome, respectively. We used clustering applying Euclidean distance and average linkage of the survival groups for each gene signature to analyze patient subgroups (Figure 5E). The results confirmed that all three signatures assigned HCCpatients into similar low and high risk groups. Thus, although the lymph node, lung and colon metastasis-specific genes fall into distinct pathways, each is similarly associated with HCCpatient outcome.
Figure 5
Survival risk prediction analysis and analysis of the function of the organ site-specific metastasis gene signatures. (A) Top 5 canonical pathways identified by Ingenuity Pathway Analysis of the lymph node, lung and colon metastasis specific gene signatures. Displayed is the -log(p-value) of the respective canonical pathways for the lymph node (orange), lung (purple) and colon (blue) signatures. (B) Kaplan-Meier survival curves showing the overall survival of the predicted high and low risk groups in a HCC cohort (N = 242) predicted by the lymph node, (C) lung, (D) colon and (E) overlapping genes. (F) Hierarchical clustering applying Euclidean distance and average linkage of the survival groups for each gene signature (N=242).
Discussion
The high heterogeneity of cancer manifests in the degree and site of metastasis. Intrahepatic metastasis is most common in liver cancer; however, extrahepatic metastases do occur in the lung lymph nodes and bone 20. For multiple solid tumors, the liver is the most common site of metastasis, especially for colorectal cancer 21. In particular, gastrointestinal tumors metastasize preferentially to the liver most likely due to dissemination of tumor cells through the portal vein. Approximately 20% of colorectal cancerpatients present synchronous liver metastasis at diagnosis and an additional 20-30% of colorectal cancerpatients will develop metachronous liver metastases after radical primary colorectal resection 22, 23. Thus, it appears that metastases of different tumor entities occur at varying rates and are organ site-specific; however, the contribution of driver genes and the microenvironment at the distant site to the metastatic process are still poorly understood 24.In this study, we analyzed gene expression profiles of a unique set of fresh-frozen resection specimens derived from paired primary tumors and metastases of liver and colorectal cancerpatients. Although the patients of this study had metastatic disease their primary tumor and distant metastasis were resectable which allowed us to obtain fresh-frozen specimens. We used laser capture microdissection to enrich tumor cells and performed gene expression and SCNA profiling. We found that the gene expression profiles of primary tumors and metastases are similar, but organ site-specific metastasis genes differ greatly. These data suggest that metastasis capacity and organ-site specificity are early events in tumorigenesis, without the requirement of additional mutations beyond those that are needed for primary tumor formation 25.Since cancer is a genetic disease, we also investigated the SCNA profiles of paired primary tumors and metastases. We found that the genomic profiles exhibit higher variability than the gene expression profiles, and that colorectal tumors had fewer genomic aberrations than liver carcinomas. The genomic changes between primary and metastatic tumors also were higher in liver tumors than colorectal tumors. It appears that the genomic instability is not reflected in the gene expression and most of the genetic changes may be passenger events for tumor cells. Interestingly, a recent study in renal-cell carcinoma found evidence for genomic and transcriptomic heterogeneity within one primary tumor and its metastatic lesions 26. This study also shows intratumor genetic heterogeneity because different areas of a single tumor mass carry different mutated genes. One interpretation from these findings is that a majority of genetic changes associated with tumor cells are non-functional as they are passively selected during tumor evolution, which give rise to the appearance of inter- and intratumor heterogeneity. We suggest that integrative genomics through the integration of multiple levels of omics data, such as the approach used in this study, is powerful in identifying key molecular networks that functionally contribute to each tumor type and in minimizing our efforts to chase those passenger events.Taken together, this study focused on the genomic and transcriptomic analysis of tumor cells at the primary tumor site and metastasis. Our results showed that while the SCNA differ between primary tumor and metastasis which appears to be mainly due to passenger effects the gene expression is highly similar. In addition, organ-specific changes that can be linked to HCC prognosis suggest a common prognostic propensity but different tumor biology which most likely requires specific treatment strategies.Fig.S1, Tables S1-S6.Click here for additional data file.
Authors: Rameen Beroukhim; Craig H Mermel; Dale Porter; Guo Wei; Soumya Raychaudhuri; Jerry Donovan; Jordi Barretina; Jesse S Boehm; Jennifer Dobson; Mitsuyoshi Urashima; Kevin T Mc Henry; Reid M Pinchback; Azra H Ligon; Yoon-Jae Cho; Leila Haery; Heidi Greulich; Michael Reich; Wendy Winckler; Michael S Lawrence; Barbara A Weir; Kumiko E Tanaka; Derek Y Chiang; Adam J Bass; Alice Loo; Carter Hoffman; John Prensner; Ted Liefeld; Qing Gao; Derek Yecies; Sabina Signoretti; Elizabeth Maher; Frederic J Kaye; Hidefumi Sasaki; Joel E Tepper; Jonathan A Fletcher; Josep Tabernero; José Baselga; Ming-Sound Tsao; Francesca Demichelis; Mark A Rubin; Pasi A Janne; Mark J Daly; Carmelo Nucera; Ross L Levine; Benjamin L Ebert; Stacey Gabriel; Anil K Rustgi; Cristina R Antonescu; Marc Ladanyi; Anthony Letai; Levi A Garraway; Massimo Loda; David G Beer; Lawrence D True; Aikou Okamoto; Scott L Pomeroy; Samuel Singer; Todd R Golub; Eric S Lander; Gad Getz; William R Sellers; Matthew Meyerson Journal: Nature Date: 2010-02-18 Impact factor: 49.962
Authors: Jittiporn Chaisaingmongkol; Anuradha Budhu; Hien Dang; Siritida Rabibhadana; Benjarath Pupacdi; So Mee Kwon; Marshonna Forgues; Yotsawat Pomyen; Vajarabhongsa Bhudhisawasdi; Nirush Lertprasertsuke; Anon Chotirosniramit; Chawalit Pairojkul; Chirayu U Auewarakul; Thaniya Sricharunrat; Kannika Phornphutkul; Suleeporn Sangrajrang; Maggie Cam; Ping He; Stephen M Hewitt; Kris Ylaya; Xiaolin Wu; Jesper B Andersen; Snorri S Thorgeirsson; Joshua J Waterfall; Yuelin J Zhu; Jennifer Walling; Holly S Stevenson; Daniel Edelman; Paul S Meltzer; Christopher A Loffredo; Natsuko Hama; Tatsuhiro Shibata; Robert H Wiltrout; Curtis C Harris; Chulabhorn Mahidol; Mathuros Ruchirawat; Xin W Wang Journal: Cancer Cell Date: 2017-06-22 Impact factor: 31.743
Authors: Jacinto García; Alberto Orfao; Luis Muñoz-Bellvís; José María Sayagués; María Laura Gutiérrez; Luis Antonio Corchete; María Eugenia Sarasquete; María Del Mar Abad; Oscar Bengoechea; Encarna Fermiñán; María Fernanda Anduaga; Sofía Del Carmen; Manuel Iglesias; Carmen Esteban; María Angoso; Jose Antonio Alcazar Journal: Oncotarget Date: 2017-11-21
Authors: Choon Sen Seah; Shahreen Kasim; Mohd Farhan Md Fudzee; Jeffrey Mark Law Tze Ping; Mohd Saberi Mohamad; Rd Rohmat Saedudin; Mohd Arfian Ismail Journal: Saudi J Biol Sci Date: 2017-11-20 Impact factor: 4.219