| Literature DB >> 25885340 |
Jonas Vikeså1, Anne Kirstine H Møller2, Bogumil Kaczkowski3, Rehannah Borup4, Ole Winther5,6, Ricardo Henao7,8, Anders Krogh9, Katharina Perell10, Flemming Jensen11, Gedske Daugaard12, Finn C Nielsen13.
Abstract
BACKGROUND: Cancers of unknown primary (CUPs) constitute ~5% of all cancers. The tumors have an aggressive biological and clinical behavior. The aim of the present study has been to uncover whether CUPs exhibit distinct molecular features compared to metastases of known origin.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25885340 PMCID: PMC4404593 DOI: 10.1186/s12885-015-1128-x
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Prediction results in CUP patients
| ID | Sex | Biopsy site | Histology | Path Diag. | Stand of Ref | LDA Pred | Outlier score |
|---|---|---|---|---|---|---|---|
| 14. | F | LN neck | PDC | Lung | Lung (CD) | Lung | 975 |
| 17. | F | LN neck | Adenoc. | Lower GI | Colon (RD) | Colon | 746 |
| 22. | M | LN neck | Adenoc. | CUP | Stomach (RD) | Normal | 934 |
| 23. | M | LN retro | PDC | CUP | Kidney (RD) | Kidney | 1085 |
| 28. | F | Peritoneum | PDA | Ovary | Ovary (RD) | Ovary | 810 |
| 31. | F | LN neck | PDA | CUP | Lung (CD) | Stomach | 985 |
| 34 | M | Skin | PDA | Lung | Lung (RD) | Lung | 898 |
| 39. | M | Liver | Adenoc. | CUP | Pancreas (CD) | Pancreas | 1097 |
| 40. | F | Liver | Adenoc. | Colon | Colon (RD) | Colon | 729 |
| 44. | F | Kidney | Carc. | CUP | Bladder (RD) | Bladder | 1286 |
| 49. | M | LN neck | PDA | Kidney | Kidney (RD) | Kidney | 1223 |
| 51. | F | LN pelvis | SCC | CUP-SCC | Cervical (RD) | Cervix | 828 |
| 52. | M | Liver | PDA | CUP | CCC (RD) | CCC | 923 |
| 53. | M | Liver | Adenoc. | Lung | Lung (RD) | Lung | 1047 |
| 57. | M | Liver | Adenoc. | CCC | CCC (RD) | HCC | 965 |
| 66. | F | Liver | PDA | CUP | CCC (RD) | Cervix | 1100 |
| 70. | M | Peritoneum | Adenoc. | Stomach | Stomach (CD) | Colon | 842 |
| 74. | M | Leg | Carc. | Adnex tumor | Adnex tumor (RD) | Normal | 1010 |
| 76. | M | Liver | Adenoc. | Lower GI | Small intestine (RD) | Colon | 912 |
| 77. | M | LN axilla | PDC | CUP | Lung (CD) | Breast | 978 |
| 86. | F | LN axilla | Adenoc. | CUP | Lung (RD) | Stomach | 1108 |
| 88. | F | Peritoneum | Adenoc. | Ovary | Ovary (RD) | Cervix | 1033 |
| 89. | F | Liver | PDA | CCC | CCC (RD) | CCC | 916 |
| 90. | F | Peritoneum | Adenoc | Ovary | Ovary (RD) | Ovary | 781 |
| 92. | M | Liver | Malignant tumor | Angiosarcoma | Angiosarcoma (RD) | Normal | 1097 |
| 95. | M | Peritoneum | PDC | DSRCT | DSRCT (RD) | Breast | 1098 |
| 71 + 72 | M | Bone + Kidney | PDC | Kidney | Kidney (RD) | Kidney | 1096 |
| 1277 | |||||||
| 75 + 87 | F/43 | Liver | PDA | CCC | CCC (RD) | CCC | 925 |
| 1030 | |||||||
|
|
|
|
|
|
|
|
|
| 11. | F | LN neck | PDA | CUP | CUP (SD) | Ovary | 756 |
| 13. | F | Peritoneum | PDA | CUP | CUP (NSD) | Pancreas | 1193 |
| 21. | M | LN neck | PDC | CUP | CUP (NSD) | Breast | 1108 |
| 26. | F | Skin | PDA | CUP | CUP (NSD) | Breast | 971 |
| 32. | M | LN neck | PDSCC | CUP-SCC | CUP (NSD) | Normal | 926 |
| 33. | M | Skin | PDA | CUP | CUP (NSD) | Colon | 1098 |
| 41. | M | Liver | PDA | Pancreas | CUP (NSD) | Stomach | 1040 |
| 42. | M | Liver | Adenoc. | CUP | CUP (NSD) | Pancreas | 994 |
| 43. | F | LN retro | PDA | CUP | CUP (NSD) | Stomach | 797 |
| 45. | M | Liver | PDC | CUP | CUP (NSD) | Colon | 1245 |
| 46. | F | Liver | Adenoc. | CUP | CUP (NSD) | Normal | 1027 |
| 47. | F | Liver | Adenoc. | CUP | CUP (SD) | CCC | 932 |
| 48. | F | LN neck | PDC | CUP | CUP (NSD) | Ovary | 1032 |
| 54. | F | Liver | Adenoc. | CUP | CUP (NSD) | Normal | 1068 |
| 55 | F | Liver | Adenoc. | CUP | CUP (NSD) | Normal | 962 |
| 58. | F | Liver | PDC | CUP | CUP (SD) | CCC | 995 |
| 61. | M | Liver | Carc. | HCC | CUP (NSD) | CCC | 1102 |
| 64. | F | LN inguien | PDA | CUP | CUP (SD) | Lung | 1168 |
| 65. | M | LN neck | PDSCC | CUP-SCC | CUP (NSD) | Breast | 929 |
| 73. | M | LN retro | PDC | CUP | CUP (NSD) | Normal | 1020 |
| 78. | F | Lung | Adenoc. | Lower GI | CUP (NSD) | Lung | 1062 |
| 80. | F | Liver | Adenoc. | CUP | CUP (SD) | CCC | 1111 |
| 81. | F | Liver | PDA | CUP | CUP (NSD) | Breast | 1212 |
| 82. | F | Bone | Adenoc. | CUP | CUP (NSD) | CCC | 1209 |
| 83. | F | Liver | PDA | CUP | CUP (SD) | CCC | 1061 |
| 91. | F | LN axilla | Adenoc. | CUP | CUP (SD) | Lung | 939 |
| 93. | M | Bone | PDSCC | CUP-SCC | CUP (NSD) | Breast | 940 |
| 94. | F | Liver | PDA | CUP | CUP (NSD) | Normal | 984 |
| 50. + 68 | M | Adr gl | PDC | CUP | CUP (NSD) | Stomach | 978 |
| Pancreas | 1079 |
A validation of the LDA predicted diagnoses was performed by comparing with a Standard of Reference (SR). SR was established by an experienced pathologist and two experienced oncologists. In addition to the 23 patients where a primary tumor site was identified (Reference Diagnosis (RD)) within the study period, the Standard of Reference reached a Consensus Diagnosis (CD) in 5 patients based on patient demographics, metastatic pattern, results of clinical and laboratory tests, imaging data and pathologic evaluations (Samples labeled in red). In the 29 remaining CUP labeled in blue, the results from gene expression profiling were compared with clinicopathological features and the predictions were categorized as Supportive (SD) or Non-Supportive (NSD). LN: lymph node; n: neck LN; m: mediastinal LN; a: axilla LN; r: retroperitoneal LN; p: pelvis LN; adr gl: adrenal gland; Adenoc: adenocarcinoma, PDA: poorly differentiated adenocarcinoma; Carc: carcinoma; PDC: poorly differentiated carcinoma; SCC: squamous cell carcinoma; PDSCC: poorly differentiated SCC; CCC: cholangiocarcinoma; HCC: hepatocelluar carcinoma; DSRCT: desmoplastic small round cell tumor.Path Diag: pathological diagnosis; Stand of ref: Standard of reference; LDA pred: Linear discriminant analysis prediction; RD: Reference Diagnosis; CD: Consensus Diagnosis, SD: Supportive Diagnosis; NSD: Non-Supportive Diagnosis.
Figure 1Hierachial cluster and principal component analysis of tumor classes. A. Two-way hierachial cluster of 16 tumor classes by the 641 transcript signature. The tumor classes are shown at the top of the cluster and the transcripts are clustered at the left side. B. Principal component analysis (PCA) of primary tumors and known metastases based on the signature. The tumor classes are colored and indicated in association with the corresponding tumor samples.
Figure 2Patomarkers in primary tumors and CUP. Probeset Ids for 45 common histopatological markers were collected and used to generate a two-way hierarchal cluster with a selection of primary tumors (Panel A) or CUP (Panel B). The variance of the individual markers is shown to the left and the scale is indicated at the top of the clusters. Gene symbols are shown to the right and the different tumor classes are shown below ((Panel A), primary tumors). For the CUP samples (Panel B), groups of markers corresponding to different tumor classes are indicated by the boxes around the gene symbols at the right side of the cluster. The number below the cluster indicated the number of the CUP sample corresponding to the annotation in Table 1.
Figure 3QDA derived outlier scores in CUP. A) To determine the relationship between prediction error and outlier scores the primary cancers and metastases were divided into ten bins according to the outlier scores and the error rate was calculated for each bin. Each point represents the error rate plotted versus the median outlier score of the bin. The vertical lines show the span of outlier scores within the bins. The plot shows that higher outlier score translates into higher error rate. We modeled the relationship between outlier scores and prediction error by fitting polynomial function to the data points (the orange line), and the function allows us to estimate the expected error rate for new samples of unknown origin, once their outlier scores have been determined. B) Samples from CUP patients tend to have higher outlier scores than other cancer patients. The box plot summarizes the distributions of outlier scores within metastases (MET), primary (PRIM) and CUP tumors. There is a clear tendency for CUP samples to have higher outlier score than metastases and primary cancers. The median outlier score of CUP samples of >1000 suggests the origin prediction error above 30%. On the other hand, most primary cancers and metastases have outlier scores below 800, hence the estimated prediction error from 2-10% (see panel A). Since data for CUP and some primary tumors and metastases were generated at Rigshospitalet, the non-CUP samples from Rigshospitalet are presented as separate group (RH_MET and RH-PRIM), this is to show that the shift in outlier scores was not caused by technical bias. Additionally, the normal, non-cancerous tissue group (NORMAL) is included, and shows the whole range of outlier scores.
Enriched or depleted gene sets in CUPs compared to metastases of known origin
| Up-regulated in CUP | ||||
|---|---|---|---|---|
| Gene Set name | Transcripts | Overlap | k/K | p value |
|
| ||||
| Genes constituting the BRCA1-PCC network of transcripts whose expression positively correlated (Pearson correlation coefficient, PCC > = 0.4) with that of BRCA1 | 1671 | 159 | 0.0952 | 0.00E + 00 |
|
| ||||
| Genes up-regulated in TC71 and EWS502 cells (Ewing’s sarcoma) upon knockdown of theEWSR1-FLII fusion | 1281 | 133 | 0.1038 | 0.00E + 00 |
|
| ||||
| Genes constituting the ATM-PCC network of transcripts whose expression positively correlated (Pearson correlation coefficient, PCC > = 0.4) with that of ATM | 1461 | 152 | 0.104 | 0.00E + 00 |
|
| ||||
| Genes constituting the CHEK2-PCC network of transcripts whose expression positively correlates (Pearson correlation coefficient, PCC > 0.4) with that of CHEK2 | 782 | 89 | 0.1138 | 0.00E + 00 |
|
| ||||
| Genes down-regulated in nasopharyngeal carcinoma (NPC) compared to the normal tissue. | 1375 | 157 | 0.1142 | 0.00E+00 |
|
| ||||
| Genes up-regulated in anaplastic thyroid carcinoma (ATC) compared to normal tissue. | 721 | 93 | 0.129 | 0.00+00 |
|
| ||||
| Transcripts enriched in pseudopodia of NIH/3T3 cells (fibroblast) in response to haptotactic migratory stimulus by fibronectin, FN1 | 552 | 74 | 0.1341 | 0.00E+00 |
|
| ||||
| Genes up-regulated in poorly diffrentiated thyroid carcinoma (PDTC) compared to normal thyroid tissue. | 640 | 94 | 0.1469 | 0.00E+00 |
|
| ||||
| Genes down-regulated transcripts in fibrolasts expressing ethier XP/CS or TDD mutant forms of ERCC3 [Gene ID=2071], after UVC irradiation | 855 | 126 | 0.1474 | 0.00E+00 |
|
| ||||
| Common down -regulated transcripts in fibroblasts expressing either XP/CS orTDD mutant forms of ERCC3 [Gene ID=2071], after UVC irradiation | 420 | 64 | 0.1524 | 0.00E+00 |
|
| ||||
| Common down-regulated in blood samples from bladder cancer patients | 402 | 57 | 0.1418 | 5.55E-16 |
|
| ||||
| Genes up-regulated in nasopharyngeal carcinoma (NPC) positive for LMP1 [Gene ID=9260], a latent gene of Epstein Barr virus (EBV) | 399 | 56 | 0.1404 | 155E-15 |
|
| ||||
| Genes up-regulated in nsopharyngeal carcinoma relative to the normal tissue. | 286 | 46 | 0.1608 | 3.33E+15 |
|
| ||||
| Genes constituting the XPRSS-Int network: intersection of genes whose expression correlates with BRCA1, BRCA2, ATM, and CHEK2 [Gene ID=672, 675, 472, 11200] in a compendium of normal tissues. | 167 | 34 | 0.2036 | 1.21E-14 |
|
| ||||
|
|
|
|
|
|
|
| ||||
| Genes whose expression negatively correlated with that of SMARCA2 [GeneID=6595] in prostate cancer samples | 360 | 73 | 032028 | 0.00E+00 |
|
| ||||
| Genes doen-regulated in non-metastic breast cancer tumors having type 1 amplifications in the 20q13 region; involves ZNF217 [Gene ID=7764] locus only. | 336 | 49 | 0.1458 | 7.71E-11 |
Gene set enrichments among up or down regulated mRNAs in the CUP core set were examined in the molecular signatures database (MSig) among the C2 curated gene sets comprising profiles from chemical and genetic perturbations, canonical pathways, BIOCARTA, KEGG and the reactome collections. The uncorrected p values are indicated. In all cases the false positive discovery rate was set to q < 0.01.
Figure 4Two way hierachial clusters of BRCA1 and SMARCA2 networks in metastases and CUP. (A) The PUJANA_BRCA1_PCC_NETWORK was downloaded from the MSig database (http://www.broadinstitute.org/gsea/msigdb) and used to generate a paired two way hierarchical cluster with known metastases and CUPs. Gene symbols were translated into probe sets and because of the probe set redundancy the data were filtered by a p < 0.001 before clustering. Following filtering 1297 probe sets were included in the clustering. Known metastases are indicated in green and CUP samples are labeled with pink above the cluster. The scale is shown at the right side of the cluster. (B) Two-way cluster of the SHEN_SMARCA2_ TARGETS up- and (C) downregulated transcripts. The sets consists of 360 down- and 430 up-regulated genes that translated into 772 and 1211 probe sets, respectively. The known metastases are indicated in pink and CUP samples are labeled with green below the cluster. The scale is shown at the right side of the cluster.
Figure 5Signatures of genomic instability in CUPs. Messenger RNA signatures of chromosomal instability (CIN), DNA double-strand break repair, nucleotide excision repair (NER), base excision repair (BER) and mismatch repair (MMR) in CUPs and MOKO were examined with the Broad Institute GSEA v 2 software. The names of the individual signatures, the number of transcripts and the normalized enrichment scores (NES) are indicated. The right panel depicts the transcript ranking on a colometric scale. With the exception of the CIN signature obtained from [10] all gene lists were retrieved from http://www.broadinstitute.org/gsea/msigdb.
Figure 6Chromosomal instability and outlier scores in CUP. Panel A. Two-way hierachial cluster of the Ferreira_Ewing_Sarcoma_Unstable signature in MOKO and CUPs. MOKO and CUPs are indicated by green and pink labels, respectively. Panel B. Instability scores in Normal tissues, Primary tumors, MOKO and CUPs. The signature of chromosome unstable sarcoma was employed to generate an instability score calculated as the mean of the expression values from the included probe sets of the signature following variance filtering (206 probe sets). Panel C. Linear correlation between outlier scores and instability scores. Primary tumors, metastasis of known origin and CUP are indicated as black, red or green dots, respectively. The lower panel shows the value of the instability scores depicted in a green to red color scale. The p-value of the linear correlation between outlier and instability scores is indicated.