| Literature DB >> 29248889 |
Marcos Tadeu Dos Santos1,2, Bruno Feres de Souza3, Flavio Mavignier Cárcano4, Ramon de Oliveira Vidal2,5, Cristovam Scapulatempo-Neto5, Cristiano Ribeiro Viana5, Andre Lopes Carvalho5.
Abstract
AIMS: Cancers of unknown primary sites account for 3%-5% of all malignant neoplasms. Current diagnostic workflows based on immunohistochemistry and imaging tests have low accuracy and are highly subjective. We aim to develop and validate a gene-expression classifier to identify potential primary sites for metastatic cancers more accurately.Entities:
Keywords: cancer of unknown primary site; metastasis; molecular pathology
Mesh:
Substances:
Year: 2017 PMID: 29248889 PMCID: PMC6204949 DOI: 10.1136/jclinpath-2017-204887
Source DB: PubMed Journal: J Clin Pathol ISSN: 0021-9746 Impact factor: 3.411
Figure 1Flow chart of the necessary steps to obtain the final version of our Reference Database (RefDB) and the metastatic formalin-fixed, paraffin-embedded (FFPE) sample set for validation. Numbers in parentheses represent the step number until reaching the classification of the metastatic FFPE samples against the RefDB by the gene-expression classifier. QC, quality control.
The Reference Database composition
| Tumour superclasses (25) | Tumour subclasses used | Samples (4429) |
| Adrenal | Adrenocortical carcinoma | 68 |
| Breast | Ductal carcinoma | 142 |
| Inflammatory | ||
| Lobular carcinoma | ||
| Gastro-oesophageal | Adenocarcinoma—oesophagus | 78 |
| Adenocarcinoma—stomach | ||
| Germ cell—non-seminomatous | Mixed germ cells | 133 |
| Yolk sac cells | ||
| Teratoma of testis/ovary | ||
| Germ cell—seminomatous | Seminoma/dysgerminoma | 42 |
| GIST | Gastrointestinal stromal tumour | 54 |
| Head and neck (salivary gland) | Adenoid cystic carcinoma—salivary gland | 22 |
| Intestine | Colorectal adenocarcinoma | 212 |
| Kidney | Oncocytoma | 360 |
| Renal cell carcinoma—clear cell | ||
| Renal cell carcinoma—cromophobe | ||
| Renal cell carcinoma—papillary | ||
| Liver | Hepatocellular carcinoma | 139 |
| Lung adenocarcinoma/large cell carcinoma | Lung adenocarcinoma | 327 |
| Large cell/bronchoalveolar | ||
| Lung small cell carcinoma | Small cell carcinoma | 54 |
| Lymphoma | Hodgkin | 151 |
| Large B cell diffuse | ||
| Peripheral T cell | ||
| Melanoma | Uveal | 85 |
| Non-uveal | ||
| Mesothelioma | Mesothelioma | 102 |
| Neuroendocrine | Pheochromocytoma/paraganglioma | 334 |
| Lung carcinoid | ||
| Merkel cell carcinoma | ||
| Ovary | Clear cell adenocarcinoma | 626 |
| Endometrioid adenocarcinoma | ||
| Mucinous adenocarcinoma | ||
| Papillary serous adenocarcinoma | ||
| Serous adenocarcinoma | ||
| Serous or papillary serous carcinoma | ||
| Pancreas | Pancreatic ductal adenocarcinoma | 107 |
| Cholangiocarcinoma | ||
| Prostate | Prostate adenocarcinoma | 119 |
| Sarcoma | Condrosarcoma | 130 |
| Leiomyosarcoma | ||
| Liposarcoma/myxoid liposarcoma | ||
| Malignant fibrous histiocytoma/myxofibrosarcoma | ||
| Synovial sarcoma biphasic or | ||
| Osteosarcoma | ||
| Primitive neuroectodermal/Ewing sarcoma | ||
| Squamous cell carcinoma | Uterus cervix squamous cell carcinoma | 460 |
| Lung squamous cell carcinoma | ||
| Head and neck squamous cell carcinoma | ||
| Oesophagus squamous cell carcinoma | ||
| Thymus | Thymoma | 36 |
| Thyroid | Follicular carcinoma | 230 |
| Papillary carcinoma | ||
| Hurthle cell or anaplastic carcinoma | ||
| Urinary (bladder) | Transitional cell carcinoma | 144 |
| Urothelial adenocarcinoma | ||
| Uterus | Cervix adenocarcinoma | 274 |
| Endometrium endometrioid carcinoma |
The 25 tumour superclasses are composed by 58 tumour subclasses from 4429 samples, which were obtained from 100 different experiments available from the ArrayExpress online platform.
Algorithm performance
| RefDB—10-fold cross-validation (4429 samples) | Metastatic FFPE sample set (real-time PCR vs RefDB—105 samples) | |||||||||||
| Sensitivity | Specificity | Sensitivity | Specificity | |||||||||
| Tumour superclass | Ratio | Positive % agreement | 95% CI | Ratio | Negative % agreement | 95% Cl | Ratio | Positive % agreement | 95% Cl | Ratio | Negative % agreement | 95% Cl |
| Adrenal | 64/68 | 94.12 | 85.62 to 98.37 | 4357/4361 | 99.91 | 99.77 to 99.98 | 0/4 | 0.00 | 0.00 to 0.00 | 97/101 | 96.04 | 90.17 to 98.91 |
| Breast | 104/142 | 73.24 | 65.17 to 80.32 | 4249/4287 | 99.11 | 98.79 to 99.37 | 12/12 | 100.00 | 73.53 to 100.00 | 93/93 | 100.00 | 96.52 to 100.00 |
| Gastro-oesophageal | 61/78 | 78.21 | 67.41 to 86.76 | 4334/4351 | 99.61 | 99.37 to 99.77 | 11/14 | 78.57 | 49.20 to 95.34 | 88/91 | 96.70 | 90.67 to 99.31 |
| Germ cell non-seminomatous | 121/133 | 90.98 | 84.77 to 95.25 | 4284/4296 | 99.72 | 99.51 to 99.86 | – | – | – | – | – | – |
| Germ cell seminomatous | 34/42 | 80.95 | 65.88 to 91.40 | 4382/4387 | 99.89 | 99.73 to 99.96 | – | – | – | – | – | – |
| Gatrointestinal stromal tumour | 53/54 | 98.15 | 90.11 to 99.95 | 4374/4375 | 99.98 | 99.87 to 100.00 | 0/1 | 0.00 | 0.00 to 0.00 | 103/104 | 99.04 | 94.76 to 99.98 |
| Head and neck (salivary gland) | 14/22 | 63.64 | 40.66 to 82.80 | 4399/4407 | 99.82 | 99.64 to 99.92 | 0/1 | 0.00 | 0.00 to 0.00 | 103/104 | 99.04 | 94.76 to 99.98 |
| Intestine (colorectal adenocarcinoma) | 210/212 | 99.06 | 96.63 to 99.89 | 4215/4217 | 99.95 | 99.83 to 99.99 | 13/13 | 100.00 | 75.29 to 100.00 | 92/92 | 100.00 | 96.07 to 100.00 |
| Kidney | 346/360 | 96.11 | 93.56 to 97.86 | 4055/4069 | 99.66 | 99.42 to 99.81 | 3/4 | 75.00 | 19.41 to 99.37 | 100/101 | 99.01 | 94.61 to 99.98 |
| Liver | 120/139 | 86.33 | 79.48 to 91.57 | 4271/4290 | 99.56 | 99.31 to 99.73 | 1/1 | 100.00 | 2.50 to 100.00 | 104/104 | 100.00 | 96.52 to 100.00 |
| Lung adenocarcinoma/large cell carcinoma | 302/327 | 92.35 | 88.92 to 94.99 | 4077/4102 | 99.39 | 99.10 to 99.61 | – | – | – | – | – | – |
| Lung small cell carcinoma | 27/54 | 50.00 | 36.08 to 63.92 | 4348/4375 | 99.38 | 99.10 to 99.59 | – | – | – | – | – | – |
| Lymphoma | 136/151 | 90.07 | 84.15 to 94.33 | 4263/4278 | 99.65 | 99.42 to 99.80 | 2/2 | 100.00 | 15.81 to 100.00 | 103/103 | 100.00 | 96.48 to 100.00 |
| Melanoma | 71/85 | 83.53 | 73.91 to 90.69 | 4330/4344 | 99.68 | 99.46 to 99.82 | 12/12 | 100.00 | 73.53 to 100.00 | 93/93 | 100.00 | 96.11 to 100.00 |
| Mesothelioma | 90/102 | 88.24 | 80.35 to 93.77 | 4315/4327 | 99.72 | 99.52 to 99.86 | – | – | – | – | – | – |
| Neuroendocrine | 326/334 | 97.60 | 95.34 to 98.96 | 4087/4095 | 99.80 | 99.62 to 99.92 | 1/2 | 50.00 | 1.26 to 98.74 | 102/103 | 99.03 | 94.71 to 99.98 |
| Ovary | 519/626 | 82.91 | 79.73 to 85.77 | 3696/3803 | 97.19 | 96.61 to 97.69 | 4/4 | 100.00 | 39.76 to 100.00 | 101/101 | 100.00 | 96.41 to 100.00 |
| Pancreas | 79/107 | 73.83 | 64.45 to 81.85 | 4294/4322 | 99.35 | 99.07 to 99.57 | 0/2 | 0.00 | 0.00 to 0.00 | 101/103 | 98.06 | 93.16 to 99.76 |
| Prostate | 112/119 | 94.12 | 88.26 to 97.60 | 4303/4310 | 99.84 | 99.67 to 99.93 | 1/1 | 100.00 | 2.50 to 100.00 | 104/104 | 100.00 | 96.52 to 100.00 |
| Sarcoma | 105/130 | 80.77 | 72.93 to 87.15 | 4274/4299 | 99.42 | 99.14 to 99.62 | 8/10 | 80.00 | 44.39 to 97.48 | 93/95 | 97.89 | 92.60 to 99.74 |
| Squamous cell carcinomas | 422/460 | 91.74 | 88.84 to 94.09 | 3931/3969 | 99.04 | 98.69 to 99.32 | 13/14 | 92.86 | 66.13 to 99.82 | 90/91 | 98.90 | 94.03 to 99.97 |
| Thymus | 34/36 | 94.44 | 81.34 to 99.32 | 4391/4393 | 99.95 | 99.84 to 99.99 | – | – | – | – | – | – |
| Thyroid | 205/230 | 89.13 | 84.37 to 92.84 | 4174/4199 | 99.40 | 99.12 to 99.61 | 4/5 | 80.00 | 28.36 to 99.49 | 99/100 | 99.00 | 94.55 to 99.98 |
| Urinary (bladder) | 130/144 | 90.28 | 84.23 to 94.58 | 4271/4285 | 99.67 | 99.45 to 99.82 | – | – | – | – | – | – |
| Uterus | 147/274 | 53.65 | 47.55 to 59.67 | 4028/4155 | 96.94 | 96.37 to 97.45 | 3/3 | 100.00 | 29.24 to 100.00 | 102/102 | 100.00 | 96.45 to 100.00 |
| Overall | 3835/4429 | 86.59 | 85.55 to 87.58 | NA | 99.43 | 99.18 to 99.60 | 88/105 | 83.81 | 75.35 to 90.28 | NA | 99.04 | 94.73 to 99.87 |
Sensitivity and specificity of the gene-expression classifier on the RefDB itself by 10-fold cross-validation and on the metastatic FFPE samples from the validation set. See also at online supplementary tables S3, S4, S5 and S6.
FFPE, formalin-fixed, paraffin-embedded; NA, not available; RefDB, Reference Database.
Figure 2Quality control parameters. (A) Box plot of the cycle threshold dispersion range of the three normalizer genes and the three quality control (QC) genes used to determine the QC parameters for the metastatic formalin-fixed, paraffin-embedded samples from the validation set. (B) Box plot of dispersion ranges applied as QC parameters for the Reference Database based on the three QC genes used to calculate the AARRAY, BARRAY and CARRAY correlation values. The whiskers represents the lower and upper fences. (N), normaliser gene; Q1 and Q3, 1st and 3rd quartiles.
Reproducibility of the gene-expression classifier
| First classification | Second classification | Third classification | ||||||
| Sample # (ref. diagnosis —gender) | TLDA card lot | TLDA card # | Tumour superclass | Probability (%) | Tumour superclass | Probability (%) | Tumour superclass | Probability (%) |
| Sample #19 | 1 | 1 | Liver* | 29.6 | Prostate | 29.2 | Gastro-oesophageal | 25.0 |
| 2 | 2 | Liver* | 48.2 | Lung—AC/LCC | 18.5 | Gastro-oesophageal | 17.1 | |
| 3 | Liver* | 36.4 | Gastro-oesophageal | 25.7 | Prostate | 21.7 | ||
| Prostate | 36.9 | Liver* | 27.6 | Gastro-oesophageal | 19.3 | |||
| 3 | 4 | Liver* | 39.5 | Gastro-oesophageal | 23.5 | Lung—AC/LCC | 20.8 | |
| 6 | Liver* | 47.4 | Gastro-oesophageal | 19.8 | Lung—AC/LCC | 16.7 | ||
| Liver* | 40.4 | Prostate | 26.4 | Gastro-oesophageal | 17.0 | |||
| Liver* | 45.4 | Gastro-oesophageal | 19.9 | Prostate | 18.5 | |||
| Liver* | 36.4 | Lung—AC/LCC | 29.4 | Gastro-oesophageal | 18.0 | |||
| Sample #52 (ovary—female) | 1 | 1 | Ovary* | 49.0 | Uterus | 26.2 | Kidney | 8.6 |
| 2 | 2 | Ovary* | 47.6 | Uterus | 24.6 | Squamous CC | 11.6 | |
| 3 | Ovary* | 45.3 | Uterus | 28.9 | Squamous CC | 9.7 | ||
| Ovary* | 40.4 | Uterus | 23.1 | Melanoma | 20.2 | |||
| 3 | 4 | Ovary* | 47.2 | Uterus | 31.8 | Squamous CC | 4.8 | |
| 7 | Ovary* | 45.3 | Uterus | 24.6 | Urinary (bladder) | 13.9 | ||
| Ovary* | 51.1 | Uterus | 21.9 | Melanoma | 10.8 | |||
| Ovary* | 48.9 | Uterus | 25.3 | Sarcoma | 9.7 | |||
| Ovary* | 51.2 | Uterus | 25.3 | Sarcoma | 7.3 | |||
| Sample #58 (thyroid—male) | 1 | 1 | Thyroid* | 53.4 | Thymus | 18.3 | Lymphoma | 12.2 |
| 2 | 2 | Thyroid* | 52.6 | Melanoma | 16.0 | Gastro-oesophageal | 15.2 | |
| 3 | 4 | Thyroid* | 66.7 | Squamous CC/thymus | 8.6 | Lung—AC/LCC | 8.4 | |
| 5 | Thyroid* | 59.9 | Thymus | 13.6 | Lymphoma | 10.3 | ||
| Thyroid* | 45.3 | Gastro-oesophageal | 21.8 | Kidney | 16.8 | |||
| 8 | Thyroid* | 61.1 | Gastro-oesophageal | 11.5 | Thymus | 11.3 | ||
| Thyroid* | 60.8 | Thymus | 14.2 | Lymphoma | 8.7 | |||
| Thyroid* | 44.8 | Gastro-oesophageal | 20.8 | Melanoma | 18.3 | |||
| Thyroid* | 51.3 | Gastro-oesophageal | 18.6 | Squamous CC | 14.0 | |||
| Sample #56 (kidney—female) | 1 | 1 | Kidney* | 43.1 | Ovary | 21.7 | Uterus | 19.0 |
| 2 | 2 | Ovary | 40.7 | Uterus | 24.0 | Kidney* | 19.1 | |
| 3 | 4 | Kidney* | 35.9 | Uterus | 24.3 | Ovary | 23.6 | |
| 5 | Kidney* | 45.3 | Ovary | 20.5 | Uterus | 17.9 | ||
| Kidney* | 52.5 | Uterus | 15.8 | Ovary | 15.5 | |||
| 9 | Kidney* | 47.8 | Ovary | 18.7 | Squamous CC | 17.3 | ||
| Squamous CC | 28.7 | Ovary | 28.4 | Kidney*/breast | 26.7 | |||
| Squamous CC | 41.4 | Ovary | 21.8 | Urinary (bladder) | 20.6 | |||
| Ovary | 36.9 | Kidney* | 23.9 | Squamous CC | 22.9 | |||
*Indicates the correct classifications. See also online supplementary table S6.
AC, adenocarcinoma; CC, cell carcinoma; LCC, large cell carcinoma; TLDA card lot, different lots of low-density array cards customisation provided by the manufacturer.