| Literature DB >> 28574441 |
Bartosz Wojtas1,2, Aleksandra Pfeifer3,4, Malgorzata Oczko-Wojciechowska5, Jolanta Krajewska6, Agnieszka Czarniecka7, Aleksandra Kukulska8, Markus Eszlinger9, Thomas Musholt10, Tomasz Stokowy11,12,13, Michal Swierniak14,15, Ewa Stobiecka16, Ewa Chmielik17, Dagmara Rusinek18, Tomasz Tyszkiewicz19, Monika Halczok20, Steffen Hauptmann21, Dariusz Lange22, Michal Jarzab23, Ralf Paschke24, Barbara Jarzab25.
Abstract
Distinguishing between follicular thyroid cancer (FTC) and follicular thyroid adenoma (FTA) constitutes a long-standing diagnostic problem resulting in equivocal histopathological diagnoses. There is therefore a need for additional molecular markers. To identify molecular differences between FTC and FTA, we analyzed the gene expression microarray data of 52 follicular neoplasms. We also performed a meta-analysis involving 14 studies employing high throughput methods (365 follicular neoplasms analyzed). Based on these two analyses, we selected 18 genes differentially expressed between FTA and FTC. We validated them by quantitative real-time polymerase chain reaction (qRT-PCR) in an independent set of 71 follicular neoplasms from formaldehyde-fixed paraffin embedded (FFPE) tissue material. We confirmed differential expression for 7 genes (CPQ, PLVAP, TFF3, ACVRL1, ZFYVE21, FAM189A2, and CLEC3B). Finally, we created a classifier that distinguished between FTC and FTA with an accuracy of 78%, sensitivity of 76%, and specificity of 80%, based on the expression of 4 genes (CPQ, PLVAP, TFF3, ACVRL1). In our study, we have demonstrated that meta-analysis is a valuable method for selecting possible molecular markers. Based on our results, we conclude that there might exist a plausible limit of gene classifier accuracy of approximately 80%, when follicular tumors are discriminated based on formalin-fixed postoperative material.Entities:
Keywords: follicular thyroid adenoma; follicular thyroid cancer; gene expression; meta-analysis; microarray
Mesh:
Substances:
Year: 2017 PMID: 28574441 PMCID: PMC5486007 DOI: 10.3390/ijms18061184
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Presentation of a study scheme.
Differentially expressed genes selected based on analysis of our own microarray dataset.
| No. | Gene Symbol | Gene Name | Affy ID | Primary Dataset | Primary Dataset—Evaluation of Non-Oncocytic Samples Only | |||
|---|---|---|---|---|---|---|---|---|
| FDR Corrected | Mean Expression in FTC | Mean Expression in FTA | Fold-Change | FDR Corrected | ||||
| 1 | activin A receptor type II-like 1 | 226950_at | 0.07 | 5.52 | 7.02 | 0.35 | 0.12 | |
| 2 | C-type lectin domain family 3, member B | 205200_at | 0.08 | 7.54 | 9.52 | 0.25 | 0.13 | |
| 3 | GABA(A) receptor-associated protein-like 2 | 209046_s_at | 0.08 | 11.05 | 11.84 | 0.58 | 0.15 | |
| 4 | zinc finger, FYVE domain containing 21 | 219929_s_at | 0.07 | 7.39 | 8.67 | 0.41 | 0.04 | |
| 5 | LIM domain kinase 2 | 217475_s_at | 0.07 | 4.32 | 5.84 | 0.35 | 0.12 | |
| 6 | zinc finger, MYND domain containing 11 | 1554159_a_at | 0.10 | 6.60 | 8.05 | 0.37 | 0.15 | |
| 7 | DIP2 disco-interacting protein 2 homolog B (Drosophila) | 224872_at | 0.11 | 8.23 | 7.40 | 1.78 | 0.16 | |
| 8 | v-maf musculoaponeurotic fibrosarcoma oncogene homolog B (avian) | 222670_s_at | 0.08 | 8.23 | 9.78 | 0.34 | 0.13 | |
The genes were selected for validation from the genes differentially expressed both in primary and secondary microarray set. Values represented in the table are from analysis of the primary microarray data set.
The results of a meta-analysis of 14 papers, in which differences in gene expression profile between follicular thyroid cancers (FTC) and follicular thyroid adenomas (FTA) were assessed by a high throughput method. Ten genes (highlighted in bold) were selected for our qRT-PCR validation.
| No. | Entrez Gene ID | Symbol | Name | Number of Papers | References | Gene Regulation |
|---|---|---|---|---|---|---|
| 1 | 762 | carbonic anhydrase IV | 3 | [ | down | |
| 4 | 7881 | potassium voltage-gated channel, shaker-related subfamily, beta member 1 | 3 | [ | down | |
| 8 | 185 | angiotensin II receptor, type 1 | 2 | [ | down | |
| 9 | 822 | capping protein (actin filament), gelsolin-like | 2 | [ | down | |
| 10 | 1306 | collagen, type XV, alpha 1 | 2 | [ | down | |
| 11 | 1363 | carboxypeptidase E | 2 | [ | down | |
| 12 | 3491 | cysteine-rich, angiogenic inducer, 61 | 2 | [ | down | |
| 13 | 1733 | deiodinase, iodothyronine, type I | 2 | [ | down | |
| 14 | 11072 | dual specificity phosphatase 14 | 2 | [ | down | |
| 15 | 129080 | EMI domain containing 1 | 2 | [ | down | |
| 16 | 953 | ectonucleoside triphosphate diphosphohydrolase 1 | 2 | [ | down | |
| 17 | 8857 | Fc fragment of IgG binding protein | 2 | [ | down | |
| 18 | 2354 | FBJ murine osteosarcoma viral oncogene homolog B | 2 | [ | down | |
| 19 | 2697 | gap junction protein, alpha 1, 43 kDa | 2 | [ | down | |
| 20 | 55830 | glycosyltransferase 8 domain containing 1 | 2 | [ | down | |
| 21 | 221395 | G protein-coupled receptor 116 | 2 | [ | down | |
| 22 | 3043 | hemoglobin, beta | 2 | [ | down | |
| 23 | 3309 | heat shock 70 kDa protein 5 (glucose-regulated protein, 78 kDa) | 2 | [ | down | |
| 24 | 3400 | inhibitor of DNA binding 4, dominant negative helix-loop-helix protein | 2 | [ | down | |
| 25 | 3590 | interleukin 11 receptor, alpha | 2 | [ | down | |
| 26 | 9452 | integral membrane protein 2A | 2 | [ | down | |
| 27 | 3708 | inositol 1,4,5-trisphosphate receptor, type 1 | 2 | [ | down | |
| 28 | 3725 | jun proto-oncogene | 2 | [ | down | |
| 29 | 3912 | laminin, beta 1 | 2 | [ | down | |
| 30 | 744 | metallophosphoesterase domain containing 2 | 2 | [ | down | |
| 31 | 22795 | nidogen 2 (osteonidogen) | 2 | [ | down | |
| 32 | 3164 | nuclear receptor subfamily 4, group A, member 1 | 2 | [ | down | |
| 33 | 22925 | phospholipase A2 receptor 1, 180 kDa | 2 | [ | down | |
| 35 | 5583 | protein kinase C, eta | 2 | [ | down | |
| 36 | 23180 | raftlin, lipid raft linker 1 | 2 | [ | down | |
| 37 | 8490 | regulator of G-protein signaling 5 | 2 | [ | down | |
| 38 | 6414 | selenoprotein P, plasma, 1 | 2 | [ | down | |
| 39 | 7038 | Thyroglobulin | 2 | [ | down | |
| 40 | 4982 | tumor necrosis factor receptor superfamily, member 11b | 2 | [ | down | |
| 41 | 7173 | thyroid peroxidase | 2 | [ | down | |
| 43 | 771 | carbonic anhydrase XII | 2 | [ | up | |
| 46 | 2358 | formyl peptide receptor 2 | 2 | [ | up | |
| 48 | 2896 | Granulin | 2 | [ | up | |
| 49 | 3486 | insulin-like growth factor binding protein 3 | 2 | [ | up | |
| 50 | 23089 | paternally expressed 10 | 2 | [ | up |
Table 2 shows the Entrez ID, gene symbol, gene name, number of papers in which a particular gene occurs, references to the papers, regulation direction (up–up-regulated in FTC; down–down-regulated in FTC).
Figure 2Principal component analysis (PCA) results. PCA plots of samples from our own microarray dataset, based on genes selected in the meta-analysis that occurred in at least one paper (upper plot) or at least two papers (lower plot).
Comparison of gene expression between FTC (29 samples) and FTA (40 samples) in qRT-PCR dataset (t-test and two-way ANOVA calculated p-values corrected for multiple tests by FDR method). FDR corrected p-values below 0.05 are highlighted in bold.
| No. | Gene | Gene Selection | Fold Change (FTC/FTA) | Two-Way ANOVA—FDR Corrected | |
|---|---|---|---|---|---|
| 1 | Microarrays | 0.58 | |||
| 2 | Microarrays | 0.69 | |||
| 3 | Microarrays | 0.75 | |||
| 4 | Microarrays | 0.068 | 0.81 | 0.17 | |
| 5 | Microarrays | 0.093 | 0.79 | 0.17 | |
| 6 | Microarrays | 0.23 | 0.86 | ||
| 7 | Microarrays | 0.44 | 0.89 | 0.56 | |
| 8 | Microarrays | Amplification not possible in FFPE samples | |||
| 9 | Meta-analysis | 0.49 | |||
| 10 | Meta-analysis | 0.51 | |||
| 11 | Meta-analysis | 0.48 | |||
| 12 | Meta-analysis | 0.68 | |||
| 13 | Meta-analysis | 0.058 | 1.49 | 0.99 | |
| 14 | Meta-analysis | 0.69 | 1.07 | 0.94 | |
| 15 | Meta-analysis | 0.90 | 1.02 | 0.17 | |
| 16 | Meta-analysis | 0.90 | 0.97 | 0.89 | |
| 17 | Meta-analysis | Amplification not possible in FFPE samples | |||
| 18 | Meta-analysis | Amplification not possible in FFPE samples | |||
Two-way analysis of variance (ANOVA) was used in order to adjust for oncocytic feature and the results are shown in column “two-way ANOVA—FDR corrected p-value”.
Figure 3The normalized relative expression levels of positively validated genes in the FFPE dataset of 69 samples. Boxplots superimposed with scatterplots are shown. The line inside each box corresponds to median. Upper and lower edges of boxes correspond to first (Q1) and third (Q3) quartiles, respectively. The whiskers extend to smallest and largest observations within 1.5 times interquartile range (IQR) from the box. Black dots represent RAS mutation carrying samples, and grey dots represent samples without RAS mutation.
Figure 4Receiver operating characteristc (ROC) curve analysis for the predictive power of 4-gene classifier, estimated in qRT-PCR dataset. Using a cutoff probability of 50% (marked with black dot), we obtained sensitivity of 76% and specificity of 80%. The calculated area under the ROC curve was 0.84.
Fresh-frozen material used for microarray analysis.
| Set | Histotype | Samples | % of Men | Median Age (Years) | Frequency of Oncocytic Tumours | Concordance of Pathologic Diagnosis by 2 Experts |
|---|---|---|---|---|---|---|
| Primary set | FTC | 13 | 38.5% | 66 | 46.2% | 100% |
| FTA | 13 | 0% | 42 | 15.4% | 100% | |
| Secondary set | FTC | 14 | 21.4% | 69 | 7.1% | 28.6% |
| FTA | 12 | 25% | 49.5 | 0% | 75% | |
| Total | - | 52 | 21.2% | 60.5 | 17.3% | 75% |
FFPE material used for qRT-PCR validation.
| Histopathological Diagnosis | Number of Samples | % of Men | Median Age (Years) | Frequency of Oncocytic Tumours |
|---|---|---|---|---|
| FTC | 31 | 32.3% | 59 | 61.3% |
| FTA | 40 | 12.5% | 45 | 15% |
| Total | 71 | 21.1% | 52 | 35.2% |