| Literature DB >> 24099521 |
Aleksandra Pfeifer1, Bartosz Wojtas, Malgorzata Oczko-Wojciechowska, Aleksandra Kukulska, Agnieszka Czarniecka, Markus Eszlinger, Thomas Musholt, Tomasz Stokowy, Michal Swierniak, Ewa Stobiecka, Dagmara Rusinek, Tomasz Tyszkiewicz, Monika Kowal, Michal Jarzab, Steffen Hauptmann, Dariusz Lange, Ralf Paschke, Barbara Jarzab.
Abstract
BACKGROUND: Differential diagnosis between malignant follicular thyroid cancer (FTC) and benign follicular thyroid adenoma (FTA) is a great challenge for even an experienced pathologist and requires special effort. Molecular markers may potentially support a differential diagnosis between FTC and FTA in postoperative specimens. The purpose of this study was to derive molecular support for differential post-operative diagnosis, in the form of a simple multigene mRNA-based classifier that would differentiate between FTC and FTA tissue samples.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24099521 PMCID: PMC3852913 DOI: 10.1186/1755-8794-6-38
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Description of datasets analyzed in the study (see Additional filefor detailed information)
| Borup | 18 | 22 | [ |
| Training microarray dataset B | 13 | 13 | this study |
| Validation qPCR dataset C | 31 | 40 | this study |
| Validation microarray dataset D | 14 | 12 | this study |
| Weber | 12 | 12 | [ |
| Hinsch | 8 | 4 | [ |
| Total | 96 | 103 |
Figure 1Analysis pipeline. First, datasets were collected. Second, the micorarray dataset A was analyzed and 99 genes were selected. Third, the microarray dataset B was analyzed for further selection of 8 genes and classifier cross-validation. Next, qPCR dataset C was analyzed in order to validate the classifier. Finally, public datasets D, E1 and E2 were analysed to test the classifier.
Performance measures of classifiers in different datasets
| B (own, microarray) | DLDA classification based on the 8 best genes chosen from 99 preselected ones.* | 80 | 82 | 78 | 76 | 83 | 50 |
| DLDA classification based on 45 (optimal number) best genes chosen from 99 preselected ones.* | 84 | 85 | 83 | 83 | 85 | 50 | |
| 5-gene DLDA classification (cut-off 0.12)** | 70 | 61 | 88 | 90 | 55 | 44 | |
| D (own, microarray) | 5-gene DLDA classifier trained on dataset B, tested on D | 73 | 77 | 69 | 71 | 75 | 54 |
| E1 (Weber | 5-gene DLDA classifier.** | 92 | 100 | 86 | 83 | 100 | 50 |
| E2 (Hinsch | 5-gene DLDA classifier.** | 83 | 100 | 67 | 75 | 100 | 67 |
Accuracy, proportion of all samples that are correctly classified; PPV, positive predictive value; NPV, negative predictive value; SVM, support vector machines; DLDA, diagonal linear discriminant analysis; CI, confidence interval. *Performance assessed by 10-fold cross-validation. **Performance assessed by leave-one-out cross-validation.
Figure 2Boxplots for the validated genes ( , and ). All genes were under-expressed in follicular thyroid carcinoma (FTC) compared to follicular thyroid adenoma (FTA). All p-values were calculated using Mann–Whitney U test. The boxplots show following values: median: middle line; 25–75 percentile: box; non-outlying range: whiskers; outliers: circles; extreme values: stars.
Results for the 8 genes included in the classifier and chosen for qPCR validation on FFPE samples
| -2.15 | -2.36 | -0.98 | 0.0377 | ||
| -1.94 | -2.25 | -1.17 | 0.0015 | ||
| -1.51 | -1.98 | -0.38 | 0.2214 | ||
| -2.45 | -3.35 | -1.67 | 0.0100 | ||
| -1.86 | -1.45 | -1.23 | 0.0014 | ||
| -2.15 | -2.7 | Insufficient amplification | |||
| -2.77 | -2.92 | Insufficient amplification | |||
| -1.88 | -1.36 | Insufficient amplification | |||
Log ratios of the 8 genes from microarray training datasets A and B, and validation dataset C (qPCR), are presented. Mann–Whitney U test results for genes validated with qPCR are in bold.
Figure 3ROC curve for the DLDA classifier that was cross-validated on the dataset C. The circle marks the classifier with a cut-off of 0.5 (specificity = 71%, sensitivity = 72%). The star marks the classifier with a cut-off of 0.12 (specificity, 90%; sensitivity, 55%).
Accuracy comparison for various classifiers
| 76-gene Borup's classifier | 81% | 81% | 92% | 83% |
| 5-gene own classifier | 77% | 73% | 92% | 83% |
| 3-gene Weber's classifier | 77% | 50% | 91%* | 42% |
| 5-gene Foukakis classifier | 77% | 61% | 71% | 75% |
*Potentially overfitting (tested on the set from which it was developed).