| Literature DB >> 27441241 |
Matthew P A Henderson1, Holger Hirte2, Sebastien J Hotte2, Peter A Kavsak3.
Abstract
OBJECTIVE: We examined a panel of cytokines and cell adhesion molecules in an attempt to identify cancer specific profiles. DESIGN AND METHODS: Cytokines and cell adhesion arrays (Randox Ltd.) were measured in samples from women with a histological diagnosis of ovarian cancer ([Formula: see text]) or breast cancer ([Formula: see text]) or cancer free ([Formula: see text]). Random forest analysis was used for classification.Entities:
Keywords: Cancer diagnostics; Laboratory medicine; Medical informatics; Statistics
Year: 2016 PMID: 27441241 PMCID: PMC4945896 DOI: 10.1016/j.heliyon.2015.e00059
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Tumour staging (TNM system) of participants in this study.
| Disease | Progression | ||||||
|---|---|---|---|---|---|---|---|
| ND | T0 | T1 | T2 | T3 | T4 | TX | |
| Breast | 0 | 0 | 17 | 34 | 4 | 5 | 0 |
| Healthy | 32 | 0 | 0 | 0 | 0 | 0 | 0 |
| Ovarian | 0 | 3 | 10 | 8 | 17 | 0 | 4 |
Figure 3Variable importance plot for random forest analysis on the training data set. A mean decrease in accuracy of 0.02 was used as a cut-off for inclusion (triangles) of the variable in subsequent analysis.
Figure 4Test efficiency for classification of breast cancer (blue), healthy (green), and ovarian cancer (red). Test efficiency was calulated for the training data at each vote threshold from 0 to 1 in increments of 0.001.
Summary of the Random Forest algorithm classification accuracy using the optimal classification threshold. Bootstrapped confidence intervals are provided. The asterisk indicates that intervals could not be calculated as there was no variation between the bootstrapped data sets.
| Train (n) | Test (n) | Threshold | Efficiency | Sensitivity | 95% CI | Specificity | 95% CI | |
|---|---|---|---|---|---|---|---|---|
| Breast cancer | 36 | 24 | 0.665 | 86.6 | 70.8 | 47.1–86.4 | 96.4 | 82.1–100 |
| Cancer free | 18 | 14 | 0.495 | 87.8 | 57.1 | 29.6–81.8 | 100.0 | *–* |
| Ovarian cancer | 28 | 14 | 0.402 | 91.5 | 85.7 | 50–100 | 84.2 | 69.4–93.4 |
Figure 2Boxplot summary of analyte concentration z-scores grouped by diagnosis: breast cancer (blue), healthy (green), and ovarian cancer (red).
Figure 1Parallel co-ordinates plots for the three classes: breast cancer (blue), cancer free controls (green) and ovarian cancer (red). The threshold probablity for classification in each group is represented by a horizontal black line. FN: false negative, FP: false postive, TN: true negative, TP: true postive.
Concordance table for predicted classification of the test data set. The values in the “Adjusted” column are the classification error with samples classified as “unknown” removed.
| Observed | Predicted | Error | Adjusted | |||
|---|---|---|---|---|---|---|
| Breast | Cancer free | Ovarian | Unknown | |||
| Breast | 17 | 0 | 2 | 5 | 0.29 | 0.08 |
| Cancer free | 0 | 8 | 4 | 2 | 0.43 | 0.29 |
| Ovarian | 1 | 0 | 12 | 1 | 0.14 | 0.07 |