| Literature DB >> 27701436 |
Steffen Falgreen1, Anders Ellern Bilgrau1,2, Rasmus Froberg Brøndum1, Lasse Hjort Jakobsen1,2, Jonas Have1,2, Kasper Lindblad Nielsen1,2, Tarec Christoffer El-Galaly1,2, Julie Støve Bødker1, Alexander Schmitz1, Ken H Young3, Hans Erik Johnsen1,2, Karen Dybkær1,2, Martin Bøgsted1,2.
Abstract
BACKGROUND: Dozens of omics based cancer classification systems have been introduced with prognostic, diagnostic, and predictive capabilities. However, they often employ complex algorithms and are only applicable on whole cohorts of patients, making them difficult to apply in a personalized clinical setting.Entities:
Mesh:
Year: 2016 PMID: 27701436 PMCID: PMC5049784 DOI: 10.1371/journal.pone.0163711
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Diagram of the hemaClass.org workflow architecture.
Overview of used datasets and GEO accession numbers.
| No. | Dataset | Usage | GEO number | Ref. | |
|---|---|---|---|---|---|
| 1. | LLMPP CHOP | 181 | Training | GSE10846 | [ |
| 2. | Tonsil | 33 | Training | GSE56315 | [ |
| 3. | BCELL26 | 26 | Training | GSE53798 | [ |
| 4. | CHEPRETRO | 89 | Validation | GSE56315 | [ |
| 5. | IDRC | 470 | Validation | GSE31312 | [ |
| 6. | LLMPP R-CHOP | 233 | Validation | GSE10846 | [ |
| 7. | MDFCI | 90 | Validation | GSE34171 | [ |
Comparison of ABC/GCB classification performed using Wright’s naive Bayes classifier [36] and the established elastic net classifier both based on cohort normalization.
The second column shows the accuracy of the classifiers with 95% CI. The third column shows the Cohen’s weighted κ with 95% CI.
| Dataset | Accuracy | Cohen’s |
|---|---|---|
| CHEPRETRO | 0.94 (0.87, 0.98) | 0.94 (0.88, 1.00) |
| MDFCI | 0.78 (0.68, 0.86) | 0.77 (0.66, 0.88) |
| IDRC | 0.82 (0.79, 0.86) | 0.80 (0.76, 0.85) |
| LLMPP R-CHOP | 0.91 (0.86, 0.94) | 0.90 (0.85, 0.96) |
Fig 2Comparison of logit probabilities for the ABC/GCB and REGS classifiers obtained through InLab or ExLab one-by-one normalization against cohort normalization.
The areas marked with green indicate patients with similar classification between cohort based normalization and ExLab one-by-one normalization (A, C, E, G, I), or InLab one-by-one normalization (B, D, F, H, J). The areas marked with red indicate complete misclassifications. For ABC/GCB and REGS the white areas indicate unclassified and intermediate sensitivity, respectively, in at least one of the classifiers. The dashed and solid line show the identity and total least squares line, respectively.
Comparison of classifications obtained using cohort based normalization against Exlab and InLab reference based normalization.
The classifications are compared in terms of accuracy, Cohen’s weighted κ, and Pearson’s correlation coefficient r all supplied with 95% CIs. The comparisons in the first and last three columns are based on the ExLab and InLab reference based normalization method, respectively. For ABC/GCB classification, results from InLab or Exlab classification with the elasitic net classifier is compared against ABC/GCB classes for cohort normalized data obtained using both Wrights Bayes classifier and the elastic net classifier.
| ExLab RMA pre-processing | InLab RMA pre-processing | |||||
|---|---|---|---|---|---|---|
| Accuracy | Cohen’s | Pearson’s | Accuracy | Cohen’s | Pearson’s | |
| CHEPRETRO | .89 (.80, .94) | .89 (.79, .98) | - | .97 (.88, 1.) | .97 (.90, 1.) | - |
| MDFCI | .63 (.52, .73) | .52 (.40, .64) | - | .72 (.59, .83) | .71 (.55, .86) | - |
| IDRC | .67 (.63, .71) | .62 (.56, .67) | - | .84 (.80, .87) | .82 (.77, .86) | - |
| LLMPP R-CHOP | .83 (.77, .87) | .82 (.74, .89) | - | .88 (.83, .92) | .88 (.82, .93) | - |
| CHEPRETRO | .88 (.79, .94) | .87 (.78, .97) | .999 (.998, .999) | .98 (.91, 1.) | .98 (.93, 1.) | 1. (.999, 1.) |
| MDFCI | .69 (.59, .78) | .68 (.53, .82) | .998 (.998, .999) | .98 (.91, 1.) | .98 (.85, 1.) | 1. (.999, 1.) |
| IDRC | .65 (.61, .69) | .62 (.57, .68) | .986 (.983, .988) | .93 (.91, .95) | .93 (.90, .96) | .993 (.991, .994) |
| LLMPP R-CHOP | .82 (.77, .87) | .82 (.74, .89) | .999 (.999, .999) | .94 (.90, .97) | .94 (.90, .98) | .991 (.988, .993) |
| CHEPRETRO | .58 (.47, .69) | .56 (.28, .84) | - | .78 (.65, .88) | .74 (.33, 1.) | - |
| MDFCI | .54 (.43, .64) | .48 (.17, .79) | - | .80 (.68, .89) | .83 (.30, 1.) | - |
| IDRC | .52 (.47, .56) | .41 (.32, .50) | - | .79 (.75, .83) | .79 (.62, .96) | - |
| LLMPP R-CHOP | .56 (.49, .62) | .53 (.36, .70) | - | .88 (.82, .92) | .88 (.60, 1.) | - |
| CHEPRETRO | .73 (.68, .78) | .71 (.64, .77) | .934 (.920, .946) | .84 (.79, .88) | .83 (.76, .89) | .992 (.990, .994) |
| MDFCI | .60 (.55, .65) | .55 (.48, .61) | .824 (.788, .855) | .90 (.86, .94) | .89 (.83, .96) | .997 (.996, .997) |
| IDRC | .52 (.49, .54) | .33 (.30, .36) | .660 (.635, .685) | .85 (.84, .87) | .84 (.81, .86) | .981 (.979, .983) |
| LLMPP R-CHOP | .58 (.54, .61) | .50 (.46, .54) | .810 (.786, .831) | .90 (.87, .92) | .89 (.85, .92) | .992 (.990, .993) |
Fig 3Comparison of logit probabilities for the BAGS classifier obtained through ExLab or InLab one-by-one normalization against cohort normalization.
The coloured regions in the figure correspond to a threshold probability of 0.5. The dashed and solid line show the identity and total least squares line, respectively.