| Literature DB >> 28122501 |
Rob Eisinga1, Tom Heskes2, Ben Pelzer3, Manfred Te Grotenhuis3.
Abstract
BACKGROUND: The Friedman rank sum test is a widely-used nonparametric method in computational biology. In addition to examining the overall null hypothesis of no significant difference among any of the rank sums, it is typically of interest to conduct pairwise comparison tests. Current approaches to such tests rely on large-sample approximations, due to the numerical complexity of computing the exact distribution. These approximate methods lead to inaccurate estimates in the tail of the distribution, which is most relevant for p-value calculation.Entities:
Keywords: Classifier comparison; Exact p-value; Friedman test; Machine learning; Multiple comparison; Nonparametric statistics; Rank sum difference
Mesh:
Year: 2017 PMID: 28122501 PMCID: PMC5267387 DOI: 10.1186/s12859-017-1486-2
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Two-way layout for Friedman test
Recommended critical difference (CD) approximate tests for 1 × N and N × N comparisons of Friedman rank sums
| Comparison | Critical difference | Reference |
|---|---|---|
| 1 × |
| Demšar [ |
|
| Siegel and Castellan [ | |
|
|
| Siegel and Castellan [ |
|
| Nemenyi [ | |
|
| Miller [ |
Note: The constant is the upper αth percentile point for the distribution of the maximum of k − 1 equally correlated (ρ=.5) unit normal N(0, 1) random variables. The constant q is the upper αth percentile point of the Studentized range (q) distribution with (k, ∞) degrees of freedom. The references in the right-most column are ordered by year of publication (of first edition).
Fig. 1Computational time. Time (in seconds) for calculating the exact p-value of d = 1 and d = k(n − 1) − 1, for n = 2, …, 100 and k = 10 (black line) and k = 100 (red line)
Fig. 2Distribution of exact mass point probabilities and exact p-values. a Exact mass point probabilities, and exact p-values, for k = n = 5. (b) Exact p-values, and log10-transformed exact (blue line) and normal approximate p-values (red line), for k = n = 10. (c) Histogram of simulated p-values under the overall null hypothesis with expected null frequency superimposed, and cumulative distribution function of the simulated 1 − p-values with diagonal line overlay, for k = 50, n = 5.
Exact (CD) and approximate critical values of differences in rank sums, at the familywise error rate of α=.05
|
|
| max( | Unadjusted | Bonferroni-adjusted | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||||
|
|
|
|
|
|
|
|
|
| |||
| 5 | 5 | 20 | 11 | 10 | 13 | 13 | 13 | 14 | 15 | 14 | 16 |
| 10 | 40 | 15 | 14 | 18 | 18 | 18 | 20 | 20 | 20 | 22 | |
| 25 | 100 | 23 | 22 | 29 | 28 | 28 | 32 | 33 | 31 | 35 | |
| 50 | 200 | 32 | 31 | 40 | 40 | 39 | 45 | 45 | 44 | 49 | |
| 100 | 400 | 45 | 44 | 57 | 56 | 55 | 64 | 63 | 61 | 69 | |
| 10 | 5 | 45 | 20 | 19 | 27 | 27 | 26 | 30 | 32 | 31 | 40 |
| 10 | 90 | 27 | 27 | 38 | 38 | 37 | 44 | 45 | 43 | 56 | |
| 25 | 225 | 43 | 42 | 60 | 60 | 58 | 70 | 70 | 68 | 89 | |
| 50 | 450 | 60 | 60 | 85 | 84 | 82 | 99 | 99 | 96 | 125 | |
| 100 | 900 | 85 | 84 | 120 | 119 | 115 | 141 | 140 | 136 | 177 | |
| 25 | 5 | 120 | 46 | 46 | 70 | 72 | 69 | 83 | 88 | 86 |
|
| 10 | 240 | 65 | 65 | 100 | 102 | 98 | 121 | 124 | 121 | 199 | |
| 25 | 600 | 103 | 102 | 160 | 161 | 154 | 194 | 196 | 191 | 315 | |
| 50 | 1200 | 145 | 145 | 227 | 227 | 218 | 276 | 278 | 270 | 445 | |
| 100 | 2400 | 205 | 204 | 321 | 321 | 308 | 392 | 392 | 381 | 629 | |
| 50 | 5 | 245 | 91 | 91 | 146 | 152 | 145 | 175 | 190 | 185 |
|
| 10 | 490 | 128 | 128 | 210 | 215 | 205 | 258 | 268 | 261 |
| |
| 25 | 1225 | 203 | 203 | 337 | 339 | 323 | 417 | 423 | 412 | 840 | |
| 50 | 2450 | 287 | 286 | 478 | 479 | 457 | 595 | 599 | 582 | 1188 | |
| 100 | 4900 | 405 | 405 | 677 | 678 | 646 | 844 | 846 | 824 | 1680 | |
| 100 | 5 | 495 | 180 | 180 | 304 | 320 | 302 | 368 | 406 | 395 |
|
| 10 | 990 | 255 | 255 | 441 | 452 | 427 | 548 | 573 | 559 |
| |
| 25 | 2475 | 403 | 403 | 708 | 714 | 676 | 891 | 906 | 883 | 2278 | |
| 50 | 4950 | 569 | 569 | 1005 | 1010 | 955 | 1271 | 1281 | 1249 | 3221 | |
| 100 | 9900 | 805 | 805 | 1425 | 1427 | 1350 | 1805 | 1812 | 1766 | 4555 | |
Note: The tabled values satisfy the relation P(D ≥ tabled value) <.05. For presentational purposes, the approximate critical differences were rounded up to the smallest integer that is not less than the calculated value. Italicized figures in the right-most column represent critical differences exceeding the maximum value of d, denoted max(d), implying that none of the rank sum differences is significant at the α=.05 level
Exact and mid p-values for unadjusted exact CD values
|
|
|
| mid |
|
|
| mid |
|
|
| mid |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 5 | .0326 | .0440 | 10 | 5 | .0397 | .0457 | 25 | 5 | .0494 |
|
| 10 | .0389 | .0471 | 10 | .0496 |
| 10 | .0494 |
| |||
| 25 | .0437 | .0489 | 25 | .0468 | .0495 | 25 | .0487 | .0498 | |||
| 50 | .0461 | .0498 | 50 | .0492 |
| 50 | .0495 |
| |||
| 100 | .0465 | .0490 | 100 | .0484 | .0497 | 100 | .0494 | .0499 | |||
| 50 | 5 | .0485 | .0498 | 100 | 5 | .0493 | .0500 | ||||
| 10 | .0500 |
| 10 | .0493 | .0497 | ||||||
| 25 | .0493 | .0498 | 25 | .0496 | .0498 | ||||||
| 50 | .0493 | .0497 | 50 | .0499 |
| ||||||
| 100 | .0497 | .0500 | 100 | .0499 | .0500 |
Note: Bold figures indicate mid p-values exceeding the nominal level of α=.05.
Fig. 3Error normal approximation. Percentage relative error R(d) of normal approximation with (red line) and without (black line) continuity correction versus exact p-value, for n = k = 10,100
Friedman rank data for k = 11 methods and n = 4 performance indicators (Ruijter et al. [69])
| Method | Rank sum | Cy0 | LinRegPCR | Standard-Cq | PCR-Miner | MAK2 | LRE-E100 | 5PSM | DART | FPLM | LRE-Emax | FPK-PCR |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cy0 | 7 | 3 | 3 | 10 | 11 | 15 | 25 | 27 | 29 | 31 | 33 | |
| LinRegPCR | 10 | 1 | 0 | 7 | 8 | 12 | 22 | 24 | 26 | 28 | 30 | |
| Standard-Cq | 10 | 1 | 1 | 7 | 8 | 12 | 22 | 24 | 26 | 28 | 30 | |
| PCR-Miner | 17 | 1 | 1 | 1 | 1 | 5 | 15 | 17 | 19 | 21 | 23 | |
| MAK2 | 18 | 1 | 1 | 1 | 1 | 4 | 14 | 16 | 18 | 20 | 22 | |
| LRE-E100 | 22 | 1 | 1 | 1 | 1 | 1 | 10 | 12 | 14 | 16 | 18 | |
| 5PSM | 32 | 0.423 | 1 | 1 | 1 | 1 | 1 | 2 | 4 | 6 | 8 | |
| DART | 34 | 0.220 | 0.578 | 0.578 | 1 | 1 | 1 | 1 | 2 | 4 | 6 | |
| FPLM | 36 | 0.110 | 0.307 | 0.307 | 1 | 1 | 1 | 1 | 1 | 2 | 4 | |
| LRE-Emax | 38 | 0.052 | 0.156 | 0.156 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | |
| FPK-PCR | 40 |
| 0.076 | 0.076 | 0.782 | 1 | 1 | 1 | 1 | 1 | 1 | |
| Cy0 | 1 | 1 | 0.993 | 0.985 | 0.883 | 0.216 | 0.130 | 0.073 |
|
| ||
| LinRegPCR | 1 | 1 | 1 | 0.999 | 0.972 | 0.403 | 0.271 | 0.169 | 0.098 | 0.053 | ||
| Standard-Cq | 1 | 1 | 1 | 0.999 | 0.972 | 0.403 | 0.271 | 0.169 | 0.098 | 0.053 | ||
| PCR-Miner | 1 | 1 | 1 | 1 | 1 | 0.883 | 0.773 | 0.631 | 0.477 | 0.334 | ||
| MAK2 | 1 | 1 | 1 | 1 | 1 | 0.923 | 0.833 | 0.705 | 0.554 | 0.403 | ||
| LRE-E100 | 1 | 1 | 1 | 1 | 1 | 0.993 | 0.972 | 0.923 | 0.833 | 0.705 | ||
| 5PSM | 0.350 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0.999 | ||
| DART | 0.150 | 0.514 | 0.514 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| FPLM | 0.057 | 0.232 | 0.232 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| LRE-Emax |
| 0.094 | 0.094 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| FPK-PCR |
|
|
| 0.738 | 1 | 1 | 1 | 1 | 1 | 1 |
Note: The upper top triangle displays the rank sum differences, upper bottom triangle the Bonferroni-adjusted normal approximate p-values, lower bottom triangle the Bonferroni-adjusted exact p-values, and lower top triangle the Studentized range approximate p-values. Bold figures indicate p-values ≤ .05
Friedman rank data for k = 12 methods and n = 10 cell differentiation datasets (Zagar et al. [70])
| Method | GDS 2431 | GDS 2666 | GDS 2667 | GDS 2668 | GDS 2669 | GDS 2671 | GDS 2672 | GDS 586 | GDS 587 | GDS 2688 | Rank sum excluding GDS2688 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MCE-euclid-FC | 1 | 2 | 1 | 6 | 6 | 1 | 1 | 10 | 8 | 1 | 36 |
| PCA-FC | 5 | 1 | 6 | 1 | 1.5 | 12 | 8 | 5.5 | 1 | 3 | 41 |
| PLS-AREA | 6.5 | 8 | 4 | 3 | 4.5 | 5 | 6 | 7.5 | 3 | 6 | 47.5 |
| PCA-AREA | 4 | 6.5 | 3 | 2 | 7 | 11 | 7 | 7.5 | 2 | 2 | 50 |
| MCE-euclid-AREA | 3 | 3.5 | 2 | 5 | 9 | 3.5 | 5 | 11 | 9 | 4 | 51 |
| PLS-FC | 9 | 5 | 8 | 4 | 1.5 | 3.5 | 12 | 5.5 | 5.5 | 5 | 54 |
| SVMRank-FC | 11 | 9 | 5 | 8 | 8 | 6 | 3 | 1 | 5.5 | 7 | 56.5 |
| SVMRank-AREA | 9 | 11 | 9 | 7 | 3 | 10 | 2 | 2 | 4 | 8 | 57 |
| PLS-FC-time | 9 | 3.5 | 11 | 11 | 4.5 | 8 | 10 | 3 | 10 | 9 | 70 |
| PLS-AREA-time | 6.5 | 6.5 | 12 | 12 | 12 | 9 | 4 | 4 | 7 | 10 | 73 |
| Pathrecon | 2 | 12 | 7 | 10 | 11 | 2 | 9 | 9 | 11 | 73 | |
| PCA-Markers | 12 | 10 | 10 | 9 | 10 | 7 | 11 | 12 | 12 | 93 | |
| Exact | |||||||||||
|
|
|
|
|
|
|
| unadjusted | Bonferroni-adjusted | |||
| 1 × |
| ||||||||||
| Excluding GDS2688 | 37 | 12 | 9 |
| 0.174 | 1 | |||||
| Including GDS2688 | 46 | 12 | 9 | 10 | 1 |
|
| 0.230 | |||
Note: Bold figures indicate p-values ≤ .05