| Literature DB >> 30930678 |
Anthony Bagnall1, Jason Lines1, Aaron Bostrom1, James Large1, Eamonn Keogh2.
Abstract
In the last 5 years there have been a large number of new time series classification algorithms proposed in the literature. These algorithms have been evaluated on subsets of the 47 data sets in the University of California, Riverside time series classification archive. The archive has recently been expanded to 85 data sets, over half of which have been donated by researchers at the University of East Anglia. Aspects of previous evaluations have made comparisons between algorithms difficult. For example, several different programming languages have been used, experiments involved a single train/test split and some used normalised data whilst others did not. The relaunch of the archive provides a timely opportunity to thoroughly evaluate algorithms on a larger number of datasets. We have implemented 18 recently proposed algorithms in a common Java framework and compared them against two standard benchmark classifiers (and each other) by performing 100 resampling experiments on each of the 85 datasets. We use these results to test several hypotheses relating to whether the algorithms are significantly more accurate than the benchmarks and each other. Our results indicate that only nine of these algorithms are significantly more accurate than both benchmarks and that one classifier, the collective of transformation ensembles, is significantly more accurate than all of the others. All of our experiments and results are reproducible: we release all of our code, results and experimental details and we hope these experiments form the basis for more robust testing of new algorithms in the future.Entities:
Keywords: Elastic distance measures; Shapelets; Time series classification; Time series similarity
Year: 2016 PMID: 30930678 PMCID: PMC6404674 DOI: 10.1007/s10618-016-0483-9
Source DB: PubMed Journal: Data Min Knowl Discov ISSN: 1384-5810 Impact factor: 3.670
Fig. 1Four cases from two classes of the dataset FiftyWords. The top two series show class 30, the bottom two class 50. The common pattern is clear, but only detectable with reallignment
Fig. 2An example dataset where interval methods should do well. The noise in the early part of the series may confound whole series methods
Fig. 3Average ranks of published results for TSF, LPS and TSBF in a critical difference diagram (explained in detail in Sect. 3)
Fig. 4Example matching between a shapelet and three series of different classes from NonInvasiveFetalECGThorax2. The scale of the shapelet is different on each series to reflect that distance is measured with normalised subseries
Fig. 5Average ranks of the published results for three shapelet algorithms Fast Shapelets (FS), Shapelet Transform (ST) and Learned Shapelets (LS) on the 33 datasets they have in common
Fig. 6An example of the need for detecting recurring patterns rather than unique patterns from WormsTwoClass. The top two series are the motion of normal worms, the bottom two mutant worms. The candidate subseries would not necessarily be a good shapelet, because there are close matches in both mutant and non mutant series
Fig. 7Average ranks of published results on 19 data sets or BOP, SAXVSM, BOSS
Summary of the time and space complexity of the 18 TSC algorithms considered
| Train time | Train space | Parameters | |
|---|---|---|---|
| WDTW |
|
| |
| TWE |
|
| |
| MSM |
|
| |
| CID |
|
| |
|
|
|
| |
|
|
|
| |
| EE |
|
| |
| ST |
|
|
|
| LS |
|
|
|
| FS |
|
| |
| TSF |
|
|
|
| TSBF |
|
|
|
| LPS |
|
| |
| BOP |
|
|
|
| SAXVSM |
|
|
|
| BOSS |
|
| |
|
|
|
| |
| COTE |
|
|
Train time includes the cross validated parameter search. Series length is m, number of series is n and number of classes is c
A summary of algorithm taxonomy by data feature characteristics: weighted DTW (WDTW) (Jeong et al. 2011); time warp edit (TWE) (Marteau 2009); move–split–merge (MSM) (Stefan et al. 2013); complexity invariant distance (CID) (Batista et al. 2014); derivative DTW () (Górecki and Łuczak 2013); derivative transform distance () (Górecki and Łuczak 2014); elastic ensemble (EE) (Lines and Bagnall 2015); time series forest (TSF) (Deng et al. 2013); time series bag of features (TSBF) (Baydogan et al. 2013); learned pattern similarity (LPS) (Baydogan and Runger 2016); fast shapelets (FS) (Rakthanmanon and Keogh 2013); shapelet transform (ST) (Hills et al. 2014); bag of patterns (BOP) (Lin et al. 2012); SAX vector space model (SAXVSM) (Senin and Malinchik 2013); bag of SFA symbols (BOSS) (Schäfer 2015); DTW features (Kate 2016); collective of transformation-based ensembles (COTE) (Bagnall et al. 2015)
| Whole series | Intervals | Shapelets | Dictionary |
|---|---|---|---|
| WDTW | TSF | FS | BOP |
| TWE | TSBF | ST | SAXVSM |
| MSM | LPS | LS | BOSS |
| CID | COTE |
| |
|
| |||
|
| |||
| EE | |||
| COTE |
A summary of algorithms and the component approaches underlying them
| NN | time | deriv | shape | int | dict | auto | ens | |
|---|---|---|---|---|---|---|---|---|
| WDTW | x | x | ||||||
| TWE | x | x | ||||||
| MSM | x | x | ||||||
| CID | x | x | x | |||||
|
| x | x | x | |||||
|
| x | x | x | |||||
| ST | x | x | ||||||
| LS | x | |||||||
| FS | x | |||||||
| TSF | x | x | ||||||
| TSBF | x | x | ||||||
| LPS | x | x | x | x | ||||
| BOP | x | x | ||||||
| SAXVSM | x | x | ||||||
| BOSS | x | x | x | x | ||||
|
| x | x | x | x | ||||
| EE | x | x | x | x | ||||
| COTE | x | x | x | x | x | x |
Approaches are nearest neighbour classification (NN), time domain distance function (time), derivative based distance function (deriv), shapelet based (shape), interval based (int), dictionary based (dict), auto-correlation based (auto) and ensemble (ens)
Fig. 8Summary information for the 85 datasets in the archive
Parameter settings and ranges for TSC algorithms
| Parameters | CV Folds | |
|---|---|---|
| WDTW |
| LOOCV |
| TWE |
| LOOCV |
| MSM |
| LOOCV |
| CID |
| LOOCV |
|
|
| LOOCV |
|
|
| LOOCV |
| ST | min=3, max=m-1 | 0 |
| LS |
| 3 |
| FS |
| 0 |
| TSF |
| 0 |
| TSBF |
| LOOCV |
| LPS |
| LOOCV |
| BOP |
| LOOCV |
| SAXVSM |
| LOOCV |
| BOSS |
| LOOCV |
|
| DTW paras 0 to 0.99, SAX pars as BOP, SVM kernel degree | 10 |
| EE | Constituent classifier parameters only | 0 |
| COTE | Constituent classifier parameters only | 0 |
The notation is overloaded in order to maintain consistency with authors’ original parameter names
Fig. 9Critical difference diagram for 11 potential benchmark classifiers
A summary of algorithm performance based on significant difference to DTW and Rotation Forest
| Comparison to DTW | Comparison to RotF | ||||
|---|---|---|---|---|---|
| Classifier | Prop better (%) | Mean difference (%) | Classifier | Prop better (%) | Mean difference (%) |
| Significantly better than DTW | Significantly better than RotF | ||||
| COTE | 96.47 | 8.12 | COTE | 84.71 | 8.14 |
| EE | 95.29 | 3.51 | ST | 75.29 | 6.15 |
| ST | 80.00 | 6.13 | TSF | 63.53 | 1.93 |
| BOSS | 78.82 | 5.67 | BOSS | 62.35 | 5.70 |
| | 75.29 | 2.87 | LPS | 60.00 | 1.86 |
| TSF | 68.24 | 1.91 | EE | 58.82 | 3.54 |
| TSBF | 65.88 | 2.19 | | 58.82 | 2.89 |
| MSM | 62.35 | 1.89 | MSM | 57.65 | 1.91 |
| LPS | 61.18 | 1.83 | TSBF | 56.47 | 2.22 |
| WDTW | 60.00 | 0.20 | Not significantly different to RotF | ||
| LS | 58.82 | 0.56 | LS | 61.18 | 0.58 |
| | 52.94 | 0.79 | | 48.24 | 0.56 |
| | 50.59 | 0.54 | | 47.06 | 0.82 |
| Not significantly different to DTW | | 45.88 | 0.44 | ||
| | 56.47 | 0.42 | TWE | 45.88 | 0.40 |
| RotF | 56.47 |
| WDTW | 44.71 | 0.22 |
| TWE | 49.41 | 0.37 | DTW | 43.53 | 0.02 |
| Significantly worse than DTW | Significantly worse than RotF | ||||
| SAXVSM | 41.18 |
| BOP | 34.12 |
|
| BOP | 37.65 |
| SAXVSM | 31.76 |
|
| FS | 30.59 |
| FS | 22.35 |
|
The column prop better gives the proportion of problems where the classifier has a significantly higher mean accuracy over 100 resamples than the benchmark. The column mean gives the mean difference in mean accuracy over all 85 problems. Thus, for example, COTE is on average 8.12% more accurate than DTW over the 85 problems
Average accuracy of the best nine classifiers over 85 problems
| Datasets | COTE | ST | BOSS | EE |
| TSF | TSBF | LPS | MSM |
|---|---|---|---|---|---|---|---|---|---|
| Adiac |
| 0.768 | 0.749 | 0.665 | 0.605 | 0.707 | 0.727 | 0.765 | 0.636 |
| ArrowHead |
| 0.851 | 0.875 | 0.86 | 0.776 | 0.789 | 0.801 | 0.806 | 0.815 |
| Beef |
| 0.736 | 0.615 | 0.532 | 0.546 | 0.648 | 0.554 | 0.52 | 0.474 |
| BeetleFly | 0.921 | 0.875 |
| 0.823 | 0.853 | 0.842 | 0.799 | 0.893 | 0.794 |
| BirdChicken | 0.941 | 0.927 |
| 0.848 | 0.865 | 0.839 | 0.902 | 0.854 | 0.866 |
| Car | 0.899 |
| 0.855 | 0.799 | 0.851 | 0.758 | 0.795 | 0.836 | 0.841 |
| CBF | 0.998 | 0.986 |
| 0.993 | 0.979 | 0.958 | 0.977 | 0.984 | 0.972 |
| ChlorineConcentration |
| 0.682 | 0.66 | 0.659 | 0.658 | 0.719 | 0.683 | 0.642 | 0.626 |
| CinCECGtorso |
| 0.918 | 0.9 | 0.946 | 0.714 | 0.974 | 0.716 | 0.743 | 0.935 |
| Coffee |
| 0.995 | 0.989 | 0.989 | 0.973 | 0.989 | 0.982 | 0.95 | 0.945 |
| Computers | 0.77 | 0.785 |
| 0.732 | 0.659 | 0.768 | 0.765 | 0.726 | 0.713 |
| CricketX |
| 0.777 | 0.764 | 0.801 | 0.769 | 0.691 | 0.731 | 0.696 | 0.778 |
| CricketY |
| 0.762 | 0.749 | 0.794 | 0.756 | 0.688 | 0.728 | 0.706 | 0.76 |
| CricketZ |
| 0.798 | 0.776 | 0.804 | 0.785 | 0.707 | 0.738 | 0.714 | 0.779 |
| DiatomSizeReduction | 0.925 | 0.911 | 0.939 |
| 0.942 | 0.941 | 0.89 | 0.915 | 0.939 |
| DistalPhalanxOAG | 0.805 |
| 0.815 | 0.768 | 0.796 | 0.809 | 0.816 | 0.767 | 0.756 |
| DistalPhalanxOC |
| 0.819 | 0.814 | 0.768 | 0.76 | 0.813 | 0.812 | 0.742 | 0.754 |
| DistalPhalanxTW |
| 0.69 | 0.673 | 0.654 | 0.658 | 0.686 | 0.69 | 0.618 | 0.618 |
| Earthquakes | 0.747 | 0.737 | 0.746 | 0.735 |
| 0.747 | 0.747 | 0.668 | 0.695 |
| ECG200 | 0.873 | 0.84 |
| 0.881 | 0.819 | 0.868 | 0.847 | 0.807 | 0.877 |
| ECG5000 |
| 0.943 | 0.94 | 0.939 | 0.94 | 0.944 | 0.938 | 0.917 | 0.93 |
| ECGFiveDays |
| 0.955 | 0.983 | 0.847 | 0.907 | 0.922 | 0.849 | 0.84 | 0.879 |
| ElectricDevices | 0.883 |
| 0.8 | 0.831 | 0.874 | 0.804 | 0.808 | 0.853 | 0.825 |
| FaceAll |
| 0.968 | 0.974 | 0.976 | 0.963 | 0.949 | 0.942 | 0.962 | 0.986 |
| FaceFour | 0.85 | 0.794 |
| 0.879 | 0.909 | 0.891 | 0.862 | 0.889 | 0.92 |
| FacesUCR | 0.967 | 0.909 | 0.951 | 0.948 | 0.889 | 0.897 | 0.849 | 0.91 |
|
| Fiftywords | 0.801 | 0.713 | 0.702 |
| 0.748 | 0.728 | 0.744 | 0.776 | 0.817 |
| Fish | 0.962 |
| 0.969 | 0.913 | 0.931 | 0.807 | 0.913 | 0.912 | 0.897 |
| FordA | 0.955 |
| 0.92 | 0.751 | 0.884 | 0.816 | 0.831 | 0.869 | 0.725 |
| FordB |
| 0.915 | 0.911 | 0.757 | 0.843 | 0.79 | 0.751 | 0.852 | 0.73 |
| GunPoint | 0.992 |
| 0.994 | 0.974 | 0.964 | 0.962 | 0.965 | 0.972 | 0.948 |
| Ham | 0.805 | 0.808 |
| 0.763 | 0.795 | 0.795 | 0.711 | 0.685 | 0.745 |
| HandOutlines | 0.894 |
| 0.903 | 0.88 | 0.915 | 0.909 | 0.879 | 0.868 | 0.864 |
| Haptics |
| 0.512 | 0.459 | 0.451 | 0.464 | 0.467 | 0.463 | 0.415 | 0.444 |
| Herring | 0.632 |
| 0.605 | 0.566 | 0.609 | 0.606 | 0.59 | 0.549 | 0.559 |
| InlineSkate |
| 0.393 | 0.503 | 0.476 | 0.382 | 0.379 | 0.377 | 0.449 | 0.455 |
| InsectWingbeatSound |
| 0.617 | 0.51 | 0.581 | 0.602 | 0.613 | 0.616 | 0.519 | 0.57 |
| ItalyPowerDemand |
| 0.953 | 0.866 | 0.951 | 0.948 | 0.958 | 0.926 | 0.914 | 0.936 |
| LargeKitchenAppliances | 0.9 |
| 0.837 | 0.816 | 0.823 | 0.644 | 0.551 | 0.68 | 0.749 |
| Lightning2 | 0.785 | 0.659 | 0.81 |
| 0.71 | 0.757 | 0.76 | 0.757 | 0.792 |
| Lightning7 |
| 0.724 | 0.666 | 0.763 | 0.671 | 0.723 | 0.68 | 0.631 | 0.713 |
| Mallat |
| 0.972 | 0.949 | 0.961 | 0.929 | 0.937 | 0.951 | 0.908 | 0.918 |
| Meat | 0.981 | 0.966 | 0.98 | 0.978 |
| 0.978 | 0.983 | 0.968 | 0.977 |
| MedicalImages |
| 0.691 | 0.715 | 0.761 | 0.701 | 0.757 | 0.701 | 0.71 | 0.757 |
| MiddlePhalanxOAG | 0.801 |
| 0.808 | 0.782 | 0.798 | 0.794 | 0.8 | 0.77 | 0.751 |
| MiddlePhalanxOC |
| 0.694 | 0.666 | 0.609 | 0.581 | 0.676 | 0.673 | 0.597 | 0.56 |
| MiddlePhalanxTW |
| 0.579 | 0.537 | 0.525 | 0.519 | 0.577 | 0.568 | 0.503 | 0.499 |
| MoteStrain | 0.902 | 0.882 | 0.846 | 0.875 | 0.891 | 0.874 | 0.886 |
| 0.88 |
| NonInvFetalECGThorax1 | 0.929 |
| 0.841 | 0.849 | 0.877 | 0.88 | 0.842 | 0.807 | 0.818 |
| NonInvFetalECGThorax2 | 0.946 |
| 0.904 | 0.914 | 0.898 | 0.914 | 0.862 | 0.826 | 0.894 |
| OliveOil |
| 0.881 | 0.87 | 0.879 | 0.864 | 0.883 | 0.864 | 0.892 | 0.872 |
| OSULeaf | 0.949 | 0.934 |
| 0.812 | 0.809 | 0.637 | 0.678 | 0.763 | 0.787 |
| PhalangesOutlinesCorrect | 0.783 | 0.794 | 0.821 | 0.78 | 0.793 | 0.804 |
| 0.79 | 0.76 |
| Phoneme |
| 0.329 | 0.256 | 0.299 | 0.22 | 0.211 | 0.278 | 0.245 | 0.275 |
| Plane |
| 1 | 0.998 |
| 0.996 | 0.994 | 0.993 | 1 | 0.999 |
| ProximalPhalanxOAG | 0.871 |
| 0.867 | 0.839 | 0.829 | 0.847 | 0.861 | 0.851 | 0.806 |
| ProximalPhalanxOC |
| 0.841 | 0.819 | 0.805 | 0.824 | 0.846 | 0.842 | 0.8 | 0.769 |
| ProximalPhalanxTW |
| 0.803 | 0.773 | 0.759 | 0.774 | 0.808 | 0.798 | 0.722 | 0.729 |
| RefrigerationDevices | 0.742 | 0.761 |
| 0.676 | 0.656 | 0.615 | 0.638 | 0.675 | 0.704 |
| ScreenType | 0.651 |
| 0.586 | 0.554 | 0.499 | 0.573 | 0.538 | 0.506 | 0.493 |
| ShapeletSim | 0.964 | 0.934 |
| 0.827 | 0.888 | 0.51 | 0.913 | 0.874 | 0.85 |
| ShapesAll |
| 0.854 | 0.909 | 0.886 | 0.796 | 0.8 | 0.853 | 0.885 | 0.875 |
| SmallKitchenAppliances | 0.788 | 0.802 | 0.75 | 0.703 | 0.753 |
| 0.674 | 0.724 | 0.717 |
| SonyAIBORobotSurface1 |
| 0.888 | 0.897 | 0.794 | 0.884 | 0.845 | 0.839 | 0.842 | 0.764 |
| SonyAIBORobotSurface2 |
| 0.924 | 0.888 | 0.87 | 0.859 | 0.856 | 0.825 | 0.851 | 0.877 |
| StarlightCurves |
| 0.977 | 0.978 | 0.941 | 0.96 | 0.969 | 0.978 | 0.968 | 0.882 |
| Strawberry | 0.963 | 0.968 |
| 0.959 | 0.97 | 0.963 | 0.968 | 0.963 | 0.958 |
| SwedishLeaf |
| 0.939 | 0.918 | 0.916 | 0.885 | 0.892 | 0.908 | 0.926 | 0.887 |
| Symbols | 0.953 | 0.862 |
| 0.957 | 0.93 | 0.888 | 0.944 | 0.96 | 0.952 |
| SyntheticControl |
| 0.987 | 0.968 | 0.994 | 0.986 | 0.99 | 0.987 | 0.972 | 0.982 |
| ToeSegmentation1 | 0.934 |
| 0.929 | 0.788 | 0.922 | 0.661 | 0.858 | 0.841 | 0.821 |
| ToeSegmentation2 | 0.951 | 0.947 |
| 0.907 | 0.904 | 0.782 | 0.886 | 0.926 | 0.895 |
| Trace |
| 1 | 1 | 0.996 | 0.997 | 0.998 | 0.981 | 0.966 | 0.956 |
| TwoLeadECG | 0.983 | 0.984 |
| 0.958 | 0.958 | 0.842 | 0.91 | 0.928 | 0.941 |
| TwoPatterns | 1 | 0.952 | 0.991 |
| 1 | 0.991 | 0.974 | 0.967 | 0.999 |
| UWaveGestureLibraryX | 0.831 | 0.806 | 0.753 | 0.805 | 0.806 | 0.806 |
| 0.819 | 0.775 |
| UWaveGestureLibraryY |
| 0.737 | 0.661 | 0.731 | 0.717 | 0.727 | 0.746 | 0.753 | 0.69 |
| UWaveGestureLibraryZ | 0.76 | 0.747 | 0.695 | 0.726 | 0.736 | 0.741 |
| 0.766 | 0.701 |
| UWaveGestureLibraryAll |
| 0.942 | 0.944 | 0.968 | 0.963 | 0.962 | 0.944 | 0.968 | 0.96 |
| Wafer | 0.999 |
| 0.999 | 0.997 | 0.996 | 0.997 | 0.996 | 0.995 | 0.996 |
| Wine | 0.904 |
| 0.912 | 0.887 | 0.892 | 0.881 | 0.879 | 0.884 | 0.884 |
| WordSynonyms | 0.748 | 0.582 | 0.659 |
| 0.674 | 0.643 | 0.669 | 0.728 | 0.773 |
| Worms | 0.725 | 0.719 |
| 0.644 | 0.673 | 0.628 | 0.668 | 0.642 | 0.616 |
| WormsTwoClass | 0.785 | 0.779 |
| 0.717 | 0.73 | 0.685 | 0.755 | 0.743 | 0.712 |
| Yoga | 0.898 | 0.823 |
| 0.885 | 0.863 | 0.867 | 0.835 | 0.874 | 0.888 |
| Average rank | 2.11 | 3.56 | 4.04 | 5.02 | 5.61 | 5.71 | 5.92 | 6.40 | 6.62 |
| Wins | 36.5 | 17 | 17 | 6.5 | 2 | 1 | 3 | 1 | 1 |
The best algorithm of these nine is in bold. Some of the problem names are abbreviated and all of the results are rounded to 3 decimal places to save space. Apparent discrepancies such as the fact ST has accuracy of 1 on plane but is not registered as one of the best are caused by rounding (ST average accuracy is 0.99961). Full results, including the accuracy for each fold for every algorithm, are available on the website
Fig. 10Critical difference diagram for the nine classifiers significantly better than both benchmark classifiers
Average deviation in accuracy from COTE, standard deviation in ranks and maximum rank over all 85 data sets
| Algorithm | Difference to COTE | Rank Standard Deviation | Max Rank |
|---|---|---|---|
| COTE | 0 | 3.81 | 22 |
| ST |
| 7.89 | 33 |
| BOSS |
| 6.93 | 35 |
| EE |
| 5.79 | 26 |
|
|
| 6.57 | 30 |
| TSF |
| 7.27 | 32 |
| TSBF |
| 7.37 | 33 |
| LPS |
| 9.11 | 35 |
| MSM |
| 8.79 | 37 |
| RotF |
| 9.84 | 37 |
| DTW |
| 8.07 | 36 |
Best performing algorithms split by problem type
| Problem | COTE (%) | Dictionary (%) | Elastic (%) | Interval (%) | Shapelet (%) | Vector (%) | # |
|---|---|---|---|---|---|---|---|
| Image outline | 20.69 | 17.24 |
| 0.00 | 17.24 | 20.69 | 29 |
| Sensor readings |
| 0.00 | 16.67 | 5.56 | 22.22 | 16.67 | 18 |
| Motion capture |
| 21.43 | 14.29 | 14.29 | 14.29 | 0.00 | 14 |
| Spectrographs | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 7 |
| Electric devices | 0.00 | 33.33 | 0.00 | 16.67 |
| 0.00 | 6 |
| ECG | 33.33 | 16.67 | 0.00 | 0.00 |
| 0.00 | 6 |
| Simulated |
| 20.00 | 20.00 | 0.00 | 20.00 | 0.00 | 5 |
Each entry is the percentage of problems of that type a member of a class of algorithm is most accurate for
Bold values indicate the largest value on the row (i.e. the best performing algorithm for each problem type)
Fig. 11The rank of BOSS against series length, grouped into sets with lengths 1–100, 101–200, etc. The last group contains all series of length 1000 or more. The dotted line is the linear regression fit
Algorithm ranks split by number of classes
| #Classes | MSM | LPS | TSBF | TSF |
| EE | BOSS | ST | COTE |
|---|---|---|---|---|---|---|---|---|---|
| 2 | 11.94 | 10.74 | 9.76 | 9.53 | 8.31 | 8.97 | 4.26 | 4.55 | 3.16 |
| 3 | 12.92 | 10.42 | 8.42 | 8.83 | 8.50 | 7.67 | 3.67 | 4.00 | 2.92 |
| 4–5 | 10.82 | 12.09 | 11.64 | 7.00 | 8.45 | 7.45 | 6.41 | 7.05 | 3.91 |
| 6–7 | 10.91 | 11.27 | 10.09 | 10.45 | 9.73 | 5.77 | 7.36 | 6.05 | 2.05 |
| 8–12 | 8.70 | 10.10 | 8.30 | 9.10 | 7.90 | 4.00 | 11.70 | 6.00 | 1.50 |
| 13+ | 6.00 | 7.19 | 9.63 | 11.31 | 11.94 | 4.00 | 7.06 | 6.63 | 1.81 |
Each data represents the average rank of that algorithm over all problems with the range of classes shown in the first column
Algorithm ranks split by train set sizes
| #Train cases | MSM | LPS | TSBF | TSF |
| EE | BOSS | ST | COTE |
|---|---|---|---|---|---|---|---|---|---|
|
| 10.89 | 10.86 | 11.23 | 11.13 | 9.52 | 7.79 | 5.64 | 6.04 | 3.39 |
| 100–399 (28) | 8.96 | 11.61 | 11.18 | 10.36 | 9.57 | 5.98 | 5.66 | 5.75 | 2.09 |
| 400–799 (15) | 12.40 | 10.73 | 6.73 | 6.93 | 8.80 | 7.53 | 5.60 | 4.33 | 2.53 |
|
| 12.00 | 7.71 | 7.29 | 6.21 | 6.07 | 7.00 | 7.93 | 4.29 | 2.86 |
Each data represents the average rank of that algorithm over all problems with the range of train set size shown in the first column. The number of datasets in each category is in brackets
A summary of the relationship between classes of algorithms
| Best | # | Vector (%) | Elastic (%) | Interval (%) | Shapelet (%) | Dictionary (%) |
|---|---|---|---|---|---|---|
| Vector | 18 | 0.00 |
|
|
|
|
| Elastic | 18 |
| 0.00 |
|
|
|
| Interval | 8 |
|
| 0.00 |
|
|
| Shapelet | 28 |
|
|
| 0.00 |
|
| Dictionary | 13 |
|
|
|
| 0.00 |
All problems are grouped by the type of algorithm which has the highest accuracy. Each table entry is the average difference in accuracy of the average of the best performing algorithms of the best in each category. So, for example, the best of the shapelet approaches (ST, LS and FS) is on average 3.58% less accurate than the dictionary approaches on problems where the dictionary approach is the most accurate overall
Fig. 12Critical difference diagram for ten elastic distance measures. The elastic ensemble (EE) is significantly more accurate than its constituents
Fig. 13Time to classify ten test instances (averaged over 100 parameter options) for varying number of train instances of the problem StarlightCurves. The legend is ordered from fastest algorithm (ED) to slowest algorithm (TWE)
Fig. 14Texas sharp shooter plot for MSM against DTW. The top right quadrant contains the problems where both the train and test accuracy for MSM is higher than DTW
Fig. 15Critical difference diagram for three interval based techniques. There is no significant difference between them
Fig. 16Example series from ToeSegmentation1, left is class 1 (normal walking) and right is class 2 (abnormal walking)
Results of all algorithms on ToeSegmentation1 and ToeSegmentation2
| Algorithm | ToeSegmentation1 | ToeSegmentation2 |
|---|---|---|
| ST |
| 94.72% |
| LS | 93.43% | 94.26% |
| COTE | 93.37% | 95.15% |
| BOSS | 92.88% |
|
| SAXVSM | 92.79% | 92.08% |
| BoP | 92.62% | 91.17% |
|
| 92.20% | 90.38% |
| FS | 90.41% | 87.28% |
| TSBF | 85.82% | 88.58% |
| LPS | 84.12% | 92.64% |
| MSM | 82.13% | 89.52% |
| TWE | 79.59% | 88.85% |
| EE | 78.76% | 90.70% |
|
| 74.18% | 82.80% |
|
| 72.90% | 82.58% |
| WDTW | 72.79% | 86.22% |
| DTW | 72.20% | 85.09% |
|
| 71.80% | 84.39% |
| TSF | 66.10% | 78.24% |
| ED | 61.20% | 78.10% |
| RotF | 57.80% | 64.60% |
Bold values indicate the best overall algorithm for each problem (i.e. ST is best at Toe1, Boss at Toe2)
Fig. 17Example series from LargeKitchenApplicances. The top three are Washing Machine (class 3), the middle three Tumble Dryers (class 2) and the bottom three Dishwashers (class 1)