Skip to content

Latest commit

 

History

History
80 lines (56 loc) · 4.38 KB

README.md

File metadata and controls

80 lines (56 loc) · 4.38 KB

Previous AALTD@ECML-2021

Scalable Classifier-Agnostic Channel Selection for Multivariate Time Series Classification

Accepted for ECML2023, Journal Track

Abstract:

Accuracy is a key focus of current work in time series classification. However, in many applications speed and data reduction is equally important, especially when the data scale and storage requirements increase rapidly. Current multivariate time series classification algorithms need hundreds of compute hours to complete training and prediction. This is due to the nature of multivariate time series data which grows with the number of time series, their length and the number of channels. In many applications not all the channels are useful for the classification task, hence we require methods that can efficiently select useful channels and thus save computational resources. We propose and evaluate two methods for channel selection. Our techniques work by representing each class by a prototype time series and performing channel selection based on the prototype-distance between classes. The main hypothesis is that useful channels enable better separation between classes and hence channels with higher distance between class-prototypes are more useful. On the UEA Multivariate Time Series Classification (MTSC) benchmark we show that these techniques achieve significant data reduction and classifier speedup, for similar levels of classification accuracy. Channel selection is applied as a pre-processing step before training state-of-the-art MTSC algorithms and saves about 70% of computation time and data storage, with preserved accuracy. Furthermore, our methods enable even efficient classifiers, such as ROCKET, to achieve better accuracy as compared to using no channel selection or forward channel selection. To further study the impact of our techniques we present experiments on classifying synthetic multivariate time series datasets with more than 100 channels, as well as a real-world case study on a dataset with 50 channels. We find that our channel selection methods lead to significant data reduction with preserved or improved accuracy.

Result

image

image

Case Study 1: Sythetic Datasets

image

Case Study 2: Military Press Dataset

image

Running instructions

Look into examples

Implementation

Aeon-toolkit : ECS, ECP

Military Press Dataset

Download here

Citation

Please use below two papers to cite the work.

@article{dhariyal2023scalable,
  title={Scalable classifier-agnostic channel selection for multivariate time series classification},
  author={Dhariyal, Bhaskar and Le Nguyen, Thach and Ifrim, Georgiana},
  journal={Data Mining and Knowledge Discovery},
  pages={1--45},
  year={2023},
  publisher={Springer}
}
@inproceedings{dhariyal2021fast,
  title={Fast Channel Selection for Scalable Multivariate Time Series Classification},
  author={Dhariyal, Bhaskar and Nguyen, Thach Le and Ifrim, Georgiana},
  booktitle={International Workshop on Advanced Analytics and Learning on Temporal Data},
  pages={36--54},
  year={2021},
  organization={Springer}
}

If you are using Military Press dataset, please cite:

@inproceedings{singh2021interpretable,
  title={Interpretable classification of human exercise videos through pose estimation and multivariate time series analysis},
  author={Singh, Ashish and Le, Binh Thanh and Nguyen, Thach Le and Whelan, Darragh and O’Reilly, Martin and Caulfield, Brian and Ifrim, Georgiana},
  booktitle={International Workshop on Health Intelligence},
  pages={181--199},
  year={2021},
  organization={Springer}
}