Abstract:Explanation for Multivariate Time Series Classification (MTSC) is an important topic that is under explored. There are very few quantitative evaluation methodologies and even fewer examples of actionable explanation, where the explanation methods are shown to objectively improve specific computational tasks on time series data. In this paper we focus on analyzing InterpretTime, a recent evaluation methodology for attribution methods applied to MTSC. We showcase some significant weaknesses of the original methodology and propose ideas to improve both its accuracy and efficiency. Unlike related work, we go beyond evaluation and also showcase the actionability of the produced explainer ranking, by using the best attribution methods for the task of channel selection in MTSC. We find that perturbation-based methods such as SHAP and Feature Ablation work well across a set of datasets, classifiers and tasks and outperform gradient-based methods. We apply the best ranked explainers to channel selection for MTSC and show significant data size reduction and improved classifier accuracy.

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve This paper primarily focuses on interpretability methods in Multivariate Time Series Classification (MTSC) and attempts to improve the evaluation and practical applicability of these methods. #### Main Research Content: 1. **Evaluating Existing Interpretability Methods**: The authors evaluate current attribution methods used for MTSC, analyzing their strengths and limitations, especially in synthetic and real datasets. 2. **Quantitative Comparison of Various Interpretability Methods**: Based on the existing evaluation framework InterpretTime, the authors identify significant weaknesses, such as the need for training data augmentation, which limits the types of classifiers that can be used, and inefficiency in obtaining interpretability evaluations. 3. **Proposing Improvements**: The authors propose improvements to the InterpretTime framework, including the use of multiple masks instead of data augmentation, and demonstrate that grouping consecutive time points by channel can reduce runtime and improve interpretability performance. 4. **Demonstrating Practical Applicability of Interpretability**: By using the best attribution methods for channel selection tasks in MTSC, the authors show that their proposed methods not only improve classifier accuracy but also significantly reduce the amount of data required. ### Research Background and Contributions - **Research Background**: With the widespread application of machine learning algorithms in daily life, interpreting the results of these models has become particularly important. Especially in multivariate time series classification, reliable classifiers are needed along with methods that can explain the classification results to users. - **Contributions**: The authors quantitatively compare various interpretability methods and propose improvements to the existing evaluation framework, thereby enhancing the quality and efficiency of interpretability. Additionally, by practically applying these interpretability methods for channel selection, they validate their effectiveness in specific tasks. ### Conclusion This paper aims to improve the evaluation framework for interpretability methods in multivariate time series classification and validates the effectiveness of the improved methods through practical application. The research results indicate that perturbation-based methods (such as SHAP and feature ablation) perform well across multiple datasets, classifiers, and tasks, outperforming gradient-based methods. Furthermore, channel selection using the best interpreters significantly reduces data volume and improves classifier accuracy.

Improving the Evaluation and Actionability of Explanation Methods for Multivariate Time Series Classification

XCM: an Explainable Convolutional Neural Network for Multivariate Time Series Classification.

Robust explainer recommendation for time series classification

Ranking by Aggregating Referees: Evaluating the Informativeness of Explanation Methods for Time Series Classification

Visual Explanations with Attributions and Counterfactuals on Time Series Classification

InterpretTime: a new approach for the systematic evaluation of neural-network interpretability in time series classification

The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances

Explaining deep multi-class time series classifiers

SSET: Swapping-Sliding Explanation for Time Series Classifiers in Affect Detection

A Unified Study of Machine Learning Explanation Evaluation Metrics

TV-Net: Temporal-Variable feature harmonizing Network for multivariate time series classification and interpretation

Explainable AI for Time Series Classification: A Review, Taxonomy and Research Directions

Evaluation of post-hoc interpretability methods in time-series classification

CAFO: Feature-Centric Explanation on Time Series Classification

M-CELS: Counterfactual Explanation for Multivariate Time Series Data Guided by Learned Saliency Maps

TS-MULE: Local Interpretable Model-Agnostic Explanations for Time Series Forecast Models

"Is your explanation stable?": A Robustness Evaluation Framework for Feature Attribution

Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models

Inherently Interpretable Time Series Classification via Multiple Instance Learning

Towards a Unified Framework for Evaluating Explanations