Improving the Evaluation and Actionability of Explanation Methods for Multivariate Time Series Classification

Davide Italo Serramazza,Thach Le Nguyen,Georgiana Ifrim
2024-08-12
Abstract:Explanation for Multivariate Time Series Classification (MTSC) is an important topic that is under explored. There are very few quantitative evaluation methodologies and even fewer examples of actionable explanation, where the explanation methods are shown to objectively improve specific computational tasks on time series data. In this paper we focus on analyzing InterpretTime, a recent evaluation methodology for attribution methods applied to MTSC. We showcase some significant weaknesses of the original methodology and propose ideas to improve both its accuracy and efficiency. Unlike related work, we go beyond evaluation and also showcase the actionability of the produced explainer ranking, by using the best attribution methods for the task of channel selection in MTSC. We find that perturbation-based methods such as SHAP and Feature Ablation work well across a set of datasets, classifiers and tasks and outperform gradient-based methods. We apply the best ranked explainers to channel selection for MTSC and show significant data size reduction and improved classifier accuracy.
Machine Learning
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper primarily focuses on interpretability methods in Multivariate Time Series Classification (MTSC) and attempts to improve the evaluation and practical applicability of these methods. #### Main Research Content: 1. **Evaluating Existing Interpretability Methods**: The authors evaluate current attribution methods used for MTSC, analyzing their strengths and limitations, especially in synthetic and real datasets. 2. **Quantitative Comparison of Various Interpretability Methods**: Based on the existing evaluation framework InterpretTime, the authors identify significant weaknesses, such as the need for training data augmentation, which limits the types of classifiers that can be used, and inefficiency in obtaining interpretability evaluations. 3. **Proposing Improvements**: The authors propose improvements to the InterpretTime framework, including the use of multiple masks instead of data augmentation, and demonstrate that grouping consecutive time points by channel can reduce runtime and improve interpretability performance. 4. **Demonstrating Practical Applicability of Interpretability**: By using the best attribution methods for channel selection tasks in MTSC, the authors show that their proposed methods not only improve classifier accuracy but also significantly reduce the amount of data required. ### Research Background and Contributions - **Research Background**: With the widespread application of machine learning algorithms in daily life, interpreting the results of these models has become particularly important. Especially in multivariate time series classification, reliable classifiers are needed along with methods that can explain the classification results to users. - **Contributions**: The authors quantitatively compare various interpretability methods and propose improvements to the existing evaluation framework, thereby enhancing the quality and efficiency of interpretability. Additionally, by practically applying these interpretability methods for channel selection, they validate their effectiveness in specific tasks. ### Conclusion This paper aims to improve the evaluation framework for interpretability methods in multivariate time series classification and validates the effectiveness of the improved methods through practical application. The research results indicate that perturbation-based methods (such as SHAP and feature ablation) perform well across multiple datasets, classifiers, and tasks, outperforming gradient-based methods. Furthermore, channel selection using the best interpreters significantly reduces data volume and improves classifier accuracy.