Robust explainer recommendation for time series classification

Thu Trang Nguyen,Thach Le Nguyen,Georgiana Ifrim
DOI: https://doi.org/10.1007/s10618-024-01045-8
IF: 5.406
2024-06-21
Data Mining and Knowledge Discovery
Abstract:Time series classification is a task which deals with temporal sequences, a prevalent data type common in domains such as human activity recognition, sports analytics and general sensing. In this area, interest in explanability has been growing as explanation is key to understand the data and the model better. Recently, a great variety of techniques (e.g., LIME, SHAP, CAM) have been proposed and adapted for time series to provide explanation in the form of saliency maps , where the importance of each data point in the time series is quantified with a numerical value. However, the saliency maps can and often disagree, so it is unclear which one to use. This paper provides a novel framework to quantitatively evaluate and rank explanation methods for time series classification . We show how to robustly evaluate the informativeness of a given explanation method (i.e., relevance for the classification task), and how to compare explanations side-by-side. The goal is to recommend the best explainer for a given time series classification dataset. We propose AMEE, a Model-Agnostic Explanation Evaluation framework, for recommending saliency-based explanations for time series classification. In this approach, data perturbation is added to the input time series guided by each explanation. Our results show that perturbing discriminative parts of the time series leads to significant changes in classification accuracy, which can be used to evaluate each explanation. To be robust to different types of perturbations and different types of classifiers, we aggregate the accuracy loss across perturbations and classifiers. This novel approach allows us to recommend the best explainer among a set of different explainers, including random and oracle explainers. We provide a quantitative and qualitative analysis for synthetic datasets, a variety of time-series datasets, as well as a real-world case study with known expert ground truth.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper primarily addresses the issue of evaluating and recommending explanation methods in time series classification. Specifically: 1. **Proposing the AMEE Framework**: - The paper introduces a framework called AMEE (Model-Agnostic Explanation Evaluation) for evaluating and ranking explanation methods in time series classification. This framework aims to recommend the best explainers, thereby helping users better understand the data and models. 2. **Standardized Evaluation Measures**: - It proposes a standardized evaluation metric (explanation capability) that can be compared across different explanation methods, classifier judges, and datasets. 3. **Experimental Validation**: - Experiments were conducted on synthetic and real datasets, and the evaluation method's effectiveness was validated using annotated real datasets. All data, code, and detailed results are publicly available. 4. **Problem Solved**: - In time series classification tasks, the saliency maps obtained from various explanation methods are often inconsistent, necessitating a method to evaluate and compare the effectiveness of these explanation methods. The paper addresses this issue through the AMEE framework, providing an objective evaluation standard. Through these contributions, the paper addresses the shortcomings in the current evaluation of explanation methods in time series classification, offering a more reliable and general evaluation framework.