Compact Flow Diagrams for State Sequences

Kevin Buchin,Maike Buchin,Joachim Gudmundsson,Michael Horton,Stef Sijben
DOI: https://doi.org/10.1145/3150525
2016-02-18
Abstract:We introduce the concept of compactly representing a large number of state sequences, e.g., sequences of activities, as a flow diagram. We argue that the flow diagram representation gives an intuitive summary that allows the user to detect patterns among large sets of state sequences. Simplified, our aim is to generate a small flow diagram that models the flow of states of all the state sequences given as input. For a small number of state sequences we present efficient algorithms to compute a minimal flow diagram. For a large number of state sequences we show that it is unlikely that efficient algorithms exist. More specifically, the problem is W[1]-hard if the number of state sequences is taken as a parameter. We thus introduce several heuristics for this problem. We argue about the usefulness of the flow diagram by applying the algorithms to two problems in sports analysis. We evaluate the performance of our algorithms on a football data set and generated data.
Data Structures and Algorithms,Computational Complexity
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to represent a large number of state sequences, such as activity sequences, compactly so that these sequences can be summarized intuitively and users can detect patterns in a large number of state sequences. Specifically, the goal of the paper is to generate a small - sized flowchart to model the state flow of all given state sequences. For a small number of state sequences, the paper proposes effective algorithms to calculate the minimum flowchart; while for a large number of state sequences, the paper proves that the existence of efficient algorithms is unlikely and that this problem is W[1] - hard. Therefore, the paper introduces several heuristic methods to solve this problem. By applying the algorithms to two problems in sports analysis, the paper demonstrates the usefulness of the flowchart and evaluates the performance of the algorithms on a football dataset and generated data.