Vladislav Zhuzhel,Vsevolod Grabar,Galina Boeva,Artem Zabolotnyi,Alexander Stepikin,Vladimir Zholobov,Maria Ivanova,Mikhail Orlov,Ivan Kireev,Evgeny Burnaev,Rodrigo Rivera-Castro,Alexey Zaytsev
Abstract:Event sequences often emerge in data mining. Modeling these sequences presents two main challenges: methodological and computational. Methodologically, event sequences are non-uniform and sparse, making traditional models unsuitable. Computationally, the vast amount of data and the significant length of each sequence necessitate complex and efficient models. Existing solutions, such as recurrent and transformer neural networks, rely on parametric intensity functions defined at each moment. These functions are either limited in their ability to represent complex event sequences or notably inefficient.
We propose COTIC, a method based on an efficient convolution neural network designed to handle the non-uniform occurrence of events over time. Our paper introduces a continuous convolution layer, allowing a model to capture complex dependencies, including, e.g., the self-excitement effect, with little computational expense.
COTIC outperforms existing models in predicting the next event time and type, achieving an average rank of 1.5 compared to 3.714 for the nearest competitor. Furthermore, COTIC`s ability to produce effective embeddings demonstrates its potential for various downstream tasks. Our code is open and available at: <a class="link-external link-https" href="https://github.com/VladislavZh/COTIC" rel="external noopener nofollow">this https URL</a>.
What problem does this paper attempt to address?
### What problems does this paper attempt to solve?
This paper aims to solve **two major challenges in event - sequence modeling**: methodological challenges and computational challenges. Specifically:
1. **Methodological challenges**:
- Event sequences are usually **non - uniform and sparse**, which makes traditional models (such as traditional models based on parameterized intensity functions) inapplicable.
- Existing solutions (such as recurrent neural networks and Transformer neural networks) rely on parameterized intensity functions defined at each time point, and these functions are either limited in their ability to represent complex event sequences or inefficient.
2. **Computational challenges**:
- The large amount of data and the significant length of each sequence require the model to be both complex and efficient.
To solve these problems, the authors propose the **COTIC (COntinuous - TIme Convolutional) model**, a convolutional neural network - based method specifically designed to handle events that occur non - uniformly over time. By introducing continuous convolutional layers, COTIC can capture complex dependencies, including self - excitation effects, while maintaining low computational costs.
### Specific improvements of COTIC
- **Continuous convolutional layer**: COTIC introduces continuous convolutional layers, avoiding closed - form parameter assumptions for intensity functions, thereby being able to model a wider range of event dependencies.
- **Self - supervised representation learning**: Due to its generative nature, COTIC has the ability of self - supervised representation learning and can generate effective embedding representations suitable for various downstream tasks.
- **Performance improvement**: COTIC outperforms existing models in predicting the time and type of the next event, with an average rank of 1.5, while the closest competitor has a rank of 3.714 (see Table 1). In addition, the embedding representation of COTIC also shows higher accuracy in tasks such as age - group prediction.
### Conclusion
Through innovative continuous convolutional layers and efficient architecture design, COTIC successfully solves the problems of non - uniformity and computational complexity in event - sequence modeling, providing a more powerful and flexible tool for event - sequence prediction and analysis.
#### Table 1: Average ranks of different methods on eight datasets (the lower, the better)
| Method | MAE of next - event time, rank | Accuracy of next - event type, rank | Average rank |
| ---- | ---- | ---- | ---- |
| RMTPP | 5.286 | 6.714 | 6 |
| Neural Hawkes | 4.143 | 3.286 | 3.714 |
| ODETPP | 6.571 | 5.571 | 6.071 |
| THP | 7.143 | 5.714 | 6.428 |
| THP2SAHP | 6.571 | 5.143 | 5.857 |
| Attentive NHP | 3.714 | 4.714 | 4.214 |
| WaveNet | 4.143 | |