Abstract:Despite its practical significance, generating realistic synthetic financial time series is challenging due to statistical properties known as stylized facts, such as fat tails, volatility clustering, and seasonality patterns. Various generative models, including generative adversarial networks (GANs) and variational autoencoders (VAEs), have been employed to address this challenge, although no model yet satisfies all the stylized facts. We alternatively propose utilizing diffusion models, specifically denoising diffusion probabilistic models (DDPMs), to generate synthetic financial time series. This approach employs wavelet transformation to convert multiple time series (into images), such as stock prices, trading volumes, and spreads. Given these converted images, the model gains the ability to generate images that can be transformed back into realistic time series by inverse wavelet transformation. We demonstrate that our proposed approach satisfies stylized facts.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to generate realistic synthetic financial time series. Although generating synthetic financial time series has important practical significance, this task is extremely challenging due to its statistical characteristics (i.e., the so - called "stylized facts", such as fat - tailed, volatility clustering, and seasonal patterns). Existing generative models, such as generative adversarial networks (GANs) and variational auto - encoders (VAEs), can capture these characteristics to a certain extent, but have not yet fully met the requirements of all stylized facts.
To solve this problem, the authors propose an approach based on diffusion models (specifically, denoising diffusion probability models, DDPMs) to generate synthetic financial time series. Specifically, this method transforms multiple time series (such as stock prices, trading volumes, and bid - ask spreads) into images through wavelet transforms and uses DDPMs to learn the features of these images. After training, the images generated by the model can be transformed back into real - time series through inverse wavelet transforms. Experimental results show that this method can well reproduce the stylized facts, thereby significantly improving the quality of synthetic financial time series.
### Overview of Stylized Facts
Stylized facts refer to some typical statistical characteristics in financial time series, mainly including the following:
1. **Fat - tailed distribution**: Changes in financial time series usually follow a power - law distribution rather than a normal distribution. This means that the frequency of extreme events is higher than that predicted by the normal distribution.
2. **Volatility clustering**: Volatility shows continuous high - and - low - fluctuation periods, that is, large fluctuations are often accompanied by more large fluctuations, and small fluctuations also tend to cluster together.
3. **Seasonal pattern**: Intra - day volatility, bid - ask spreads, and trading volumes show different patterns at different times of the day, for example, there are larger fluctuations at the opening and closing, and smaller fluctuations at noon.
### Method Innovation Points
- **Wavelet transform**: Transforming time series into two - dimensional images enables the model to better capture multi - scale features in time series.
- **Application of DDPMs**: Compared with GANs and VAEs, DDPMs show higher quality and diversity in image generation, which helps to generate more realistic financial time series.
- **Synchronous generation of multiple time series**: By simultaneously processing three related time series (stock prices, bid - ask spreads, and trading volumes), the model can better capture the inter - relationships between them.
### Experimental Verification
By conducting experiments on the historical data of AAPL.O stocks, the authors verified the effectiveness of the proposed method. Experimental results show that this method can not only reproduce the stylized facts, but also accurately simulate the intra - day seasonal patterns and cross - correlations between time series. Compared with existing methods such as TimeGAN and QuantGAN, this method performs excellently on multiple evaluation indicators.
In conclusion, this paper proposes a novel and effective method, which solves the problem that existing generative models cannot fully meet the stylized facts when generating synthetic financial time series, and provides a powerful tool for financial market simulation and risk assessment.