Bayesian Nonparametric Density Autoregression with Lag Selection

Matthew Heiner,Athanasios Kottas
DOI: https://doi.org/10.1214/21-ba1296
2022-01-01
Bayesian Analysis
Abstract:[6pt] Supplementary_Material.pdf. Contains additional details referenced in the article. Section S1 explores considerations for a model that assumes stationarity of the time series. Section S2 provides a visualization of the effect of prior settings on the prior transition mean function. Section S3 reports a simulation study examining sensitivity of model runtime and posterior inferences to various settings. Section S4 contains additional details on the MCMC sampler presented in Section 2.3. Section S5 provides additional details for posterior simulation and inference, as well as an additional illustration with data simulated from an AR process. [6pt] Julia package. Package containing code to support fitting and post-processing models. [6pt] Example code. Code and data to fit models and perform simulation studies as described in the article. We develop a Bayesian nonparametric autoregressive model applied to flexibly estimate general transition densities exhibiting nonlinear lag dependence. Our approach is related to Bayesian density regression using Dirichlet process mixtures, with the Markovian likelihood defined through the conditional distribution obtained from the mixture. This results in a Bayesian nonparametric extension of a mixtures-of-experts model formulation. We address computational challenges to posterior sampling that arise from the Markovian structure in the likelihood. The base model is illustrated with synthetic data from a classical model for population dynamics, as well as a series of waiting times between eruptions of Old Faithful Geyser. We study inferences available through the base model before extending the methodology to include automatic relevance detection among a pre-specified set of lags. Inference for global and local lag selection is explored with additional simulation studies, and the methods are illustrated through analysis of an annual time series of pink salmon abundance in a stream in Alaska. We further explore and compare transition density estimation performance for alternative configurations of the proposed model. Supplementary materials are available online.
statistics & probability,mathematics, interdisciplinary applications
What problem does this paper attempt to address?