Mutual Information with Parameter Determination Approach for Feature Selection in Multivariate Time Series Prediction.

Tianhong Liu,Haikun Wei,Chi Zhang,Kanjian Zhang
DOI: https://doi.org/10.1007/978-3-319-44188-7_17
2016-01-01
Abstract:For modeling of multivariate time series, input variable selection is a key problem. Feature selection is to select a relevant subset to reduce the dimensionality of the problem without significant loss of information. This paper presents the estimation of mutual information and its application in feature selection problem. Mutual information is one of the most common strategies borrowed from information theory for feature selection. However, the calculation of probability density function (PDF) according to the definition of mutual information is difficult, especially for high dimensional variables. A k-nearest neighbor (k-NN) method based estimator is widely used to estimate the mutual information between two variables directly from the data set. Nevertheless, this estimator depends on smoothing parameter. There is no theoretically method to choose the parameter. This paper purposes to solve two problems: one is to employ resampling methods to help the mutual information estimator to improve feature selection and the other is to apply these methods to a wind power prediction problem.
What problem does this paper attempt to address?