Evaluation of feature selection using information gain and gain ratio on bank marketing classification using naïve bayes

B Prasetiyo,Alamsyah,M A Muslim,N Baroroh
DOI: https://doi.org/10.1088/1742-6596/1918/4/042153
2021-06-01
Journal of Physics: Conference Series
Abstract:Abstract One of the efforts of banks to do marketing is by telephone to offer their products, such as deposits. There are many variables that influence whether the customer decides to subscribe or not. In this study, we present a comparison of feature selection from high features dataset. We use a bank marketing dataset which has 20 features and consists of 4,119 instances. We consider 2 ranking methods entropy-based, namely Information Gain (IG) and Gain Ratio (GR). In our experiment, we classified the various selected based on the ranking of the selected features using Naïve Bayes. We show that the selection of different features is important for classification accuracy. The different combinations of feature selection can affect the accuracy results.
What problem does this paper attempt to address?