An Improved Feature Representation Method for Maximum Entropy Model.

Guan Yi,Zhao Jian
DOI: https://doi.org/10.1109/icdmw.2006.29
2006-01-01
Abstract:In maximum entropy model (MEM), features are typically represented by either 0-1 binary-valued function or real-valued function. However, both representations only examine the impact of specific value of some attributes but not their types. Such negligence not only causes the decreasing of classification precision, but also slows the convergence speed of the generalized iterative scaling (GIS) algorithm, as more apparent to incomplete data. In this paper, an improved feature representation method is presented. The feature is composed of two parts: the first one is for specific value of an attribute; the second one is for the type of corresponding attribute. The experimental results on Mushroom dataset of UCI data repository showed that the average classifying precisions on incomplete dataset and complete dataset were improved by 1.5% and 3.0% respectively, and the average convergence speed was improved by 42.9% and 90.7% respectively
What problem does this paper attempt to address?