An Information-Based Method for Selecting Feature Types for Word Prediction

Dekai Wu,Zhifang Sui,Jun Zhao
DOI: https://doi.org/10.21437/eurospeech.1999-454
1999-01-01
Abstract:This paper uses an information-based approach to conduct feature types selection for language modeling in a systematic manner. We describe a quantitative analysis of the information gain and the information redundancy for various combinations of feature types inspired by both dependency structure and bigram structure through analyzing an English treebank corpus and taking word prediction as the object. The experiments yield several conclusions on the predictive value of several feature types and feature types combinations for word prediction, which are expected to provide reliable reference for feature type selection in language modeling.
What problem does this paper attempt to address?