Massively parallel learning of part of speech disambiguation

Bao-Liang Lu,Qing Ma,Michinori Ichikawa,Hitoshi Isahara
2000-01-01
Abstract:This paper presents a method for massively parallel learning of part-of-speech disambiguation based on a minmax modular neural network model. The method has three main steps. Firstly, a large-scale tagging problem is decomposed into a number of relatively smaller and simpler subproblems according to the class relations among a given training corpus. Secondly, all of the subproblems are learned by small network modules in parallel. Finally, following two module combination principles, all of the trained network modules are integrated into a modular parallel tagging system that produces solutions to the original tagging problem. The proposed method has several advantages over the existing tagging systems based on multilayer perceptrons.(1) Training times can be drastically reduced and desired learning accuracy can be easily achieved;(2) the method can scale up to larger tagging problems; and (3) the tagging system has quick response and facilitates hardware implementation. In order to demonstrate the effectiveness of the proposed method, we perform simulations on two different language corpora: a Thai corpus and a Chinese corpus, which have 29,028 and 45,595 ambiguous words, respectively. We also compare our method with several existing tagging models such as hidden Markov models, multilayer perceptrons and neuro-taggers. The results show that both the learning accuracy and generalization performance of the proposed tagging model are better than statistical models and multilayer perceptrons, and they are comparable to the most successful tagging models.
What problem does this paper attempt to address?