EFFICIENT INFERENCE AND TRAINING FOR CONDITIONAL LATENT VARIABLE MODELS¡¢£¤¥¦ § ©

SUN Xu
2009-01-01
Abstract:Real-world problems may contain latent dependencies (ie, hidden sub-structures) that are difficult to capture with conventional structured classifiers, such as conditional random fields. In such cases, models that exploit latent variables are advantageous in learning, and conditional latent variable models have been applied successfully into real-world tasks in natural language processing and vision processing communities, including syntactic parsing and vision recognition. In the first part of this thesis, I perform experiments in a variety of tasks to confirm the advantages of conditional latent variable models, and investigate what kind of latent dependencies are learned by the model. While the experiments confirmed the advantages of conditional latent variable models, the same experiments revealed two problems in applying such models for practical usages. First, establishing an efficient inference method on latent conditional models remains an open question. Second, because of the incorporation of latent variables, training a latent conditional model brings a heavy computational cost. To deal with those critical problems, I propose efficient methods for inference and training on latent conditional models.In the inference stage, I propose the “latent-dynamic inference”, which is able to produce the exact optimal label sequence on latent conditional models by systematically combining efficient search strategy (the A* search) and dynamic programming (the Forward-Backward method). I also describe a straightforward solution on approximating the exact method, and show that the approximated version performs as well as the exact one, while the speed …
What problem does this paper attempt to address?