Large Margin Boltzmann Machines and Large Margin Sigmoid Belief Networks

Xu Miao,Rajesh P.N. Rao
DOI: https://doi.org/10.48550/arXiv.1003.4781
2010-03-25
Abstract:Current statistical models for structured prediction make simplifying assumptions about the underlying output graph structure, such as assuming a low-order Markov chain, because exact inference becomes intractable as the tree-width of the underlying graph increases. Approximate inference algorithms, on the other hand, force one to trade off representational power with computational efficiency. In this paper, we propose two new types of probabilistic graphical models, large margin Boltzmann machines (LMBMs) and large margin sigmoid belief networks (LMSBNs), for structured prediction. LMSBNs in particular allow a very fast inference algorithm for arbitrary graph structures that runs in polynomial time with a high probability. This probability is data-distribution dependent and is maximized in learning. The new approach overcomes the representation-efficiency trade-off in previous models and allows fast structured prediction with complicated graph structures. We present results from applying a fully connected model to multi-label scene classification and demonstrate that the proposed approach can yield significant performance gains over current state-of-the-art methods.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **In structured prediction, how to overcome the trade - off between representational power and computational efficiency, so as to be able to perform fast and accurate reasoning and learning on complex graph structures**. Specifically, current structured prediction models (such as conditional random fields, maximum - margin Markov networks, etc.) usually need to make simplified assumptions about the output graph structure to ensure the computational feasibility of reasoning. However, these simplified assumptions limit the representational power of the model. On the other hand, approximate reasoning algorithms can operate on more complex graph structures, but often need to make a compromise between representational power and computational efficiency. Therefore, these methods are difficult to achieve both efficient reasoning and strong representational power simultaneously. To solve this problem, the paper proposes two new probabilistic graph models: **Large - Margin Boltzmann Machines (LMBM) and Large - Margin Sigmoid Belief Networks (LMSBN)**. In particular, LMSBN allows for fast reasoning on arbitrarily complex graph structures, and its reasoning speed increases as the fit between the model and the data distribution improves. This enables LMSBN to handle complex graph structures while maintaining efficient reasoning, thus avoiding the trade - off between representational power and computational efficiency. ### Main contributions of the paper: 1. **Propose LMBM and LMSBN**: These two models can perform structured prediction on complex graph structures. 2. **Fast reasoning algorithm**: Design a fast reasoning algorithm based on branch - and - bound for LMSBN. The time complexity of this algorithm is independent of the graph structure and accelerates as the fit between the model and the data improves. 3. **Theoretical analysis**: Provide theoretical analysis tools for the generalization performance of LMBM and LMSBN, and prove the superiority of LMSBN on complex graph structures. 4. **Experimental verification**: Through multi - label scene classification tasks, verify the performance improvement of the proposed method compared to existing methods. ### Summary: This paper aims to solve the trade - off problem between representational power and computational efficiency in structured prediction. By introducing LMBM and LMSBN, it achieves fast reasoning and efficient learning on complex graph structures.