Abstract:In the pattern theoretical framework developed by Grenander and advocated by Mumford for computer vision and pattern recognition, different patterns are represented by statistical generative models. The FRAME (Filters, Random fields, And Maximum Entropy) model is such a generative model for texture patterns. It is a Markov random field model (or a Gibbs distribution, or an energy-based model) of stationary spatial processes. The log probability density function of the model (or the energy function of the Gibbs distribution) is the sum of translation-invariant potential functions that are one-dimensional non-linear transformations of linear filter responses. In this paper, we review two generalizations of this model. One is a sparse FRAME model for non-stationary patterns such as objects, where the potential functions are location specific, and they are non-zero only at a selected collection of locations. The other generalization is a deep FRAME model where the filters are defined by a convolutional neural network (CNN or ConvNet). This leads to a deep convolutional energy-based model. The local modes of the energy function satisfies an auto-encoder which we call the Hopfield auto-encoder. The model can be learned by an “analysis by synthesis” algorithm that iterates a sampling step for synthesis and a learning step for analysis. The algorithm admits an adversarial interpretation where the learning step and sampling step play a minimax game based on a value function. We can recruit a generator model as a direct and approximate sampler of the deep energy-based model to speed up the sampling step, and the two models can be learned simultaneously by a cooperative learning algorithm.

Generative Hierarchical Learning of Sparse FRAME Models

Learning Sparse Frame Models for Natural Image Patterns

Sparse and Deep Generalizations of the FRAME Model

Learning FRAME Models Using CNN Filters

Learning Inhomogeneous Frame Models for Object Patterns

Adaptive Hierarchical Motion-Focused Model for Video Prediction.

Learning FRAME Models Using CNN Filters for Knowledge Visualization.

How Deep Networks Learn Sparse and Hierarchical Data: the Sparse Random Hierarchy Model

Object Tracking with Hierarchical Multiview Learning

Adaptive Recurrent Frame Prediction with Learnable Motion Vectors.

Unsupervised Learning of Dictionaries of Hierarchical Compositional Models

Learning Adaptive Filter Banks for Hierarchical Image Representation

LeaF: Learning Frames for 4D Point Cloud Sequence Understanding

Learning Generative Models of Scene Features

Inducing Hierarchical Compositional Model by Sparsifying Generator Network

Exploring Generative Perspective of Convolutional Neural Networks by Learning Random Field Models

Learning Hierarchical Features with Joint Latent Space Energy-Based Prior

Learning Redundant Sparsifying Transform Based on Equi-Angular Frame.

Learning Sparse Latent Representations for Generator Model

Nested Diffusion Models Using Hierarchical Latent Priors