Abstract:Kernel methods have been successfully applied in various applications. To succeed in these applications, it is crucial to learn a good kernel representation, whose objective is to reveal the data similarity precisely. In this paper, we address the problem of multiple kernel learning (MKL), searching for the optimal kernel combination weights through maximizing a generalized performance measure. Most MKL methods employ the L(1)-norm simplex constraints on the kernel combination weights, which therefore involve a sparse but non-smooth solution for the kernel weights. Despite the success of their efficiency, they tend to discard informative complementary or orthogonal base kernels and yield degenerated generalization performance. Alternatively, imposing the L(p)-norm (p > 1) constraint on the kernel weights will keep all the information in the base kernels. This leads to non-sparse solutions and brings the risk of being sensitive to noise and incorporating redundant information. To tackle these problems, we propose a generalized MKL (GMKL) model by introducing an elastic-net-type constraint on the kernel weights. More specifically, it is an MKL model with a constraint on a linear combination of the L(1)-norm and the squared L(2)-norm on the kernel weights to seek the optimal kernel combination weights. Therefore, previous MKL problems based on the L(1)-norm or the L(2)-norm constraints can be regarded as special cases. Furthermore, our GMKL enjoys the favorable sparsity property on the solution and also facilitates the grouping effect. Moreover, the optimization of our GMKL is a convex optimization problem, where a local solution is the global optimal solution. We further derive a level method to efficiently solve the optimization problem. A series of experiments on both synthetic and real-world datasets have been conducted to show the effectiveness and efficiency of our GMKL.

Triply Stochastic Gradients on Multiple Kernel Learning

Generalized Convexity-Based Inexact Projection Method for Multiple Kernel Learning

Multi-kernel Passive Stochastic Gradient Algorithms and Transfer Learning

Parallel Multiple Kernel Learning: a Hybrid Alternating Direction Method of Multipliers

Stein Variational Gradient Descent with Multiple Kernel

Parallel multiple kernel learning

Efficient Multiple Kernel Learning Algorithms Using Low-Rank Representation.

A Primal Method for Multiple Kernel Learning

Multiple kernel dimensionality reduction via spectral regression and trace ratio maximization

Demystifying SGD with Doubly Stochastic Gradients

Asynchronous Doubly Stochastic Sparse Kernel Learning.

Two-stage multiple kernel learning with multiclass kernel polarization.

Efficient sparse generalized multiple kernel learning

Fast randommultiple kernel learning for classification

Regularization Strategies and Empirical Bayesian Learning for MKL

On the Consistency and Large-Scale Extension of Multiple Kernel Clustering

Scalable Kernel Ordinal Regression Via Doubly Stochastic Gradients.

Joint Learning of Distance Metric and Kernel Classifier via Multiple Kernel Learning.

Multiple Empirical Kernel Learning with Discriminant Locality Preservation

A Novel Multiple Nyström-approximating Kernel Discriminant Analysis

Linear discriminant multiple kernel learning for multispectral image classification